Skip to content

Actions: huggingface/optimum-quanto

Linux examples (CPU, CUDA)

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
220 workflow runs
220 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Stop enforcing conventional commits in pull-requests
Linux examples (CPU, CUDA) #133: Pull request #302 synchronize by dacorvo
September 2, 2024 13:25 1m 33s stop_enforcing_conventional_commits
September 2, 2024 13:25 1m 33s
feat: E4M3fnuz FP8 format added
Linux examples (CPU, CUDA) #131: Pull request #281 synchronize by dacorvo
August 28, 2024 13:43 3m 49s maktukmak:add_e4m3fnuz
August 28, 2024 13:43 3m 49s
feat(examples): add an example with Whisper for speech recognition
Linux examples (CPU, CUDA) #130: Commit 9d50ea5 pushed by dacorvo
August 28, 2024 13:38 18m 18s main
August 28, 2024 13:38 18m 18s
feat(examples): add an example with Whisper for speech recognition
Linux examples (CPU, CUDA) #129: Pull request #298 opened by dacorvo
August 28, 2024 13:38 18m 38s examples/speech/whisper
August 28, 2024 13:38 18m 38s
[WIP] Whisper demo for ASR
Linux examples (CPU, CUDA) #128: Pull request #242 synchronize by dacorvo
August 28, 2024 13:12 19m 46s mattiadg:examples/speech/whisper
August 28, 2024 13:12 19m 46s
perf: faster and less memory-intensive model [re]quantization
Linux examples (CPU, CUDA) #127: Commit 4c4bff5 pushed by dacorvo
August 28, 2024 13:11 19m 32s main
August 28, 2024 13:11 19m 32s
perf: faster and less memory-intensive model [re]quantization
Linux examples (CPU, CUDA) #126: Pull request #297 opened by dacorvo
August 28, 2024 10:22 21m 4s faster-model-quantize
August 28, 2024 10:22 21m 4s
test(qlinear): increase tolerance when using Marlin FP8
Linux examples (CPU, CUDA) #125: Commit 739309f pushed by dacorvo
August 28, 2024 08:43 17m 11s main
August 28, 2024 08:43 17m 11s
Add support for Marlin fp16/fp8 kernel (refactored)
Linux examples (CPU, CUDA) #124: Pull request #296 synchronize by dacorvo
August 28, 2024 08:14 20m 27s marlin-fp8-refactored
August 28, 2024 08:14 20m 27s
Add support for Marlin fp16/fp8 kernel (refactored)
Linux examples (CPU, CUDA) #123: Pull request #296 synchronize by dacorvo
August 28, 2024 06:43 19m 52s marlin-fp8-refactored
August 28, 2024 06:43 19m 52s
Add support for Marlin fp16/fp8 kernel (refactored)
Linux examples (CPU, CUDA) #122: Pull request #296 opened by dacorvo
August 27, 2024 16:18 20m 4s marlin-fp8-refactored
August 27, 2024 16:18 20m 4s
fix: Enable non-strict loading of state dicts
Linux examples (CPU, CUDA) #121: Commit f9b71f4 pushed by dacorvo
August 27, 2024 07:07 24m 12s main
August 27, 2024 07:07 24m 12s
[WIP] Whisper demo for ASR
Linux examples (CPU, CUDA) #118: Pull request #242 synchronize by dacorvo
August 26, 2024 07:31 21m 6s mattiadg:examples/speech/whisper
August 26, 2024 07:31 21m 6s
test(requantize): also test with bfloat16
Linux examples (CPU, CUDA) #117: Commit f3b39ce pushed by dacorvo
August 24, 2024 13:01 19m 3s main
August 24, 2024 13:01 19m 3s
Implement Tensor.equal and torch.equal for QTensor
Linux examples (CPU, CUDA) #116: Pull request #294 opened by dacorvo
August 24, 2024 12:41 17m 22s qtensor_torch_equal
August 24, 2024 12:41 17m 22s
feat(qmodule): avoid random weights initialization
Linux examples (CPU, CUDA) #115: Commit a1c310b pushed by dacorvo
August 24, 2024 12:36 17m 49s main
August 24, 2024 12:36 17m 49s
Avoid random weights initialization when quantizing
Linux examples (CPU, CUDA) #114: Pull request #291 opened by dacorvo
August 23, 2024 14:17 18m 33s avoid_random_weights_quantize
August 23, 2024 14:17 18m 33s
review: add missing test guards
Linux examples (CPU, CUDA) #113: Commit 3345bef pushed by dacorvo
August 22, 2024 13:16 23m 57s main
August 22, 2024 13:16 23m 57s
feat: implement load and save support from the Hub.
Linux examples (CPU, CUDA) #112: Pull request #263 synchronize by dacorvo
August 22, 2024 12:42 19m 17s feat-hub-support
August 22, 2024 12:42 19m 17s
docs: fix typo in file name s/READMD.md/README.md/
Linux examples (CPU, CUDA) #110: Commit 084e6e7 pushed by dacorvo
August 21, 2024 12:45 22m 13s main
August 21, 2024 12:45 22m 13s
feat: implement load and save support from the Hub.
Linux examples (CPU, CUDA) #109: Pull request #263 synchronize by dacorvo
August 21, 2024 12:18 20m 7s feat-hub-support
August 21, 2024 12:18 20m 7s
docs: fix typo in file name s/READMD.md/README.md/
Linux examples (CPU, CUDA) #108: Pull request #268 synchronize by dacorvo
August 21, 2024 12:13 22m 41s dvrogozh:fixes
August 21, 2024 12:13 22m 41s
fix: adjust _convert_weight_to_int4pack_cpu input weights for pytorch…
Linux examples (CPU, CUDA) #105: Commit c02750b pushed by dacorvo
August 20, 2024 12:56 23m 42s main
August 20, 2024 12:56 23m 42s