Skip to content

Actions: huggingface/optimum-quanto

Linux examples (CPU, CUDA)

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
220 workflow runs
220 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Add marlin int4 kernel
Linux examples (CPU, CUDA) #217: Pull request #333 synchronize by dacorvo
October 10, 2024 10:07 21m 56s add_marlin_int4_kernel
October 10, 2024 10:07 21m 56s
Add marlin int4 kernel
Linux examples (CPU, CUDA) #216: Pull request #333 synchronize by dacorvo
October 10, 2024 09:15 22m 14s add_marlin_int4_kernel
October 10, 2024 09:15 22m 14s
Applied formatter changes
Linux examples (CPU, CUDA) #215: Commit b0cce24 pushed by dacorvo
October 8, 2024 14:07 17m 54s main
October 8, 2024 14:07 17m 54s
Switched linters, black -> ruff
Linux examples (CPU, CUDA) #214: Pull request #334 synchronize by ishandeva
October 8, 2024 11:09 21m 38s ishandeva:Switch-to-ruff-native-formatter-#186
October 8, 2024 11:09 21m 38s
Switched linters, black -> ruff
Linux examples (CPU, CUDA) #213: Pull request #334 synchronize by ishandeva
October 7, 2024 14:57 1m 31s ishandeva:Switch-to-ruff-native-formatter-#186
October 7, 2024 14:57 1m 31s
Switched linters, black -> ruff
Linux examples (CPU, CUDA) #212: Pull request #334 synchronize by ishandeva
October 7, 2024 12:24 20m 54s ishandeva:Switch-to-ruff-native-formatter-#186
October 7, 2024 12:24 20m 54s
Switched linters, black -> ruff
Linux examples (CPU, CUDA) #210: Pull request #334 opened by ishandeva
October 6, 2024 16:06 1m 30s ishandeva:Switch-to-ruff-native-formatter-#186
October 6, 2024 16:06 1m 30s
Add marlin int4 kernel
Linux examples (CPU, CUDA) #209: Pull request #333 opened by dacorvo
October 6, 2024 14:35 20m 25s add_marlin_int4_kernel
October 6, 2024 14:35 20m 25s
feat: add HIP support
Linux examples (CPU, CUDA) #208: Commit 843b793 pushed by dacorvo
October 4, 2024 16:40 17m 16s main
October 4, 2024 16:40 17m 16s
Add hip support
Linux examples (CPU, CUDA) #207: Pull request #330 synchronize by dacorvo
October 4, 2024 16:11 21m 18s add_hip_support
October 4, 2024 16:11 21m 18s
Add hip support
Linux examples (CPU, CUDA) #206: Pull request #330 opened by dacorvo
October 4, 2024 16:06 1m 48s add_hip_support
October 4, 2024 16:06 1m 48s
Revert "test: reduce random qweight magnitude"
Linux examples (CPU, CUDA) #205: Commit 76940cb pushed by dacorvo
October 4, 2024 15:38 20m 21s main
October 4, 2024 15:38 20m 21s
Refactor extensions
Linux examples (CPU, CUDA) #204: Pull request #329 opened by dacorvo
October 4, 2024 15:12 20m 58s refactor_extensions
October 4, 2024 15:12 20m 58s
refactor(library): remove unpack indirection
Linux examples (CPU, CUDA) #203: Commit b54737e pushed by dacorvo
October 3, 2024 09:25 17m 48s main
October 3, 2024 09:25 17m 48s
Remove overheads in library
Linux examples (CPU, CUDA) #202: Pull request #328 synchronize by dacorvo
October 3, 2024 08:58 22m 46s refactor_library
October 3, 2024 08:58 22m 46s
Remove overheads in library
Linux examples (CPU, CUDA) #201: Pull request #328 opened by dacorvo
October 3, 2024 08:38 24m 14s refactor_library
October 3, 2024 08:38 24m 14s
fix(qbytes_mm): reshape input
Linux examples (CPU, CUDA) #200: Commit 194150f pushed by dacorvo
October 1, 2024 11:15 24m 22s main
October 1, 2024 11:15 24m 22s
Fix lumina
Linux examples (CPU, CUDA) #199: Pull request #326 synchronize by dacorvo
October 1, 2024 10:07 26m 42s fix_lumina
October 1, 2024 10:07 26m 42s
Fix lumina
Linux examples (CPU, CUDA) #198: Pull request #326 opened by dacorvo
October 1, 2024 09:22 18m 2s fix_lumina
October 1, 2024 09:22 18m 2s
feat(examples): use QuantizedModelForCausalLM
Linux examples (CPU, CUDA) #197: Commit 13b2b0f pushed by dacorvo
September 30, 2024 14:28 20m 15s main
September 30, 2024 14:28 20m 15s
Fix missing call in QuantizedTransformersModel
Linux examples (CPU, CUDA) #196: Pull request #325 opened by dacorvo
September 30, 2024 12:56 18m 40s fix_call_transformers
September 30, 2024 12:56 18m 40s
refactor(library): reduce overhead in marlin op
Linux examples (CPU, CUDA) #195: Commit 7b73aae pushed by dacorvo
September 30, 2024 12:45 17m 2s main
September 30, 2024 12:45 17m 2s
refactor(library): reduce overhead in marlin op
Linux examples (CPU, CUDA) #194: Pull request #323 synchronize by dacorvo
September 30, 2024 12:22 18m 35s avoid_overhead_fp8_marlin
September 30, 2024 12:22 18m 35s
refactor(library): reduce overhead in marlin op
Linux examples (CPU, CUDA) #193: Pull request #323 synchronize by dacorvo
September 30, 2024 12:06 22m 33s avoid_overhead_fp8_marlin
September 30, 2024 12:06 22m 33s
refactor(library): reduce overhead in marlin op
Linux examples (CPU, CUDA) #192: Pull request #323 synchronize by dacorvo
September 30, 2024 08:05 8m 24s avoid_overhead_fp8_marlin
September 30, 2024 08:05 8m 24s