Skip to content

Actions: huggingface/optimum-quanto

Linux CUDA tests

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
73 workflow run results
73 workflow run results

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Refactor linear dispatch to use new torch kernels
Linux CUDA tests #83: Pull request #222 synchronize by dacorvo
June 27, 2024 15:09 6m 48s _weight_int8pack_mm
June 27, 2024 15:09 6m 48s
Refactor linear dispatch to use new torch kernels
Linux CUDA tests #82: Pull request #222 synchronize by dacorvo
June 27, 2024 14:59 5m 46s _weight_int8pack_mm
June 27, 2024 14:59 5m 46s
Refactor linear dispatch to use new torch kernels
Linux CUDA tests #81: Pull request #222 synchronize by dacorvo
June 27, 2024 14:35 5m 56s _weight_int8pack_mm
June 27, 2024 14:35 5m 56s
Refactor linear dispatch to use new torch kernels
Linux CUDA tests #80: Pull request #222 opened by dacorvo
June 27, 2024 14:26 5m 32s _weight_int8pack_mm
June 27, 2024 14:26 5m 32s
ci: add OWL detection examples
Linux CUDA tests #79: Commit 759bcef pushed by dacorvo
June 12, 2024 20:16 5m 55s main
June 12, 2024 20:16 5m 55s
Add owlv2 detection example
Linux CUDA tests #78: Pull request #210 synchronize by dacorvo
June 12, 2024 19:59 5m 51s add_owlv2_detection_example
June 12, 2024 19:59 5m 51s
Add owlv2 detection example
Linux CUDA tests #77: Pull request #210 synchronize by dacorvo
June 12, 2024 19:11 5m 40s add_owlv2_detection_example
June 12, 2024 19:11 5m 40s
Add owlv2 detection example
Linux CUDA tests #76: Pull request #210 synchronize by dacorvo
June 12, 2024 16:58 8m 6s add_owlv2_detection_example
June 12, 2024 16:58 8m 6s
Add owlv2 detection example
Linux CUDA tests #75: Pull request #210 synchronize by dacorvo
June 12, 2024 16:45 5m 45s add_owlv2_detection_example
June 12, 2024 16:45 5m 45s
Add owlv2 detection example
Linux CUDA tests #74: Pull request #210 opened by dacorvo
June 12, 2024 16:40 1m 43s add_owlv2_detection_example
June 12, 2024 16:40 1m 43s
feat(cuda): compile according to capabilities
Linux CUDA tests #73: Commit 0ca8021 pushed by dacorvo
June 12, 2024 16:21 8m 28s main
June 12, 2024 16:21 8m 28s
feat(cuda): compile according to capabilities
Linux CUDA tests #72: Pull request #209 opened by dacorvo
June 12, 2024 16:08 5m 43s fix_cuda_regression
June 12, 2024 16:08 5m 43s
test(mm): only test if CUDA arch is at least sm80
Linux CUDA tests #71: Commit fef7b60 pushed by dacorvo
June 3, 2024 16:37 5m 48s main
June 3, 2024 16:37 5m 48s
Only use optimized CUDA kernels if arch is at least sm80
Linux CUDA tests #70: Pull request #208 synchronize by dacorvo
June 3, 2024 15:50 6m 5s detect_cuda_version
June 3, 2024 15:50 6m 5s
Only use optimized CUDA kernels if arch is at least sm80
Linux CUDA tests #69: Pull request #208 opened by dacorvo
June 3, 2024 15:36 1m 48s detect_cuda_version
June 3, 2024 15:36 1m 48s
ci: update workflows
Linux CUDA tests #68: Commit eb6b82d pushed by dacorvo
May 31, 2024 14:41 5m 4s main
May 31, 2024 14:41 5m 4s
Convert quanto to optimum-quanto
Linux CUDA tests #67: Pull request #205 synchronize by dacorvo
May 31, 2024 14:23 5m 9s namespace_package
May 31, 2024 14:23 5m 9s
Convert quanto to optimum-quanto
Linux CUDA tests #66: Pull request #205 synchronize by dacorvo
May 31, 2024 14:20 2m 16s namespace_package
May 31, 2024 14:20 2m 16s
Convert quanto to optimum-quanto
Linux CUDA tests #65: Pull request #205 opened by dacorvo
May 31, 2024 14:15 1m 51s namespace_package
May 31, 2024 14:15 1m 51s
fix(examples): pin transformers version
Linux CUDA tests #64: Commit 7de45a3 pushed by dacorvo
May 23, 2024 18:39 5m 45s main
May 23, 2024 18:39 5m 45s
Add latest AWQ CUDA fp16 int4 kernels
Linux CUDA tests #63: Pull request #198 synchronize by dacorvo
May 22, 2024 16:56 5m 18s awq_kernels
May 22, 2024 16:56 5m 18s
Add latest AWQ CUDA fp16 int4 kernels
Linux CUDA tests #62: Pull request #198 synchronize by dacorvo
May 22, 2024 13:13 5m 4s awq_kernels
May 22, 2024 13:13 5m 4s
Add latest AWQ CUDA fp16 int4 kernels
Linux CUDA tests #61: Pull request #198 opened by dacorvo
May 22, 2024 13:11 1m 48s awq_kernels
May 22, 2024 13:11 1m 48s
feat(quantize): do not use a group_size lower than 128
Linux CUDA tests #60: Commit 934c4a7 pushed by dacorvo
May 16, 2024 15:36 5m 9s main
May 16, 2024 15:36 5m 9s
Prepare for gemm kernels
Linux CUDA tests #59: Pull request #197 opened by dacorvo
May 16, 2024 15:19 5m 19s prepare_for_gemm_kernels
May 16, 2024 15:19 5m 19s