Skip to content

[Bug] recent change of param "block_size_x/y, unroll" in dlight/gpu/matmul.py significantly decrease q4f16_1 prefill speed on android 8gen3 device #1079

[Bug] recent change of param "block_size_x/y, unroll" in dlight/gpu/matmul.py significantly decrease q4f16_1 prefill speed on android 8gen3 device

[Bug] recent change of param "block_size_x/y, unroll" in dlight/gpu/matmul.py significantly decrease q4f16_1 prefill speed on android 8gen3 device #1079

Triggered via issue October 30, 2024 06:31
Status Skipped
Total duration 32m 21s
Artifacts

tvmbot.yml

on: issue_comment
run-tvm-bot
0s
run-tvm-bot
Fit to window
Zoom out
Zoom in