determine dtype based on gpu compute capability #748

zhuangqh · 2024-12-03T06:32:36Z

Is your feature request related to a problem? Please describe.

use different dtype for different generation gpu to achieve better performance.
regarding V100, its compute capability is 7.0, which means it does not support bfloat16 precision.
if we use --torch_dtype=bfloat16, pytorch will support bf16 by software emulation with lower performance.

Describe the solution you'd like

setting dtype value from controller based on gpu compute capability.

Describe alternatives you've considered

Additional context

pytorch/pytorch#124996

The text was updated successfully, but these errors were encountered:

zhuangqh added the enhancement New feature or request label Dec 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

determine dtype based on gpu compute capability #748

determine dtype based on gpu compute capability #748

zhuangqh commented Dec 3, 2024

determine dtype based on gpu compute capability #748

determine dtype based on gpu compute capability #748

Comments

zhuangqh commented Dec 3, 2024