QwenVL2.5 bfloat16 Type Issue #706

qirui-chen · 2025-02-01T08:56:09Z

In the code, the tensor is forcibly cast to float32, while cos and sin may be inconsistent with the tensor due to the model type being bfloat16.

This problem occurred when using the flash attention 2 & bf16 Qwen2.5VL model.

sorenmc · 2025-02-05T10:52:15Z

Looks like they are fixing it in this pr huggingface/transformers#35837

marwankefah · 2025-02-05T12:25:15Z

Until they accept the pull request, you can modify modeling_qwen2_5_vl.py in the Transformers library by changing the following line:

tensor_ = tensor.float()

to

tensor_ = tensor

hiyouga mentioned this issue Feb 2, 2025

Qwen2.5-VL full sft dtype error hiyouga/LLaMA-Factory#6791

Open

1 task

Provide feedback