Any Methods to alleviate Curse of Multi-Modalities? #1

VincentVanNF · 2024-11-25T07:49:36Z

Hello, thank you very much for your research findings, particularly regarding the two multimodal hallucination issues mentioned in the paper: SPURIOUS INTER-MODALITY CORRELATIONS and OVERRELIANCE ON UNIMODAL PRIORS.

While performing SFT training on a specific classification task based on the Qwen2-VL-7B model, I encountered the aforementioned hallucination problems during inference on the test set. These issues significantly impact further performance improvements of the model, especially when trying to boost performance from 90 to 95. Are there any methods or references to alleviate these problems? Thank you very much for your response.

LengSicong · 2025-01-13T08:38:17Z

Hi, thanks for your interest!

You can try different decoding methods like VCD.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Any Methods to alleviate Curse of Multi-Modalities? #1

Any Methods to alleviate Curse of Multi-Modalities? #1

VincentVanNF commented Nov 25, 2024

LengSicong commented Jan 13, 2025

Any Methods to alleviate Curse of Multi-Modalities? #1

Any Methods to alleviate Curse of Multi-Modalities? #1

Comments

VincentVanNF commented Nov 25, 2024

LengSicong commented Jan 13, 2025