Introducing the MMWorld benchmark #677

jkooy · 2025-01-27T20:21:20Z

Dear Qwen team,

We are a big fan of your Qwen series and noticed that you have evaluated your models on several video-language benchmarks. We were wondering if you might be interested in evaluating your models on our MMWorld benchmark (https://arxiv.org/abs/2406.08407). MMWorld is designed to assess models' reasoning capabilities across various reasoning tasks and disciplines and could serve as a useful evaluation benchmark for your model development. Thank you!

ShuaiBai623 · 2025-01-28T04:47:29Z

Thank you for your recognition and good video evaluation work. We will test it after the Spring Festival.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Introducing the MMWorld benchmark #677

Introducing the MMWorld benchmark #677

jkooy commented Jan 27, 2025

ShuaiBai623 commented Jan 28, 2025

Introducing the MMWorld benchmark #677

Introducing the MMWorld benchmark #677

Comments

jkooy commented Jan 27, 2025

ShuaiBai623 commented Jan 28, 2025