[Fix] Address remain issues of supporting MiniCPMV #2977

mickqian · 2025-01-19T07:51:03Z

Motivation

Address remaining issues of #2785

Modifications

Update document of implementing a new vision-llm
Add some test for comparing logits output of SGLang and HF
Code cleanup

Checklist

Format your code according to the Code Formatting with Pre-Commit.
Add unit tests as outlined in the Running Unit Tests.
Update documentation / docstrings / example tutorials as needed, according to Writing Documentation.
[] Provide throughput / latency benchmark results and accuracy evaluation results as needed, according to Benchmark and Profiling.

zhaochenyang20 · 2025-01-19T17:54:07Z

Great work! Once ready, please ask us to review.

merrymercy · 2025-01-20T09:18:41Z

also remove these vllm dependency

sglang/python/sglang/srt/layers/attention/vision.py

Lines 8 to 9 in 2584f6d

    
           from vllm.distributed import parallel_state 
        
           from vllm.distributed import utils as dist_utils

initial

mickqian · 2025-01-22T06:43:47Z

python/sglang/srt/layers/attention/vision.py

-            max_seqlen,
-            is_causal=False,
-        )
+        if self.use_context_forward:


context_attention_fwd generates different result with hf implementation: SiglipAttention, probably because:
1.SiglipAttention performs softmax in float32:

attn_weights = nn.functional.softmax(attn_weights, dim=-1, dtype=torch.float32).to(query_states.dtype)

SiglipAttention performs full-sequence + mask, whereas context_attention_fwd skips padding tokens, leaving 0s in the attention weights.

Does qwen2vl model have the same question? If so, I think maybe we can set qwen2vl use_context_forward to False or even remove use_context_forward branch.

zhaochenyang20 · 2025-01-22T09:00:03Z

Cool. I will ask @yizhang2077 to help~

yizhang2077 · 2025-01-23T16:34:21Z

test/srt/test_vision_llm.py

+                    "tgt_sizes": [inputs["tgt_sizes"]],
+                    "im_start_id": [self.tokenizer.im_start_id],
+                    "im_end_id": [self.tokenizer.im_end_id],
+                    "slice_start_id": [self.tokenizer.slice_start_id],


Cool! I think maybe this test can be more general so that it can also be used by qwen2vl or more VLM?

yizhang2077 · 2025-01-23T17:52:16Z

test/srt/test_vision_openai_server.py

@@ -457,6 +457,8 @@ def setUpClass(cls):
                "--trust-remote-code",
                "--chat-template",
                "minicpmv",
+                "--max-total-tokens",


Why do we need add args here?

mickqian force-pushed the minicpmv branch from feca596 to 2c7fa6e Compare January 19, 2025 07:51

mickqian mentioned this pull request Jan 19, 2025

[Feature] Support minicpmv v2.6 #2785

Merged

3 tasks

mickqian force-pushed the minicpmv branch 9 times, most recently from d4a7a2d to d32da67 Compare January 19, 2025 17:04

mickqian force-pushed the minicpmv branch from 71d4d2d to 1d7eba6 Compare January 20, 2025 04:04

mickqian force-pushed the minicpmv branch 5 times, most recently from 3453ce7 to 323aaf7 Compare January 21, 2025 11:35

mickqian added 2 commits January 21, 2025 19:59

[Fix] Address remain issues of supporting MiniCPMV

95536a7

initial

update

fbbd90e

mickqian force-pushed the minicpmv branch 3 times, most recently from a04062f to 6f78efe Compare January 21, 2025 14:09

update

797ac44

mickqian force-pushed the minicpmv branch from 6f78efe to 797ac44 Compare January 21, 2025 14:35

Merge branch 'main' into minicpmv

d23fc2e

mickqian marked this pull request as ready for review January 22, 2025 03:56

mickqian requested review from merrymercy and Ying1123 as code owners January 22, 2025 03:56

mickqian requested review from zhyncs, hnyls2002, ispobock and ByronHsu as code owners January 22, 2025 03:56

mickqian commented Jan 22, 2025

View reviewed changes

yizhang2077 reviewed Jan 23, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Fix] Address remain issues of supporting MiniCPMV #2977

[Fix] Address remain issues of supporting MiniCPMV #2977

mickqian commented Jan 19, 2025

zhaochenyang20 commented Jan 19, 2025

merrymercy commented Jan 20, 2025

mickqian Jan 22, 2025

yizhang2077 Jan 23, 2025

zhaochenyang20 commented Jan 22, 2025

yizhang2077 Jan 23, 2025 •

edited

Loading

yizhang2077 Jan 23, 2025

[Fix] Address remain issues of supporting MiniCPMV #2977

Are you sure you want to change the base?

[Fix] Address remain issues of supporting MiniCPMV #2977

Conversation

mickqian commented Jan 19, 2025

Motivation

Modifications

Checklist

zhaochenyang20 commented Jan 19, 2025

merrymercy commented Jan 20, 2025

mickqian Jan 22, 2025

Choose a reason for hiding this comment

yizhang2077 Jan 23, 2025

Choose a reason for hiding this comment

zhaochenyang20 commented Jan 22, 2025

yizhang2077 Jan 23, 2025 • edited Loading

Choose a reason for hiding this comment

yizhang2077 Jan 23, 2025

Choose a reason for hiding this comment

yizhang2077 Jan 23, 2025 •

edited

Loading