[misc] feat: spport rmpad/data-packing in FSDP with transformers #91

PeterSH6 · 2025-01-10T05:06:33Z

Use actor_rollout_ref.model.use_rmpad=True + critic.model.use_rmpad=True \ + reward_model.model.use_rmpad=True to enable rmpad for different models. Default set to False
Using AutoModelForTokenClassification for Value and Reward Model. Instead of using SeqenceClassification
Compute logprob convert to log_probs_from_logits_response_rmpad

Resolve: #53

Comparison using DeepSeek7b and GSM8k:
About 1.7x speedup compare to no rmpad (original cases)

verl/workers/actor/dp_actor.py

vermouth1992 · 2025-01-10T05:33:34Z

Shall we add a supported model list and raise error if the model is not in the list?

vermouth1992 · 2025-01-10T05:37:29Z

Try to avoid using log_probs_from_logits_response_rmpad because there is an unpad op inside. unpad is a cuda-blocking op. Instead, we can directly use unpad input_ids from the input

PeterSH6 · 2025-01-10T05:40:42Z

I think this list depends on transformers lib. No sure where to get this list. I didn't find any doc about the feature in transformers.

vermouth1992 · 2025-01-10T05:42:10Z

I think this list depends on transformers lib. No sure where to get this list. I didn't find any doc about the feature in transformers.

Simply add potential models in the CI. If the model passes CI, then add to the supported list. I guess we can target

Llama
Mistral
QWen
Gemma

PeterSH6 · 2025-01-10T05:42:27Z

Try to avoid using log_probs_from_logits_response_rmpad because there is an unpad op inside. unpad is a cuda-blocking op. Instead, we can directly use unpad input_ids from the input

Sure, I will write a new API for unpad input_ids

PeterSH6 · 2025-01-10T05:46:53Z

Simply add potential models in the CI. If the model passes CI, then add to the supported list. I guess we can target

Shall we add the test_transformers.py to CI? I didn't do it as I think it only depends on the transformers version and flash_attn version.

So, I guess the goal for the CI is to test whether the latest transformers + flash_attn would break our implementation

vermouth1992 · 2025-01-10T05:48:30Z

Simply add potential models in the CI. If the model passes CI, then add to the supported list. I guess we can target

Shall we add the test_transformers.py to CI? I didn't do it as I think it only depends on the transformers version and flash_attn version.

So, I guess the goal for the CI is to test whether the latest transformers + flash_attn would break our implementation

After this PR, we should set a minimum version of transformers

verl/utils/torch_functional.py

verl/workers/fsdp_workers.py

PeterSH6 added 10 commits January 9, 2025 14:11

init commit of rmpad

39f0d5d

add rmpad test

b1d7e80

support rmpad in actor model

4a23636

add test for value model

666a0c0

support rmpad in critic and rm

3b3911e

fix actor return and fix num_labels and clean not used rmpad

2863e97

fix critic and benchmark

dcf799f

update script

52485ff

fix critic

92f674e

lint

e268c77

PeterSH6 requested review from vermouth1992 and eric-haibin-lin January 10, 2025 05:06

fix util issue

081c1cc

vermouth1992 reviewed Jan 10, 2025

View reviewed changes

verl/workers/actor/dp_actor.py Outdated Show resolved Hide resolved

fix unnecessary unpad

8403fd5

vermouth1992 reviewed Jan 10, 2025

View reviewed changes

verl/utils/torch_functional.py Show resolved Hide resolved

PeterSH6 added 4 commits January 10, 2025 17:05

address issues

c39a93f

fix args

7e39c47

update test and update rmpad support model list

13d3755

fix typo

3e57836

vermouth1992 reviewed Jan 10, 2025

View reviewed changes

verl/workers/fsdp_workers.py Outdated Show resolved Hide resolved

vermouth1992 reviewed Jan 10, 2025

View reviewed changes

verl/workers/fsdp_workers.py Outdated Show resolved Hide resolved

fix typo and fix name

d88204a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[misc] feat: spport rmpad/data-packing in FSDP with transformers #91

[misc] feat: spport rmpad/data-packing in FSDP with transformers #91

PeterSH6 commented Jan 10, 2025 •

edited

Loading

vermouth1992 commented Jan 10, 2025

vermouth1992 commented Jan 10, 2025

PeterSH6 commented Jan 10, 2025

vermouth1992 commented Jan 10, 2025 •

edited

Loading

PeterSH6 commented Jan 10, 2025

PeterSH6 commented Jan 10, 2025

vermouth1992 commented Jan 10, 2025

[misc] feat: spport rmpad/data-packing in FSDP with transformers #91

Are you sure you want to change the base?

[misc] feat: spport rmpad/data-packing in FSDP with transformers #91

Conversation

PeterSH6 commented Jan 10, 2025 • edited Loading

vermouth1992 commented Jan 10, 2025

vermouth1992 commented Jan 10, 2025

PeterSH6 commented Jan 10, 2025

vermouth1992 commented Jan 10, 2025 • edited Loading

PeterSH6 commented Jan 10, 2025

PeterSH6 commented Jan 10, 2025

vermouth1992 commented Jan 10, 2025

PeterSH6 commented Jan 10, 2025 •

edited

Loading

vermouth1992 commented Jan 10, 2025 •

edited

Loading