Sop plugin refactoring #75

sducouedic · 2025-02-03T12:07:28Z

This PR goes along with this other PR (see detailed description of the refactoring there), to make vllm supporting spyre a plugin. Only remaining spyre-related code is for the scheduler.

Signed-off-by: Thomas Parnell <[email protected]>

this PR improves code readability by - [commit 1](0c9bec0): removing the batch size argument in the load_model function [here](https://github.com/IBM/vllm/blob/124f3a961d1a9ce2628c01fe56dfcc589a49c8dd/vllm/worker/spyre_worker.py#L135) since unused in `SpyreModelRunner` ([here](https://github.com/IBM/vllm/blob/124f3a961d1a9ce2628c01fe56dfcc589a49c8dd/vllm/worker/spyre_model_runner.py#L104)) as well as in `SpyreEmbeddingModelRunner` ([here](https://github.com/IBM/vllm/blob/124f3a961d1a9ce2628c01fe56dfcc589a49c8dd/vllm/worker/spyre_embedding_model_runner.py#L52)). - -- _[commit 2](beadbd5): function renaming to better distinguish between the imported function `init_distributed_environment` from `vllm.distributed` and our own function `self.init_distributed_environment`._ -- (**reverted since this seems to be a convention introduced in other worker classes**) --------- Signed-off-by: Yannick Schnider <[email protected]>

Signed-off-by: Sophie du Couédic <[email protected]>

…ed to pluging and tested Signed-off-by: Sophie du Couédic <[email protected]>

… by setting attribute 'supported_quantization' in the plugin Signed-off-by: Sophie du Couédic <[email protected]>

Signed-off-by: Sophie du Couédic <[email protected]>

…e_type' in Platform Signed-off-by: Sophie du Couédic <[email protected]>

Signed-off-by: Sophie du Couédic <[email protected]>

github-actions · 2025-02-03T12:07:41Z

👋 Hi! Thank you for contributing to the vLLM project.
Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can do one of these:

Add ready label to the PR
Enable auto-merge.

🚀

tdoublep and others added 25 commits January 17, 2025 21:02

Adapt SpyreWorker to some changes from upstream.

8dd2228

Signed-off-by: Thomas Parnell <[email protected]>

Working on CPU in TP

82bda5c

Signed-off-by: Thomas Parnell <[email protected]>

Resolve issue with TP=1

bef766f

Signed-off-by: Thomas Parnell <[email protected]>

Working on TP>1; there is a dynamic shape bug though

a0d55de

Signed-off-by: Thomas Parnell <[email protected]>

Dynamic shapes works too

20f5565

Signed-off-by: Thomas Parnell <[email protected]>

Set env vars in CI test

9f973d1

Signed-off-by: Thomas Parnell <[email protected]>

Remove debug prints

3209382

Signed-off-by: Thomas Parnell <[email protected]>

Remove unused code

3aa8196

Signed-off-by: Thomas Parnell <[email protected]>

fmt

a3af854

Signed-off-by: Thomas Parnell <[email protected]>

Fix embedding model runner + fmt

5510021

Signed-off-by: Thomas Parnell <[email protected]>

Remove comment

6e8de27

Signed-off-by: Thomas Parnell <[email protected]>

Pass template argument

9703645

Signed-off-by: Thomas Parnell <[email protected]>

enable CI tests on every PR

124f3a9

Signed-off-by: Thomas Parnell <[email protected]>

minimal changes in vllm to transform to spyre plugin

bacc8dc

Signed-off-by: Sophie du Couédic <[email protected]>

removal and built-in spyre definition, worker, and executor

c70568c

Signed-off-by: Sophie du Couédic <[email protected]>

To discuss: use 'empty' device and install 'requirements-common.txt'

d2322f1

Signed-off-by: Sophie du Couédic <[email protected]>

removed additional spyre-related, examples and tests scripts were add…

b1d9d1f

…ed to pluging and tested Signed-off-by: Sophie du Couédic <[email protected]>

use base class method to validate quantization (rather than manually)…

68ce274

… by setting attribute 'supported_quantization' in the plugin Signed-off-by: Sophie du Couédic <[email protected]>

remove PlatformEnum.SPYRE and use PlatformEnum.OOT in the plugin

775fdbb

Signed-off-by: Sophie du Couédic <[email protected]>

move 'VLLM_SPYRE_DYNAMO_BACKEND' to envs.py file in spyre plugin

756d33c

Signed-off-by: Sophie du Couédic <[email protected]>

remove spyre from installation configurations and files

7716448

Signed-off-by: Sophie du Couédic <[email protected]>

remove spyre tests from github workflow

0d6bee9

Signed-off-by: Sophie du Couédic <[email protected]>

remove manual setting of device to 'cpu' for Spyre, rather set 'devic…

ca4a8e7

…e_type' in Platform Signed-off-by: Sophie du Couédic <[email protected]>

reformat based on 'bash format.sh'

4e606a6

Signed-off-by: Sophie du Couédic <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sop plugin refactoring #75

Sop plugin refactoring #75

sducouedic commented Feb 3, 2025 •

edited by github-actions bot

Loading

github-actions bot commented Feb 3, 2025

Sop plugin refactoring #75

Are you sure you want to change the base?

Sop plugin refactoring #75

Conversation

sducouedic commented Feb 3, 2025 • edited by github-actions bot Loading

github-actions bot commented Feb 3, 2025

sducouedic commented Feb 3, 2025 •

edited by github-actions bot

Loading