[Feature]: Parallelize router initialization #7647

noobHappylife · 2025-01-09T07:00:21Z

The Feature

When using litellm router, i noticed the router initialization is taking quite a while, basically scale linearly with the input model_list.

After profiling the code, it seems that the bottleneck is in set_model_list where we need to _create_deployment in a loop. Perhaps we can use a PoolProcessExecutor to parallelize this part. So, on a machine with more CPU cores, it benefits the initialization time.

Motivation, pitch

Mainly our team are trying to create a router for different groups (per group router), when the number of models grows the initialization time becomes noticeable. Thus, would like to speed it up if possible. Based on my testing, a naive wrapping of PoolProcessExecutor can brings benefit to the initialization.

Here is a testing result i have, on a 6 cores vm:

Time to create router with 10 models: 1.3750 seconds
Time to create router with 20 models: 2.8792 seconds Time to create router with 30 models: 3.9720 seconds Time to create router with 40 models: 5.1541 seconds
Time to create router with 50 models: 6.8256 seconds

After parallelization:

Time to create router with 10 models: 0.7394 seconds
Time to create router with 20 models: 0.8583 seconds
Time to create router with 30 models: 1.0763 seconds
Time to create router with 40 models: 1.5589 seconds
Time to create router with 50 models: 1.8502 seconds

Are you a ML Ops Team?

Yes

Twitter / LinkedIn details

No response

noobHappylife added the enhancement New feature or request label Jan 9, 2025

github-actions bot added the mlops user request label Jan 9, 2025

noobHappylife mentioned this issue Jan 9, 2025

Parallelize router initialization #7648

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature]: Parallelize router initialization #7647

[Feature]: Parallelize router initialization #7647

noobHappylife commented Jan 9, 2025

[Feature]: Parallelize router initialization #7647

[Feature]: Parallelize router initialization #7647

Comments

noobHappylife commented Jan 9, 2025

The Feature

Motivation, pitch

Are you a ML Ops Team?

Twitter / LinkedIn details