Scalability Concerns for Multi-Agent Problems in Crew AI #1989

alm0ra · 2025-01-28T09:57:17Z

Description

I have been researching how Crew AI addresses scalability issues for multi-agent problems but couldn’t find any detailed information or documentation on this topic. Additionally, I’ve come across discussions where people suggest that Crew AI may not be scalable in handling multi-agent scenarios effectively.

Steps to Reproduce

1.	Searched the documentation and forums for scalability solutions in Crew AI.
2.	Reviewed community discussions pointing out potential limitations in scalability.

Expected behavior

Screenshots/Code snippets

Operating System

macOS Sonoma

Python Version

3.12

crewAI Version

0.98.0

crewAI Tools Version

1

Virtual Environment

Venv

Evidence

Possible Solution

Additional context

The text was updated successfully, but these errors were encountered:

alm0ra · 2025-01-28T10:32:38Z

I believe the issue lies in the kickoff() method within the Crew class. Currently, it appears to execute agents sequentially, one after another.

        for agent in self.agents:
            agent.i18n = i18n
            # type: ignore[attr-defined] # Argument 1 to "_interpolate_inputs" of "Crew" has incompatible type "dict[str, Any] | None"; expected "dict[str, Any]"
            agent.crew = self  # type: ignore[attr-defined]
            # TODO: Create an AgentFunctionCalling protocol for future refactoring
            if not agent.function_calling_llm:  # type: ignore # "BaseAgent" has no attribute "function_calling_llm"
                agent.function_calling_llm = self.function_calling_llm  # type: ignore # "BaseAgent" has no attribute "function_calling_llm"

            if not agent.step_callback:  # type: ignore # "BaseAgent" has no attribute "step_callback"
                agent.step_callback = self.step_callback  # type: ignore # "BaseAgent" has no attribute "step_callback"

            agent.create_agent_executor()

To improve scalability and bring it closer to a production-ready level, consider adopting a more asynchronous approach. For example, using a message broker like Apache Kafka would allow agents to operate independently and process messages online in real time. This would not only enable high scalability but also enhance availability, making the system more robust for handling larger workloads.

alm0ra · 2025-01-28T10:38:37Z

If you decide to implement this approach, I recommend building it on top of FastStream. This library simplifies integration with various message brokers, reducing the complexity typically associated with managing them.

However, keep in mind that some setup and handling will still be required for specific brokers. For instance, with Kafka, you’ll need to implement configurations such as creating topics and partitions to ensure the system operates efficiently and is production-ready.

I’d be happy to assist with this issue and contribute to making Crew AI more scalable!

@joaomdmoura
@bhancockio

Vidit-Ostwal · 2025-01-28T14:48:12Z

Hi @alm0ra, Usually major usage I have seen is where the output of one of the agent is input for another one, therefore Sequential is essential.

If you have more of a general usecase, where there is no flow for the entire process, kickoff() can be ran async

Refer to this documentation

alm0ra · 2025-01-28T15:34:39Z

@Vidit-Ostwal
This sentence perfectly captures the bottleneck:

“The output of one agent serves as the input for another, making sequential execution essential.”

Now, imagine having multiple agents—for example, five.

The process starts with the first agent, and in the worst-case scenario, all agents execute sequentially. This means that until the fifth agent completes its task, new requests cannot be processed because the first agent remains unavailable.

Now, extend this scenario to a production workflow involving more than ten agents. The bottleneck becomes even more pronounced.

However, if agents communicate asynchronously via messages through a broker, this limitation could be addressed efficiently.

Vidit-Ostwal · 2025-01-28T17:05:55Z

@alm0ra, now I understand the issue, atmost only one agent would be working at a time, or two depending whether they require each other or not, but meanwhile, till the entire things does not complete, we can not process any other request from production side.
I guess this is one of the needed feature, making crew ai more production ready,
Currently scaling more than 10 agents would also be very tricky, as it could lead to latency issues.

Let me know if you making a PR for this, I would like to contribute on this along with you, if you don't mind.
I know Kafka a bit.

alm0ra · 2025-01-28T19:50:50Z

The implementation of this feature ultimately depends on the maintainers of Crew AI and their decisions and plans for the future of the package.

Vidit-Ostwal · 2025-01-28T19:56:39Z

@alm0ra yep yep, i understand that,
There is no point we develop some stuff, which doesn't align with what maintainers vision.

I am currently facing the same problem, where latency becomes an issue, when multiple hits are in the API.

What you have proposed, partially solves my problem.

And therefore I believe this kind of latency solving bug / feature request will surely come at a point and maintainers have to think about that at one point of time.

The only question is how are they willing to approach the problem, with Kafka or something else, it's upon them only.

alm0ra added the bug Something isn't working label Jan 28, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Scalability Concerns for Multi-Agent Problems in Crew AI #1989

Scalability Concerns for Multi-Agent Problems in Crew AI #1989

alm0ra commented Jan 28, 2025

alm0ra commented Jan 28, 2025 •

edited

Loading

alm0ra commented Jan 28, 2025 •

edited

Loading

Vidit-Ostwal commented Jan 28, 2025

alm0ra commented Jan 28, 2025 •

edited

Loading

Vidit-Ostwal commented Jan 28, 2025

alm0ra commented Jan 28, 2025

Vidit-Ostwal commented Jan 28, 2025

Scalability Concerns for Multi-Agent Problems in Crew AI #1989

Scalability Concerns for Multi-Agent Problems in Crew AI #1989

Comments

alm0ra commented Jan 28, 2025

Description

Steps to Reproduce

Expected behavior

Screenshots/Code snippets

Operating System

Python Version

crewAI Version

crewAI Tools Version

Virtual Environment

Evidence

Possible Solution

Additional context

alm0ra commented Jan 28, 2025 • edited Loading

alm0ra commented Jan 28, 2025 • edited Loading

Vidit-Ostwal commented Jan 28, 2025

alm0ra commented Jan 28, 2025 • edited Loading

Vidit-Ostwal commented Jan 28, 2025

alm0ra commented Jan 28, 2025

Vidit-Ostwal commented Jan 28, 2025

alm0ra commented Jan 28, 2025 •

edited

Loading

alm0ra commented Jan 28, 2025 •

edited

Loading

alm0ra commented Jan 28, 2025 •

edited

Loading