Implicit (split thread) MemGPT #115

cpacker · 2023-10-20T19:20:29Z

cpacker
Oct 20, 2023
Maintainer

Add implicit version of MemGPT with two separate threads (dialogue thread + memory management thread)

chandeldivyam · 2023-10-22T06:16:56Z

chandeldivyam
Oct 22, 2023

- create new agent class
- agent.step now runs one thread (can itself be an agent) that does inner monologue + pause heartbeats
- and another thread that does all the other memory functions
- you can run these async, but the entire high-level step() should probably (?) be blocking
- to split the threads, you also have to split the system prompt into 2, and probably provide some in-context examples for the memory functions thread (might not be needed)

Looking into it, and will come back with a proposed solution.

Also, it would be helpful to understand more about Why this is necessary. Thanks!

0 replies

chandeldivyam · 2023-10-22T18:03:26Z

chandeldivyam
Oct 22, 2023

my understanding

Current Framework on how we are handling requests:

The above-mentioned proposal might not directly work as memory functions have a direct dependency on inner monologue (i.e.) the response received from the LLM.

What we can do to optimize it better is, split the update functions and retrieval functions to different agents. As, when the agent have to retrieve some information, it has a need to send heartbeat_request as true. But, there is no need for another request to LLM, if we are able separate the update functions.

Here, we can send the new message using send_message function + parallel execution of update memory

@cpacker I would love to understand your perspective. Also, would want to understand what is the goal for performing this. What we are saying is handle_ai_response breaks away from the step function. But, it is blocked on get_ai_reply_async

The memory functions are blocked by the first request as they are dependent.

I do not have a holistic understanding of the project and would like to know your thoughts!

0 replies

HaileyStorm · 2023-10-25T01:01:57Z

HaileyStorm
Oct 25, 2023

One advantage I see to this is the eventually we might want the memory management agent to (intelligently, with some provided contextual information) modify the retrieved information such as filtering out most relevant parts or summarizing due to length. Would make it easier to use third party tools for those operations too.

0 replies

HaileyStorm · 2023-10-25T01:08:28Z

HaileyStorm
Oct 25, 2023

Also, you can probably use a cheaper model for the memory management thread.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implicit (split thread) MemGPT #115

{{title}}

Replies: 4 comments

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{title}}

Select a reply

Implicit (split thread) MemGPT #115

cpacker Oct 20, 2023 Maintainer

Replies: 4 comments

chandeldivyam Oct 22, 2023

chandeldivyam Oct 22, 2023

my understanding

HaileyStorm Oct 25, 2023

HaileyStorm Oct 25, 2023

cpacker
Oct 20, 2023
Maintainer

chandeldivyam
Oct 22, 2023

chandeldivyam
Oct 22, 2023

HaileyStorm
Oct 25, 2023

HaileyStorm
Oct 25, 2023