Handling Very Long LLM Responses #529

CyberBearSec · 2024-05-24T04:16:08Z

CyberBearSec
May 24, 2024

While I completely understanding encoding, text splitting, etc. for handling input into an LLM, I have no idea how to move past the output limits.

In my case, I am passing in a very large amount of data and asking the LLM to generate Nodes and Edges for a Graph database. No matter what I attempt, I keep running into the response limit and not obtaining all of the needed statements.

Is there some way to handle very long LLM responses?

hinthornw · 2024-05-24T17:12:07Z

hinthornw
May 24, 2024
Maintainer

Just to confirm the question - basically you are using an LLM to generate content, and the maximum generated tokens is hit. You want a guide showing the best strateg(ies) to continue generating without losing quality?

In. your case, is the total context window (prompt inputs) exceeded as well? or just the output tokens?

Best strategies somewhat depend on the model. Anthropic lets you end in an AI messag, and it directly continues that, so the "continue generating" option is fairly straightforward there

2 replies

Michael-YongWang Dec 20, 2024

Could you please provide an example, how Anthropic "continue generating" works? Much appreciated!

Michael-YongWang Dec 23, 2024

Just tried Anthropic, it works perfectly!

mohsenhariri · 2024-09-02T22:59:51Z

mohsenhariri
Sep 2, 2024

the bottleneck here might be the KV cache size, you could try different kv cache optimization/compression techniques to address this

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Handling Very Long LLM Responses #529

{{title}}

Replies: 2 comments 2 replies

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{title}}

Select a reply

Handling Very Long LLM Responses #529

CyberBearSec May 24, 2024

Replies: 2 comments · 2 replies

hinthornw May 24, 2024 Maintainer

Michael-YongWang Dec 20, 2024

Michael-YongWang Dec 23, 2024

mohsenhariri Sep 2, 2024

CyberBearSec
May 24, 2024

Replies: 2 comments 2 replies

hinthornw
May 24, 2024
Maintainer

mohsenhariri
Sep 2, 2024