Add notebook to demonstrate small LLM usage #642

kellyaa · 2025-01-24T21:32:26Z

Why are these changes needed?

Most AG2 examples do not work well with small LLMs (as stated in the docs). However, AG2 can be used with small models given the right techniques. This notebook demonstrates a tactic that can be used to get high performance out of small models in a dynamic RAG workflow.

Related issue number

n/a

Checks

I've included any doc changes needed for https://docs.ag2.ai/. See https://docs.ag2.ai/docs/contributor-guide/documentation to build and test documentation locally.
I've added tests (if relevant) corresponding to the changes introduced in this PR.
I've made sure all auto checks have passed.

Signed-off-by: Kelly Abuelsaad <[email protected]>

kellyaa · 2025-01-24T22:18:59Z

Ready for review!

qingyun-wu · 2025-01-26T17:23:33Z

Thank you Kelly! Could you sign the CLA via this check:

qingyun-wu

LGTM! Thank you!

davorrunje

Please fix the spelling errors:

Idenfiy documents to ingest

The Worflow

Signed-off-by: Kelly Abuelsaad <[email protected]>

CLAassistant · 2025-01-27T15:35:02Z

All committers have signed the CLA.

kellyaa · 2025-01-27T15:38:10Z

@davorrunje Ah, thank you

marklysze · 2025-01-28T20:28:44Z

Thanks @kellyaa, love having more on local and smaller models!

Have you had a chance to try it using the AG2 Ollama client instead of the AG2 OpenAI client?

e.g. "api_type"="ollama" and using the client_host for the path?

[
    {
        "model": "llama3.1",
        "api_type": "ollama",
        "client_host": "http://192.168.0.1:11434"
    }
]

Signed-off-by: Kelly Abuelsaad <[email protected]>

kellyaa · 2025-01-29T15:15:34Z

@marklysze
Works like a charm! Updated the code to use this. Thank you!

kellyaa added 4 commits January 24, 2025 16:10

Add notebook to demonstrate small LLM usage

d50124b

Signed-off-by: Kelly Abuelsaad <[email protected]>

Shorten title

36a0cd4

Signed-off-by: Kelly Abuelsaad <[email protected]>

Formatting fix

119008c

Signed-off-by: Kelly Abuelsaad <[email protected]>

Clear cell output; fix agent names

38dbdea

Signed-off-by: Kelly Abuelsaad <[email protected]>

qingyun-wu requested review from qingyun-wu and emooreatx January 26, 2025 17:22

qingyun-wu approved these changes Jan 26, 2025

View reviewed changes

davorrunje reviewed Jan 27, 2025

View reviewed changes

davorrunje self-assigned this Jan 27, 2025

Fix spelling errors

29404a1

Signed-off-by: Kelly Abuelsaad <[email protected]>

Use ollama client instead of openai

2a1adec

Signed-off-by: Kelly Abuelsaad <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add notebook to demonstrate small LLM usage #642

Add notebook to demonstrate small LLM usage #642

kellyaa commented Jan 24, 2025

kellyaa commented Jan 24, 2025

qingyun-wu commented Jan 26, 2025

qingyun-wu left a comment

davorrunje left a comment •

edited

Loading

CLAassistant commented Jan 27, 2025 •

edited

Loading

kellyaa commented Jan 27, 2025

marklysze commented Jan 28, 2025

kellyaa commented Jan 29, 2025

Add notebook to demonstrate small LLM usage #642

Are you sure you want to change the base?

Add notebook to demonstrate small LLM usage #642

Conversation

kellyaa commented Jan 24, 2025

Why are these changes needed?

Related issue number

Checks

kellyaa commented Jan 24, 2025

qingyun-wu commented Jan 26, 2025

qingyun-wu left a comment

Choose a reason for hiding this comment

davorrunje left a comment • edited Loading

Choose a reason for hiding this comment

Idenfiy documents to ingest

The Worflow

CLAassistant commented Jan 27, 2025 • edited Loading

kellyaa commented Jan 27, 2025

marklysze commented Jan 28, 2025

kellyaa commented Jan 29, 2025

davorrunje left a comment •

edited

Loading

CLAassistant commented Jan 27, 2025 •

edited

Loading