Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add notebook to demonstrate small LLM usage #642

Open
wants to merge 6 commits into
base: main
Choose a base branch
from

Conversation

kellyaa
Copy link
Collaborator

@kellyaa kellyaa commented Jan 24, 2025

Why are these changes needed?

Most AG2 examples do not work well with small LLMs (as stated in the docs). However, AG2 can be used with small models given the right techniques. This notebook demonstrates a tactic that can be used to get high performance out of small models in a dynamic RAG workflow.

Related issue number

n/a

Checks

Signed-off-by: Kelly Abuelsaad <[email protected]>
Signed-off-by: Kelly Abuelsaad <[email protected]>
Signed-off-by: Kelly Abuelsaad <[email protected]>
@kellyaa
Copy link
Collaborator Author

kellyaa commented Jan 24, 2025

Ready for review!

@qingyun-wu
Copy link
Collaborator

Thank you Kelly! Could you sign the CLA via this check:
Screenshot 2025-01-26 at 9 22 19 AM

Copy link
Collaborator

@qingyun-wu qingyun-wu left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM! Thank you!

Copy link
Collaborator

@davorrunje davorrunje left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please fix the spelling errors:

Idenfiy documents to ingest

The Worflow

@davorrunje davorrunje self-assigned this Jan 27, 2025
Signed-off-by: Kelly Abuelsaad <[email protected]>
@CLAassistant
Copy link

CLAassistant commented Jan 27, 2025

CLA assistant check
All committers have signed the CLA.

@kellyaa
Copy link
Collaborator Author

kellyaa commented Jan 27, 2025

@davorrunje Ah, thank you

@marklysze
Copy link
Collaborator

Thanks @kellyaa, love having more on local and smaller models!

Have you had a chance to try it using the AG2 Ollama client instead of the AG2 OpenAI client?

e.g. "api_type"="ollama" and using the client_host for the path?

[
    {
        "model": "llama3.1",
        "api_type": "ollama",
        "client_host": "http://192.168.0.1:11434"
    }
]

@kellyaa
Copy link
Collaborator Author

kellyaa commented Jan 29, 2025

@marklysze
Works like a charm! Updated the code to use this. Thank you!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants