-
Notifications
You must be signed in to change notification settings - Fork 35
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ADR for IBM Granite Embeddings #169
Conversation
I received some comments about this proposal in a direct message from @akashgit . The pull request is updated with additional text to address the concerns that he raised. |
@instructlab/oversight-committee , this has approval from relevant stakeholders and no request for more changes so I think it is ready for final oversight now. Can someone review this and if it meets the criteria merge it too? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please squash commits to remove the intermediate changes
Signed-off-by: Bill Murdock <[email protected]>
Squashing is complete. |
@akashgit was clear that he doesn't want to be the blocker, so I don't think we need to wait for a re-review. I let him know on January 9 that I had updated the text to address the issues he raised, so if he wanted to submit a formal review in github, I think he would have by now. He is concerned that we don't have enough data to be really data driven on this issue. I agree with that assessment and I updated the Consequences section to be more explicit about the risk that we will need to pivot to another default as more data emerges. I think that's an acceptable risk because changing the default embedding model is a fairly low cost operation. I think it is time to make a decision now for what we will do now and then pivot later if it becomes clear that there are better options available. I also think assessing the trade-offs between issues like hardware requirements, accuracy, IP/legal risk, etc. will always be very subjective. I am hoping the "key considerations" listed in this document will provide a framework for making these subjective judgements long after the specific decision about which model is the best one for January 2025 has become irrelevant. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for the explanation!
This ADR proposes that we will use the IBM Granite Embeddings model for the vector retrieval component of our RAG solution.