ADR for IBM Granite Embeddings #169

jwm4 · 2024-12-20T15:46:04Z

This ADR proposes that we will use the IBM Granite Embeddings model for the vector retrieval component of our RAG solution.

docs/rag/adrs/granite-embeddings.md

jwm4 · 2025-01-10T19:40:57Z

I received some comments about this proposal in a direct message from @akashgit . The pull request is updated with additional text to address the concerns that he raised.

jwm4 · 2025-01-10T22:43:48Z

@instructlab/oversight-committee , this has approval from relevant stakeholders and no request for more changes so I think it is ready for final oversight now. Can someone review this and if it meets the criteria merge it too?

danmcp

Please squash commits to remove the intermediate changes

Signed-off-by: Bill Murdock <[email protected]>

jwm4 · 2025-01-17T23:41:40Z

Squashing is complete.

danmcp · 2025-01-18T00:09:37Z

@jwm4 Re:

I received some comments about this proposal in a direct message from @akashgit . The pull request is updated with additional text to address the concerns that he raised.

Are you confident these concerns were addressed or do we need @akashgit to re-review?

jwm4 · 2025-01-18T01:18:45Z

I received some comments about this proposal in a direct message from @akashgit . The pull request is updated with additional text to address the concerns that he raised.

Are you confident these concerns were addressed or do we need @akashgit to re-review?

@akashgit was clear that he doesn't want to be the blocker, so I don't think we need to wait for a re-review. I let him know on January 9 that I had updated the text to address the issues he raised, so if he wanted to submit a formal review in github, I think he would have by now.

He is concerned that we don't have enough data to be really data driven on this issue. I agree with that assessment and I updated the Consequences section to be more explicit about the risk that we will need to pivot to another default as more data emerges. I think that's an acceptable risk because changing the default embedding model is a fairly low cost operation.

I think it is time to make a decision now for what we will do now and then pivot later if it becomes clear that there are better options available. I also think assessing the trade-offs between issues like hardware requirements, accuracy, IP/legal risk, etc. will always be very subjective. I am hoping the "key considerations" listed in this document will provide a framework for making these subjective judgements long after the specific decision about which model is the best one for January 2025 has become irrelevant.

danmcp

Thanks for the explanation!

anastasds reviewed Jan 3, 2025

View reviewed changes

docs/rag/adrs/granite-embeddings.md Outdated Show resolved Hide resolved

jwm4 force-pushed the jwm4-embed-adr branch from 0cc4cf8 to 3a0d2b5 Compare January 3, 2025 17:51

anastasds mentioned this pull request Jan 6, 2025

feat: Expose document store Python API in instructlab/instructlab rag submodule instructlab/instructlab#2832

Merged

6 tasks

anastasds approved these changes Jan 7, 2025

View reviewed changes

anastasds approved these changes Jan 10, 2025

View reviewed changes

hemajv approved these changes Jan 10, 2025

View reviewed changes

dmartinol approved these changes Jan 10, 2025

View reviewed changes

danmcp requested changes Jan 17, 2025

View reviewed changes

ADR for IBM Granite Embeddings

dd82bec

Signed-off-by: Bill Murdock <[email protected]>

jwm4 force-pushed the jwm4-embed-adr branch from 9859ed9 to dd82bec Compare January 17, 2025 23:40

danmcp approved these changes Jan 18, 2025

View reviewed changes

danmcp merged commit b18c310 into instructlab:main Jan 18, 2025
4 checks passed

jwm4 deleted the jwm4-embed-adr branch January 18, 2025 01:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ADR for IBM Granite Embeddings #169

ADR for IBM Granite Embeddings #169

jwm4 commented Dec 20, 2024 •

edited

Loading

jwm4 commented Jan 10, 2025

jwm4 commented Jan 10, 2025

danmcp left a comment

jwm4 commented Jan 17, 2025

danmcp commented Jan 18, 2025

jwm4 commented Jan 18, 2025

danmcp left a comment

ADR for IBM Granite Embeddings #169

ADR for IBM Granite Embeddings #169

Conversation

jwm4 commented Dec 20, 2024 • edited Loading

jwm4 commented Jan 10, 2025

jwm4 commented Jan 10, 2025

danmcp left a comment

Choose a reason for hiding this comment

jwm4 commented Jan 17, 2025

danmcp commented Jan 18, 2025

jwm4 commented Jan 18, 2025

danmcp left a comment

Choose a reason for hiding this comment

jwm4 commented Dec 20, 2024 •

edited

Loading