-
Notifications
You must be signed in to change notification settings - Fork 313
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[v2] Merge main #1927
[v2] Merge main #1927
Conversation
update stella meta
* Add Summary Retrieval Task * Add FaMTEBClassification * Add FaMTEBClustering * Add FaMTEBPairClassification * Add FaMTEBRetrieval and BEIRFA and FaMTEBSTS * Add FaMTEBSummaryRetrieval * Add FaMTEB to benchmarks * fix benchmark names * temporary fix metadata * Fix dataset revisions * Update SummaryRetrievalEvaluator.py * Update task files * Update task files * add data domain and subtask description * Update AbsTaskSummaryRetrieval and FaMTEBSummaryRetrieval * Update AbsTaskSummaryRetrieval * Add mock task * Update AbsTaskSummaryRetrieval * Update AbsTaskSummaryRetrieval * make lint * Refactor SummaryRetrieval to subclass BitextMining * Add aggregated datasets --------- Co-authored-by: mehran <[email protected]> Co-authored-by: e.zeinivand <[email protected]> Co-authored-by: Erfun76 <[email protected]>
* update docs * Apply suggestions from code review Co-authored-by: Isaac Chung <[email protected]> * update readme * Update README.md Co-authored-by: Isaac Chung <[email protected]> --------- Co-authored-by: Isaac Chung <[email protected]>
* Adding a banner to the new MMTEB leaderboard * linting * Update mteb/leaderboard/app.py Co-authored-by: Isaac Chung <[email protected]> * adding reference to mteb arena --------- Co-authored-by: Isaac Chung <[email protected]>
fix: Updated citation for mteb(scandinavian)
* add three out of four datasets in CodeRAG-Bench * add verified CodeRAGStackoverflowPostsRetrieval dataset * clean up code and make some comments * fixed lint errors * addressed comments about code-rag datasets: fixed grammar and remove unnessary code and loop * roll back files which is not supposed to change * fixed the comments in split_by_first_newline() and make the methods private by adding a underscore prefix * refactor to use common args * update task descriptions * add entry in benchmarks * correct the alphanumeric order for the dataset * add in tasks.md * add in tasks.md * update task metadata * update importing path * fix lint errors * correct CodeRAG task metadata description field and id for stackoverflow-posts * fix error in test --------- Co-authored-by: Isaac Chung <[email protected]>
This reverts commit 7e8be03.
# Conflicts: # mteb/abstasks/AbsTask.py # mteb/evaluation/MTEB.py # mteb/load_results/task_results.py # mteb/tasks/Retrieval/eng/NQRetrieval.py
@isaac-chung Can I merge this? Because after that merge I want to merge main with |
Whoops, thanks for pinging me. Got buried in notifications. Will take a look at the last few commits. |
I want to merge |
While some refactoring can be done (e.g.#1944 (comment)), I'd say the core still needs #1950 and its conclusions. |
In some parts yes, but generally I want to merge it to not block development. |
I added
CodeRAGStackoverflowPosts
to exceptions for test withdescriptive_stat
for because it takes more than 128GB of memory to calculate #1595 (comment)Code Quality
make lint
to maintain consistent style.Documentation
Testing
make test-with-coverage
.make test
ormake test-with-coverage
to ensure no existing functionality is broken.Adding datasets checklist
Reason for dataset addition: ...
mteb -m {model_name} -t {task_name}
command.sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2
intfloat/multilingual-e5-small
self.stratified_subsampling() under dataset_transform()
make test
.make lint
.Adding a model checklist
mteb.get_model(model_name, revision)
andmteb.get_model_meta(model_name, revision)