Accessing document vectors for msmarco-v1-passage prebuilt index #2034
-
Hello! I'm trying to view the TF-IDF or BM25 vectors for the msmarco-v1-passage prebuilt index. But when I try running
I get the following error:
I also get the same error when trying to dump out all document vectors with:
Neither of these errors occur with the Do you know what could be causing this error? Apologies if this is not an issue with pyserini and I'm just missing what to do. |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 1 reply
-
Ah, you need the version of the prebuilt index that stores the docvectors. Try Which is linked from this page: https://github.com/castorini/pyserini/blob/master/docs/prebuilt-indexes.md |
Beta Was this translation helpful? Give feedback.
Ah, you need the version of the prebuilt index that stores the docvectors. Try
msmarco-v1-passage-full
: https://github.com/castorini/pyserini/blob/master/pyserini/resources/index-metadata/lucene-inverted.msmarco-v1-passage.20221004.252b5e.README.mdWhich is linked from this page: https://github.com/castorini/pyserini/blob/master/docs/prebuilt-indexes.md