Anserini: Prebuilt Indexes
May 1, 2026 · View on GitHub
Anserini ships with a number of prebuilt indexes. This means that various indexes (inverted indexes, HNSW indexes, etc.) for common collections used in NLP and IR research have already been built and just needs to be downloaded (from UWaterloo and Hugging Face servers), which Anserini will handle automatically for you.
Bindings for available prebuilt indexes can be found in the package io.anserini.index.prebuilt under the right type, e.g., PrebuiltInvertedIndex for inverted indexes.
For example, if you specify -index msmarco-v1-passage, Anserini will know that you mean the Lucene index of the MS MARCO V1 passage corpus.
It will then download the index from the specified location(s) and cache locally.
All of this happens auto-magically!
Getting Started
To download a prebuilt index and view its statistics, you can use the following command:
bin/run.sh io.anserini.index.IndexReaderUtils -index cacm -stats
The output of the above command will be:
Index statistics
----------------
documents: 3204
documents (non-empty): 3204
unique terms: 14363
total terms: 320968
index_path: /home/jimmylin/.cache/pyserini/indexes/lucene-index.cacm.20221005.252b5e.cfe14d543c6a27f4d742fb2d0099b8e0
total_size: 2.9 MB
Note that for inverted indexes, unless the underlying index was built with the -optimize option (i.e., merging all index segments into a single segment), unique_terms will show -1.
Nope, that's not a bug.
Managing Indexes
Downloaded indexes are stored in the first matching cache location:
- If the system property
pyserini.cacheis set, downloaded indexes are stored inindexes/under that base cache directory. - Otherwise, if the environment variable
$PYSERINI_CACHEis set, downloaded indexes are stored inindexes/under that base cache directory. - Otherwise, if a
.cache/directory exists in the current working directory, downloaded indexes are stored in.cache/pyserini/indexes/under the current working directory. - Otherwise, downloaded indexes are stored in
~/.cache/pyserini/indexes/.
Yes, pyserini, that's not a bug — this is so prebuilt indexes can be shared between Pyserini and Anserini.
Another helpful tip is to download and manage the indexes by hand.
As an example, from the metadata in msmarco-v1-passage-inverted.json, you can see that msmarco-v1-passage can be downloaded from:
https://huggingface.co/datasets/castorini/prebuilt-indexes-msmarco-v1/resolve/main/passage/original/lucene-inverted/tf/lucene-inverted.msmarco-v1-passage.20221004.252b5e.tar.gz
The tarball has an MD5 checksum of 678876e8c99a89933d553609a0fd8793.
You can download, verify, unpack, and put the index anywhere you want.
With -index /path/to/index/ you'll get exactly the same output as -index msmarco-v1-passage, except now you've got fine-grained control over managing the index.
By manually managing indexes, you can share indexes between multiple users to conserve space.
The schema of the index location in ~/.cache/pyserini/indexes/ is the tarball name (after unpacking), followed by a dot and the checksum, so msmarco-v1-passage lives in following location:
~/.cache/pyserini/indexes/lucene-inverted.msmarco-v1-passage.20221004.252b5e.678876e8c99a89933d553609a0fd8793
You can download the index once, put in a common location, and have each user symlink to the actual index location. The source of the symlink would conform to the schema above, and the target of the symlink would be where your index actually resides.
Recovering from Partial Downloads
A common issue is recovering from partial downloads, for example, if you abort the downloading of a large index tarball. In the standard flow, Anserini downloads the tarball from the servers, verifies the checksum, and then unpacks the tarball. If this process is interrupted, you'll end up in an inconsistent state.
To recover, go to ~/.cache/pyserini/indexes/ or your custom cache directory and remove any tarballs (i.e., .tar.gz files).
If there are any partially unpacked indexes, remove those also.
Then start over (e.g., rerun the command you were running before).
Available Prebuilt Indexes
Below is a summary of the prebuilt indexes that are currently available.
Note that this page is automatically generated from this test case. This means that the page is updated with every (successful) build. Therefore, do not modify this page directly; modify the test case instead.
Standard Inverted Indexes
MS MARCO
- Anserini Lucene inverted index of the MS MARCO V1 passage corpus
- Anserini Lucene inverted index of the MS MARCO V1 passage corpus ('full' version)
- Anserini Lucene inverted index of the MS MARCO V1 passage corpus ('slim' version)
- Anserini Lucene inverted index of the MS MARCO V1 passage corpus with doc2query-T5 expansions
- Anserini Lucene inverted index of the MS MARCO V1 passage corpus with doc2query-T5 expansions (with stored docvectors)
- Anserini Lucene inverted index of the MS MARCO V1 document corpus.
- Anserini Lucene inverted index of the MS MARCO V1 document corpus ('full' version)
- Anserini Lucene inverted index of the MS MARCO V1 document corpus ('slim' version)
- Anserini Lucene inverted index of the MS MARCO V1 document corpus with doc2query-T5 expansions
- Anserini Lucene inverted index of the MS MARCO V1 document corpus with doc2query-T5 expansions (with stored docvectors)
- Anserini Lucene inverted index of the MS MARCO V1 segmented document corpus
- Anserini Lucene inverted index of the MS MARCO V1 segmented document corpus ('full' version)
- Anserini Lucene inverted index of the MS MARCO V1 segmented document corpus ('slim' version)
- Anserini Lucene inverted index of the MS MARCO V1 segmented document corpus with doc2query-T5 expansions
- Anserini Lucene inverted index of the MS MARCO V1 segmented document corpus with doc2query-T5 expansions (with stored docvectors)
- Anserini Lucene inverted index of the MS MARCO V2 passage corpus
- Anserini Lucene inverted index of the MS MARCO V2 passage corpus ('full' version)
- Anserini Lucene inverted index of the MS MARCO V2 passage corpus ('slim' version)
- Anserini Lucene inverted index of the MS MARCO V2 passage corpus with doc2query-T5 expansions
- Anserini Lucene inverted index of the MS MARCO V2 passage corpus with doc2query-T5 expansions (with stored docvectors)
- Anserini Lucene inverted index of the MS MARCO V2 document corpus
- Anserini Lucene inverted index of the MS MARCO V2 document corpus ('full' version)
- Anserini Lucene inverted index of the MS MARCO V2 document corpus ('slim' version)
- Anserini Lucene inverted index of the MS MARCO V2 document corpus with doc2query-T5 expansions
- Anserini Lucene inverted index of the MS MARCO V2 document corpus with doc2query-T5 expansions (with stored docvectors)
- Anserini Lucene inverted index of the MS MARCO V2 segmented document corpus
- Anserini Lucene inverted index of the MS MARCO V2 segmented document corpus ('full' version)
- Anserini Lucene inverted index of the MS MARCO V2 segmented document corpus ('slim' version)
- Anserini Lucene inverted index of the MS MARCO V2 segmented document corpus with doc2query-T5 expansions
- Anserini Lucene inverted index of the MS MARCO V2 segmented document corpus with doc2query-T5 expansions (with stored docvectors)
- Anserini Lucene impact index of the MS MARCO V2 segmented document corpus for uniCOIL, with title prepended
- Anserini Lucene impact index of the MS MARCO V2 segmented document corpus for uniCOIL (noexp) with title prepended
- Anserini Lucene inverted index of the MS MARCO V2.1 document corpus
- Anserini Lucene inverted index of the MS MARCO V2.1 document corpus ('full' version)
- Anserini Lucene inverted index of the MS MARCO V2.1 document corpus ('slim' version)
- Anserini Lucene inverted index of the MS MARCO V2.1 segmented document corpus
- Anserini Lucene inverted index of the MS MARCO V2.1 segmented document corpus ('full' version)
- Anserini Lucene inverted index of the MS MARCO V2.1 segmented document corpus ('slim' version)
msmarco-v1-passage
[README]
msmarco-v1-passage-full
[README]
msmarco-v1-passage-slim
[README]
msmarco-v1-passage.d2q-t5
[README]
msmarco-v1-passage.d2q-t5-docvectors
[README]
msmarco-v1-doc
[README]
msmarco-v1-doc-full
[README]
msmarco-v1-doc-slim
[README]
msmarco-v1-doc.d2q-t5
[README]
msmarco-v1-doc.d2q-t5-docvectors
[README]
msmarco-v1-doc-segmented
[README]
msmarco-v1-doc-segmented-full
[README]
msmarco-v1-doc-segmented-slim
[README]
msmarco-v1-doc-segmented.d2q-t5
[README]
msmarco-v1-doc-segmented.d2q-t5-docvectors
[README]
msmarco-v2-passage
[README]
msmarco-v2-passage-full
[README]
msmarco-v2-passage-slim
[README]
msmarco-v2-passage.d2q-t5
[README]
msmarco-v2-passage.d2q-t5-docvectors
[README]
msmarco-v2-doc
[README]
msmarco-v2-doc-full
[README]
msmarco-v2-doc-slim
[README]
msmarco-v2-doc.d2q-t5
[README]
msmarco-v2-doc.d2q-t5-docvectors
[README]
msmarco-v2-doc-segmented
[README]
msmarco-v2-doc-segmented-full
[README]
msmarco-v2-doc-segmented-slim
[README]
msmarco-v2-doc-segmented.d2q-t5
[README]
msmarco-v2-doc-segmented.d2q-t5-docvectors
[README]
msmarco-v2-doc-segmented.unicoil-0shot
[README]
msmarco-v2-doc-segmented.unicoil-noexp-0shot
[README]
msmarco-v2.1-doc
[README]
msmarco-v2.1-doc-full
[README]
msmarco-v2.1-doc-slim
[README]
msmarco-v2.1-doc-segmented
[README]
msmarco-v2.1-doc-segmented-full
[README]
msmarco-v2.1-doc-segmented-slim
[README]
BEIR
- Anserini Lucene inverted 'flat' index of BEIR collection 'arguana'
- Anserini Lucene inverted 'multifield' index of BEIR collection 'arguana'
- Anserini Lucene inverted 'flat' index of BEIR collection 'bioasq'
- Anserini Lucene inverted 'multifield' index of BEIR collection 'bioasq'
- Anserini Lucene inverted 'flat' index of BEIR collection 'climate-fever'
- Anserini Lucene inverted 'multifield' index of BEIR collection 'climate-fever'
- Anserini Lucene inverted 'flat' index of BEIR collection 'cqadupstack-android'
- Anserini Lucene inverted 'multifield' index of BEIR collection 'cqadupstack-android'
- Anserini Lucene inverted 'flat' index of BEIR collection 'cqadupstack-english'
- Anserini Lucene inverted 'multifield' index of BEIR collection 'cqadupstack-english'
- Anserini Lucene inverted 'flat' index of BEIR collection 'cqadupstack-gaming'
- Anserini Lucene inverted 'multifield' index of BEIR collection 'cqadupstack-gaming'
- Anserini Lucene inverted 'flat' index of BEIR collection 'cqadupstack-gis'
- Anserini Lucene inverted 'multifield' index of BEIR collection 'cqadupstack-gis'
- Anserini Lucene inverted 'flat' index of BEIR collection 'cqadupstack-mathematica'
- Anserini Lucene inverted 'multifield' index of BEIR collection 'cqadupstack-mathematica'
- Anserini Lucene inverted 'flat' index of BEIR collection 'cqadupstack-physics'
- Anserini Lucene inverted 'multifield' index of BEIR collection 'cqadupstack-physics'
- Anserini Lucene inverted 'flat' index of BEIR collection 'cqadupstack-programmers'
- Anserini Lucene inverted 'multifield' index of BEIR collection 'cqadupstack-programmers'
- Anserini Lucene inverted 'flat' index of BEIR collection 'cqadupstack-stats'
- Anserini Lucene inverted 'multifield' index of BEIR collection 'cqadupstack-stats'
- Anserini Lucene inverted 'flat' index of BEIR collection 'cqadupstack-tex'
- Anserini Lucene inverted 'multifield' index of BEIR collection 'cqadupstack-tex'
- Anserini Lucene inverted 'flat' index of BEIR collection 'cqadupstack-unix'
- Anserini Lucene inverted 'multifield' index of BEIR collection 'cqadupstack-unix'
- Anserini Lucene inverted 'flat' index of BEIR collection 'cqadupstack-webmasters'
- Anserini Lucene inverted 'multifield' index of BEIR collection 'cqadupstack-webmasters'
- Anserini Lucene inverted 'flat' index of BEIR collection 'cqadupstack-wordpress'
- Anserini Lucene inverted 'multifield' index of BEIR collection 'cqadupstack-wordpress'
- Anserini Lucene inverted 'flat' index of BEIR collection 'dbpedia-entity'
- Anserini Lucene inverted 'multifield' index of BEIR collection 'dbpedia-entity'
- Anserini Lucene inverted 'flat' index of BEIR collection 'fever'
- Anserini Lucene inverted 'multifield' index of BEIR collection 'fever'
- Anserini Lucene inverted 'flat' index of BEIR collection 'fiqa'
- Anserini Lucene inverted 'multifield' index of BEIR collection 'fiqa'
- Anserini Lucene inverted 'flat' index of BEIR collection 'hotpotqa'
- Anserini Lucene inverted 'multifield' index of BEIR collection 'hotpotqa'
- Anserini Lucene inverted 'flat' index of BEIR collection 'nfcorpus'
- Anserini Lucene inverted 'multifield' index of BEIR collection 'nfcorpus'
- Anserini Lucene inverted 'flat' index of BEIR collection 'nq'
- Anserini Lucene inverted 'multifield' index of BEIR collection 'nq'
- Anserini Lucene inverted 'flat' index of BEIR collection 'quora'
- Anserini Lucene inverted 'multifield' index of BEIR collection 'quora'
- Anserini Lucene inverted 'flat' index of BEIR collection 'robust04'
- Anserini Lucene inverted 'multifield' index of BEIR collection 'robust04'
- Anserini Lucene inverted 'flat' index of BEIR collection 'scidocs'
- Anserini Lucene inverted 'multifield' index of BEIR collection 'scidocs'
- Anserini Lucene inverted 'flat' index of BEIR collection 'scifact'
- Anserini Lucene inverted 'multifield' index of BEIR collection 'scifact'
- Anserini Lucene inverted 'flat' index of BEIR collection 'signal1m'
- Anserini Lucene inverted 'multifield' index of BEIR collection 'signal1m'
- Anserini Lucene inverted 'flat' index of BEIR collection 'trec-covid'
- Anserini Lucene inverted 'multifield' index of BEIR collection 'trec-covid'
- Anserini Lucene inverted 'flat' index of BEIR collection 'trec-news'
- Anserini Lucene inverted 'multifield' index of BEIR collection 'trec-news'
- Anserini Lucene inverted 'flat' index of BEIR collection 'webis-touche2020'
- Anserini Lucene inverted 'multifield' index of BEIR collection 'webis-touche2020'
beir-v1.0.0-arguana.flat
[README]
beir-v1.0.0-arguana.multifield
[README]
beir-v1.0.0-bioasq.flat
[README]
beir-v1.0.0-bioasq.multifield
[README]
beir-v1.0.0-climate-fever.flat
[README]
beir-v1.0.0-climate-fever.multifield
[README]
beir-v1.0.0-cqadupstack-android.flat
[README]
beir-v1.0.0-cqadupstack-android.multifield
[README]
beir-v1.0.0-cqadupstack-english.flat
[README]
beir-v1.0.0-cqadupstack-english.multifield
[README]
beir-v1.0.0-cqadupstack-gaming.flat
[README]
beir-v1.0.0-cqadupstack-gaming.multifield
[README]
beir-v1.0.0-cqadupstack-gis.flat
[README]
beir-v1.0.0-cqadupstack-gis.multifield
[README]
beir-v1.0.0-cqadupstack-mathematica.flat
[README]
beir-v1.0.0-cqadupstack-mathematica.multifield
[README]
beir-v1.0.0-cqadupstack-physics.flat
[README]
beir-v1.0.0-cqadupstack-physics.multifield
[README]
beir-v1.0.0-cqadupstack-programmers.flat
[README]
beir-v1.0.0-cqadupstack-programmers.multifield
[README]
beir-v1.0.0-cqadupstack-stats.flat
[README]
beir-v1.0.0-cqadupstack-stats.multifield
[README]
beir-v1.0.0-cqadupstack-tex.flat
[README]
beir-v1.0.0-cqadupstack-tex.multifield
[README]
beir-v1.0.0-cqadupstack-unix.flat
[README]
beir-v1.0.0-cqadupstack-unix.multifield
[README]
beir-v1.0.0-cqadupstack-webmasters.flat
[README]
beir-v1.0.0-cqadupstack-webmasters.multifield
[README]
beir-v1.0.0-cqadupstack-wordpress.flat
[README]
beir-v1.0.0-cqadupstack-wordpress.multifield
[README]
beir-v1.0.0-dbpedia-entity.flat
[README]
beir-v1.0.0-dbpedia-entity.multifield
[README]
beir-v1.0.0-fever.flat
[README]
beir-v1.0.0-fever.multifield
[README]
beir-v1.0.0-fiqa.flat
[README]
beir-v1.0.0-fiqa.multifield
[README]
beir-v1.0.0-hotpotqa.flat
[README]
beir-v1.0.0-hotpotqa.multifield
[README]
beir-v1.0.0-nfcorpus.flat
[README]
beir-v1.0.0-nfcorpus.multifield
[README]
beir-v1.0.0-nq.flat
[README]
beir-v1.0.0-nq.multifield
[README]
beir-v1.0.0-quora.flat
[README]
beir-v1.0.0-quora.multifield
[README]
beir-v1.0.0-robust04.flat
[README]
beir-v1.0.0-robust04.multifield
[README]
beir-v1.0.0-scidocs.flat
[README]
beir-v1.0.0-scidocs.multifield
[README]
beir-v1.0.0-scifact.flat
[README]
beir-v1.0.0-scifact.multifield
[README]
beir-v1.0.0-signal1m.flat
[README]
beir-v1.0.0-signal1m.multifield
[README]
beir-v1.0.0-trec-covid.flat
[README]
beir-v1.0.0-trec-covid.multifield
[README]
beir-v1.0.0-trec-news.flat
[README]
beir-v1.0.0-trec-news.multifield
[README]
beir-v1.0.0-webis-touche2020.flat
[README]
beir-v1.0.0-webis-touche2020.multifield
[README]
BRIGHT
- Anserini Lucene inverted index of BRIGHT collection 'aops'
- Anserini Lucene inverted index of BRIGHT collection 'biology'
- Anserini Lucene inverted index of BRIGHT collection 'earth-science'
- Anserini Lucene inverted index of BRIGHT collection 'economics'
- Anserini Lucene inverted index of BRIGHT collection 'leetcode'
- Anserini Lucene inverted index of BRIGHT collection 'pony'
- Anserini Lucene inverted index of BRIGHT collection 'psychology'
- Anserini Lucene inverted index of BRIGHT collection 'robotics'
- Anserini Lucene inverted index of BRIGHT collection 'stackoverflow'
- Anserini Lucene inverted index of BRIGHT collection 'sustainable-living'
- Anserini Lucene inverted index of BRIGHT collection 'theoremqa-questions'
- Anserini Lucene inverted index of BRIGHT collection 'theoremqa-theorems'
bright-aops
[README]
bright-biology
[README]
bright-earth-science
[README]
bright-economics
[README]
bright-leetcode
[README]
bright-pony
[README]
bright-psychology
[README]
bright-robotics
[README]
bright-stackoverflow
[README]
bright-sustainable-living
[README]
bright-theoremqa-questions
[README]
bright-theoremqa-theorems
[README]
Other
- Anserini Lucene inverted index of the CACM corpus
cacm
Impact Indexes
MS MARCO
- Anserini Lucene impact index of the MS MARCO V1 passage corpus encoded by SPLADE++ CoCondenser-EnsembleDistil
- Anserini Lucene impact index of the MS MARCO passage corpus encoded by SPLADE-v3
- Anserini Lucene impact index of the MS MARCO V1 segmented document corpus for uniCOIL, with title/segment encoding
- Anserini Lucene impact index of the MS MARCO V1 segmented document corpus for uniCOIL (noexp), with title/segment encoding
- Anserini Lucene impact index of the MS MARCO V2 passage corpus for uniCOIL
- Anserini Lucene impact index of the MS MARCO V2 passage corpus for uniCOIL (noexp)
- Anserini Lucene impact index of the MS MARCO V2.1 segmented document corpus encoded by SPLADE-v3
msmarco-v1-passage.splade-pp-ed
[README]
msmarco-v1-passage.splade-v3
[README]
msmarco-v1-doc-segmented.unicoil
[README]
msmarco-v1-doc-segmented.unicoil-noexp
[README]
msmarco-v2-passage.unicoil-0shot
[README]
msmarco-v2-passage.unicoil-noexp-0shot
[README]
msmarco-v2.1-doc-segmented.splade-v3
BEIR
- Anserini Lucene impact index of BEIR collection 'arguana' encoded by SPLADE++ CoCondenser-EnsembleDistil (Lucene 9)
- Anserini Lucene impact index of BEIR collection 'arguana' encoded by SPLADE-v3
- Anserini Lucene impact index of BEIR collection 'bioasq' encoded by SPLADE++ CoCondenser-EnsembleDistil (Lucene 9)
- Anserini Lucene impact index of BEIR collection 'bioasq' encoded by SPLADE-v3
- Anserini Lucene impact index of BEIR collection 'climate-fever' encoded by SPLADE++ CoCondenser-EnsembleDistil (Lucene 9)
- Anserini Lucene impact index of BEIR collection 'climate-fever' encoded by SPLADE-v3
- Anserini Lucene impact index of BEIR collection 'cqadupstack-android' encoded by SPLADE++ CoCondenser-EnsembleDistil (Lucene 9)
- Anserini Lucene impact index of BEIR collection 'cqadupstack-android' encoded by SPLADE-v3
- Anserini Lucene impact index of BEIR collection 'cqadupstack-english' encoded by SPLADE++ CoCondenser-EnsembleDistil (Lucene 9)
- Anserini Lucene impact index of BEIR collection 'cqadupstack-english' encoded by SPLADE-v3
- Anserini Lucene impact index of BEIR collection 'cqadupstack-gaming' encoded by SPLADE++ CoCondenser-EnsembleDistil (Lucene 9)
- Anserini Lucene impact index of BEIR collection 'cqadupstack-gaming' encoded by SPLADE-v3
- Anserini Lucene impact index of BEIR collection 'cqadupstack-gis' encoded by SPLADE++ CoCondenser-EnsembleDistil (Lucene 9)
- Anserini Lucene impact index of BEIR collection 'cqadupstack-gis' encoded by SPLADE-v3
- Anserini Lucene impact index of BEIR collection 'cqadupstack-mathematica' encoded by SPLADE++ CoCondenser-EnsembleDistil (Lucene 9)
- Anserini Lucene impact index of BEIR collection 'cqadupstack-mathematica' encoded by SPLADE-v3
- Anserini Lucene impact index of BEIR collection 'cqadupstack-physics' encoded by SPLADE++ CoCondenser-EnsembleDistil (Lucene 9)
- Anserini Lucene impact index of BEIR collection 'cqadupstack-physics' encoded by SPLADE-v3
- Anserini Lucene impact index of BEIR collection 'cqadupstack-programmers' encoded by SPLADE++ CoCondenser-EnsembleDistil (Lucene 9)
- Anserini Lucene impact index of BEIR collection 'cqadupstack-programmers' encoded by SPLADE-v3
- Anserini Lucene impact index of BEIR collection 'cqadupstack-stats' encoded by SPLADE++ CoCondenser-EnsembleDistil (Lucene 9)
- Anserini Lucene impact index of BEIR collection 'cqadupstack-stats' encoded by SPLADE-v3
- Anserini Lucene impact index of BEIR collection 'cqadupstack-tex' encoded by SPLADE++ CoCondenser-EnsembleDistil (Lucene 9)
- Anserini Lucene impact index of BEIR collection 'cqadupstack-tex' encoded by SPLADE-v3
- Anserini Lucene impact index of BEIR collection 'cqadupstack-unix' encoded by SPLADE++ CoCondenser-EnsembleDistil (Lucene 9)
- Anserini Lucene impact index of BEIR collection 'cqadupstack-unix' encoded by SPLADE-v3
- Anserini Lucene impact index of BEIR collection 'cqadupstack-webmasters' encoded by SPLADE++ CoCondenser-EnsembleDistil (Lucene 9)
- Anserini Lucene impact index of BEIR collection 'cqadupstack-webmasters' encoded by SPLADE-v3
- Anserini Lucene impact index of BEIR collection 'cqadupstack-wordpress' encoded by SPLADE++ CoCondenser-EnsembleDistil (Lucene 9)
- Anserini Lucene impact index of BEIR collection 'cqadupstack-wordpress' encoded by SPLADE-v3
- Anserini Lucene impact index of BEIR collection 'dbpedia-entity' encoded by SPLADE++ CoCondenser-EnsembleDistil (Lucene 9)
- Anserini Lucene impact index of BEIR collection 'dbpedia-entity' encoded by SPLADE-v3
- Anserini Lucene impact index of BEIR collection 'fever' encoded by SPLADE++ CoCondenser-EnsembleDistil (Lucene 9)
- Anserini Lucene impact index of BEIR collection 'fever' encoded by SPLADE-v3
- Anserini Lucene impact index of BEIR collection 'fiqa' encoded by SPLADE++ CoCondenser-EnsembleDistil (Lucene 9)
- Anserini Lucene impact index of BEIR collection 'fiqa' encoded by SPLADE-v3
- Anserini Lucene impact index of BEIR collection 'hotpotqa' encoded by SPLADE++ CoCondenser-EnsembleDistil (Lucene 9)
- Anserini Lucene impact index of BEIR collection 'hotpotqa' encoded by SPLADE-v3
- Anserini Lucene impact index of BEIR collection 'nfcorpus' encoded by SPLADE++ CoCondenser-EnsembleDistil (Lucene 9)
- Anserini Lucene impact index of BEIR collection 'nfcorpus' encoded by SPLADE-v3
- Anserini Lucene impact index of BEIR collection 'nq' encoded by SPLADE++ CoCondenser-EnsembleDistil (Lucene 9)
- Anserini Lucene impact index of BEIR collection 'nq' encoded by SPLADE-v3
- Anserini Lucene impact index of BEIR collection 'quora' encoded by SPLADE++ CoCondenser-EnsembleDistil (Lucene 9)
- Anserini Lucene impact index of BEIR collection 'quora' encoded by SPLADE-v3
- Anserini Lucene impact index of BEIR collection 'robust04' encoded by SPLADE++ CoCondenser-EnsembleDistil (Lucene 9)
- Anserini Lucene impact index of BEIR collection 'robust04' encoded by SPLADE-v3
- Anserini Lucene impact index of BEIR collection 'scidocs' encoded by SPLADE++ CoCondenser-EnsembleDistil (Lucene 9)
- Anserini Lucene impact index of BEIR collection 'scidocs' encoded by SPLADE-v3
- Anserini Lucene impact index of BEIR collection 'scifact' encoded by SPLADE++ CoCondenser-EnsembleDistil (Lucene 9)
- Anserini Lucene impact index of BEIR collection 'scifact' encoded by SPLADE-v3
- Anserini Lucene impact index of BEIR collection 'signal1m' encoded by SPLADE++ CoCondenser-EnsembleDistil (Lucene 9)
- Anserini Lucene impact index of BEIR collection 'signal1m' encoded by SPLADE-v3
- Anserini Lucene impact index of BEIR collection 'trec-covid' encoded by SPLADE++ CoCondenser-EnsembleDistil (Lucene 9)
- Anserini Lucene impact index of BEIR collection 'trec-covid' collection 'trec-covid' encoded by SPLADE-v3
- Anserini Lucene impact index of BEIR collection 'trec-news' encoded by SPLADE++ CoCondenser-EnsembleDistil (Lucene 9)
- Anserini Lucene impact index of BEIR collection 'trec-news' encoded by SPLADE-v3
- Anserini Lucene impact index of BEIR collection 'webis-touche2020' encoded by SPLADE++ CoCondenser-EnsembleDistil (Lucene 9)
- Anserini Lucene impact index of BEIR collection 'webis-touche2020' encoded by SPLADE-v3
beir-v1.0.0-arguana.splade-pp-ed
[README]
beir-v1.0.0-arguana.splade-v3
[README]
beir-v1.0.0-bioasq.splade-pp-ed
[README]
beir-v1.0.0-bioasq.splade-v3
[README]
beir-v1.0.0-climate-fever.splade-pp-ed
[README]
beir-v1.0.0-climate-fever.splade-v3
[README]
beir-v1.0.0-cqadupstack-android.splade-pp-ed
[README]
beir-v1.0.0-cqadupstack-android.splade-v3
[README]
beir-v1.0.0-cqadupstack-english.splade-pp-ed
[README]
beir-v1.0.0-cqadupstack-english.splade-v3
[README]
beir-v1.0.0-cqadupstack-gaming.splade-pp-ed
[README]
beir-v1.0.0-cqadupstack-gaming.splade-v3
[README]
beir-v1.0.0-cqadupstack-gis.splade-pp-ed
[README]
beir-v1.0.0-cqadupstack-gis.splade-v3
[README]
beir-v1.0.0-cqadupstack-mathematica.splade-pp-ed
[README]
beir-v1.0.0-cqadupstack-mathematica.splade-v3
[README]
beir-v1.0.0-cqadupstack-physics.splade-pp-ed
[README]
beir-v1.0.0-cqadupstack-physics.splade-v3
[README]
beir-v1.0.0-cqadupstack-programmers.splade-pp-ed
[README]
beir-v1.0.0-cqadupstack-programmers.splade-v3
[README]
beir-v1.0.0-cqadupstack-stats.splade-pp-ed
[README]
beir-v1.0.0-cqadupstack-stats.splade-v3
[README]
beir-v1.0.0-cqadupstack-tex.splade-pp-ed
[README]
beir-v1.0.0-cqadupstack-tex.splade-v3
[README]
beir-v1.0.0-cqadupstack-unix.splade-pp-ed
[README]
beir-v1.0.0-cqadupstack-unix.splade-v3
[README]
beir-v1.0.0-cqadupstack-webmasters.splade-pp-ed
[README]
beir-v1.0.0-cqadupstack-webmasters.splade-v3
[README]
beir-v1.0.0-cqadupstack-wordpress.splade-pp-ed
[README]
beir-v1.0.0-cqadupstack-wordpress.splade-v3
[README]
beir-v1.0.0-dbpedia-entity.splade-pp-ed
[README]
beir-v1.0.0-dbpedia-entity.splade-v3
[README]
beir-v1.0.0-fever.splade-pp-ed
[README]
beir-v1.0.0-fever.splade-v3
[README]
beir-v1.0.0-fiqa.splade-pp-ed
[README]
beir-v1.0.0-fiqa.splade-v3
[README]
beir-v1.0.0-hotpotqa.splade-pp-ed
[README]
beir-v1.0.0-hotpotqa.splade-v3
[README]
beir-v1.0.0-nfcorpus.splade-pp-ed
[README]
beir-v1.0.0-nfcorpus.splade-v3
[README]
beir-v1.0.0-nq.splade-pp-ed
[README]
beir-v1.0.0-nq.splade-v3
[README]
beir-v1.0.0-quora.splade-pp-ed
[README]
beir-v1.0.0-quora.splade-v3
[README]
beir-v1.0.0-robust04.splade-pp-ed
[README]
beir-v1.0.0-robust04.splade-v3
[README]
beir-v1.0.0-scidocs.splade-pp-ed
[README]
beir-v1.0.0-scidocs.splade-v3
[README]
beir-v1.0.0-scifact.splade-pp-ed
[README]
beir-v1.0.0-scifact.splade-v3
[README]
beir-v1.0.0-signal1m.splade-pp-ed
[README]
beir-v1.0.0-signal1m.splade-v3
[README]
beir-v1.0.0-trec-covid.splade-pp-ed
[README]
beir-v1.0.0-trec-covid.splade-v3
[README]
beir-v1.0.0-trec-news.splade-pp-ed
[README]
beir-v1.0.0-trec-news.splade-v3
[README]
beir-v1.0.0-webis-touche2020.splade-pp-ed
[README]
beir-v1.0.0-webis-touche2020.splade-v3
[README]
BRIGHT
- Anserini Lucene impact index of BRIGHT collection 'aops' encoded by SPLADE-v3
- Anserini Lucene impact index of BRIGHT collection 'biology' encoded by SPLADE-v3
- Anserini Lucene impact index of BRIGHT collection 'earth-science' encoded by SPLADE-v3
- Anserini Lucene impact index of BRIGHT collection 'economics' encoded by SPLADE-v3
- Anserini Lucene impact index of BRIGHT collection 'leetcode' encoded by SPLADE-v3
- Anserini Lucene impact index of BRIGHT collection 'pony' encoded by SPLADE-v3
- Anserini Lucene impact index of BRIGHT collection 'psychology' encoded by SPLADE-v3
- Anserini Lucene impact index of BRIGHT collection 'robotics' encoded by SPLADE-v3
- Anserini Lucene impact index of BRIGHT collection 'stackoverflow' encoded by SPLADE-v3
- Anserini Lucene impact index of BRIGHT collection 'sustainable-living' encoded by SPLADE-v3
- Anserini Lucene impact index of BRIGHT collection 'theoremqa-questions' encoded by SPLADE-v3
- Anserini Lucene impact index of BRIGHT collection 'theoremqa-theorems' encoded by SPLADE-v3
bright-aops.splade-v3
[README]
bright-biology.splade-v3
[README]
bright-earth-science.splade-v3
[README]
bright-economics.splade-v3
[README]
bright-leetcode.splade-v3
[README]
bright-pony.splade-v3
[README]
bright-psychology.splade-v3
[README]
bright-robotics.splade-v3
[README]
bright-stackoverflow.splade-v3
[README]
bright-sustainable-living.splade-v3
[README]
bright-theoremqa-questions.splade-v3
[README]
bright-theoremqa-theorems.splade-v3
[README]
Flat Vector Indexes
BEIR
- Anserini Lucene flat vector index of BEIR collection 'arguana' encoded by BGE-base-en-v1.5 (Lucene 10.4.0)
- Anserini Lucene flat vector index of BEIR collection 'bioasq' encoded by BGE-base-en-v1.5 (Lucene 10.4.0)
- Anserini Lucene flat vector index of BEIR collection 'climate-fever' encoded by BGE-base-en-v1.5 (Lucene 10.4.0)
- Anserini Lucene flat vector index of BEIR collection 'cqadupstack-android' encoded by BGE-base-en-v1.5 (Lucene 10.4.0)
- Anserini Lucene flat vector index of BEIR collection 'cqadupstack-english' encoded by BGE-base-en-v1.5 (Lucene 10.4.0)
- Anserini Lucene flat vector index of BEIR collection 'cqadupstack-gaming' encoded by BGE-base-en-v1.5 (Lucene 10.4.0)
- Anserini Lucene flat vector index of BEIR collection 'cqadupstack-gis' encoded by BGE-base-en-v1.5 (Lucene 10.4.0)
- Anserini Lucene flat vector index of BEIR collection 'cqadupstack-mathematica' encoded by BGE-base-en-v1.5 (Lucene 10.4.0)
- Anserini Lucene flat vector index of BEIR collection 'cqadupstack-physics' encoded by BGE-base-en-v1.5 (Lucene 10.4.0)
- Anserini Lucene flat vector index of BEIR collection 'cqadupstack-programmers' encoded by BGE-base-en-v1.5 (Lucene 10.4.0)
- Anserini Lucene flat vector index of BEIR collection 'cqadupstack-stats' encoded by BGE-base-en-v1.5 (Lucene 10.4.0)
- Anserini Lucene flat vector index of BEIR collection 'cqadupstack-tex' encoded by BGE-base-en-v1.5 (Lucene 10.4.0)
- Anserini Lucene flat vector index of BEIR collection 'cqadupstack-unix' encoded by BGE-base-en-v1.5 (Lucene 10.4.0)
- Anserini Lucene flat vector index of BEIR collection 'cqadupstack-webmasters' encoded by BGE-base-en-v1.5 (Lucene 10.4.0)
- Anserini Lucene flat vector index of BEIR collection 'cqadupstack-wordpress' encoded by BGE-base-en-v1.5 (Lucene 10.4.0)
- Anserini Lucene flat vector index of BEIR collection 'dbpedia-entity' encoded by BGE-base-en-v1.5 (Lucene 10.4.0)
- Anserini Lucene flat vector index of BEIR collection 'fever' encoded by BGE-base-en-v1.5 (Lucene 10.4.0)
- Anserini Lucene flat vector index of BEIR collection 'fiqa' encoded by BGE-base-en-v1.5 (Lucene 10.4.0)
- Anserini Lucene flat vector index of BEIR collection 'hotpotqa' encoded by BGE-base-en-v1.5 (Lucene 10.4.0)
- Anserini Lucene flat vector index of BEIR collection 'nfcorpus' encoded by BGE-base-en-v1.5 (Lucene 10.4.0)
- Anserini Lucene flat vector index of BEIR collection 'nq' encoded by BGE-base-en-v1.5 (Lucene 10.4.0)
- Anserini Lucene flat vector index of BEIR collection 'quora' encoded by BGE-base-en-v1.5 (Lucene 10.4.0)
- Anserini Lucene flat vector index of BEIR collection 'robust04' encoded by BGE-base-en-v1.5 (Lucene 10.4.0)
- Anserini Lucene flat vector index of BEIR collection 'scidocs' encoded by BGE-base-en-v1.5 (Lucene 10.4.0)
- Anserini Lucene flat vector index of BEIR collection 'scifact' encoded by BGE-base-en-v1.5 (Lucene 10.4.0)
- Anserini Lucene flat vector index of BEIR collection 'signal1m' encoded by BGE-base-en-v1.5 (Lucene 10.4.0)
- Anserini Lucene flat vector index of BEIR collection 'trec-covid' encoded by BGE-base-en-v1.5 (Lucene 10.4.0)
- Anserini Lucene flat vector index of BEIR collection 'trec-news' encoded by BGE-base-en-v1.5 (Lucene 10.4.0)
- Anserini Lucene flat vector index of BEIR collection 'webis-touche2020' encoded by BGE-base-en-v1.5 (Lucene 10.4.0)
beir-v1.0.0-arguana.bge-base-en-v1.5.flat
[README]
beir-v1.0.0-bioasq.bge-base-en-v1.5.flat
[README]
beir-v1.0.0-climate-fever.bge-base-en-v1.5.flat
[README]
beir-v1.0.0-cqadupstack-android.bge-base-en-v1.5.flat
[README]
beir-v1.0.0-cqadupstack-english.bge-base-en-v1.5.flat
[README]
beir-v1.0.0-cqadupstack-gaming.bge-base-en-v1.5.flat
[README]
beir-v1.0.0-cqadupstack-gis.bge-base-en-v1.5.flat
[README]
beir-v1.0.0-cqadupstack-mathematica.bge-base-en-v1.5.flat
[README]
beir-v1.0.0-cqadupstack-physics.bge-base-en-v1.5.flat
[README]
beir-v1.0.0-cqadupstack-programmers.bge-base-en-v1.5.flat
[README]
beir-v1.0.0-cqadupstack-stats.bge-base-en-v1.5.flat
[README]
beir-v1.0.0-cqadupstack-tex.bge-base-en-v1.5.flat
[README]
beir-v1.0.0-cqadupstack-unix.bge-base-en-v1.5.flat
[README]
beir-v1.0.0-cqadupstack-webmasters.bge-base-en-v1.5.flat
[README]
beir-v1.0.0-cqadupstack-wordpress.bge-base-en-v1.5.flat
[README]
beir-v1.0.0-dbpedia-entity.bge-base-en-v1.5.flat
[README]
beir-v1.0.0-fever.bge-base-en-v1.5.flat
[README]
beir-v1.0.0-fiqa.bge-base-en-v1.5.flat
[README]
beir-v1.0.0-hotpotqa.bge-base-en-v1.5.flat
[README]
beir-v1.0.0-nfcorpus.bge-base-en-v1.5.flat
[README]
beir-v1.0.0-nq.bge-base-en-v1.5.flat
[README]
beir-v1.0.0-quora.bge-base-en-v1.5.flat
[README]
beir-v1.0.0-robust04.bge-base-en-v1.5.flat
[README]
beir-v1.0.0-scidocs.bge-base-en-v1.5.flat
[README]
beir-v1.0.0-scifact.bge-base-en-v1.5.flat
[README]
beir-v1.0.0-signal1m.bge-base-en-v1.5.flat
[README]
beir-v1.0.0-trec-covid.bge-base-en-v1.5.flat
[README]
beir-v1.0.0-trec-news.bge-base-en-v1.5.flat
[README]
beir-v1.0.0-webis-touche2020.bge-base-en-v1.5.flat
[README]
BRIGHT
- Anserini Lucene flat vector index of BRIGHT collection 'aops' encoded by BGE-large-en-v1.5
- Anserini Lucene flat vector index of BRIGHT collection 'biology' encoded by BGE-large-en-v1.5
- Anserini Lucene flat vector index of BRIGHT collection 'earth-science' encoded by BGE-large-en-v1.5
- Anserini Lucene flat vector index of BRIGHT collection 'economics' encoded by BGE-large-en-v1.5
- Anserini Lucene flat vector index of BRIGHT collection 'leetcode' encoded by BGE-large-en-v1.5
- Anserini Lucene flat vector index of BRIGHT collection 'pony' encoded by BGE-large-en-v1.5
- Anserini Lucene flat vector index of BRIGHT collection 'psychology' encoded by BGE-large-en-v1.5
- Anserini Lucene flat vector index of BRIGHT collection 'robotics' encoded by BGE-large-en-v1.5
- Anserini Lucene flat vector index of BRIGHT collection 'stackoverflow' encoded by BGE-large-en-v1.5
- Anserini Lucene flat vector index of BRIGHT collection 'sustainable-living' encoded by BGE-large-en-v1.5
- Anserini Lucene flat vector index of BRIGHT collection 'theoremqa-questions' encoded by BGE-large-en-v1.5
- Anserini Lucene flat vector index of BRIGHT collection 'theoremqa-theorems' encoded by BGE-large-en-v1.5
bright-aops.bge-large-en-v1.5.flat
[README]
bright-biology.bge-large-en-v1.5.flat
[README]
bright-earth-science.bge-large-en-v1.5.flat
[README]
bright-economics.bge-large-en-v1.5.flat
[README]
bright-leetcode.bge-large-en-v1.5.flat
[README]
bright-pony.bge-large-en-v1.5.flat
[README]
bright-psychology.bge-large-en-v1.5.flat
[README]
bright-robotics.bge-large-en-v1.5.flat
[README]
bright-stackoverflow.bge-large-en-v1.5.flat
[README]
bright-sustainable-living.bge-large-en-v1.5.flat
[README]
bright-theoremqa-questions.bge-large-en-v1.5.flat
[README]
bright-theoremqa-theorems.bge-large-en-v1.5.flat
[README]
HNSW Vector Indexes
MS MARCO
- Anserini Lucene HNSW index of the MS MARCO V1 passage corpus encoded by BGE-base-en-v1.5
- Anserini Lucene quantized (int8) HNSW index of the MS MARCO V1 passage corpus encoded by BGE-base-en-v1.5
- Anserini Lucene HNSW index of the MS MARCO V1 passage corpus encoded by Cohere embed-english-v3.0
- Anserini Lucene quantized (int8) HNSW index of the MS MARCO V1 passage corpus encoded by Cohere embed-english-v3.0
- Anserini Lucene HNSW index of the MS MARCO V1 passage corpus encoded by cos-DPR Distil
- Anserini Lucene quantized (int8) HNSW index of the MS MARCO V1 passage corpus encoded by cos-DPR Distil
- Anserini Lucene quantized (int8) HNSW index of the MS MARCO V2.1 segmented document corpus (shard00) encoded by Snowflake's arctic-embed-l model
- Anserini Lucene quantized (int8) HNSW index of the MS MARCO V2.1 segmented document corpus (shard01) encoded by Snowflake's arctic-embed-l model
- Anserini Lucene quantized (int8) HNSW index of the MS MARCO V2.1 segmented document corpus (shard02) encoded by Snowflake's arctic-embed-l model
- Anserini Lucene quantized (int8) HNSW index of the MS MARCO V2.1 segmented document corpus (shard03) encoded by Snowflake's arctic-embed-l model
- Anserini Lucene quantized (int8) HNSW index of the MS MARCO V2.1 segmented document corpus (shard04) encoded by Snowflake's arctic-embed-l model
- Anserini Lucene quantized (int8) HNSW index of the MS MARCO V2.1 segmented document corpus (shard05) encoded by Snowflake's arctic-embed-l model
- Anserini Lucene quantized (int8) HNSW index of the MS MARCO V2.1 segmented document corpus (shard06) encoded by Snowflake's arctic-embed-l model
- Anserini Lucene quantized (int8) HNSW index of the MS MARCO V2.1 segmented document corpus (shard07) encoded by Snowflake's arctic-embed-l model
- Anserini Lucene quantized (int8) HNSW index of the MS MARCO V2.1 segmented document corpus (shard08) encoded by Snowflake's arctic-embed-l model
- Anserini Lucene quantized (int8) HNSW index of the MS MARCO V2.1 segmented document corpus (shard09) encoded by Snowflake's arctic-embed-l model
msmarco-v1-passage.bge-base-en-v1.5.hnsw
[README]
msmarco-v1-passage.bge-base-en-v1.5.hnsw-int8
[README]
msmarco-v1-passage.cohere-embed-english-v3.0.hnsw
msmarco-v1-passage.cohere-embed-english-v3.0.hnsw-int8
msmarco-v1-passage.cosdpr-distil.hnsw
[README]
msmarco-v1-passage.cosdpr-distil.hnsw-int8
[README]
msmarco-v2.1-doc-segmented-shard00.arctic-embed-l.hnsw-int8
[README]
msmarco-v2.1-doc-segmented-shard01.arctic-embed-l.hnsw-int8
[README]
msmarco-v2.1-doc-segmented-shard02.arctic-embed-l.hnsw-int8
[README]
msmarco-v2.1-doc-segmented-shard03.arctic-embed-l.hnsw-int8
[README]
msmarco-v2.1-doc-segmented-shard04.arctic-embed-l.hnsw-int8
[README]
msmarco-v2.1-doc-segmented-shard05.arctic-embed-l.hnsw-int8
[README]
msmarco-v2.1-doc-segmented-shard06.arctic-embed-l.hnsw-int8
[README]
msmarco-v2.1-doc-segmented-shard07.arctic-embed-l.hnsw-int8
[README]
msmarco-v2.1-doc-segmented-shard08.arctic-embed-l.hnsw-int8
[README]
msmarco-v2.1-doc-segmented-shard09.arctic-embed-l.hnsw-int8
[README]
BEIR
- Anserini Lucene HNSW index of BEIR collection 'arguana' encoded by BGE-base-en-v1.5 (Lucene 9)
- Anserini Lucene HNSW index of BEIR collection 'bioasq' encoded by BGE-base-en-v1.5 (Lucene 9)
- Anserini Lucene HNSW index of BEIR collection 'climate-fever' encoded by BGE-base-en-v1.5 (Lucene 9)
- Anserini Lucene HNSW index of BEIR collection 'cqadupstack-android' encoded by BGE-base-en-v1.5 (Lucene 9)
- Anserini Lucene HNSW index of BEIR collection 'cqadupstack-english' encoded by BGE-base-en-v1.5 (Lucene 9)
- Anserini Lucene HNSW index of BEIR collection 'cqadupstack-gaming' encoded by BGE-base-en-v1.5 (Lucene 9)
- Anserini Lucene HNSW index of BEIR collection 'cqadupstack-gis' encoded by BGE-base-en-v1.5 (Lucene 9)
- Anserini Lucene HNSW index of BEIR collection 'cqadupstack-mathematica' encoded by BGE-base-en-v1.5 (Lucene 9)
- Anserini Lucene HNSW index of BEIR collection 'cqadupstack-physics' encoded by BGE-base-en-v1.5 (Lucene 9)
- Anserini Lucene HNSW index of BEIR collection 'cqadupstack-programmers' encoded by BGE-base-en-v1.5 (Lucene 9)
- Anserini Lucene HNSW index of BEIR collection 'cqadupstack-stats' encoded by BGE-base-en-v1.5 (Lucene 9)
- Anserini Lucene HNSW index of BEIR collection 'cqadupstack-tex' encoded by BGE-base-en-v1.5 (Lucene 9)
- Anserini Lucene HNSW index of BEIR collection 'cqadupstack-unix' encoded by BGE-base-en-v1.5 (Lucene 9)
- Anserini Lucene HNSW index of BEIR collection 'cqadupstack-webmasters' encoded by BGE-base-en-v1.5 (Lucene 9)
- Anserini Lucene HNSW index of BEIR collection 'cqadupstack-wordpress' encoded by BGE-base-en-v1.5 (Lucene 9)
- Anserini Lucene HNSW index of BEIR collection 'dbpedia-entity' encoded by BGE-base-en-v1.5 (Lucene 9)
- Anserini Lucene HNSW index of BEIR collection 'fever' encoded by BGE-base-en-v1.5 (Lucene 9)
- Anserini Lucene HNSW index of BEIR collection 'fiqa' encoded by BGE-base-en-v1.5 (Lucene 9)
- Anserini Lucene HNSW index of BEIR collection 'hotpotqa' encoded by BGE-base-en-v1.5 (Lucene 9)
- Anserini Lucene HNSW index of BEIR collection 'nfcorpus' encoded by BGE-base-en-v1.5 (Lucene 9)
- Anserini Lucene HNSW index of BEIR collection 'nq' encoded by BGE-base-en-v1.5 (Lucene 9)
- Anserini Lucene HNSW index of BEIR collection 'quora' encoded by BGE-base-en-v1.5 (Lucene 9)
- Anserini Lucene HNSW index of BEIR collection 'robust04' encoded by BGE-base-en-v1.5 (Lucene 9)
- Anserini Lucene HNSW index of BEIR collection 'scidocs' encoded by BGE-base-en-v1.5 (Lucene 9)
- Anserini Lucene HNSW index of BEIR collection 'scifact' encoded by BGE-base-en-v1.5 (Lucene 9)
- Anserini Lucene HNSW index of BEIR collection 'signal1m' encoded by BGE-base-en-v1.5 (Lucene 9)
- Anserini Lucene HNSW index of BEIR collection 'trec-covid' encoded by BGE-base-en-v1.5 (Lucene 9)
- Anserini Lucene HNSW index of BEIR collection 'trec-news' encoded by BGE-base-en-v1.5 (Lucene 9)
- Anserini Lucene HNSW index of BEIR collection 'webis-touche2020' encoded by BGE-base-en-v1.5 (Lucene 9)
beir-v1.0.0-arguana.bge-base-en-v1.5.hnsw
[README]
beir-v1.0.0-bioasq.bge-base-en-v1.5.hnsw
[README]
beir-v1.0.0-climate-fever.bge-base-en-v1.5.hnsw
[README]
beir-v1.0.0-cqadupstack-android.bge-base-en-v1.5.hnsw
[README]
beir-v1.0.0-cqadupstack-english.bge-base-en-v1.5.hnsw
[README]
beir-v1.0.0-cqadupstack-gaming.bge-base-en-v1.5.hnsw
[README]
beir-v1.0.0-cqadupstack-gis.bge-base-en-v1.5.hnsw
[README]
beir-v1.0.0-cqadupstack-mathematica.bge-base-en-v1.5.hnsw
[README]
beir-v1.0.0-cqadupstack-physics.bge-base-en-v1.5.hnsw
[README]
beir-v1.0.0-cqadupstack-programmers.bge-base-en-v1.5.hnsw
[README]
beir-v1.0.0-cqadupstack-stats.bge-base-en-v1.5.hnsw
[README]
beir-v1.0.0-cqadupstack-tex.bge-base-en-v1.5.hnsw
[README]
beir-v1.0.0-cqadupstack-unix.bge-base-en-v1.5.hnsw
[README]
beir-v1.0.0-cqadupstack-webmasters.bge-base-en-v1.5.hnsw
[README]
beir-v1.0.0-cqadupstack-wordpress.bge-base-en-v1.5.hnsw
[README]
beir-v1.0.0-dbpedia-entity.bge-base-en-v1.5.hnsw
[README]
beir-v1.0.0-fever.bge-base-en-v1.5.hnsw
[README]
beir-v1.0.0-fiqa.bge-base-en-v1.5.hnsw
[README]
beir-v1.0.0-hotpotqa.bge-base-en-v1.5.hnsw
[README]
beir-v1.0.0-nfcorpus.bge-base-en-v1.5.hnsw
[README]
beir-v1.0.0-nq.bge-base-en-v1.5.hnsw
[README]
beir-v1.0.0-quora.bge-base-en-v1.5.hnsw
[README]
beir-v1.0.0-robust04.bge-base-en-v1.5.hnsw
[README]
beir-v1.0.0-scidocs.bge-base-en-v1.5.hnsw
[README]
beir-v1.0.0-scifact.bge-base-en-v1.5.hnsw
[README]
beir-v1.0.0-signal1m.bge-base-en-v1.5.hnsw
[README]
beir-v1.0.0-trec-covid.bge-base-en-v1.5.hnsw
[README]
beir-v1.0.0-trec-news.bge-base-en-v1.5.hnsw
[README]
beir-v1.0.0-webis-touche2020.bge-base-en-v1.5.hnsw
[README]