Pyserini Release Notes (v0.9.3.0)

May 27, 2020 ยท View on GitHub

Release date: May 27, 2020

  • Implemented pseudo-relevance classifier reranking technique.
  • Added TfidfVectorizer to obtain vector representations of arbitrary documents from index. Verified that class works as expected by replicating classification demo on 20 Newsgroups with scikit-learn.
  • Added bindings to TREC COVID round 3 topics.
  • Added script for CORD-19 length outlier detection.
  • Added __main__ to pyserini.search to perform TREC runs from the command line.
  • Fixed issues with computing BM25 term weights and query-document scores.
  • Exposed access to basic index statistics in IndexReaderUtils.

Contributors (This Release)

Sorted by number of commits:

All Contributors

Sorted by number of commits, according to GitHub: