Skip to content

Releases: allenai/ir_datasets

v0.4.3

17 Sep 13:30
Compare
Choose a tag to compare

Added:

  • trec-fair-2021/eval topics
  • clinicaltrials/2021/trec-ct-2021
  • c4 and c4/en-noclean-tr/trec-misinfo-2021
  • wikir/en78k and wikir/ens78k
  • msmarco-passage-v2/trec-dl-2021 and msmarco-document-v2/trec-dl-2021
  • mr-tydi
  • mmarco

Misc:

  • some minor changes to clean command
  • msmarco-passage-v2 lookups now performed by ID instead of lz4
  • file linking info not shown when downloading small files
  • fixed cord19/fulltext
  • other minor fixes

v0.4.2

16 Jul 00:03
Compare
Choose a tag to compare

Adds the following datasets:

  • MS MARCO Passage version 2
  • TREC Fair Ranking 2021

A few other minor improvements:

  • Progress bars: units + totals in a few more places
  • Checks for adequate disk space before big downloads (can be disabled with an environment variable)

v0.4.1

03 Jul 15:39
Compare
Choose a tag to compare
  • Adds version 2 of the MS MARCO document collection.
  • Using mirror.ir-datsets.com as a fallback for some small files
  • More examples in the documentation (the python API is now joined by the CLI and a PyTerrier example)
  • Improved bibtex, including a master bib file that can be imported papers (e.g., in overleaf).
  • Other minor improvements

v0.4.0

04 Jun 16:24
Compare
Choose a tag to compare

New datasets:

  • BEIR suite
  • Cranfield
  • CLIRMatrix
  • DPR-W100
  • NQ
  • TREC DL Hard
  • TREC News
  • TripClick

Other:

  • Download dashboard
  • Improved documentation for non-downloadable datasets
  • A beta "more pythonic API"
  • Speeding up library load time
  • Minor bug fixes, improvements, etc.

v0.3.3

15 Mar 19:12
Compare
Choose a tag to compare

dataset migration bugfix

v0.3.2

12 Mar 15:24
Compare
Choose a tag to compare
v0.3.2 version bump

v0.3.1

02 Mar 16:05
Compare
Choose a tag to compare
bump version for release

v0.3.0

01 Mar 09:36
Compare
Choose a tag to compare
slight updates to documentation code, bump version, rebuild docs

v0.2.0

16 Feb 17:23
Compare
Choose a tag to compare

Now includes language codes for queries and docs

v0.1.7

15 Feb 19:24
Compare
Choose a tag to compare
this should finally work for GH releases to pypi