v0.5.1
What's Changed
- [MINOR FIX / TYPO] Update trec-robust04.yaml by @cakiki in #137
- .z compression support for robust04 by @seanmacavaney in #139
- moving msmarco-passage scoreddocs around by @seanmacavaney in #142
- mmarco updates (files hosted elsewhere & new version of some sources) by @seanmacavaney in #145
- new data available for mmarco (scoreddocs, docpairs, and dev/small) by @seanmacavaney in #146
- added tripclick/train/hofstaetter-triples by @seanmacavaney in #147
- additional versions of msmarco-passage triples by @seanmacavaney in #149
- mMARCO v2 by @seanmacavaney in #150
- Anchor Text for msmarco-document and msmarco-document-v2 by @seanmacavaney in #155
- mmarco source files renamed by @seanmacavaney in #153
- TREC CAsT 2019, 2020 by @seanmacavaney in #156
- HC4 by @eugene-yang in #158
- LoTTE dataset by @seanmacavaney in #159
- kilt by @seanmacavaney in #161
- some trec 2021 qrels released by @seanmacavaney in #162
- some trec 2021 qrels released by @seanmacavaney in #171
- CODEC by @seanmacavaney in #172
- improved HTML/XML parser, TREC 7 and 8 by @seanmacavaney in #173
- fixed and tested issue affecting some clueweb lookups by @seanmacavaney in #174
- cache hc4 topics/qrels by @seanmacavaney in #176
- wikiclir by @seanmacavaney in #178
- NeuCLIR Collection 1 (documents and HC4-filtered subset) by @eugene-yang in #179
- neuMARCO by @seanmacavaney in #181
New Contributors
- @cakiki made their first contribution in #137
- @eugene-yang made their first contribution in #158
Full Changelog: v0.5.0...v0.5.1