You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
BRIGHT: "A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval ... 1,398 real-world queries collected from [12] diverse domains (such as economics, psychology, robotics, software engineering, earth sciences, etc.), sourced from naturally occurring or carefully curated human data."
Each dataset would support their own queries, passages, and qrels. The StackExchange datasets (biology - pony) include both passage-level and document-level labels for the passage/long-document retrieval settings, so probably also bright/{domain}/long_documents that inherits queries from the base task, implicitly treating the passage retrieval setting as default as is the case in the paper.
Checklist
Mark each task once completed. All should be checked prior to merging a new dataset.
Dataset definition (in ir_datasets/datasets/[topid].py)
Tests (in tests/integration/[topid].py)
Metadata generated (using ir_datasets generate_metadata command, should appear in ir_datasets/etc/metadata.json)
Downloadable content (in ir_datasets/etc/downloads.json)
Download verification action (in .github/workflows/verify_downloads.yml). Only one needed per topid.
Any small public files from NIST (or other potentially troublesome files) mirrored in https://github.com/seanmacavaney/irds-mirror/. Mirrored status properly reflected in downloads.json.
Additional comments/concerns/ideas/etc.
queries, qrels (same file as queries), and docs are all stored in single .parquet files on HF (max file size: leetcode-00000-of-00001.parquet: 211 MB) unlike other datasets sources in downloads.json
The text was updated successfully, but these errors were encountered:
Dataset Information:
BRIGHT: "A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval ... 1,398 real-world queries collected from [12] diverse domains (such as economics, psychology, robotics, software engineering, earth sciences, etc.), sourced from naturally occurring or carefully curated human data."
Links to Resources:
Dataset ID(s) & supported entities:
bright/biology
bright/earth_science
bright/economics
bright/psychology
bright/robotics
bright/stackoverflow
bright/sustainable_living
bright/pony
bright/leetcode
bright/aops
bright/theoremqa_theorems
bright/theoremqa_questions
Each dataset would support their own queries, passages, and qrels. The StackExchange datasets (biology - pony) include both passage-level and document-level labels for the passage/long-document retrieval settings, so probably also
bright/{domain}/long_documents
that inherits queries from the base task, implicitly treating the passage retrieval setting as default as is the case in the paper.Checklist
Mark each task once completed. All should be checked prior to merging a new dataset.
ir_datasets/datasets/[topid].py
)tests/integration/[topid].py
)ir_datasets generate_metadata
command, should appear inir_datasets/etc/metadata.json
)ir_datasets/etc/[topid].yaml
)ir_datasets/etc/downloads.json
).github/workflows/verify_downloads.yml
). Only one needed pertopid
.downloads.json
.Additional comments/concerns/ideas/etc.
.parquet
files on HF (max file size:leetcode-00000-of-00001.parquet
: 211 MB) unlike other datasets sources indownloads.json
The text was updated successfully, but these errors were encountered: