Skip to content

Commit

Permalink
init
Browse files Browse the repository at this point in the history
  • Loading branch information
orionw committed Jul 6, 2024
0 parents commit 46c0fca
Show file tree
Hide file tree
Showing 13,434 changed files with 3,319,572 additions and 0 deletions.
The diff you're trying to view is too large. We only load the first 3000 changed files.
19 changes: 19 additions & 0 deletions .github/workflows/main.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,19 @@
name: Sync to Hugging Face hub
on:
push:
branches: [main]
# to run this workflow manually from the Actions tab
workflow_dispatch:

jobs:
sync-to-hub:
runs-on: ubuntu-latest
steps:
- uses: actions/checkout@v3
with:
fetch-depth: 0
lfs: true
- name: Push to hub
env:
HF_TOKEN: ${{ secrets.HF_TOKEN }}
run: git push https://mteb:[email protected]/datasets/mteb/results main
3 changes: 3 additions & 0 deletions CONTRIBUTING
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
TODO

Users must be sure no files are over 10MB. If there are we should remove all spaces from them to keep them < 10MB. This should be a pre-commit hook checking.
5 changes: 5 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,5 @@
---
benchmark: mteb
type: evaluation
submission_name: MTEB
---
13,777 changes: 13,777 additions & 0 deletions paths.json

Large diffs are not rendered by default.

15 changes: 15 additions & 0 deletions remove_spaces_from_large_json_files.py
Original file line number Diff line number Diff line change
@@ -0,0 +1,15 @@
import os
import glob
import sys
import json

for file in glob.glob("results/*/*/*.json"):
# if the file is greater than 9 MB, compress it with gzip
if os.path.getsize(file) >= 9.5 * 1024 * 1024:
print(f"Resizing {file} to have no indentations")
# read it in as json and write it out with no indent
with open(file, "r") as f:
data = json.load(f)

with open(file, "w") as f:
json.dump(data, f, indent=None)
Loading

0 comments on commit 46c0fca

Please sign in to comment.