Skip to content

Actions: tatsu-lab/alpaca_eval

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
1,216 workflow runs
1,216 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Add PairRM best-of-16 to AlpacaEval
alpaca_eval unit tests #284: Pull request #181 opened by jdf-prog
December 5, 2023 07:41 3m 29s main
December 5, 2023 07:41 3m 29s
Modify configs of 01-ai/Yi-34B-Chat to make model verified
alpaca_eval unit tests #283: Pull request #179 opened by HyperdriveHustle
November 29, 2023 09:27 3m 12s main
November 29, 2023 09:27 3m 12s
minor
alpaca_eval unit tests #282: Commit 642cd5e pushed by YannDubs
November 28, 2023 03:35 3m 3s main
November 28, 2023 03:35 3m 3s
pages build and deployment
pages-build-deployment #284: by YannDubs
November 28, 2023 03:35 1m 5s main
November 28, 2023 03:35 1m 5s
show img in readme (#178)
alpaca_eval unit tests #281: Commit 94cd8b6 pushed by YannDubs
November 27, 2023 23:03 3m 14s main
November 27, 2023 23:03 3m 14s
pages build and deployment
pages-build-deployment #283: by YannDubs
November 27, 2023 23:03 1m 24s main
November 27, 2023 23:03 1m 24s
show img in readme
alpaca_eval unit tests #280: Pull request #178 synchronize by YannDubs
November 27, 2023 23:03 2m 53s yann/verified_img
November 27, 2023 23:03 2m 53s
show img in readme
alpaca_eval unit tests #279: Pull request #178 synchronize by YannDubs
November 27, 2023 23:02 2m 54s yann/verified_img
November 27, 2023 23:02 2m 54s
show img in readme
alpaca_eval unit tests #278: Pull request #178 synchronize by YannDubs
November 27, 2023 23:01 2m 54s yann/verified_img
November 27, 2023 23:01 2m 54s
show img in readme
alpaca_eval unit tests #277: Pull request #178 synchronize by YannDubs
November 27, 2023 23:00 2m 52s yann/verified_img
November 27, 2023 23:00 2m 52s
show img in readme
alpaca_eval unit tests #276: Pull request #178 opened by YannDubs
November 27, 2023 22:57 2m 50s yann/verified_img
November 27, 2023 22:57 2m 50s
pages build and deployment
pages-build-deployment #282: by YannDubs
November 27, 2023 22:54 1m 1s main
November 27, 2023 22:54 1m 1s
feat: add way to verify results (#177)
alpaca_eval unit tests #275: Commit 5d66c75 pushed by YannDubs
November 27, 2023 22:54 3m 9s main
November 27, 2023 22:54 3m 9s
feat: add way to verify results
alpaca_eval unit tests #274: Pull request #177 synchronize by YannDubs
November 27, 2023 22:54 2m 52s yann/readme_verified
November 27, 2023 22:54 2m 52s
feat: add way to verify results
alpaca_eval unit tests #273: Pull request #177 synchronize by YannDubs
November 27, 2023 22:53 2m 54s yann/readme_verified
November 27, 2023 22:53 2m 54s
feat: add way to verify results
alpaca_eval unit tests #272: Pull request #177 opened by YannDubs
November 27, 2023 22:45 2m 59s yann/readme_verified
November 27, 2023 22:45 2m 59s
pages build and deployment
pages-build-deployment #281: by github-pages bot
November 26, 2023 20:44 57s main
November 26, 2023 20:44 57s
Add 01-ai/Yi-34B-Chat to AlpacaEval (#175)
Format leaderboard #53: Commit 330cf69 pushed by YannDubs
November 26, 2023 20:43 1m 20s main
November 26, 2023 20:43 1m 20s
Add 01-ai/Yi-34B-Chat to AlpacaEval (#175)
alpaca_eval unit tests #271: Commit 330cf69 pushed by YannDubs
November 26, 2023 20:43 2m 57s main
November 26, 2023 20:43 2m 57s
pages build and deployment
pages-build-deployment #280: by YannDubs
November 26, 2023 20:43 1m 0s main
November 26, 2023 20:43 1m 0s
fix: ensure that people use the correct baseline
alpaca_eval unit tests #270: Commit 588772c pushed by YannDubs
November 26, 2023 20:38 2m 57s main
November 26, 2023 20:38 2m 57s
pages build and deployment
pages-build-deployment #279: by YannDubs
November 26, 2023 20:38 59s main
November 26, 2023 20:38 59s
pages build and deployment
pages-build-deployment #278: by github-pages bot
November 26, 2023 20:30 56s main
November 26, 2023 20:30 56s
Add MiniChat-1.5-3B to AlpacaEval and Fix MiniChat-3B (#176)
alpaca_eval unit tests #269: Commit b226e30 pushed by YannDubs
November 26, 2023 20:29 3m 10s main
November 26, 2023 20:29 3m 10s
Add MiniChat-1.5-3B to AlpacaEval and Fix MiniChat-3B (#176)
Format leaderboard #52: Commit b226e30 pushed by YannDubs
November 26, 2023 20:29 1m 25s main
November 26, 2023 20:29 1m 25s
ProTip! You can narrow down the results and go further in time using created:<2023-11-26 or the other filters available.