Skip to content

Actions: volcengine/verl

model_rmpad

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
310 workflow runs
310 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[misc]: fix ci and add warning to make sure wandb is used when loggin…
model_rmpad #357: Commit 3d566ad pushed by PeterSH6
February 9, 2025 16:21 31m 6s main
February 9, 2025 16:21 31m 6s
delete redundant append_to_dict (#236)
model_rmpad #356: Commit 6427f50 pushed by vermouth1992
February 9, 2025 14:45 1h 36m 16s main
February 9, 2025 14:45 1h 36m 16s
[misc]: fix ci and add warning to make sure wandb is used when logging val results
model_rmpad #355: Pull request #237 synchronize by PeterSH6
February 9, 2025 13:59 2h 15m 17s PeterSH6:gm/fix_log_ci
February 9, 2025 13:59 2h 15m 17s
Add option to log validation generations to wandb (#177)
model_rmpad #353: Commit d0725a6 pushed by PeterSH6
February 9, 2025 13:42 1h 1m 13s main
February 9, 2025 13:42 1h 1m 13s
delete redundant append_to_dict
model_rmpad #352: Pull request #236 opened by Cppowboy
February 9, 2025 13:24 41m 19s Cppowboy:main
February 9, 2025 13:24 41m 19s
Add stronger reward verification sandbox
model_rmpad #351: Pull request #233 synchronize by ZefanW
February 9, 2025 13:05 58m 53s ZefanW:sandbox
February 9, 2025 13:05 58m 53s
Feature/add remax support
model_rmpad #350: Pull request #234 opened by liziniu
February 9, 2025 12:56 Action required liziniu:feature/add-remax-support
February 9, 2025 12:56 Action required
Add stronger reward verification sandbox
model_rmpad #349: Pull request #233 synchronize by ZefanW
February 9, 2025 12:41 24m 51s ZefanW:sandbox
February 9, 2025 12:41 24m 51s
Add stronger reward verification sandbox
model_rmpad #348: Pull request #233 opened by ZefanW
February 9, 2025 12:11 45m 9s ZefanW:sandbox
February 9, 2025 12:11 45m 9s
add requirements (#231)
model_rmpad #347: Commit 577a341 pushed by vermouth1992
February 9, 2025 11:41 48m 30s main
February 9, 2025 11:41 48m 30s
docs: add programming model guide (#230)
model_rmpad #346: Commit e842b73 pushed by eric-haibin-lin
February 9, 2025 11:10 33m 19s main
February 9, 2025 11:10 33m 19s
add requirements
model_rmpad #345: Pull request #231 opened by ZefanW
February 9, 2025 10:03 1h 32m 19s ZefanW:dummy
February 9, 2025 10:03 1h 32m 19s
docs: add programming model guide
model_rmpad #344: Pull request #230 opened by eric-haibin-lin
February 9, 2025 10:02 36m 57s eric-haibin-lin:hlin/program-model
February 9, 2025 10:02 36m 57s
implement REINFORCE++ algorithm (#228)
model_rmpad #342: Commit bdb50ac pushed by vermouth1992
February 9, 2025 09:37 31m 26s main
February 9, 2025 09:37 31m 26s
implement REINFORCE++ algorithm
model_rmpad #341: Pull request #228 synchronize by 4332001876
February 9, 2025 09:28 30m 35s 4332001876:feat_reinforce_plus_plus
February 9, 2025 09:28 30m 35s
implement REINFORCE++ algorithm
model_rmpad #340: Pull request #228 synchronize by 4332001876
February 9, 2025 08:32 27m 14s 4332001876:feat_reinforce_plus_plus
February 9, 2025 08:32 27m 14s
implement REINFORCE++ algorithm
model_rmpad #339: Pull request #228 synchronize by 4332001876
February 9, 2025 07:47 30m 50s 4332001876:feat_reinforce_plus_plus
February 9, 2025 07:47 30m 50s
Add push to hub functionality
model_rmpad #338: Pull request #196 synchronize by NielsRogge
February 8, 2025 17:17 Action required NielsRogge:add_push_to_hub
February 8, 2025 17:17 Action required
Add push to hub functionality
model_rmpad #337: Pull request #196 synchronize by NielsRogge
February 8, 2025 17:15 Action required NielsRogge:add_push_to_hub
February 8, 2025 17:15 Action required
Add push to hub functionality
model_rmpad #336: Pull request #196 synchronize by NielsRogge
February 8, 2025 17:15 Action required NielsRogge:add_push_to_hub
February 8, 2025 17:15 Action required
[ckpt] feat: integrate checkpoint resume in RL ray trainer (#222)
model_rmpad #334: Commit 5a400bf pushed by PeterSH6
February 8, 2025 13:35 31m 39s main
February 8, 2025 13:35 31m 39s
[ckpt] feat: integrate checkpoint resume in RL ray trainer
model_rmpad #333: Pull request #222 synchronize by PeterSH6
February 8, 2025 12:38 46m 40s gm/ckpt_integrate
February 8, 2025 12:38 46m 40s
[ckpt] feat: integrate checkpoint resume in RL ray trainer
model_rmpad #332: Pull request #222 synchronize by PeterSH6
February 8, 2025 12:25 6m 47s gm/ckpt_integrate
February 8, 2025 12:25 6m 47s
[ckpt] feat: integrate checkpoint resume in RL ray trainer
model_rmpad #331: Pull request #222 synchronize by PeterSH6
February 8, 2025 11:45 8m 25s gm/ckpt_integrate
February 8, 2025 11:45 8m 25s