Skip to content

why need to calculate reward_stat? I see llvm_trainer.train use reward from sequence_example.reward #353

Unanswered
18liumin asked this question in General
Discussion options

You must be logged in to vote

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
2 participants
Converted from issue

This discussion was converted from issue #332 on September 02, 2024 13:59.