monitoring training progress #6

jelmervdl · 2023-03-02T12:47:13Z

There's already a tensorboard-marian connector. We can either plug into that or write our own version of it. We have the added benefit of having direct access to marian's stdout and stderr so we can just read directly from there.

Regular expressions: https://github.com/marian-nmt/marian-tensorboard/blob/b9867c43472a27783611accba93adebda60ba462/src/marian_tensorboard/marian_tensorboard.py#L107-L125

Added benefit of doing the integration ourselves: we can also push dataset events to tensorboard, like epoch events and training stages.

Slightly related to #3.

XapaJIaMnu · 2023-07-06T15:08:44Z

Add to this issue:

Advance to a new state when marian reports stall in a validation set.

This can be used to automatically find the optimal point to transition between stages combined with resetting the optimizer inside marian so that our new dataset mixture doesn't get its gradients penalised too hard from the change of data.

jelmervdl added the enhancement New feature or request label Mar 2, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

monitoring training progress #6

monitoring training progress #6

jelmervdl commented Mar 2, 2023 •

edited

Loading

XapaJIaMnu commented Jul 6, 2023

monitoring training progress #6

monitoring training progress #6

Comments

jelmervdl commented Mar 2, 2023 • edited Loading

XapaJIaMnu commented Jul 6, 2023

jelmervdl commented Mar 2, 2023 •

edited

Loading