Significance of hyperparameters in defaults.py #182

TanayKarve · 2020-12-11T05:45:46Z

I understand that the model uses default hyperparameters for training from defaults.py. However why are these specific values used?
The reason I ask this is, the harvard paper for this model mentions different values for certain parameters, e.g. token embedding size is 80 whereas defaults.py sets it to be 10, then hidden units of attention decoder in the paper is 512, whereas defaults.py sets it as 128.
Are the values in defaults.py optimum for im2latex problem or should i stick with the ones in the paper?

saymyname77 · 2021-01-12T10:25:27Z

just for fun..

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Significance of hyperparameters in defaults.py #182

Significance of hyperparameters in defaults.py #182

TanayKarve commented Dec 11, 2020 •

edited

Loading

saymyname77 commented Jan 12, 2021

Significance of hyperparameters in defaults.py #182

Significance of hyperparameters in defaults.py #182

Comments

TanayKarve commented Dec 11, 2020 • edited Loading

saymyname77 commented Jan 12, 2021

TanayKarve commented Dec 11, 2020 •

edited

Loading