the problem of lr #14

ffxz · 2019-03-28T09:40:27Z

I find that the lr of your code is 0.0001, but the paper's lr is 0.0015, which one is better? I train the model, the loss can not decrease, i don't know why.

zhr1201 · 2019-04-09T01:41:53Z

I didn't spend too much effort fine tune the model when I wrote this piece of code. I just found that this lr work for the data set and the model, but there may be better choices.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

the problem of lr #14

the problem of lr #14

ffxz commented Mar 28, 2019

zhr1201 commented Apr 9, 2019

the problem of lr #14

the problem of lr #14

Comments

ffxz commented Mar 28, 2019

zhr1201 commented Apr 9, 2019