-
Notifications
You must be signed in to change notification settings - Fork 41
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
About performance #3
Comments
Hi, that is used for calculating the filtered metrics. If you use train only, it will not correctly filter out true entities in valid and test set when calculate the filtered scores.
…Sent from my iPhone
On Mar 17, 2017, at 7:27 AM, Zichao Huang ***@***.***> wrote:
In gen_ht_r, you use both the validation set and test set to generate ht_r and tr_h. When I change it to use only the training set to generate ht_r and tr_h(I run ProjE_softmax.py as you recommended in the README file), filtered mean rank and ***@***.*** only reaches 74.3 and 0.675 after 12 iters, but according to your appendix in your AAAI paper, the model should yield filtered ***@***.*** over 0.8 around 12 iters.
—
You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHub, or mute the thread.
|
But shouldn't we only use the training set during training? If we use the validation set and test set during training, how do we know if the model is overfitting or not? |
The training is using the train-hrt only. The hrt is used for evaluation.
…Sent from my iPhone
On Mar 17, 2017, at 11:13 AM, Zichao Huang ***@***.***> wrote:
But shouldn't we only use the training set during training? If we use the validation set and test set during training, how do we know if the model is overfitting or not?
—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub, or mute the thread.
|
Hi I think there might be a problem, I'll look into that. |
Hi, please check data_generator_func, it took an input_queue from self.raw_training_data, which is the generated raw training data from The Hope this explanation helps. |
https://github.com/bxshi/ProjE/blob/master/ProjE_softmax.py#L546 uses |
Thanks for the explanation. |
Hi I have updated the code. Now it does not use |
Thanks for the updates. |
If you have time I would suggest lower the learning rate say 1e-3 or 4e-5 and try again. My parameters are based on the wrong set so you may need to try some other settings. Meanwhile I'm also testing and will update the arxiv version once it's done.
…Sent from my iPhone
On Mar 18, 2017, at 7:42 AM, Zichao Huang ***@***.***> wrote:
Thanks for the updates.
I run the new ProjE model for over 25 epochs and it seems that the filtered ***@***.*** on test set converges around 0.782. Is that normal?
—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub, or mute the thread.
|
OK, thanks. |
Hi,your experiments show relatively consistent performance using negative sampling rates as low as 25%.Which variable is represented negative sampling rates in the code? |
@760008522 the parameter is |
Hi,can you explain how to select negative samples? Which paragraph corresponds to the code?Thank you very much. |
@760008522 For example https://github.com/bxshi/ProjE/blob/master/ProjE_softmax_noweight.py#L313, the |
@ZichaoHuang I also run the ProjE_softmax model for over 30 epochs and The result is below: |
Hi @ocsponge, I have the same issue: my final result after training is about 10 points below the Hits@10 reported in the paper. I tried reducing the learning rate, but it didn't work. |
Hi bxshi,
In gen_ht_r, it seems that you use both the validation set and test set to generate
ht_r
andtr_h
. When I change it to use only the training set to generateht_r
andtr_h
(I run ProjE_softmax.py as you recommended in the README file), filtered mean rank and hits@10 only reaches 74.3 and 0.675 after 12 iters, but according to the appendix in your AAAI paper, the model should yield filtered hits@10 over 0.8 around 12 iters.The text was updated successfully, but these errors were encountered: