About performance #3

ZichaoHuang · 2017-03-17T11:27:00Z

Hi bxshi,
In gen_ht_r, it seems that you use both the validation set and test set to generate ht_r and tr_h. When I change it to use only the training set to generate ht_r and tr_h(I run ProjE_softmax.py as you recommended in the README file), filtered mean rank and hits@10 only reaches 74.3 and 0.675 after 12 iters, but according to the appendix in your AAAI paper, the model should yield filtered hits@10 over 0.8 around 12 iters.

The text was updated successfully, but these errors were encountered:

bxshi · 2017-03-17T13:58:06Z

Hi, that is used for calculating the filtered metrics. If you use train only, it will not correctly filter out true entities in valid and test set when calculate the filtered scores.

…

Sent from my iPhone

On Mar 17, 2017, at 7:27 AM, Zichao Huang ***@***.***> wrote: In gen_ht_r, you use both the validation set and test set to generate ht_r and tr_h. When I change it to use only the training set to generate ht_r and tr_h(I run ProjE_softmax.py as you recommended in the README file), filtered mean rank and ***@***.*** only reaches 74.3 and 0.675 after 12 iters, but according to your appendix in your AAAI paper, the model should yield filtered ***@***.*** over 0.8 around 12 iters. — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub, or mute the thread.

ZichaoHuang · 2017-03-17T15:13:15Z

But shouldn't we only use the training set during training? If we use the validation set and test set during training, how do we know if the model is overfitting or not?

bxshi · 2017-03-17T15:27:45Z

The training is using the train-hrt only. The hrt is used for evaluation.

…

Sent from my iPhone

On Mar 17, 2017, at 11:13 AM, Zichao Huang ***@***.***> wrote: But shouldn't we only use the training set during training? If we use the validation set and test set during training, how do we know if the model is overfitting or not? — You are receiving this because you commented. Reply to this email directly, view it on GitHub, or mute the thread.

bxshi · 2017-03-17T15:31:43Z

Hi I think there might be a problem, I'll look into that.

bxshi · 2017-03-17T15:42:27Z

Hi, please check data_generator_func, it took an input_queue from self.raw_training_data, which is the generated raw training data from self.__train_hr_t and self.__train_tr_h.

The tr_h and hr_t are used to filter out negative sampling results. This means we generate corrupted triples only if they never exists in the entire triple universe.

Hope this explanation helps.

bxshi · 2017-03-17T15:44:58Z

https://github.com/bxshi/ProjE/blob/master/ProjE_softmax.py#L546 uses hr_t and tr_h for evaluation so if you change them the model can not report accurate results.

ZichaoHuang · 2017-03-17T15:48:16Z

Thanks for the explanation.
But when the data_generator_func function is called as a process target in main, model.hr_t and model.tr_h is also passed to the target as parameters.

bxshi · 2017-03-17T17:53:45Z

Hi I have updated the code. Now it does not use hr_t and tr_h to filter out negative examples.

ZichaoHuang · 2017-03-18T12:42:51Z

Thanks for the updates.
I run the new ProjE model for over 25 epochs and it seems that the filtered hits@10 on test set converges around 0.782. Is that normal?

bxshi · 2017-03-18T15:43:45Z

If you have time I would suggest lower the learning rate say 1e-3 or 4e-5 and try again. My parameters are based on the wrong set so you may need to try some other settings. Meanwhile I'm also testing and will update the arxiv version once it's done.

…

Sent from my iPhone

On Mar 18, 2017, at 7:42 AM, Zichao Huang ***@***.***> wrote: Thanks for the updates. I run the new ProjE model for over 25 epochs and it seems that the filtered ***@***.*** on test set converges around 0.782. Is that normal? — You are receiving this because you commented. Reply to this email directly, view it on GitHub, or mute the thread.

ZichaoHuang · 2017-03-18T15:47:29Z

OK, thanks.

760008522 · 2017-10-18T13:01:31Z

Hi,your experiments show relatively consistent performance using negative sampling rates as low as 25%.Which variable is represented negative sampling rates in the code?

aaai18openkgc · 2017-10-18T15:30:44Z

@760008522 the parameter is --neg_weight.

760008522 · 2017-10-19T10:24:31Z

Hi，can you explain how to select negative samples? Which paragraph corresponds to the code?Thank you very much.

bxshi · 2017-10-25T17:59:10Z

@760008522 For example https://github.com/bxshi/ProjE/blob/master/ProjE_softmax_noweight.py#L313, the hr_tlist_weight is the masking weight which has a shape of [None, model.n_entity]. It is generated inside data_generator_func https://github.com/bxshi/ProjE/blob/master/ProjE_softmax_noweight.py#L425

ocsponge · 2018-05-30T02:57:23Z

@ZichaoHuang I also run the ProjE_softmax model for over 30 epochs and The result is below：
[VALID] ITER 30 [HEAD PREDICTION] MEAN RANK: 276.7 FILTERED MEAN RANK 81.4 HIT@10 0.412 FILTERED HIT@10 0.733
[VALID] ITER 30 [TAIL PREDICTION] MEAN RANK: 174.7 FILTERED MEAN RANK 59.6 HIT@10 0.497 FILTERED HIT@10 0.782
[TEST] ITER 30 [HEAD PREDICTION] MEAN RANK: 273.3 FILTERED MEAN RANK 80.5 HIT@10 0.416 FILTERED HIT@10 0.735
[TEST] ITER 30 [TAIL PREDICTION] MEAN RANK: 180.9 FILTERED MEAN RANK 60.0 HIT@10 0.494 FILTERED HIT@10 0.784

AndRossi · 2019-07-11T19:21:26Z

Hi @ocsponge, I have the same issue: my final result after training is about 10 points below the Hits@10 reported in the paper. I tried reducing the learning rate, but it didn't work.
Were you able to solve this issue in your environment?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About performance #3

About performance #3

ZichaoHuang commented Mar 17, 2017 •

edited

Loading

bxshi commented Mar 17, 2017 via email

ZichaoHuang commented Mar 17, 2017

bxshi commented Mar 17, 2017 via email

bxshi commented Mar 17, 2017

bxshi commented Mar 17, 2017

bxshi commented Mar 17, 2017

ZichaoHuang commented Mar 17, 2017 •

edited

Loading

bxshi commented Mar 17, 2017

ZichaoHuang commented Mar 18, 2017

bxshi commented Mar 18, 2017 via email

ZichaoHuang commented Mar 18, 2017 •

edited

Loading

760008522 commented Oct 18, 2017

aaai18openkgc commented Oct 18, 2017

760008522 commented Oct 19, 2017

bxshi commented Oct 25, 2017

ocsponge commented May 30, 2018

AndRossi commented Jul 11, 2019

About performance #3

About performance #3

Comments

ZichaoHuang commented Mar 17, 2017 • edited Loading

bxshi commented Mar 17, 2017 via email

ZichaoHuang commented Mar 17, 2017

bxshi commented Mar 17, 2017 via email

bxshi commented Mar 17, 2017

bxshi commented Mar 17, 2017

bxshi commented Mar 17, 2017

ZichaoHuang commented Mar 17, 2017 • edited Loading

bxshi commented Mar 17, 2017

ZichaoHuang commented Mar 18, 2017

bxshi commented Mar 18, 2017 via email

ZichaoHuang commented Mar 18, 2017 • edited Loading

760008522 commented Oct 18, 2017

aaai18openkgc commented Oct 18, 2017

760008522 commented Oct 19, 2017

bxshi commented Oct 25, 2017

ocsponge commented May 30, 2018

AndRossi commented Jul 11, 2019

ZichaoHuang commented Mar 17, 2017 •

edited

Loading

ZichaoHuang commented Mar 17, 2017 •

edited

Loading

ZichaoHuang commented Mar 18, 2017 •

edited

Loading