Output is not as expected(politeness does not getting transferred) #15

souro · 2021-03-22T08:24:12Z

I have followed your tutorial to generate the output fully. I have not changed a tiny bit. Did whatever has been mentioned in your github tutorial. But not getting the expected output. I have also seen the another open issue same like mine and I have followed your new tutorial but not getting the output as you have mentioned. Please let me know how you can help me to track the issue? As I am doing exactly the same thing you mentioned and same data and all, but still let me know if I have to upload any file from my side to track the issue? Thank you.

Input:
send me the text files.
look into this issue.

Output:
send me copy of the text files.
look forward to looking into this issue.

chenrq2005 · 2021-04-02T19:13:36Z

@souro I am using the same input as yours, my output for the first sentence was OK, same as the output in the README. But the second one was same as yours.

madaan · 2021-04-05T02:06:16Z

Hi,

As you can imagine, there's several places where randomness can play a role (i.e., you might not get the exact same output). Perhaps you can using the pre-trained models https://drive.google.com/drive/folders/1tXLC4WbXc_WLgvQu2mTa3jDe0efZ3dz1.

souro · 2021-04-08T10:46:07Z

Thank you. Let me check with this.

souro · 2021-04-13T17:09:28Z

@madaan
Hi,
I tried using the pre-trained model as it is,

but getting the below error,
No such file or directory: 'models/politeness/bpe/en-tagged-tagger.pt'
No such file or directory: 'models/politeness/bpe/en-generated-generator.pt'

so I renamed the pretrained models,
and then getting the below error,

RuntimeError: Error(s) in loading state_dict for Transformer:
size mismatch for embeds.weight: copying a param with shape torch.Size([15762, 512]) from checkpoint, the shape in current model is torch.Size([15761, 512]).
size mismatch for logits.weight: copying a param with shape torch.Size([15762, 512]) from checkpoint, the shape in current model is torch.Size([15761, 512]).
size mismatch for logits.bias: copying a param with shape torch.Size([15762]) from checkpoint, the shape in current model is torch.Size([15761]).

and

RuntimeError: Error(s) in loading state_dict for Transformer:
size mismatch for embeds.weight: copying a param with shape torch.Size([15785, 512]) from checkpoint, the shape in current model is torch.Size([15782, 512]).
size mismatch for logits.weight: copying a param with shape torch.Size([15785, 512]) from checkpoint, the shape in current model is torch.Size([15782, 512]).
size mismatch for logits.bias: copying a param with shape torch.Size([15785]) from checkpoint, the shape in current model is torch.Size([15782]).
BLEU+case.mixed+numrefs.1+smooth.exp+tok.13a+version.1.5.1 = 56.08 100.0/88.9/85.7/80.0 (BP = 0.635 ratio = 0.688 hyp_len = 11 ref_len = 16)

I am using configuration setting as mentioned in this repo without any changes. Can you please let me know if any changes required or not?

TanmayParekh · 2021-04-13T17:47:37Z

This seems like there could be a difference between the vocabulary sizes of the loaded model and the initialized model. This could happen if you are initializing the model with the wrong vocabulary size. Could you give more details/code where you initialize the model and load the pre-trained model?

souro · 2021-04-13T19:07:45Z

@TanmayParekh Thank you for your reply ...
Instructions I am using following the below link (without a single change)
https://github.com/tag-and-generate/tagger-generator#readme

Code I am using (again without a single change) ...
https://github.com/tag-and-generate/tagger-generator

I just have replaced the pretrained model within the location /tagger-generator/tag-and-generate-train/models/politeness/bpe/
and then for testing ran the below command as suggested in your repo:
bash scripts/inference.sh input.txt sample tagged generated politeness P_9 P_9 ../data/ 3

TanmayParekh · 2021-04-13T19:58:34Z

Just trying to compare the potential cause for the difference in the vocabs, I want to clarify that you are using the BPE segmentation for words (using the prepare_bpe.sh) script.
I wanted to know what version of the sentencepiece library are you using?

qyang1021 · 2021-05-27T01:52:57Z

Just trying to compare the potential cause for the difference in the vocabs, I want to clarify that you are using the BPE segmentation for words (using the prepare_bpe.sh) script.
I wanted to know what version of the sentencepiece library are you using?

so which version of sentencepiece shall I use?

jionghaolin · 2021-07-25T06:06:51Z

@madaan
Hi,
I tried using the pre-trained model as it is,

but getting the below error,
No such file or directory: 'models/politeness/bpe/en-tagged-tagger.pt'
No such file or directory: 'models/politeness/bpe/en-generated-generator.pt'

so I renamed the pretrained models,
and then getting the below error,

RuntimeError: Error(s) in loading state_dict for Transformer:
size mismatch for embeds.weight: copying a param with shape torch.Size([15762, 512]) from checkpoint, the shape in current model is torch.Size([15761, 512]).
size mismatch for logits.weight: copying a param with shape torch.Size([15762, 512]) from checkpoint, the shape in current model is torch.Size([15761, 512]).
size mismatch for logits.bias: copying a param with shape torch.Size([15762]) from checkpoint, the shape in current model is torch.Size([15761]).

and

RuntimeError: Error(s) in loading state_dict for Transformer:
size mismatch for embeds.weight: copying a param with shape torch.Size([15785, 512]) from checkpoint, the shape in current model is torch.Size([15782, 512]).
size mismatch for logits.weight: copying a param with shape torch.Size([15785, 512]) from checkpoint, the shape in current model is torch.Size([15782, 512]).
size mismatch for logits.bias: copying a param with shape torch.Size([15785]) from checkpoint, the shape in current model is torch.Size([15782]).
BLEU+case.mixed+numrefs.1+smooth.exp+tok.13a+version.1.5.1 = 56.08 100.0/88.9/85.7/80.0 (BP = 0.635 ratio = 0.688 hyp_len = 11 ref_len = 16)

I am using configuration setting as mentioned in this repo without any changes. Can you please let me know if any changes required or not?

@madaan
I met the same issue and I shared my screenshot in here. I followed the instruction https://github.com/tag-and-generate/tagger-generator#readme, replaced the trained models with the pre-trained models, but the terminal raised the errors. I installed the sentencepiece using 'pip install sentencepiece'

AJR07 · 2022-06-25T09:09:21Z

@madaan
Hi, upon replacing the models with the pre-trained models I am also met with the same error. Could it be because of the versions of sentencepiece are different? If so, could you provide us with the version of sentencepiece you are using? Thanks!

AJR07 · 2022-11-23T05:21:41Z

Did you all try sentencepiece version 0.1.91 with Python 3.7.0? Seems to work for me :)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Output is not as expected(politeness does not getting transferred) #15

Output is not as expected(politeness does not getting transferred) #15

souro commented Mar 22, 2021

chenrq2005 commented Apr 2, 2021

madaan commented Apr 5, 2021

souro commented Apr 8, 2021

souro commented Apr 13, 2021

TanmayParekh commented Apr 13, 2021

souro commented Apr 13, 2021 •

edited

Loading

TanmayParekh commented Apr 13, 2021

qyang1021 commented May 27, 2021

jionghaolin commented Jul 25, 2021 •

edited

Loading

AJR07 commented Jun 25, 2022 •

edited

Loading

AJR07 commented Nov 23, 2022

Output is not as expected(politeness does not getting transferred) #15

Output is not as expected(politeness does not getting transferred) #15

Comments

souro commented Mar 22, 2021

chenrq2005 commented Apr 2, 2021

madaan commented Apr 5, 2021

souro commented Apr 8, 2021

souro commented Apr 13, 2021

TanmayParekh commented Apr 13, 2021

souro commented Apr 13, 2021 • edited Loading

TanmayParekh commented Apr 13, 2021

qyang1021 commented May 27, 2021

jionghaolin commented Jul 25, 2021 • edited Loading

AJR07 commented Jun 25, 2022 • edited Loading

AJR07 commented Nov 23, 2022

souro commented Apr 13, 2021 •

edited

Loading

jionghaolin commented Jul 25, 2021 •

edited

Loading

AJR07 commented Jun 25, 2022 •

edited

Loading