lm_head.weight in convert_opt_checkpoint.py #36

quaternior · 2024-10-31T04:02:18Z

Hello authors, I would like to express my gratitude for thecontributions you've made.
I read convert_opt_checkpoint.py, and I found the code as below.

    item['lm_head.weight'] = model.state_dict()['model.decoder.embed_tokens.weight']
    item['final_layer_norm.weight'] = model.state_dict()['model.decoder.final_layer_norm.weight']
    item['final_layer_norm.bias'] = model.state_dict()['model.decoder.final_layer_norm.bias']

Since the key is 'lm_head.weight', shouldn't it be read as the key for 'lm_head.weight' in the model as well?
Thanks!

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

lm_head.weight in convert_opt_checkpoint.py #36

lm_head.weight in convert_opt_checkpoint.py #36

quaternior commented Oct 31, 2024

lm_head.weight in convert_opt_checkpoint.py #36

lm_head.weight in convert_opt_checkpoint.py #36

Comments

quaternior commented Oct 31, 2024