You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi,
It seems that the final_proj of llama_3_2_1b and the input embedding layer are the same when loading the checkpoint. Should we modify the fs2 architecture to tied weights?
Thanks,
The text was updated successfully, but these errors were encountered:
Hi,
It seems that the final_proj of llama_3_2_1b and the input embedding layer are the same when loading the checkpoint. Should we modify the fs2 architecture to tied weights?
Thanks,
The text was updated successfully, but these errors were encountered: