Skip to content

Commit

Permalink
47,678-->48,725 (#281)
Browse files Browse the repository at this point in the history
  • Loading branch information
TITC authored Jul 23, 2024
1 parent 46fcde1 commit 6cbe652
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion ch05/01_main-chapter-code/ch05.ipynb
Original file line number Diff line number Diff line change
Expand Up @@ -743,7 +743,7 @@
"id": "71ae26dd-d77e-41fd-b924-6bd103dd4ee7",
"metadata": {},
"source": [
"- The perplexity is often considered more interpretable because it can be understood as the effective vocabulary size that the model is uncertain about at each step (in the example above, that'd be 47,678 words or tokens)\n",
"- The perplexity is often considered more interpretable because it can be understood as the effective vocabulary size that the model is uncertain about at each step (in the example above, that'd be 48,725 words or tokens)\n",
"- In other words, perplexity provides a measure of how well the probability distribution predicted by the model matches the actual distribution of the words in the dataset\n",
"- Similar to the loss, a lower perplexity indicates that the model predictions are closer to the actual distribution"
]
Expand Down

0 comments on commit 6cbe652

Please sign in to comment.