forked from rasbt/LLMs-from-scratch
-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
distinguish better between main chapter code and bonus materials
- Loading branch information
Showing
6 changed files
with
37 additions
and
3 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,7 +1,12 @@ | ||
# Chapter 2: Working with Text Data | ||
|
||
|
||
## Main Chapter Code | ||
|
||
- [01_main-chapter-code](01_main-chapter-code) contains the main chapter code and exercise solutions | ||
|
||
|
||
## Bonus Materials | ||
|
||
- [02_bonus_bytepair-encoder](02_bonus_bytepair-encoder) contains optional code to benchmark different byte pair encoder implementations | ||
|
||
- [03_bonus_embedding-vs-matmul](03_bonus_embedding-vs-matmul) contains optional (bonus) code to explain that embedding layers and fully connected layers applied to one-hot encoded vectors are equivalent. |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,4 +1,9 @@ | ||
# Chapter 3: Coding Attention Mechanisms | ||
|
||
## Main Chapter Code | ||
|
||
- [01_main-chapter-code](01_main-chapter-code) contains the main chapter code. | ||
|
||
## Bonus Materials | ||
|
||
- [02_bonus_efficient-multihead-attention](02_bonus_efficient-multihead-attention) implements and compares different implementation variants of multihead-attention |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,4 +1,9 @@ | ||
# Chapter 4: Implementing a GPT Model from Scratch to Generate Text | ||
|
||
## Main Chapter Code | ||
|
||
- [01_main-chapter-code](01_main-chapter-code) contains the main chapter code. | ||
|
||
## Bonus Materials | ||
|
||
- [02_performance-analysis](02_performance-analysis) contains optional code analyzing the performance of the GPT model(s) implemented in the main chapter. |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,5 +1,11 @@ | ||
# Chapter 6: Finetuning for Classification | ||
|
||
|
||
## Main Chapter Code | ||
|
||
- [01_main-chapter-code](01_main-chapter-code) contains the main chapter code | ||
|
||
## Bonus Materials | ||
|
||
- [02_bonus_additional-experiments](02_bonus_additional-experiments) includes additional experiments (e.g., training the last vs first token, extending the input length, etc.) | ||
- [03_bonus_imdb-classification](03_bonus_imdb-classification) compares the LLM from chapter 6 with other models on a 50k IMDB movie review sentiment classification dataset |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,3 +1,11 @@ | ||
# Chapter 7: Finetuning to Follow Instructions | ||
|
||
In progress ... | ||
## Main Chapter Code | ||
|
||
- [01_main-chapter-code](01_main-chapter-code) contains the main chapter code and exercise solutions | ||
|
||
## Bonus Materials | ||
|
||
- [02_dataset-utilities](02_dataset-utilities) contains utility code that can be used for preparing an instruction dataset. | ||
|
||
- [03_model-evaluation](03_model-evaluation) contains utility code for evaluating instruction responses using a local Llama 3 model and the GPT-4 API. |