Print out embeddings for more illustrative learning #481

henrythe9th · 2025-01-13T05:01:35Z

The book only prints the shape but not the contents of the embeddings in ch02

Printing out embeddings helps to better illustrate learnings by showing the actual contents of the embeddings, and it doesn't take that much more space

review-notebook-app · 2025-01-13T05:01:40Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

rasbt · 2025-01-13T20:32:58Z

Thanks for the suggestion, @henrythe9th. I totally agree with you here. Unfortunately, I had to trim the chapter per request by the publisher because it exceeded the allowed page limits for a chapter. Hence, I was focusing on the most important essentials. (Luckily it became a bit less strict in consequent chapters as it became clear that my style was a bit different from other books, but I still had to be careful to adhere to the suggested lengths.)

Long story short, I agree with you here. At the same time I don't want to make the modification like this in the notebook because it creates a deviation from the print version then, which could confuse some readers. As a compromise, I can add these lines in a "commented out" way with a suggestion to uncomment the line to see how the embeddings look like.

Anyways, thanks so much for taking the time to try to improve the book, and I hope you like it overall!

henrythe9th · 2025-01-13T22:25:35Z

Sounds good and thank you for putting together this great book!

* Add "What's next" section (rasbt#432) * Add What's next section * Delete appendix-D/01_main-chapter-code/appendix-D-Copy2.ipynb * Delete ch03/01_main-chapter-code/ch03-Copy1.ipynb * Delete appendix-D/01_main-chapter-code/appendix-D-Copy1.ipynb * Update ch07.ipynb * Update ch07.ipynb * Add chapter names * Add missing device transfer in gpt_generate.py (rasbt#436) * Add utility to prevent double execution of certain cells (rasbt#437) * Add flexible padding bonus experiment (rasbt#438) * Add flexible padding bonus experiment * fix links * Fixed command for row 16 additional experiment (rasbt#439) * fixed command for row 16 experiment * Update README.md --------- Co-authored-by: Sebastian Raschka <[email protected]> * [minor] typo & comments (rasbt#441) * typo & comment - safe -> save - commenting code: batch_size, seq_len = in_idx.shape * comment - adding # NEW for assert num_heads % num_kv_groups == 0 * update memory wording --------- Co-authored-by: rasbt <[email protected]> * fix misplaced parenthesis and update license (rasbt#466) * Minor readability improvement in dataloader.ipynb (rasbt#461) * Minor readability improvement in dataloader.ipynb - The tokenizer and encoded_text variables at the root level are unused. - The default params for create_dataloader_v1 are confusing, especially for the default batch_size 4, which happens to be the same as the max_length. * readability improvements --------- Co-authored-by: rasbt <[email protected]> * typo fixed (rasbt#468) * typo fixed * only update plot --------- Co-authored-by: rasbt <[email protected]> * Add backup URL for gpt2 weights (rasbt#469) * Add backup URL for gpt2 weights * newline * fix ch07 unit test (rasbt#470) * adds no-grad context for reference model to DPO (rasbt#473) * Auto download DPO dataset if not already available in path (rasbt#479) * Auto download DPO dataset if not already available in path * update tests to account for latest HF transformers release in unit tests * pep 8 * fix reward margins plot label in dpo nb * Print out embeddings for more illustrative learning (rasbt#481) * print out embeddings for illustrative learning * suggestion print embeddingcontents --------- Co-authored-by: rasbt <[email protected]> * Include mathematical breakdown for exercise solution 4.1 (rasbt#483) * 04_optional-aws-sagemaker-notebook (rasbt#451) * 04_optional-aws-sagemaker-notebook * Update setup/04_optional-aws-sagemaker-notebook/cloudformation-template.yml * Update README.md --------- Co-authored-by: Sebastian Raschka <[email protected]> * Implementingthe BPE Tokenizer from Scratch (rasbt#487) * BPE: fixed typo (rasbt#492) * fixed typo * use rel path if exists * mod gitignore and use existing vocab files --------- Co-authored-by: rasbt <[email protected]> * fix: preserve newline tokens in BPE encoder (rasbt#495) * fix: preserve newline tokens in BPE encoder * further fixes * more fixes --------- Co-authored-by: rasbt <[email protected]> * add GPT2TokenizerFast to BPE comparison (rasbt#498) * added HF BPE Fast * update benchmarks * add note about performance * revert accidental changes --------- Co-authored-by: rasbt <[email protected]> * Bonus material: extending tokenizers (rasbt#496) * Bonus material: extending tokenizers * small wording update * Test for PyTorch 2.6 release candidate (rasbt#500) * Test for PyTorch 2.6 release candidate * update * update * remove extra added file * A few cosmetic updates (rasbt#504) * Fix default argument in ex 7.2 (rasbt#506) * Alternative weight loading via .safetensors (rasbt#507) * Test PyTorch nightly releases (rasbt#509) --------- Co-authored-by: Sebastian Raschka <[email protected]> Co-authored-by: Daniel Kleine <[email protected]> Co-authored-by: casinca <[email protected]> Co-authored-by: Tao Qian <[email protected]> Co-authored-by: QS <[email protected]> Co-authored-by: Henry Shi <[email protected]> Co-authored-by: rvaneijk <[email protected]> Co-authored-by: Austin Welch <[email protected]>

print out embeddings for illustrative learning

9e88891

suggestion print embeddingcontents

58e8cc0

rasbt merged commit b3150ee into rasbt:main Jan 13, 2025
8 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Print out embeddings for more illustrative learning #481

Print out embeddings for more illustrative learning #481

henrythe9th commented Jan 13, 2025

review-notebook-app bot commented Jan 13, 2025

rasbt commented Jan 13, 2025

henrythe9th commented Jan 13, 2025

Print out embeddings for more illustrative learning #481

Print out embeddings for more illustrative learning #481

Conversation

henrythe9th commented Jan 13, 2025

review-notebook-app bot commented Jan 13, 2025

rasbt commented Jan 13, 2025

henrythe9th commented Jan 13, 2025