Skip to content

Commit

Permalink
Deploying to gh-pages from @ 900119a 🚀
Browse files Browse the repository at this point in the history
  • Loading branch information
nerdai committed Feb 10, 2025
1 parent f7a2694 commit 740d7ea
Show file tree
Hide file tree
Showing 5 changed files with 48 additions and 14 deletions.
16 changes: 13 additions & 3 deletions nlp/models/deepseek_r1.html
Original file line number Diff line number Diff line change
Expand Up @@ -156,9 +156,19 @@ <h1 class="menu-title">AI Pocket Reference: NLP</h1>
<main>
<!-- markdownlint-disable-file MD033 -->
<h1 id="deepseek-r1"><a class="header" href="#deepseek-r1">DeepSeek-R1</a></h1>
<p align="left"><small>
(Reading time: 6 minutes)
</small></p>
<!-- markdownlint-disable MD013 -->
<div style="display: flex; justify-content: space-between; align-items: center; margin-bottom: 2em;">
<div>
<a target="_blank" href="https://github.com/VectorInstitute/ai-pocket-reference/issues/new?template=edit-request.yml">
<img src="https://img.shields.io/badge/Suggest_an_Edit-black?logo=github&style=flat" alt="Suggest an Edit"/>
</a>
<a target="_blank" href="https://colab.research.google.com/github/VectorInstitute/ai-pocket-reference-code/blob/main/notebooks/nlp/dummy.ipynb">
<img src="https://colab.research.google.com/assets/colab-badge.svg" alt="Open In Colab"/>
</a>
<p style="margin: 0;"><small>Reading time: 7 minutes</small></p>
</div>
</div>
<!-- markdownlint-enable MD013 -->
<p>The DeepSeek-R1 model was introduced by DeepSeek in January of 2025. It is
derived from an earlier checkpoint of <a href="../models/deepseek_v3.html">DeepSeek-V3</a>.
In particular, starting with DeepSeek-V3-base, four stages of fine-tuning were
Expand Down
13 changes: 10 additions & 3 deletions nlp/models/deepseek_v3.html
Original file line number Diff line number Diff line change
Expand Up @@ -156,9 +156,16 @@ <h1 class="menu-title">AI Pocket Reference: NLP</h1>
<main>
<!-- markdownlint-disable-file MD033 -->
<h1 id="deepseek-v3"><a class="header" href="#deepseek-v3">DeepSeek-v3</a></h1>
<p align="left"><small>
(Reading time: 6 minutes)
</small></p>
<!-- markdownlint-disable MD013 -->
<div style="display: flex; justify-content: space-between; align-items: center; margin-bottom: 2em;">
<div>
<a target="_blank" href="https://github.com/VectorInstitute/ai-pocket-reference/issues/new?template=edit-request.yml">
<img src="https://img.shields.io/badge/Suggest_an_Edit-black?logo=github&style=flat" alt="Suggest an Edit"/>
</a>
<p style="margin: 0;"><small>Reading time: 7 minutes</small></p>
</div>
</div>
<!-- markdownlint-enable MD013 -->
<p>The DeepSeek-V3 model was introduced by DeepSeek in December of 2024. It is an
LLM that leverages <a href="../llms/architecture/moe.html">MoE</a> in its design.</p>
<center>
Expand Down
29 changes: 23 additions & 6 deletions nlp/print.html
Original file line number Diff line number Diff line change
Expand Up @@ -158,9 +158,19 @@ <h1 class="menu-title">AI Pocket Reference: NLP</h1>
<h1 id="introduction"><a class="header" href="#introduction">Introduction</a></h1>
<div style="break-before: page; page-break-before: always;"></div><!-- markdownlint-disable-file MD033 -->
<h1 id="deepseek-r1"><a class="header" href="#deepseek-r1">DeepSeek-R1</a></h1>
<p align="left"><small>
(Reading time: 6 minutes)
</small></p>
<!-- markdownlint-disable MD013 -->
<div style="display: flex; justify-content: space-between; align-items: center; margin-bottom: 2em;">
<div>
<a target="_blank" href="https://github.com/VectorInstitute/ai-pocket-reference/issues/new?template=edit-request.yml">
<img src="https://img.shields.io/badge/Suggest_an_Edit-black?logo=github&style=flat" alt="Suggest an Edit"/>
</a>
<a target="_blank" href="https://colab.research.google.com/github/VectorInstitute/ai-pocket-reference-code/blob/main/notebooks/nlp/dummy.ipynb">
<img src="https://colab.research.google.com/assets/colab-badge.svg" alt="Open In Colab"/>
</a>
<p style="margin: 0;"><small>Reading time: 7 minutes</small></p>
</div>
</div>
<!-- markdownlint-enable MD013 -->
<p>The DeepSeek-R1 model was introduced by DeepSeek in January of 2025. It is
derived from an earlier checkpoint of <a href="models/../models/deepseek_v3.html">DeepSeek-V3</a>.
In particular, starting with DeepSeek-V3-base, four stages of fine-tuning were
Expand Down Expand Up @@ -322,9 +332,16 @@ <h4 id="references--useful-links"><a class="header" href="#references--useful-li
</div>
<div style="break-before: page; page-break-before: always;"></div><!-- markdownlint-disable-file MD033 -->
<h1 id="deepseek-v3"><a class="header" href="#deepseek-v3">DeepSeek-v3</a></h1>
<p align="left"><small>
(Reading time: 6 minutes)
</small></p>
<!-- markdownlint-disable MD013 -->
<div style="display: flex; justify-content: space-between; align-items: center; margin-bottom: 2em;">
<div>
<a target="_blank" href="https://github.com/VectorInstitute/ai-pocket-reference/issues/new?template=edit-request.yml">
<img src="https://img.shields.io/badge/Suggest_an_Edit-black?logo=github&style=flat" alt="Suggest an Edit"/>
</a>
<p style="margin: 0;"><small>Reading time: 7 minutes</small></p>
</div>
</div>
<!-- markdownlint-enable MD013 -->
<p>The DeepSeek-V3 model was introduced by DeepSeek in December of 2024. It is an
LLM that leverages <a href="models/../llms/architecture/moe.html">MoE</a> in its design.</p>
<center>
Expand Down
2 changes: 1 addition & 1 deletion nlp/searchindex.js

Large diffs are not rendered by default.

2 changes: 1 addition & 1 deletion nlp/searchindex.json

Large diffs are not rendered by default.

0 comments on commit 740d7ea

Please sign in to comment.