Doc supported architecture (#139)

* separate doc on supported archtiectures * add
huggingface · Jan 5, 2025 · 0b9cfd2 · 0b9cfd2
1 parent 060b709
commit 0b9cfd2
Show file tree

Hide file tree

Showing 3 changed files with 31 additions and 12 deletions.
diff --git a/docs/source/_toctree.yml b/docs/source/_toctree.yml
@@ -1,6 +1,8 @@
 - sections:
   - local: index
     title: 🤗 Optimum-TPU
+  - local: supported-architectures
+    title: Supported Models
   - sections:
     - local: tutorials/overview
       title: Overview

diff --git a/docs/source/howto/training.mdx b/docs/source/howto/training.mdx
@@ -2,19 +2,9 @@
 
 Welcome to the 🤗 Optimum-TPU training guide! This section covers how to fine-tune models using Google Cloud TPUs.
 
-## Currently Supported Models
+## Supported Models
 
-The following models have been tested and validated for fine-tuning on TPU `v5e` and `v6e`:
-
-- 🦙 LLaMA Family
-  - LLaMA-2 7B
-  - LLaMA-3 8B
-  - LLaMA-3.2 1B
-- 💎 Gemma Family
-  - Gemma 2B
-  - Gemma 7B
-
-Bigger models are supported, but not yet tested.
+See [Supported Models](../supported-architectures.mdx).
 
 ## Getting Started
 

diff --git a/docs/source/supported-architectures.mdx b/docs/source/supported-architectures.mdx
@@ -0,0 +1,27 @@
+# Supported Models
+
+## Inference
+The following LLMs have been tested and validated for inference on TPU v5e and v6e for text generation:
+
+- 🦙 LLaMA Family
+  - LLaMA-2 7B
+  - LLaMA-3 8B, 70B
+  - LlaMa3.1 8B, 70B
+  - LLaMA-3.2 1B, 3B (text only models)
+  - LlaMa-3.3 70B
+- 💎 Gemma Family
+  - Gemma 2B, 7B
+- 💨 Mistral Family
+  - Mistral 7B
+  - Mixtral 8x7B
+
+## Fine-tuning
+The following models have been tested and validated for fine-tuning on TPU v5e and v6e:
+
+- 🦙 LLaMA Family
+  - LLaMA-2 7B
+  - LLaMA-3 8B
+  - LLaMA-3.2 1B
+- 💎 Gemma Family
+  - Gemma 2B
+  - Gemma 7B