Skip to content

Commit

Permalink
adding more pages, links, etc.
Browse files Browse the repository at this point in the history
  • Loading branch information
mrwyattii committed Dec 4, 2023
1 parent 5e2289a commit fa5f948
Show file tree
Hide file tree
Showing 4 changed files with 18 additions and 0 deletions.
4 changes: 4 additions & 0 deletions docs/source/deployment.rst
Original file line number Diff line number Diff line change
Expand Up @@ -102,6 +102,8 @@ override the default ``max_length``:
response_long = client(prompt, max_length=1024)
response_short = client(prompt, max_length=128)
.. _deployment_model_parallelism:

Model Parallelism
-----------------

Expand All @@ -113,6 +115,8 @@ argument to :func:`mii.serve`:
client = mii.serve("mistralai/Mistral-7B-v0.1", tensor_parallel=2)
.. _deployment_model_replicas:

Model Replicas
--------------

Expand Down
8 changes: 8 additions & 0 deletions docs/source/parallelism.rst
Original file line number Diff line number Diff line change
@@ -1,2 +1,10 @@
Model parallelism
=================

DeepSpeed-MII supports model parallelism via tensor parallelism for splitting models across multiple GPUs.

For model parallelism with :doc:`pipeline`, please see :ref:`Pipeline Model
Parallelism <pipeline_model_parallelism>`.

For model parallelism with :doc:`deployment`, please see :ref:`Persistent
Deployment Model Parallelism <deployment_model_parallelism>`.
2 changes: 2 additions & 0 deletions docs/source/pipeline.rst
Original file line number Diff line number Diff line change
Expand Up @@ -70,6 +70,8 @@ pipeline. For example, you can control per-prompt generation length:
response_long = pipeline(prompt, max_length=1024)
response_short = pipeline(prompt, max_length=128)
.. _pipeline_model_parallelism:

Model Parallelism
-----------------

Expand Down
4 changes: 4 additions & 0 deletions docs/source/replicas.rst
Original file line number Diff line number Diff line change
@@ -1,2 +1,6 @@
Model Replicas
==============

DeepSpeed-MII supports creating multiple replicas of a model with
:doc:`deployment`. Please see :ref:`Persistent Deployment Model Replicas
<deployment_model_replicas>`.

0 comments on commit fa5f948

Please sign in to comment.