Skip to content

Commit

Permalink
Enable Distributed Shampoo (v2) in HPC TorchRec (#1459)
Browse files Browse the repository at this point in the history
Summary:
Pull Request resolved: #1459

See title. Pair programmed with Tsung-Hsien Lee (zong).

NOTE: Checkpointing in HPC TorchRec currently does not work.

Next step is to enhance TorchRec checkpointing (cc: YLGH).

Differential Revision: D50479625

fbshipit-source-id: 04fafa41a4aa1f5f0a779b3f7e34f9f146022c8c
  • Loading branch information
Michael Shi authored and facebook-github-bot committed Dec 9, 2023
1 parent fba7f3f commit 6099753
Showing 1 changed file with 1 addition and 0 deletions.
1 change: 1 addition & 0 deletions torchrec/distributed/embedding_types.py
Original file line number Diff line number Diff line change
Expand Up @@ -44,6 +44,7 @@ class OptimType(Enum):
ADAGRAD = "ADAGRAD"
ROWWISE_ADAGRAD = "ROWWISE_ADAGRAD"
SHAMPOO = "SHAMPOO"
SHAMPOO_V2 = "SHAMPOO_V2"
LION = "LION"


Expand Down

0 comments on commit 6099753

Please sign in to comment.