AI Infra

Do not try to use LLM to enhance the learning process, such generating questions and answers. You will get nothing.
Consult the expert to save most of the time.

Acknowledgement

Thanks to Liyue Zhang and Guangnan Feng

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
Efficient Large-Scale Language Model Training on GPU Clusters Using Megatron-LM.pdf		Efficient Large-Scale Language Model Training on GPU Clusters Using Megatron-LM.pdf
MegaScale Scaling Large Language Model Training to More Than 10,000 GPUs.pdf		MegaScale Scaling Large Language Model Training to More Than 10,000 GPUs.pdf
Megatron-LM Training Multi-Billion Parameter Language Models Using Model Parallelism.pdf		Megatron-LM Training Multi-Billion Parameter Language Models Using Model Parallelism.pdf
README.md		README.md
Reducing Activation Recomputation in Large Transformer Models.pdf		Reducing Activation Recomputation in Large Transformer Models.pdf
TPU v4 An Optically Reconfigurable Supercomputer for Machine Learning with Hardware Support for Embeddings.pdf		TPU v4 An Optically Reconfigurable Supercomputer for Machine Learning with Hardware Support for Embeddings.pdf
Two-Tree Algorithms for Full Bandwidth Broadcast, Reduction and Scan.pdf		Two-Tree Algorithms for Full Bandwidth Broadcast, Reduction and Scan.pdf
ZeRO Memory Optimizations Toward Training Trillion Parameter Models.pdf		ZeRO Memory Optimizations Toward Training Trillion Parameter Models.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AI Infra

Concepts intro

Some Best Practices

LLM

Optimization

Application

MLOps

Intro

Application

Papers

Communication

Megatron-LM

Zero

Megascale

Cautious

Acknowledgement

About

Releases

Packages

Entityless/AI-Infra-Learning-from-Scratch

Folders and files

Latest commit

History

Repository files navigation

AI Infra

Concepts intro

Some Best Practices

LLM

Optimization

Application

MLOps

Intro

Application

Papers

Communication

Megatron-LM

Zero

Megascale

Cautious

Acknowledgement

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages