Skip to content
Change the repository type filter

All

    Repositories list

    • 🚀 Collection of tuning recipes with HuggingFace SFTTrainer and PyTorch FSDP.
      Python
      Apache License 2.0
      44241719Updated Oct 24, 2024Oct 24, 2024
    • Scan resources consumed during LLM training
      Python
      Apache License 2.0
      1710Updated Oct 24, 2024Oct 24, 2024
    • 🚀 Collection of components for development, training, tuning, and inference of foundation models leveraging PyTorch native components.
      Python
      Apache License 2.0
      481583537Updated Oct 23, 2024Oct 23, 2024
    • 🚀 Guardrails orchestration server for application of various detections on text generation input and output.
      Rust
      Apache License 2.0
      153273Updated Oct 23, 2024Oct 23, 2024
    • Estimate resources needed to train LLMs
      Python
      Apache License 2.0
      61012Updated Oct 23, 2024Oct 23, 2024
    • Demonstration of MoE distributed training using various techniques
      Python
      0101Updated Oct 23, 2024Oct 23, 2024
    • 🚀 Collection of libraries used with fms-hf-tuning to accelerate fine-tuning and training of large models.
      Python
      Apache License 2.0
      64112Updated Oct 23, 2024Oct 23, 2024
    • fms-fsdp

      Public
      🚀 Efficiently (pre)training foundation models with native PyTorch features, including FSDP for training and SDPA implementation of Flash attention v2.
      Python
      Apache License 2.0
      29175137Updated Oct 22, 2024Oct 22, 2024
    • pod-vllm

      Public
      Source code to launch a number of pods, performing synthetic data generation
      Python
      Apache License 2.0
      0000Updated Oct 22, 2024Oct 22, 2024
    • fms-dgt

      Public
      Synthetic Data Generation for Foundation Models
      Python
      Apache License 2.0
      181426Updated Oct 21, 2024Oct 21, 2024
    • Go
      Apache License 2.0
      21101Updated Oct 10, 2024Oct 10, 2024
    • Dockerfile
      Apache License 2.0
      4200Updated Sep 30, 2024Sep 30, 2024
    • Python
      Apache License 2.0
      82034Updated Sep 9, 2024Sep 9, 2024
    • Go
      Apache License 2.0
      534131Updated Aug 29, 2024Aug 29, 2024
    • High-performance safetensors model loader
      Python
      Apache License 2.0
      2410Updated Jul 30, 2024Jul 30, 2024
    • avengers

      Public
      Shell
      Apache License 2.0
      0040Updated Jul 20, 2024Jul 20, 2024
    • trl

      Public
      Train transformer language models with reinforcement learning.
      Python
      Apache License 2.0
      1.2k002Updated Mar 5, 2024Mar 5, 2024
    • Training job management tool for foundation model service
      Python
      Apache License 2.0
      4560Updated Feb 28, 2024Feb 28, 2024
    • Operator that enables EFA and/or GDRCOPY in an OpenShift cluster
      Go
      Apache License 2.0
      0000Updated Nov 22, 2023Nov 22, 2023
    • Training operators on Kubernetes.
      Python
      Apache License 2.0
      696000Updated Nov 16, 2022Nov 16, 2022