Skip to content
Change the repository type filter

All

    Repositories list

    • xla

      Public
      A machine learning compiler for GPUs, CPUs, and ML accelerators
      C++
      Apache License 2.0
      4342012Updated Nov 15, 2024Nov 15, 2024
    • HIP

      Public
      HIP: C++ Heterogeneous-Compute Interface for Portability
      C++
      MIT License
      5383.8k3149Updated Nov 15, 2024Nov 15, 2024
    • ROCm

      Public
      AMD ROCm™ Software - GitHub Home
      Shell
      MIT License
      3864.6k10913Updated Nov 15, 2024Nov 15, 2024
    • AMD's graph optimization engine.
      C++
      MIT License
      8618534241Updated Nov 15, 2024Nov 15, 2024
    • triton

      Public
      Development repository for the Triton language and compiler
      C++
      MIT License
      1.6k931044Updated Nov 15, 2024Nov 15, 2024
    • This is the AMD-maintained fork of the LLVM git repository. This repository accepts pull requests and issues related to AMD fork-specific topics (amd/*). For all other issues/PRs, please submit upstream at https://github.com/llvm/llvm-project.
      LLVM
      Other
      12k1222813Updated Nov 15, 2024Nov 15, 2024
    • HIPIFY

      Public
      HIPIFY: Convert CUDA to Portable C++ Code
      C++
      MIT License
      75523201Updated Nov 15, 2024Nov 15, 2024
    • hipBLASLt

      Public
      hipBLASLt is a library that provides general matrix-matrix operations with a flexible API and extends functionalities beyond a traditional BLAS library
      Assembly
      MIT License
      8762865Updated Nov 15, 2024Nov 15, 2024
    • rocMLIR

      Public
      C++
      40128117Updated Nov 15, 2024Nov 15, 2024
    • pytorch

      Public
      Tensors and Dynamic neural networks in Python with strong GPU acceleration
      Python
      Other
      23k21910531Updated Nov 15, 2024Nov 15, 2024
    • rocWMMA

      Public
      rocWMMA
      C++
      MIT License
      269123Updated Nov 15, 2024Nov 15, 2024
    • Fast and memory-efficient exact attention
      Python
      BSD 3-Clause "New" or "Revised" License
      1.3k138279Updated Nov 15, 2024Nov 15, 2024
    • MIOpen

      Public
      AMD's Machine Intelligence Library
      Assembly
      Other
      2281.1k24948Updated Nov 15, 2024Nov 15, 2024
    • FBGEMM

      Public
      FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/
      C++
      Other
      5011010Updated Nov 15, 2024Nov 15, 2024
    • ROCm Platform Runtime: ROCr a HPC market enhanced HSA based runtime
      C++
      Other
      1092241226Updated Nov 15, 2024Nov 15, 2024
    • clr

      Public
      C++
      MIT License
      491001215Updated Nov 15, 2024Nov 15, 2024
    • ROCgdb

      Public
      This is ROCgdb, the ROCm source-level debugger for Linux, based on GDB, the GNU source-level debugger.
      C
      GNU General Public License v2.0
      105051Updated Nov 15, 2024Nov 15, 2024
    • hip-tests

      Public
      C++
      MIT License
      3130223Updated Nov 15, 2024Nov 15, 2024
    • ROCm SMI LIB
      C++
      MIT License
      501231124Updated Nov 15, 2024Nov 15, 2024
    • amdsmi

      Public
      AMD SMI
      C++
      MIT License
      264166Updated Nov 15, 2024Nov 15, 2024
    • TensorFlow ROCm port
      C++
      Apache License 2.0
      74k6889359Updated Nov 15, 2024Nov 15, 2024
    • TensorFlow Serving dockerfiles for ROCM
      Shell
      Apache License 2.0
      2010Updated Nov 15, 2024Nov 15, 2024
    • Tensile

      Public
      Stretching GPU performance for GEMMs and tensor contractions.
      Python
      MIT License
      15022293Updated Nov 15, 2024Nov 15, 2024
    • vllm

      Public
      A high-throughput and memory-efficient inference and serving engine for LLMs
      Python
      Apache License 2.0
      4.6k45024Updated Nov 15, 2024Nov 15, 2024
    • rocDecode

      Public
      rocDecode is a high performance video decode SDK for AMD hardware
      C++
      Other
      161345Updated Nov 15, 2024Nov 15, 2024
    • 🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.
      Python
      Apache License 2.0
      27k403Updated Nov 15, 2024Nov 15, 2024
    • Python
      Other
      31295Updated Nov 15, 2024Nov 15, 2024
    • Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators
      C++
      Other
      1283113244Updated Nov 15, 2024Nov 15, 2024
    • rccl

      Public
      ROCm Communication Collectives Library (RCCL)
      C++
      Other
      120268917Updated Nov 14, 2024Nov 14, 2024
    • aomp

      Public
      AOMP is an open source Clang/LLVM based compiler with added support for the OpenMP® API on Radeon™ GPUs. Use this repository for releases, issues, documentation, packaging, and examples.
      Fortran
      Apache License 2.0
      47206838Updated Nov 14, 2024Nov 14, 2024