Skip to content
Change the repository type filter

All

    Repositories list

    • ROCm

      Public
      AMD ROCm™ Software - GitHub Home
      Shell
      MIT License
      3944.8k11618Updated Jan 6, 2025Jan 6, 2025
    • MIOpen

      Public
      AMD's Machine Intelligence Library
      Assembly
      Other
      2331.1k24864Updated Jan 6, 2025Jan 6, 2025
    • hipBLASLt

      Public
      hipBLASLt is a library that provides general matrix-matrix operations with a flexible API and extends functionalities beyond a traditional BLAS library
      Assembly
      MIT License
      9771767Updated Jan 6, 2025Jan 6, 2025
    • Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators
      C++
      Other
      1373312648Updated Jan 6, 2025Jan 6, 2025
    • triton

      Public
      Development repository for the Triton language and compiler
      C++
      MIT License
      1.7k101943Updated Jan 6, 2025Jan 6, 2025
    • This is the AMD-maintained fork of the LLVM git repository. This repository accepts pull requests and issues related to AMD fork-specific topics (amd/*). For all other issues/PRs, please submit upstream at https://github.com/llvm/llvm-project.
      LLVM
      Other
      12k1263217Updated Jan 6, 2025Jan 6, 2025
    • aomp

      Public
      AOMP is an open source Clang/LLVM based compiler with added support for the OpenMP® API on Radeon™ GPUs. Use this repository for releases, issues, documentation, packaging, and examples.
      Fortran
      Apache License 2.0
      48210142Updated Jan 6, 2025Jan 6, 2025
    • C++
      MIT License
      101786Updated Jan 6, 2025Jan 6, 2025
    • FBGEMM

      Public
      FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/
      C++
      Other
      5201011Updated Jan 6, 2025Jan 6, 2025
    • pytorch

      Public
      Tensors and Dynamic neural networks in Python with strong GPU acceleration
      Python
      Other
      23k2207642Updated Jan 6, 2025Jan 6, 2025
    • Fast and memory-efficient exact attention
      Python
      BSD 3-Clause "New" or "Revised" License
      1.4k1502412Updated Jan 6, 2025Jan 6, 2025
    • ROCgdb

      Public
      This is ROCgdb, the ROCm source-level debugger for Linux, based on GDB, the GNU source-level debugger.
      C
      GNU General Public License v2.0
      105231Updated Jan 6, 2025Jan 6, 2025
    • jax

      Public
      Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
      Python
      Apache License 2.0
      2.8k19014Updated Jan 6, 2025Jan 6, 2025
    • TensorFlow ROCm port
      C++
      Apache License 2.0
      74k6896961Updated Jan 5, 2025Jan 5, 2025
    • rocDecode

      Public
      rocDecode is a high performance video decode SDK for AMD hardware
      C++
      Other
      172033Updated Jan 5, 2025Jan 5, 2025
    • xla

      Public
      A machine learning compiler for GPUs, CPUs, and ML accelerators
      C++
      Apache License 2.0
      4593018Updated Jan 5, 2025Jan 5, 2025
    • rocJPEG

      Public
      rocJPEG is a high-performance jpeg decode SDK for decoding jpeg images using a hardware-accelerated jpeg decoder on AMD’s GPUs.
      C++
      MIT License
      7312Updated Jan 5, 2025Jan 5, 2025
    • AMD's graph optimization engine.
      C++
      MIT License
      8819335052Updated Jan 5, 2025Jan 5, 2025
    • Advanced Profiling and Analytics for AMD Hardware
      Python
      MIT License
      511395010Updated Jan 5, 2025Jan 5, 2025
    • Libraries integrating migraphx with pytorch
      Python
      BSD 3-Clause "New" or "Revised" License
      26134Updated Jan 4, 2025Jan 4, 2025
    • aotriton

      Public
      Ahead of Time (AOT) Triton Math Library
      Python
      MIT License
      1648101Updated Jan 4, 2025Jan 4, 2025
    • xformers

      Public
      Hackable and optimized Transformers building blocks, supporting a composable construction.
      Python
      Other
      6352284Updated Jan 4, 2025Jan 4, 2025
    • ROCm Systems Profiler
      C++
      MIT License
      61302Updated Jan 4, 2025Jan 4, 2025
    • ROCm Platform Runtime: ROCr a HPC market enhanced HSA based runtime
      C++
      Other
      1112331526Updated Jan 4, 2025Jan 4, 2025
    • clr

      Public
      C++
      MIT License
      531111415Updated Jan 4, 2025Jan 4, 2025
    • ROCdbgapi

      Public
      The AMD Debugger API is a library that provides all the support necessary for a debugger and other tools to perform low level control of the execution and inspection of execution state of AMD's commercially available GPU architectures.
      C++
      MIT License
      151931Updated Jan 4, 2025Jan 4, 2025
    • rocPyDecode is a set of Python bindings to rocDecode C++ library which provides full HW acceleration for video decoding on AMD GPUs.
      C++
      MIT License
      7305Updated Jan 4, 2025Jan 4, 2025
    • rccl

      Public
      ROCm Communication Collectives Library (RCCL)
      C++
      Other
      1282871422Updated Jan 4, 2025Jan 4, 2025
    • Python
      Other
      51595Updated Jan 4, 2025Jan 4, 2025
    • flang

      Public
      Mirror of flang repo: The source repo is https://github.com/flang-compiler/flang . Once a day the master branch is updated from the upstream source repo and then locked. AOMP or ROCm developers may commit or create PRs on branch aomp-dev.
      C++
      Other
      86010Updated Jan 4, 2025Jan 4, 2025