Skip to content
Change the repository type filter

All

    Repositories list

    • Python
      Other
      0423Updated Oct 1, 2024Oct 1, 2024
    • Tensile

      Public
      Stretching GPU performance for GEMMs and tensor contractions.
      Python
      MIT License
      147214156Updated Oct 1, 2024Oct 1, 2024
    • This is the AMD-maintained fork of the LLVM git repository. This repository accepts pull requests and issues related to AMD fork-specific topics (amd/*). For all other issues/PRs, please submit upstream at https://github.com/llvm/llvm-project.
      LLVM
      Other
      12k1132511Updated Oct 1, 2024Oct 1, 2024
    • hipBLASLt

      Public
      hipBLASLt is a library that provides general matrix-matrix operations with a flexible API and extends functionalities beyond a traditional BLAS library
      Assembly
      MIT License
      80491448Updated Oct 1, 2024Oct 1, 2024
    • triton

      Public
      Development repository for the Triton language and compiler
      C++
      MIT License
      1.6k89744Updated Oct 1, 2024Oct 1, 2024
    • vllm

      Public
      A high-throughput and memory-efficient inference and serving engine for LLMs
      Python
      Apache License 2.0
      4.1k39020Updated Oct 1, 2024Oct 1, 2024
    • TensorFlow ROCm port
      C++
      Apache License 2.0
      74k6849054Updated Oct 1, 2024Oct 1, 2024
    • Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators
      C++
      Other
      1132974529Updated Oct 1, 2024Oct 1, 2024
    • AMD's graph optimization engine.
      C++
      MIT License
      8418437431Updated Oct 1, 2024Oct 1, 2024
    • ROCm SMI LIB
      C++
      MIT License
      491161827Updated Oct 1, 2024Oct 1, 2024
    • Fast and memory-efficient exact attention
      Python
      BSD 3-Clause "New" or "Revised" License
      1.2k131276Updated Oct 1, 2024Oct 1, 2024
    • rocBLAS

      Public
      Next generation BLAS implementation for ROCm platform
      C++
      Other
      16133963Updated Oct 1, 2024Oct 1, 2024
    • aomp

      Public
      AOMP is an open source Clang/LLVM based compiler with added support for the OpenMP® API on Radeon™ GPUs. Use this repository for releases, issues, documentation, packaging, and examples.
      Fortran
      Apache License 2.0
      44205839Updated Sep 30, 2024Sep 30, 2024
    • MIOpen

      Public
      AMD's Machine Intelligence Library
      Assembly
      Other
      2211.1k27052Updated Sep 30, 2024Sep 30, 2024
    • rocPyDecode is a set of Python bindings to rocDecode C++ library which provides full HW acceleration for video decoding on AMD GPUs.
      C++
      MIT License
      5023Updated Sep 30, 2024Sep 30, 2024
    • jax

      Public
      Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
      Python
      Apache License 2.0
      2.8k1701Updated Sep 30, 2024Sep 30, 2024
    • HIPIFY

      Public
      HIPIFY: Convert CUDA to Portable C++ Code
      C++
      MIT License
      70503211Updated Sep 30, 2024Sep 30, 2024
    • pytorch

      Public
      Tensors and Dynamic neural networks in Python with strong GPU acceleration
      Python
      Other
      22k21910523Updated Sep 30, 2024Sep 30, 2024
    • aotriton

      Public
      Ahead of Time (AOT) Triton Math Library
      Python
      MIT License
      133692Updated Sep 30, 2024Sep 30, 2024
    • C
      MIT License
      91009Updated Sep 30, 2024Sep 30, 2024
    • apex

      Public
      A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
      Python
      BSD 3-Clause "New" or "Revised" License
      1.4k18136Updated Sep 30, 2024Sep 30, 2024
    • rocMLIR

      Public
      C++
      40123312Updated Sep 30, 2024Sep 30, 2024
    • DeepSpeed

      Public
      DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.
      Python
      Apache License 2.0
      4.1k462Updated Sep 30, 2024Sep 30, 2024
    • ROCm

      Public
      AMD ROCm™ Software - GitHub Home
      Shell
      MIT License
      3734.5k14215Updated Sep 30, 2024Sep 30, 2024
    • HIP

      Public
      HIP: C++ Heterogeneous-Compute Interface for Portability
      C++
      MIT License
      5283.7k6649Updated Sep 30, 2024Sep 30, 2024
    • xgboost

      Public
      Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow
      C++
      Apache License 2.0
      8.7k120Updated Sep 30, 2024Sep 30, 2024
    • xla

      Public
      A machine learning compiler for GPUs, CPUs, and ML accelerators
      C++
      Apache License 2.0
      4072011Updated Sep 30, 2024Sep 30, 2024
    • Jupyter Notebook
      72801Updated Sep 30, 2024Sep 30, 2024
    • hipTensor

      Public
      AMD’s C++ library for accelerating tensor primitives
      C++
      MIT License
      163404Updated Sep 30, 2024Sep 30, 2024
    • Python
      MIT License
      2300Updated Sep 30, 2024Sep 30, 2024