Skip to content

AMD ROCm™ Software

AMD ROCm software is AMD's Open Source stack for GPU computation.

To learn more about ROCm, check out our Documentation, Examples, and Developer Hub.

If you have questions or need help, reach out to us on GitHub.

Popular repositories Loading

  1. ROCm ROCm Public

    AMD ROCm™ Software - GitHub Home

    Shell 4.9k 398

  2. HIP HIP Public

    HIP: C++ Heterogeneous-Compute Interface for Portability

    C++ 3.8k 546

  3. MIOpen MIOpen Public

    AMD's Machine Intelligence Library

    Assembly 1.1k 236

  4. tensorflow-upstream tensorflow-upstream Public

    Forked from tensorflow/tensorflow

    TensorFlow ROCm port

    C++ 689 97

  5. HIPIFY HIPIFY Public

    HIPIFY: Convert CUDA to Portable C++ Code

    C++ 538 79

  6. ROCm-docker ROCm-docker Public

    Dockerfiles for the various software layers defined in the ROCm software platform

    Shell 439 69

Repositories

Showing 10 of 296 repositories
  • rocm-docs-core Public

    ROCm Documentation Python package for ReadTheDocs build standardization

    ROCm/rocm-docs-core’s past year of commit activity
    CSS 13 17 7 4 Updated Jan 23, 2025
  • composable_kernel Public

    Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators

    ROCm/composable_kernel’s past year of commit activity
    C++ 337 139 22 (1 issue needs help) 51 Updated Jan 23, 2025
  • ROCm Public

    AMD ROCm™ Software - GitHub Home

    ROCm/ROCm’s past year of commit activity
    Shell 4,857 MIT 398 109 12 Updated Jan 23, 2025
  • apex Public Forked from NVIDIA/apex

    A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch

    ROCm/apex’s past year of commit activity
    Python 19 BSD-3-Clause 1,436 13 6 Updated Jan 23, 2025
  • MIOpen Public

    AMD's Machine Intelligence Library

    ROCm/MIOpen’s past year of commit activity
    Assembly 1,109 236 246 (4 issues need help) 54 Updated Jan 23, 2025
  • ROCm/TransformerEngine’s past year of commit activity
    Python 15 7 9 9 Updated Jan 23, 2025
  • vllm Public Forked from vllm-project/vllm

    A high-throughput and memory-efficient inference and serving engine for LLMs

    ROCm/vllm’s past year of commit activity
    Python 56 Apache-2.0 5,370 3 25 Updated Jan 23, 2025
  • rocSPARSE Public

    Next generation SPARSE implementation for ROCm platform

    ROCm/rocSPARSE’s past year of commit activity
    C++ 118 MIT 56 1 0 Updated Jan 23, 2025
  • flash-attention Public Forked from Dao-AILab/flash-attention

    Fast and memory-efficient exact attention

    ROCm/flash-attention’s past year of commit activity
    Python 152 BSD-3-Clause 1,439 24 12 Updated Jan 23, 2025
  • hipBLAS Public

    ROCm BLAS marshalling library

    ROCm/hipBLAS’s past year of commit activity
    C++ 126 80 1 5 Updated Jan 23, 2025