Skip to content
Change the repository type filter

All

    Repositories list

    • barney

      Public
      A Scalable (and Optionally, Data-Parallel) ANARI Multi-GPU Path Tracer
      C++
      41930Updated Dec 10, 2025Dec 10, 2025
    • cuda-python

      Public
      CUDA Python: Performance meets Productivity
      Cython
      2283.1k20913Updated Dec 10, 2025Dec 10, 2025
    • nv-redfish

      Public
      NVIDIA's Redfish next generation redfish crate
      Rust
      2601Updated Dec 10, 2025Dec 10, 2025
    • cuda-quantum

      Public
      C++ and Python support for the CUDA Quantum programming model for heterogeneous quantum-classical workflows
      C++
      30987140688Updated Dec 10, 2025Dec 10, 2025
    • Megatron-LM

      Public
      Ongoing research training transformer models at scale
      Python
      3.4k14k332240Updated Dec 10, 2025Dec 10, 2025
    • nvidia-resiliency-ext

      Public
      NVIDIA Resiliency Extension is a python package for framework developers and users to implement fault-tolerant features. It improves the effective training time by minimizing the downtime due to failures and interruptions.
      Python
      37239118Updated Dec 10, 2025Dec 10, 2025
    • Fuser

      Public
      A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")
      C++
      71365208213Updated Dec 10, 2025Dec 10, 2025
    • nvidia-code-mgmt

      Public
      Non-PLDM firmware update infrastructure
      C++
      1400Updated Dec 10, 2025Dec 10, 2025
    • C++
      1025995515Updated Dec 10, 2025Dec 10, 2025
    • cccl

      Public
      CUDA Core Compute Libraries
      C++
      3002.1k1.1k204Updated Dec 10, 2025Dec 10, 2025
    • bionemo-framework

      Public
      BioNeMo Framework: For building and adapting AI models in drug discovery at scale
      Jupyter Notebook
      10459860103Updated Dec 10, 2025Dec 10, 2025
    • aerial-framework

      Public
      A toolchain for generating high-performance, GPU-accelerated 5G/6G pipelines from Python and a modular, real-time runtime for executing the pipelines on NVIDIA Aerial™ RAN Computer platforms.
      2800Updated Dec 10, 2025Dec 10, 2025
    • Model-Optimizer

      Public
      A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM, TensorRT, vLLM, etc. to optimize inference speed.
      Python
      2111.6k6854Updated Dec 10, 2025Dec 10, 2025
    • NeMo-Agent-Toolkit

      Public
      The NVIDIA NeMo Agent toolkit is an open-source library for efficiently connecting and optimizing teams of AI agents.
      Python
      4451.6k5429Updated Dec 10, 2025Dec 10, 2025
    • OSMO

      Public
      The developer-first platform for scaling complex Physical AI workloads across heterogeneous compute—unifying training GPUs, simulation clusters, and edge devices in a simple YAML
      Python
      35776Updated Dec 10, 2025Dec 10, 2025
    • cutlass

      Public
      CUDA Templates and Python DSLs for High-Performance Linear Algebra
      C++
      1.6k8.9k41091Updated Dec 10, 2025Dec 10, 2025
    • stdexec

      Public
      `std::execution`, the proposed C++ framework for asynchronous and parallel programming.
      C++
      2202.1k11213Updated Dec 10, 2025Dec 10, 2025
    • Q2RTX

      Public
      NVIDIA’s implementation of RTX ray-tracing in Quake II
      C
      1921.3k573Updated Dec 10, 2025Dec 10, 2025
    • cuopt

      Public
      GPU accelerated decision optimization
      Cuda
      1016087526Updated Dec 10, 2025Dec 10, 2025
    • earth2studio

      Public
      Open-source deep-learning framework for exploring, building and deploying AI weather/climate workflows.
      Python
      84315109Updated Dec 10, 2025Dec 10, 2025
    • accelerated-computing-hub

      Public
      NVIDIA curated collection of educational resources related to general purpose GPU programming.
      Jupyter Notebook
      169949145Updated Dec 10, 2025Dec 10, 2025
    • NVFlare

      Public
      NVIDIA Federated Learning Application Runtime Environment
      Python
      2258461514Updated Dec 10, 2025Dec 10, 2025
    • TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in a performant way.
      Python
      1.9k12k589465Updated Dec 9, 2025Dec 9, 2025
    • nv-ingest

      Public
      NeMo Retriever extraction is a scalable, performance-oriented document content and metadata extraction microservice. NeMo Retriever extraction uses specialized NVIDIA NIM microservices to find, contextualize, and extract text, tables, charts and images that you can use in downstream generative applications.
      Python
      2792.8k10137Updated Dec 9, 2025Dec 9, 2025
    • spark-rapids-tools

      Public
      User tools for Spark RAPIDS
      Scala
      46652622Updated Dec 9, 2025Dec 9, 2025
    • maxtext-jaxpp

      Public
      Showcase JaxPP with MaxText
      Python
      436401Updated Dec 9, 2025Dec 9, 2025
    • libredfish

      Public
      A Rust Crate for interacting with DTMF Redfish endpoints
      Rust
      7901Updated Dec 9, 2025Dec 9, 2025
    • NVSentinel is a cross-platform fault remediation service designed to rapidly remediate runtime node-level issues in GPU-accelerated computing environments
      Go
      26110358Updated Dec 9, 2025Dec 9, 2025
    • aistore

      Public
      AIStore: scalable storage for AI applications
      Go
      2301.7k00Updated Dec 9, 2025Dec 9, 2025
    • JAX-Toolbox
      Python
      683678044Updated Dec 9, 2025Dec 9, 2025