Starred repositories
Playing around "Less Slow" coding practices in C++ 20, C, CUDA, PTX, & Assembly, from numerics & SIMD to coroutines, ranges, exception handling, networking and user-space IO
Parallel Computing starter project to build GPU & CPU kernels in CUDA & C++ and call them from Python without a single line of CMake using PyBind11
NVIDIA curated collection of educational resources related to general purpose GPU programming.
This repo contains CUDA-Q Academic materials, including self-paced Jupyter notebook modules for building and optimizing hybrid quantum-classical algorithms using CUDA-Q.
Iso Fortran HPC Tutorial materials
[ARCHIVED] The C++ Standard Library for your entire system. See https://github.com/NVIDIA/cccl
Tutorial material for Julia basic training
Get started with your NVIDIA Arm HPC Developers Kit!
NPBench - A Benchmarking Suite for High-Performance NumPy
Fortran language support for Visual Studio Code
A plugin for Jupyter Notebook to run CUDA C/C++ code
QMCPACK miniapp: a simplified real space QMC code for algorithm development, performance portability testing, and computer science experiments
Celeritas is a new Monte Carlo transport code designed to accelerate scientific discovery in high energy physics by improving detector simulation throughput and energy efficiency using GPUs.
Benchmarks for experimentation of performance of features of OpenMP in the SOLLVE project.
Proposals for the Fortran Standard Committee
Environment modules for NGC containers
Set of 42 protein-ligand complexes for testing search algorithms and docking runtime
Training materials provided by OpenACC.org.
Sources for the Oak Ridge Leadership Computing Facility User Documentation