8000 minitu (Jaemin Choi) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View minitu's full-sized avatar

Organizations

@UIUC-PPL

Block or report minitu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 7,323 711 Updated Jul 10, 2025

A library for 10000 accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory…

Python 2,545 449 Updated Jul 11, 2025

NVIDIA Resiliency Extension is a python package for framework developers and users to implement fault-tolerant features. It improves the effective training time by minimizing the downtime due to fa…

Python 187 29 Updated Jul 10, 2025

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 15,063 2,987 Updated Jul 11, 2025

A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch

Python 8,716 1,460 Updated Jul 10, 2025

PyTorch compiler that accelerates training and inference. Get built-in optimizations for performance, memory, parallelism, and easily write your own.

Python 1,375 102 Updated Jul 11, 2025

Development repository for the Triton language and compiler

MLIR 16,115 2,105 Updated Jul 11, 2025

Ongoing research training transformer models at scale

Python 12,835 2,921 Updated Jul 11, 2025

Color effects manager for Razer devices for macOS. Supports High Sierra (10.13) to Monterey (12.0). Made by the community, based on openrazer.

JavaScript 2,503 282 Updated Apr 5, 2024

An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites

4,899 495 Updated Jul 30, 2024

A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.

C++ 5,456 642 Updated Jul 11, 2025
Jupyter Notebook 2 2 Updated Apr 25, 2024

rocSHMEM intra-kernel networking runtime for AMD dGPUs on the ROCm platform.

C++ 91 24 Updated Jul 11, 2025

An HPC-oriented, parallel programming language targeting Charm++. Aims to be to C++ as Scala is to Java.

Scala 3 Updated Mar 10, 2022

DaCe - Data Centric Parallel Programming

Python 543 138 Updated Jul 11, 2025

Near-optimal Prefetching System

34 6 Updated Nov 17, 2021

Graph Neural Network Library for PyTorch

Python 22,585 3,847 Updated Jul 11, 2025

Python package built to ease deep learning on graph, on top of existing DL frameworks.

Python 13,971 3,047 Updated Feb 11, 2025

Menubar Tool to set Charge Limits and Prolong Battery Lifespan

Swift 8,529 315 Updated Jul 2, 2025

HPC Container Maker

Python 483 98 Updated Mar 26, 2025

HugeCTR is a high efficiency GPU framework designed for Click-Through-Rate (CTR) estimating training

C++ 1,016 205 Updated Mar 23, 2025

An implementation of a deep learning recommendation model (DLRM)

Python 3,923 858 Updated May 30, 2025

Open MPI main development repository

C 2,377 906 Updated Jul 10, 2025

[ARCHIVED] The C++ Standard Library for your entire system. See https://github.com/NVIDIA/cccl

C++ 2,307 189 Updated Feb 7, 2024

CLI11 is a command line parser for C++11 and beyond that provides a rich feature set with a simple and intuitive interface.

C++ 3,729 374 Updated Jun 30, 2025

GPUDirect Async support for IB Verbs

C++ 124 17 Updated Nov 10, 2022

Simple message passing library

Cuda 23 7 Updated Aug 28, 2018

Directed Acyclic Graph Execution Engine (DAGEE) is a C++ library that enables programmers to express computation and data movement, as task graphs that are scheduled concurrently and asynchronously…

C++ 46 8 Updated Oct 12, 2021

Examples demonstrating available options to program multiple GPUs in a single node or a cluster

Cuda 752 128 Updated Feb 21, 2025

tests for ChaNGa

Perl 2 2 Updated Sep 7, 2019
Next
0