minitu

Jaemin Choi minitu

Senior Deep Learning Architect at NVIDIA

13 followers · 0 following

@NVIDIA
Santa Clara, California, United States
05:01 (UTC -07:00)
https://www.linkedin.com/in/jaemincs/

Organizations

Stars

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 7,323 711 Updated Jul 10, 2025

A library for 10000 accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory…

Python 2,545 449 Updated Jul 11, 2025

NVIDIA / nvidia-resiliency-ext

NVIDIA Resiliency Extension is a python package for framework developers and users to implement fault-tolerant features. It improves the effective training time by minimizing the downtime due to fa…

Python 187 29 Updated Jul 10, 2025

NVIDIA / NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 15,063 2,987 Updated Jul 11, 2025

NVIDIA / apex

A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch

Python 8,716 1,460 Updated Jul 10, 2025

Lightning-AI / lightning-thunder

PyTorch compiler that accelerates training and inference. Get built-in optimizations for performance, memory, parallelism, and easily write your own.

Python 1,375 102 Updated Jul 11, 2025

triton-lang / triton

Development repository for the Triton language and compiler

MLIR 16,115 2,105 Updated Jul 11, 2025

NVIDIA / Megatron-LM

Ongoing research training transformer models at scale

Python 12,835 2,921 Updated Jul 11, 2025

1kc / razer-macos

Color effects manager for Razer devices for macOS. Supports High Sierra (10.13) to Monterey (12.0). Made by the community, based on openrazer.

JavaScript 2,503 282 Updated Apr 5, 2024

cmhungsteve / Awesome-Transformer-Attention

An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites

4,899 495 Updated Jul 30, 2024

NVIDIA / DALI

A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.

C++ 5,456 642 Updated Jul 11, 2025

minitu / starter-academic

Jupyter Notebook 2 2 Updated Apr 25, 2024

ROCm / rocSHMEM

rocSHMEM intra-kernel networking runtime for AMD dGPUs on the ROCm platform.

C++ 91 24 Updated Jul 11, 2025

jszaday / ergoline

An HPC-oriented, parallel programming language targeting Charm++. Aims to be to C++ as Scala is to Java.

Scala 3 Updated Mar 10, 2022

spcl / dace

DaCe - Data Centric Parallel Programming

Python 543 138 Updated Jul 11, 2025

spcl / NoPFS

Near-optimal Prefetching System

34 6 Updated Nov 17, 2021

pyg-team / pytorch_geometric

Graph Neural Network Library for PyTorch

Python 22,585 3,847 Updated Jul 11, 2025

dmlc / dgl

Python package built to ease deep learning on graph, on top of existing DL frameworks.

Python 13,971 3,047 Updated Feb 11, 2025

AppHouseKitchen / AlDente-Battery_Care_and_Monitoring

Menubar Tool to set Charge Limits and Prolong Battery Lifespan

Swift 8,529 315 Updated Jul 2, 2025

NVIDIA / hpc-container-maker

HPC Container Maker

Python 483 98 Updated Mar 26, 2025

NVIDIA-Merlin / HugeCTR

HugeCTR is a high efficiency GPU framework designed for Click-Through-Rate (CTR) estimating training

C++ 1,016 205 Updated Mar 23, 2025

facebookresearch / dlrm

An implementation of a deep learning recommendation model (DLRM)

Python 3,923 858 Updated May 30, 2025

open-mpi / ompi

Open MPI main development repository

C 2,377 906 Updated Jul 10, 2025

NVIDIA / libcudacxx

[ARCHIVED] The C++ Standard Library for your entire system. See https://github.com/NVIDIA/cccl

C++ 2,307 189 Updated Feb 7, 2024

CLIUtils / CLI11

CLI11 is a command line parser for C++11 and beyond that provides a rich feature set with a simple and intuitive interface.

C++ 3,729 374 Updated Jun 30, 2025

gpudirect / libgdsync

GPUDirect Async support for IB Verbs

C++ 124 17 Updated Nov 10, 2022

gpudirect / libmp

Simple message passing library

Cuda 23 7 Updated Aug 28, 2018

AMDResearch / DAGEE

Directed Acyclic Graph Execution Engine (DAGEE) is a C++ library that enables programmers to express computation and data movement, as task graphs that are scheduled concurrently and asynchronously…

C++ 46 8 Updated Oct 12, 2021

NVIDIA / multi-gpu-programming-models

Examples demonstrating available options to program multiple GPUs in a single node or a cluster

Cuda 752 128 Updated Feb 21, 2025

hainest / ChaNGa_test

tests for ChaNGa

Perl 2 2 Updated Sep 7, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly