8000 hebiao064 (Stefan He) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View hebiao064's full-sized avatar

Block or report hebiao064

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

slime is a LLM post-training framework aiming at scaling RL.

Python 445 19 Updated Jun 25, 2025

A Quirky Assortment of CuTe Kernels

Python 116 2 Updated Jun 24, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 1 1 Updated Jun 19, 2025

Config files for my GitHub profile.

1 Updated Jun 17, 2025
SCSS 1 Updated Jun 24, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 1 1 Updated Jun 24, 2025

Allow torch tensor memory to be released and resumed later

C++ 2 Updated Jun 17, 2025

Nano vLLM

Python 3,920 413 Updated Jun 24, 2025

The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.

LLVM 33,179 14,292 Updated Jun 25, 2025

📄 Awesome CV is LaTeX template for your outstanding job application

TeX 24,665 4,994 Updated Feb 6, 2025

cudnn_frontend provides a c++ wrapper for the cudnn backend API and samples on how to use it

C++ 582 122 Updated Jun 12, 2025

Fast and memory-efficient exact attention

Python 10 5 Updated May 15, 2025

A toolkit to run Ray applications on Kubernetes

Go 1,844 549 Updated Jun 24, 2025

Allow torch tensor memory to be released and resumed later

C++ 48 8 Updated Jun 17, 2025

Distributed RL System for LLM Reasoning

Python 1,855 98 Updated Jun 25, 2025

Run LLMs with MLX

Python 1,139 143 Updated Jun 17, 2025

NVIDIA curated collection of educational resources related to general purpose GPU programming.

Jupyter Notebook 538 93 Updated Jun 7, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 9,951 1,628 Updated Jun 25, 2025

A next generation Python CMake adaptor and Python API for plugins

Python 338 66 Updated Jun 19, 2025

Seamless operability between C++11 and Python

C++ 16,749 2,187 Updated Jun 24, 2025

DeeperGEMM: crazy optimized version

Cuda 69 Updated May 5, 2025

📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA/Tensor Cores Kernels, HGEMM, FA-2 MMA.

Cuda 4,883 534 Updated Jun 21, 2025

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorR…

C++ 10,839 1,520 Updated Jun 25, 2025

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

C++ 1,321 105 Updated Jun 24, 2025

how to optimize some algorithm in cuda.

Cuda 2,277 202 Updated Jun 25, 2025

FlashInfer: Kernel Library for LLM Serving

Cuda 3,239 352 Updated Jun 25, 2025

CUDA Templates for Linear Algebra Subroutines

C++ 7,746 1,285 Updated Jun 12, 2025

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Python 5,472 626 Updated Jun 23, 2025

A curated collection of resources, tutorials, and best practices for learning and mastering NVIDIA CUTLASS

190 9 Updated May 6, 2025

[ACL 2025] CoT-ICL Lab: A Synthetic Framework for Studying Chain-of-Thought Learning from In-Context Demonstrations

Python 11 1 Updated May 23, 2025
Next
0