8000 RissyRan (Ran Ran) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View RissyRan's full-sized avatar

Block or report RissyRan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 178 14 Updated May 9, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 14,200 1,715 Updated May 11, 2025

Minimalistic large language model 3D-parallelism training

Python 1,852 187 Updated May 11, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 47,045 7,341 Updated May 11, 2025

Recipes for reproducing training and serving benchmarks for large machine learning models using GPUs on Google Cloud.

Python 61 16 Updated Apr 24, 2025
Python 352 32 Updated Apr 12, 2024
Python 109 14 Updated May 8, 2025

Expert Parallelism Load Balancer

Python 1,175 189 Updated Mar 24, 2025

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Python 5,314 585 Updated May 9, 2025

DeepEP: an efficient expert-parallel communication library

Cuda 7,616 753 Updated May 8, 2025

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,756 276 Updated Apr 14, 2025

The LLM Evaluation Framework

Python 6,261 543 Updated May 10, 2025

s1: Simple test-time scaling

Python 6,352 745 Updated Apr 4, 2025

Large Language Model Text Generation Inference

Python 10,102 1,195 Updated May 11, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 7,842 892 Updated May 11, 2025

RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.

Python 1,783 127 Updated May 10, 2025

Fully open reproduction of DeepSeek-R1

Python 24,357 2,238 Updated May 11, 2025

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 20,787 1,362 Updated May 9, 2025

Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models

Python 3,040 601 Updated Jul 19, 2024

[ACL'24 Outstanding] Data and code for L-Eval, a comprehensive long context language models evaluation benchmark

Python 377 13 Updated Jul 9, 2024

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Jupyter Notebook 50,183 5,395 Updated May 11, 2025

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 48,503 6,946 Updated Apr 20, 2025

Low-code framework for building custom LLMs, neural networks, and other AI models

Python 11,444 1,207 Updated May 5, 2025

Your shell history: synced, queryable, and in context

Go 2,742 53 Updated Apr 25, 2025
C++ 85 16 Updated Mar 17, 2025

深度学习经典、新论文逐段精读

30,159 2,642 Updated Mar 22, 2025
Next
0