8000 lirundong (Rundong Li) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View lirundong's full-sized avatar
🤓
Sit down and think hard
🤓
Sit down and think hard
  • NVIDIA
  • Shanghai, China

Block or report lirundong

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Implementing DeepSeek R1's GRPO algorithm from scratch

Python 1,337 51 Updated Apr 18, 2025

The official repository for tariff

Python 3,032 44 Updated Apr 16, 2025

MoBA: Mixture of Block Attention for Long-Context LLMs

Python 1,774 105 Updated Apr 3, 2025

Distributed Triton for Parallel Systems

Python 724 43 Updated May 12, 2025

Official inference repo for FLUX.1 models

Python 21,699 1,541 Updated Feb 6, 2025

A private messenger for iOS.

Swift 11,305 3,189 Updated May 16, 2025

A high-performance distributed file system designed to address the challenges of AI training and inference workloads.

C++ 8,885 883 Updated May 7, 2025

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,770 276 Updated May 15, 2025

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Python 5,351 597 Updated May 16, 2025

Model Context Protocol Servers

JavaScript 47,015 5,287 Updated May 17, 2025

DeepEP: an efficient expert-parallel communication library

Cuda 7,663 769 Updated May 12, 2025

FlashMLA: Efficient MLA decoding kernels

Cuda 11,549 834 Updated Apr 29, 2025

Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.

Python 2,699 201 Updated May 17, 2025

My readings, my thoughts.

JavaScript 15 1 Updated May 5, 2025

Fully open reproduction of DeepSeek-R1

Python 24,449 2,251 Updated May 17, 2025

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

C++ 1,173 89 Updated May 17, 2025

Sampling profiler for Python programs

Rust 13,668 455 Updated Apr 10, 2025

The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model & vision-language-model based on Linear Attention

Python 2,659 203 Updated May 12, 2025

DSPy: The framework for programming—not prompting—language models

Python 24,297 1,873 Updated May 18, 2025

A beautiful, simple, clean, and responsive Jekyll theme for academics

HTML 12,990 11,810 Updated May 12, 2025

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 41,274 6,834 Updated Dec 9, 2024

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,725 375 Updated May 13, 2025

Tile primitives for speedy kernels

Cuda 2,352 142 Updated May 18, 2025

CUDA Kernel Benchmarking Library

Cuda 642 76 Updated May 10, 2025

The simplest implementation of recent Sparse Attention patterns for efficient LLM inference.

Jupyter Notebook 61 4 Updated Jan 25, 2025

CUDA Python: Performance meets Productivity

Python 2,660 162 Updated May 17, 2025

Automatically block unwanted, leeches and abnormal BT peers with support for customized and cloud rules.| BT 反吸血工具 - 自动封禁不受欢迎、吸血和异常的 BT 客户端,并支持自定义规则。支持 qB/qBEE/Deluge/BiglyBT/BitComet

Java 4,293 107 Updated May 17, 2025

FlashInfer: Kernel Library for LLM Serving

Cuda 2,974 308 Updated May 17, 2025

A bibliography and survey of the papers surrounding o1

TeX 1,191 50 Updated Nov 16, 2024

Graph-indexed Pandas DataFrames for analyzing hierarchical performance data

JavaScript 32 19 Updated May 16, 2025
Next
10C9
0