Light-of-Hers

🐶

wow

Renze Chen Light-of-Hers

🐶

wow

PKU CECA Renze Chen

136 followers · 174 following

Peking University
https://light-of-hers.github.io
https://www.zhihu.com/people/yi-guang-99-48

Achievements

Highlights

Lists (3)

Sort

Starred repositories

jd / tenacity

Retrying library for Python

Python 7,522 295 Updated May 1, 2025

TreeAI-Lab / Awesome-KV-Cache-Management

This repository serves as a comprehensive survey of LLM development, featuring numerous research papers along with their corresponding code links.

150 3 Updated Feb 18, 2025

uoep / UOEP

Universal battlefield-adaptive Operator Evaluation Protocol for Arknights / 泛用型环境自适应干员强度评价体系 for 明日方舟

Python 22 Updated May 10, 2025

HPMLL / NVIDIA-Hopper-Benchmark

C++ 41 1 Updated May 31, 2025

PKU-SEC-Lab / HybriMoE

[DAC'25] Official implement of "HybriMoE: Hybrid CPU-GPU Scheduling and Cache Management for Efficient MoE Inference"

Python 51 2 Updated Jun 11, 2025

papercopilot / paperlists

Processed / Cleaned Data for Paper Copilot

Python 499 17 Updated Jun 12, 2025

guaguastandup / zotero-pdf2zh

PDF2zh for Zotero | Zotero PDF中文翻译插件

TypeScript 802 39 Updated Jun 12, 2025

Byaidu / PDFMathTranslate

PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译，支持 Google/DeepL/Ollama/OpenAI 等服务，提供 CLI/GUI/MCP/Docker/Zotero

Python 24,647 2,124 Updated Jun 11, 2025

pytorch-labs / helion

A Python-embedded DSL that makes it easy to write fast, scalable ML kernels with minimal boilerplate.

Python 160 12 Updated Jun 12, 2025

ByteDance-Seed / ShadowKV

[ICML 2025 Spotlight] ShadowKV: KV Cache in Shadows for High-Throughput Long-Context LLM Inference

Python 198 14 Updated May 1, 2025

leesou / H2-LLM-ISCA-2025

H2-LLM: Hardware-Dataflow Co-Exploration for Heterogeneous Hybrid-Bonding-based Low-Batch LLM Inference

Python 8 Updated Apr 26, 2025

kendryte / nncase

Open deep learning compiler stack for Kendryte AI accelerators ✨

C# 791 195 Updated Jun 12, 2025

SandAI-org / MAGI-1

MAGI-1: Autoregressive Video Generation at Scale

Python 3,267 189 Updated Jun 4, 2025

SandAI-org / MagiAttention

A Distributed Attention Towards Linear Scalability for Ultra-Long Context, Heterogeneous Data Training

Python 381 21 Updated Jun 12, 2025

mpark / patterns

This is an experimental library that has evolved to P2688

C++ 674 31 Updated Nov 27, 2024

BowenFu / matchit.cpp

match(it): A lightweight single-header pattern-matching library for C++17 with macro-free APIs.

C++ 613 20 Updated Nov 22, 2022

spcl / pymlir

Python interface for MLIR - the Multi-Level Intermediate Representation

Python 257 44 Updated Nov 28, 2024

pytorch-labs / tritonbench

Tritonbench is a collection of PyTorch custom operators with example inputs to measure their performance.

Python 133 20 Updated Jun 11, 2025

pku-liang / telos

Python 1 Updated May 6, 2025

ByteDance-Seed / Triton-distributed

Distributed Compiler Based on Triton for Parallel Systems

Python 812 52 Updated Jun 5, 2025

google / python-fire

Python Fire is a library for automatically generating command line interfaces (CLIs) from absolutely any Python object.

Python 27,697 1,454 Updated Jun 1, 2025

xxyux / SpInfer

SpInfer: Leveraging Low-Level Sparsity for Efficient Large Language Model Inference on GPUs

Cuda 47 7 Updated Mar 25, 2025

lebrice / SimpleParsing

Simple, Elegant, Typed Argument Parsing with argparse

Python 469 57 Updated Jun 3, 2025

mit-han-lab / x-attention

XAttention: Block Sparse Attention with Antidiagonal Scoring

Python 163 8 Updated Jun 5, 2025

ademeure / DeeperGEMM

Forked from deepseek-ai/DeepGEMM

DeeperGEMM: crazy optimized version

Cuda 69 Updated May 5, 2025

Renze Chen Light-of-Hers

Highlights

Lists (3)

Compiler

Diffusion

LLM

Starred repositories

Go

JavaScript

Python

C

C++

CUDA

Rust