8000 Light-of-Hers (Renze Chen) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View Light-of-Hers's full-sized avatar
🐶
wow
🐶
wow

Highlights

  • Pro

Block or report Light-of-Hers

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Sh 10000 owing results

Retrying library for Python

Python 7,522 295 Updated May 1, 2025

This repository serves as a comprehensive survey of LLM development, featuring numerous research papers along with their corresponding code links.

150 3 Updated Feb 18, 2025

Universal battlefield-adaptive Operator Evaluation Protocol for Arknights / 泛用型环境自适应干员强度评价体系 for 明日方舟

Python 22 Updated May 10, 2025

[DAC'25] Official implement of "HybriMoE: Hybrid CPU-GPU Scheduling and Cache Management for Efficient MoE Inference"

Python 51 2 Updated Jun 11, 2025

Processed / Cleaned Data for Paper Copilot

Python 499 17 Updated Jun 12, 2025

PDF2zh for Zotero | Zotero PDF中文翻译插件

TypeScript 802 39 Updated Jun 12, 2025

PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/MCP/Docker/Zotero

Python 24,647 2,124 Updated Jun 11, 2025

A Python-embedded DSL that makes it easy to write fast, scalable ML kernels with minimal boilerplate.

Python 160 12 Updated Jun 12, 2025

[ICML 2025 Spotlight] ShadowKV: KV Cache in Shadows for High-Throughput Long-Context LLM Inference

Python 198 14 Updated May 1, 2025

H2-LLM: Hardware-Dataflow Co-Exploration for Heterogeneous Hybrid-Bonding-based Low-Batch LLM Inference

Python 8 Updated Apr 26, 2025

Open deep learning compiler stack for Kendryte AI accelerators ✨

C# 791 195 Updated Jun 12, 2025

MAGI-1: Autoregressive Video Generation at Scale

Python 3,267 189 Updated Jun 4, 2025

A Distributed Attention Towards Linear Scalability for Ultra-Long Context, Heterogeneous Data Training

Python 381 21 Updated Jun 12, 2025

This is an experimental library that has evolved to P2688

C++ 674 31 Updated Nov 27, 2024

match(it): A lightweight single-header pattern-matching library for C++17 with macro-free APIs.

C++ 613 20 Updated Nov 22, 2022

Python interface for MLIR - the Multi-Level Intermediate Representation

Python 257 44 Updated Nov 28, 2024

Tritonbench is a collection of PyTorch custom operators with example inputs to measure their performance.

Python 133 20 Updated Jun 11, 2025
Python 1 Updated May 6, 2025

Distributed Compiler Based on Triton for Parallel Systems

Python 812 52 Updated Jun 5, 2025

Python Fire is a library for automatically generating command line interfaces (CLIs) from absolutely any Python object.

Python 27,697 1,454 Updated Jun 1, 2025

SpInfer: Leveraging Low-Level Sparsity for Efficient Large Language Model Inference on GPUs

Cuda 47 7 Updated Mar 25, 2025

Simple, Elegant, Typed Argument Parsing with argparse

Python 469 57 Updated Jun 3, 2025

XAttention: Block Sparse Attention with Antidiagonal Scoring

Python 163 8 Updated Jun 5, 2025

DeeperGEMM: crazy optimized version

Cuda 69 Updated May 5, 2025
C++ 539 86 Updated Jun 11, 2025

High-performance inference framework for large language models, focusing on efficiency, flexibility, and availability.

Python 1,143 75 Updated Jun 12, 2025

Democratizing Reinforcement Learning for LLMs

Jupyter Notebook 3,358 306 Updated May 13, 2025
C++ 35 7 Updated May 25, 2025

Load compute kernels from the Hub

Python 148 9 Updated Jun 12, 2025
Next
0