rhmaaa

quinlan rhmaaa

N'ayez pas peur

4 followers · 87 following

Achievements

pplx-kernels Public
Forked from ppl-ai/pplx-kernels

Perplexity GPU Kernels

C++ MIT License Updated May 1, 2025
vllm Public
Forked from vllm-project/vllm

To learn vllm

Python Apache License 2.0 Updated Apr 30, 2025
sglang Public
Forked from sgl-project/sglang

SGLang is a fast serving framework for large language models and vision language models.

Python Apache License 2.0 Updated Apr 30, 2025
MoBA Public
Forked from MoonshotAI/MoBA

MoBA: Mixture of Block Attention for Long-Context LLMs

Python MIT License Updated Mar 31, 2025
vattention Public
Forked from microsoft/vattention

Dynamic Memory Management for Serving LLMs without PagedAttention

C MIT License Updated Mar 20, 2025
accel-sim-framework Public
Forked from accel-sim/accel-sim-framework

This is the top-level repository for the Accel-Sim framework.

Python Other Updated Feb 25, 2025
comet-25 Public

C++ 5 1 Updated Feb 11, 2025
TinyZero Public
Forked from Jiayi-Pan/TinyZero

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python Apache License 2.0 Updated Feb 1, 2025
cpp Public

Rust MIT License Updated Jan 8, 2025
yalm Public
Forked from andrewkchan/yalm

Yet Another Language Model: LLM inference in C++/CUDA, no libraries except for I/O

C++ Updated Dec 24, 2024
ThunderKittens Public
Forked from HazyResearch/ThunderKittens

Tile primitives for speedy kernels

Cuda MIT License Updated Dec 13, 2024
tutorials Public
Forked from triton-inference-server/tutorials

This repository contains tutorials and examples for Triton Inference Server

Python BSD 3-Clause "New" or "Revised" License Updated Dec 7, 2024
Awesome-Cute Public
Forked from CalebDu/Awesome-Cute

C++ Updated Nov 23, 2024
lectures Public
Forked from gpu-mode/lectures

Material for cuda-mode lectures

Jupyter Notebook Apache License 2.0 Updated Aug 11, 2024
interview_internal_reference Public
Forked from 0voice/interview_internal_reference

2023年最新总结，阿里，腾讯，百度，美团，头条等技术面试题目，以及答案，专家出题人分析汇总。

Python Updated May 20, 2024
any-precision-llm Public
Forked from SNU-ARC/any-precision-llm

[ICML 2024 Oral] Any-Precision LLM: Low-Cost Deployment of Multiple, Different-Sized LLMs

Python MIT License Updated May 16, 2024
triton Public
Forked from triton-lang/triton

Development repository for the Triton language and compiler

C++ MIT License Updated May 15, 2024
flash-attention Public
Forked from Dao-AILab/flash-attention

Fast and memory-efficient exact attention

Python BSD 3-Clause "New" or "Revised" License Updated May 14, 2024
HumanSystemOptimization Public
Forked from zijie0/HumanSystemOptimization

健康学习到150岁 - 人体系统调优不完全指南

Updated May 9, 2024
llm.c Public
Forked from karpathy/llm.c

LLM training in simple, raw C/CUDA

Cuda MIT License Updated Apr 13, 2024
xv6-riscv Public
Forked from mit-pdos/xv6-riscv

Xv6 for RISC-V

C Other Updated Mar 24, 2024
cppbestpractices Public
Forked from cpp-best-practices/cppbestpractices

Collaborative Collection of C++ Best Practices. This online resource is part of Jason Turner's collection of C++ Best Practices resources. See README.md for more information.

Other Updated Feb 8, 2024
docs Public
Forked from PaddlePaddle/docs

Documentations for PaddlePaddle

Python Apache License 2.0 Updated Dec 1, 2023
Paddle Public
Forked from PaddlePaddle/Paddle

PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice （『飞桨』核心框架，深度学习&机器学习高性能单机、分布式训练和跨平台部署）

C++ Updated Nov 29, 2023
jit-exportable-models Public
Forked from PaddleJitLab/jit-exportable-models

Shell Updated Nov 20, 2023
Data-Structures-and-Algorithms-in-cpp Public
Forked from amritansh22/Data-Structures-and-Algorithms-in-cpp

To learn cpp

C++ MIT License Updated Oct 24, 2023
TensorRT Public
Forked from NVIDIA/TensorRT

NVIDIA® TensorRT™, an SDK for high-performance deep learning inference, includes a deep learning inference optimizer and runtime that delivers low latency and high throughput for inference applicat…

C++ Apache License 2.0 Updated Oct 20, 2023
tvm Public
Forked from apache/tvm

Open deep learning compiler stack for cpu, gpu and specialized accelerators

Python Apache License 2.0 Updated Oct 10, 2023
FasterTransformer Public
Forked from NVIDIA/FasterTransformer

Transformer related optimization, including BERT, GPT

C++ Apache License 2.0 Updated Oct 2, 2023
SysML Public
Forked from Jack47/hack-SysML

The road to hack SysML and become an system expert

Emacs Lisp Updated Sep 26, 2023

quinlan rhmaaa

Achievements

Achievements

pplx-kernels Public

Uh oh!

vllm Public

Uh oh!

sglang Public

Uh oh!

MoBA Public

Uh oh!

vattention Public

Uh oh!

accel-sim-framework Public

Uh oh!

comet-25 Public

Uh oh!

TinyZero Public

Uh oh!

cpp Public

Uh oh!

yalm Public

Uh oh!

ThunderKittens Public

Uh oh!

tutorials Public

Uh oh!

Awesome-Cute Public

Uh oh!

lectures Public

Uh oh!

interview_internal_reference Public

Uh E9A6 oh!

any-precision-llm Public

Uh oh!

triton Public

Uh oh!

flash-attention Public

Uh oh!

HumanSystemOptimization Public

Uh oh!

llm.c Public

Uh oh!

xv6-riscv Public

Uh oh!

cppbestpractices Public

Uh oh!

docs Public

Uh oh!

Paddle Public

Uh oh!

jit-exportable-models Public

Uh oh!

Data-Structures-and-Algorithms-in-cpp Public

Uh oh!

TensorRT Public

Uh oh!

tvm Public

Uh oh!

FasterTransformer Public

Uh oh!

SysML Public

Uh oh!