8000 drewjin (Drew) / Starred · GitHub

More Web Proxy on the site http://driver.im/

drewjin

Follow

🎯

Focusing

Drew drewjin

🎯

Focusing

Follow

I am a undergrad. student at Shanghai University interested in Machine Learning Systems

17 followers · 56 following

Shanghai University (SHU)
Baoshan, Shanghai
05:21 (UTC +08:00)
https://drewjin.github.io/
https://scholar.google.com.hk/citations?user=L220uBgAAAAJ&hl=zh-CN

Achievements

Achievements

Highlights

Pro

Stars

HPC-SJTU / af3_kernels

C++ 3 Updated May 22, 2025

HPC-SJTU / xfold

Forked from Shenggan/xfold

Democratizing AlphaFold3: an PyTorch reimplementation to accelerate protein structure prediction

Python 9 Updated May 24, 2025

garipovroma / autojudge

Official PyTorch implementation for the paper AutoJudge: Judge Decoding Without Manual Annotation

Python 10 1 Updated Apr 30, 2025

Shenggan / xfold

Democratizing AlphaFold3: an PyTorch reimplementation to accelerate protein structure prediction

Python 32 3 Updated Dec 16, 2024

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 8,627 1,079 Updated May 28, 2025

EleutherAI / lm-evaluation-harness

A framework for few-shot evaluation of language models.

Python 9,072 2,425 Updated May 27, 2025

drewjin / SHU-Computer-Arch

Shanghai University Computer Architecture Experiments

C++ 3 Updated May 18, 2025

drewjin / HealthCare

Shanghai University Database Course Project

Vue 2 1 Updated May 22, 2025

zhijie-group / Orthus

Python 31 1 Updated May 15, 2025

stas00 / ml-engineering

Machine Learning Engineering Open Book

Python 13,814 832 Updated May 23, 2025

abdelfattah-lab / SplitReason

Python 15 2 Updated May 14, 2025

SandAI-org / MagiAttention

A Distributed Attention Towards Linear Scalability for Ultra-Long Context, Heterogeneous Data Training

Python 364 20 Updated May 28, 2025

eqimp / hogwild_llm

Official PyTorch implementation for Hogwild! Inference: Parallel LLM Generation with a Concurrent Attention Cache

Python 105 6 Updated Apr 24, 2025

yuanpinz / awesome-deep-multimodal-reasoning

Collect the awesome works evolved around reasoning models like O1/R1 in visual domain

27 1 Updated May 26, 2025

uservan / speculative_thinking

Jupyter Notebook 19 1 Updated Apr 8, 2025

yangjackie / Topics-on-diffusion-generative-models

TeX 11 1 Updated Apr 20, 2025

XiaoYee / Awesome_Efficient_LRM_Reasoning

😎 A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond

223 5 Updated May 28, 2025

shenyanpai / awesome-summer-camp-2025

2025年度保研夏令营通知合集（完整版）

417 10 Updated May 28, 2025

ruipeterpan / specreason

PoC for "SpecReason: Fast and Accurate Inference-Time Compute via Speculative Reasoning" [arXiv '25]

Python 37 5 Updated May 16, 2025

astral-sh / uv

An extremely fast Python package and project manager, written in Rust.

Rust 55,982 1,577 Updated May 28, 2025

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 14,709 1,862 Updated May 28, 2025

kvcache-ai / Mooncake

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 3,343 266 Updated May 27, 2025

vincentlaucsb / csv-parser

A high-performance, fully-featured CSV parser and serializer for modern C++.

C++ 980 168 Updated May 28, 2025

NVIDIA / cutlass

CUDA Templates for Linear Algebra Subroutines

C++ 7,601 1,248 Updated May 28, 2025

alibaba / MNN

MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba. Full multimodal LLM Android App:[MNN-LLM-Android](./apps/Android/MnnLlmChat/READ…

C++ 11,132 1,869 Updated May 27, 2025

xlite-dev / LeetCUDA

📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA/Tensor Cores Kernels, HGEMM, FA-2 MMA etc.🔥

Cuda 4,519 473 Updated May 28, 2025

SafeAILab / EAGLE

Official Implementation of EAGLE-1 (ICML'24), EAGLE-2 (EMNLP'24), and EAGLE-3.

Python 1,271 148 Updated May 18, 2025

AnswerDotAI / cold-compress

Cold Compress is a hackable, lightweight, and open-source toolkit for creating and benchmarking cache compression methods built on top of GPT-Fast, a simple, PyTorch-native generation codebase.

Python 132 13 Updated Aug 9, 2024

DefTruth / Awesome-LLM-Inference

Forked from xlite-dev/Awesome-LLM-Inference

📖A curated list of Awesome LLM/VLM Inference Papers with codes: WINT8/4, FlashAttention, PagedAttention, MLA, Parallelism, etc. 🎉🎉

8 1 Updated Mar 30, 2025

milvus-io / milvus

Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search

Go 34,992 3,228 Updated May 28, 2025

0