10000 sarckk (Yong Hoon Shin) / Starred · GitHub

More Web Proxy on the site http://driver.im/

sarckk

Follow

🎯

Yong Hoon Shin sarckk

🎯

Follow

Interested in bits and brains.

26 followers · 79 following

https://sarckk.github.io/

Achievements

Achievements

Stars

LMCache / LMCache

Supercharge Your LLM with the Fastest KV Cache Layer

Python 2,521 301 Updated Jul 8, 2025

Infini-AI-Lab / TriForce

[COLM 2024] TriForce: Lossless Acceleration of Long Sequence Generation with Hierarchical Speculative Decoding

Python 258 17 Updated Aug 31, 2024

facebookresearch / locate-3d

Open source repo for Locate 3D Model, 3D-JEPA and Locate 3D Dataset

Python 328 24 Updated Jun 3, 2025

shadowpa0327 / Palu

[ICLR 2025] Palu: Compressing KV-Cache with Low-Rank Projection

Python 124 6 Updated Feb 20, 2025

HKUNLP / Dream

Dream 7B, a large diffusion language model

Python 813 39 Updated Jun 18, 2025

Zefan-Cai / KVCache-Factory

Unified KV Cache Compression Methods for Auto-Regressive Models

Python 1,184 150 Updated Jan 4, 2025

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 15,828 2,286 Updated Jul 8, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 51,758 8,570 Updated Jul 8, 2025

microsoft / vattention

Dynamic Memory Management for Serving LLMs without PagedAttention

C 401 31 Updated May 30, 2025

HuangOwen / Awesome-LLM-Compression

Awesome LLM compression research papers and tools.

1,591 102 Updated Jul 2, 2025

facebookresearch / vggt

[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer

Python 9,539 912 Updated Jul 6, 2025

hemingkx / SpeculativeDecodingPapers

📰 Must-read papers and blogs on Speculative Decoding ⚡️

821 44 Updated Jun 22, 2025

facebookresearch / HolisticTraceAnalysis

A library to analyze PyTorch traces.

Python 391 62 Updated Jun 23, 2025

horseee / Awesome-Efficient-LLM

A curated list for Efficient Large Language Models

Python 1,768 141 Updated Jun 17, 2025

facebookresearch / dlrm

An implementation of a deep learning recommendation model (DLRM)

Python 3,918 858 Updated May 30, 2025

NVIDIA / Cosmos

New repo collection for NVIDIA Cosmos: https://github.com/nvidia-cosmos

8,036 514 Updated Jun 9, 2025

browser-use / web-ui

🖥️ Run AI Agent in your browser.

Python 14,019 2,406 Updated Jun 1, 2025

nuta / operating-system-in-1000-lines

Writing an OS in 1,000 lines.

C 2,667 202 Updated Jun 15, 2025

sarckk / boids

Boids implementation in C++ with spatial hashing

C++ 11 2 Updated Jul 24, 2021

Lightning-AI / litgpt

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

Python 12,451 1,269 Updated Jul 7, 2025

fishaudio / fish-speech

SOTA Open Source TTS

Python 22,270 1,822 Updated Jul 2, 2025

livekit / agents

A powerful framework for building realtime voice AI agents 🤖🎙️📹

Python 6,679 1,054 Updated Jul 8, 2025

kolinko / effort

An implementation of bucketMul LLM inference

Swift 220 10 Updated Jul 1, 2024

Anuken / Mindustry

The automation tower defense RTS

Java 24,292 3,153 Updated Jul 8, 2025

pytorch / torchrec

Pytorch domain library for recommendation systems

Python 2,260 535 Updated Jul 8, 2025

VadimBoev / FlappyBird

Less than 100 Kilobytes. Works for Android 5.1 and above

C 2,306 146 Updated Dec 27, 2024

exo-explore / exo

Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚

Python 28,906 1,832 Updated Mar 21, 2025

nginx / nginx

The official NGINX Open Source repository.

C 27,381 7,412 Updated Jul 3, 2025

karpathy / nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 42,654 7,136 Updated Dec 9, 2024

swyxio / ai-notes

notes for software engineers getting up to speed on new AI developments. Serves as datastore for https://latent.space writing, and product brainstorming, but has cleaned up canonical references und…

HTML 5,870 502 Updated Jun 27, 2025

0