8000 KSLee7 (Kungsing Lee) / Starred · GitHub

More Web Proxy on the site http://driver.im/

KSLee7

Follow

Kungsing Lee KSLee7

Follow

1 follower · 0 following

Washington, D.C.

Stars

datawhalechina / happy-llm

📚 从零开始的大语言模型原理与实践教程

3,375 250 Updated Jun 15, 2025

bytedance / InfiniStore

KV cache store for distributed LLM inference

C++ 262 24 Updated Jun 6, 2025

wormi4ok / evernote2md

Convert Evernote .enex files to Markdown

Go 990 82 Updated Jun 14, 2025

vzhd1701 / enex2notion

Import Evernote ENEX files to Notion

Python 448 38 Updated Jan 17, 2024

SafeAILab / EAGLE

Official Implementation of EAGLE-1 (ICML'24), EAGLE-2 (EMNLP'24), and EAGLE-3.

Python 1,313 161 Updated Jun 10, 2025

deepseek-ai / DeepGEMM

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Python 5,446 619 Updated Jun 16, 2025

deepseek-ai / DeepEP

DeepEP: an efficient expert-parallel communication library

Cuda 7,776 793 Updated Jun 16, 2025

deepseek-ai / FlashMLA

FlashMLA: Efficient MLA decoding kernels

Cuda 11,603 853 Updated Apr 29, 2025

hpcaitech / ColossalAI

Making large AI models cheaper, faster and more accessible

Python 40,971 4,522 Updated Jun 16, 2025

kvcache-ai / ktransformers

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python 14,394 1,026 Updated Jun 15, 2025

ggml-org / llama.cpp

LLM inference in C/C++

C++ 81,833 12,106 Updated Jun 16, 2025

gpustack / gguf-parser-go

Review/Check GGUF files and estimate the memory usage and maximum tokens per second.

Go 174 17 Updated Jun 13, 2025

gpustack / gpustack

Simple, scalable AI model deployment on GPU clusters

Python 2,925 300 Updated Jun 16, 2025

deepseek-ai / awesome-deepseek-integration

Integrate the DeepSeek API into popular softwares

32,869 3,628 Updated May 13, 2025

deepseek-ai / DeepSeek-R1

90,111 11,637 Updated Apr 9, 2025

deepseek-ai / DeepSeek-VL2

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

Python 4,896 1,750 Updated Feb 26, 2025

deepseek-ai / DeepSeek-V3

Python 97,637 15,875 Updated Jun 16, 2025

deepseek-ai / DeepSeek-VL

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Python 3,881 572 Updated Apr 24, 2024

OpenBMB / MiniCPM-o

MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone

Python 19,626 1,424 Updated Jun 12, 2025

OpenGVLab / InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 8,346 637 Updated May 29, 2025

16131zzzzzzzz / EveryoneNobel

A flexible framework powered by ComfyUI for generating personalized Nobel Prize images.

Python 1,351 89 Updated Nov 4, 2024

slimtoolkit / slim

Slim(toolkit): Don't change anything in your container image and minify it by up to 30x (and for compiled languages even more) making it secure too! (free and open source)

Go 21,794 774 Updated Jun 16, 2025

THUDM / GLM-4

GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型

Python 6,625 561 Updated Jun 16, 2025

Tencent / ncnn

ncnn is a high-performance neural network inference framework optimized for the mobile platform

C++ 21,633 4,260 Updated Jun 16, 2025

InternLM / lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 6,526 557 Updated Jun 16, 2025

NVIDIA / TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorR…

C++ 10,753 1,500 Updated Jun 16, 2025

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 15,185 2,040 Updated Jun 16, 2025

comfyanonymous / ComfyUI_TensorRT

Python 621 52 Updated Oct 10, 2024

mlc-ai / mlc-llm

Universal LLM Deployment Engine with ML Compilation

Python 20,803 1,747 Updated Jun 8, 2025

triton-lang / triton

Development repository for the Triton language and compiler

MLIR 15,870 2,045 Updated Jun 16, 2025

0