8000 KSLee7 (Kungsing Lee) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View KSLee7's full-sized avatar

Block or report KSLee7

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

📚 从零开始的大语言模型原理与实践教程

3,375 250 Updated Jun 15, 2025

KV cache store for distributed LLM inference

C++ 262 24 Updated Jun 6, 2025

Convert Evernote .enex files to Markdown

Go 990 82 Updated Jun 14, 2025

Import Evernote ENEX files to Notion

Python 448 38 Updated Jan 17, 2024

Official Implementation of EAGLE-1 (ICML'24), EAGLE-2 (EMNLP'24), and EAGLE-3.

Python 1,313 161 Updated Jun 10, 2025

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Python 5,446 619 Updated Jun 16, 2025

DeepEP: an efficient expert-parallel communication library

Cuda 7,776 793 Updated Jun 16, 2025

FlashMLA: Efficient MLA decoding kernels

Cuda 11,603 853 Updated Apr 29, 2025

Making large AI models cheaper, faster and more accessible

Python 40,971 4,522 Updated Jun 16, 2025

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python 14,394 1,026 Updated Jun 15, 2025

LLM inference in C/C++

C++ 81,833 12,106 Updated Jun 16, 2025

Review/Check GGUF files and estimate the memory usage and maximum tokens per second.

Go 174 17 Updated Jun 13, 2025

Simple, scalable AI model deployment on GPU clusters

Python 2,925 300 Updated Jun 16, 2025

Integrate the DeepSeek API into popular softwares

32,869 3,628 Updated May 13, 2025

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

Python 4,896 1,750 Updated Feb 26, 2025

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Python 3,881 572 Updated Apr 24, 2024

MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone

Python 19,626 1,424 Updated Jun 12, 2025

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 8,346 637 Updated May 29, 2025

A flexible framework powered by ComfyUI for generating personalized Nobel Prize images.

Python 1,351 89 Updated Nov 4, 2024

Slim(toolkit): Don't change anything in your container image and minify it by up to 30x (and for compiled languages even more) making it secure too! (free and open source)

Go 21,794 774 Updated Jun 16, 2025

GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型

Python 6,625 561 Updated Jun 16, 2025

ncnn is a high-performance neural network inference framework optimized for the mobile platform

C++ 21,633 4,260 Updated Jun 16, 2025

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 6,526 557 Updated Jun 16, 2025

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorR…

C++ 10,753 1,500 Updated Jun 16, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 15,185 2,040 Updated Jun 16, 2025

Universal LLM Deployment Engine with ML Compilation

Python 20,803 1,747 Updated Jun 8, 2025

Development repository for the Triton language and compiler

MLIR 15,870 2,045 Updated Jun 16, 2025
Next
0