-
-
flash-attention Public
Forked from Dao-AILab/flash-attentionFast and memory-efficient exact attention
Python BSD 3-Clause "New" or "Revised" License UpdatedMay 22, 2025 -
LeetCUDA Public
Forked from xlite-dev/LeetCUDA📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA/Tensor Cores Kernels, HGEMM, FA-2 MMA etc.🔥
Cuda GNU General Public License v3.0 UpdatedMay 17, 2025 -
verl Public
Forked from volcengine/verlverl: Volcano Engine Reinforcement Learning for LLMs
Python Apache License 2.0 UpdatedMay 12, 2025 -
Awesome-ML-SYS-Tutorial Public
Forked from zhaochenyang20/Awesome-ML-SYS-TutorialMy learning notes/codes for ML SYS.
Python Apache License 2.0 UpdatedMay 10, 2025 -
Mooncake Public
Forked from kvcache-ai/MooncakeMooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
C++ Apache License 2.0 UpdatedApr 27, 2025 -
sgl-learning-materials Public
Forked from sgl-project/sgl-learning-materialsMaterials for learning SGLang
MIT License UpdatedApr 25, 2025 -
HybridSearch2026 Public
A Deep Dive into Advanced Hybrid Search Architectures
C++ Apache License 2.0 UpdatedApr 25, 2025 -
open-infra-index Public
Forked from deepseek-ai/open-infra-indexProduction-tested AI infrastructure tools for efficient AGI development and community-driven innovation
Creative Commons Zero v1.0 Universal UpdatedApr 16, 2025 -
MoneyPrinterTurbo Public
Forked from harry0703/MoneyPrinterTurbo利用大模型,一键生成短视频
Python MIT License UpdatedApr 11, 2025 -
AI-Guide-and-Demos-zh_CN Public
Forked from Hoper-J/AI-Guide-and-Demos-zh_CN这是一份入门AI/LLM大模型的逐步指南,包含教程和演示代码,带你从API走进本地大模型部署和微调,代码文件会提供Kaggle或Colab在线版本,即便没有显卡也可以进行学习。项目中还开设了一个小型的代码游乐场🎡,你可以尝试在里面实验一些有意思的AI脚本。同时,包含李宏毅 (HUNG-YI LEE)2024生成式人工智能导论课程的完整中文镜像作业。
Python MIT License UpdatedMar 26, 2025 -
3FS Public
Forked from deepseek-ai/3FSA high-performance distributed file system designed to address the challenges of AI training and inference workloads.
C++ MIT License UpdatedMar 18, 2025 -
sglang Public
Forked from sgl-project/sglangSGLang is a fast serving framework for large language models and vision language models.
Python Apache License 2.0 UpdatedMar 9, 2025 -
how-to-optim-algorithm-in-cuda Public
Forked from BBuf/how-to-optim-algorithm-in-cudahow to optimize some algorithm in cuda.
Cuda UpdatedMar 8, 2025 -
ZhiLight Public
Forked from zhihu/ZhiLightA highly optimized LLM inference acceleration engine for Llama and its variants.
C++ Apache License 2.0 UpdatedJan 24, 2025 -
llm-action Public
Forked from liguodongiot/llm-action本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
HTML Apache License 2.0 UpdatedJan 4, 2025 -
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedJan 3, 2025 -
seastar Public
Forked from scylladb/seastarHigh performance server-side application framework
C++ Apache License 2.0 UpdatedJan 1, 2025 -
infinity Public
Forked from infiniflow/infinityThe AI-native database built for LLM applications, providing incredibly fast hybrid search of dense vector, sparse vector, tensor (multi-vector), and full-text
C++ Apache License 2.0 UpdatedDec 28, 2024 -
FlagEmbedding Public
Forked from FlagOpen/FlagEmbeddingRetrieval and Retrieval-augmented LLMs
Python MIT License UpdatedDec 6, 2024 -
dive-into-llms Public
Forked from Lordog/dive-into-llms《动手学大模型Dive into LLMs》系列编程实践教程
UpdatedSep 20, 2024 -
-
ACORN Public
Forked from guestrin-lab/ACORNstate-of-the-art search over vector embeddings and structured data (SIGMOD '24)
C++ MIT License UpdatedJun 11, 2024 -
ragflow Public
Forked from infiniflow/ragflowRAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Python Apache License 2.0 UpdatedApr 12, 2024 -
databend Public
Forked from databendlabs/databendA modern cloud data warehouse focusing on reducing cost and complexity for your massive-scale analytics needs. Open source alternative to Snowflake. Also available in the cloud: https://app.databen…
Rust Other UpdatedMar 30, 2024 -
how-to-learn-deep-learning-framework Public
Forked from BBuf/how-to-learn-deep-learning-frameworkhow to learn PyTorch and OneFlow
Apache License 2.0 UpdatedMar 22, 2024 -
hnswlib Public
Forked from nmslib/hnswlibHeader-only C++/python library for fast approximate nearest neighbors
C++ Apache License 2.0 UpdatedMar 4, 2024 -
databend-docs Public
Forked from databendlabs/databend-docsOfficial repository for Databend documentation
SCSS Apache License 2.0 UpdatedFeb 20, 2024 -
mini-lsm Public
Forked from skyzh/mini-lsmA tutorial of building an LSM-Tree storage engine in a week!
Rust Apache License 2.0 UpdatedFeb 11, 2024 -