-
Tencent
-
14:13
(UTC +08:00)
Highlights
- Pro
-
spring2025-lectures Public
Forked from stanford-cs336/spring2025-lecturesPython UpdatedMay 15, 2025 -
LeetCUDA Public
Forked from xlite-dev/LeetCUDA📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA/Tensor Cores Kernels, HGEMM, FA-2 MMA etc.🔥
Cuda GNU General Public License v3.0 UpdatedMay 8, 2025 -
sglang Public
Forked from sgl-project/sglangSGLang is a fast serving framework for large language models and vision language models.
-
SpaceLM Public
Forked from sengine-research/SpaceLMA LLM model for space understanding
Python MIT License UpdatedApr 11, 2025 -
SpatialLM Public
Forked from manycore-research/SpatialLMSpatialLM: Large Language Model for Spatial Understanding
Python Other UpdatedMar 28, 2025 -
shallowsim Public
Forked from zartbot/shallowsimDeepSeek-V3/R1 inference performance simulator
Jupyter Notebook UpdatedMar 20, 2025 -
DeepEP Public
Forked from deepseek-ai/DeepEPDeepEP: an efficient expert-parallel communication library
Cuda MIT License UpdatedMar 17, 2025 -
DeepSeek_Simulator Public
Forked from shenh10/DeepSeek_SimulatorPython MIT License UpdatedMar 17, 2025 -
picotron Public
Forked from huggingface/picotronMinimalistic 4D-parallelism distributed training framework for education purpose
Python Apache License 2.0 UpdatedDec 19, 2024 -
flux Public
Forked from bytedance/fluxA fast communication-overlapping library for tensor parallelism on GPUs.
C++ Apache License 2.0 UpdatedOct 30, 2024 -
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedOct 24, 2024 -
llama.cpp Public
Forked from ggml-org/llama.cppLLM inference in C/C++
C++ MIT License UpdatedOct 9, 2024 -
whisper.cpp Public
Forked from ggml-org/whisper.cppPort of OpenAI's Whisper model in C/C++
C MIT License UpdatedOct 8, 2024 -
Open-MAGVIT2 Public
Forked from TencentARC/SEED-VokenOpen-MAGVIT2: Democratizing Autoregressive Visual Generation
Python Apache License 2.0 UpdatedSep 27, 2024 -
xDiT Public
Forked from xdit-project/xDiTxDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) on multi-GPU Clusters
Python Apache License 2.0 UpdatedSep 4, 2024 -
marlin Public
Forked from IST-DASLab/marlinFP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.
Python Apache License 2.0 UpdatedAug 15, 2024 -
exo Public
Forked from exo-explore/exoRun your own AI cluster at home with everyday devices 📱💻 🖥️⌚
Python GNU General Public License v3.0 UpdatedAug 2, 2024 -
chameleon Public
Forked from facebookresearch/chameleonRepository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.
Python Other UpdatedJul 29, 2024 -
AISystem Public
Forked from chenzomi12/AISystemAISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术
Jupyter Notebook Apache License 2.0 UpdatedJul 28, 2024 -
triton Public
Forked from triton-lang/tritonDevelopment repository for the Triton language and compiler
C++ MIT License UpdatedJul 5, 2024 -
mmrotate Public
Forked from open-mmlab/mmrotateOpenMMLab Rotated Object Detection Toolbox and Benchmark
Python Apache License 2.0 UpdatedApr 30, 2024 -
-
monai Public
Forked from Project-MONAI/research-contributionsImplementations of recent research prototypes/demonstrations using MONAI.
Python Apache License 2.0 UpdatedMar 7, 2024 -
-
magvit2-pytorch Public
Forked from lucidrains/magvit2-pytorchImplementation of MagViT2 Tokenizer in Pytorch
Python MIT License UpdatedJan 18, 2024 -
taming-transformers Public
Forked from CompVis/taming-transformersTaming Transformers for High-Resolution Image Synthesis
Jupyter Notebook MIT License UpdatedNov 10, 2023 -
LoRA Public
Forked from microsoft/LoRACode for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
Jupyter Notebook MIT License UpdatedOct 9, 2023 -
rerope Public
Forked from bojone/reropeRectified Rotary Position Embeddings
Python UpdatedAug 10, 2023 -
-