-
sglang Public
Forked from sgl-project/sglangSGLang is a fast serving framework for large language models and vision language models.
-
-
flashinfer Public
Forked from flashinfer-ai/flashinferFlashInfer: Kernel Library for LLM Serving
Cuda Apache License 2.0 UpdatedFeb 28, 2025 -
BitNet Public
Forked from microsoft/BitNetOfficial inference framework for 1-bit LLMs
C++ MIT License UpdatedOct 22, 2024 -
gpt-fast Public
Forked from pytorch-labs/gpt-fastSimple and efficient pytorch-native transformer text generation in <1000 LOC of python.
Python BSD 3-Clause "New" or "Revised" License UpdatedSep 27, 2024 -
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
-
llama.cpp Public
Forked from ggml-org/llama.cppPort of Facebook's LLaMA model in C/C++
C++ MIT License UpdatedAug 12, 2024 -
llm.c Public
Forked from karpathy/llm.cLLM training in simple, raw C/CUDA
Cuda MIT License UpdatedJun 3, 2024 -
whisper.cpp Public
Forked from ggml-org/whisper.cppPort of OpenAI's Whisper model in C/C++
-
cpuinfo Public
Forked from pytorch/cpuinfoCPU INFOrmation library (x86/x86-64/ARM/ARM64, Linux/Windows/Android/macOS/iOS)
C BSD 2-Clause "Simplified" License UpdatedApr 17, 2024 -
-
SqueezeLLM Public
Forked from SqueezeAILab/SqueezeLLMSqueezeLLM: Dense-and-Sparse Quantization
Python MIT License UpdatedJun 16, 2023 -
extension-cpp Public
Forked from pytorch/extension-cppC++ extensions in PyTorch
Python UpdatedJun 12, 2023 -
ChatRWKV Public
Forked from BlinkDL/ChatRWKVChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.
Python Apache License 2.0 UpdatedMar 13, 2023 -
pytorch Public
Forked from pytorch/pytorchTensors and Dynamic neural networks in Python with strong GPU acceleration
-
detectron2 Public
Forked from facebookresearch/detectron2Detectron2 is FAIR's next-generation platform for object detection and segmentation.
-
vision Public
Forked from pytorch/visionDatasets, Transforms and Models specific to Computer Vision
Python BSD 3-Clause "New" or "Revised" License UpdatedSep 19, 2022 -
pytorch_geometric Public
Forked from pyg-team/pytorch_geometricGraph Neural Network Library for PyTorch
Python MIT License UpdatedJul 24, 2022 -
tutorials Public
Forked from pytorch/tutorialsPyTorch tutorials.
Python BSD 3-Clause "New" or "Revised" License UpdatedJun 27, 2022 -
-
espnet Public
Forked from espnet/espnetEnd-to-End Speech Processing Toolkit
Python Apache License 2.0 UpdatedSep 9, 2021 -
ideep Public
Forked from intel/ideepIntel® Optimization for Chainer*, a Chainer module providing numpy like API and DNN acceleration using MKL-DNN.
-
Informer2020 Public
Forked from zhouhaoyi/Informer2020The GitHub repository for the paper "Informer" accepted by AAAI 2021.
Python Apache License 2.0 UpdatedMar 20, 2021 -
-
pytorch_block_sparse Public
Forked from huggingface/pytorch_block_sparseFast Block Sparse Matrices for Pytorch
-
inference Public
Forked from mlcommons/inferenceReference implementations of inference benchmarks
-
oneDNN Public
Forked from uxlfoundation/oneDNNoneAPI Deep Neural Network Library (oneDNN)
-
serve Public
Forked from pytorch/serveModel Serving on PyTorch
-
lite-transformer Public
Forked from mit-han-lab/lite-transformer[ICLR 2020] Lite Transformer with Long-Short Range Attention
-
fairseq Public
Forked from facebookresearch/fairseqFacebook AI Research Sequence-to-Sequence Toolkit written in Python.