-
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedApr 7, 2025 -
-
KernelBench Public
Forked from ScalingIntelligence/KernelBenchKernelBench: Can LLMs Write GPU Kernels? - Benchmark with Torch -> CUDA problems
Python Other UpdatedFeb 21, 2025 -
sglang Public
Forked from sgl-project/sglangSGLang is a fast serving framework for large language models and vision language models.
Python Apache License 2.0 UpdatedFeb 18, 2025 -
TensorRT-LLM Public
Forked from NVIDIA/TensorRT-LLMTensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…
C++ Apache License 2.0 UpdatedJan 16, 2025 -
text-generation-webui Public
Forked from oobabooga/text-generation-webuiA Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, llama.cpp (GGUF), Llama models.
Python GNU Affero General Public License v3.0 UpdatedOct 11, 2023 -
hipBLASLt Public
Forked from ROCm/hipBLASLthipBLASLt is a library that provides general matrix-matrix operations with a flexible API and extends functionalities beyond a traditional BLAS library
Assembly MIT License UpdatedOct 6, 2023 -
stable-diffusion-webui Public
Forked from AUTOMATIC1111/stable-diffusion-webuiStable Diffusion web UI
Python GNU Affero General Public License v3.0 UpdatedSep 21, 2023 -
pytorch Public
Forked from pytorch/pytorchTensors and Dynamic neural networks in Python with strong GPU acceleration
-
triton Public
Forked from ROCm/tritonDevelopment repository for the Triton language and compiler
C++ MIT License UpdatedAug 22, 2023 -
-
FAMBench Public
Forked from facebookresearch/FAMBenchBenchmarks to capture important workloads.
Python Apache License 2.0 UpdatedApr 24, 2023 -
audio Public
Forked from ROCm/audioData manipulation and transformation for audio signal processing, powered by PyTorch
Python BSD 2-Clause "Simplified" License UpdatedApr 19, 2023 -
-
-
dlrm Public
Forked from facebookresearch/dlrmAn implementation of a deep learning recommendation model (DLRM)
Python MIT License UpdatedOct 17, 2022 -
FBGEMM Public
Forked from pytorch/FBGEMMFB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/
C++ Other UpdatedAug 17, 2022 -
builder Public
Forked from pytorch/builderContinuous builder and binary build scripts for pytorch
Shell BSD 2-Clause "Simplified" License UpdatedNov 10, 2021 -
vision Public
Forked from pytorch/visionDatasets, Transforms and Models specific to Computer Vision
Python BSD 3-Clause "New" or "Revised" License UpdatedNov 9, 2021