-
open-r1 Public
Forked from huggingface/open-r1Fully open reproduction of DeepSeek-R1
Python Apache License 2.0 UpdatedFeb 9, 2025 -
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedFeb 8, 2025 -
sglang Public
Forked from sgl-project/sglangSGLang is a fast serving framework for large language models and vision language models.
Python Apache License 2.0 UpdatedFeb 8, 2025 -
ollama Public
Forked from ollama/ollamaGet up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 2, and other large language models.
Go MIT License UpdatedFeb 8, 2025 -
TensorRT-LLM Public
Forked from NVIDIA/TensorRT-LLMTensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…
C++ Apache License 2.0 UpdatedAug 8, 2024 -
mergekit Public
Forked from arcee-ai/mergekitTools for merging pretrained large language models.
Python GNU Lesser General Public License v3.0 UpdatedJul 10, 2024 -
NeMo Public
Forked from NVIDIA/NeMoA scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Python Apache License 2.0 UpdatedJun 4, 2024 -
LLaVA Public
Forked from haotian-liu/LLaVA[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Python Apache License 2.0 UpdatedApr 8, 2024 -
-
flash-attention Public
Forked from Dao-AILab/flash-attentionFast and memory-efficient exact attention
Python BSD 3-Clause "New" or "Revised" License UpdatedDec 1, 2023 -
text-generation-webui Public
Forked from oobabooga/text-generation-webuiA Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, llama.cpp (GGUF), Llama models.
Python GNU Affero General Public License v3.0 UpdatedOct 23, 2023 -
AutoGPT Public
Forked from Significant-Gravitas/AutoGPTAn experimental open-source attempt to make GPT-4 fully autonomous.
JavaScript MIT License UpdatedOct 20, 2023 -
LLaMA-Factory Public
Forked from hiyouga/LLaMA-FactoryEasy-to-use LLM fine-tuning framework (LLaMA-2, BLOOM, Falcon, Baichuan, Qwen, ChatGLM2)
Python Apache License 2.0 UpdatedOct 13, 2023 -
FastEdit Public
Forked from hiyouga/FastEdit🩹Editing large language models within 10 seconds⚡
Python Apache License 2.0 UpdatedJul 13, 2023 -
fastllm Public
Forked from ztxz16/fastllm纯c++的全平台llm加速库,chatglm-6B级模型单卡可达10000+token / s,支持moss, chatglm, baichuan模型,手机端流畅运行
C++ UpdatedJun 16, 2023 -
-
FastChat Public
Forked from lm-sys/FastChatAn open platform for training, serving, and evaluating large language models. Release repo for Vicuna and FastChat-T5.
Python Apache License 2.0 UpdatedMay 31, 2023 -
CPM-Live Public
Forked from OpenBMB/CPM-LiveLive Training for Open-source Big Models
Python UpdatedMay 30, 2023 -
BMCook Public
Forked from OpenBMB/BMCookModel Compression for Big Models
Python Apache License 2.0 UpdatedMay 28, 2023 -
sharegpt Public
Forked from domeccleston/sharegptEasily share permanent links to ChatGPT conversations with your friends
TypeScript MIT License UpdatedMay 26, 2023 -
transformers_tasks Public
Forked from HarderThenHarder/transformers_tasks⭐️ NLP Algorithms with transformers lib. Supporting Text-Classification, Text-Generation, Information-Extraction, Text-Matching, RLHF, SFT etc.
Jupyter Notebook UpdatedMay 23, 2023 -
Chinese-LLaMA-Alpaca Public
Forked from ymcui/Chinese-LLaMA-Alpaca中文LLaMA&Alpaca大语言模型+本地CPU/GPU部署 (Chinese LLaMA & Alpaca LLMs)
Python Apache License 2.0 UpdatedMay 8, 2023 -
pandallm Public
Forked from dandelionsllm/pandallmPanda: 海外中文开源大语言模型,基于 Llama-7B, -13B, -33B, -65B 进行中文领域上的持续预训练。
Python Apache License 2.0 UpdatedMay 5, 2023 -
fairseq Public
Forked from facebookresearch/fairseqFacebook AI Research Sequence-to-Sequence Toolkit written in Python.
-
BELLE Public
Forked from LianjiaTech/BELLEBELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)
HTML Apache License 2.0 UpdatedApr 17, 2023 -
tacotron2 Public
Forked from NVIDIA/tacotron2Tacotron 2 - PyTorch implementation with faster-than-realtime inference
Jupyter Notebook BSD 3-Clause "New" or "Revised" License UpdatedMar 24, 2023 -
gpt-neox Public
Forked from EleutherAI/gpt-neoxAn implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.
Python Apache License 2.0 UpdatedFeb 15, 2023 -
ColossalAI Public
Forked from hpcaitech/ColossalAIMaking big AI models cheaper, easier, and scalable
Python Apache License 2.0 UpdatedFeb 15, 2023 -
hf-trim Public
Forked from IamAdiSri/hf-trimReduce the size of pretrained Hugging Face models via vocabulary trimming.
Python Mozilla Public License 2.0 UpdatedDec 28, 2022 -
pyChatGPT Public
Forked from terry3041/pyChatGPTAn unofficial Python wrapper for OpenAI's ChatGPT API
Python GNU General Public License v3.0 UpdatedDec 20, 2022