-
torchtitan Public
Forked from pytorch/torchtitanA PyTorch native library for large model training
-
oat Public
Forked from sail-sg/oat🌾 OAT: A research-friendly framework for LLM online alignment, including preference learning, reinforcement learning, etc.
Python Apache License 2.0 UpdatedFeb 9, 2025 -
Megatron-LM Public
Forked from NVIDIA/Megatron-LMfork from Megatron-LM commit 2196398f5252ead6f036b06d45f7acb89b1308da
-
LLM-Drop Public
Forked from CASE-Lab-UMD/LLM-DropThe official implementation of the paper "What Matters in Transformers? Not All Attention is Needed".
-
FastChat Public
Forked from lm-sys/FastChatAn open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Python Apache License 2.0 UpdatedJul 11, 2024 -
wanda Public
Forked from locuslab/wandaA simple and effective LLM pruning approach.
Python MIT License UpdatedJul 9, 2024 -
lm-evaluation-harness Public
Forked from EleutherAI/lm-evaluation-harnessA framework for few-shot evaluation of language models.
-
-
SVD-LLM Public
Forked from AIoT-MLSys-Lab/SVD-LLMOfficial Code for "SVD-LLM: Truncation-aware Singular Value Decomposition for Large Language Model Compression"
-
mergekit Public
Forked from arcee-ai/mergekitTools for merging pretrained large language models.
Python GNU Lesser General Public License v3.0 UpdatedMar 27, 2024 -
bigcode-evaluation-harness Public
Forked from bigcode-project/bigcode-evaluation-harnessA framework for the evaluation of autoregressive code generation language models.
Python Apache License 2.0 UpdatedFeb 8, 2024 -
-
representation-engineering Public
Forked from andyzoujm/representation-engineeringRepresentation Engineering: A Top-Down Approach to AI Transparency
Jupyter Notebook MIT License UpdatedNov 10, 2023 -
-
qlora Public
Forked from artidoro/qloraQLoRA: Efficient Finetuning of Quantized LLMs
Jupyter Notebook MIT License UpdatedOct 3, 2023 -
LLM-Pruner Public
Forked from horseee/LLM-Pruner[NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language Models. Support LLaMA, Llama-2, BLOOM, Vicuna, Baichuan, etc.
Python Apache License 2.0 UpdatedSep 24, 2023 -
TinyLlama Public
Forked from jzhang38/TinyLlamaThe TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
Python Apache License 2.0 UpdatedSep 11, 2023 -
falcontune Public
Forked from rmihaylov/falcontuneTune any FALCON in 4-bit
Python Apache License 2.0 UpdatedJun 17, 2023 -
FasterTransformer Public
Forked from NVIDIA/FasterTransformerTransformer related optimization, including BERT, GPT
C++ Apache License 2.0 UpdatedJun 16, 2023 -
easy-llm-finetuner Public
Forked from Antlera/easy-llm-finetunerEasy Environment Configuration for LLM Model Finetuning
Shell MIT License UpdatedJun 10, 2023 -
transformers Public
Forked from huggingface/transformers🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Python Apache License 2.0 UpdatedJun 9, 2023 -
-
alpaca-lora Public
Forked from tloen/alpaca-loraInstruct-tune LLaMA on consumer hardware
Jupyter Notebook Apache License 2.0 UpdatedApr 13, 2023 -
s1ghhh.github.io Public
Forked from academicpages/academicpages.github.ioGithub Pages
JavaScript MIT License UpdatedMar 14, 2023 -
-
demix Public
Forked from kernelmachine/demixDEMix Layers for Modular Language Modeling
Python Other UpdatedAug 23, 2021