-
transformer-explainer Public
Forked from poloclub/transformer-explainerTransformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization
JavaScript MIT License UpdatedApr 30, 2025 -
-
tiny-universe Public
Forked from datawhalechina/tiny-universe《大模型白盒子构建指南》:一个全手搓的Tiny-Universe
Python UpdatedDec 28, 2024 -
awesome-LLM-resourses Public
Forked from WangRongsheng/awesome-LLM-resources🧑🚀 全世界最好的LLM资料总结 | Summary of the world's best LLM resources.
Apache License 2.0 UpdatedSep 21, 2024 -
freeCodeCamp Public
Forked from freeCodeCamp/freeCodeCampfreeCodeCamp.org's open-source codebase and curriculum. Learn to code for free.
TypeScript BSD 3-Clause "New" or "Revised" License UpdatedSep 6, 2024 -
SwiftTransformer Public
Forked from LLMServe/SwiftTransformerHigh performance Transformer implementation in C++.
C++ UpdatedApr 22, 2024 -
awesome-compression Public
Forked from datawhalechina/awesome-compression模型压缩的小白入门教程
UpdatedApr 5, 2024 -
llm-foundry Public
Forked from mosaicml/llm-foundryLLM training code for MosaicML foundation models
Python Apache License 2.0 UpdatedDec 8, 2023 -
gpt-fast Public
Forked from pytorch-labs/gpt-fastSimple and efficient pytorch-native transformer text generation in <1000 LOC of python.
Python BSD 3-Clause "New" or "Revised" License UpdatedDec 8, 2023 -
H2O Public
Forked from FMInference/H2O[NeurIPS'23] H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models.
Python UpdatedDec 2, 2023 -
AutoGPTQ Public
Forked from AutoGPTQ/AutoGPTQAn easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
Python MIT License UpdatedNov 24, 2023 -
oneflow Public
Forked from Oneflow-Inc/oneflowOneFlow is a deep learning framework designed to be user-friendly, scalable and efficient.
C++ Apache License 2.0 UpdatedNov 3, 2023 -
DB-GPT Public
Forked from eosphoros-ai/DB-GPTRevolutionizing Database Interactions with Private LLM Technology
Python MIT License UpdatedNov 2, 2023 -
xtuner Public
Forked from InternLM/xtunerA toolkit for efficiently fine-tuning LLM (InternLM, Llama, Baichuan, QWen, ChatGLM2)
Python Apache License 2.0 UpdatedNov 1, 2023 -
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedOct 27, 2023 -
text-generation-inference Public
Forked from huggingface/text-generation-inferenceLarge Language Model Text Generation Inference
Python Other UpdatedOct 27, 2023 -
spyder Public
Forked from spyder-ide/spyderOfficial repository for Spyder - The Scientific Python Development Environment
Python MIT License UpdatedOct 19, 2023 -
xformers Public
Forked from facebookresearch/xformersHackable and optimized Transformers building blocks, supporting a composable construction.
Python Other UpdatedOct 15, 2023 -
streaming-llm Public
Forked from mit-han-lab/streaming-llmEfficient Streaming Language Models with Attention Sinks
Python MIT License UpdatedOct 5, 2023 -
exllamav2 Public
Forked from turboderp-org/exllamav2A fast inference library for running LLMs locally on modern consumer-class GPUs
Python MIT License UpdatedSep 14, 2023 -
annotated_deep_learning_paper_implementations Public
Forked from labmlai/annotated_deep_learning_paper_implementations🧑🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gan…
Jupyter Notebook MIT License UpdatedSep 9, 2023 -
FasterTransformer Public
Forked from NVIDIA/FasterTransformerTransformer related optimization, including BERT, GPT
C++ Apache License 2.0 UpdatedSep 8, 2023 -
DeepLearningExamples Public
Forked from NVIDIA/DeepLearningExamplesState-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
Jupyter Notebook UpdatedSep 5, 2023 -
trt-samples-for-hackathon-cn Public
Forked from NVIDIA/trt-samples-for-hackathon-cnSimple samples for TensorRT programming
Python Apache License 2.0 UpdatedSep 3, 2023 -
flash-attention Public
Forked from Dao-AILab/flash-attentionFast and memory-efficient exact attention
Python BSD 3-Clause "New" or "Revised" License UpdatedSep 1, 2023 -
LLMTrainer Public
Forked from ifromeast/LLMTrainerA comparison of pretraining framework for LLM
Python UpdatedAug 14, 2023 -
ByteTransformer Public
Forked from bytedance/ByteTransformeroptimized BERT transformer inference on NVIDIA GPU. https://arxiv.org/abs/2210.03052
C++ Apache License 2.0 UpdatedJul 24, 2023 -
lightseq Public
Forked from bytedance/lightseqLightSeq: A High Performance Library for Sequence Processing and Generation
C++ Other UpdatedMay 16, 2023 -
Automatic-Speech-Recognition Public
Forked from 30stomercury/Automatic-Speech-RecognitionEnd-to-End Speech Recognition Using Tensorflow
Python UpdatedMar 24, 2023 -
lockable-resources-plugin Public
Forked from jenkinsci/lockable-resources-pluginLock resources against concurrent use
Java MIT License UpdatedFeb 25, 2022