-
megalodon Public
Forked from XuezheMax/megalodonReference implementation of Megalodon 7B model
Cuda MIT License UpdatedApr 17, 2024 -
-
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedOct 12, 2023 -
Awesome-LLM-System-Papers Public
Forked from AmadeusChan/Awesome-LLM-System-PapersUpdatedOct 6, 2023 -
streaming-llm Public
Forked from mit-han-lab/streaming-llmEfficient Streaming Language Models with Attention Sinks
Python MIT License UpdatedOct 5, 2023 -
LLMSpeculativeSampling Public
Forked from feifeibear/LLMSpeculativeSamplingFast inference from large lauguage models via speculative decoding
Python UpdatedSep 22, 2023 -
Medusa Public
Forked from FasterDecoding/MedusaMedusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads
Python Apache License 2.0 UpdatedSep 18, 2023 -
FlexFlow Public
Forked from flexflow/flexflow-trainA distributed deep learning framework.
C++ Apache License 2.0 UpdatedSep 18, 2023 -
nanoGPT Public
Forked from karpathy/nanoGPTThe simplest, fastest repository for training/finetuning medium-sized GPTs.
Python MIT License UpdatedSep 17, 2023 -
RetNet_huggingfaceCompatible Public
Forked from syncdoth/RetNetHuggingface compatible implementation of RetNet (Retentive Networks, https://arxiv.org/pdf/2307.08621.pdf) including parallel, recurrent, and chunkwise forward.
Python MIT License UpdatedAug 26, 2023 -
BMTrain Public
Forked from OpenBMB/BMTrainEfficient Training (including pre-training and fine-tuning) for Big Models
Python Apache License 2.0 UpdatedAug 24, 2023 -
RetNet Public
Forked from Jamie-Stirling/RetNetAn implementation of "Retentive Network: A Successor to Transformer for Large Language Models"
Python MIT License UpdatedAug 21, 2023 -
flash-attention Public
Forked from Dao-AILab/flash-attentionFast and memory-efficient exact attention
Python BSD 3-Clause "New" or "Revised" License UpdatedJul 23, 2023 -
alpaca-lora Public
Forked from tloen/alpaca-loraInstruct-tune LLaMA on consumer hardware
Jupyter Notebook Apache License 2.0 UpdatedApr 8, 2023 -
LoRA Public
Forked from microsoft/LoRACode for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
Python MIT License UpdatedApr 7, 2023 -
llama Public
Forked from meta-llama/llamaInference code for LLaMA models
Python GNU General Public License v3.0 UpdatedApr 4, 2023 -
BELLE Public
Forked from LianjiaTech/BELLEBELLE: BE Large Language model Engine(开源中文对话大模型7B)
Python Apache License 2.0 UpdatedMar 30, 2023 -
NLP-Tutorials Public
Forked from MorvanZhou/NLP-TutorialsSimple implementations of NLP models. Tutorials are written in Chinese on my website https://mofanpy.com
Python MIT License UpdatedMar 25, 2023 -
stanford_alpaca Public
Forked from tatsu-lab/stanford_alpacaAn Instruction-following LLaMA Model,Code and documentation to train Stanford's Alpaca models, and generate the data.
Python Apache License 2.0 UpdatedMar 22, 2023 -
ColossalAI Public
Forked from hpcaitech/ColossalAIMaking large AI models cheaper, faster and more accessible
Python Apache License 2.0 UpdatedMar 20, 2023 -
awesome-chatgpt-prompts Public
Forked from f/awesome-chatgpt-promptsThis repo includes ChatGPT prompt curation to use ChatGPT better.
HTML Creative Commons Zero v1.0 Universal UpdatedMar 17, 2023 -
huggingface-accelerate Public
Forked from huggingface/accelerate🚀 A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision
Python Apache License 2.0 UpdatedMar 15, 2023 -
langchain Public
Forked from langchain-ai/langchain⚡ Building applications with LLMs through composability ⚡
Python MIT License UpdatedMar 12, 2023 -
Prompt-Engineering-Guide-Cn Public
Forked from prompting-work/Prompt-Engineering-Guide-Cn关于提示工程的技术文章汇总和翻译
Jupyter Notebook MIT License UpdatedMar 3, 2023 -
Prompt-Engineering-Guide Public
Forked from dair-ai/Prompt-Engineering-Guide🐙 Guides, papers, lecture, and resources for prompt engineering
Jupyter Notebook MIT License UpdatedFeb 21, 2023 -
datasets-CrossWOZ Public
Forked from thu-coai/CrossWOZA Large-Scale Chinese Cross-Domain Task-Oriented Dialogue Dataset
Python Apache License 2.0 UpdatedJan 10, 2023 -
OmniXAI Public
Forked from salesforce/OmniXAIOmniXAI: A Library for eXplainable AI
Jupyter Notebook BSD 3-Clause "New" or "Revised" License UpdatedNov 22, 2022 -
Agoral-API-Examples Public
Forked from AgoraIO/API-ExamplesPlay with AgoraSDK and have fun! Everything you need to start learning Agora.
C++ UpdatedAug 24, 2022 -
NDK_OpenGLES_3_0 Public
Forked from githubhaohao/NDK_OpenGLES_3_0Android OpenGL ES 3.0 从入门到精通系统性学习教程
C++ Apache License 2.0 UpdatedJul 26, 2022 -
ucx Public
Forked from openucx/ucxUnified Communication X (mailing list - https://elist.ornl.gov/mailman/listinfo/ucx-group)
C Other UpdatedJun 20, 2022