Stars
A curated list for Efficient Large Language Models
CoT-Valve: Length-Compressible Chain-of-Thought Tuning
Model merging is a highly efficient approach for long-to-short reasoning.
Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It contains papers, codes, datasets, evaluations, and analyses.
A series of technical report on Slow Thinking with LLM
verl: Volcano Engine Reinforcement Learning for LLMs
Fast and memory-efficient exact attention
A high-throughput and memory-efficient inference and serving engine for LLMs
Minimal reproduction of DeepSeek R1-Zero
Fully open reproduction of DeepSeek-R1
GPT-4o-level, real-time spoken dialogue system.
Auto Garden is here to help you complete all harvesting and watering actions in your garden.
LLM API 管理 & 分发系统,支持 OpenAI、Azure、Anthropic Claude、Google Gemini、DeepSeek、字节豆包、ChatGLM、文心一言、讯飞星火、通义千问、360 智脑、腾讯混元等主流模型,统一 API 适配,可用于 key 管理与二次分发。单可执行文件,提供 Docker 镜像,一键部署,开箱即用。LLM API management & k…
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
Chat Templates for 🤗 HuggingFace Large Language Models
[ACL 2024] An Easy-to-use Knowledge Editing Framework for LLMs.
Must-read Papers on Knowledge Editing for Large Language Models.
Demonstrate all the questions on LeetCode in the form of animation.(用动画的形式呈现解LeetCode题目的思路)
「CNote」一份涵盖大部分学习 C 语言所需要掌握的核心知识,致力于打造最易懂的 C语言入门教程,让天下没有难学的 C语言。(包含C语言教程、C语言精华文章)
📰 Must-read papers and blogs on Speculative Decoding ⚡️
Code for our EMNLP 2023 Paper: "LLM-Adapters: An Adapter Family for Parameter-Efficient Fine-Tuning of Large Language Models"
Easily download/autodownload torrent(s) from share.dmhy.org/acg.rip etc. sites for OS X
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters