Stars
《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程
MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone
llama3 implementation one matrix multiplication at a time
Holistic Evaluation of Language Models (HELM) is an open source Python framework created by the Center for Research on Foundation Models (CRFM) at Stanford for holistic, reproducible and transparen…
中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard
Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks
A framework for few-shot evaluation of language models.
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
Awesome-LLM: a curated list of Large Language Model
Set of tools to assess and improve LLM security.
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…
Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"
Open-Sora: Democratizing Efficient Video Production for All
A family of open-sourced Mixture-of-Experts (MoE) Large Language Models
Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型
Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)
DAMO-ConvAI: The official repository which contains the codebase for Alibaba DAMO Conversational AI.
ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
4D38 Language Understanding Evaluation benchmark for Chinese: datasets, baselines, pre-trained models,corpus and leaderboard
Official inference library for Mistral models
Unified framework for building enterprise RAG pipelines with small, specialized models