Starred repositories
An open protocol enabling communication and interoperability between opaque agentic applications.
[ACL 2024] MT-Bench-101: A Fine-Grained Benchmark for Evaluating Large Language Models in Multi-Turn Dialogues
Visualizer for neural network, deep learning and machine learning models
The ultimate LLM/AI application development framework in Golang.
Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks
A streamlined and customizable framework for efficient large model evaluation and performance benchmarking
📄 A curated list of awesome .cursorrules files
The official Typescript SDK for Model Context Protocol servers and clients
The official Python SDK for Model Context Protocol servers and clients
本项目为xiaozhi-esp32提供后端服务,帮助您快速搭建ESP32设备控制服务器。Backend service for xiaozhi-esp32, helps you quickly build an ESP32 device control server.
MCP server to provide Figma layout information to AI coding agents like Cursor
No fortress, purely open ground. OpenManus is Coming.
Modern. Native. Delightful Web Debugging Proxy for macOS, iOS, and Android ⚡️
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程
A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour
🦜🔗 Build context-aware reasoning applications
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
The official Python library for the OpenAI API
An extremely fast Python package and project manager, written in Rust.
Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]
Production-ready platform for agentic workflow development.
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
OpenMMLab Foundational Library for Training Deep Learning Models
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.