Stars
A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour
Get your documents ready for gen AI
A rugged, minimal framework for composing JavaScript behavior in your markup.
Curated List of React Components & Libraries.
Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM
リアルタイムボイスチェンジャー Realtime Voice Changer
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
A connector for Claude Desktop to read and search an Obsidian vault.
MCP server for Todoist integration enabling natural language task management with Claude
Memory for AI Agents; Announcing OpenMemory MCP - local and secure memory management.
The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.
A TTS model capable of generating ultra-realistic dialogue in one pass.
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX and NVIDIA GPU PyTorch support, handling, and auto-stitching
Multi-Scale Neural Audio Codec (SNAC) compresses audio into discrete codes at a low bitrate
Run Orpheus 3B Locally With LM Studio
Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Andr…
Talk to any LLM with hands-free voice interaction, voice interruption, and Live2D taking face running locally across platforms
SoTA production-ready AI retrieval system. Agentic Retrieval-Augmented Generation (RAG) with a RESTful API.
Silero VAD: pre-trained enterprise-grade Voice Activity Detector