Lists (1)
Sort Name ascending (A-Z)
Stars
mediasoup android demo https://demo.mediasoup.org
mediasoup android client side library https://mediasoup.org
TR-UDP library: Teonet Real time communications over UDP protocol
zero-shot voice conversion & singing voice conversion, with real-time support
Speech-to-text, text-to-speech, speaker diarization, speech enhancement, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, HarmonyOS…
A generative speech model for daily dialogue.
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
✨✨VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Multilingual Voice Understanding Model
Open-source framework and platform for building real-time, multimodal, low-latency conversational voice AI agents. It features a workflow builder and supports C, C++, Go, Python, JavaScript, and Ty…
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.
Faster Whisper transcription with CTranslate2
A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node