Stars
Open Source framework for voice and multimodal conversational AI
An image picker (also with video and audio) for Flutter projects based on the WeChat's UI.
Firebase SDK for Apple App Development
WebRTC plugin for Flutter Mobile/Desktop/Web
Simple Objective-C wrapper for the keychain that works on Mac and iOS
Promises is a modern framework that provides a synchronization construct for Swift and Objective-C.
Google-internal core components of Firebase App Check.
Git with a cup of tea! Painless self-hosted all-in-one software development service, including Git hosting, code review, team collaboration, package registry and CI/CD
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
appr.tc has been shutdown. Please use the Dockerfile to run your own test/dev instance.
Faster Whisper transcription with CTranslate2
LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.
Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
Implementation and Deployment of Multilingual Custom Keyword Spotting Running in Real-time on an Edge Device.
Offline Speech Recognition with OpenAI Whisper and TensorFlow Lite for Android
Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Andr…
Offline speech recognition for Android with Vosk library.
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
FFmpeg for Android, iOS and tvOS. Not maintained anymore. Superseded by FFmpegKit.