Stars
STT
7 repositories
Multilingual Voice Understanding Model
Faster Whisper transcription with CTranslate2
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Robust Speech Recognition via Large-Scale Weak Supervision
Kimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversation