-
Idrak Ai
- Islamabad Pakistan
More
8000
details>Lists (1)
Sort Name ascending (A-Z)
Stars
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
A repo that implements various tricks to improve the inference time of VITPOSE.
This is the ✨ source code for my personal website, built with Next.js, Tailwind CSS, Contentlayer, and 🚀 deployed on Vercel 🔼. You can use this repository as a template to build your own personal w…
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
Android Voice Activity Detection (VAD) library. Supports WebRTC VAD GMM, Silero VAD DNN, Yamnet VAD DNN models.
☎️ An automated answering machine build on top of Amazon Connect
A multi-voice TTS system trained with an emphasis on quality