8000 RamanHacks (Abhigyan Raman) / Starred · GitHub

More Web Proxy on the site http://driver.im/

RamanHacks

Follow

Abhigyan Raman RamanHacks

Follow

31 followers · 17 following

IIT Delhi

Achievements

Achievements

Lists (3)

Sort

🌟 ASR

⭐ MLOps

YT

Stars

FunAudioLLM / SenseVoice

Multilingual Voice Understanding Model

Python 5,626 500 Updated Mar 23, 2025

csteinmetz1 / pyloudnorm

Flexible audio loudness meter in Python with implementation of ITU-R BS.1770-4 loudness algorithm

Python 696 57 Updated Jul 2, 2024

TomFrankly / pipedream-notion-voice-notes

Take notes with your voice and send them to Notion

JavaScript 126 62 Updated May 14, 2025

KellerJordan / modded-nanogpt

NanoGPT (124M) in 3 minutes

Python 2,550 305 Updated Apr 26, 2025

Xbozon / go-whisper-cpp-server-example

Go 20 1 Updated Apr 25, 2024

speaches-ai / speaches

Python 1,820 229 Updated May 13, 2025

aiola-lab / whisper-medusa

Whisper with Medusa heads

Python 833 52 Updated Apr 29, 2025

nyrahealth / CrisperWhisper

Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection

Python 704 35 Updated Dec 19, 2024

Anjok07 / ultimatevocalremovergui

GUI for a Vocal Remover that uses Deep Neural Networks.

Python 20,609 1,515 Updated Mar 13, 2025

huggingface / speech-to-speech

Speech To Speech: an effort for an open-sourced and modular GPT4-o

Python 4,019 440 Updated Apr 15, 2025

agiresearch / AIOS

AIOS: AI Agent Operating System

Python 4,142 507 Updated May 7, 2025

ARBML / klaam

Arabic speech recognition, classification and text-to-speech.

Jupyter Notebook 393 80 Updated Sep 30, 2023

nipponjo / arabic-speech-to-text

Python 5 1 Updated Jan 8, 2024

Rikorose / DeepFilterNet

Noise supression using deep filtering

Python 3,041 281 Updated Oct 17, 2024

sanchit-gandhi / codesnippets

Jupyter Notebook 10 3 Updated Apr 3, 2024

biboamy / TVSM-dataset

Python 84 14 Updated Oct 3, 2024

clovaai / lookwhostalking

Look Who’s Talking: Active Speaker Detection in the Wild

Python 72 3 Updated Aug 24, 2023

lhotse-speech / lhotse

Tools for handling speech data in machine learning projects.

Python 1,023 233 Updated May 14, 2025

sevagh / audio-degradation-toolbox

easy-to-use implementation of the ISMIR 2013 Audio Degradation Toolbox

Python 49 10 Updated Nov 19, 2019

DigitalPhonetics / IMS-Toucan

Controllable and fast Text-to-Speech for over 7000 languages!

Python 1,589 181 Updated Nov 7, 2024

dioco-group / jenny-tts-dataset

A high-quality, varied ~30hr voice dataset suitable for training a TTS model

59 3 Updated Jan 7, 2023

stanfordnlp / string2string

String-to-String Algorithms for Natural Language Processing

Jupyter Notebook 546 30 Updated Jul 26, 2024

slhck / ffmpeg-normalize

Audio Normalization for Python/ffmpeg

HTML 1,361 122 Updated May 8, 2025

ruizhecao96 / CMGAN

Conformer-based Metric GAN for speech enhancement

Python 355 63 Updated May 3, 2024

yuekaizhang / Triton-ASR-Client

ASR client for Triton ASR Service

Python 29 6 Updated Dec 13, 2024

HMUNACHI / nanodl

A Jax-based library for building transformers, includes implementations of GPT, Gemma, LlaMa, Mixtral, Whisper, SWin, ViT and more.

Python 287 10 Updated Aug 28, 2024

NavodPeiris / speechlib

speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with actual speaker names

Python 214 20 Updated Apr 9, 2025

yandexdataschool / speech_course

YSDA course in Speech Processing.

Jupyter Notebook 243 76 Updated May 11, 2025

davidmartinrius / speech-dataset-generator

🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.

Python 243 23 Updated Jun 10, 2024

myshell-ai / MeloTTS

High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.

Python 6,062 826 Updated Dec 24, 2024

0