8000 RamanHacks (Abhigyan Raman) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View RamanHacks's full-sized avatar

Block or report RamanHacks

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Multilingual Voice Understanding Model

Python 5,626 500 Updated Mar 23, 2025

Flexible audio loudness meter in Python with implementation of ITU-R BS.1770-4 loudness algorithm

Python 696 57 Updated Jul 2, 2024

Take notes with your voice and send them to Notion

JavaScript 126 62 Updated May 14, 2025

NanoGPT (124M) in 3 minutes

Python 2,550 305 Updated Apr 26, 2025
Python 1,820 229 Updated May 13, 2025

Whisper with Medusa heads

Python 833 52 Updated Apr 29, 2025

Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection

Python 704 35 Updated Dec 19, 2024

GUI for a Vocal Remover that uses Deep Neural Networks.

Python 20,609 1,515 Updated Mar 13, 2025

Speech To Speech: an effort for an open-sourced and modular GPT4-o

Python 4,019 440 Updated Apr 15, 2025

AIOS: AI Agent Operating System

Python 4,142 507 Updated May 7, 2025

Arabic speech recognition, classification and text-to-speech.

Jupyter Notebook 393 80 Updated Sep 30, 2023

Noise supression using deep filtering

Python 3,041 281 Updated Oct 17, 2024
Jupyter Notebook 10 3 Updated Apr 3, 2024
Python 84 14 Updated Oct 3, 2024

Look Who’s Talking: Active Speaker Detection in the Wild

Python 72 3 Updated Aug 24, 2023

Tools for handling speech data in machine learning projects.

Python 1,023 233 Updated May 14, 2025

easy-to-use implementation of the ISMIR 2013 Audio Degradation Toolbox

Python 49 10 Updated Nov 19, 2019

Controllable and fast Text-to-Speech for over 7000 languages!

Python 1,589 181 Updated Nov 7, 2024

A high-quality, varied ~30hr voice dataset suitable for training a TTS model

59 3 Updated Jan 7, 2023

String-to-String Algorithms for Natural Language Processing

Jupyter Notebook 546 30 Updated Jul 26, 2024

Audio Normalization for Python/ffmpeg

HTML 1,361 122 Updated May 8, 2025

Conformer-based Metric GAN for speech enhancement

Python 355 63 Updated May 3, 2024

ASR client for Triton ASR Service

Python 29 6 Updated Dec 13, 2024

A Jax-based library for building transformers, includes implementations of GPT, Gemma, LlaMa, Mixtral, Whisper, SWin, ViT and more.

Python 287 10 Updated Aug 28, 2024

speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with actual speaker names

Python 214 20 Updated Apr 9, 2025

YSDA course in Speech Processing.

Jupyter Notebook 243 76 Updated May 11, 2025

🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.

Python 243 23 Updated Jun 10, 2024

High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.

Python 6,062 826 Updated Dec 24, 2024
Next
0