8000 vladbataev (Vlad Bataev) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View vladbataev's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report vladbataev

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

ACE-Step: A Step Towards Music Generation Foundation Model

Python 2,619 263 Updated Jun 27, 2025

YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open

Python 5,169 583 Updated Jun 4, 2025

A song aesthetic evaluation toolkit trained on SongEval.

Python 197 15 Updated Jun 15, 2025

Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切割、翻译、对齐、甚至加上配音,一键全自动视频搬运AI字幕组

Python 13,712 1,381 Updated May 18, 2025

Perforator is a cluster-wide continuous profiling tool designed for large data centers

C++ 3,223 145 Updated Jul 3, 2025

ARCH: Audio Representations benCHmark

Python 46 5 Updated Aug 26, 2024

YaFSDP: Yet another Fully Sharded Data Parallel

Python 972 48 Updated Jun 17, 2025

Awesome speech/audio LLMs, representation learning, and codec models

1,058 63 Updated Jun 27, 2025

Deep learning for dummies. All the practical details and useful utilities that go into working with real models.

Python 801 42 Updated Jun 9, 2025

Reference-aware automatic speech evaluation toolkit

Python 156 13 Updated Dec 5, 2024

A developer's guide to management: an open-sourced handbook for leading software engineering teams.

1,561 95 Updated Jan 24, 2020

A Neural Framework for MT Evaluation

Python 623 93 Updated Jun 17, 2025

The strictest and most opinionated python linter ever!

Python 2,737 404 Updated Jul 3, 2025

dataset of podcasts and episodes

Python 14 3 Updated Jan 16, 2018

A Unified Library for Parameter-Efficient and Modular Transfer Learning

Python 2,728 364 Updated May 27, 2025

Foundational Models for State-of-the-Art Speech and Text Translation

Jupyter Notebook 11,583 1,143 Updated Nov 14, 2024

Versatile audio super resolution (any -> 48kHz) with AudioSR.

Python 1,475 158 Updated May 10, 2025

Machine Learning Engineering Open Book

Python 14,182 857 Updated Jul 2, 2025

Text-to-Audio/Music Generation

Python 2,457 200 Updated Sep 29, 2024

Voice Conversion With Just Nearest Neighbors

Python 492 68 Updated Mar 18, 2024

FTP client package for Go

Go 1,360 374 Updated Jun 13, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 51,386 8,478 Updated Jul 3, 2025

Noise supression using deep filtering

Python 3,159 297 Updated Oct 17, 2024

A Very Low-Bitrate Codec for Speech Compression

C++ 3,876 360 Updated Aug 20, 2024

Stable Diffusion inference benchmarks

Python 10 Updated Jun 14, 2024

A timeline of the latest AI models for audio generation, starting in 2023!

1,906 70 Updated Jan 4, 2024

Download your Spotify playlists and songs along with album art and metadata (from YouTube if a match is found).

Python 20,497 1,821 Updated May 19, 2025

Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch

Python 2,561 276 Updated Jan 12, 2025

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 38,109 4,533 Updated Aug 19, 2024

AudioLDM: Generate speech, sound effects, music and beyond, with text.

Python 2,688 241 Updated Jun 25, 2025
Next
0