8000 Pi-F (Pifometricien) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View Pi-F's full-sized avatar

Block or report Pi-F

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

✨✨VITA-Audio: Fast Interleaved Cross-Modal Token Generation for Efficient Large Speech-Language Model

Python 613 50 Updated May 24, 2025

Rust crate for some audio utilities

Rust 26 Updated Mar 8, 2025
Python 41 3 Updated Apr 30, 2025

A Conversational Speech Generation Model

Python 13,679 1,335 Updated May 27, 2025
Python 29 4 Updated Apr 28, 2025

Kimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversation

Python 3,916 264 Updated Jun 21, 2025

Data manipulation and transformation for audio signal processing, powered by PyTorch

Python 2,685 697 Updated Jul 7, 2025

A TTS model capable of generating ultra-realistic dialogue in one pass.

Python 17,355 1,434 Updated Jul 6, 2025

State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.

Python 1,504 147 Updated Jun 24, 2025
CMake 133 7 Updated May 6, 2025

A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.

Python 2,448 181 Updated Jun 10, 2025

Dataset of dry/wet pairs for audio effects research

Python 28 1 Updated Apr 17, 2025

Framework for differentiable black-box and gray-box audio effects modeling

Python 69 4 Updated Jun 30, 2025
Metal 15 Updated Nov 19, 2024

YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open

Python 5,176 584 Updated Jun 4, 2025

A practical implementation of GradNorm, Gradient Normalization for Adaptive Loss Balancing, in Pytorch

Python 98 4 Updated Jan 22, 2024

Vector (and Scalar) Quantization, in Pytorch

Python 3,384 271 Updated Jun 16, 2025

Inspired by "Neural Networks Fail to Learn Periodic Functions and How to Fix It"

Jupyter Notebook 66 8 Updated May 15, 2024

A large-scale dataset of caption-annotated MIDI files.

Python 69 3 Updated Jul 23, 2024

A family of state-of-the-art Transformer-based audio codecs for low-bitrate high-quality audio coding.

Python 375 23 Updated May 30, 2025

Mamba SSM architecture

Python 15,280 1,358 Updated Jun 26, 2025

ModernBERT model optimized for Apple Neural Engine.

Python 27 1 Updated Jan 10, 2025

Text-to-Audio/Music Generation

Python 2,460 199 Updated Sep 29, 2024

Versatile Evaluation of Speech and Audio

Python 291 32 Updated Jul 5, 2025
Python 198 44 Updated May 29, 2024

Polyphonic generalisation of DDSP

Python 19 Updated Apr 30, 2024
Swift 104 7 Updated Jun 26, 2025

A full collection of Music Informatic Retrieval (MIR) and AI Music labs.

43 1 Updated Dec 27, 2024
Next
0