A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.

Python 2,988 175 Updated May 28, 2025

HandsOnLLM / Hands-On-Large-Language-Models

Official code repo for the O'Reilly Book - "Hands-On Large Language Models"

Jupyter Notebook 12,138 2,831 Updated Jul 10, 2025

ZhangXInFD / soundstorm-speechtokenizer

Implementation of SoundStorm built upon SpeechTokenizer.

Python 112 14 Updated Nov 2, 2023

ZhangXInFD / SpeechTokenizer

This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples are presented on

Python 580 53 Updated Jun 9, 2024

Qingzheng-Wang / Dual-Window-SE

An implement of STFT-Domain Neural Speech Enhancement with Very Low Algorithmic Latency of Zhong-Qiu Wang et al.

Python 13 1 Updated Nov 21, 2023

allenzren / open-pi-zero

Re-implementation of pi0 vision-language-action (VLA) model from Physical Intelligence

Python 1,016 66 Updated Jan 31, 2025

xiph / rnnoise

Recurrent neural network for audio noise reduction

C 4,836 962 Updated Feb 22, 2025

jorgehatccrma / pyagc

Python implementation for audio time-frequency automatic gain control

Python 81 23 Updated Feb 24, 2013

andyye1999 / WebRtc_NS_AGC

webrtc ns agc windows 仿真

C 4 2 Updated Jul 6, 2022

liuslevis / AdaptiveVolumeControl

Automatically control volume of songs in playlist to make a better experience.

Python 14 7 Updated Aug 31, 2017

jgaeddert / liquid-dsp

digital signal processing library for software-defined radios

C 2,027 471 Updated Jul 9, 2025

shichaog / WebRTC-audio-processing

webrtc audio processing

C++ 397 140 Updated May 10, 2020

MaxMax2016 / Preprocessing

完全独立编译 AEC, AGC, NS, VAD in WebRTC

C 13 60 Updated Jul 8, 2019

lbcgi / webrtc_agc_matlab

把webrtc的agc转成matlab代码以供科研工作者研究

MATLAB 36 9 Updated Dec 10, 2022

IDSIA / kohonen-vae

Official repository for the paper "Topological Neural Discrete Representation Learning à la Kohonen" (ICML 2023 Workshop on Sampling and Optimization in Discrete Space)

Python 10 1 Updated Jun 11, 2025

Standard-Intelligence / hertz-dev

first base model for full-duplex conversational audio

Python 1,746 111 Updated Jan 5, 2025

QiquanZhang yunzqq

Lists (3)

🔮 Future ideas

✨ Inspiration

🚀 My stack

Stars