-
-
Unconditional-Audio-Generation-Benchmark Public
Forked from state-spaces/s4Unconditional audio generation benchmark
-
Cacophony Public
Inference codebase for "Cacophony: An Improved Contrastive Audio-Text Model". Preprint: https://arxiv.org/abs/2402.06986
-
Y-vector Public
Y-vector: Multiscale Waveform Encoder for Speaker Embedding
-
Filler-semi-CRF Public
Codebase for "Transcription free filler word detection with Neural semi-CRFs" [ICASSP2023]
-
openSFX-TFShard Public
A codebase for open source SFX data TFrecord sharding
Python MIT License UpdatedMay 26, 2023 -
Waveform-Synthesizer-with-Diffusion Public
Forked from lmnt-com/diffwavearchived
-
GenerativeSourceSeparation Public
Open source code for the paper 'Music Source Separation with Generative Flow'
-
Unofficial implementation for the paper 'Improving Diffusion Models for Inverse Problems using Manifold Constraints'[https://arxiv.org/abs/2206.00941]
-
PodcastFillers_Utils Public
Utility functions for preprocessing PodcastFillers dataset
-
TDspkr-mismatch-study Public
Code base for "A study of the robustness of raw waveform based speaker embeddings under mismatched conditions"