-
MUFFIN Public
Forked from dianwen-ng/MUFFINMulti-band Frequency Reconstruction for Neural Psychoacoustic Coding
Python MIT License UpdatedMay 5, 2025 -
R3GAN Public
Forked from brownvc/R3GANCode for NeurIPS 2024 paper - The GAN is dead; long live the GAN! A Modern Baseline GAN - by Huang et al.
-
ml-engineering Public
Forked from stas00/ml-engineeringMachine Learning Engineering Open Book
-
sgmse Public
Forked from sp-uhh/sgmseScore-based Generative Models (Diffusion Models) for Speech Enhancement and Dereverberation
Python MIT License UpdatedOct 18, 2024 -
-
audiocraft Public
Forked from facebookresearch/audiocraftAudiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…
Python MIT License UpdatedJul 18, 2024 -
pyroomacoustics Public
Forked from LCAV/pyroomacousticsPyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.
Python MIT License UpdatedJul 8, 2024 -
LookOnceToHear Public
Forked from vb000/LookOnceToHearA novel human-interaction method for real-time speech extraction on headphones.
Python Other UpdatedMay 30, 2024 -
pykan Public
Forked from KindXiaoming/pykanKolmogorov Arnold Networks
Jupyter Notebook MIT License UpdatedMay 1, 2024 -
-
-
torch-audiomentations Public
Forked from iver56/torch-audiomentationsFast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.
Python MIT License UpdatedApr 4, 2024 -
Academic-project-page-template Public template
Forked from eliahuhorwitz/Academic-project-page-templateA project page template for academic papers. Demo at https://eliahuhorwitz.github.io/Academic-project-page-template/
JavaScript UpdatedApr 2, 2024 -
stable-audio-tools Public
Forked from Stability-AI/stable-audio-toolsGenerative models for conditional audio generation
Python MIT License UpdatedJan 30, 2024 -
ddsp-singing-vocoders Public
Forked from YatingMusic/ddsp-singing-vocodersOfficial implementation of SawSing (ISMIR'22)
Python GNU Affero General Public License v3.0 UpdatedAug 11, 2022 -
Transformers-Tutorials Public
Forked from NielsRogge/Transformers-TutorialsThis repository contains demos I made with the Transformers library by HuggingFace.
-
cargan Public
Forked from descriptinc/carganOfficial repository for the paper "Chunked Autoregressive GAN for Conditional Waveform Synthesis"
Python MIT License UpdatedOct 20, 2021 -
speechbrain Public
Forked from speechbrain/speechbrainA PyTorch-based Speech Toolkit
Python Apache License 2.0 UpdatedApr 19, 2021 -
ParallelWaveGAN Public
Forked from kan-bayashi/ParallelWaveGANUnofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN) with Pytorch
Jupyter Notebook MIT License UpdatedJun 2, 2020 -
espnet Public
Forked from espnet/espnetEnd-to-End Speech Processing Toolkit
Python Apache License 2.0 UpdatedMay 31, 2020 -
FB-MelGAN Public
Forked from yanggeng1995/FB-MelGANA pytroch implementation of the FB-MelGAN
Python UpdatedMay 26, 2020 -
DeepSpeechDistances Public
Forked from mbinkowski/DeepSpeechDistancesOfficial implementation of DeepSpeech Distances.
Jupyter Notebook Apache License 2.0 UpdatedFeb 13, 2020 -
build-your-own-x Public
Forked from codecrafters-io/build-your-own-x🤓 Build your own (insert technology here)
UpdatedFeb 11, 2020 -
DeepSpeed Public
Forked from deepspeedai/DeepSpeedDeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.
Python MIT License UpdatedFeb 11, 2020 -
-
melgan Public
Forked from seungwonpark/melganMelGAN vocoder (compatible with NVIDIA/tacotron2)
Python BSD 3-Clause "New" or "Revised" License UpdatedNov 4, 2019 -
melgan-neurips Public
Forked from descriptinc/melgan-neuripsGAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis
Python MIT License UpdatedOct 26, 2019 -
MaximumMarginGANs Public
Forked from AlexiaJM/MaximumMarginGANsCode for paper: "Support Vector Machines, Wasserstein's distance and gradient-penalty GANs maximize a margin"
Python MIT License UpdatedOct 17, 2019 -
Python-Wrapper-for-World-Vocoder Public
Forked from JeremyCCHsu/Python-Wrapper-for-World-VocoderA Python wrapper for the high-quality vocoder "World"
Python MIT License UpdatedSep 30, 2019 -
interspeech2019-tutorial Public
Forked from espnet/interspeech2019-tutorialINTERSPEECH 2019 Tutorial Materials
Jupyter Notebook UpdatedSep 23, 2019