Highlights
- Pro
-
liusongxiang.github.io Public
Forked from yuyinzhou/yuyinzhou_old.github.ioPersonal homepage:
SCSS MIT License UpdatedMay 9, 2025 -
Large-Audio-Models Public
Keep track of big models in audio domain, including speech, singing, music etc.
-
.tmux Public
Forked from gpakosz/.tmux🇫🇷 Oh my tmux! My self-contained, pretty & versatile tmux configuration made with ❤️
Shell MIT License UpdatedAug 18, 2024 -
-
bigvsan Public
Forked from sony/bigvsanPytorch implementation of BigVSAN
-
AcademiCodec Public
Forked from yangdongchao/AcademiCodecAcademiCodec: An Open Source Audio Codec Model for Academic Research
-
-
phonemizer Public
Forked from bootphon/phonemizerSimple text to phones converter for multiple languages
Python GNU General Public License v3.0 UpdatedMar 23, 2023 -
audiolm-pytorch Public
Forked from lucidrains/audiolm-pytorchImplementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
Python MIT License UpdatedMar 20, 2023 -
rayeren.github.io Public
Forked from RayeRen/rayeren.github.ioMy personal homepage
SCSS MIT License UpdatedFeb 11, 2023 -
HN-UnifiedSourceFilterGAN Public
Forked from chomeyama/HN-UnifiedSourceFilterGANPython MIT License UpdatedJul 30, 2022 -
ppg-vc Public
PPG-Based Voice Conversion
-
cceyda Public
Forked from cceyda/cceydaShort profile with some stats and keywords
UpdatedJul 11, 2022 -
s3prl Public
Forked from s3prl/s3prlSelf-Supervised Speech Pre-training and Representation Learning Toolkit.
-
-
efficient_tts Public
Pytorch implementation of "Efficienttts: an efficient and high-quality text-to-speech architecture"
-
-
-
VQMIVC Public
Forked from Wendison/VQMIVCOfficial implementation of VQMIVC: One-shot Voice Conversion @ Interspeech 2021
-
fairseq Public
Forked from facebookresearch/fairseqFacebook AI Research Sequence-to-Sequence Toolkit written in Python.
Python MIT License UpdatedJun 17, 2021 -
BNE-Seq2SeqMoL-VC Public
Demo for "Any-to-Many Voice Conversion with Location-Relative Sequence-to-Sequence Modeling"
-
ForwardTacotron Public
Forked from spring-media/ForwardTacotron⏩ Generating speech in a single forward pass without any attention!
Python MIT License UpdatedMar 25, 2021 -
glow-tts Public
Forked from jaywalnut310/glow-ttsA Generative Flow for Text-to-Speech via Monotonic Alignment Search
Python MIT License UpdatedDec 7, 2020 -
CPC_audio Public
Forked from facebookresearch/CPC_audioAn implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.
Python MIT License UpdatedDec 3, 2020 -
WavAugment Public
Forked from facebookresearch/WavAugmentA library for speech data augmentation in time-domain
Python MIT License UpdatedDec 2, 2020 -
aishell-3-baseline-fc Public
Forked from sos1sos2Sixteen/aishell-3-baseline-fcThe code for aishell-3 baseline acoustic model
Jupyter Notebook MIT License UpdatedNov 30, 2020 -
WaveGrad Public
Forked from ivanvovk/WaveGradImplementation of Google Brain's WaveGrad high-fidelity vocoder (paper: https://arxiv.org/pdf/2009.00713.pdf). First implementation on GitHub.
-
hifi-gan Public
Forked from jik876/hifi-ganHiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
Python MIT License UpdatedNov 9, 2020 -
-
Parselmouth Public
Forked from YannickJadoul/ParselmouthPraat in Python, the Pythonic way
C++ GNU General Public License v3.0 UpdatedSep 25, 2020