-
bark Public
Forked from suno-ai/bark🔊 Text-Prompted Generative Audio Model
Python Other UpdatedApr 25, 2023 -
segment-anything Public
Forked from facebookresearch/segment-anythingThe repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Jupyter Notebook Apache License 2.0 UpdatedApr 7, 2023 -
AudioLDM Public
Forked from haoheliu/AudioLDMAudioLDM: Generate speech, sound effects, music and beyond, with text.
Python Other UpdatedFeb 12, 2023 -
SpeechTransProgress Public
Forked from kahne/SpeechTransProgressTracking the progress in end-to-end speech translation
Creative Commons Zero v1.0 Universal UpdatedOct 30, 2021 -
WeTS Public
Forked from ZhenYangIACAS/WeTSA benchmark for the task of translation suggestion
Mask The Unlicense UpdatedOct 19, 2021 -
Image Restoration Toolbox (PyTorch). Training and testing codes for DPIR, USRNet, DnCNN, FFDNet, SRMD, DPSR, BSRGAN
Python MIT License UpdatedSep 7, 2021 -
singing_transcription_ICASSP2021 Public
Forked from york135/singing_transcription_ICASSP2021The source code and pre-trained model of the paper "On the Preparation and Validation of a Large-scale Dataset"
Python UpdatedMay 23, 2021 -
s3prl Public
Forked from s3prl/s3prlSelf-Supervised Speech Pre-training and Representation Learning Toolkit.
Python MIT License UpdatedApr 16, 2021 -
pytorch_cpp Public
Forked from koba-jon/pytorch_cppDeep Learning sample programs using PyTorch in C++
C++ MIT License UpdatedApr 2, 2021 -
traditional-speech-enhancement Public
Forked from PandoraLS/traditional-speech-enhancement语音增强传统方法
MATLAB MIT License UpdatedMar 11, 2021 -
Subband-Music-Separation Public
Forked from haoheliu/Subband-Music-SeparationPytorch: Channel-wise subband input for better voice and accompaniment separation
Python UpdatedJan 22, 2021 -
pytorch-optimizer Public
Forked from jettify/pytorch-optimizertorch-optimizer -- collection of optimizers for Pytorch
Python Apache License 2.0 UpdatedDec 30, 2020 -
pika Public
Forked from tencent-ailab/pikaa lightweight speech processing toolkit based on Pytorch and (Py)Kaldi
Python Apache License 2.0 UpdatedDec 25, 2020 -
Listening-to-Sound-of-Silence-for-Speech-Denoising Public
Forked from henryxrl/Listening-to-Sound-of-Silence-for-Speech-Denoising[NeurIPS 2020] Official repository for the project "Listening to Sound of Silence for Speech Denoising"
-
-
awesome-audio-visual Public
Forked from krantiparida/awesome-audio-visualA curated list of different papers and datasets in various areas of audio-visual processing
UpdatedNov 28, 2020 -
A 2-dimensional Self-attention-based Solution with Cooperative Gated Convolutional Modules for Speech Enhancement
-
performer-pytorch Public
Forked from lucidrains/performer-pytorchAn implementation of Performer, a linear attention-based transformer, in Pytorch
Python MIT License UpdatedNov 11, 2020 -
DeepComplexCRN Public
Forked from huyanxin/DeepComplexCRNHTML Apache License 2.0 UpdatedNov 9, 2020 -
spleeter Public
Forked from deezer/spleeterDeezer source separation library including pretrained models.
Python MIT License UpdatedNov 8, 2020 -
DARCN Public
Forked from Andong-Li-speech/DARCNThe implementation of "A Recursive Network with Dynamic Attention for Monaural Speech Enhancement"
Python UpdatedNov 4, 2020 -
-
asteroid Public
Forked from asteroid-team/asteroidThe PyTorch-based audio source separation toolkit for researchers || Pretrained models available
Python MIT License UpdatedOct 11, 2020 -
av-se Public
Forked from danmic/av-seDeep-Learning-Based Audio-Visual Speech Enhancement and Separation
UpdatedSep 9, 2020 -
DCUNetTorchSound Public
Forked from mhlevgen/DCUNetTorchSoundImplementation of Phase-aware speech enhancement with deep complex U-Net
Jupyter Notebook 6960 UpdatedAug 16, 2020 -
python-pesq Public
Forked from ludlows/PESQPESQ (Perceptual Evaluation of Speech Quality) Wrapper for Python Users (narrow band and wide band)
C MIT License UpdatedJul 26, 2020 -
recommended-books Public
Forked from fz199626/recommended-books计算机经典书籍推荐 部分书籍提供PDF下载
MIT License UpdatedJun 30, 2020 -
ganhacks Public
Forked from soumith/ganhacksstarter from "How to Train a GAN?" at NIPS2016
UpdatedMar 5, 2020 -
sms_wsj Public
Forked from fgnt/sms_wsjSMS-WSJ: Spatialized Multi-Speaker Wall Street Journal database for multi-channel source separation and recognition
Python MIT License UpdatedMar 3, 2020 -