- China
-
17:24
(UTC +08:00) - https://pengyvwang.github.io/
- https://orcid.org/0000-0001-5768-0658
More
Lists (1)
Sort Name ascending (A-Z)
Stars
The Official PyTorch Implementation of "Mel-McNet: A Mel-Scale Framework for Online Multichannel Speech Enhancement" [Interspeech 2025]
Pytorch implementation of "CleanMel: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR".
AcadHomepage: A Modern and Responsive Academic Personal Homepage
Official PyTorch implementation of 'VINP: Variational Bayesian Inference with Neural Speech Prior for Joint ASR-Effective Speech Dereverberation and Blind RIR Identification'
Implementation of journal paper entitled 'An off-grid wideband DOA estimation method with the variational Bayes expectation-maximization framework' for DOA estimation [Signal Processing]
Implementation of journal paper entitled '1-Bit direction of arrival estimation via improved complex-valued binary iterative hard thresholding' for DOA estimation [Digital Signal Processing]
On the Variance of the Adaptive Learning Rate and Beyond
BUDDy: Single-Channel Blind Unsupervised Dereverberation with Diffusion Models
Algorithm for blind estimation of reverberation time
Python implementation of performance metrics in Loizou's Speech Enhancement book
Different implementations of "Weighted Prediction Error" for speech dereverberation
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a ca…
Conformer-based Metric GAN for speech enhancement
Implementation of "DCCRN-Deep Complex Convolution Recurrent Network for Phase-Aware Speech Enhancement" by pytorch.
implementation of "DCCRN-Deep Complex Convolution Recurrent Network for Phase-Aware Speech Enhancement" by pytorch
StoRM: A Diffusion-based Stochastic Regeneration Model for Speech Enhancement and Dereverberation
A speech dereverberation algorithm, also called wpe
UT-Sarulab MOS prediction system using SSL models
Correctly generate plurals, ordinals, indefinite articles; convert numbers to words
Evaluate your speech-to-text system with similarity measures such as word error rate (WER)
Robust Speech Recognition via Large-Scale Weak Supervision
AI Audio Datasets (AI-ADS) 🎵, including Speech, Music, and Sound Effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio a…