-
PDFMathTranslate Public
Forked from Byaidu/PDFMathTranslatePDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/Docker
Python GNU Affero General Public License v3.0 UpdatedDec 18, 2024 -
sgmse Public
Forked from sp-uhh/sgmseScore-based Generative Models (Diffusion Models) for Speech Enhancement and Dereverberation
Python MIT License UpdatedDec 8, 2023 -
-
conformer Public
Forked from lucidrains/conformerImplementation of the convolutional module from the Conformer paper, for use in Transformers
Python MIT License UpdatedMay 17, 2023 -
torchsubband Public
Forked from haoheliu/torchsubbandPytorch implementation of subband decomposition
HTML MIT License UpdatedJul 26, 2022 -
speechmetrics Public
Forked from aliutkus/speechmetricsA wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR
Python MIT License UpdatedOct 7, 2021 -
music_source_separation Public
Forked from bytedance/music_source_separationPython Other UpdatedSep 25, 2021 -
COSPA Public
Forked from ModarHalimeh/COSPAComplex-valued Spatial Autoencoders for Multichannel Speech Enhancement
Apache License 2.0 UpdatedAug 13, 2021 -
Percepnet-Keras Public
Forked from cookcodes/Percepnet-Keraspercepnet implemented using Keras, still need to be optimized and tuned.
C BSD 3-Clause "New" or "Revised" License UpdatedJul 23, 2021 -
crepe Public
Forked from marl/crepeCREPE: A Convolutional REpresentation for Pitch Estimation -- pre-trained model (ICASSP 2018)
Python MIT License UpdatedJul 19, 2021 -
TAC Public
Forked from yluo42/TACtransform-average-concatenate (TAC) method for end-to-end microphone permutation and number invariant ad-hoc beamforming.
Python UpdatedJun 15, 2021 -
PyTorch_Tutorial Public
Forked from TingsongYu/PyTorch_Tutorial《Pytorch模型训练实用教程》中配套代码
Python UpdatedJun 8, 2021 -
Phase_aware_Deep_Complex_UNet Public
Forked from russellgeum/Phase-aware-Deep-Complex-UNetImplementation Phase-aware Speech Enhancement with Deep Complex U-Net
Python UpdatedMay 22, 2021 -
separation_data_preparation Public
Forked from YongyuG/separation_data_preparationData preparation for separation
Python UpdatedApr 20, 2021 -
ConferencingSpeech2021 Public
Forked from ConferencingSpeech/ConferencingSpeech2021Conferencing Speech Challenge
Python Apache License 2.0 UpdatedApr 6, 2021 -
Cone-of-Silence Public
Forked from vivjay30/Cone-of-SilenceThe Cone of Si 8000 lence:
Python MIT License UpdatedDec 18, 2020 -
BIRD Public
Forked from Andong-Li-speech/BIRDBig Impulse Response Dataset
Python GNU General Public License v3.0 UpdatedNov 12, 2020 -
audioprocessing Public
Forked from abdfahim/audioprocessingStandard libraries for audio processing, especially STFT and Spherical Harmonics decomposition of a soundfield.
MATLAB MIT License UpdatedJul 25, 2020 -
sound-source-localization-algorithm_DOA_estimation Public
Forked from WenzheLiu-Speech/sound-source-localization-algorithm_DOA_estimation关于语音信号声源定位DOA估计所用的一些传统算法
MATLAB UpdatedJul 14, 2020 -
awesome-speech-enhancement Public
Forked from WenzheLiu-Speech/awesome-speech-enhancementspeech enhancement\speech seperation\sound source localization
UpdatedJul 6, 2020 -
DeepXi Public
Forked from anicolson/DeepXiDeep Xi: A Deep Learning Approach to A Priori SNR Estimation. Used for Speech Enhancement and robust ASR.
Python Mozilla Public License 2.0 UpdatedJun 30, 2020 -
pyannote-audio Public
Forked from pyannote/pyannote-audioNeural building blocks for speaker diarization: speech activity detection, speaker change detection, speaker embedding
Jupyter Notebook MIT License UpdatedJun 19, 2020 -
setk Public
Forked from funcwj/setkTools for Speech Enhancement integrated with Kaldi
Python Apache License 2.0 UpdatedJun 4, 2020 -
Good_open_source_library Public
Mainly on audio(speech/voice/sound...) also with others
UpdatedJun 2, 2020 -
onssen Public
Forked from speechLabBcCuny/onssenAn open-source speech separation and enhancement library
Python GNU General Public License v3.0 UpdatedMay 13, 2020 -
sse2neon Public
Forked from DLTcollab/sse2neonC/C++ header converting Intel SSE intrinsics to Arm/Aarch64 NEON intrinsics
-
InterpretableMLBook Public
Forked from MingchaoZhu/InterpretableMLBook《可解释的机器学习--黑盒模型可解释性理解指南》,该书为《Interpretable Machine Learning》中文版
GNU General Public License v3.0 UpdatedMay 10, 2020 -
cmake-examples Public
Forked from ttroy50/cmake-examplesUseful CMake Examples
CMake MIT License UpdatedApr 24, 2020 -
WebRTC-audio-processing Public
Forked from shichaog/WebRTC-audio-processingwebrtc audio processing
-
sound-separation Public
Forked from google-research/sound-separationPython Apache License 2.0 UpdatedApr 10, 2020