8000 speechdnn repositories · GitHub

More Web Proxy on the site http://driver.im/

speechdnn

All

103 repositories

Noises
Public
Noise15 , Noisex-92 and Nonspeech
10•40•2•0•Updated Nov 17, 2020Nov 17, 2020
Deep-Residual-Shrinkage-Networks
Public
The deep residual shrinkage network is a variant of deep residual networks.
Python
•107•0•0•0•Updated Nov 4, 2020Nov 4, 2020
deep-video-prior
Public
Code for NeurIPS 2020 paper: Blind Video Temporal Consistency via Deep Video Prior
Python
•39•0•0•0•Updated Nov 3, 2020Nov 3, 2020
spleeter
Public
Deezer source separation library including pretrained models.
Python
•
MIT License
•2.9k•0•0•0•Updated Oct 8, 2020Oct 8, 2020
nussl
Public
A flexible source separation library in Python
Python
•
MIT License
•96•0•0•0•Updated Aug 21, 2020Aug 21, 2020
demucs
Public
Code for the paper Music Source Separation in the Waveform Domain
Python
•
MIT License
•1.2k•0•0•0•Updated Jul 22, 2020Jul 22, 2020
StarGAN-Voice-Conversion
Public
full tensorflow implementation of the paper: StarGAN-VC: Non-parallel many-to-many voice conversion with star generative adversarial networks https://arxiv.org/abs/1806.02169
Python
•
MIT License
•54•0•0•0•Updated Mar 26, 2020Mar 26, 2020
speech_recognition
Public
Speech recognition module for Python, supporting several engines and APIs, online and offline.
Python
•
BSD 3-Clause "New" or "Revised" License
•2.4k•0•0•0•Updated Feb 14, 2020Feb 14, 2020
seld-net
Public
Sound event localization, detection, and tracking of multiple overlapping and moving sources in 2D spherical space using convolutional recurrent neural network
Python
•
Other
•66•0•0•0•Updated Dec 16, 2019Dec 16, 2019
World
Public
A high-quality speech analysis, manipulation and synthesis system
C++
•
Other
•255•0•0•0•Updated Dec 10, 2019Dec 10, 2019
SpectralCluster
Public
Python re-implementation of the spectral clustering algorithm in the paper "Speaker Diarization with LSTM"
Python
•
Apache License 2.0
•71•0•0•0•Updated Nov 27, 2019Nov 27, 2019
uis-rnn
Public
This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.
Python
•
Apache License 2.0
•320•0•0•0•Updated Nov 10, 2019Nov 10, 2019
voice-builder
Public
An opensource text-to-speech (TTS) voice building tool
JavaScript
•
Apache License 2.0
•135•0•0•0•Updated Nov 2, 2019Nov 2, 2019
flite
Public
A small fast portable speech synthesis system
C
•
Other
•198•0•0•0•Updated Oct 30, 2019Oct 30, 2019
QQGroup
Public
The problems encountered in QQ Group 868373192
0•1•0•0•Updated Oct 27, 2019Oct 27, 2019
tacotron2
Public
Tacotron 2 - PyTorch implementation with faster-than-realtime inference
Jupyter Notebook
•
BSD 3-Clause "New" or "Revised" License
•1.4k•0•0•0•Updated Oct 26, 2019Oct 26, 2019
end_effects
Public
for solving the end effects in the frames after signal processing
MATLAB
•0•2•0•0•Updated Oct 25, 2019Oct 25, 2019
ImageAI
Public
A python library built to empower developers to build applications and systems with self-contained Computer Vision capabilities
Python
•
MIT License
•2.2k•0•0•0•Updated Oct 23, 2019Oct 23, 2019
Speaker-Diarization
Public
speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition
Python
•
Apache License 2.0
•119•0•0•0•Updated Oct 18, 2019Oct 18, 2019
tensorflow
Public
An Open Source Machine Learning Framework for Everyone
C++
•
Apache License 2.0
•75k•1•0•0•Updated Oct 12, 2019Oct 12, 2019
netron
Public
Visualizer for neural network, deep learning and machine learning models
JavaScript
•
MIT License
•2.9k•1•0•0•Updated Oct 11, 2019Oct 11, 2019
ncnn
Public
ncnn is a high-performance neural network inference framework optimized for the mobile platform
C++
•
Other
•4.3k•0•0•0•Updated Oct 10, 2019Oct 10, 2019
models-1
Public
Pre-trained and Reproduced Deep Learning Models （『飞桨』官方模型库，包含多种学术前沿和工业场景验证的深度学习模型）
Python
•
Apache License 2.0
•2.9k•0•0•0•Updated Oct 10, 2019Oct 10, 2019
gTTS
Public
Python library and CLI tool to interface with Google Translate's text-to-speech API
Python
•
MIT License
•369•0•0•0•Updated Oct 8, 2019Oct 8, 2019
ekho
Public
Chinese text-to-speech engine
Scheme
•
GNU General Public License v2.0
•267•0•0•0•Updated Oct 8, 2019Oct 8, 2019
Real-Time-Voice-Cloning
Public
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Python
•
Other
•9k•0•0•0•Updated Oct 3, 2019Oct 3, 2019
examples
Public
TensorFlow examples
Jupyter Notebook
•
Apache License 2.0
•7.4k•0•0•0•Updated Sep 24, 2019Sep 24, 2019
Tacotron-2
Public
DeepMind's Tacotron-2 Tensorflow implementation
Python
•
MIT License
•913•0•0•0•Updated Sep 24, 2019Sep 24, 2019
marytts
Public
MARY TTS -- an open-source, multilingual text-to-speech synthesis system written in pure java
Java
•
Other
•758•0•0•0•Updated Sep 15, 2019Sep 15, 2019
WaveRNN
Public
WaveRNN Vocoder + TTS
Python
•
MIT License
•698•0•0•0•Updated Sep 8, 2019Sep 8, 2019

0