8000 speechdnn repositories · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
Change the repository type filter

All

    Repositories list

    • Noises

      Public
      Noise15 , Noisex-92 and Nonspeech
      104020Updated Nov 17, 2020Nov 17, 2020
    • The deep residual shrinkage network is a variant of deep residual networks.
      Python
      107000Updated Nov 4, 2020Nov 4, 2020
    • Code for NeurIPS 2020 paper: Blind Video Temporal Consistency via Deep Video Prior
      Python
      39000Updated Nov 3, 2020Nov 3, 2020
    • spleeter

      Public
      Deezer source separation library including pretrained models.
      Python
      MIT License
      2.9k000Updated Oct 8, 2020Oct 8, 2020
    • nussl

      Public
      A flexible source separation library in Python
      Python
      MIT License
      96000Updated Aug 21, 2020Aug 21, 2020
    • demucs

      Public
      Code for the paper Music Source Separation in the Waveform Domain
      Python
      MIT License
      1.2k000Updated Jul 22, 2020Jul 22, 2020
    • full tensorflow implementation of the paper: StarGAN-VC: Non-parallel many-to-many voice conversion with star generative adversarial networks https://arxiv.org/abs/1806.02169
      Python
      MIT License
      54000Updated Mar 26, 2020Mar 26, 2020
    • Speech recognition module for Python, supporting several engines and APIs, online and offline.
      Python
      BSD 3-Clause "New" or "Revised" License
      2.4k000Updated Feb 14, 2020Feb 14, 2020
    • seld-net

      Public
      Sound event localization, detection, and tracking of multiple overlapping and moving sources in 2D spherical space using convolutional recurrent neural network
      Python
      Other
      66000Updated Dec 16, 2019Dec 16, 2019
    • World

      Public
      A high-quality speech analysis, manipulation and synthesis system
      C++
      Other
      255000Updated Dec 10, 2019Dec 10, 2019
    • Python re-implementation of the spectral clustering algorithm in the paper "Speaker Diarization with LSTM"
      Python
      Apache License 2.0
      71000Updated Nov 27, 2019Nov 27, 2019
    • uis-rnn

      Public
      This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.
      Python
      Apache License 2.0
      320000Updated Nov 10, 2019Nov 10, 2019
    • An opensource text-to-speech (TTS) voice building tool
      JavaScript
      Apache License 2.0
      135000Updated Nov 2, 2019Nov 2, 2019
    • flite

      Public
      A small fast portable speech synthesis system
      C
      Other
      198000Updated Oct 30, 2019Oct 30, 2019
    • QQGroup

      Public
      The problems encountered in QQ Group 868373192
      0100Updated Oct 27, 2019Oct 27, 2019
    • tacotron2

      Public
      Tacotron 2 - PyTorch implementation with faster-than-realtime inference
      Jupyter Notebook
      BSD 3-Clause "New" or "Revised" License
      1.4k000Updated Oct 26, 2019Oct 26, 2019
    • for solving the end effects in the frames after signal processing
      MATLAB
      0200Updated Oct 25, 2019Oct 25, 2019
    • ImageAI

      Public
      A python library built to empower developers to build applications and systems with self-contained Computer Vision capabilities
      Python
      MIT License
      2.2k000Updated Oct 23, 2019Oct 23, 2019
    • speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition
      Python
      Apache License 2.0
      119000Updated Oct 18, 2019Oct 18, 2019
    • An Open Source Machine Learning Framework for Everyone
      C++
      Apache License 2.0
      75k100Updated Oct 12, 2019Oct 12, 2019
    • netron

      Public
      Visualizer for neural network, deep learning and machine learning models
      JavaScript
      MIT License
      2.9k100Updated Oct 11, 2019Oct 11, 2019
    • ncnn

      Public
      ncnn is a high-performance neural network inference framework optimized for the mobile platform
      C++
      Other
      4.3k000Updated Oct 10, 2019Oct 10, 2019
    • models-1

      Public
      Pre-trained and Reproduced Deep Learning Models (『飞桨』官方模型库,包含多种学术前沿和工业场景验证的深度学习模型)
      Python
      Apache License 2.0
      2.9k000Updated Oct 10, 2019Oct 10, 2019
    • gTTS

      Public
      Python library and CLI tool to interface with Google Translate's text-to-speech API
      Python
      MIT License
      369000Updated Oct 8, 2019Oct 8, 2019
    • ekho

      Public
      Chinese text-to-speech engine
      Scheme
      GNU General Public License v2.0
      267000Updated Oct 8, 2019Oct 8, 2019
    • Clone a voice in 5 seconds to generate arbitrary speech in real-time
      Python
      Other
      9k000Updated Oct 3, 2019Oct 3, 2019
    • examples

      Public
      TensorFlow examples
      Jupyter Notebook
      Apache License 2.0
      7.4k000Updated Sep 24, 2019Sep 24, 2019
    • DeepMind's Tacotron-2 Tensorflow implementation
      Python
      MIT License
      913000Updated Sep 24, 2019Sep 24, 2019
    • marytts

      Public
      MARY TTS -- an open-source, multilingual text-to-speech synthesis system written in pure java
      Java
      Other
      758000Updated Sep 15, 2019Sep 15, 2019
    • WaveRNN

      Public
      WaveRNN Vocoder + TTS
      Python
      MIT License
      698000Updated Sep 8, 2019Sep 8, 2019
    0