8000 GitHub - ElsevierSoftwareX/SOFTX-D-20-00038: ASVtorch Toolkit: Speaker Verification with Deep-Neural Networks. To cite this software publication: https://www.sciencedirect.com/science/article/pii/S235271102100042X
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

ElsevierSoftwareX/SOFTX-D-20-00038

Repository files navigation

ASVtorch

ASVtorch is a toolkit for automatic speaker recognition.

Main features

  • Complete pipelines from audio files to speaker recognition scores
  • Multi-GPU training of deep embedding extractors
  • Fast training of i-vector extractor with GPU

Requirements

  • GPU with at least 4 GB of memory (>8GB recommended)
  • Preferably a computing server with many CPU cores and ample amount of RAM
  • Recent Kaldi installation
    • Needed in feature extraction and data augmentation
    • Also used for UBM training in i-vector systems
  • ffmpeg
  • Python environment (installation instructions below)

Installation

  1. Install ffmpeg if not yet installed
  2. Install Kaldi if not yet installed
    • http://kaldi-asr.org/doc/install.html
    • Note: Augmentation scripts in Kaldi have changed over time (for example in 2019). Thus, if you encounter problems in data augmentation, try to update your Kaldi installation.
  3. Install a python environment (instructions below are for conda):
    1. conda create -n asvtorch python=3.7

    2. conda activate asvtorch

    3. conda install -c pykaldi pykaldi-cpu

    4. conda install pytorch=1.4 cudatoolkit -c pytorch
      If you do not have cuda 10, try instead:
      conda install pytorch=1.4 cudatoolkit=9.2 -c pytorch

    5. conda install scipy matplotlib

    6. pip install wget

  4. Clone ASVtorch repository
    1. Navigate to a folder where you want ASVtorch folder to be placed to
    2. git clone https://gitlab.com/ville.vestman/asvtorch.git
    3. cd asvtorch
  • To install updates later on:
    • run git pull in asvtorch folder

Running the VoxCeleb recipe

Other instructions

License

The ASVtorch toolkit is licensed under the MIT license. See LICENSE.txt. A small proportion of the codes of the toolkit are modified from the Kaldi Toolkit. These codes are marked with comments, and they are licensed under their original Apache 2.0 License.

About

ASVtorch Toolkit: Speaker Verification with Deep-Neural Networks. To cite this software publication: https://www.sciencedirect.com/science/article/pii/S235271102100042X

Topics

Resources

License

Stars

Watchers

Forks

Packages

No packages published
0