ASVtorch

ASVtorch is a toolkit for automatic speaker recognition.

Main features

Complete pipelines from audio files to speaker recognition scores
Multi-GPU training of deep embedding extractors
Fast training of i-vector extractor with GPU

Requirements

GPU with at least 4 GB of memory (>8GB recommended)
Preferably a computing server with many CPU cores and ample amount of RAM
Recent Kaldi installation
- Needed in feature extraction and data augmentation
- Also used for UBM training in i-vector systems
ffmpeg
Python environment (installation instructions below)

Installation

Install ffmpeg if not yet installed
Install Kaldi if not yet installed
- http://kaldi-asr.org/doc/install.html
- Note: Augmentation scripts in Kaldi have changed over time (for example in 2019). Thus, if you encounter problems in data augmentation, try to update your Kaldi installation.
Install a python environment (instructions below are for conda):
1. conda create -n asvtorch python=3.7
2. conda activate asvtorch
3. conda install -c pykaldi pykaldi-cpu
4. conda install pytorch=1.4 cudatoolkit -c pytorch
  If you do not have cuda 10, try instead:
  conda install pytorch=1.4 cudatoolkit=9.2 -c pytorch
5. conda install scipy matplotlib
6. pip install wget
Clone ASVtorch repository
1. Navigate to a folder where you want ASVtorch folder to be placed to
2. git clone https://gitlab.com/ville.vestman/asvtorch.git
3. cd asvtorch

To install updates later on:
- run git pull in asvtorch folder

Running the VoxCeleb recipe

See instructions from asvtorch/recipes/voxceleb/xvector/README.md

Other instructions

For more information on how to execute and configure experiments, see asvtorch/src/settings/README.md
To train neural networks by using multiple GPUs in parallel, see multigpu_readme.md
To prepare custom datasets, see data_preparation_readme.md
To create custom network architectures, see custom_architectures_readme.md

License

The ASVtorch toolkit is licensed under the MIT license. See LICENSE.txt. A small proportion of the codes of the toolkit are modified from the Kaldi Toolkit. These codes are marked with comments, and they are licensed under their original Apache 2.0 License.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

ASVtorch

Main features

Requirements

Installation

Running the VoxCeleb recipe

Other instructions

License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 72 Commits
asvtorch		asvtorch
.gitignore		.gitignore
LICENSE.txt		LICENSE.txt
README.md		README.md
custom_architectures_readme.md		custom_architectures_readme.md
data_preparation_readme.md		data_preparation_readme.md
multigpu_readme.md		multigpu_readme.md

License

ElsevierSoftwareX/SOFTX-D-20-00038

Folders and files

Latest commit

History

Repository files navigation

ASVtorch

Main features

Requirements

Installation

Running the VoxCeleb recipe

Other instructions

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages