D-Vector

This is the PyTorch implementation of speaker embedding (d-vector) trained with GE2E loss.

The original paper about GE2E loss could be found here: Generalized End-to-End Loss for Speaker Verification

Usage

Prepare training data

To use the script provided here, you have to organize your raw data in this way:

all utterances from a speaker should be put under a directory (speaker directory)
all speaker directories should be put under a directory (root directory)
speaker directory can have subdirectories and utterances can be placed under subdirectories

You have to specify two things here:

use -s or --save_dir to specify the directory for saving processed utterances
use -c or --config_path to specify the path to the configuration for Audiotoolkit module

And you can specify the maximum amount of utterances to be extracted and preprocessed for a single speaker, e.g. -m 50.

And a good thing about this script is that you can extract utterances from multiple root directories. For example:

python prepare.py -s data-dir -c toolkit_config.yaml VCTK-Corpus/wav48 LibriSpeech/train-clean-360

Start training

Only DATA_DIR, MODEL_DIR and CONFIG_PATH have to be specified here. For example:

python train.py data-dir model-dir dvector_config.yaml

Note that the configuration needed here is different from the one for preprocessing. For more details, check the usage with python train.py -h. During training, event logs will be put under MODEL_DIR.

Continue training from saved checkpoints

To continue the training from a saved checkpoint, just specify the checkpoint path with -c or --checkpoint_path. Note that you can still specify other optional arguments because they might be different from the ones in the previous training.

Results

The dimension reduction result (using t-SNE) of some utterances from LibriSpeech.

Credits

The GE2E-Loss module is first borrowed from cvqluu/GE2E-Loss and then rewritten and optimized for speed by myself.

Name		Name	Last commit message	Last commit date
Latest commit History 51 Commits
images		images
modules		modules
.gitignore		.gitignore
README.md		README.md
dvector_config.yaml		dvector_config.yaml
prepare.py		prepare.py
run_prepare.sh		run_prepare.sh
run_tensorboard.sh		run_tensorboard.sh
run_train.sh		run_train.sh
toolkit_config.yaml		toolkit_config.yaml
train.py		train.py
visualize.py		visualize.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

D-Vector

Usage

Prepare training data

Start training

Continue training from saved checkpoints

Results

Credits

About

Uh oh!

Releases

Packages

Languages

KevinHua/dvector

Folders and files

Latest commit

History

Repository files navigation

D-Vector

Usage

Prepare training data

Start training

Continue training from saved checkpoints

Results

Credits

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages