Mutual exclusivity in visually-grounded speech models

This repository represents the implementation of the paper:

Oneata, Dan, Leanne Nortje, Yevgen Matusevych, and Herman Kamper. The mutual exclusivity bias of bilingual visually grounded speech models. Interspeech, 2025.

Setup

The implementation relies on PyTorch, which can be installed via conda:

conda create -n me-vgs python=3.12
conda activate me-vgs
conda install pytorch torchvision torchaudio pytorch-cuda=12.1 -c pytorch -c nvidia

Then we can install the code as a library:

pip install -e .

Example usage

To train a visually grounded speech model, we can run the mevgs/train.py script followed by the configuration name; for example:

python mevgs/train.py en-nl_links-no_size-md_a

The list of configurations is in the mevgs/config.py file.

To obtain predictions for the mutual exclusivity tests, we can run the mevgs/predict.py script, again followed by the configuration name; for example:

python mevgs/predict.py en-nl_links-no_size-md_a

The results are then obtained with the mevgs/scripts/evaluate.py script:

python mevgs/scripts/evaluate.py en-nl_links-no_size-md_a

The results in the Table 1 from the paper are obtained with the following command:

python mevgs/scripts/show_interspeech25_table_1.py

Name		Name	Last commit message	Last commit date
Latest commit History 72 Commits
mevgs		mevgs
.gitignore		.gitignore
README.md		README.md
find-lr.ipynb		find-lr.ipynb
pyrightconfig.json		pyrightconfig.json
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Mutual exclusivity in visually-grounded speech models

Setup

Example usage

About

Uh oh!

Releases

Packages

Uh oh!

Languages

danoneata/me-vgs

Folders and files

Latest commit

History

Repository files navigation

Mutual exclusivity in visually-grounded speech models

Setup

Example usage

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages