ζ-mixup

This repository hosts the code supporting our two papers on ζ-mixup.

ζ-mixup is a multi-sample mixing-based data augmentation method to generate richer and more realistic outputs. ζ-mixup is a generalization of mixup with provably and demonstrably desirable properties that allows for convex combinations of N ≥ 2 samples weighted using a p-series interpolant. ζ-mixup better preserves the intrinsic dimensionality of the original datasets, is computationally efficient, and outperforms mixup, CutMix, and traditional data augmentation methods. Here are some visualizations comparing ζ-mixup to mixup:

Half-moons dataset (N = 512)

512 samples with non-linear class boundaries distributed in the shape of interleaving crescents.

1-D helix embedded in $\mathbb{R}^3$ (N = 8192)

8192 samples on a 1D helix as an example of low-D manifolds lying in high-D ambient spaces.

Repository Structure

zeta_mixup.py: Code for ζ-mixup data augmentation.
utils.py: Utility functions: codes for generating the weights for ζ-mixup and cross-entropy loss with "soft" target labels.
mixup.py: Original mixup implementation (source). Used only for the visualizations above.
demo/:
- demo_2d_halfmoons.py: Code for generating the half-moons visualizations shown above.
- demo_3d_spirals.py: Code for generating the 1-D helix visualizations shown above.
- demo_utils.py: Utility function for generating the half-moons dataset.
- demo_visualizations/: Output directory for visualizations shown above.

Abstract

Modern deep learning training procedures rely on model regularization techniques such as data augmentation methods, which generate training samples that increase the diversity of data and richness of label information. A popular recent method, mixup, uses convex combinations of pairs of original samples to generate new samples. However, as we show in our experiments, mixup can produce undesirable synthetic samples, where the data is sampled off the manifold and can contain incorrect labels. We propose ζ-mixup, a generalization of mixup with provably and demonstrably desirable properties that allows convex combinations of T ≥ 2 samples, leading to more realistic and diverse outputs that incorporate information from T original samples by using a p-series interpolant. We show that, compared to mixup, ζ-mixup better preserves the intrinsic dimensionality of the original datasets, which is a desirable property for training generalizable models. Furthermore, we show that our implementation of ζ-mixup is faster than mixup, and extensive evaluation on controlled synthetic and 26 diverse real-world natural and medical image classification datasets shows that ζ-mixup outperforms mixup, CutMix, and traditional data augmentation techniques.

Citation

If you use our code, please cite our papers:

Kumar Abhishek, Colin J. Brown, Ghassan Hamarneh, "Multi-Sample ζ-mixup: Richer, More Realistic Synthetic Samples from a p-Series Interpolant", Journal of Big Data (J Big Data), 2024.
Kumar Abhishek, Colin J. Brown, Ghassan Hamarneh, "ζ-mixup: Richer, More Realistic Mixing of Multiple Images", Medical Imaging with Deep Learning (MIDL), 2023.

The corresponding BibTeX entries are:

@article{abhishek2024multi,
author = {Abhishek, Kumar and Brown, Colin J. and Hamarneh, Ghassan},
title = {Multi-Sample $\zeta$-mixup: Richer, More Realistic Synthetic Samples from a $p$-Series Interpolant},
journal = {Journal of Big Data},
volume = {11},
number = {1},
pages = {1--41},
month = {Mar},
year = {2024},
ISSN = {2196-1115},
url = {http://dx.doi.org/10.1186/s40537-024-00898-6},
DOI = {10.1186/s40537-024-00898-6},
publisher = {Springer}
}

@inproceedings{abhishek2023zetamixup,
title = {$\zeta$-mixup: Richer, More Realistic Mixing of Multiple Images},
author = {Kumar Abhishek and Colin Joseph Brown and Ghassan Hamarneh},
booktitle = {Medical Imaging with Deep Learning, short paper track},
year = {2023},
pages = {1--5},
url = {https://openreview.net/forum?id=iXjsAarmqn}
}

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
demo		demo
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
mixup.py		mixup.py
overview.png		overview.png
utils.py		utils.py
zeta_mixup.py		zeta_mixup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

ζ-mixup

Half-moons dataset (N = 512)

1-D helix embedded in $\mathbb{R}^3$ (N = 8192)

Repository Structure

Abstract

Citation

About

Uh oh!

Languages

License

kakumarabhishek/zeta-mixup

Folders and files

Latest commit

History

Repository files navigation

ζ-mixup

Half-moons dataset (N = 512)

1-D helix embedded in $\mathbb{R}^3$ (N = 8192)

Repository Structure

Abstract

Citation

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Languages