ML4ML: automated invariance testing for machine leanring models

ML4ML invariance testing: Paper & Appendix | Model Repository | Metadata

Authors: Zukang Liao, Pengfei Zhang, Min Chen

Try visualising invariance testing results! Download the npyfiles for model 1. Specify the "data_dir" in plot_tool.py, so that the npy files are in "data_dir/mid/xxx.npy"

python plot_tool.py --mid=1 --aug_type=r

Output: You can check the plots for scaling (--aug_type=s) or brightness (--aug_type=b) by yourself.

General guideline to run the code:

(1). Standard CNN training (train.py)

Mutiple CNNs on either MNIST or CIFAR or other databases should be trained in advance. We use the code train.py to train many CNNs using the metadata specified in the metadata.txt file.

(2). Model database (model.py)

Our model database consists of 4 different partitions:

partition (a): mid: 1-100, t1-t50. VGG13bn trained on CIFAR for rotation invariance testing.
partition (b): mid: 101-200, t101-t150. VGG13bn trained on CIFAR for brightness invariance testing.
partition (c): mid: 201-300, t201-t250. VGG13bn trained on CIFAR for scaling invariance testing.
partition (d): mid: 301-400, t301-t350. CNN5 trained on MNIST for rotation invariance testing.

Mid starting with "t" is a hold-out set. When using three-fold cross validation, the hold-out set is always treated as one fold, while the rest 100 "regular" models are randomly split into two folds.

Notes: Model 1 to 15 are trained on CPU, other models are trained on GPU. Model 101 to 200, t101 to t150: preprocessing -- normalised to [0, 1], Others: [-0.5, 0.5]

(3). Invariance testing data (save_invariance_results.py)

For a given CNN (named "mid.pth" - mid short for model id, e.g., "1.pth"), please run:

python save_invariance_results.py --mid=1 --aug_type=r

where mid is the index of the CNN and aug_type: "r" for "rotation", "s" for "scaling" and "b" for "brightness". The script generates two .npy files, namely test_results1515.npy (CONF) and test_actoverall1515.npy (CONV).

For partition (d), please specify --dbname=mnist:

python save_invariance_results.py --mid=mid --aug_type=r --dbname=mnist

(4). Variance matrices (matrices_CONF.py and matrices_CONV.py)

To generate variance matrices (CONF):

python matrices_CONF.py --mid=mid --aug_type=r

To generate variance matrices (CONV):

python matrices_CONV.py --mid=mid --aug_type=r

Example at CONF level

(5). Measurements (measurements.py)

To generate a json file consisting of all measurements for the model:

python measurements.py --mid=mid --aug_type=r

For partition (d), please specify --dbname=mnist:

python measurements.py --mid=mid --aug_type=r --dbname=mnist

(6). ML4ML assessors (ML4MLassessor.py)

To train an ML4ML assessor with different types of ml algorithms. And test the performance of the assessor on the testing set of the model-database.

python ML4MLassessor.py --aug_type=r --dataset=1

For partition (d), please specify --dbname=mnist:

python ML4MLassessor.py --aug_type=r --dbname=mnist --dataset=1

Specify --dataset=1 for the first time runing the script. If the dataset.csv file already exists (which can be downloaded from the metadata link above), then please do not set any value to --dataset

Name		Name	Last commit message	Last commit date
Latest commit History 174 Commits
example_mid_73/1515		example_mid_73/1515
ML4MLassessor.py		ML4MLassessor.py
README.md		README.md
datamat.py		datamat.py
datasets.py		datasets.py
json_format.txt		json_format.txt
json_stat.py		json_stat.py
matrices_CONF.py		matrices_CONF.py
matrices_CONV.py		matrices_CONV.py
measurement.py		measurement.py
model.py		model.py
plot_tool.py		plot_tool.py
plot_tool_1.png		plot_tool_1.png
requirements.txt		requirements.txt
sampler.py		sampler.py
save_invariance_results.py		save_invariance_results.py
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

ML4ML: automated invariance testing for machine leanring models

(1). Standard CNN training (train.py)

(2). Model database (model.py)

(3). Invariance testing data (save_invariance_results.py)

(4). Variance matrices (matrices_CONF.py and matrices_CONV.py)

(5). Measurements (measurements.py)

(6). ML4ML assessors (ML4MLassessor.py)

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Zukang-Liao/ML4ML-invariance-testing

Folders and files

Latest commit

History

Repository files navigation

ML4ML: automated invariance testing for machine leanring models

(1). Standard CNN training (train.py)

(2). Model database (model.py)

(3). Invariance testing data (save_invariance_results.py)

(4). Variance matrices (matrices_CONF.py and matrices_CONV.py)

(5). Measurements (measurements.py)

(6). ML4ML assessors (ML4MLassessor.py)

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages