SPANet

official codes for our WACV 2024 paper (Interpretable Object Recognition by Semantic Prototype Analysis)

Environment

Python 3.8 & PyTorch 2.0 with CUDA.

conda create -n spanet python=3.8
conda activate spanet
conda install pytorch torchvision pytorch-cuda=11.8 -c pytorch -c nvidia
pip install ftfy regex tqdm

Dataset Preparation

The instructions are from https://github.com/cfchen-duke/ProtoPNet

Instructions for preparing the data:

Download the dataset CUB_200_2011.tgz from http://www.vision.caltech.edu/visipedia/CUB-200-2011.html

Unpack CUB_200_2011.tgz

Crop the images using information from bounding_boxes.txt (included in the dataset)

Split the cropped images into training and test sets, using train_test_split.txt (included in the dataset)

Put the cropped training images in the directory ./datasets/cub200_cropped/train_cropped/

Put the cropped test images in the directory ./datasets/cub200_cropped/test_cropped/

Augment the training set using img_aug.py (included in this code package) -- this will create an augmented training set in the following directory: ./datasets/cub200_cropped/train_cropped_augmented/

Cropped CUB test dataset should look like this:

./datasets/CUB/cub200_cropped/test_cropped/001.Black_footed_Albatross/Black_Footed_Albatross_0001_796111.JPG
./datasets/CUB/cub200_cropped/test_cropped/001.Black_footed_Albatross/Black_Footed_Albatross_0002_55.JPG
...
./datasets/CUB/cub200_cropped/test_cropped/200.Common_Yellowthroat/Common_Yellowthroat_0125_190902.JPG

Model Weights Preparation

Download model weights (including pretrained weights from CLIP) from the latest published release of this repository. Unzip pretrained_models.zip to pretrained_models/clip, and unzip my_models.zip to my_models.

pretrained_models and my_models should look like this:

./pretrained_models/clip/RN50.pt
./pretrained_models/clip/RN101.zip
./pretrained_models/clip/ViT-B-16.zip
./pretrained_models/clip/ViT-B-32.zip
./my_models/CUB_RN50.pth
./my_models/CUB_RN101.pth
./my_models/CUB_ViTB16.pth
./my_models/CUB_ViTB32.pth

Evaulation

python test.py

Training

Training code is under construction, which will be released soon.

Citation

Wan, Q., Wang, R., & Chen, X. (2024). Interpretable Object Recognition by Semantic Prototype Analysis. In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (pp. 800-809).

@inproceedings{wan2024interpretable,
  title={Interpretable Object Recognition by Semantic Prototype Analysis},
  author={Wan, Qiyang and Wang, Ruiping and Chen, Xilin},
  booktitle={Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision},
  pages={800--809},
  year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
clip		clip
datasets		datasets
my_models		my_models
pretrained_models/clip		pretrained_models/clip
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
hook_features.py		hook_features.py
model.py		model.py
receptive_field.py		receptive_field.py
test.py		test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

SPANet

Environment

Dataset Preparation

Model Weights Preparation

Evaulation

Training

Citation

About

Uh oh!

Releases 1

Packages

Uh oh!

Languages

License

WanQiyang/SPANet

Folders and files

Latest commit

History

Repository files navigation

SPANet

Environment

Dataset Preparation

Model Weights Preparation

Evaulation

Training

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Languages

Packages