Crane; Robust Novel Anomaly Detector

The repository official code for Crane, a zero-shot anomaly detection framework built on CLIP.

📌 Table of Contents

Introduction
Results
Getting Started
Installation
Datasets
Custom Dataset
Citation
Acknowledgements
Contact

Introduction

Crane is a zero-shot anomaly detection (ZSAD) framework that leverages a pre-trained vision-language model, CLIP, for robust and generalizable anomaly localization. It introduces two attention refinement modules—E-Attn and D-Attn—inserted into the vision backbone to enhance patch-level alignment, fully utilizing the pretrained knowledge, for the zero-shot task. For image-level refinement, Crane adjusts the CLS token to improve global anomaly sensitivity and incorporates a context-guided prompt learning strategy to better model finer-grained anomalies. Together, these components strengthen both image-level and pixel-level detection. Extensive experiments across 14 datasets from industrial and medical domains show that Crane achieves state-of-the-art performance with consistent improvements across multiple evaluation metrics.

Key Features

Enhancing the sensitivity of global to anomalous cues for image-level anomaly detection
Reinforcing patch-level alignment by extending self-correlation attention through E-Attn
Further improving patch-level alignment using the similarity of DINO features through D-Attn
Improving auxiliary training generalization through context-guided prompt learning

Overview

📊 Main Results

Zero-shot evaluation on industrial datasets

Zero-shot evaluation on medical datasets

Getting Started

To reproduce the results, follow the instructions below to run inference and training:

🧰 Installation

All required libraries, including the correct PyTorch version, are specified in environment.yaml. Running setup.sh will automatically create the environment and install all dependencies.

git clone https://github.com/AlirezaSalehy/Crane.git && cd Crane
bash setup.sh
conda activate crane_env

The required checkpoints for CLIP and DINO will be downloaded automatically by the code and stored in ~/.cache. However, the ViT-B SAM checkpoint must be downloaded manually. Please download sam_vit_b_01ec64.pth from the official Segment Anything repository here to the following directory:

~/.cache/sam/sam_vit_b_01ec64.pth

📁 Datasets

You can download the datasets from their official sources and use utilities in datasets/generate_dataset_json/ to generate a compatible meta.json. Alternatively from the AdaCLIP repository which has provided a compatible format of the datasets. Place all datasets under DATASETS_ROOT, which is defined in ./__init__.py.

Inference

bash test.sh default

Training

bash train.sh default

➕ Custom Dataset

You can use your custom dataset with our model easily following instructions bellow:

1. Organize Your Data

Your dataset must either include a meta.json file at the root directory, or be organized so that one can be automatically generated.

The meta.json should follow this format:

A dictionary with "train" and "test" at the highest level
Each section contains class names mapped to a list of samples
Each sample includes:
- img_path: path to the image relative to the root dir
- mask_path: path to the mask relative to the root dir (empty for normal samples)
- cls_name: class name
- specie_name: subclass or condition (e.g., "good", "fault1")
- anomaly: anomaly label; 0 (normal) or 1 (anomalous)

If your dataset does not include the required meta.json, you can generate it automatically by organizing your data as shown below and running datasets/generate_dataset_json/custom_dataset.py:

datasets/your_dataset/
├── train/
│   ├── c1/
│   │   └── good/
│   │       ├── <NAME>.png
│   └── c2/
│       └── good/
│           ├── <NAME>.png
├── test/
│   ├── c1/
│   │   ├── good/
│   │   │   ├── <NAME>.png
│   │   ├── fault1/
│   │   │   ├── <NAME>.png
│   │   ├── fault2/
│   │   │   ├── <NAME>.png
│   │   └── masks/
│   │       ├── <NAME>.png
│   └── c2/
│       ├── good/
...     ...

Once organized, run the script to generate a meta.json automatically at the dataset root.

2. Run Testing

Then you should place your dataset in the DATASETS_ROOT, specified in datasets/generate_dataset_json/__init__.py and run the inference:

python test.py --dataset YOUR_DATASET --model_name default --epoch 5

🔒 License

This project is licensed under the MIT License. See the LICENSE file for details.

📄 Citation

If you find this project helpful for your research, please consider citing the following BibTeX entry.

BibTeX:

@article{salehi2025crane,
  title={Crane: Context-Guided Prompt Learning and Attention Refinement for Zero-Shot Anomaly Detections},
  author={Salehi, Alireza and Salehi, Mohammadreza and Hosseini, Reshad and Snoek, Cees GM and Yamada, Makoto and Sabokrou, Mohammad},
  journal={arXiv preprint arXiv:2504.11055},
  year={2025}
}

Acknowledgements

This project builds upon:

We thank the authors for their contributions and open-source support.

Contact

For questions or collaborations, please contact alireza99salehy@gmail.com.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
assets		assets
checkpoints		checkpoints
dataset		dataset
models		models
segment_anything		segment_anything
utils		utils
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
environment.yml		environment.yml
runtime.sh		runtime.sh
setup.sh		setup.sh
test.py		test.py
test.sh		test.sh
train.py		train.py
train.sh		train.sh

Uh oh!

Repository files navigation

Crane; Robust Novel Anomaly Detector

📌 Table of Contents

Introduction

Key Features

Overview

📊 Main Results

Zero-shot evaluation on industrial datasets

Zero-shot evaluation on medical datasets

Getting Started

🧰 Installation

📁 Datasets

Inference

Training

➕ Custom Dataset

1. Organize Your Data

2. Run Testing

🔒 License

📄 Citation

Acknowledgements

Contact

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

AlirezaSalehy/Crane

Folders and files

Latest commit

History

Repository files navigation

Crane; Robust Novel Anomaly Detector

📌 Table of Contents

Introduction

Key Features

Overview

📊 Main Results

Zero-shot evaluation on industrial datasets

Zero-shot evaluation on medical datasets

Getting Started

🧰 Installation

📁 Datasets

Inference

Training

➕ Custom Dataset

1. Organize Your Data

2. Run Testing

🔒 License

📄 Citation

Acknowledgements

Contact

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages