IQE-CLIP: Instance-aware Query Embedding for Zero/Few-shot Anomaly Detection in Medical Domain

Hong Huang, Weixiang Sun, Zhijian Wu, Jingwen Niu, Donghuan Lu, Xian Wu, Yefeng Zheng

The official PyTorch implementation of the paper "IQE-CLIP: Instance-aware Query Embedding for Zero/Few-shot Anomaly Detection in Medical Domain".

Abstract: Recently, the rapid advancements of vision-language models, such as CLIP, leads to significant progress in zero-/few-shot anomaly detection (ZFSAD) tasks. However, most existing CLIP-based ZFSAD methods commonly assume prior knowledge of categories and rely on carefully crafted prompts tailored to specific scenarios. While such meticulously designed text prompts effectively capture semantic information in the textual space, they fall short of distinguishing normal and anomalous instances within the joint embedding space. Moreover, these ZFSAD methods are predominantly explored in industrial scenarios, with few efforts conducted to medical tasks. To this end, we propose an innovative framework for ZFSAD tasks in medical domain, denoted as IQE-CLIP. We reveal that query embeddings, which incorporate both textual and instance-aware visual information, are better indicators for abnormalities. Specifically, we first introduce class-based prompting tokens and learnable prompting tokens for better adaptation of CLIP to the medical domain. Then, we design an instance-aware query module (IQM) to extract region-level contextual information from both text prompts and visual features, enabling the generation of query embeddings that are more sensitive to anomalies. Extensive experiments conducted on six medical datasets demonstrate that IQE-CLIP achieves state-of-the-art performance on both zero-shot and few-shot tasks.

🛠️ Get Started

🔧 Installation

To set up the IQE-CLIP environment, follow one of the methods below:

Clone the repository:

git clone https://github.com/hongh0/IQE-CLIP.git && cd IQE-CLIP

Create a conda environment and install dependencies:

conda create -n IQECLIP python=3.9.5 -y
conda activate IQECLIP
conda install pytorch==2.1.2 torchvision==0.16.2 torchaudio==2.1.2 pytorch-cuda=12.1 -c pytorch -c nvidia
pip install requirements.txt -r

📦 Pretrained model

Download CLIP pretrained weights: ViT-L-14-336px.pt

📁 Data Preparation

Please follow the BMAD to apply for permission to download the relevant dataset.
Or use the pre-processed benchmark by MVFAD. Please download the following dataset.
- Liver
- Brain
- HIS
- RESC
- OCT17
- ChestXray

Place it within the master directory data and unzip the dataset.

tar -xvf Liver.tar.gz
tar -xvf Brain.tar.gz
tar -xvf Histopathology_AD.tar.gz
tar -xvf Retina_RESC.tar.gz
tar -xvf Retina_OCT2017.tar.gz
tar -xvf Chest.tar.gz

The data file structure is as followed:

data/
├── Brain_AD/
│   ├── valid/
│   └── test/
├── ...
├── Retina_RESC_AD/
│   ├── valid/
│   └── test/
...
dataset/
├── fewshot_seed/
│   ├── Brain/
│   ├── ...
│   └── Retina_RESC/
├── medical_few.py
└── medical_zero.py

🚀 Train

Run the following command to train the model in zero-shot mode on a specific dataset (e.g., Brain ).

python train_zero.py \
  --ob
6809
j Brain \
  --batch_size 16 \
  --epoch 50 \
  --features_list [6,12,18,24] \
  --log_path 'Brain_zero.log' \
  --save_dir './ckpt' \
  --prompt_len 2 \
  --deep_prompt_len 1 \
  --use_global \
  --total_d_layer_len 11

Use the following command to train the model in few-shot mode. This setting assumes a small number of anomaly examples are available (e.g., K=4 ),

python train_few.py \
  --obj Brain \
  --shot 4 \
  --batch_size 16 \
  --epoch 50 \
  --features_list [6,12,18,24] \
  --log_path 'Brain_few.log' \
  --save_dir './ckpt' \
  --prompt_len 2 \
  --deep_prompt_len 1 \
  --use_global \
  --total_d_layer_len 11

🔍 Test

After training, run the following command to evaluate the model in zero-shot mode. Make sure to replace $YOUR_CHECKPOINT_PATH with the path to your trained model checkpoint.

python test_zero.py \
  --obj Brain \
  --batch_size 16 \
  --features_list [6,12,18,24] \
  --log_path 'Test_Brain_zero.log' \
  --ckpt_path $YOUR_CHECKPOINT_PATH \
  --prompt_len 2 \
  --deep_prompt_len 1 \
  --use_global \
  --total_d_layer_len 11

Use the following command to evaluate the model in few-shot mode.

python test_few.py \
  --obj Brain \
  --shot 4 \
  --batch_size 16 \
  --features_list [6,12,18,24] \
  --log_path 'Test_Brain_few.log' \
  --ckpt_path $YOUR_CHECKPOINT_PATH \
  --prompt_len 2 \
  --deep_prompt_len 1 \
  --use_global \
  --total_d_layer_len 11

Acknowledgement

Our work is largely inspired by the following projects. Thanks for their admiring contribution.

Citation

If you find this project helpful for your research, please consider citing the following BibTeX entry.

@misc{huang2025iqeclip,
    title={IQE-CLIP: Instance-aware Query Embedding for Zero-/Few-shot Anomaly Detection in Medical Domain}, 
    author={Hong Huang and Weixiang Sun and Zhijian Wu and Jingwen Niu and Donghuan Lu and Xian Wu and Yefeng Zheng},
    year={2025},
    eprint={2506.10730},
    archivePrefix={arXiv},
    primaryClass={cs.CV},
    url={https://arxiv.org/abs/2506.10730}, 
}

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
CLIP		CLIP
assets		assets
config		config
dataset		dataset
README.md		README.md
loss.py		loss.py
requirements.txt		requirements.txt
test_few.py		test_few.py
test_zero.py		test_zero.py
train_few.py		train_few.py
train_zero.py		train_zero.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

IQE-CLIP: Instance-aware Query Embedding for Zero/Few-shot Anomaly Detection in Medical Domain

🛠️ Get Started

🔧 Installation

📦 Pretrained model

📁 Data Preparation

🚀 Train

🔍 Test

Acknowledgement

Citation

About

Uh oh!

Releases

Packages

Languages

hongh0/IQE-CLIP

Folders and files

Latest commit

History

Repository files navigation

IQE-CLIP: Instance-aware Query Embedding for Zero/Few-shot Anomaly Detection in Medical Domain

🛠️ Get Started

🔧 Installation

📦 Pretrained model

📁 Data Preparation

🚀 Train

🔍 Test

Acknowledgement

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages