Stars
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
The official implementation of AA-CLIP: Enhancing Zero-shot Anomaly Detection via Anomaly-Aware CLIP
Official implementation of CVPR 2024 PromptAD: Learning Prompts with Only Normal Samples for Few-Shot Anomaly Detection
Official implementation for AnomalyCLIP (ICLR 2024)
[CVPR 2023] Unofficial re-implementation of "WinCLIP: Zero-/Few-Shot Anomaly Classification and Segmentation".
[CVPR 2023 Workshop] VAND Challenge: 1st Place on Zero-shot AD and 4th Place on Few-shot AD
[ECCV2024] The Official Implementation for ''AdaCLIP: Adapting CLIP with Hybrid Learnable Prompts for Zero-Shot Anomaly Detection''
【AAAI2024】TOP-ReID: Multi-spectral Object Re-Identification with Token Permutation
A General-purpose Person Re-identification Task with Instructions
【AAAI2025】MambaPro: Multi-Modal Object Re-Identification with Mamba Aggregation and Synergistic Prompt
Official implementation for "CLIP-ReID: Exploiting Vision-Language Model for Image Re-identification without Concrete Text Labels" (AAAI 2023)
(ECCV 2024) VCP-CLIP: A visual context prompting model for zero-shot anomaly segmentation
Paper list and datasets for industrial image anomaly/defect detection (updating). 工业异常/瑕疵检测论文及数据集检索库(持续更新)。
Anomaly detection related books, papers, videos, and toolboxes
Official implementation of CVPR'24 paper 'Anomaly Heterogeneity Learning for Open-set Supervised Anomaly Detection'.
《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程
Awesome Pretrained Chinese NLP Models,高质量中文预训练模型&大模型&多模态模型&大语言模型集合
Stable Diffusion web UI
FLUX, Stable Diffusion, SDXL, SD3, LoRA, Fine Tuning, DreamBooth, Training, Automatic1111, Forge WebUI, SwarmUI, DeepFake, TTS, Animation, Text To Video, Tutorials, Guides, Lectures, Courses, Comfy…
ControlNet++: All-in-one ControlNet for image generations and editing!
A collection of resources on controllable generation with text-to-image diffusion models.
Repository for the Paper "Multi-LoRA Composition for Image Generation"
[Neurips 2023 & TPAMI] T2I-CompBench (++) for Compositional Text-to-image Generation Evaluation
Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA
CosmicMan: A Text-to-Image Foundation Model for Humans (CVPR 2024)
This repo contains the code for PreciseControl project [ECCV'24]
Code for Text2Human (SIGGRAPH 2022). Paper: Text2Human: Text-Driven Controllable Human Image Generation
StoryMaker: Towards consistent characters in text-to-image generation
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
[CVPR 2025] The First Investigation of CoT Reasoning (RL, TTS, Reflection) in Image Generation