Starred repositories
Q-Insight: Understanding Image Quality via Visual Reinforcement Learning
[ECCV'24] Contrasting Deepfakes Diffusion via Contrastive Learning and Global-Local Similarities
Implementation of paper "Leveraging Representations from Intermediate Encoder-blocks for Synthetic Image Detection"
SuperCLUE: 中文通用大模型综合性基准 | A Benchmark for Foundation Models in Chinese
The released data for the paper entilted "FakeBench: Probing Explainable Fake Image Detection via Large Multimodal Models"
Information, resources and knowledges about AIGC & Anti-AIGC.
This project aims to collect the latest "call for reviewers" links from various top CS/ML/AI conferences/journals
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
BARTScore: Evaluating Generated Text as Text Generation
The official code of "DRCT: Diffusion Reconstruction Contrastive Training towards Universe Detection of Diffusion Generated Images"
Official codes for "Q-Ground: Image Quality Grounding with Large Multi-modality Models", ACM MM2024 (Oral)
The official Tensorflow implementation for ICCV'19 paper 'Attributing Fake Images to GANs: Learning and Analyzing GAN Fingerprints'
[NeurIPS 2023] Sentry-Image: Detect Any AI-generated Images
[NeurIPS 2024 D&B Spotlight🔥] ChronoMagic-Bench: A Benchmark for Metamorphic Evaluation of Text-to-Time-lapse Video Generation
Interpretable-through-prototypes deepfake detection for diffusion models
DeepSeek-VL: Towards Real-World Vision-Language Understanding
A collection of AI-generated images papers and corresponding source code/demo program, including text-to-image, image translation (e.g., text-, image, or other multimodality-guided), image inpainti…
deep learning for image processing including classification and object-detection etc.
A curated list of awesome prompt/adapter learning methods for vision-language models like CLIP.
Code for CLIB-FIQA: Face Image Quality Assessment with Confidence Calibration
An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
[IEEE TCSVT'24] Study of Subjective and Objective Naturalness Assessment of AI-Generated Images
✨✨Latest Advances on Multimodal Large Language Models
A curated list of Large Language Model (LLM) Interpretability resources.