Stars
[CVPR 2025] Aesthetic Post-Training Diffusion Models from Generic Preferences with Step-by-step Preference Optimization
Official implementation of the paper: REPA-E: Unlocking VAE for End-to-End Tuning of Latent Diffusion Transformers
Official repository for R2E-Gym: Procedural Environment Generation and Hybrid Verifiers for Scaling Open-Weights SWE Agents
This repo will contain scripts that automatically export paper information from openreview to ACM
Code for our paper: Learning Camera Movement Control from Real-World Drone Videos
Official Implementation for paper: Negative Token Merging: Image-based Adversarial Feature Guidance
This is the official implementation of "Vec2Face: Scaling Face Dataset Generation with Loosely Constrained Vectors", which is accepted at ICLR2025.
Repo for our CVPR 2023 paper on "High-Fidelity Guided Image Synthesis with Latent Diffusion Models"
The implementation for the paper "Assessing Model Generalization in Vicinity"
Synthesis large scale trainable data for the micro-expression recognition task
hierarchical multi-agent workflow for prompt optimazation
Implementation of the paper: "Privacy Assessment on Reconstructed Images: Are Existing Evaluation Metrics Faithful to Human Perception?” (NeurIPS 2023)
Optimizing Calibration by Gaining Aware of Prediction Correctness
The First to Know: How Token Distributions Reveal Hidden Knowledge in Large Vision-Language Models?
Training A Small Emotional Vision Language Model for Visual Art Comprehension
[ICLR 2024 Spotlight] Bounding Box Stability against Feature Dropout Reflects Detector Generalization across Environments
[ICCV2023] How Far Pre-trained Models Are from Neural Collapse on the Target Dataset Informs their Transferability
Taylor videos and Taylor-transformed skeletons (ICML 2024).
CIFAR-10-Warehouse: Towards Broad and More Realistic Testbeds in Model Generalization Analysis
Code for ICLR 2024 paper "Towards Optimal Feature-Shaping Methods for Out-of-Distribution Detection"
This repository includes various baseline techniques for label-free model evaluation task for the VDU2023 competition.
Repo for our NeurIPS 2023 paper on: Divide, Evaluate, and Refine: Evaluating and Improving Text-to-Image Alignment with Iterative VQA Feedback
Training with Product Digital Twins for AutoRetail Checkout
SnP: Large-Scale Training Data Search for Object Re-Identification (CVPR 2023)