Stars
Descriptive Caption Enhancement with Visual Specialists for Multimodal Perception
7
Updated Mar 4, 2025
PyTorch code for our paper "Grounding-IQA: Multimodal Language Grounding Model for Image Quality Assessment"
App-Controller: Allow users to manipulate your App with natural language
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.