Stars
An easy to use, user-friendly and efficient code for extracting OpenAI CLIP (Global/Grid) features from image and text respectively.
X-modaler is a versatile and high-performance codebase for cross-modal analytics(e.g., image captioning, video captioning, vision-language pre-training, visual question answering, visual commonsens…
A curated list of Meta Learning papers, code, books, blogs, videos, datasets and other resources.
Meta Learning / Learning to Learn / One Shot Learning / Few Shot Learning
1st Place Solution for O2O Coupon Usage Forecast