Lists (1)
Sort Name ascending (A-Z)
Stars
[ICCV2025] Referring any person or objects given a natural language description. Code base for RexSeek and HumanRef Benchmark
This repository provides the code and model checkpoints for AIMv1 and AIMv2 research projects.
🏅 Collection of Kaggle Solutions and Ideas 🏅
UI for your AI. Open Source Tailwind components tailored for your GPT, generative AI, and LLM projects.
Easily compute clip embeddings and build a clip retrieval system with them
Easily fine-tune, evaluate and deploy Qwen3, DeepSeek-R1, Llama 4 or any open source LLM / VLM!
streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL
Most popular metrics used to evaluate object detection algorithms.
Transformer OCR for Indian Languages
Evaluate your speech-to-text system with similarity measures such as word error rate (WER)
🩺 A comprehensive project leveraging YOLOv8 and Faster R-CNN for detecting thoracic abnormalities in chest X-rays. Optimized for medical diagnostics with CBAM attention, achieving precision and rec…
Fit interpretable models. Explain blackbox machine learning.
🏅 Collection of Kaggle Solutions and Ideas 🏅
We write your reusable computer vision tools. 💜
VLM driven tool that processes surveillance videos, extracts frames, and generates insightful annotations using a fine-tuned Florence-2 Vision-Language Model. Includes a Gradio-based interface for …
A paper list of some recent Transformer-based CV works.
ADAG (Activity Detector and Alert Generator) aims to take real-time videos from CCTV as an input and pass it to the CNN model created with the help of transfer learning and detect ‘Shoplifting’, ‘R…
[CAAI AIR'24] Bilateral Reference for High-Resolution Dichotomous Image Segmentation
Repo with resources to pass the AWS ML Specialty exam
Awesome LeetCode resources to learn Data Structures and Algorithms and prepare for Coding Interviews.
Tips and resources to prepare for Behavioral interviews.
Learn System Design concepts and prepare for interviews using free resources.
[ICCV 2019] "DeblurGAN-v2: Deblurring (Orders-of-Magnitude) Faster and Better" by Orest Kupyn, Tetiana Martyniuk, Junru Wu, Zhangyang Wang