Highlights
- Pro
Lists (3)
Sort Name ascending (A-Z)
Stars
[ICLR 2025] OMG for material modeling in Gaussian Splatting
[ICLR 2025] Code for Self-Correcting Decoding with Generative Feedback for Mitigating Hallucinations in Large Vision-Language Models
[WACV 2025] Code for Enhancing Vision-Language Few-Shot Adaptation with Negative Learning
📖 A curated list of resources dedicated to hallucination of multimodal large language models (MLLM).
[ICLR 2023] SQA3D for embodied scene understanding and reasoning
🔖 Curated list of video object segmentation (VOS) papers, datasets, and projects.
Official code of "EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model"
(ෆ`꒳´ෆ) A Survey on Text-to-Image Generation/Synthesis.
Let there be clock in the beach - WACV 2022
[Mamba-Survey-2024] Paper list for State-Space-Model/Mamba and it's Applications
【CVPR2024】Magic Tokens: Select Diverse Tokens for Multi-modal Object Re-Identification
Code for Negative Yields Positive: Unified Dual-Path Adapter for Vision-Language Models
[CVPR 2024] Code for HiKER-SGG: Hierarchical Knowledge Enhanced Robust Scene Graph Generation
Collection of Summer 2026 tech internships!
A collection of full time roles in SWE, Quant, and PM for new grads.
Collection of papers on state-space models
An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
医学影像数据集列表 『An Index for Medical Imaging Datasets』
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Ai edge toolbox,专门面向边端设备尤其是嵌入式RTOS平台,AI模型部署工具链,包括模型推理引擎和模型压缩工具
Hand Gesture Recognition using Deep Learning Neural Networks using YOLO algorithm