8000 hnhbcc / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View hnhbcc's full-sized avatar

Block or report hnhbcc

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Awesome OVD-OVS - A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and Future

185 7 Updated Apr 3, 2025

Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series

Python 960 36 Updated Jan 21, 2025

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Python 8,181 825 Updated Aug 12, 2024

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 8,275 625 Updated May 29, 2025

Code for "Multi-view Reconstruction via SfM-guided Monocular Depth Estimation". CVPR 2025 (Oral Presentation)

Python 282 22 Updated Apr 29, 2025

The official repository for the RealSyn dataset

34 2 Updated Apr 28, 2025

CUHK-SYSU-TBPS&&PRW-TBPS

3 Updated Mar 23, 2022

Med-R1: Reinforcement Learning for Generalizable Medical Reasoning in Vision-Language Models

Python 39 2 Updated May 27, 2025

[OpenPAR] An open-source framework for Pedestrian Attribute Recognition, based on PyTorch

Python 127 17 Updated Jun 3, 2025

Collect the awesome works evolved around reasoning models like O1/R1 in visual domain

28 1 Updated Jun 6, 2025

A Comprehensive Evaluation Benchmark for Open-Vocabulary Detection (AAAI 2024)

Python 50 3 Updated May 7, 2024

A curated list of papers and resources related to Described Object Detection, Open-Vocabulary/Open-World Object Detection and Referring Expression Comprehension. Updated frequently and pull request…

280 21 Updated Jun 6, 2025

A detection/segmentation dataset with labels characterized by intricate and flexible expressions. "Described Object Detection: Liberating Object Detection with Flexible Expressions" (NeurIPS 2023).

Python 125 7 Updated Mar 20, 2024

The official repo for "SpatialBot: Precise Spatial Understanding with Vision Language Models.

Python 266 18 Updated May 28, 2025

A curated list of 3D Vision papers relating to Robotics domain in the era of large models i.e. LLMs/VLMs, inspired by awesome-computer-vision, including papers, codes, and related websites

708 36 Updated Nov 4, 2024
Python 23 4 Updated Feb 14, 2025

High-Resolution 3D Human Digitization from A Single Image.

Python 9,691 1,474 Updated Aug 19, 2024

The official code for "ImFace: A Nonlinear 3D Morphable Face Model with Implicit Neural Representations" presented at CVPR 2022, along with its extended version ImFace++.

Python 174 9 Updated Jul 10, 2024

3D version of the MNIST database of handwritten digits

Jupyter Notebook 14 13 Updated Nov 4, 2016

[MICCAI 2024] Easy diffusion models (optionally with segmentation guidance) for medical images and beyond.

Python 164 11 Updated Dec 2, 2024
Python 19 1 Updated Apr 14, 2025

A bidirectional pipeline parallelism algorithm for computation-communication overlap in V3/R1 training.

Python 2,797 296 Updated Mar 10, 2025

Automatic segmentation of CBCT scans with a 3D Unet

Python 45 10 Updated Jul 29, 2022
Python 12 2 Updated Oct 1, 2024

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 17,344 2,241 Updated Feb 1, 2025

Diffusion Models in Medical Imaging (Published in Medical Image Analysis Journal)

1,833 160 Updated Jan 4, 2025

A collection of resources on applications of multi-modal learning in medical imaging.

753 68 Updated Jun 5, 2025

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 10,819 777 Updated May 15, 2025

GPT Meet Zotero.

TypeScript 6,274 263 Updated Mar 11, 2025
Next
0