Stars
Ray tracing and hybrid rasterization of Gaussian particles
Pytorch Implementation of rpautrat/SuperPoint
Efficient neural feature detector and descriptor
Superpoint Implemented in PyTorch: https://arxiv.org/abs/1712.07629
PyTorch pre-trained model for real-time interest point detection, description, and sparse tracking (https://arxiv.org/abs/1712.07629)
Visual localization made easy with hloc
[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer
Official repository for Splatt3R: Zero-shot Gaussian Splatting from Uncalibrated Image Pairs
A high-performance distributed file system designed to address the challenges of AI training and inference workloads.
[CVPR'25] 🌟🌟 EgoTextVQA: Towards Egocentric Scene-Text Aware Video Question Answering
This repository use GPUs to expedite the point cloud evaluation process on DTUs, utilizing Python and CUDA.
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
A project page template for academic papers. Demo at https://eliahuhorwitz.github.io/Academic-project-page-template/
[CVPR 23'] GeoMVSNet: Learning Multi-View Stereo with Geometry Perception
Efficient Edge-Preserving Multi-view Stereo Network for Depth Estimation, AAAI2023
[IEEE TIP'23] Exploring Sparse Spatial Relation in Graph Inference for Text-Based VQA
[ACM TOMM'24] Graph Pooling Inference Network for Text-based VQA
✨✨Latest Research on Multimodal Large Language Models on Scene-Text VQA Tasks
Official code of PatchmatchNet (CVPR 2021 Oral)
A fast python implementation of DTU MVS 2014 evaluation
Multiview matching with deep-learning and hand-crafted local features for COLMAP and other SfM software. Supports high-resolution formats and images with rotations. Both CLI and GUI are supported.
Code for "Detector-Free Structure from Motion", CVPR 2024
[IEEE TMM'25] Scene-Text Grounding for Text-Based Video Question Answering
📑 A list of awesome learning-based multi-view stereo papers