-
SSP Public
Forked from AntXinyuan/SSPSemantic-decoupled Spatial Partition Guided Point-supervised Oriented Object Detection
UpdatedJun 13, 2025 -
SceneCompleter Public
Forked from chen-wl20/SceneCompleterSceneCompleter: Dense 3D Scene Completion for Generative Novel View Synthesis
Apache License 2.0 UpdatedJun 13, 2025 -
GenWorld Public
Forked from chen-wl20/GenWorldGenWorld: Towards Detecting AI-generated Real-world Simulation Videos
Apache License 2.0 UpdatedJun 13, 2025 -
CDPruner Public
Forked from Theia-4869/CDPrunerOfficial code for paper: Beyond Attention or Similarity: Maximizing Conditional Diversity for Token Pruning in MLLMs.
Python Apache License 2.0 UpdatedJun 12, 2025 -
-
OmniBench Public
Forked from antgroup/OmniBench[ICML 2025 Oral] This is the official repository of the paper "What Limits Virtual Agent Application? OmniBench: A Scalable Multi-Dimensional Benchmark for Essential Virtual Agent Capabilities"
Python UpdatedJun 12, 2025 -
ReSim Public
Forked from OpenDriveLab/ReSimReSim: Reliable World Simulation for Autonomous Driving
Apache License 2.0 UpdatedJun 12, 2025 -
Less3Depend Public
Forked from ou524u/Less3Depend[arxiv] PyTorch implementation of "The Less You Depend, The More You Learn: Synthesizing Novel Views from Sparse, Unposed Images without Any 3D Knowledge".
Python UpdatedJun 12, 2025 -
ViLaSR Public
Forked from AntResearchNLP/ViLaSRReinforcing Spatial Reasoning in Vision-Language Models with Interwoven Thinking and Visual Drawing
UpdatedJun 12, 2025 -
OctoNav-R1 Public
Forked from buaa-colalab/OctoNav-R1Code for OctoNav-R1
MIT License UpdatedJun 12, 2025 -
Vision-Matters Public
Forked from YutingLi0606/Vision-Matters(ArXiv25) Vision Matters: Simple Visual Perturbations Can Boost Multimodal Math Reasoning
Python UpdatedJun 12, 2025 -
AnimateAnyMesh Public
Forked from JarrentWu1031/AnimateAnyMeshOfficial code for AnimateAnyMesh: A Feed-Forward 4D Foundation Model for Text-Driven Universal Mesh Animation
UpdatedJun 12, 2025 -
Cosmos-Drive-Dreams Public
Forked from nv-tlabs/Cosmos-Drive-DreamsCosmos-Drive-Dreams: Scalable Synthetic Driving Data Generation with World Foundation Models
Python UpdatedJun 12, 2025 -
PlayerOne Public
Forked from yuanpengtu/PlayerOnePlayerOne: Egocentric World Simulator
UpdatedJun 12, 2025 -
UniPre3D Public
Forked from wangzy22/UniPre3D[CVPR 2025] UniPre3D: Unified Pre-training of 3D Point Cloud Models with Cross-Modal Gaussian Splatting
Python MIT License UpdatedJun 12, 2025 -
IntPhys2 Public
Forked from facebookresearch/IntPhys2This is the code repository for IntPhys 2, a video benchmark designed to evaluate the intuitive physics understanding of deep learning models.
Python Other UpdatedJun 12, 2025 -
PartPacker Public
Forked from NVlabs/PartPackerEfficient Part-level 3D Object Generation via Dual Volume Packing
Python Other UpdatedJun 12, 2025 -
ASVR Public
Forked from AlenjandroWang/ASVRAutoregressive Semantic Visual Reconstruction Helps VLMs Understand Better
Python UpdatedJun 11, 2025 -
MIL-Lab Public
Forked from mahmoodlab/MIL-LabStandardized initialization and loading of pretrained MIL models
UpdatedJun 11, 2025 -
DAWN Public
Forked from zhangye-zoe/DAWNDomain-adaptive Weakly Supervised Nuclei Segmentation via Cross-task Interaction, TCSVT.
UpdatedJun 11, 2025 -
FCIS Public
Forked from zhangye-zoe/FCISFour Color Theorem for Cell Instance Segmentation, ICML.
Python UpdatedJun 11, 2025 -
SSS Public
Forked from AIGeeksGroup/SSSSSS: Semi-Supervised SAM for Medical Imaging Segmentation
UpdatedJun 11, 2025 -
-
-
ALTA Public
Forked from DopamineLcy/ALTAOfficial code for Efficient medical vision-language alignment through adapting masked vision models (TMI 2025)
Python Other UpdatedJun 11, 2025 -
-
-
DsicoVLA Public
Forked from LunarShen/DsicoVLA[CVPR 2025 🔥] DiscoVLA: Discrepancy Reduction in Vision, Language, and Alignment for Parameter-Efficient Video-Text Retrieval
Python MIT License UpdatedJun 11, 2025 -
StreamSplat Public
Forked from nickwzk/StreamSplatStreamSplat: Towards Online Dynamic 3D Reconstruction from Uncalibrated Video Streams
UpdatedJun 11, 2025 -
4dgt.github.io Public
Forked from 4dgt/4dgt.github.io4DGT: Learning a 4D Gaussian Transformer Using Real-World Monocular Videos
HTML UpdatedJun 10, 2025