-
Zhejiang University
- Zhejiang, China
-
09:09
(UTC +08:00) - lhmd.top
- https://orcid.org/0009-0006-9088-1471
- @wjwang2003
Highlights
- Pro
Lists (14)
Sort Name ascending (A-Z)
Stars
[arXiv'25]🌈 Unseen 3D Geometry Reasoning from a Single Image.
🖨️ [arXiv'23] Official PyTorch Implementation of MatchNeRF
VR-based Robot Teleoperation and Data Collection System for Unitree G1
MMaDA - Open-Sourced Multimodal Large Diffusion Language Models
PartCrafter: Structured 3D Mesh Generation via Compositional Latent Diffusion Transformers
Revisiting Depth Representations for Feed-Forward 3D Gaussian Splatting
StyleAR: Customizing Multimodal Autoregressive Model for Style-Aligned Text-to-Image Generation
🚀ReVisual-R1 is a 7B open-source multimodal language model that follows a three-stage curriculum—cold-start pre-training, multimodal reinforcement learning, and text-only reinforcement learning—to …
Mutual Information Neural Estimation in Pytorch
Implementation for Variational Information Bottleneck for Effective Low-resource Fine-tuning, ICLR 2021
[ICML 2025] Rethinking Latent Redundancy in Behavior Cloning: An Information Bottleneck Approach for Robot Manipulation
A 3DGS framework for omni urban scene reconstruction and simulation.
🌊 [ECCV'24 Oral] MVSplat: Efficient 3D Gaussian Splatting from Sparse Multi-View Images
[arXiv 2025] MMSI-Bench: A Benchmark for Multi-Image Spatial Intelligence
HoliTom: Holistic Token Merging for Fast Video Large Language Models
Official implementation of Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence
ZPressor: Bottleneck-Aware Compression for Scalable Feed-Forward 3DGS
Official Code Release for [SIGGRAPH 2025] RenderFormer: Transformer-based Neural Rendering of Triangle Meshes with Global Illumination
arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv
[NeurIPS'24 Spotlight, ICLR'25, ICML'25] To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention, which reduces inference latency by up to 10x for pre-filli…
Pytorch implementation of Deep Variational Information Bottleneck
Provide .bst files for NeurIPS latex template
Official Implementation of Arxiv 2025 paper 'FreeSplat++: Generalizable 3D Gaussian Splatting for Efficient Indoor Scene Reconstruction'
[ECCV2022] PETR: Position Embedding Transformation for Multi-View 3D Object Detection & [ICCV2023] PETRv2: A Unified Framework for 3D Perception from Multi-Camera Images
Official implementation of NeurIPS 2024 paper: "FreeSplat: Generalizable 3D Gaussian Splatting Towards Free-View Synthesis of Indoor Scenes"
[ICCV 2025] Geo4D: Leveraging Video Generators for Geometric 4D Scene Reconstruction