8000 alvin528 (Alvin) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View alvin528's full-sized avatar
😶
I know nothing.
😶
I know nothing.
  • Tsinghua University
  • Beijing

Block or report alvin528

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

[CVPR' 2025'] Mani-GS: Gaussian Splatting Manipulation with Triangular Mesh

Python 160 8 Updated May 5, 2025

[RSS 2025] Learning to Act Anywhere with Task-centric Latent Actions

Python 337 13 Updated May 21, 2025

PlaceIt3D: Language-Guided Object Placement in Real 3D Scenes

19 Updated May 10, 2025

A curated list of awesome 3D scene generation papers. (arXiv 2505.05474)

355 12 Updated May 24, 2025
Python 20 1 Updated Mar 25, 2025

[NeurIPS 2024]Repos for "Visualization-of-Thought" dataset, construction code and evaluation.

Python 28 1 Updated Oct 23, 2024

Official implementation of CVPR25 paper "Decompositional Neural Scene Reconstruction with Generative Diffusion Prior"

Python 68 8 Updated Mar 27, 2025

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 21,724 1,442 Updated May 22, 2025

🔥GrabS in PyTorch (ICLR 2025 Spotlight)

Python 8 Updated Apr 30, 2025

PE3R: Perception-Efficient 3D Reconstruction. Take 2 - 3 photos with your phone, upload them, wait a few minutes, and then start exploring your 3D world via text!

Python 357 13 Updated Apr 1, 2025

Implementing DeepSeek R1's GRPO algorithm from scratch

Python 1,377 54 Updated Apr 18, 2025

🚀 「大模型」1小时从0训练26M参数的视觉多模态VLM!🌏 Train a 26M-parameter VLM from scratch in just 1 hours!

Python 3,642 361 Updated Apr 27, 2025

[Lumina Embodied AI Community] 具身智能技术指南 Embodied-AI-Guide

5,434 353 Updated May 19, 2025

Code for "Multi-view Reconstruction via SfM-guided Monocular Depth Estimation". CVPR 2025 (Oral Presentation)

Python 274 23 Updated Apr 29, 2025

[CVPR 2025 Best Paper Nomination] FoundationStereo: Zero-Shot Stereo Matching

Python 1,572 92 Updated May 17, 2025

Awesome RL Reasoning Recipes ("Triple R")

578 31 Updated May 27, 2025

Calculating the actual value of your job beyond just salary

TypeScript 1,486 81 Updated May 25, 2025

[ICLR 2025] EditRoom: LLM-parameterized Graph Diffusion for Composable 3D Room Layout Editing

Python 14 Updated Apr 1, 2025

BIP3D: Bridging 2D Images and 3D Perception for Embodied Intelligence

Python 189 3 Updated Mar 27, 2025

Code Release for CVPR (2025), "GaussianUDF: Inferring Unsigned Distance Functions through 3D Gaussian Splatting"

23 Updated Mar 25, 2025

Implementation of the project: SceneSplat - Gaussian Splatting-based Scene Understanding with Vision-Language Pretraining

27 Updated Mar 20, 2025

[CVPR'2025] MonoInstance: Enhancing Monocular Priors via Multi-view Instance Alignment for Neural Rendering and Reconstruction

11 Updated Apr 25, 2025

Towards a Training Free Approach for 3D Scene Editing

Python 1 Updated Apr 11, 2025

SpatialLM: Large Language Model for Spatial Understanding

Python 3,205 250 Updated Mar 28, 2025

MetaSpatial leverages reinforcement learning to enhance 3D spatial reasoning in vision-language models (VLMs), enabling more structured, realistic, and adaptive scene generation for applications in…

Python 126 5 Updated May 5, 2025

[CVPR 2025 Best Paper Award Candidate] VGGT: Visual Geometry Grounded Transformer

Python 7,044 715 Updated May 22, 2025

[CVPR 2025] Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass

Python 1,071 53 Updated May 7, 2025

Extend OpenRLHF to support LMM RL training for reproduction of DeepSeek-R1 on multimodal tasks.

Python 758 46 Updated May 14, 2025

[CVPR 2025] MIDI: Multi-Instance Diffusion for Single Image to 3D Scene Generation

Python 661 50 Updated May 27, 2025
Next
0