- Seoul
-
07:03
(UTC +09:00) - www.linkedin.com/in/soobinsuh
Stars
Official PyTorch implementation of QUEEN: QUantized Efficient ENcoding of Dynamic Gaussians for Streaming Free-viewpoint Videos (NeurIPS 2024)
Curated list of papers and resources focused on 3D Gaussian Splatting, intended to keep pace with the anticipated surge of research in the coming months.
Original reference implementation of "VRSplat: Fast and Robust Gaussian Splatting for Virtual Reality"
A Code Release for Mip-NeRF 360, Ref-NeRF, and RawNeRF
An Open-Ended Embodied Agent with Large Language Models
Everything you need to build state-of-the-art foundation multimodal desktop agent, end-to-end.
Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch
PyTorch code and models for the DINOv2 self-supervised learning method.
[CVPR 2024 - Oral, Best Paper Award Candidate] Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation
[CVPR2024 Oral] MultiPly: Reconstruction of Multiple People from Monocular Video in the Wild
[WACV'25 Oral] Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think
Kalman Filter book using Jupyter Notebook. Focuses on building intuition and experience, not formal proofs. Includes Kalman filters,extended Kalman filters, unscented Kalman filters, particle filte…
Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"
[CVPR'22] ICON: Implicit Clothed humans Obtained from Normals
Open-Sora: Democratizing Efficient Video Production for All
Code of ICCV 2023 paper Cross-modal Orthogonal High-rank Augmentation for RGB-Event Transformer-trackers
High-resolution models for human tasks.
[CVPR2023] Blind Video Deflickering by Neural Filtering with a Flawed Atlas
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
Human Performance Capture from Monocular Video in the Wild (3DV2021)
[ICLR 2022] Official implementation of the paper "DAB-DETR: Dynamic Anchor Boxes are Better Queries for DETR"
End-to-End Object Detection with Transformers
[CVPR 2025] Multiple Object Tracking as ID Prediction
Deformable DETR: Deformable Transformers for End-to-End Object Detection.
Official PyTorch implementation of SparseTrack
[ECCV 2022] ByteTrack: Multi-Object Tracking by Associating Every Detection Box