The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 50,068 5,884 Updated Sep 18, 2024

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 47,287 7,406 Updated May 14, 2025

OpenRobotLab / gs-lrm-unofficial

Python 29 Updated Mar 14, 2025

NarcissusEx / GuardSplat

[CVPR 2025] GuardSplat: Efficient and Robust Watermarking for 3D Gaussian Splatting

Python 21 2 Updated Apr 16, 2025

Stability-AI / stable-virtual-camera

Stable Virtual Camera: Generative View Synthesis with Diffusion Models

Python 1,250 76 Updated Apr 26, 2025

ChrisDong-THU / GaussianToken

Pytorch implementation of GaussianToken: An Effective Image Tokenizer with 2D Gaussian Splatting

Python 83 2 Updated Apr 3, 2025

kuai-lab / cvpr25_3D-GSW

Python 4 1 Updated Apr 1, 2025

NVlabs / FoundationStereo

[CVPR 2025 Best Paper Nomination] FoundationStereo: Zero-Shot Stereo Matching

Python 1,459 81 Updated May 11, 2025

VAST-AI-Research / TripoSG

TripoSG: High-Fidelity 3D Shape Synthesis using Large-Scale Rectified Flow Models

Python 1,145 104 Updated Apr 18, 2025

hurunyi / VideoShield

[ICLR 2025] VideoShield: Regulating Diffusion-based Video Generation Models via Watermarking (Official Implementation)

Python 35 1 Updated Apr 8, 2025

deepbeepmeep / Wan2GP

Forked from Wan-Video/Wan2.1

Wan 2.1 for the GPU Poor

Python 834 90 Updated May 7, 2025

kevinhuangxf

Highlights

Lists (32)

3D

Acoustic

Adversarial training

Autonomous

Avatar

Cemera

Depth Estimation

Diffusion

Distributed System

Foundation Model

google-research

HPC

Image quality

Image Restoration

Invertible Neural Network

LLM

Mesh

MultiModality

NeRF

Neural Rendering

PointCloud

Robotic

SLAM

Steganography

Stereo Vision

Text-To-Video

Transformer

Vein Biometric

Visual Representation

VLM

VTON

Watermarking

Stars