8000 kevin-ssy (Shuyang (Kevin) Sun) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View kevin-ssy's full-sized avatar
  • Google DeepMind

Organizations

@torrvision

Block or report kevin-ssy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

MAGI-1: Autoregressive Video Generation at Scale

Python 3,277 191 Updated Jun 4, 2025

State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!

Jupyter Notebook 1,255 69 Updated May 28, 2025

[CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

Python 1,330 67 Updated Apr 24, 2025

A generative world for general-purpose robotics & embodied AI learning.

Python 25,281 2,267 Updated Jun 15, 2025

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 51,105 7,444 Updated Jun 13, 2025

[NeurIPS 2024] Code release for "Segment Anything without Supervision"

Jupyter Notebook 473 25 Updated May 27, 2025

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Python 1,777 78 Updated Aug 15, 2024

Official Implementation for CVPR 2024 paper: CLIP as RNN: Segment Countless Visual Concepts without Training Endeavor

108 3 Updated Jun 23, 2024

Your image is almost there!

Python 7,642 441 Updated Jul 26, 2024

[CVPR2024, Highlight] Official code for DragDiffusion

Python 1,220 93 Updated Jan 29, 2024

[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…

Jupyter Notebook 8,219 504 Updated May 18, 2025

A family of lightweight multimodal models.

Python 1,023 74 Updated Nov 18, 2024

A Multi-Modal Large Language Model with Retrieval-augmented In-context Learning capacity designed for generalisable and explainable end-to-end driving

Python 98 9 Updated Oct 7, 2024

A471 OMG-LLaVA and OMG-Seg codebase [CVPR-24 and NeurIPS-24]

Python 1,300 47 Updated May 30, 2025

[ICLR 2024 Spotlight] Official implementation of ScaleCrafter for higher-resolution visual generation at inference time.

Python 512 27 Updated Mar 7, 2024

Official Code for MotionCtrl [SIGGRAPH 2024]

Python 1,432 76 Updated Feb 19, 2025

IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks

Python 57 6 Updated Sep 26, 2024

[ICCV 2023 Oral] "FateZero: Fusing Attentions for Zero-shot Text-based Video Editing"

Jupyter Notebook 1,150 107 Updated Aug 14, 2023

[ICLR 2024] Real-Fake: Effective Training Data Synthesis Through Distribution Matching

Python 79 3 Updated Dec 9, 2023

[arXiv:2309.16669] Code release for "Training a Large Video Model on a Single Machine in a Day"

Python 130 10 Updated Jul 31, 2024

Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).

Python 6,938 488 Updated Feb 7, 2025

😂😂😂Official Implementation for ICCV 2023 paper: OxfordTVG-HIC: Can Machine Make Humorous Captions from Images?

Python 9 1 Updated Feb 23, 2024

[ICLR 2024 Spotlight] Unified Human-Scene Interaction via Prompted Chain-of-Contacts

Python 216 11 Updated Apr 13, 2025

[ECCV 2024 Best Paper Candidate] PointLLM: Empowering Large Language Models to Understand Point Clouds

Python 824 41 Updated May 22, 2025

[NeurIPS 2023] This repo contains the code for our paper Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convolutional CLIP

Python 321 28 Updated Feb 5, 2024

a PyTorch re-implementation of ECCV 2022 paper based on Detectron2: k-means mask Transformer.

Python 75 10 Updated Jul 28, 2023

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 16,458 1,507 Updated Sep 5, 2024
Next
0