8000 ZiYang-xie (Leo) / Starred · GitHub

More Web Proxy on the site http://driver.im/

ZiYang-xie

Follow

🦁

Work Like a Lion

Leo ZiYang-xie

🦁

Work Like a Lion

Follow

MS in CS @ UIUC | BS in CS @ Fudan Univerisity | Computer Vision Researcher

183 followers · 159 following

University of Illinois Urbana Champion
https://ziyangxie.site/

Achievements

Achievements

Highlights

Pro

Organizations

Lists (15)

Sort

3D Generation

3D Understanding

AGI

Audio Synthesis

Depth Estimation

Foundation Model

Generative Control

Image Generation

Infra

Inverse Rendering

Reconstruction

Robotics

utils

Video Generation

15 repositories

Video Tools

Stars

AvaLovelace1 / LegoGPT

Official repository for LegoGPT, the first approach for generating physically stable LEGO brick models from text prompts.

Python 1,057 53 Updated May 17, 2025

ByteDance-Seed / Seed1.5-VL

Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving state-of-the-art performance on 38 out of 60 public benchmarks.

Jupyter Notebook 893 22 Updated May 15, 2025

ekonwang / VisuoThink

[Arxiv Paper 2504.09130]: VisuoThink: Empowering LVLM Reasoning with Multimodal Tree Search

Python 17 1 Updated Apr 27, 2025

Tencent / FlashVDM

8000 Unleashing Vecset Diffusion Model for Fast Shape Generation within 1 Second.

Python 218 7 Updated May 14, 2025

krillinai / KrillinAI

A video translation and dubbing tool powered by LLMs, offering professional-grade translations and one-click full-process deployment. It can generate content optimized for platforms like YouTube，T…

Go 7,281 546 Updated May 14, 2025

River-Zhang / ICEdit

Image editing is worth a single LoRA! 0.1% training data for fantastic image editing! Training released! Surpasses GPT-4o in ID persistence! Official ComfyUI workflow release! Only 4GB VRAM is enou…

Python 1,349 78 Updated May 16, 2025

chungmin99 / pyroki

A Modular Toolkit for Robot Kinematic Optimization

Python 478 26 Updated May 18, 2025

yzhao062 / cs-paper-checklist

A final sanity checklist to help your CS paper get accepted, not desk rejected.

926 105 Updated May 7, 2025

TencentARC / GeometryCrafter

GeometryCrafter: Consistent Geometry Estimation for Open-world Videos with Diffusion Priors

Python 277 8 Updated Apr 28, 2025

kornia / kornia

🐍 Geometric Computer Vision Library for Spatial AI

Python 10,471 1,018 Updated May 13, 2025

NVIDIA / cuda-python

CUDA Python: Performance meets Productivity

Python 2,662 162 Updated May 18, 2025

stepfun-ai / Step1X-Edit

A SOTA open-source image editing model, which aims to provide comparable performance against the closed-source models like GPT-4o and Gemini 2 Flash.

Python 1,245 56 Updated May 13, 2025

skyzh / tiny-llm

(🚧 WIP) a course of LLM inference serving on Apple Silicon for systems engineers.

Python 1,891 73 Updated May 18, 2025

Vector-Wangel / XLeRobot

XLeRobot: Practical Household Dual-Arm Mobile Robot for ~$660

463 41 Updated May 16, 2025

TheRobotStudio / SO-ARM100

Standard Open Arm 100

CMake 2,093 147 Updated May 12, 2025

xiahongchi / DRAWER

Jupyter Notebook 52 5 Updated Apr 22, 2025

iSEE-Laboratory / PanoDecouple

(CVPR2025 Highlight) Official repository of paper "Panorama Generation From NFoV Image Done Right"

9 1 Updated Apr 4, 2025

SandAI-org / MAGI-1

MAGI-1: Autoregressive Video Generation at Scale

Python 3,036 164 Updated May 14, 2025

ZiYang-xie / WorldGen

🌍 WorldGen - Generate Any 3D Scene in Seconds

Python 482 19 Updated May 10, 2025

astral-sh / uv

An extremely fast Python package and project manager, written in Rust.

Rust 54,674 1,531 Updated May 18, 2025

sunset1995 / py360convert

Python implementation of convertion between equirectangular, cubemap and perspective. (equirect2cube, cube2equirect, equirect2perspec)

Python 505 102 Updated May 12, 2025

lllyasviel / FramePack

Lets make video diffusion practical!

Python 13,252 1,135 Updated May 4, 2025

openai / codex

Lightweight coding agent that runs in your terminal

TypeScript 24,300 2,426 Updated May 18, 2025

eliliu2233 / occ-flow

[CoRL2024] Let Occ Flow: Self-Supervised 3D Occupancy Flow Prediction

Python 107 3 Updated Apr 20, 2025

PKU-VCL-3DV / SLAM3R

[CVPR 2025 Highlight] Real-time dense scene reconstruction with SLAM3R

Python 725 31 Updated Apr 30, 2025

lpiccinelli-eth / UniK3D

[CVPR 2025] UniK3D: Universal Camera Monocular 3D Estimation

Python 475 30 Updated Mar 24, 2025

TCXX / ObjaversePlusPlus

Repo for Objaverse++, Curated 3D Object Dataset with Quality Annotations

Python 73 1 Updated Apr 23, 2025

Yaofang-Liu / Pusa-VidGen

Pusa: Thousands Timesteps Video Diffusion Model

Python 166 4 Updated Apr 22, 2025

KwaiVGI / ReCamMaster

[ARXIV'25] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video

Python 1,175 46 Updated Apr 20, 2025

stopaimme / GI-GS

[ICLR 2025] GI-GS: Global Illumination Decomposition on Gaussian Splatting for Inverse Rendering

Python 62 3 Updated Apr 8, 2025

0