8000 QuLiao1117 (Liao Qu) · GitHub

More Web Proxy on the site http://driver.im/

QuLiao1117

Follow

Liao Qu QuLiao1117

Follow

CMU -> ByteDance | Multimodal Understanding & Generation

24 followers · 45 following

Carnegie Mellon University
Pittsburgh, PA
https://scholar.google.com/citations?user=IDbqDdEAAAAJ&hl=zh-CN

Achievements

Achievements

QuLiao1117/README.md

Hi there 👋

🔭 I’m currently interested in MLLM and visual generation.
⚡ I graduated from Carnegie Mellon University and am currently an MLE at ByteDance.

Pinned Loading

ByteFlow-AI/TokenFlow ByteFlow-AI/TokenFlow Public

[CVPR 2025] 🔥 Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".

Python 336 1
ByteFlow-AI/DetailFlow ByteFlow-AI/DetailFlow Public

🔥 Official impl. of "DetailFlow: 1D Coarse-to-Fine Autoregressive Image Generation via Next-Detail Prediction"

111 7
ruohaoguo/avis ruohaoguo/avis Public

[CVPR 2025] 🔥 Official impl. of "Audio-Visual Instance Segmentation".

Python 24 3
ruohaoguo/ovavss ruohaoguo/ovavss Public

Official Implementation of "Open-Vocabulary Audio-Visual Semantic Segmentation" [ACM MM 2024 Oral].

Python 29 3
bytedance/AvatarVerse bytedance/AvatarVerse Public

code repo for the paper "AvatarVerse: High-quality & Stable 3D Avatar Creation from Text and Pose" (AAAI2024)

Python 60 7
easton-cau/SOTR easton-cau/SOTR Public

SOTR: Segmenting Objects with Transformers

Python 193 33

0