geyuying

Yuying Ge geyuying

181 followers · 4 following

Tencent ARC Lab
https://geyuying.github.io/

Achievements

Stars

TencentARC / GRPO-CARE

Python 10 Updated Jun 23, 2025

TencentARC / TokLIP

TokLIP: Marry Visual Tokens to CLIP for Multimodal Comprehension and Generation

Python 83 1 Updated Jun 5, 2025

liyz15 / Aligning-Latent-Spaces-with-Flow-Priors

Python 30 2 Updated Jun 6, 2025

qiulu66 / Anime-Shooter

Python 33 Updated Jun 4, 2025

tulerfeng / Video-R1

Video-R1: Reinforcing Video Reasoning in MLLMs [🔥the first paper to explore R1 for video]

Python 580 29 Updated May 28, 2025

TencentARC / Video-Holmes

Video-Holmes: Can MLLM Think Like Holmes for Complex Video Reasoning?

Python 52 Updated Jun 3, 2025

QwenLM / Qwen2.5-Omni

Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.

Jupyter Notebook 3,198 246 Updated Jun 12, 2025

lllyasviel / FramePack

Lets make video diffusion practical!

Python 14,646 1,315 Updated May 4, 2025

liyz15 / Diffusion-Compressed-Deep-Tokens

DiCoDe: Diffusion-Compressed Deep Tokens for Autoregressive Video Generation with Language Models

Python 10 Updated Apr 1, 2025

TencentARC / SEED-Bench-R1

Python 84 1 Updated Jun 23, 2025

TencentARC / AnimeGamer

AnimeGamer: Infinite Anime Life Simulation with Next Game State Prediction

Python 318 27 Updated Apr 9, 2025

mashijie1028 / GenHancer

A post-training method to enhance CLIP's fine-grained visual representations with generative models.

Python 53 Updated Mar 27, 2025

TencentARC / BlobCtrl

[Arxiv'25] BlobCtrl: A Unified and Flexible Framework for Element-level Image Generation and Editing

Python 90 2 Updated Mar 20, 2025

Wang-Xiaodong1899 / Open-R1-Video

✨First Open-Source R1-like Video-LLM [2025/02/18]

Python 348 12 Updated Feb 23, 2025

bytedance / SALMONN

SALMONN family: A suite of advanced multi-modal LLMs

1,266 101 Updated Jun 20, 2025

VideoVerses / VideoVAEPlus

VideoVAE+: Large Motion Video Autoencoding with Cross-modal Video VAE

Python 334 7 Updated Jan 19, 2025

OpenGVLab / InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 8,405 647 Updated May 29, 2025

MineDojo / MineDojo

Building Open-Ended Embodied Agents with Internet-Scale Knowledge

Java 1,976 177 Updated Mar 18, 2024

openai / Video-Pre-Training

Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos

Python 1,465 151 Updated Jun 10, 2024

NVIDIA / Cosmos-Tokenizer

A suite of image and video neural tokenizers

Jupyter Notebook 1,636 78 Updated Feb 11, 2025

qiulu66 / EgoPlan-Bench2

Jupyter Notebook 25 1 Updated Apr 11, 2025

TencentARC / FluxKits

Python 93 7 Updated Nov 27, 2024

etched-ai / open-oasis

Inference script for Oasis 500M

Python 1,850 160 Updated Nov 8, 2024

TencentARC / Moto

Latent Motion Token as the Bridging Language for Robot Manipulation

Python 105 1 Updated May 11, 2025

zhizhou57 / FVD

Implement FVD in pytorch

Python 9 1 Updated Apr 12, 2024

XLabs-AI / x-flux

Python 2,116 157 Updated Nov 8, 2024

showlab / Awesome-Unified-Multimodal-Models

📖 This is a repository for organizing papers, codes and other resources related to unified multimodal models.

588 31 Updated Jun 21, 2025

baaivision / Emu3

Next-Token Prediction is All You Need

Python 2,152 81 Updated Mar 17, 2025

THUDM / CogVideo

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 11,594 1,123 Updated Jun 17, 2025

Oryx-mllm / Oryx

[ICLR 2025] MLLM for On-Demand Spatial-Temporal Understanding at Arbitrary Resolution

Python 312 16 Updated Feb 27, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Yuying Ge geyuying

Achievements

Achievements

Block or report geyuying

Stars

TencentARC / GRPO-CARE

TencentARC / TokLIP

liyz15 / Aligning-Latent-Spaces-with-Flow-Priors

qiulu66 / Anime-Shooter

tulerfeng / Video-R1

TencentARC / Video-Holmes

QwenLM / Qwen2.5-Omni

lllyasviel / FramePack

liyz15 / Diffusion-Compressed-Deep-Tokens

TencentARC / SEED-Bench-R1

TencentARC / AnimeGamer

mashijie1028 / GenHancer

TencentARC / BlobCtrl

Wang-Xiaodong1899 / Open-R1-Video

bytedance / SALMONN

VideoVerses / VideoVAEPlus

OpenGVLab / InternVL

MineDojo / MineDojo

openai / Video-Pre-Training

NVIDIA / Cosmos-Tokenizer

qiulu66 / EgoPlan-Bench2

TencentARC / FluxKits

etched-ai / open-oasis

TencentARC / Moto

zhizhou57 / FVD

XLabs-AI / x-flux

showlab / Awesome-Unified-Multimodal-Models

baaivision / Emu3

THUDM / CogVideo

Oryx-mllm / Oryx