lixin4ever

🍉

I may be slow to respond before the due date of ACL.

LI XIN lixin4ever

🍉

I may be slow to respond before the due date of ACL.

PhD@CUHK, Research Engineer@Alibaba

417 followers · 44 following

Achievements

Organizations

Stars

Alibaba-NLP / WebAgent

🌐 WebAgent for Information Seeking built by Tongyi Lab: WebWalker & WebDancer & WebSailor https://arxiv.org/pdf/2507.02592

Python 3,569 251 Updated Jul 11, 2025

alibaba-damo-academy / WorldVLA

WorldVLA: Towards Autoregressive Action World Model

Python 257 10 Updated Jul 5, 2025

MiniMax-AI / MiniMax-M1

MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model.

Python 2,639 207 Updated Jul 7, 2025

MoonshotAI / Kimi-Dev

open-source coding LLM for software engineering tasks

Python 731 85 Updated Jun 27, 2025

facebookresearch / vjepa2

PyTorch code and models for VJEPA2 self-supervised learning from video.

Python 1,842 147 Updated Jul 2, 2025

alibaba-damo-academy / ECBench

Python 3 Updated Jun 9, 2025

VectorSpaceLab / Video-XL

🔥🔥First-ever hour scale video understanding models

Python 490 29 Updated Jul 9, 2025

alibaba-damo-academy / EOCBench

EOC-Bench, an innovative benchmark designed to systematically evaluate object-centric embodied cognition in dynamic egocentric scenarios.

Python 12 1 Updated Jun 17, 2025

zhishuifeiqian / VCR-Bench

VCR-Bench: A Comprehensive Evaluation Framework for Video Chain-of-Thought Reasoning

Python 32 1 Updated Jul 5, 2025

MiniMax-AI / One-RL-to-See-Them-All

The official repo of One RL to See Them All: Visual Triple Unified Reinforcement Learning

Python 287 15 Updated May 31, 2025

eric-ai-lab / GRIT

Official code for paper "GRIT: Teaching MLLMs to Think with Images"

Python 108 2 Updated Jun 23, 2025

open-mmlab / FoleyCrafter

FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds. AI拟音大师，给� C583 ��的无声视频添加生动而且同步的音效 😝

Python 607 61 Updated Jul 26, 2024

RoboDita / Dita

ICCV2025

Python 103 3 Updated Jun 28, 2025

patrickloeber / workshop-build-with-gemini

Workshop: Build with Gemini

Jupyter Notebook 316 44 Updated Jul 7, 2025

NUS-TRAIL / RAPID

Python 8 Updated Mar 2, 2025

OpenDriveLab / UniVLA

[RSS 2025] Learning to Act Anywhere with Task-centric Latent Actions

Python 548 26 Updated Jul 2, 2025

alibaba-damo-academy / VCBench

This repo contains evaluation code for the paper "Benchmarking Multimodal Mathematical Reasoning with Explicit Visual Dependency"

Python 9 Updated May 12, 2025

MoonshotAI / Kimi-Audio

Kimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversation

Python 3,944 265 Updated Jun 21, 2025

embodiedreasoning / ERQA

Embodied Reasoning Question Answer (ERQA) Benchmark

Python 183 8 Updated Mar 12, 2025

openai / codex

Lightweight coding agent that runs in your terminal

Rust 30,756 3,533 Updated Jul 11, 2025

steven-ccq / ViLAMP

[ICML 2025] Official repository for paper "Scaling Video-Language Models to 10K Frames via Hierarchical Differential Distillation"

Python 159 36 Updated May 15, 2025

OpenGVLab / VideoChat-R1

VideoChat-R1: Enhancing Spatio-Temporal Perception via Reinforcement Fine-Tuning

Python 162 4 Updated Jun 9, 2025

moojink / openvla-oft

Forked from openvla/openvla

Fine-Tuning Vision-Language-Action Models: Optimizing Speed and Success

Python 530 47 Updated Apr 28, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LI XIN lixin4ever

Achievements

Achievements

Organizations

Block or report lixin4ever

Stars

Alibaba-NLP / WebAgent

alibaba-damo-academy / WorldVLA

MiniMax-AI / MiniMax-M1

MoonshotAI / Kimi-Dev

facebookresearch / vjepa2

alibaba-damo-academy / ECBench

VectorSpaceLab / Video-XL

alibaba-damo-academy / EOCBench

zhishuifeiqian / VCR-Bench

MiniMax-AI / One-RL-to-See-Them-All

eric-ai-lab / GRIT

open-mmlab / FoleyCrafter

RoboDita / Dita

patrickloeber / workshop-build-with-gemini

NUS-TRAIL / RAPID

OpenDriveLab / UniVLA

alibaba-damo-academy / VCBench

MoonshotAI / Kimi-Audio

embodiedreasoning / ERQA

openai / codex

steven-ccq / ViLAMP

OpenGVLab / VideoChat-R1

moojink / openvla-oft

ByteDance-Seed / Seed-Thinking-v1.5

MoonshotAI / Kimi-VL

TIGER-AI-Lab / VisualWebInstruct

apple / ml-cross-entropy

lucidrains / soundstorm-pytorch

OpenBMB / OlympiadBench

SparkAudio / Spark-TTS