10000 liusongxiang (Songxiang Liu) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View liusongxiang's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report liusongxiang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

verl: Volcano Engine Reinforcement Learning for LLMs

Python 10,062 1,656 Updated Jun 27, 2025
Python 555 23 Updated Jun 23, 2025

The development and future prospects of multimodal reasoning models.

399 16 Updated Jun 13, 2025

A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Models

Python 57 3 Updated Feb 25, 2025

The repository for IEEE CVPR 2023 (A Light Weight Model for Active Speaker Detection)

Python 143 16 Updated Mar 23, 2025

Kimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversation

Python 3,878 260 Updated Jun 21, 2025

Lets make video diffusion practical!

Python 14,710 1,322 Updated Jun 27, 2025

程序员在家做饭方法指南。Programmer's guide about how to cook at home (Simplified Chinese only).

Dockerfile 90,268 10,295 Updated Jun 25, 2025
Python 6 1 Updated Oct 2, 2024

Official repo and evaluation implementation of VSI-Bench

Python 524 28 Updated Feb 28, 2025

[CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

Python 1,345 72 Updated Jun 24, 2025

📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥

1,547 60 Updated Jun 26, 2025

SpatialLM: Training Large Language Models for Structured Indoor Modeling

Python 3,421 256 Updated Jun 24, 2025

[ICCV 2025] Official repo for "GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation"

Python 162 2 Updated Jun 26, 2025

The demo page for ALMTokenizer

Python 51 3 Updated Apr 14, 2025

Video-R1: Reinforcing Video Reasoning in MLLMs [🔥the first paper to explore R1 for video]

Python 585 29 Updated May 28, 2025

[CVPR 2025] OVO-Bench: How Far is Your Video-LLMs from Real-World Online Video Understanding?

Python 66 3 Updated Apr 3, 2025

VideoChat-R1: Enhancing Spatio-Temporal Perception via Reinforcement Fine-Tuning

Python 153 4 Updated Jun 9, 2025

One for All Modalities Evaluation Toolkit - including text, image, video, audio tasks.

Python 2,682 318 Updated Jun 27, 2025

A feature-rich command-line audio/video downloader

Python 116,665 9,223 Updated Jun 26, 2025

Open-Qwen2VL: Compute-Efficient Pre-Training of Fully-Open Multimodal LLMs on Academic Resources

Python 229 8 Updated May 17, 2025

学术期刊配色推荐器

R 423 29 Updated Jan 27, 2025

Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey

668 20 Updated Jun 21, 2025

Awesome Reasoning LLM Tutorial/Survey/Guide

Python 1,799 128 Updated Jun 16, 2025

RWKV-SpeechChat is a real-time dialogue script based on a frozen 3B RWKV model with trained adapters and initial states. Various trained weights can be applied to perform a range of audio tasks, in…

Python 27 1 Updated Jan 1, 2025

Fully open reproduction of DeepSeek-R1

Python 24,903 2,312 Updated Jun 26, 2025

Movie Gen Bench - two media generation evaluation benchmarks released with Meta Movie Gen

403 22 Updated Mar 8, 2025

VOCANO: A note transcription framework for singing voice in polyphonic music

Python 68 6 Updated Aug 9, 2021

A curated list of audio-visual learning methods and datasets.

263 18 Updated Dec 3, 2024
Next
0