8000 beichenzbc (Beichen Zhang) / Starred · GitHub

More Web Proxy on the site http://driver.im/

beichenzbc

Follow

Beichen Zhang beichenzbc

Follow

Undergraduate in Shanghai Jiao Tong University

66 followers · 59 following

Achievements

Achievements

Stars

Thinklab-SJTU / UP2ME

Official implementation of our ICML 2024 paper "UP2ME: Univariate Pre-training to Multivariate Fine-tuning as a General-purpose Framework for Multivariate Time Series Analysis"

Python 29 Updated May 12, 2025

ByteDance-Seed / Seed1.5-VL

Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving state-of-the-art performance on 38 out of 60 public benchmarks.

Jupyter Notebook 653 8 Updated May 14, 2025

pjlab-songcomposer / songcomposer

Python 213 13 Updated Nov 1, 2024

QwenLM / Qwen3

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 21,016 1,385 Updated May 14, 2025

SYuan03 / MM-IFEngine

MM-IFEngine: Towards Multimodal Instruction Following

Python 84 Updated Apr 26, 2025

bethgelab / sober-reasoning

Code for "A Sober Look at Progress in Language Model Reasoning" paper

Python 45 Updated May 12, 2025

3DTopia / GenDoP

GenDoP: Auto-regressive Camera Trajectory Generation as a Director of Photography

53 1 Updated Apr 10, 2025

Bujiazi / HiFlow

Official implementation of HiFlow: Training-free High-Resolution Image Generation with Flow-Aligned Guidance

Python 63 1 Updated May 12, 2025

Aleafy / RelightVid

RelightVid: Temporal-Consistent Diffusion Model for Video Relighting

Python 57 2 Updated Apr 2, 2025

Wiselnn570 / Wiselnn570.github.io

HTML 1 Updated Apr 9, 2025

CodeGoat24 / UnifiedReward

Official implementation of UnifiedReward & UnifiedReward-Think

Python 348 8 Updated May 14, 2025

Liuziyu77 / Visual-RFT

Official repository of ’Visual-RFT: Visual Reinforcement Fine-Tuning’

Python 1,655 77 Updated Apr 18, 2025

fengmingyang666 / RRTformer

Python 5 Updated Feb 28, 2025

PhoenixZ810 / OmniAlign-V

Official Repository of paper OmniAlign-V: Towards Enhanced Alignment of MLLMs with Human Preference

Python 147 1 Updated Mar 2, 2025

JoeLeelyf / customize-arxiv-daily

Customize your arXiv recommendation every day.

Python 102 15 Updated Mar 27, 2025

bcmi / Light-A-Video

Light-A-Video: Training-free Video Relighting via Progressive Light Fusion

Python 415 27 Updated Apr 25, 2025

Deep-Agent / R1-V

Witness the aha moment of VLM with less than $3.

Python 3,657 286 Updated Mar 1, 2025

LiuZH-19 / SongGen

Python 223 20 Updated Mar 18, 2025

PRIME-RL / PRIME

Scalable RL solution for advanced reasoning of language models

Python 1,552 91 Updated Mar 18, 2025

Wiselnn570 / VideoRoPE

[ICML 2025 Spotlight] An official implementation of VideoRoPE: What Makes for Good Video Rotary Position Embedding?

Python 143 3 Updated Apr 24, 2025

JianzeLi-114 / FluxSR

152 1 Updated May 13, 2025

Qi-Zhangyang / GPT4Scene

GPT4Scene: Understand 3D Scenes from Videos with Vision-Language Models

Python 266 6 Updated Apr 11, 2025

showlab / Awesome-GUI-Agent

💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.

669 44 Updated Mar 10, 2025

beichenzbc / BoostStep

official code for "BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning"

Python 35 3 Updated Jan 21, 2025

Mark12Ding / Dispider

[CVPR 2025]Dispider: Enabling Video LLMs with Active Real-Time Interaction via Disentangled Perception, Decision, and Reaction

Python 105 5 Updated Mar 23, 2025

LPengYang / Scene-Depth-Ordering

Single Image Dehazing Using Scene Depth Ordering

MATLAB 3 Updated Dec 13, 2024

lqtrung1998 / mwp_ReFT

Python 527 60 Updated Jan 2, 2025

Qi-Zhangyang / Tailor3D

This is the official code for the paper Tailor3D

Python 183 8 Updated Jul 9, 2024

wutong16 / FiVA

[ NeurIPS 2024 D&B Track ] Implementation for "FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models"

Python 68 1 Updated Dec 27, 2024

SunzeY / X-Prompt

Official implementation of X-Prompt: Towards Universal In-Context Image Generation in Auto-Regressive Vision Language Foundation Models

153 1 Updated Dec 3, 2024

0