8000 beichenzbc (Beichen Zhang) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View beichenzbc's full-sized avatar

Block or report beichenzbc

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official implementation of our ICML 2024 paper "UP2ME: Univariate Pre-training to Multivariate Fine-tuning as a General-purpose Framework for Multivariate Time Series Analysis"

Python 29 Updated May 12, 2025

Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving state-of-the-art performance on 38 out of 60 public benchmarks.

Jupyter Notebook 653 8 Updated May 14, 2025

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 21,016 1,385 Updated May 14, 2025

MM-IFEngine: Towards Multimodal Instruction Following

Python 84 Updated Apr 26, 2025

Code for "A Sober Look at Progress in Language Model Reasoning" paper

Python 45 Updated May 12, 2025

GenDoP: Auto-regressive Camera Trajectory Generation as a Director of Photography

53 1 Updated Apr 10, 2025

Official implementation of HiFlow: Training-free High-Resolution Image Generation with Flow-Aligned Guidance

Python 63 1 Updated May 12, 2025

RelightVid: Temporal-Consistent Diffusion Model for Video Relighting

Python 57 2 Updated Apr 2, 2025

Official implementation of UnifiedReward & UnifiedReward-Think

Python 348 8 Updated May 14, 2025

Official repository of ’Visual-RFT: Visual Reinforcement Fine-Tuning’

Python 1,655 77 Updated Apr 18, 2025
Python 5 Updated Feb 28, 2025

Official Repository of paper OmniAlign-V: Towards Enhanced Alignment of MLLMs with Human Preference

Python 147 1 Updated Mar 2, 2025

Customize your arXiv recommendation every day.

Python 102 15 Updated Mar 27, 2025

Light-A-Video: Training-free Video Relighting via Progressive Light Fusion

Python 415 27 Updated Apr 25, 2025

Witness the aha moment of VLM with less than $3.

Python 3,657 286 Updated Mar 1, 2025
Python 223 20 Updated Mar 18, 2025

Scalable RL solution for advanced reasoning of language models

Python 1,552 91 Updated Mar 18, 2025

[ICML 2025 Spotlight] An official implementation of VideoRoPE: What Makes for Good Video Rotary Position Embedding?

Python 143 3 Updated Apr 24, 2025

GPT4Scene: Understand 3D Scenes from Videos with Vision-Language Models

Python 266 6 Updated Apr 11, 2025

💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.

669 44 Updated Mar 10, 2025

official code for "BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning"

Python 35 3 Updated Jan 21, 2025

[CVPR 2025]Dispider: Enabling Video LLMs with Active Real-Time Interaction via Disentangled Perception, Decision, and Reaction

Python 105 5 Updated Mar 23, 2025

Single Image Dehazing Using Scene Depth Ordering

MATLAB 3 Updated Dec 13, 2024
Python 527 60 Updated Jan 2, 2025

This is the official code for the paper Tailor3D

Python 183 8 Updated Jul 9, 2024

[ NeurIPS 2024 D&B Track ] Implementation for "FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models"

Python 68 1 Updated Dec 27, 2024

Official implementation of X-Prompt: Towards Universal In-Context Image Generation in Auto-Regressive Vision Language Foundation Models

153 1 Updated Dec 3, 2024
Next
0