Bohao-Lee

🎯

Focusing

Bohao Li Bohao-Lee

🎯

Focusing

Chase myself

39 followers · 54 following

Shenzhen
20:04 (UTC +08:00)
https://bohao-lee.github.io/

Achievements

Stars

facebookresearch / paco

This repo contains documentation and code needed to use PACO dataset: data loaders and training and evaluation scripts for objects, parts, and attributes prediction models, query evaluation scripts…

Python 283 13 Updated Feb 12, 2024

mit-han-lab / radial-attention

Radial Attention Official Implementation

Python 132 5 Updated Jun 26, 2025

csuhan / Tar

Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representations

JavaScript 54 1 Updated Jun 26, 2025

RoboTwin-Platform / RoboTwin

RoboTwin 2.0 Offical Repo

Python 1,141 123 Updated Jun 27, 2025

FreedomIntelligence / ShareGPT-4o-Image

125 Updated Jun 24, 2025

facebookresearch / vggt

[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer

Python 9,131 881 Updated Jun 24, 2025

microsoft / UniGenX

Python 16 1 Updated Jun 10, 2025

TencentARC / GRPO-CARE

Python 43 Updated Jun 23, 2025

AntResearchNLP / ViLaSR

Reinforcing Spatial Reasoning in Vision-Language Models with Interwoven Thinking and Visual Drawing

Python 34 1 Updated Jun 25, 2025

zhanyong-wan / dongbei

东北方言编程语言

Python 2,501 140 Updated Jun 22, 2025

AV-Reasoner / AV-Reasoner

Python 13 Updated Jun 16, 2025

TencentARC / TokLIP

TokLIP: Marry Visual Tokens to CLIP for Multimodal Comprehension and Generation

Python 87 1 Updated Jun 5, 2025

liyz15 / Aligning-Latent-Spaces-with-Flow-Priors

Python 30 2 Updated Jun 6, 2025

PKU-YuanGroup / UniWorld-V1

UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation

Python 587 20 Updated Jun 26, 2025

WZDTHU / NiT

Native-resolution diffusion Transformer

Python 252 16 Updated Jun 4, 2025

XiaomiMiMo / MiMo-VL

414 20 Updated Jun 5, 2025

wusize / OpenUni

Python 117 2 Updated Jun 27, 2025

PKU-Alignment / align-anything

Align Anything: Training All-modality Model with Feedback

Jupyter Notebook 4,087 498 Updated May 28, 2025

Paper2Poster / Paper2Poster

Open-source Multi-agent Poster Generation from Papers

Python 2,194 124 Updated Jun 17, 2025

TencentARC / Video-Holmes

Video-Holmes: Can MLLM Think Like Holmes for Complex Video Reasoning?

Python 52 Updated Jun 3, 2025

Qinyu-Allen-Zhao / DiSA

Official Implementation of Diffusion Step Annealing (DiSA) in Autoregressive Image Generation

Jupyter Notebook 137 Updated May 27, 2025

ML-GSAI / LLaDA-V

Python 161 6 Updated Jun 25, 2025

Hhhhhhao / continuous_tokenizer

Python 204 3 Updated May 29, 2025

Gen-Verse / MMaDA

MMaDA - Open-Sourced Multimodal Large Diffusion Language Models

Python 1,140 53 Updated Jun 13, 2025

deepseek-ai / Janus

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 17,407 2,237 Updated Feb 1, 2025

ByteDance-Seed / Bagel

Open-source unified multimodal model

Python 4,360 363 Updated Jun 17, 2025

TapXWorld / ChinaTextbook

所有小初高、大学PDF教材。

Roff 41,446 9,191 Updated May 18, 2025

JiuhaiChen / BLIP3o

Python 1,236 46 Updated Jun 22, 2025

ByteDance-Seed / Seed1.5-VL

Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving state-of-the-art performance on 38 out of 60 public benchmarks.

Jupyter Notebook 1,265 47 Updated Jun 14, 2025

huggingface / nanoVLM

The simplest, fastest repository for training/finetuning small-sized VLMs.

Python 3,565 313 Updated Jun 25, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bohao Li Bohao-Lee

Achievements

Achievements

Block or report Bohao-Lee

Stars

facebookresearch / paco

mit-han-lab / radial-attention

csuhan / Tar

RoboTwin-Platform / RoboTwin

FreedomIntelligence / ShareGPT-4o-Image

facebookresearch / vggt

microsoft / UniGenX

TencentARC / GRPO-CARE

AntResearchNLP / ViLaSR

zhanyong-wan / dongbei

AV-Reasoner / AV-Reasoner

TencentARC / TokLIP

liyz15 / Aligning-Latent-Spaces-with-Flow-Priors

PKU-YuanGroup / UniWorld-V1

WZDTHU / NiT

XiaomiMiMo / MiMo-VL

wusize / OpenUni

PKU-Alignment / align-anything

Paper2Poster / Paper2Poster

TencentARC / Video-Holmes

Qinyu-Allen-Zhao / DiSA

ML-GSAI / LLaDA-V

Hhhhhhao / continuous_tokenizer

Gen-Verse / MMaDA

deepseek-ai / Janus

ByteDance-Seed / Bagel

TapXWorld / ChinaTextbook

JiuhaiChen / BLIP3o

ByteDance-Seed / Seed1.5-VL

huggingface / nanoVLM