tsb0601

tsb0601

102 followers · 7 following

Achievements

Stars

ai4ce / INT-ACT

Official repo for From Intention to Execution: Probing the Generalization Boundaries of Vision-Language-Action Models

Python 13 Updated Jun 22, 2025

test-time-interaction / TTI

Python 42 1 Updated Jun 10, 2025

mlfoundations / MINT-1T

MINT-1T: A one trillion token multimodal interleaved dataset.

817 19 Updated Jul 31, 2024

google-research / kauldron

Modular, scalable library to train ML models

Python 128 15 Updated Jun 26, 2025

alexanderswerdlow / unidisc

UniDisc: A discrete diffusion model for joint multimodal generation, enabling controllable and efficient text-image synthesis, editing, and inpainting.

Python 108 5 Updated Apr 2, 2025

Jialuo-Li / Science-T2I

[CVPR 2025] Science-T2I: Addressing Scientific Illusions in Image Synthesis

Python 56 4 Updated Apr 27, 2025

facebookresearch / webssl

Code for "Scaling Language-Free Visual Representation Learning" paper (Web-SSL).

Python 148 10 Updated Apr 29, 2025

kenantang / spice

SPICE: A Synergistic, Precise, Iterative, and Customizable Image Editing Workflow

HTML 14 Updated May 15, 2025

dfan / webssl

Code for Scaling Language-Free Visual Representation Learning (WebSSL)

246 2 Updated Apr 24, 2025

facebookresearch / metamorph

Code for MetaMorph Multimodal Understanding and Generation via Instruction Tuning

Python 191 7 Updated Apr 19, 2025

tulip-berkeley / open_clip

Forked from mlfoundations/open_clip

An open source implementation of CLIP (With TULIP Support)

Python 157 2 Updated May 14, 2025

bronyayang / Law_of_Vision_Representation_in_MLLMs

Official implementation of the Law of Vision Representation in MLLMs

Python 158 8 Updated Nov 17, 2024

hkust-nlp / simpleRL-reason

Simple RL training for reasoning

Python 3,640 272 Updated Apr 10, 2025

yinboc / dito

Official PyTorch Implementation of "Diffusion Autoencoders are Scalable Image Tokenizers"

Python 125 4 Updated Jan 31, 2025

deepseek-ai / DeepSeek-R1

90,278 11,653 Updated Apr 9, 2025

DruvPai / DiffusionLab

Easy no-frills Pytorch implementations of common abstractions for simple diffusion models.

Python 8 1 Updated Apr 20, 2025

vision-x-nyu / thinking-in-space

Official repo and evaluation implementation of VSI-Bench

Python 524 28 Updated Feb 28, 2025

google-research / silc

[ECCV 2024] Official Release of SILC: Improving vision language pretraining with self-distillation

44 Updated Oct 3, 2024

LTH14 / mar

PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838

Python 1,624 97 Updated Sep 27, 2024

facebookresearch / dinov2

PyTorch code and models for the DINOv2 self-supervised learning method.

Jupyter Notebook 10,931 1,004 Updated Jun 24, 2025

facebookresearch / vfusion3d

[ECCV 2024] Code for VFusion3D: Learning Scalable 3D Generative Models from Video Diffusion Models

Python 449 35 Updated Sep 9, 2024

tianyu-z / VCR

Official Repo for the paper: VCR: Visual Caption Restoration. Check arxiv.org/pdf/2406.06462 for details.

Python 31 2 Updated Feb 26, 2025

deepcs233 / Visual-CoT

[Neurips'24 Spotlight] Visual CoT: Advancing Multi-Modal Language Models with a Comprehensive Dataset and Benchmark for Chain-of-Thought Reasoning

Python 335 15 Updated Dec 22, 2024

cambrian-mllm / cambrian

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,916 130 Updated Oct 30, 2024

peterljq / Parsimonious-Concept-Engineering

PaCE: Parsimonious Concept Engineering for Large Language Models (NeurIPS 2024)

Python 37 2 Updated Nov 5, 2024

RL4VLM / RL4VLM

Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning

Jupyter Notebook 367 31 Updated Dec 15, 2024

pytorch / xla

Enabling PyTorch on XLA Devices (e.g. Google TPU)

Python 2,626 544 Updated Jun 26, 2025

m-bain / webvid

Large-scale text-video dataset. 10 million captioned short videos.

Python 643 40 Updated Aug 14, 2024

facebookresearch / jepa

PyTorch code and models for V-JEPA self-supervised learning from video.

Python 3,104 304 Updated Feb 27, 2025

minhoooo1 / CatMAE

CatMAE

Python 14 1 Updated Dec 13, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

tsb0601

Achievements

Achievements

Block or report tsb0601

Stars

ai4ce / INT-ACT

test-time-interaction / TTI

mlfoundations / MINT-1T

google-research / kauldron

alexanderswerdlow / unidisc

Jialuo-Li / Science-T2I

facebookresearch / webssl

kenantang / spice

dfan / webssl

facebookresearch / metamorph

tulip-berkeley / open_clip

bronyayang / Law_of_Vision_Representation_in_MLLMs

hkust-nlp / simpleRL-reason

yinboc / dito

deepseek-ai / DeepSeek-R1

DruvPai / DiffusionLab

vision-x-nyu / thinking-in-space

google-research / silc

LTH14 / mar

facebookresearch / dinov2

facebookresearch / vfusion3d

tianyu-z / VCR

deepcs233 / Visual-CoT

cambrian-mllm / cambrian

peterljq / Parsimonious-Concept-Engineering

RL4VLM / RL4VLM

pytorch / xla

m-bain / webvid

facebookresearch / jepa

minhoooo1 / CatMAE