Yusuke shure-dev

🎯

Focusing

AI/LLM for research and product.

15 followers · 9 following

Algomatic
Japan
06:35 (UTC +09:00)
https://sites.google.com/view/yusukemikami
in/yusukemikami

Achievements

x2 x2

Achievements

x2 x2

Stars

⭐ VLM

24 repositories

haotian-liu / LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 22,902 2,527 Updated Aug 12, 2024

OpenGVLab / LLaMA-Adapter

[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters

Python 5,886 381 Updated Mar 14, 2024

THUDM / CogVLM

a state-of-the-art-level open visual language model | 多模态预训练模型

Python 6,598 430 Updated May 29, 2024

EvolvingLMMs-Lab / Otter

🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.

Python 3,257 212 Updated Mar 5, 2024

Vision-CAIR / MiniGPT-4

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Python 25,689 2,939 Updated Sep 2, 2024

ttengwang / Caption-Anything

Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences. https://huggingface.co/sp…

Python 1,747 104 Updated Aug 29, 2023

YujieLu10 / TIP

Multimodal-Procedural-Planning

Python 92 3 Updated Jun 1, 2023

salesforce / LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 10,689 1,045 Updated Nov 18, 2024

X2FD / LVIS-INSTRUCT4V

133 Updated Dec 22, 2023

dzcgaara / HuBo-VLM

Official implementation of HuBo-VLM.

7 Updated Aug 24, 2023

atfortes / Awesome-Controllable-Diffusion

Papers and resources on Controllable Generation using Diffusion Models, including ControlNet, DreamBooth, IP-Adapter.

475 30 Updated Jun 24, 2025

maitrix-org / llm-reasoners

A library for advanced large language model reasoning

Python 2,154 191 Updated Jun 10, 2025

joez17 / ChatBridge

ChatBridge, an approach to learning a unified multimodal model to interpret, correlate, and reason about various modalities without relying on all combinations of paired data.

Python 51 2 Updated Sep 4, 2023