10000 GitHub - BUAAw-ML/LAVIS: LAVIS - A One-stop Library for Language-Vision Intelligence
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

BUAAw-ML/LAVIS

 
 

Repository files navigation

Knowledge-enhanced Multi-modal Model

This project builds upon LAVIS library.

Installation

cd LAVIS
pip install -e .

Improve External Knowledge Utilization

The main idea focuses on making better use of external knowledge to instruct MLLM to output more accurate generation. The detail can refer to lavis\models\blip2_models\blip2_vicuna_instruct_okvqa.py

Quick Start

bash run_scripts/blip2/train/train_okvqa.sh

Notes

This publication version was made in a rush due to intensive workload that the author currently have. We will add follow-up patches to make codes more readible and ensure reproducibility. (of course, the speed depends on the number of people who are interested in using this framework.)

License

BSD 3-Clause License

About

LAVIS - A One-stop Library for Language-Vision Intelligence

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 98.7%
  • Other 1.3%
0