Knowledge-enhanced Multi-modal Model

This project builds upon LAVIS library.

Installation

cd LAVIS
pip install -e .

Improve External Knowledge Utilization

The main idea focuses on making better use of external knowledge to instruct MLLM to output more accurate generation. The detail can refer to lavis\models\blip2_models\blip2_vicuna_instruct_okvqa.py

Quick Start

bash run_scripts/blip2/train/train_okvqa.sh

Notes

This publication version was made in a rush due to intensive workload that the author currently have. We will add follow-up patches to make codes more readible and ensure reproducibility. (of course, the speed depends on the number of people who are interested in using this framework.)

License

BSD 3-Clause License

Name		Name	Last commit message	Last commit date
Latest commit History 485 Commits
.github/workflows		.github/workflows
.vscode		.vscode
lavis		lavis
run_scripts		run_scripts
tests/models		tests/models
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CODEOWNERS		CODEOWNERS
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
LICENSE.txt		LICENSE.txt
MANIFEST.in		MANIFEST.in
README.md		README.md
SECURITY.md		SECURITY.md
evaluate.py		evaluate.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
setup.py		setup.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Knowledge-enhanced Multi-modal Model

Installation

Improve External Knowledge Utilization

Quick Start

Notes

License

About

Uh oh!

Releases

Packages

Languages

License

BUAAw-ML/LAVIS

Folders and files

Latest commit

History

Repository files navigation

Knowledge-enhanced Multi-modal Model

Installation

Improve External Knowledge Utilization

Quick Start

Notes

License

About

Resources

License

Code of conduct

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages