Visual-Text Question Answering

In this challenge, the model is expected to answer the question according to the given image-text pair. Information diversity, multimedia multi-step reasoning and open-ended answer make our task more challenging than the existing dataset. The aim of this challenge is to develop and benchmark models that are able to multimedia entity alignment, multi-step reasoning and open-ended answer generation.

Data

Coming soon.

Name		Name	Last commit message	Last commit date
Latest commit History 51 Commits
static		static
.gitignore		.gitignore
README.md		README.md
index.html		index.html

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Visual-Text Question Answering

Data

About

Uh oh!

Releases

Packages

Uh oh!

Languages

LLLogen/vsd-challenge.github.io

Folders and files

Latest commit

History

Repository files navigation

Visual-Text Question Answering

Data

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages