8000 GitHub - arya-domain/HyperCap
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

arya-domain/HyperCap

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

28 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

HyperCap

HyperCap: A Hyperspectral Imaging Dataset with Pixel-Level Captions and Benchmarking

HyperCap: A Benchmark Dataset for Hyperspectral Image Captioning


Overview

This repository contains the codebase and model configurations for HyperCap

Note: Complete Dataset has been provided for review. Complete training code for captioning and evaluation pipeline will be released in future under an MIT license.


Dataset

Complete Captioning Dataset is present along with the respective HSI data. Please refer to the Datasets/ folder.


Vision Encoders

We explore and support multiple 3D vision backbones that serve as the feature extractors for our VL tasks:


Text Encoders

HyperCap supports the integration of state-of-the-art pretrained language models:


VL Classification - Training Code

To train the model for Vision-Language Classification:

python train.py

Note: Complete Dataset Released. Full training support will be enabled after infuture.


Captioning - Training Code

For the Captioning task, we build upon and adapt the codebase of prior works. As of now, we have tutorial of the training code and visual encoder integration.

The current setup includes:

  • The adapted vision backbone for captioning is located in the captioning/ directory.
  • The code supports finetuning BLIP.
  • To train and understand how to integrate the Vision Encoder on BLIP Check - Tutorial_Captioning_BLIP.py

Supported Vision-Language Captioning Models:

Captioning - Evaluation

We utilized the Microsoft COCO Caption Evaluation for evaluating.


Code and Dataset

  • Complete Dataset in Datasets/ Folder
  • Tutorial captioning code and adapted vision model released (captioning/vision_model.py and Tutorial_Captioning_BLIP.py)
  • Full codebase, pretrained checkpoints, and complete dataset to be released upon acceptance of the paper

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •  

Languages

0