STSANet

The pytorch implementation of [1] (non-official)

Abstract

3D convolutional neural networks have achieved promising results for video tasks in computer vision, including video saliency prediction that is explored in this paper. However, 3D convolution encodes visual representation merely on fixed local spacetime according to its kernel size, while human attention is always attracted by relational visual features at different time. To overcome this limitation, we propose a novel Spatio-Temp 69A5 oral Self-Attention 3D Network (STSANet) for video saliency prediction, in which multiple Spatio-Temporal Self-Attention (STSA) modules are employed at different levels of 3D convolutional backbone to directly capture long-range relations between spatiotemporal features of different time steps. Besides, we propose an Attentional Multi-Scale Fusion (AMSF) module to integrate multi-level features with the perception of context in semantic and spatio-temporal subspaces. Extensive experiments demonstrate the contributions of key components of our method, and the results on DHF1K, Hollywood-2, UCF, and DIEM benchmark datasets clearly prove the superiority of the proposed model compared with all state-of-the-art models.

Model architecture

Dataset download

You can download the datasets from here dataset

Training & Inference

$ bash run_train.sh

Reference

[1] Z. Wang et al., "Spatio-Temporal Self-Attention Network for Video Saliency Prediction," in IEEE Transactions on Multimedia, doi: 10.1109/TMM.2021.3139743.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
data_preparation		data_preparation
image		image
models		models
.gitignore		.gitignore
README.md		README.md
dataset.py		dataset.py
diem_val.py		diem_val.py
main.py		main.py
run_train.sh		run_train.sh
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

STSANet

Abstract

Model architecture

Dataset download

Training & Inference

Reference

About

Uh oh!

Releases 1

Packages

Uh oh!

Languages

come880412/STSANet

Folders and files

Latest commit

History

Repository files navigation

STSANet

Abstract

Model architecture

Dataset download

Training & Inference

Reference

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Languages

Packages