Semantic Segmentation for Remote Sensing
-
Updated
Jun 29, 2025 - Python
8000
Semantic Segmentation for Remote Sensing
SuperYOLO is accepted by TGRS
Source code for "Bi-modal Transformer for Dense Video Captioning" (BMVC 2020)
Multimodal Co-Attention Transformer for Survival Prediction in Gigapixel Whole Slide Images - ICCV 2021
This repository contains the official implementation code of the paper Improving Multimodal Fusion with Hierarchical Mutual Information Maximization for Multimodal Sentiment Analysis, accepted at EMNLP 2021.
MIntRec: A New Dataset for Multimodal Intent Recognition (ACM MM 2022)
E2E-MFD-OOD
Code on selecting an action based on multimodal inputs. Here in this case inputs are voice and text.
Creating multimodal multitask models
Multimodal sentiment analysis using hierarchical fusion with context modeling
[CVAMD 2021] "End-to-End Learning of Fused Image and Non-Image Feature for Improved Breast Cancer Classification from MRI"
Few-Shot malware classification using fused features of static analysis and dynamic analysis (基于静态+动态分析的混合特征的小样本恶意代码分类框架)
Multimodal object tracking and scene analytics for highly actionable, real-world contextualized data
Multimodal sentiment analysis
This repository contains the dataset and baselines explained in the paper: M2H2: A Multimodal Multiparty Hindi Dataset For HumorRecognition in Conversations
Official implementation of "Multi-scale Bottleneck Transformer for Weakly Supervised Multimodal Violence Detection"
FusionBrain Challenge 2.0: creating multimodal multitask model
Deep-HOSeq: Deep Higher-Order Sequence Fusion for Multimodal Sentiment Analysis.
VAPOR: Legged Robot Navigation in Outdoor Vegetation using Offline Reinforcement Learning (ICRA2024)
Add a description, image, and links to the multimodal-fusion topic page so that developers can more easily learn about it.
To associate your repository with the multimodal-fusion topic, visit your repo's landing page and select "manage topics."