SOTAVerified

Multimodal Deep Learning

Multimodal deep learning is a type of deep learning that combines information from multiple modalities, such as text, image, audio, and video, to make more accurate and comprehensive predictions. It involves training deep neural networks on data that includes multiple types of information and using the network to make predictions based on this combined data.

One of the key challenges in multimodal deep learning is how to effectively combine information from multiple modalities. This can be done using a variety of techniques, such as fusing the features extracted from each modality, or using attention mechanisms to weight the contribution of each modality based on its importance for the task at hand.

Multimodal deep learning has many applications, including image captioning, speech recognition, natural language processing, and autonomous vehicles. By combining information from multiple modalities, multimodal deep learning can improve the accuracy and robustness of models, enabling them to perform better in real-world scenarios where multiple types of information are present.

Title	Date	Tasks	Status	Hype
Emotion Based Hate Speech Detection using Multimodal Learning	Feb 13, 2022	Hate Speech DetectionMultimodal Deep Learning	—Unverified	0
Geometric Multimodal Deep Learning with Multi-Scaled Graph Wavelet Convolutional Network	Nov 26, 2021	Multimodal Deep LearningNode Classification	—Unverified	0
Multimodal Approach for Metadata Extraction from German Scientific Publications	Nov 10, 2021	Multimodal Deep Learning	—Unverified	0
From Multimodal to Unimodal Attention in Transformers using Knowledge Distillation	Oct 15, 2021	Knowledge DistillationMultimodal Deep Learning	—Unverified	0
DeepStroke: An Efficient Stroke Screening Framework for Emergency Rooms with Multimodal Adversarial Deep Learning	Sep 24, 2021	AttributeMultimodal Deep Learning	—Unverified	0
Contrastive Language-Image Pre-training for the Italian Language	Aug 19, 2021	Image RetrievalMulti-label zero-shot learning	CodeCode Available	1
Pan-Cancer Integrative Histology-Genomic Analysis via Interpretable Multimodal Deep Learning	Aug 4, 2021	Deep LearningMultimodal Deep Learning	CodeCode Available	1
Multimodal Co-learning: Challenges, Applications with Datasets, Recent Advances and Future Directions	Jul 29, 2021	Multimodal Deep Learning	—Unverified	0
Bi-Bimodal Modality Fusion for Correlation-Controlled Multimodal Sentiment Analysis	Jul 28, 2021	Multimodal Deep LearningMultimodal Sentiment Analysis	CodeCode Available	1
A Multimodal Deep Learning Model for Cardiac Resynchronisation Therapy Response Prediction	Jul 20, 2021	Deep LearningMultimodal Deep Learning	—Unverified	0

Title

Status

Hype

Emotion Based Hate Speech Detection using Multimodal Learning

—Unverified

Geometric Multimodal Deep Learning with Multi-Scaled Graph Wavelet Convolutional Network

—Unverified

Multimodal Approach for Metadata Extraction from German Scientific Publications

—Unverified

From Multimodal to Unimodal Attention in Transformers using Knowledge Distillation

—Unverified

DeepStroke: An Efficient Stroke Screening Framework for Emergency Rooms with Multimodal Adversarial Deep Learning