SOTAVerified

Multi-modal Classification

Papers

Showing 110 of 31 papers

TitleStatusHype
Lightweight Joint Audio-Visual Deepfake Detection via Single-Stream Multi-Modal Learning Framework0
A Survey on Training-free Open-Vocabulary Semantic Segmentation0
A Comparative Study of Human Activity Recognition: Motion, Tactile, and multi-modal Approaches0
Multi-modal classification of forest biodiversity potential from 2D orthophotos and 3D airborne laser scanning point clouds0
Multimodal Learning with Uncertainty Quantification based on Discounted Belief FusionCode1
Hateful Meme Detection through Context-Sensitive Prompting and Fine-Grained LabelingCode0
Turbo your multi-modal classification with contrastive learning0
FungiTastic: A multi-modal dataset and benchmark for image categorization0
Language Augmentation in CLIP for Improved Anatomy Detection on Multi-modal Medical Images0
Joint-Individual Fusion Structure with Fusion Attention Module for Multi-Modal Skin Cancer Classification0
Show:102550
← PrevPage 1 of 4Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1MMTTop-1 Accuracy66.2Unverified
2CAV-MAE (Audio-Visual)Top-1 Accuracy65.9Unverified
3UAVMTop-1 Accuracy65.8Unverified
4AVTTop-1 Accuracy63.9Unverified
#ModelMetricClaimedVerifiedStatus
1CAV-MAEAverage mAP0.51Unverified
2UAVMAverage mAP0.5Unverified