SOTAVerified

Multi-modal Classification

Papers

Showing 1120 of 31 papers

TitleStatusHype
PromptStyler: Prompt-driven Style Generation for Source-free Domain GeneralizationCode1
FAME-ViL: Multi-Tasking Vision-Language Model for Heterogeneous Fashion TasksCode1
Contrastive Audio-Visual Masked AutoencoderCode2
AVT: Audio-Video Transformer for Multimodal Action Recognition0
Multiscale Multimodal Transformer for Multimodal Action Recognition0
UAVM: Towards Unifying Audio and Visual ModelsCode1
Multi-Modal Hypergraph Diffusion Network with Dual Prior for Alzheimer Classification0
On Modality Bias Recognition and ReductionCode0
Multimodal Dynamics: Dynamical Fusion for Trustworthy Multimodal ClassificationCode1
Multi Task Learning based Framework for Multimodal Classification0
Show:102550
← PrevPage 2 of 4Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1MMTTop-1 Accuracy66.2Unverified
2CAV-MAE (Audio-Visual)Top-1 Accuracy65.9Unverified
3UAVMTop-1 Accuracy65.8Unverified
4AVTTop-1 Accuracy63.9Unverified
#ModelMetricClaimedVerifiedStatus
1CAV-MAEAverage mAP0.51Unverified
2UAVMAverage mAP0.5Unverified