SOTAVerified

Multimodal Text and Image Classification

Classification with both source Image and Text

Papers

No papers found.

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Early Fusion (Bert + InceptionV3)Accuracy (%)92.5Unverified
2Late Fusion (Bert + InceptionV3)Accuracy (%)84.59Unverified
#ModelMetricClaimedVerifiedStatus
1Convolutional image feature extraction and dense concatenatingAccuracy88Unverified
#ModelMetricClaimedVerifiedStatus
1Two Branch Network (Text - Bert + Image - Nts-Net)Accuracy96.81Unverified