SOTAVerified

Multimodal Reasoning

Reasoning over multimodal inputs.

Papers

Showing 251260 of 302 papers

TitleStatusHype
Knowledge-Aware Reasoning over Multimodal Semi-structured Tables0
Towards Holistic Disease Risk Prediction using Small Language Models0
User-in-the-loop Evaluation of Multimodal LLMs for Activity Assistance0
Controllable Contextualized Image Captioning: Directing the Visual Narrative through User-Defined HighlightsCode0
On scalable oversight with weak LLMs judging strong LLMs0
Improving Multi-Agent Debate with Sparse Communication Topology0
POEM: Interactive Prompt Optimization for Enhancing Multimodal Reasoning of Large Language Models0
Multimodal Reasoning with Multimodal Knowledge Graph0
Don't Buy it! Reassessing the Ad Understanding Abilities of Contrastive Multimodal ModelsCode0
Retrieval Meets Reasoning: Even High-school Textbook Knowledge Benefits Multimodal Reasoning0
Show:102550
← PrevPage 26 of 31Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4VAccuracy24Unverified
2Gemini ProAccuracy13.2Unverified
3LLaVa-1.5-13BAccuracy1.8Unverified
4LLaVa-1.5-7BAccuracy1.5Unverified
5BLIP2-FLAN-T5-XXLAccuracy0.9Unverified
6QWENAccuracy0.9Unverified
7CogVLMAccuracy0.9Unverified
8InstructBLIPAccuracy0.6Unverified
#ModelMetricClaimedVerifiedStatus
1GPT4VAccuracy22.76Unverified
2Gemini ProAccuracy17.66Unverified
3Qwen-VL-MaxAccuracy15.59Unverified
4InternLM-XComposer2-VLAccuracy14.54Unverified
#ModelMetricClaimedVerifiedStatus
1GPT-4Acc30.3Unverified