SOTAVerified

MM-Vet

Papers

Showing 110 of 19 papers

TitleStatusHype
CogVLM2: Visual Language Models for Image and Video UnderstandingCode9
CogAgent: A Visual Language Model for GUI AgentsCode5
Lyra: An Efficient and Speech-Centric Framework for Omni-CognitionCode3
MM-Vet v2: A Challenging Benchmark to Evaluate Large Multimodal Models for Integrated CapabilitiesCode3
ShapeLLM: Universal 3D Object Understanding for Embodied InteractionCode3
To See is to Believe: Prompting GPT-4V for Better Visual Instruction TuningCode2
Attention Prompting on Image for Large Vision-Language ModelsCode2
MM-Vet: Evaluating Large Multimodal Models for Integrated CapabilitiesCode2
Self-Supervised Visual Preference AlignmentCode2
Volcano: Mitigating Multimodal Hallucination through Self-Feedback Guided RevisionCode1
Show:102550
← PrevPage 1 of 2Next →

No leaderboard results yet.