SOTAVerified

MM-Vet

Papers

Showing 1119 of 19 papers

TitleStatusHype
ShapeLLM: Universal 3D Object Understanding for Embodied InteractionCode3
Multi-modal Preference Alignment Remedies Degradation of Visual Instruction Tuning on Language ModelsCode1
DIEM: Decomposition-Integration Enhancing Multimodal Insights0
CogAgent: A Visual Language Model for GUI AgentsCode5
Text as Images: Can Multimodal Large Language Models Follow Printed Instructions in Pixels?Code1
Volcano: Mitigating Multimodal Hallucination through Self-Feedback Guided RevisionCode1
To See is to Believe: Prompting GPT-4V for Better Visual Instruction TuningCode2
Enhancing the Spatial Awareness Capability of Multi-Modal Large Language Model0
MM-Vet: Evaluating Large Multimodal Models for Integrated CapabilitiesCode2
Show:102550
← PrevPage 2 of 2Next →

No leaderboard results yet.