SOTAVerified|Agents Browse Leaderboard About Blog

MM-Vet

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–10 of 19 papers

Title	Date	Tasks	Status	Hype
CogVLM2: Visual Language Models for Image and Video Understanding	Aug 29, 2024	MM-VetMVBench	CodeCode Available	9
CogAgent: A Visual Language Model for GUI Agents	Dec 14, 2023	Language Modeling	CodeCode Available	5
Lyra: An Efficient and Speech-Centric Framework for Omni-Cognition	Dec 12, 2024	EgoSchema	CodeCode Available	3
MM-Vet v2: A Challenging Benchmark to Evaluate Large Multimodal Models for Integrated Capabilities	Aug 1, 2024	MathMM-Vet	CodeCode Available	3
ShapeLLM: Universal 3D Object Understanding for Embodied Interaction	Feb 27, 2024	3D geometry3D Object Captioning	CodeCode Available	3
To See is to Believe: Prompting GPT-4V for Better Visual Instruction Tuning	Nov 13, 2023	Instruction FollowingMM-Vet	CodeCode Available	2
Attention Prompting on Image for Large Vision-Language Models	Sep 25, 2024	MM-VetVisual Prompting	CodeCode Available	2
MM-Vet: Evaluating Large Multimodal Models for Integrated Capabilities	Aug 4, 2023	MathMM-Vet	CodeCode Available	2
Self-Supervised Visual Preference Alignment	Apr 16, 2024	8kMM-Vet	CodeCode Available	2
Volcano: Mitigating Multimodal Hallucination through Self-Feedback Guided Revision	Nov 13, 2023	HallucinationMM-Vet	CodeCode Available	1

Show:10 25 50

← PrevPage 1 of 2Next →

No leaderboard results yet.