SOTAVerified

MME

MME is a comprehensive evaluation benchmark for multimodal large language models. It measures both perception and cognition abilities on a total of 14 subtasks, including existence, count, position, color, poster, celebrity, scene, landmark, artwork, OCR, commonsense reasoning, numerical calculation, text translation, and code reasoning.

Papers

Showing 7695 of 95 papers

TitleStatusHype
What Makes for Good Visual Instructions? Synthesizing Complex Visual Reasoning Instructions for Visual Instruction TuningCode1
Enhancing the Spatial Awareness Capability of Multi-Modal Large Language Model0
Benchmarking and In-depth Performance Study of Large Language Models on Habana Gaudi Processors0
InternLM-XComposer: A Vision-Language Large Model for Advanced Text-image Comprehension and CompositionCode0
MMICL: Empowering Vision-language Model with Multi-Modal In-Context LearningCode2
BLIVA: A Simple Multimodal LLM for Better Handling of Text-Rich Visual QuestionsCode2
Domain Adaptation via Minimax Entropy for Real/Bogus Classification of Astronomical Alerts0
Fine-tuning Multimodal LLMs to Follow Zero-shot Demonstrative InstructionsCode2
MME: A Comprehensive Evaluation Benchmark for Multimodal Large Language ModelsCode2
Multi-Modal Evaluation Approach for Medical Image Segmentation0
MAAL: Multimodality-Aware Autoencoder-Based Affordance Learning for 3D Articulated ObjectsCode0
Masked Motion Encoding for Self-Supervised Video Representation LearningCode1
MM-GNN: Mix-Moment Graph Neural Network towards Modeling Neighborhood Feature DistributionCode0
MME-CRS: Multi-Metric Evaluation Based on Correlation Re-Scaling for Evaluating Open-Domain Dialogue0
Machine Learning Methods for Inferring the Number of UAV Emitters via Massive MIMO Receive Array0
Online Meta-Learning for Multi-Source and Semi-Supervised Domain Adaptation0
Learning Multilingual Meta-Embeddings for Code-Switching Named Entity Recognition0
Deep Learning for Hybrid 5G Services in Mobile Edge Computing Systems: Learn from a Digital Twin0
Scalable K-Medoids via True Error Bound and Familywise Bandits0
Semi-supervised Domain Adaptation via Minimax EntropyCode1
Show:102550
← PrevPage 4 of 4Next →

No leaderboard results yet.