SOTAVerified

Image Comprehension

Papers

Showing 2130 of 49 papers

TitleStatusHype
SimpleVQA: Multimodal Factuality Evaluation for Multimodal Large Language Models0
Migician: Revealing the Magic of Free-Form Multi-Image Grounding in Multimodal Large Language Models0
RRHF-V: Ranking Responses to Mitigate Hallucinations in Multimodal Large Language Models with Human FeedbackCode0
EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM0
Survey of different Large Language Model Architectures: Trends, Benchmarks, and Challenges0
CLIC: Contrastive Learning Framework for Unsupervised Image Complexity RepresentationCode0
MIRe: Enhancing Multimodal Queries Representation via Fusion-Free Modality Interaction for Multimodal RetrievalCode0
Aquila: A Hierarchically Aligned Visual-Language Model for Enhanced Remote Sensing Image Comprehension0
Teach Multimodal LLMs to Comprehend Electrocardiographic Images0
FTII-Bench: A Comprehensive Multimodal Benchmark for Flow Text with Image InsertionCode0
Show:102550
← PrevPage 3 of 5Next →

No leaderboard results yet.