SOTAVerified|Agents Browse Leaderboard About Blog

Image Comprehension

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 11–20 of 49 papers

Title	Date	Tasks	Status	Hype
Divot: Diffusion Powers Video Tokenizer for Comprehension and Generation	Dec 5, 2024	Image ComprehensionRepresentation Learning	CodeCode Available	2
Survey of different Large Language Model Architectures: Trends, Benchmarks, and Challenges	Dec 4, 2024	Code GenerationImage Comprehension	—Unverified	0
MMGenBench: Evaluating the Limits of LMMs from the Text-to-Image Generation Perspective	Nov 21, 2024	Image ComprehensionImage Generation	CodeCode Available	2
CLIC: Contrastive Learning Framework for Unsupervised Image Complexity Representation	Nov 19, 2024	AttributeContrastive Learning	CodeCode Available	0
MIRe: Enhancing Multimodal Queries Representation via Fusion-Free Modality Interaction for Multimodal Retrieval	Nov 13, 2024	Image ComprehensionInformation Retrieval	CodeCode Available	0
Aquila: A Hierarchically Aligned Visual-Language Model for Enhanced Remote Sensing Image Comprehension	Nov 9, 2024	Image ComprehensionLanguage Modeling	—Unverified	0
StreamingBench: Assessing the Gap for MLLMs to Achieve Streaming Video Understanding	Nov 6, 2024	Image ComprehensionStreaming video understanding	CodeCode Available	2
Teach Multimodal LLMs to Comprehend Electrocardiographic Images	Oct 21, 2024	DiagnosticImage Comprehension	—Unverified	0
FTII-Bench: A Comprehensive Multimodal Benchmark for Flow Text with Image Insertion	Oct 16, 2024	ArticlesImage Comprehension	CodeCode Available	0
FineCops-Ref: A new Dataset and Task for Fine-Grained Compositional Referring Expression Comprehension	Sep 23, 2024	Image ComprehensionReferring Expression	CodeCode Available	1

Show:10 25 50

← PrevPage 2 of 5Next →

No leaderboard results yet.