SOTAVerified

Multimodal Large Language Model

Papers

Showing 1120 of 347 papers

TitleStatusHype
MedTVT-R1: A Multimodal LLM Empowering Medical Reasoning and DiagnosisCode1
ShareGPT-4o-Image: Aligning Multimodal Models with GPT-4o-Level Image GenerationCode3
DreamJourney: Perpetual View Generation with Video Diffusion Models0
The Condition Number as a Scale-Invariant Proxy for Information Encoding in Neural UnitsCode1
ASCD: Attention-Steerable Contrastive Decoding for Reducing Hallucination in MLLM0
CFBenchmark-MM: Chinese Financial Assistant Benchmark for Multimodal Large Language Model0
VIS-Shepherd: Constructing Critic for LLM-based Data Visualization GenerationCode0
VGR: Visual Grounded Reasoning0
PHRASED: Phrase Dictionary Biasing for Speech Translation0
Parking, Perception, and Retail: Street-Level Determinants of Community Vitality in Harbin0
Show:102550
← PrevPage 2 of 35Next →

No leaderboard results yet.