SOTAVerified

Multimodal Large Language Model

Papers

Showing 5160 of 347 papers

TitleStatusHype
mmE5: Improving Multimodal Multilingual Embeddings via High-quality Synthetic DataCode2
Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You WantCode2
MME: A Comprehensive Evaluation Benchmark for Multimodal Large Language ModelsCode2
ChartCoder: Advancing Multimodal Large Language Model for Chart-to-Code GenerationCode2
Dimple: Discrete Diffusion Multimodal Large Language Model with Parallel DecodingCode2
CAT: Enhancing Multimodal Large Language Model to Answer Questions in Dynamic Audio-Visual ScenariosCode2
LaVy: Vietnamese Multimodal Large Language ModelCode2
StoryTeller: Improving Long Video Description through Global Audio-Visual Character IdentificationCode2
LLaVA-ST: A Multimodal Large Language Model for Fine-Grained Spatial-Temporal UnderstandingCode2
MedTsLLM: Leveraging LLMs for Multimodal Medical Time Series AnalysisCode2
Show:102550
← PrevPage 6 of 35Next →

No leaderboard results yet.