SOTAVerified

Multimodal Large Language Model

Papers

Showing 3140 of 347 papers

TitleStatusHype
Referring to Any PersonCode2
Next Token Is Enough: Realistic Image Quality and Aesthetic Scoring with Multimodal Large Language ModelCode2
Keeping Yourself is Important in Downstream Tuning Multimodal Large Language ModelCode2
Introducing Visual Perception Token into Multimodal Large Language ModelCode2
mmE5: Improving Multimodal Multilingual Embeddings via High-quality Synthetic DataCode2
LLaVA-ST: A Multimodal Large Language Model for Fine-Grained Spatial-Temporal UnderstandingCode2
Parameter-Inverted Image Pyramid Networks for Visual Perception and Multimodal UnderstandingCode2
ChartCoder: Advancing Multimodal Large Language Model for Chart-to-Code GenerationCode2
Towards a Multimodal Large Language Model with Pixel-Level Insight for BiomedicineCode2
OpenAD: Open-World Autonomous Driving Benchmark for 3D Object DetectionCode2
Show:102550
← PrevPage 4 of 35Next →

No leaderboard results yet.