SOTAVerified

Multimodal Large Language Model

Papers

Showing 6170 of 347 papers

TitleStatusHype
Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially FastCode2
GeReA: Question-Aware Prompt Captions for Knowledge-based Visual Question AnsweringCode2
Jailbreaking Attack against Multimodal Large Language ModelCode2
MLLM-Tool: A Multimodal Large Language Model For Tool Agent LearningCode2
LION: Empowering Multimodal Large Language Model with Dual-Level Visual KnowledgeCode2
TimeChat: A Time-sensitive Multimodal Large Language Model for Long Video UnderstandingCode2
LLMGA: Multimodal Large Language Model based Generation AssistantCode2
MME: A Comprehensive Evaluation Benchmark for Multimodal Large Language ModelsCode2
Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Dataset for Pre-training and BenchmarksCode2
MedTVT-R1: A Multimodal LLM Empowering Medical Reasoning and DiagnosisCode1
Show:102550
← PrevPage 7 of 35Next →

No leaderboard results yet.