SOTAVerified

Multimodal Large Language Model

Papers

Showing 8190 of 347 papers

TitleStatusHype
MedTVT-R1: A Multimodal LLM Empowering Medical Reasoning and DiagnosisCode1
Mementos: A Comprehensive Benchmark for Multimodal Large Language Model Reasoning over Image SequencesCode1
Meaning Typed Prompting: A Technique for Efficient, Reliable Structured Output GenerationCode1
3UR-LLM: An End-to-End Multimodal Large Language Model for 3D Scene UnderstandingCode1
Open3DVQA: A Benchmark for Comprehensive Spatial Reasoning with Multimodal Large Language Model in Open SpaceCode1
PatentLMM: Large Multimodal Model for Generating Descriptions for Patent FiguresCode1
AnomalyR1: A GRPO-based End-to-end MLLM for Industrial Anomaly DetectionCode1
LLaSA: A Multimodal LLM for Human Activity Analysis Through Wearable and Smartphone SensorsCode1
LLaVA-SpaceSGG: Visual Instruct Tuning for Open-vocabulary Scene Graph Generation with Enhanced Spatial RelationsCode1
FinVis-GPT: A Multimodal Large Language Model for Financial Chart AnalysisCode1
Show:102550
← PrevPage 9 of 35Next →

No leaderboard results yet.