SOTAVerified

Multimodal Large Language Model

Papers

Showing 5160 of 347 papers

TitleStatusHype
ORQA: A Benchmark and Foundation Model for Holistic Operating Room Modeling0
MindOmni: Unleashing Reasoning Generation in Vision Language Models with RGPOCode0
Beyond Retrieval: Joint Supervision and Multimodal Document Ranking for Textbook Question Answering0
Unifying Segment Anything in Microscopy with Multimodal Large Language ModelCode1
Batch Augmentation with Unimodal Fine-tuning for Multimodal LearningCode0
Is your multimodal large language model a good science tutor?0
MonetGPT: Solving Puzzles Enhances MLLMs' Image Retouching Skills0
On Path to Multimodal Generalist: General-Level and General-Bench0
Consistency-aware Fake Videos Detection on Short Video PlatformsCode0
TimeSoccer: An End-to-End Multimodal Large Language Model for Soccer Commentary Generation0
Show:102550
← PrevPage 6 of 35Next →

No leaderboard results yet.