SOTAVerified

Multimodal Large Language Model

Papers

Showing 331340 of 347 papers

TitleStatusHype
Open-Qwen2VL: Compute-Efficient Pre-Training of Fully-Open Multimodal LLMs on Academic Resources0
Optimus-2: Multimodal Minecraft Agent with Goal-Observation-Action Conditioned Policy0
Orchestrate Multimodal Data with Batch Post-Balancing to Accelerate Multimodal Large Language Model Training0
ORQA: A Benchmark and Foundation Model for Holistic Operating Room Modeling0
OrthoDoc: Multimodal Large Language Model for Assisting Diagnosis in Computed Tomography0
PanoSent: A Panoptic Sextuple Extraction Benchmark for Multimodal Conversational Aspect-based Sentiment Analysis0
Parking, Perception, and Retail: Street-Level Determinants of Community Vitality in Harbin0
PHRASED: Phrase Dictionary Biasing for Speech Translation0
PP-DocBee: Improving Multimodal Document Understanding Through a Bag of Tricks0
Q-Agent: Quality-Driven Chain-of-Thought Image Restoration Agent through Robust Multimodal Large Language Model0
Show:102550
← PrevPage 34 of 35Next →

No leaderboard results yet.