SOTAVerified

Multimodal Large Language Model

Papers

Showing 221230 of 347 papers

TitleStatusHype
GroundingFace: Fine-grained Face Understanding via Pixel Grounding Multimodal Large Language Model0
ST^3: Accelerating Multimodal Large Language Model by Spatial-Temporal Visual Token Trimming0
MLLM-SUL: Multimodal Large Language Model for Semantic Scene Understanding and Localization in Traffic ScenariosCode0
A Large-scale Interpretable Multi-modality Benchmark for Facial Image Forgery Localization0
SubstationAI: Multimodal Large Model-Based Approaches for Analyzing Substation Equipment Faults0
J-EDI QA: Benchmark for deep-sea organism-specific multimodal LLM0
Multimodal Hypothetical Summary for Retrieval-based Multi-image Question AnsweringCode0
Make Imagination Clearer! Stable Diffusion-based Visual Imagination for Multimodal Machine Translation0
MERaLiON-SpeechEncoder: Towards a Speech Foundation Model for Singapore and Beyond0
A Survey of Mathematical Reasoning in the Era of Multimodal Large Language Model: Benchmark, Method & Challenges0
Show:102550
← PrevPage 23 of 35Next →

No leaderboard results yet.