SOTAVerified

Multimodal Large Language Model

Papers

Showing 151175 of 347 papers

TitleStatusHype
RAGAR, Your Falsehood Radar: RAG-Augmented Reasoning for Political Fact-Checking using Multimodal Large Language Models0
RespLLM: Unifying Audio and Text with Multimodal LLMs for Generalized Respiratory Health Prediction0
S4-Driver: Scalable Self-Supervised Driving Multimodal Large Language Modelwith Spatio-Temporal Visual Representation0
S4-Driver: Scalable Self-Supervised Driving Multimodal Large Language Model with Spatio-Temporal Visual Representation0
Self-Corrected Multimodal Large Language Model for End-to-End Robot Manipulation0
SemGrasp: Semantic Grasp Generation via Language Aligned Discretization0
SeqAfford: Sequential 3D Affordance Reasoning via Multimodal Large Language Model0
SNIFFER: Multimodal Large Language Model for Explainable Out-of-Context Misinformation Detection0
SpaceVLLM: Endowing Multimodal Large Language Model with Spatio-Temporal Video Grounding Capability0
ST^3: Accelerating Multimodal Large Language Model by Spatial-Temporal Visual Token Trimming0
StreetviewLLM: Extracting Geographic Information Using a Chain-of-Thought Multimodal Large Language Model0
Strengthening Multimodal Large Language Model with Bootstrapped Preference Optimization0
SubstationAI: Multimodal Large Model-Based Approaches for Analyzing Substation Equipment Faults0
TalkFashion: Intelligent Virtual Try-On Assistant Based on Multimodal Large Language Model0
The NTNU System at the S&I Challenge 2025 SLA Open Track0
The Solution for CVPR2024 Foundational Few-Shot Object Detection Challenge0
Think Before You Diffuse: LLMs-Guided Physics-Aware Video Generation0
Realistic Corner Case Generation for Autonomous Vehicles with Multimodal Large Language Model0
A Large-scale Interpretable Multi-modality Benchmark for Facial Image Forgery Localization0
AlignGPT: Multi-modal Large Language Models with Adaptive Alignment Capability0
A Medical Multimodal Large Language Model for Pediatric Pneumonia0
A Neural Matrix Decomposition Recommender System Model based on the Multimodal Large Language Model0
A Novel Data Augmentation Approach for Automatic Speaking Assessment on Opinion Expressions0
ASCD: Attention-Steerable Contrastive Decoding for Reducing Hallucination in MLLM0
A Survey of Mathematical Reasoning in the Era of Multimodal Large Language Model: Benchmark, Method & Challenges0
Show:102550
← PrevPage 7 of 14Next →

No leaderboard results yet.