SOTAVerified

Multimodal Large Language Model

Papers

Showing 176200 of 347 papers

TitleStatusHype
UniVG-R1: Reasoning Guided Universal Visual Grounding with Reinforcement Learning0
UPME: An Unsupervised Peer Review Framework for Multimodal Large Language Model Evaluation0
VGR: Visual Grounded Reasoning0
Video Emotion Open-vocabulary Recognition Based on Multimodal Large Language Model0
Video-of-Thought: Step-by-Step Video Reasoning from Perception to Cognition0
Vintern-1B: An Efficient Multimodal Large Language Model for Vietnamese0
Visual Question Answering Instruction: Unlocking Multimodal Large Language Model To Domain-Specific Visual Multitasks0
Visual Text Generation in the Wild0
ViT3D Alignment of LLaMA3: 3D Medical Image Report Generation0
VL-Mamba: Exploring State Space Models for Multimodal Learning0
VMAD: Visual-enhanced Multimodal Large Language Model for Zero-Shot Anomaly Detection0
VolDoGer: LLM-assisted Datasets for Domain Generalization in Vision-Language Tasks0
Web-Scale Visual Entity Recognition: An LLM-Driven Data Approach0
What Changed? Detecting and Evaluating Instruction-Guided Image Edits with Multimodal Large Language Models0
When neural implant meets multimodal LLM: A dual-loop system for neuromodulation and naturalistic neuralbehavioral research0
WSI-LLaVA: A Multimodal Large Language Model for Whole Slide Image0
Multimodal large language model for wheat breeding: a new exploration of smart breeding0
A Large-scale Interpretable Multi-modality Benchmark for Facial Image Forgery Localization0
AlignGPT: Multi-modal Large Language Models with Adaptive Alignment Capability0
A Medical Multimodal Large Language Model for Pediatric Pneumonia0
A Neural Matrix Decomposition Recommender System Model based on the Multimodal Large Language Model0
A Novel Data Augmentation Approach for Automatic Speaking Assessment on Opinion Expressions0
ASCD: Attention-Steerable Contrastive Decoding for Reducing Hallucination in MLLM0
A Survey of Mathematical Reasoning in the Era of Multimodal Large Language Model: Benchmark, Method & Challenges0
A Survey on Multimodal Large Language Models0
Show:102550
← PrevPage 8 of 14Next →

No leaderboard results yet.