SOTAVerified

Multimodal Large Language Model

Papers

Showing 161170 of 347 papers

TitleStatusHype
Guard Me If You Know Me: Protecting Specific Face-Identity from Deepfakes0
GroundingFace: Fine-grained Face Understanding via Pixel Grounding Multimodal Large Language Model0
COEF-VQ: Cost-Efficient Video Quality Understanding through a Cascaded Multimodal LLM Framework0
Graph-based Unsupervised Disentangled Representation Learning via Multimodal Large Language Models0
CoDi-2: In-Context Interleaved and Interactive Any-to-Any Generation0
Beyond Retrieval: Joint Supervision and Multimodal Document Ranking for Textbook Question Answering0
GPT4Video: A Unified Multimodal Large Language Model for lnstruction-Followed Understanding and Safety-Aware Generation0
CoDi-2: In-Context, Interleaved, and Interactive Any-to-Any Generation0
Gesture-Aware Zero-Shot Speech Recognition for Patients with Language Disorders0
CLSP: High-Fidelity Contrastive Language-State Pre-training for Agent State Representation0
Show:102550
← PrevPage 17 of 35Next →

No leaderboard results yet.