SOTAVerified

Multimodal Large Language Model

Papers

Showing 276300 of 347 papers

TitleStatusHype
A Medical Multimodal Large Language Model for Pediatric Pneumonia0
DPDEdit: Detail-Preserved Diffusion Models for Multimodal Fashion Image Editing0
Balancing Performance and Efficiency: A Multimodal Large Language Model Pruning Method based Image Text Interaction0
Multimodal Multi-turn Conversation Stance Detection: A Challenge Dataset and Effective Model0
OrthoDoc: Multimodal Large Language Model for Assisting Diagnosis in Computed Tomography0
AdaptVision: Dynamic Input Scaling in MLLMs for Versatile Scene UnderstandingCode0
Vintern-1B: An Efficient Multimodal Large Language Model for Vietnamese0
MaVEn: An Effective Multi-granularity Hybrid Visual Encoding Framework for Multimodal Large Language Model0
EE-MLLM: A Data-Efficient and Compute-Efficient Multimodal Large Language Model0
Video Emotion Open-vocabulary Recognition Based on Multimodal Large Language Model0
CaRDiff: Video Salient Object Ranking Chain of Thought Reasoning for Saliency Prediction with Diffusion0
PanoSent: A Panoptic Sextuple Extraction Benchmark for Multimodal Conversational Aspect-based Sentiment Analysis0
ChatGPT Meets Iris Biometrics0
VideoQA in the Era of LLMs: An Empirical StudyCode0
VolDoGer: LLM-assisted Datasets for Domain Generalization in Vision-Language Tasks0
LLaVA-Read: Enhancing Reading Ability of Multimodal Language Models0
Graph-based Unsupervised Disentangled Representation Learning via Multimodal Large Language Models0
Dallah: A Dialect-Aware Multimodal Large Language Model for Arabic0
Visual Text Generation in the Wild0
A Neural Matrix Decomposition Recommender System Model based on the Multimodal Large Language Model0
GenArtist: Multimodal LLM as an Agent for Unified Image Generation and Editing0
MobileFlow: A Multimodal LLM For Mobile GUI Agent0
MRIR: Integrating Multimodal Insights for Diffusion-based Realistic Image Restoration0
Guardrails for avoiding harmful medical product recommendations and off-label promotion in generative AI models0
MR-MLLM: Mutual Reinforcement of Multimodal Comprehension and Vision Perception0
Show:102550
← PrevPage 12 of 14Next →

No leaderboard results yet.