SOTAVerified

Multimodal Large Language Model

Papers

Showing 281290 of 347 papers

TitleStatusHype
AdaptVision: Dynamic Input Scaling in MLLMs for Versatile Scene UnderstandingCode0
Vintern-1B: An Efficient Multimodal Large Language Model for Vietnamese0
MaVEn: An Effective Multi-granularity Hybrid Visual Encoding Framework for Multimodal Large Language Model0
EE-MLLM: A Data-Efficient and Compute-Efficient Multimodal Large Language Model0
Video Emotion Open-vocabulary Recognition Based on Multimodal Large Language Model0
CaRDiff: Video Salient Object Ranking Chain of Thought Reasoning for Saliency Prediction with Diffusion0
PanoSent: A Panoptic Sextuple Extraction Benchmark for Multimodal Conversational Aspect-based Sentiment Analysis0
ChatGPT Meets Iris Biometrics0
VideoQA in the Era of LLMs: An Empirical StudyCode0
VolDoGer: LLM-assisted Datasets for Domain Generalization in Vision-Language Tasks0
Show:102550
← PrevPage 29 of 35Next →

No leaderboard results yet.