SOTAVerified

Multimodal Large Language Model

Papers

Showing 261270 of 347 papers

TitleStatusHype
LLaVA-Octopus: Unlocking Instruction-Driven Adaptive Projector Fusion for Video Understanding0
LLaVA-Read: Enhancing Reading Ability of Multimodal Language Models0
LLaVA-Ultra: Large Chinese Language and Vision Assistant for Ultrasound0
LRMR: LLM-Driven Relational Multi-node Ranking for Lymph Node Metastasis Assessment in Rectal Cancer0
Lumos : Empowering Multimodal LLMs with Scene Text Recognition0
Make Imagination Clearer! Stable Diffusion-based Visual Imagination for Multimodal Machine Translation0
ManipLLM: Embodied Multimodal Large Language Model for Object-Centric Robotic Manipulation0
Marmot: Multi-Agent Reasoning for Multi-Object Self-Correcting in Improving Image-Text Alignment0
Mask-aware Text-to-Image Retrieval: Referring Expression Segmentation Meets Cross-modal Retrieval0
MaVEn: An Effective Multi-granularity Hybrid Visual Encoding Framework for Multimodal Large Language Model0
Show:102550
← PrevPage 27 of 35Next →

No leaderboard results yet.