SOTAVerified

Multimodal Large Language Model

Papers

Showing 7180 of 347 papers

TitleStatusHype
MovSAM: A Single-image Moving Object Segmentation Framework Based on Deep ThinkingCode0
Face-LLaVA: Facial Expression and Attribute Understanding through Instruction Tuning0
Q-Agent: Quality-Driven Chain-of-Thought Image Restoration Agent through Robust Multimodal Large Language Model0
Towards Visual Text Grounding of Multimodal Large Language Model0
Universal Item Tokenization for Transferable Generative Recommendation0
Towards Unified Referring Expression Segmentation Across Omni-Level Visual Target GranularitiesCode0
Open-Qwen2VL: Compute-Efficient Pre-Training of Fully-Open Multimodal LLMs on Academic Resources0
Orchestrate Multimodal Data with Batch Post-Balancing to Accelerate Multimodal Large Language Model Training0
Dynamic Pyramid Network for Efficient Multimodal Large Language ModelCode0
MLLM-For3D: Adapting Multimodal Large Language Model for 3D Reasoning Segmentation0
Show:102550
← PrevPage 8 of 35Next →

No leaderboard results yet.