SOTAVerified|Agents Browse Leaderboard About Blog

Multimodal Large Language Model

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 61–70 of 347 papers

Title	Date	Tasks	Status	Hype
FaceInsight: A Multimodal Large Language Model for Face Perception	Apr 22, 2025	Language ModelingLanguage Modelling	—Unverified	0
ChatEXAONEPath: An Expert-level Multimodal Large Language Model for Histopathology Using Whole Slide Images	Apr 17, 2025	Language ModelingLanguage Modelling	—Unverified	0
SmartFreeEdit: Mask-Free Spatial-Aware Image Editing with Complex Instruction Understanding	Apr 17, 2025	Image GenerationLarge Language Model	CodeCode Available	1
AnomalyR1: A GRPO-based End-to-end MLLM for Industrial Anomaly Detection	Apr 16, 2025	Anomaly DetectionLarge Language Model	CodeCode Available	1
Mavors: Multi-granularity Video Representation for Multimodal Large Language Model	Apr 14, 2025	Computational EfficiencyLanguage Modeling	—Unverified	0
The Scalability of Simplicity: Empirical Analysis of Vision-Language Learning with a Single Transformer	Apr 14, 2025	Language ModelingLanguage Modelling	CodeCode Available	2
InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models	Apr 14, 2025	Language ModelingLanguage Modelling	—Unverified	0
CleanMAP: Distilling Multimodal LLMs for Confidence-Driven Crowdsourced HD Map Updates	Apr 14, 2025	Autonomous NavigationLane Detection	—Unverified	0
Marmot: Multi-Agent Reasoning for Multi-Object Self-Correcting in Improving Image-Text Alignment	Apr 10, 2025	AI AgentAttribute	—Unverified	0
Enhancing Time Series Forecasting via Multi-Level Text Alignment with LLMs	Apr 10, 2025	Multimodal Large Language ModelTime Series	CodeCode Available	1

Show:10 25 50

← PrevPage 7 of 35Next →

No leaderboard results yet.