SOTAVerified

Descriptive

Papers

Showing 2130 of 1477 papers

TitleStatusHype
FlashSloth: Lightning Multimodal Large Language Models via Embedded Visual CompressionCode2
SensorLLM: Human-Intuitive Alignment of Multivariate Sensor Data with LLMs for Activity RecognitionCode2
SimVG: A Simple Framework for Visual Grounding with Decoupled Multi-modal FusionCode2
SpeechCraft: A Fine-grained Expressive Speech Dataset with Natural Language DescriptionCode2
Video-STaR: Self-Training Enables Video Instruction Tuning with Any SupervisionCode2
DGR-MIL: Exploring Diverse Global Representation in Multiple Instance Learning for Whole Slide Image ClassificationCode2
MedCalc-Bench: Evaluating Large Language Models for Medical CalculationsCode2
RS-Agent: Automating Remote Sensing Tasks through Intelligent AgentCode2
Composed Image Retrieval for Remote SensingCode2
TrafficVLM: A Controllable Visual Language Model for Traffic Video CaptioningCode2
Show:102550
← PrevPage 3 of 148Next →

No leaderboard results yet.