SOTAVerified

Descriptive

Papers

Showing 1120 of 1477 papers

TitleStatusHype
Fine-Tuning Language Models from Human PreferencesCode3
SonicVerse: Multi-Task Learning for Music Feature-Informed CaptioningCode2
CausalVQA: A Physically Grounded Causal Reasoning Benchmark for Video ModelsCode2
ReID5o: Achieving Omni Multi-modal Person Re-identification in a Single ModelCode2
VAU-R1: Advancing Video Anomaly Understanding via Reinforcement Fine-TuningCode2
RuleKit 2: Faster and simpler rule learningCode2
Q-Insight: Understanding Image Quality via Visual Reinforcement LearningCode2
Teaching LMMs for Image Quality Scoring and InterpretingCode2
ViLa-MIL: Dual-scale Vision-Language Multiple Instance Learning for Whole Slide Image ClassificationCode2
FlashSloth : Lightning Multimodal Large Language Models via Embedded Visual CompressionCode2
Show:102550
← PrevPage 2 of 148Next →

No leaderboard results yet.