SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 59015925 of 474278 papers

TitleStatusHype
SemEval-2025 Task 11: Bridging the Gap in Text-Based Emotion DetectionCode2
Chameleon: Fast-slow Neuro-symbolic Lane Topology ExtractionCode2
DistiLLM-2: A Contrastive Approach Boosts the Distillation of LLMsCode2
Boosting the Generalization and Reasoning of Vision Language Models with Curriculum Reinforcement LearningCode2
YOLOMG: Vision-based Drone-to-Drone Detection with Appearance and Pixel-Level Motion FusionCode2
When Large Vision-Language Model Meets Large Remote Sensing Imagery: Coarse-to-Fine Text-Guided Token PruningCode2
FaceID-6M: A Large-Scale, Open-Source FaceID Customization DatasetCode2
MedAgentsBench: Benchmarking Thinking Models and Agent Frameworks for Complex Medical ReasoningCode2
AR-Diffusion: Asynchronous Video Generation with Auto-Regressive DiffusionCode2
A Multimodal Benchmark Dataset and Model for Crop Disease DiagnosisCode2
Seedream 2.0: A Native Chinese-English Bilingual Image Generation Foundation ModelCode2
Agent models: Internalizing Chain-of-Action Generation into Reasoning modelsCode2
Similarity-Guided Layer-Adaptive Vision Transformer for UAV TrackingCode2
DiffAtlas: GenAI-fying Atlas Segmentation via Image-Mask DiffusionCode2
Axes that matter: PCA with a differenceCode2
Learning Few-Step Diffusion Models by Trajectory Distribution MatchingCode2
Emulating Self-attention with Convolution for Efficient Image Super-ResolutionCode2
CLIMB: Data Foundations for Large Scale Multimodal Clinical Foundation ModelsCode2
DiffCLIP: Differential Attention Meets CLIPCode2
RouterEval: A Comprehensive Benchmark for Routing LLMs to Explore Model-level Scaling Up in LLMsCode2
USP: Unified Self-Supervised Pretraining for Image Generation and UnderstandingCode2
A Noise-Robust Turn-Taking System for Real-World Dialogue Robots: A Field ExperimentCode2
X2I: Seamless Integration of Multimodal Understanding into Diffusion Transformer via Attention DistillationCode2
Next Token Is Enough: Realistic Image Quality and Aesthetic Scoring with Multimodal Large Language ModelCode2
Can Atomic Step Decomposition Enhance the Self-structured Reasoning of Multimodal Large Models?Code2
Show:102550
← PrevPage 237 of 18972Next →