SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 25112520 of 474278 papers

TitleStatusHype
OpenThinkIMG: Learning to Think with Images via Visual Tool Reinforcement LearningCode3
Generative AI for Autonomous Driving: Frontiers and OpportunitiesCode3
OLinear: A Linear Model for Time Series Forecasting in Orthogonally Transformed DomainCode3
Web-Bench: A LLM Code Benchmark Based on Web Standards and FrameworksCode3
CompSLAM: Complementary Hierarchical Multi-Modal Localization and Mapping for Robot Autonomy in Underground EnvironmentsCode3
LLMs Get Lost In Multi-Turn ConversationCode3
The ML.ENERGY Benchmark: Toward Automated Inference Energy Measurement and OptimizationCode3
TokLIP: Marry Visual Tokens to CLIP for Multimodal Comprehension and GenerationCode3
A Common Interface for Automatic DifferentiationCode3
SOAP: Style-Omniscient Animatable PortraitsCode3
Show:102550
← PrevPage 252 of 47428Next →