SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 30213030 of 474278 papers

TitleStatusHype
AutoKaggle: A Multi-Agent Framework for Autonomous Data Science CompetitionsCode3
Llama Scope: Extracting Millions of Features from Llama-3.1-8B with Sparse AutoencodersCode3
Centaur: a foundation model of human cognitionCode3
Improving Model Evaluation using SMART Filtering of Benchmark DatasetsCode3
OGBench: Benchmarking Offline Goal-Conditioned RLCode3
Paint Bucket Colorization Using Anime Character Color Design SheetsCode3
ArxivDIGESTables: Synthesizing Scientific Literature into Tables using Language ModelsCode3
COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 TrainingCode3
Large Spatial Model: End-to-end Unposed Images to Semantic 3DCode3
Robust Watermarking Using Generative Priors Against Image Editing: From Benchmarking to AdvancesCode3
Show:102550
← PrevPage 303 of 47428Next →