SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 13011310 of 661570 papers

TitleStatusHype
Trace is the Next AutoDiff: Generative Optimization with Rich Feedback, Execution Traces, and LLMsCode4
Building a Culture of Reproducibility in Academic ResearchCode4
A deep learning framework for efficient pathology image analysisCode4
Story-Adapter: A Training-free Iterative Framework for Long Story VisualizationCode4
Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse AttentionCode4
CRUXEval: A Benchmark for Code Reasoning, Understanding and ExecutionCode4
VideoEval-Pro: Robust and Realistic Long Video Understanding EvaluationCode4
CitationMap: A Python Tool to Identify and Visualize Your Google Scholar Citations Around the WorldCode4
Real-time volumetric rendering of dynamic humansCode4
Improving Parallel Program Performance with LLM Optimizers via Agent-System InterfacesCode4
Show:102550
← PrevPage 131 of 66157Next →