SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 29612970 of 474278 papers

TitleStatusHype
Proteina: Scaling Flow-based Protein Structure Generative ModelsCode3
A Review of Prominent Paradigms for LLM-Based Agents: Tool Use (Including RAG), Planning, and Feedback LearningCode3
AbdomenAtlas: A Large-Scale, Detailed-Annotated, & Multi-Center Dataset for Efficient Transfer Learning and Open Algorithmic BenchmarkingCode3
Self-rewarding correction for mathematical reasoningCode3
Moving Object Segmentation: All You Need Is SAM (and Flow)Code3
MDCrow: Automating Molecular Dynamics Workflows with Large Language ModelsCode3
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech RepresentationsCode3
Prompt-to-LeaderboardCode3
GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image GenerationCode3
GameBench: Evaluating Strategic Reasoning Abilities of LLM AgentsCode3
Show:102550
← PrevPage 297 of 47428Next →