SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 27512760 of 177340 papers

TitleStatusHype
The Prusti project: Formal verification for RustCode3
UniMatch V2: Pushing the Limit of Semi-Supervised Semantic SegmentationCode3
RAKG:Document-level Retrieval Augmented Knowledge Graph ConstructionCode3
ROCKET: Exceptionally fast and accurate time series classification using random convolutional kernelsCode3
AnyTool: Self-Reflective, Hierarchical Agents for Large-Scale API CallsCode3
RecurrentGPT: Interactive Generation of (Arbitrarily) Long TextCode3
Punica: Multi-Tenant LoRA ServingCode3
ImageReward: Learning and Evaluating Human Preferences for Text-to-Image GenerationCode3
RepViT-SAM: Towards Real-Time Segmenting AnythingCode3
PokeLLMon: A Human-Parity Agent for Pokemon Battles with Large Language ModelsCode3
Show:102550
← PrevPage 276 of 17734Next →