SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 29012910 of 177340 papers

TitleStatusHype
Agent Security Bench (ASB): Formalizing and Benchmarking Attacks and Defenses in LLM-based AgentsCode3
From Reusing to Forecasting: Accelerating Diffusion Models with TaylorSeersCode3
MuChoMusic: Evaluating Music Understanding in Multimodal Audio-Language ModelsCode3
Leveraging Biomolecule and Natural Language through Multi-Modal Learning: A SurveyCode3
Vid2Avatar: 3D Avatar Reconstruction from Videos in the Wild via Self-supervised Scene DecompositionCode3
OrionBench: A Benchmark for Chart and Human-Recognizable Object Detection in InfographicsCode3
nnInteractive: Redefining 3D Promptable SegmentationCode3
3DIS: Depth-Driven Decoupled Instance Synthesis for Text-to-Image GenerationCode3
Kandinsky 3: Text-to-Image Synthesis for Multifunctional Generative FrameworkCode3
Ai2 Scholar QA: Organized Literature Synthesis with AttributionCode3
Show:102550
← PrevPage 291 of 17734Next →