SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 19311940 of 177340 papers

TitleStatusHype
AgentBench: Evaluating LLMs as AgentsCode4
Semantic-SAM: Segment and Recognize Anything at Any GranularityCode4
4D Gaussian Splatting for Real-Time Dynamic Scene RenderingCode4
InstanceDiffusion: Instance-level Control for Image GenerationCode4
Depth Any Video with Scalable Synthetic DataCode4
TabularARGN: A Flexible and Efficient Auto-Regressive Framework for Generating High-Fidelity Synthetic DataCode4
Quality-aware Masked Diffusion Transformer for Enhanced Music GenerationCode4
LET-3D-AP: Longitudinal Error Tolerant 3D Average Precision for Camera-Only 3D DetectionCode4
Simple and Effective Masked Diffusion Language ModelsCode4
Sample-Efficient Alignment for LLMsCode4
Show:102550
← PrevPage 194 of 17734Next →