SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 201210 of 474278 papers

TitleStatusHype
StableToolBench: Towards Stable Large-Scale Benchmarking on Tool Learning of Large Language ModelsCode9
ORPO: Monolithic Preference Optimization without Reference ModelCode9
LLM4Decompile: Decompiling Binary Code with Large Language ModelsCode9
Divide and Conquer: High-Resolution Industrial Anomaly Detection via Memory Efficient Tiled EnsembleCode9
Yi: Open Foundation Models by 01.AICode9
OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-onCode9
TripoSR: Fast 3D Object Reconstruction from a Single ImageCode9
World Model on Million-Length Video And Language With Blockwise RingAttentionCode9
UFO: A UI-Focused Agent for Windows OS InteractionCode9
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language ModelsCode9
Show:102550
← PrevPage 21 of 47428Next →