SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 201210 of 659983 papers

TitleStatusHype
StableToolBench: Towards Stable Large-Scale Benchmarking on Tool Learning of Large Language ModelsCode9
ORPO: Monolithic Preference Optimization without Reference ModelCode9
LLM4Decompile: Decompiling Binary Code with Large Language ModelsCode9
Divide and Conquer: High-Resolution Industrial Anomaly Detection via Memory Efficient Tiled EnsembleCode9
Yi: Open Foundation Models by 01.AICode9
OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-onCode9
TripoSR: Fast 3D Object Reconstruction from a Single ImageCode9
World Model on Million-Length Video And Language With Blockwise RingAttentionCode9
UFO: A UI-Focused Agent for Windows OS InteractionCode9
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language ModelsCode9
Show:102550
← PrevPage 21 of 65999Next →