SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 14511460 of 661570 papers

TitleStatusHype
OCRBench v2: An Improved Benchmark for Evaluating Large Multimodal Models on Visual Text Localization and ReasoningCode4
VideoChat-Flash: Hierarchical Compression for Long-Context Video ModelingCode4
Training Software Engineering Agents and Verifiers with SWE-GymCode4
TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching and Clap-Ranked Preference OptimizationCode4
MINIMA: Modality Invariant Image MatchingCode4
The Thousand Brains Project: A New Paradigm for Sensorimotor IntelligenceCode4
Dora: Sampling and Benchmarking for 3D Shape Variational Auto-EncodersCode4
LLM4AD: A Platform for Algorithm Design with Large Language ModelCode4
OpenEMMA: Open-Source Multimodal Model for End-to-End Autonomous DrivingCode4
Human-Humanoid Robots Cross-Embodiment Behavior-Skill Transfer Using Decomposed Adversarial Learning from DemonstrationCode4
Show:102550
← PrevPage 146 of 66157Next →