SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 961970 of 177340 papers

TitleStatusHype
OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive AnnotationsCode5
NovelSeek: When Agent Becomes the Scientist -- Building Closed-Loop System from Hypothesis to VerificationCode5
CogAgent: A Visual Language Model for GUI AgentsCode5
Transformer-Squared: Self-adaptive LLMsCode5
CogVLM: Visual Expert for Pretrained Language ModelsCode5
Aria: An Open Multimodal Native Mixture-of-Experts ModelCode5
Advancing Humanoid Locomotion: Mastering Challenging Terrains with Denoising World Model LearningCode5
τ-bench: A Benchmark for Tool-Agent-User Interaction in Real-World DomainsCode5
A Brief Overview of AI Governance for Responsible Machine Learning SystemsCode5
Autoregressive Image Generation without Vector QuantizationCode5
Show:102550
← PrevPage 97 of 17734Next →