SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 25512575 of 661570 papers

TitleStatusHype
CoMotion: Concurrent Multi-person 3D MotionCode3
Elucidating the Design Space of Multimodal Protein Language ModelsCode3
DataDecide: How to Predict Best Pretraining Data with Small ExperimentsCode3
DataSentinel: A Game-Theoretic Detection of Prompt Injection AttacksCode3
SimpleAR: Pushing the Frontier of Autoregressive Visual Generation through Pretraining, SFT, and RLCode3
A Clean Slate for Offline Reinforcement LearningCode3
REAL: Benchmarking Autonomous Agents on Deterministic Simulations of Real WebsitesCode3
Kimina-Prover Preview: Towards Large Formal Reasoning Models with Reinforcement LearningCode3
Efficient Reasoning Models: A SurveyCode3
REPA-E: Unlocking VAE for End-to-End Tuning of Latent Diffusion TransformersCode3
DeepMath-103K: A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing ReasoningCode3
Evaluation Report on MCP ServersCode3
Ai2 Scholar QA: Organized Literature Synthesis with AttributionCode3
A Minimalist Approach to LLM Reasoning: from Rejection Sampling to ReinforceCode3
RAKG:Document-level Retrieval Augmented Knowledge Graph ConstructionCode3
The Tenth NTIRE 2025 Efficient Super-Resolution Challenge ReportCode3
Deep Reasoning Translation via Reinforcement LearningCode3
GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI AgentsCode3
REPA-E: Unlocking VAE for End-to-End Tuning with Latent Diffusion TransformersCode3
Syzygy of Thoughts: Improving LLM CoT with the Minimal Free ResolutionCode3
TensorNEAT: A GPU-accelerated Library for NeuroEvolution of Augmenting TopologiesCode3
DocAgent: A Multi-Agent System for Automated Code Documentation GenerationCode3
MSCCL++: Rethinking GPU Communication Abstractions for Cutting-edge AI ApplicationsCode3
GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image GenerationCode3
PixelFlow: Pixel-Space Generative Models with FlowCode3
Show:102550
← PrevPage 103 of 26463Next →