SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 276300 of 659983 papers

TitleStatusHype
Paper2Code: Automating Code Generation from Scientific Papers in Machine LearningCode7
RAGEN: Understanding Self-Evolution in LLM Agents via Multi-Turn Reinforcement LearningCode7
Step1X-Edit: A Practical Framework for General Image EditingCode7
Skywork R1V2: Multimodal Hybrid Reinforcement Learning for ReasoningCode7
TTRL: Test-Time Reinforcement LearningCode7
Chinese-Vicuna: A Chinese Instruction-following Llama-based ModelCode7
PerceptionLM: Open-Access Data and Models for Detailed Visual UnderstandingCode7
BrowseComp: A Simple Yet Challenging Benchmark for Browsing AgentsCode7
Aligning Anime Video Generation with Human FeedbackCode7
The AI Scientist-v2: Workshop-Level Automated Scientific Discovery via Agentic Tree SearchCode7
A Scalable Approach to Clustering Embedding ProjectionsCode7
Skywork R1V: Pioneering Multimodal Reasoning with Chain-of-ThoughtCode7
Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe SystemsCode7
Open-Reasoner-Zero: An Open Source Approach to Scaling Up Reinforcement Learning on the Base ModelCode7
Large Language Model Agent: A Survey on Methodology, Applications and ChallengesCode7
Qwen2.5-Omni Technical ReportCode7
Open Deep Search: Democratizing Search with Open-source Reasoning AgentsCode7
Bridging Evolutionary Multiobjective Optimization and GPU Acceleration via TensorizationCode7
Scaling Vision Pre-Training to 4K ResolutionCode7
SimpleRL-Zoo: Investigating and Taming Zero Reinforcement Learning for Open Base Models in the WildCode7
Enhancing Fourier Neural Operators with Local Spatial FeaturesCode7
InfiniteYou: Flexible Photo Recrafting While Preserving Your IdentityCode7
xLSTM 7B: A Recurrent LLM for Fast and Efficient InferenceCode7
LHM: Large Animatable Human Reconstruction Model from a Single Image in SecondsCode7
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement LearningCode7
Show:102550
← PrevPage 12 of 26400Next →