SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 661670 of 659983 papers

TitleStatusHype
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey5
SAMTok: Representing Any Mask with Two Words5
UQLM: A Python Package for Uncertainty Quantification in Large Language ModelsCode5
skfolio: Portfolio Optimization in PythonCode5
Energy-Based Transformers are Scalable Learners and ThinkersVerified5
Thinking with Images for Multimodal Reasoning: Foundations, Methods, and Future FrontiersCode5
RAG-R1 : Incentivize the Search and Reasoning Capabilities of LLMs through Multi-query ParallelismCode5
ThinkSound: Chain-of-Thought Reasoning in Multimodal Large Language Models for Audio Generation and EditingCode5
Matrix-Game: Interactive World Foundation ModelCode5
LongWriter-Zero: Mastering Ultra-Long Text Generation via Reinforcement LearningCode5
Show:102550
← PrevPage 67 of 65999Next →