SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 12911300 of 661570 papers

TitleStatusHype
A Survey of LLM DATACode4
LORE: Lagrangian-Optimized Robust Embeddings for Visual EncodersCode4
Trinity-RFT: A General-Purpose and Unified Framework for Reinforcement Fine-Tuning of Large Language ModelsCode4
QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement LearningCode4
Qiskit Machine Learning: an open-source library for quantum machine learning tasks at scale on quantum hardware and classical simulatorsCode4
Scaling Up Biomedical Vision-Language Models: Fine-Tuning, Instruction Tuning, and Multi-Modal LearningCode4
Delving into RL for Image Generation with CoT: A Study on DPO vs. GRPOCode4
R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement LearningCode4
SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory SynthesisCode4
lmgame-Bench: How Good are LLMs at Playing Games?Code4
Show:102550
← PrevPage 130 of 66157Next →