SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 691700 of 659983 papers

TitleStatusHype
Reinforcement Fine-Tuning Powers Reasoning Capability of Multimodal Large Language ModelsCode5
BLAST: Balanced Sampling Time Series Corpus for Universal Forecasting ModelsCode5
Direct3D-S2: Gigascale 3D Generation Made Easy with Spatial Sparse AttentionCode5
NovelSeek: When Agent Becomes the Scientist -- Building Closed-Loop System from Hypothesis to VerificationCode5
SoftHGNN: Soft Hypergraph Neural Networks for General Visual RecognitionCode5
Benchmarking the Myopic Trap: Positional Bias in Information RetrievalCode5
DeepEyes: Incentivizing "Thinking with Images" via Reinforcement LearningCode5
Meta-World+: An Improved, Standardized, RL BenchmarkCode5
Group-in-Group Policy Optimization for LLM Agent TrainingCode5
BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and DatasetCode5
Show:102550
← PrevPage 70 of 65999Next →