SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 311320 of 659983 papers

TitleStatusHype
Visual Agentic Reinforcement Fine-TuningCode7
Grounding DINO 1.5: Advance the "Edge" of Open-Set Object DetectionCode7
Align Anything: Training All-Modality Models to Follow Instructions with Language FeedbackCode7
LLaVA-NeXT-Interleave: Tackling Multi-image, Video, and 3D in Large Multimodal ModelsCode7
Measuring short-form factuality in large language modelsCode7
RedPajama: an Open Dataset for Training Large Language ModelsCode7
Reinforcement Learning Optimization for Large-Scale Learning: An Efficient and User-Friendly Scaling LibraryCode7
BrowseComp: A Simple Yet Challenging Benchmark for Browsing AgentsCode7
Easy Begun is Half Done: Spatial-Temporal Graph Modeling with ST-Curriculum DropoutCode7
Pyramidal Flow Matching for Efficient Video Generative ModelingCode7
Show:102550
← PrevPage 32 of 65999Next →