SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 52265250 of 661570 papers

TitleStatusHype
OpenUni: A Simple Baseline for Unified Multimodal Understanding and GenerationCode2
SWE-bench Goes Live!Code2
Hallo4: High-Fidelity Dynamic Portrait Animation via Direct Preference Optimization and Temporal Motion ModulationCode2
UniTEX: Universal High Fidelity Generative Texturing for 3D ShapesCode2
UniRL: Self-Improving Unified Multimodal Models via Supervised and Reinforcement LearningCode2
ZeroGUI: Automating Online GUI Learning at Zero Human CostCode2
TextRegion: Text-Aligned Region Tokens from Frozen Image-Text ModelsCode2
DRO: A Python Library for Distributionally Robust Optimization in Machine LearningCode2
VideoREPA: Learning Physics for Video Generation through Relational Alignment with Foundation ModelsCode2
ZIPA: A family of efficient models for multilingual phone recognitionCode2
ML-Agent: Reinforcing LLM Agents for Autonomous Machine Learning EngineeringCode2
VAU-R1: Advancing Video Anomaly Understanding via Reinforcement Fine-TuningCode2
ZPressor: Bottleneck-Aware Compression for Scalable Feed-Forward 3DGSCode2
Zero-Shot Vision Encoder Grafting via LLM SurrogatesCode2
Unsupervised Post-Training for Multi-Modal LLM Reasoning via GRPOCode2
DistMLIP: A Distributed Inference Platform for Machine Learning Interatomic PotentialsCode2
GeoDrive: 3D Geometry-Informed Driving World Model with Precise Action ControlCode2
cadrille: Multi-modal CAD Reconstruction with Online Reinforcement LearningCode2
Reinforcing General Reasoning without VerifiersCode2
LLaMEA-BO: A Large Language Model Evolutionary Algorithm for Automatically Generating Bayesian Optimization AlgorithmsCode2
SPA-RL: Reinforcing LLM Agents via Stepwise Progress AttributionCode2
UI-Genie: A Self-Improving Approach for Iteratively Boosting MLLM-based Mobile GUI AgentsCode2
HoliTom: Holistic Token Merging for Fast Video Large Language ModelsCode2
R2R: Efficiently Navigating Divergent Reasoning Paths with Small-Large Model Token RoutingCode2
SANSA: Unleashing the Hidden Semantics in SAM2 for Few-Shot SegmentationCode2
Show:102550
← PrevPage 210 of 26463Next →