SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 651675 of 659983 papers

TitleStatusHype
EvoScientist: Towards Multi-Agent Evolving AI Scientists for End-to-End Scientific Discovery5
OpenTSLM: Time-Series Language Models for Reasoning over Multivariate Medical Text- and Time-Series Data5
MixGRPO: Unlocking Flow-based GRPO Efficiency with Mixed ODE-SDE5
Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length5
InternAgent-1.5: A Unified Agentic Framework for Long-Horizon Autonomous Scientific Discovery5
FireRed-Image-Edit-1.0 Technical Report5
SAMTok: Representing Any Mask with Two Words5
CLaRa: Bridging Retrieval and Generation with Continuous Latent Reasoning5
World Action Models are Zero-shot Policies5
Helios: Real Real-Time Long Video Generation Model5
Rethinking the Design of Reinforcement Learning-Based Deep Research Agents5
Kimi K2.5: Visual Agentic Intelligence5
Training Large Language Models to Reason in a Continuous Latent SpaceCode5
YOLOv13: Real-Time Object Detection with Hypergraph-Enhanced Adaptive Visual PerceptionCode5
YOLOv6: A Single-Stage Object Detection Framework for Industrial ApplicationsCode5
FasterDiT: Towards Faster Diffusion Transformers Training without Architecture ModificationCode5
OminiControl2: Efficient Conditioning for Diffusion TransformersCode5
Accessing GPT-4 level Mathematical Olympiad Solutions via Monte Carlo Tree Self-refine with LLaMa-3 8BCode5
Semantic Operators: A Declarative Model for Rich, AI-based Data ProcessingCode5
OMG-Seg: Is One Model Good Enough For All Segmentation?Code5
Ferret: Refer and Ground Anything Anywhere at Any GranularityCode5
TimeMixer: Decomposable Multiscale Mixing for Time Series ForecastingCode5
MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGICode5
SoftHGNN: Soft Hypergraph Neural Networks for General Visual RecognitionCode5
Masked Completion via Structured Diffusion with White-Box TransformersCode5
Show:102550
← PrevPage 27 of 26400Next →