SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 23512375 of 661570 papers

TitleStatusHype
HY-WU (Part I): An Extensible Functional Neural Memory Framework and An Instantiation in Text-Guided Image Editing3
Human3R: Everyone Everywhere All at Once3
Comparing AI Agents to Cybersecurity Professionals in Real-World Penetration Testing3
LoGeR: Long-Context Geometric Reconstruction with Hybrid Memory3
Latent Diffusion Model without Variational Autoencoder3
tttLRM: Test-Time Training for Long Context and Autoregressive 3D Reconstruction3
FireRed-OCR Technical Report3
EasySteer: A Unified Framework for High-Performance and Extensible LLM Steering3
GEM: A Gym for Agentic LLMs3
RLP: Reinforcement as a Pretraining Objective3
Uni-cot: Towards Unified Chain-of-Thought Reasoning Across Text and Vision3
OneVision-Encoder: Codec-Aligned Sparsity as a Foundational Principle for Multimodal Intelligence3
Molmo2: Open Weights and Data for Vision-Language Models with Video Understanding and Grounding3
The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution3
Toward Ultra-Long-Horizon Agentic Science: Cognitive Accumulation for Machine Learning Engineering3
EO-1: An Open Unified Embodied Foundation Model for General Robot Control3
A Survey of Data Agents: Emerging Paradigm or Overstated Hype?3
Much Ado About Noising: Dispelling the Myths of Generative Robotic Control3
JavisDiT++: Unified Modeling and Optimization for Joint Audio-Video Generation3
pi-Flow: Policy-Based Few-Step Generation via Imitation Distillation3
PartUV: Part-Based UV Unwrapping of 3D Meshes3
AnyUp: Universal Feature Upsampling3
DeepGen 1.0: A Lightweight Unified Multimodal Model for Advancing Image Generation and Editing3
Xiaomi-Robotics-0: An Open-Sourced Vision-Language-Action Model with Real-Time Execution3
LLaDA2.1: Speeding Up Text Diffusion via Token Editing3
Show:102550
← PrevPage 95 of 26463Next →