SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 47764800 of 661570 papers

TitleStatusHype
Metadata Embeddings for User and Item Cold-start RecommendationsCode3
U-Net: Convolutional Networks for Biomedical Image SegmentationCode3
Supplementary Material for Efficient and Robust Automated Machine LearningCode3
Efficient Reasoning with Balanced Thinking2
GenCompositor: Generative Video Compositing with Diffusion Transformer2
FASTER: Rethinking Real-Time Flow VLAs2
The Flexibility Trap: Why Arbitrary Order Limits Reasoning Potential in Diffusion Language Models2
TerraScope: Pixel-Grounded Visual Reasoning for Earth Observation2
TrajBooster: Boosting Humanoid Whole-Body Manipulation via Trajectory-Centric Learning2
EffectErase: Joint Video Object Removal and Insertion for High-Quality Effect Erasing2
Open-o3-Video: Grounded Video Reasoning with Explicit Spatio-Temporal Evidence2
Rolling Sink: Bridging Limited-Horizon Training and Open-Ended Testing in Autoregressive Video Diffusion2
GigaWorld-Policy: An Efficient Action-Centered World--Action Model2
LoST: Level of Semantics Tokenization for 3D Shapes2
SegviGen: Repurposing 3D Generative Model for Part Segmentation2
Astrolabe: Steering Forward-Process Reinforcement Learning for Distilled Autoregressive Video Models2
SocialOmni: Benchmarking Audio-Visual Social Interactivity in Omni Models2
WorldCam: Interactive Autoregressive 3D Gaming Worlds with Camera Pose as a Unifying Geometric Representation2
GlyphPrinter: Region-Grouped Direct Preference Optimization for Glyph-Accurate Visual Text Rendering2
SKEL-CF: Coarse-to-Fine Biomechanical Skeleton and Surface Mesh Recovery2
Autonomous Agents Coordinating Distributed Discovery Through Emergent Artifact Exchange2
Composing Concepts from Images and Videos via Concept-prompt Binding2
LASER: Layer-wise Scale Alignment for Training-Free Streaming 4D Reconstruction2
EnterpriseOps-Gym: Environments and Evaluations for Stateful Agentic Planning and Tool Use in Enterprise Settings2
XSkill: Continual Learning from Experience and Skills in Multimodal Agents2
Show:102550
← PrevPage 192 of 26463Next →