SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 48764900 of 661570 papers

TitleStatusHype
Solaris: Building a Multiplayer Video World Model in Minecraft2
EmbodMocap: In-the-Wild 4D Human-Scene Reconstruction for Embodied Agents2
Unified Multimodal Models as Auto-Encoders2
RebuttalAgent: Strategic Persuasion in Academic Rebuttal via Theory of Mind2
VecGlypher: Unified Vector Glyph Generation with Language Models2
Seeing the Forest and the Trees: Query-Aware Tokenizer for Long-Video Multimodal Language Models2
Training-free Mixed-Resolution Latent Upsampling for Spatially Accelerated Diffusion Transformers2
Mobile-O: Unified Multimodal Understanding and Generation on Mobile Device2
SimToolReal: An Object-Centric Policy for Zero-Shot Dexterous Tool Manipulation2
UniGenBench++: A Unified Semantic Evaluation Benchmark for Text-to-Image Generation2
Should We Still Pretrain Encoders with Masked Language Modeling?2
PyVision-RL: Forging Open Agentic Vision Models via RL2
NewtonBench: Benchmarking Generalizable Scientific Law Discovery in LLM Agents2
Mantis: A Versatile Vision-Language-Action Model with Disentangled Visual Foresight2
OBS-Diff: Accurate Pruning For Diffusion Models in One-Shot2
TOPReward: Token Probabilities as Hidden Zero-Shot Rewards for Robotics2
On Predictability of Reinforcement Learning Dynamics for Large Language Models2
Esoteric Language Models: Bridging Autoregressive and Masked Diffusion LLMs2
Follow-Your-Shape: Shape-Aware Image Editing via Trajectory-Guided Region Control2
Agent Banana: High-Fidelity Image Editing with Agentic Thinking and Tooling2
UniReason 1.0: A Unified Reasoning Framework for World Knowledge Aligned Image Generation and Editing2
SimVLA: A Simple VLA Baseline for Robotic Manipulation2
SAGE: Scalable Agentic 3D Scene Generation for Embodied AI2
VLANeXt: Recipes for Building Strong VLA Models2
MolmoSpaces: A Large-Scale Open Ecosystem for Robot Navigation and Manipulation2
Show:102550
← PrevPage 196 of 26463Next →