SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 48514900 of 661570 papers

TitleStatusHype
HiFi-Inpaint: Towards High-Fidelity Reference-Based Inpainting for Generating Detail-Preserving Human-Product Images2
Proact-VL: A Proactive VideoLLM for Real-Time AI Companions2
D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI2
StockBench: Can LLM Agents Trade Stocks Profitably In Real-world Markets?2
SciDER: Scientific Data-centric End-to-end Researcher2
WorldStereo: Bridging Camera-Guided Video Generation and Scene Reconstruction via 3D Geometric Memories2
Spotlight on Token Perception for Multimodal Reinforcement Learning2
Skywork-Reward-V2: Scaling Preference Data Curation via Human-AI Synergy2
CRISP: Contact-Guided Real2Sim from Monocular Video with Planar Scene Primitives2
VGGT-Det: Mining VGGT Internal Priors for Sensor-Geometry-Free Multi-View Indoor 3D Object Detection2
SwiReasoning: Switch-Thinking in Latent and Explicit for Pareto-Superior Reasoning LLMs2
SoFlow: Solution Flow Models for One-Step Generative Modeling2
CLiFT: Compressive Light-Field Tokens for Compute-Efficient and Adaptive Neural Rendering2
Easier Painting Than Thinking: Can Text-to-Image Models Set the Stage, but Not Direct the Play?2
ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning2
OmniGAIA: Towards Native Omni-Modal AI Agents2
EditReward: A Human-Aligned Reward Model for Instruction-Guided Image Editing2
Enhancing Spatial Understanding in Image Generation via Reward Modeling2
MLP Memory: A Retriever-Pretrained Memory for Large Language Models2
From Statics to Dynamics: Physics-Aware Image Editing with Latent Transition Priors2
Unified Multimodal Models as Auto-Encoders2
Solaris: Building a Multiplayer Video World Model in Minecraft2
The Trinity of Consistency as a Defining Principle for General World Models2
Deforming Videos to Masks: Flow Matching for Referring Video Segmentation2
Learning beyond Teacher: Generalized On-Policy Distillation with Reward Extrapolation2
EmbodMocap: In-the-Wild 4D Human-Scene Reconstruction for Embodied Agents2
G4Splat: Geometry-Guided Gaussian Splatting with Generative Prior2
LiveMCPBench: Can Agents Navigate an Ocean of MCP Tools?2
Training-free Mixed-Resolution Latent Upsampling for Spatially Accelerated Diffusion Transformers2
RebuttalAgent: Strategic Persuasion in Academic Rebuttal via Theory of Mind2
VecGlypher: Unified Vector Glyph Generation with Language Models2
Seeing the Forest and the Trees: Query-Aware Tokenizer for Long-Video Multimodal Language Models2
Should We Still Pretrain Encoders with Masked Language Modeling?2
SimToolReal: An Object-Centric Policy for Zero-Shot Dexterous Tool Manipulation2
Mobile-O: Unified Multimodal Understanding and Generation on Mobile Device2
NewtonBench: Benchmarking Generalizable Scientific Law Discovery in LLM Agents2
PyVision-RL: Forging Open Agentic Vision Models via RL2
UniGenBench++: A Unified Semantic Evaluation Benchmark for Text-to-Image Generation2
Mantis: A Versatile Vision-Language-Action Model with Disentangled Visual Foresight2
OBS-Diff: Accurate Pruning For Diffusion Models in One-Shot2
On Predictability of Reinforcement Learning Dynamics for Large Language Models2
TOPReward: Token Probabilities as Hidden Zero-Shot Rewards for Robotics2
Esoteric Language Models: Bridging Autoregressive and Masked Diffusion LLMs2
Follow-Your-Shape: Shape-Aware Image Editing via Trajectory-Guided Region Control2
Agent Banana: High-Fidelity Image Editing with Agentic Thinking and Tooling2
SAGE: Scalable Agentic 3D Scene Generation for Embodied AI2
VLANeXt: Recipes for Building Strong VLA Models2
SimVLA: A Simple VLA Baseline for Robotic Manipulation2
UniReason 1.0: A Unified Reasoning Framework for World Knowledge Aligned Image Generation and Editing2
MolmoSpaces: A Large-Scale Open Ecosystem for Robot Navigation and Manipulation2
Show:102550
← PrevPage 98 of 13232Next →