SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 426450 of 659983 papers

TitleStatusHype
ABot-PhysWorld: Interactive World Foundation Model for Robotic Manipulation with Physics Alignment0
FG-Portrait: 3D Flow Guided Editable Portrait Animation0
SIMART: Decomposing Monolithic Meshes into Sim-ready Articulated Assets via MLLM0
Planning over MAPF Agent Dependencies via Multi-Dependency PIBT0
Beyond Preset Identities: How Agents Form Stances and Boundaries in Generative Societies0
GeoSANE: Learning Geospatial Representations from Models, Not Data0
I3DM: Implicit 3D-aware Memory Retrieval and Injection for Consistent Video Scene Generation0
3DCity-LLM: Empowering Multi-modality Large Language Models for 3D City-scale Perception and Understanding0
Code Review Agent Benchmark0
DetPO: In-Context Learning with Multi-Modal LLMs for Few-Shot Object Detection0
CSTS: A Canonical Security Telemetry Substrate for AI-Native Cyber Detection0
End-to-End Efficient RL for Linear Bellman Complete MDPs with Deterministic Transitions0
RealMaster: Lifting Rendered Scenes into Photorealistic Video0
InverFill: One-Step Inversion for Enhanced Few-Step Diffusion Inpainting0
Byzantine-Robust and Differentially Private Federated Optimization under Weaker Assumptions0
UniFunc3D: Unified Active Spatial-Temporal Grounding for 3D Functionality Segmentation0
VTAM: Video-Tactile-Action Models for Complex Physical Interaction Beyond VLAs0
ReqFusion: A Multi-Provider Framework for Automated PEGS Analysis Across Software Domains0
SpecEyes: Accelerating Agentic Multimodal LLMs via Speculative Perception and Planning0
Failure of contextual invariance in gender inference with large language models0
TETO: Tracking Events with Teacher Observation for Motion Estimation and Frame Interpolation0
One View Is Enough! Monocular Training for In-the-Wild Novel View Generation0
AgentRVOS: Reasoning over Object Tracks for Zero-Shot Referring Video Object Segmentation0
Foveated Diffusion: Efficient Spatially Adaptive Image and Video Generation0
LLM Inference at the Edge: Mobile, NPU, and GPU Performance Efficiency Trade-offs Under Sustained Load0
Show:102550
← PrevPage 18 of 26400Next →