SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 41514175 of 661570 papers

TitleStatusHype
Rolling Sink: Bridging Limited-Horizon Training and Open-Ended Testing in Autoregressive Video Diffusion2
Detecting Transportation Mode Using Dense Smartphone GPS Trajectories and Transformer Models0
SWE-CI: Evaluating Agent Capabilities in Maintaining Codebases via Continuous Integration0
A Hierarchical Error-Corrective Graph Framework for Autonomous Agents with LLM-Based Action Generation0
Context-Nav: Context-Driven Exploration and Viewpoint-Aware 3D Spatial Reasoning for Instance Navigation0
Exploiting Adaptive Channel Pruning for Communication-Efficient Split Learning0
Coherent Human-Scene Reconstruction from Multi-Person Multi-View Video in a Single Pass0
Human-AI Co-reasoning for Clinical Diagnosis with Evidence-Integrated Language Agent0
Just-in-Time: Training-Free Spatial Acceleration for Diffusion Transformers0
Multimodal Emotion Recognition via Bi-directional Cross-Attention and Temporal Modeling0
Real-World AI Evaluation: How FRAME Generates Systematic Evidence to Resolve the Decision-Maker's Dilemma0
Spatial Transcriptomics as Images for Large-Scale Pretraining0
SAATT Nav: a Socially Aware Autonomous Transparent Transportation Navigation Framework for Wheelchairs0
The Reasoning Bottleneck in Graph-RAG: Structured Prompting and Context Compression for Multi-Hop QA0
AvatarForcing: One-Step Streaming Talking Avatars via Local-Future Sliding-Window Denoising0
SemanticFace: Semantic Facial Action Estimation via Semantic Distillation in Interpretable Space0
F2HDR: Two-Stage HDR Video Reconstruction via Flow Adapter and Physical Motion Modeling0
Training-free Detection of Generated Videos via Spatial-Temporal Likelihoods0
Open Biomedical Knowledge Graphs at Scale: Construction, Federation, and AI Agent Access with Samyama Graph Database0
A Tutorial on ALOS2 SAR Utilization: Dataset Preparation, Self-Supervised Pretraining, and Semantic Segmentation0
I Know What I Don't Know: Latent Posterior Factor Models for Multi-Evidence Probabilistic Reasoning0
Theoretical Foundations of Latent Posterior Factors: Formal Guarantees for Multi-Evidence Reasoning0
A Framework and Prototype for a Navigable Map of Datasets in Engineering Design and Systems Engineering0
OMNIFLOW: A Physics-Grounded Multimodal Agent for Generalized Scientific Reasoning0
100x Cost & Latency Reduction: Performance Analysis of AI Query Approximation using Lightweight Proxy Models0
Show:102550
← PrevPage 167 of 26463Next →