SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 46764700 of 661570 papers

TitleStatusHype
vAccSOL: Efficient and Transparent AI Vision Offloading for Mobile Robots0
CABTO: Context-Aware Behavior Tree Grounding for Robot Manipulation0
Grid-World Representations in Transformers Reflect Predictive Geometry0
Cost Trade-offs in Matrix Inversion Updates for Streaming Outlier Detection0
Learning Lineage-guided Geodesics with Finsler Geometry0
High-dimensional estimation with missing data: Statistical and computational limits0
GeMA: Learning Latent Manifold Frontiers for Benchmarking Complex Systems0
Emotion-Aware Classroom Quality Assessment Leveraging IoT-Based Real-Time Student Monitoring0
Understanding Quantization of Optimizer States in LLM Pre-training: Dynamics of State Staleness and Effectiveness of State Resets0
IQuest-Coder-V1 Technical Report0
Differential Harm Propensity in Personalized LLM Agents: The Curious Case of Mental Health Disclosure0
Semi-supervised Latent Disentangled Diffusion Model for Textile Pattern Generation0
Efficient Reasoning on the Edge0
MedCL-Bench: Benchmarking stability-efficiency trade-offs and scaling in biomedical continual learning0
Data-driven forced response analysis with min-max representations of nonlinear restoring forces0
Finding Common Ground in a Sea of Alternatives0
pADAM: A Plug-and-Play All-in-One Diffusion Architecture for Multi-Physics Learning0
SuCor: Susceptibility Distortion Correction via Parameter-Free and Self-Regularized Optimal Transport0
SOMP: Scalable Gradient Inversion for Large Language Models via Subspace-Guided Orthogonal Matching Pursuit0
Anticipatory Planning for Multimodal AI Agents0
IOSVLM: A 3D Vision-Language Model for Unified Dental Diagnosis from Intraoral Scans0
High-Dimensional Gaussian Mean Estimation under Realizable Contamination0
ODIN-Based CPU-GPU Architecture with Replay-Driven Simulation and Emulation0
WildDepth: A Multimodal Dataset for 3D Wildlife Perception and Depth Estimation0
SurgΣ: A Spectrum of Large-Scale Multimodal Data and Foundation Models for Surgical Intelligence0
Show:102550
← PrevPage 188 of 26463Next →