SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 40264050 of 661570 papers

TitleStatusHype
Q-Drift: Quantization-Aware Drift Correction for Diffusion Model Sampling0
STEP: Detecting Audio Backdoor Attacks via Stability-based Trigger Exposure Profiling0
Adaptive Domain Models: Bayesian Evolution, Warm Rotation, and Principled Training for Geometric and Neuromorphic AI0
Understanding Task Aggregation for Generalizable Ultrasound Foundation Models0
Learning-Augmented Algorithms for k-median via Online Learning0
ResNets of All Shapes and Sizes: Convergence of Training Dynamics in the Large-scale Limit0
VLM-AutoDrive: Post-Training Vision-Language Models for Safety-Critical Autonomous Driving Events0
Retrieval-Augmented LLMs for Security Incident Analysis0
Retrieval-Augmented LLM Agents: Learning to Learn from Experience0
A Computationally Efficient Learning of Artificial Intelligence System Reliability Considering Error Propagation0
MolRGen: A Training and Evaluation Setting for De Novo Molecular Generation with Reasonning Models0
CORE: Robust Out-of-Distribution Detection via Confidence and Orthogonal Residual Scoring0
ALIGN: Adversarial Learning for Generalizable Speech Neuroprosthesis0
Interpretability without actionability: mechanistic methods cannot correct language model errors despite near-perfect internal representations0
Synthetic Data Generation for Training Diversified Commonsense Reasoning Models0
Search2Motion: Training-Free Object-Level Motion Control via Attention-Consensus Search0
Fundamental Limits of Neural Network Sparsification: Evidence from Catastrophic Interpretability Collapse0
When Validation Fails: Cross-Institutional Blood Pressure Prediction and the Limits of Electronic Health Record-Based ModelsCode0
MEMO: Memory-Augmented Model Context Optimization for Robust Multi-Turn Multi-Agent LLM GamesCode0
Auditing Preferences for Brands and Cultures in LLMs0
Access Controlled Website Interaction for Agentic AI with Delegated Critical Tasks0
CeRA: Breaking the Linear Ceiling of Low-Rank Adaptation via Manifold Expansion0
Continually self-improving AI0
Impact of automatic speech recognition quality on Alzheimer's disease detection from spontaneous speech: a reproducible benchmark study with lexical modeling and statistical validation0
Optimizer-Induced Low-Dimensional Drift and Transverse Dynamics in Transformer Training0
Show:102550
← PrevPage 162 of 26463Next →