SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 35763600 of 661570 papers

TitleStatusHype
Learn for Variation: Variationally Guided AAV Trajectory Learning in Differentiable Environments0
Data-driven construction of machine-learning-based interatomic potentials for gas-surface scattering dynamics: the case of NO on graphite0
RadioDiff-FS: Physics-Informed Manifold Alignment in Few-Shot Diffusion Models for High-Fidelity Radio Map Construction0
Through the Looking-Glass: AI-Mediated Video Communication Reduces Interpersonal Trust and Confidence in Judgments0
MultihopSpatial: Multi-hop Compositional Spatial Reasoning Benchmark for Vision-Language Model0
Evaluating LLM-Generated Lessons from the Language Learning Students' Perspective: A Short Case Study on Duolingo0
Geography According to ChatGPT -- How Generative AI Represents and Reasons about Geography0
Revisiting Autoregressive Models for Generative Image Classification0
Reasoning over mathematical objects: on-policy reward modeling and test time aggregation0
A conceptual framework for ideology beyond the left and right0
Authority-Level Priors: An Under-Specified Constraint in Hierarchical Predictive Processing0
Context Bootstrapped Reinforcement Learning0
Unsupervised Contrastive Learning for Efficient and Robust Spectral Shape Matching0
Neural Galerkin Normalizing Flow for Transition Probability Density Functions of Diffusion Models0
Secure Linear Alignment of Large Language Models0
Security, privacy, and agentic AI in a regulatory view: From definitions and distinctions to provisions and reflections0
Agentic Business Process Management: A Research Manifesto0
Improving moment tensor solutions under Earth structure uncertainty with simulation-based inference0
An Optimised Greedy-Weighted Ensemble Framework for Financial Loan Default Prediction0
Entropy trajectory shape predicts LLM reasoning reliability: A diagnostic study of uncertainty dynamics in chain-of-thought0
Unified Taxonomy for Multivariate Time Series Anomaly Detection using Deep Learning0
Maximum-Entropy Exploration with Future State-Action Visitation Measures0
Evaluating 5W3H Structured Prompting for Intent Alignment in Human-AI Interaction0
PRIOR: Perceptive Learning for Humanoid Locomotion with Reference Gait Priors0
Revisiting OmniAnomaly for Anomaly Detection: performance metrics and comparison with PCA-based models0
Show:102550
← PrevPage 144 of 26463Next →