SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 39764000 of 661570 papers

TitleStatusHype
VISTA: Validation-Guided Integration of Spatial and Temporal Foundation Models with Anatomical Decoding for Rare-Pathology VCE Event Detection0
Understanding the Theoretical Foundations of Deep Neural Networks through Differential Equations0
Can LLMs Reason Like Automated Theorem Provers for Rust Verification? VCoT-Bench: Evaluating via Verification Chain of Thought0
Shifting Uncertainty to Critical Moments: Towards Reliable Uncertainty Quantification for VLA Model0
HRI-SA: A Multimodal Dataset for Online Assessment of Human Situational Awareness during Remote Human-Robot Teaming0
Epistemic Generative Adversarial Networks0
Large-Scale Analysis of Political Propaganda on Moltbook0
From Noise to Signal: When Outliers Seed New Topics0
Final Report for the Workshop on Robotics & AI in Medicine0
From Binary to Bilingual: How the National Weather Service is Using Artificial Intelligence to Develop a Comprehensive Translation Program0
CytoSyn: a Foundation Diffusion Model for Histopathology -- Tech Report0
AGRI-Fidelity: Evaluating the Reliability of Listenable Explanations for Poultry Disease Detection0
Privacy-Preserving Machine Learning for IoT: A Cross-Paradigm Survey and Future Roadmap0
LICA: Layered Image Composition Annotations for Graphic Design Research0
DarkDriving: A Real-World Day and Night Aligned Dataset for Autonomous Driving in the Dark Environment0
Transfer Learning for Contextual Joint Assortment-Pricing under Cross-Market Heterogeneity0
Intellectual Stewardship: Re-adapting Human Minds for Creative Knowledge Work in the Age of AI0
LGESynthNet: Controlled Scar Synthesis for Improved Scar Segmentation in Cardiac LGE-MRI Imaging0
Universal Skeleton Understanding via Differentiable Rendering and MLLMs0
A Structured Nonparametric Framework for Nonlinear Accelerated Failure Time Models (KAN-AFT)0
Constrained Hybrid Metaheuristic: A Universal Framework for Continuous Optimisation0
Rule-Based Explanations for Retrieval-Augmented LLM Systems0
LLM-Augmented Computational Phenotyping of Long Covid0
Multi-Trait Subspace Steering to Reveal the Dark Side of Human-AI Interaction0
Stable Deep Reinforcement Learning via Isotropic Gaussian Representations0
Show:102550
← PrevPage 160 of 26463Next →