SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 20012025 of 661570 papers

TitleStatusHype
Stepwise Variational Inference with Vine Copulas0
Asymptotic Learning Curves for Diffusion Models with Random Features Score and Manifold Data0
A Critical Review on the Effectiveness and Privacy Threats of Membership Inference Attacks0
Robustness Quantification and Uncertainty Quantification: Comparing Two Methods for Assessing the Reliability of Classifier Predictions0
VLA-IAP: Training-Free Visual Token Pruning via Interaction Alignment for Vision-Language-Action Models0
Minibal: Balanced Game-Playing Without Opponent Modeling0
Efficient Benchmarking of AI Agents0
IslamicMMLU: A Benchmark for Evaluating LLMs on Islamic Knowledge0
IJmond Industrial Smoke Segmentation Dataset0
Self Paced Gaussian Contextual Reinforcement Learning0
Learning Cross-Joint Attention for Generalizable Video-Based Seizure Detection0
Towards a general-purpose foundation model for fMRI analysis0
UniCA: Unified Covariate Adaptation for Time Series Foundation Model0
Children's Intelligence Tests Pose Challenges for MLLMs? KidGym: A 2D Grid-Based Reasoning Benchmark for MLLMs0
CRoCoDiL: Continuous and Robust Conditioned Diffusion for Language0
An Industrial-Scale Retrieval-Augmented Generation Framework for Requirements Engineering: Empirical Evaluation with Automotive Manufacturing Data0
GHOST: Ground-projected Hypotheses from Observed Structure-from-Motion Trajectories0
MKA: Memory-Keyed Attention for Efficient Long-Context Reasoning0
ScaleEdit-12M: Scaling Open-Source Image Editing Data Generation via Multi-Agent Framework0
Exponential Family Discriminant Analysis: Generalizing LDA-Style Generative Classification to Non-Gaussian Models0
Towards Intelligent Geospatial Data Discovery: a knowledge graph-driven multi-agent framework powered by large language models0
PiLoT: Neural Pixel-to-3D Registration for UAV-based Ego and Target Geo-localization0
LPNSR: Prior-Enhanced Diffusion Image Super-Resolution via LR-Guided Noise Prediction0
2Xplat: Two Experts Are Better Than One Generalist0
Cerebra: A Multidisciplinary AI Board for Multimodal Dementia Characterization and Risk Assessment0
Show:102550
← PrevPage 81 of 26463Next →