SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 29763000 of 661570 papers

TitleStatusHype
LLM-Driven Heuristic Synthesis for Industrial Process Control: Lessons from Hot Steel Rolling0
Understanding Behavior Cloning with Action Quantization0
Benchmarking Efficient & Effective Camera Pose Estimation Strategies for Novel View Synthesis0
Forward and inverse problems for measure flows in Bayes Hilbert spaces0
Bounded Coupled AI Learning Dynamics in Tri-Hierarchical Drone Swarms0
Procedural Refinement by LLM-driven Algorithmic Debugging for ARC-AGI-20
Hybrid Autoencoder-Isolation Forest approach for time series anomaly detection in C70XP cyclotron operation data at ARRONAX0
ContractSkill: Repairable Contract-Based Skills for Multimodal Web Agents0
Interpretable Multiple Myeloma Prognosis with Observational Medical Outcomes Partnership Data0
The production of meaning in the processing of natural language0
Uni-Classifier: Leveraging Video Diffusion Priors for Universal Guidance Classifier0
Multi-Stage Fine-Tuning of Pathology Foundation Models with Head-Diverse Ensembling for White Blood Cell Classification0
Jigsaw Regularization in Whole-Slide Image Classification0
From Cross-Validation to SURE: Asymptotic Risk of Tuned Regularized Estimators0
A chemical language model for reticular materials design0
CAMA: Exploring Collusive Adversarial Attacks in c-MARL0
Monocular Models are Strong Learners for Multi-View Human Mesh Recovery0
SymCircuit: Bayesian Structure Inference for Tractable Probabilistic Circuits via Entropy-Regularized Reinforcement Learning0
Thinking in Different Spaces: Domain-Specific Latent Geometry Survives Cross-Architecture Translation0
Meta-Learning for Repeated Bayesian Persuasion0
SLE-FNO: Single-Layer Extensions for Task-Agnostic Continual Learning in Fourier Neural Operators0
Data-driven discovery of roughness descriptors for surface characterization and intimate contact modeling of unidirectional composite tapes0
CERN: Correcting Errors in Raw Nanopore Signals Using Hidden Markov Models0
Hawkeye: Reproducing GPU-Level Non-Determinism0
PEARL: Personalized Streaming Video Understanding Model0
Show:102550
← PrevPage 120 of 26463Next →