SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 55265550 of 661570 papers

TitleStatusHype
Feed-forward Gaussian Registration for Head Avatar Creation and Editing0
ModTrack: Sensor-Agnostic Multi-View Tracking via Identity-Informed PHD Filtering with Covariance Propagation0
Spectral Hierarchy of the Cosmic Web0
When Stability Fails: Hidden Failure Modes Of LLMS in Data-Constrained Scientific Decision-Making0
FlashSampling: Fast and Memory-Efficient Exact SamplingCode0
Interpretative Interfaces: Designing for AI-Mediated Reading Practices and the Knowledge Commons0
Electrodermal Activity as a Unimodal Signal for Aerobic Exercise Detection in Wearable Sensors0
Temporal Fact Conflicts in LLMs: Reproducibility Insights from Unifying DYNAMICQA and MULAN0
COGNAC at SemEval-2026 Task 5: LLM Ensembles for Human-Level Word Sense Plausibility Rating in Challenging Narratives0
Federated Learning for Privacy-Preserving Medical AI0
Agent-based imitation dynamics can yield efficiently compressed population-level vocabularies0
Game-Theory-Assisted Reinforcement Learning for Border Defense: Early Termination based on Analytical Solutions0
Prompt Engineering for Scale Development in Generative Psychometrics0
Auto Researching, not hyperparameter tuning: Convergence Analysis of 10,000 Experiments0
Bayesian-guided inverse design of hyperelastic microstructures: Application to stochastic metamaterials0
Sparse but not Simpler: A Multi-Level Interpretability Analysis of Vision Transformers0
Evaluating Agentic Optimization on Large Codebases0
Generative Inverse Design with Abstention via Diagonal Flow Matching0
Discovery of interaction and diffusion kernels in particle-to-mean-field multi-agent systems0
Nodule-Aligned Latent Space Learning with LLM-Driven Multimodal Diffusion for Lung Nodule Progression Prediction0
Do Not Leave a Gap: Hallucination-Free Object Concealment in Vision-Language Models0
Towards Fair and Robust Volumetric CT Classification via KL-Regularised Group Distributionally Robust Optimisation0
Argumentative Human-AI Decision-Making: Toward AI Agents That Reason With Us, Not For Us0
BANGLASOCIALBENCH: A Benchmark for Evaluating Sociopragmatic and Cultural Alignment of LLMs in Bangladeshi Social Interaction0
Protein Design with Agent Rosetta: A Case Study for Specialized Scientific Agents0
Show:102550
← PrevPage 222 of 26463Next →