SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 34763500 of 661570 papers

TitleStatusHype
The Convergence Frontier: Integrating Machine Learning and High Performance Quantum Computing for Next-Generation Drug Discovery0
TransText: Alpha-as-RGB Representation for Transparent Text Animation0
TDAD: Test-Driven Agentic Development - Reducing Code Regressions in AI Coding Agents via Graph-Based Impact AnalysisCode0
Pixel-Accurate Epipolar Guided Matching0
WASD: Locating Critical Neurons as Sufficient Conditions for Explaining and Controlling LLM Behavior0
SynQ: Accurate Zero-shot Quantization by Synthesis-aware Fine-tuning0
PowerFlow: Unlocking the Dual Nature of LLMs via Principled Distribution Matching0
From Weak Cues to Real Identities: Evaluating Inference-Driven De-Anonymization in LLM Agents0
Evolutionarily Stable Stackelberg Equilibrium0
Reflection in the Dark: Exposing and Escaping the Black Box in Reflective Prompt Optimization0
An SO(3)-equivariant reciprocal-space neural potential for long-range interactions0
AutoScreen-FW: An LLM-based Framework for Resume Screening0
Computational and Statistical Hardness of Calibration Distance0
FlowMS: Flow Matching for De Novo Structure Elucidation from Mass Spectra0
TARo: Token-level Adaptive Routing for LLM Test-time Alignment0
Statistical Testing Framework for Clustering Pipelines by Selective Inference0
The Spillover Effects of Peer AI Rinsing on Corporate Green Innovation0
AcceRL: A Distributed Asynchronous Reinforcement Learning and World Model Framework for Vision-Language-Action Models0
Mind the Rarities: Can Rare Skin Diseases Be Reliably Diagnosed via Diagnostic Reasoning?0
HOMEY: Heuristic Object Masking with Enhanced YOLO for Property Insurance Risk Detection0
From Topic to Transition Structure: Unsupervised Concept Discovery at Corpus Scale via Predictive Associative Memory0
Prune-then-Quantize or Quantize-then-Prune? Understanding the Impact of Compression Order in Joint Model Compression0
Adaptive Decoding via Test-Time Policy Learning for Self-Improving Generation0
Towards Noise-Resilient Quantum Multi-Armed and Stochastic Linear Bandits0
UT-ACA: Uncertainty-Triggered Adaptive Context Allocation for Long-Context Inference0
Show:102550
← PrevPage 140 of 26463Next →