SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 33263350 of 661570 papers

TitleStatusHype
Exploring the Agentic Frontier of Verilog Code Generation0
Anatomical Heterogeneity in Transformer Language Models0
A Mathematical Theory of Understanding0
A Novel Solution for Zero-Day Attack Detection in IDS using Self-Attention and Jensen-Shannon Divergence in WGAN-GP0
Warm-Start Flow Matching for Guaranteed Fast Text/Image Generation0
Factored Levenberg-Marquardt for Diffeomorphic Image Registration: An efficient optimizer for FireANTs0
Automated Membership Inference Attacks: Discovering MIA Signal Computations using LLM Agents0
Bridging Conformal Prediction and Scenario Optimization: Discarded Constraints and Modular Risk Allocation0
Optimizing Resource-Constrained Non-Pharmaceutical Interventions for Multi-Cluster Outbreak Control Using Hierarchical Reinforcement Learning0
Scalable Prompt Routing via Fine-Grained Latent Task Discovery0
Investigating In-Context Privacy Learning by Integrating User-Facing Privacy Tools into Conversational Agents0
The Autonomy Tax: Defense Training Breaks LLM Agents0
Is Evaluation Awareness Just Format Sensitivity? Limitations of Probe-Based Evidence under Controlled Prompt Structure0
Vocabulary shapes cross-lingual variation of word-order learnability in language models0
When both Grounding and not Grounding are Bad -- A Partially Grounded Encoding of Planning into SAT (Extended Version)0
Subspace Projection Methods for Fast Spectral Embeddings of Evolving Graphs0
Near-Equivalent Q-learning Policies for Dynamic Treatment Regimes0
LoFi: Location-Aware Fine-Grained Representation Learning for Chest X-ray0
TrustFlow: Topic-Aware Vector Reputation Propagation for Multi-Agent Ecosystems0
Adaptive Layerwise Perturbation: Unifying Off-Policy Corrections for LLM RL0
In-the-Wild Camouflage Attack on Vehicle Detectors through Controllable Image Editing0
GeoLAN: Geometric Learning of Latent Explanatory Directions in Large Language Models0
Deep Hilbert--Galerkin Methods for Infinite-Dimensional PDEs and Optimal Control0
Hyperagents4
Global Convergence of Multiplicative Updates for the Matrix Mechanism: A Collaborative Proof with Gemini 30
Show:102550
← PrevPage 134 of 26463Next →