SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 17761800 of 659983 papers

TitleStatusHype
POET: Power-Oriented Evolutionary Tuning for LLM-Based RTL PPA Optimization0
Do Post-Training Algorithms Actually Differ? A Controlled Study Across Model Scales Uncovers Scale-Dependent Ranking Inversions0
Diffusion-Guided Semantic Consistency for Multimodal Heterogeneity0
Spectral Tempering for Embedding Compression in Dense Passage Retrieval0
Beyond Weighted Summation: Learnable Nonlinear Aggregation Functions for Robust Artificial Neurons0
Exploring the Agentic Frontier of Verilog Code Generation0
Anatomical Heterogeneity in Transformer Language Models0
A Mathematical Theory of Understanding0
A Novel Solution for Zero-Day Attack Detection in IDS using Self-Attention and Jensen-Shannon Divergence in WGAN-GP0
Warm-Start Flow Matching for Guaranteed Fast Text/Image Generation0
Factored Levenberg-Marquardt for Diffeomorphic Image Registration: An efficient optimizer for FireANTs0
Automated Membership Inference Attacks: Discovering MIA Signal Computations using LLM Agents0
Bridging Conformal Prediction and Scenario Optimization: Discarded Constraints and Modular Risk Allocation0
Optimizing Resource-Constrained Non-Pharmaceutical Interventions for Multi-Cluster Outbreak Control Using Hierarchical Reinforcement Learning0
Scalable Prompt Routing via Fine-Grained Latent Task Discovery0
Investigating In-Context Privacy Learning by Integrating User-Facing Privacy Tools into Conversational Agents0
The Autonomy Tax: Defense Training Breaks LLM Agents0
Is Evaluation Awareness Just Format Sensitivity? Limitations of Probe-Based Evidence under Controlled Prompt Structure0
Vocabulary shapes cross-lingual variation of word-order learnability in language models0
When both Grounding and not Grounding are Bad -- A Partially Grounded Encoding of Planning into SAT (Extended Version)0
Subspace Projection Methods for Fast Spectral Embeddings of Evolving Graphs0
Near-Equivalent Q-learning Policies for Dynamic Treatment Regimes0
LoFi: Location-Aware Fine-Grained Representation Learning for Chest X-ray0
TrustFlow: Topic-Aware Vector Reputation Propagation for Multi-Agent Ecosystems0
Adaptive Layerwise Perturbation: Unifying Off-Policy Corrections for LLM RL0
Show:102550
← PrevPage 72 of 26400Next →