SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 28262850 of 661570 papers

TitleStatusHype
Beyond the Academic Monoculture: A Unified Framework and Industrial Perspective for Attributed Graph Clustering0
Governance-Aware Vector Subscriptions for Multi-Agent Knowledge Ecosystems0
Memory-Efficient Fine-Tuning Diffusion Transformers via Dynamic Patch Sampling and Block Skipping0
TAFG-MAN: Timestep-Adaptive Frequency-Gated Latent Diffusion for Efficient and High-Quality Low-Dose CT Image Denoising0
ReLaMix: Residual Latency-Aware Mixing for Delay-Robust Financial Time-Series Forecasting0
Incentive-Aware Federated Averaging with Performance Guarantees under Strategic Participation0
RubricRAG: Towards Interpretable and Reliable LLM Evaluation via Domain Knowledge Retrieval for Rubric Generation0
NoveltyAgent: Autonomous Novelty Reporting Agent with Point-wise Novelty Analysis and Self-Validation0
Characterizing the onset and offset of motor imagery during passive arm movements induced by an upper-body exoskeleton0
Scene Graph-guided SegCaptioning Transformer with Fine-grained Alignment for Controllable Video Segmentation and Captioning0
Auto-differentiable data assimilation: Co-learning of states, dynamics, and filtering algorithms0
LLM Router: Prefill is All You Need0
Beyond the Birkhoff Polytope: Spectral-Sphere-Constrained Hyper-Connections0
The data heat island effect: quantifying the impact of AI data centers in a warming world0
Natural Gradient Descent for Online Continual Learning0
Mitigating Shortcut Reasoning in Language Models: A Gradient-Aware Training Approach0
The Hidden Puppet Master: A Theoretical and Real-World Account of Emotional Manipulation in LLMs0
Bayesian Scattering: A Principled Baseline for Uncertainty on Image Data0
LLM-ODE: Data-driven Discovery of Dynamical Systems with Large Language Models0
Do LLM-Driven Agents Exhibit Engagement Mechanisms? Controlled Tests of Information Load, Descriptive Norms, and Popularity Cues0
Enhancing LIME using Neural Decision Trees0
Democratizing AI: A Comparative Study in Deep Learning Efficiency and Future Trends in Computational Processing0
Discriminative Representation Learning for Clinical Prediction0
Profit is the Red Team: Stress-Testing Agents in Strategic Economic Interactions0
MOELIGA: a multi-objective evolutionary approach for feature selection with local improvement0
Show:102550
← PrevPage 114 of 26463Next →