SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 1120111250 of 661570 papers

TitleStatusHype
Oracle-efficient Hybrid Learning with Constrained Adversaries0
Adaptive Memory Admission Control for LLM Agents0
Weather-Related Crash Risk Forecasting: A Deep Learning Approach for Heterogenous Spatiotemporal Data0
Structure-Guided Histopathology Synthesis via Dual-LoRA Diffusion0
Self-Attribution Bias: When AI Monitors Go Easy on Themselves0
PinPoint: Evaluation of Composed Image Retrieval with Explicit Negatives, Multi-Image Queries, and Paraphrase Testing0
PDE foundation model-accelerated inverse estimation of system parameters in inertial confinement fusion0
SGR3 Model: Scene Graph Retrieval-Reasoning Model in 3D0
K-Means as a Radial Basis function Network: a Variational and Gradient-based Equivalence0
Spinverse: Differentiable Physics for Permeability-Aware Microstructure Reconstruction from Diffusion MRI0
Coordinated Semantic Alignment and Evidence Constraints for Retrieval-Augmented Generation with Large Language Models0
When Sensors Fail: Temporal Sequence Models for Robust PPO under Sensor Drift0
iAgentBench: Benchmarking Sensemaking Capabilities of Information-Seeking Agents on High-Traffic Topics0
GIANT - Global Path Integration and Attentive Graph Networks for Multi-Agent Trajectory Planning0
Decoding the Pulse of Reasoning VLMs in Multi-Image Understanding Tasks0
Direct Estimation of Tree Volume and Aboveground Biomass Using Deep Regression with Synthetic Lidar Data0
Edges Are All You Need: Robust Gait Recognition via Label-Free Structure0
VDCook:DIY video data cook your MLLMs0
An intuitive rearranging of the Yates covariance decomposition for probabilistic verification of forecasts with the Brier score0
Digital-Twin Losses for Lane-Compliant Trajectory Prediction at Urban Intersections0
Latent-IMH: Efficient Bayesian Inference for Inverse Problems with Approximate Operators0
Beyond Dominant Patches: Spatial Credit Redistribution For Grounded Vision-Language Models0
Clinical-Injection Transformer with Domain-Adapted MAE for Lupus Nephritis Prognosis Prediction0
GraphMERT: Efficient and Scalable Distillation of Reliable Knowledge Graphs from Unstructured Data0
Activation Outliers in Transformer Quantization: Reproduction, Statistical Analysis, and Deployment Tradeoffs0
Detecting AI-Generated Essays in Writing Assessment: Responsible Use and Generalizability Across LLMs0
A Late-Fusion Multimodal AI Framework for Privacy-Preserving Deduplication in National Healthcare Data Environments0
Enhancing Authorship Attribution with Synthetic Paintings0
ZeSTA: Zero-Shot TTS Augmentation with Domain-Conditioned Training for Data-Efficient Personalized Speech Synthesis0
SimpliHuMoN: Simplifying Human Motion Prediction0
When Do Language Models Endorse Limitations on Human Rights Principles?0
Projected Hessian Learning: Fast Curvature Supervision for Accurate Machine-Learning Interatomic Potentials0
Attention Meets Reachability: Structural Equivalence and Efficiency in Grammar-Constrained LLM Decoding0
Out-of-distribution transfer of PDE foundation models to material dynamics under extreme loading0
Universal Coefficients and Mayer-Vietoris Sequence for Groupoid Homology0
CoRPO: Adding a Correctness Bias to GRPO Improves Generalization0
Mask-aware inference with State-Space Models0
Fusion and Grouping Strategies in Deep Learning for Local Climate Zone Classification of Multimodal Remote Sensing DataCode0
Towards automated data analysis: A guided framework for LLM-based risk estimation0
When Agents Persuade: Propaganda Generation and Mitigation in LLMs0
Out-of-Support Generalisation via Weight-Space Sequence Modelling0
Learning to Drive is a Free Gift: Large-Scale Label-Free Autonomy Pretraining from Unposed In-The-Wild Videos0
Causality Elicitation from Large Language Models0
YuriiFormer: A Suite of Nesterov-Accelerated Transformers0
Nearest-Neighbor Density Estimation for Dependency Suppression0
Robustness of Agentic AI Systems via Adversarially-Aligned Jacobian Regularization0
Optimal Prediction-Augmented Algorithms for Testing Independence of Distributions0
A Dual-Helix Governance Approach Towards Reliable Agentic AI for WebGIS Development0
InverseNet: Benchmarking Operator Mismatch and Calibration Across Compressive Imaging Modalities0
Stan: An LLM-based thermodynamics course assistant0
Show:102550
← PrevPage 225 of 13232Next →