SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 15011525 of 659983 papers

TitleStatusHype
On the Ability of Transformers to Verify Plans0
HiPath: Hierarchical Vision-Language Alignment for Structured Pathology Report Prediction0
Trojan's Whisper: Stealthy Manipulation of OpenClaw through Injected Bootstrapped Guidance0
Fine-tuning Timeseries Predictors Using Reinforcement Learning0
An Empirical Study of SFT-DPO Interaction and Parameterization in Small Language Models0
Synergistic Perception and Generative Recomposition: A Multi-Agent Orchestration for Expert-Level Building Inspection0
MoCA3D: Monocular 3D Bounding Box Prediction in the Image Plane0
Pedestrian Crossing Intent Prediction via Psychological Features and Transformer Fusion0
Behavioral Engagement in VR-Based Sign Language Learning: Visual Attention as a Predictor of Performance and Temporal Dynamics0
FDARxBench: Benchmarking Regulatory and Clinical Reasoning on FDA Generic Drug Assessment0
Scalable Cross-Facility Federated Learning for Scientific Foundation Models on Multiple Supercomputers0
Verifiable Error Bounds for Physics-Informed Neural Network Solutions of Lyapunov and Hamilton-Jacobi-Bellman Equations0
Efficiency Follows Global-Local Decoupling0
Subspace Kernel Learning on Tensor Sequences0
SeeClear: Reliable Transparent Object Depth Estimation via Generative Opacification0
Plagiarism or Productivity? Students Moral Disengagement and Behavioral Intentions to Use ChatGPT in Academic Writing0
Learning to Bet for Horizon-Aware Anytime-Valid Testing0
StreetForward: Perceiving Dynamic Street with Feedforward Causal Attention0
TextReasoningBench: Does Reasoning Really Improve Text Classification in Large Language Models?0
Optimal Scalar Quantization for Matrix Multiplication: Closed-Form Density and Phase Transition0
Neural Uncertainty Principle: A Unified View of Adversarial Fragility and LLM Hallucination0
Accelerating Diffusion Decoders via Multi-Scale Sampling and One-Step Distillation0
AI Psychosis: Does Conversational AI Amplify Delusion-Related Language?0
PA2D-MORL: Pareto Ascent Directional Decomposition based Multi-Objective Reinforcement Learning0
Evolving Embodied Intelligence: Graph Neural Network--Driven Co-Design of Morphology and Control in Soft Robotics0
Show:102550
← PrevPage 61 of 26400Next →