SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 15011550 of 659983 papers

TitleStatusHype
On the Ability of Transformers to Verify Plans0
HiPath: Hierarchical Vision-Language Alignment for Structured Pathology Report Prediction0
Trojan's Whisper: Stealthy Manipulation of OpenClaw through Injected Bootstrapped Guidance0
Fine-tuning Timeseries Predictors Using Reinforcement Learning0
An Empirical Study of SFT-DPO Interaction and Parameterization in Small Language Models0
Synergistic Perception and Generative Recomposition: A Multi-Agent Orchestration for Expert-Level Building Inspection0
MoCA3D: Monocular 3D Bounding Box Prediction in the Image Plane0
Pedestrian Crossing Intent Prediction via Psychological Features and Transformer Fusion0
Behavioral Engagement in VR-Based Sign Language Learning: Visual Attention as a Predictor of Performance and Temporal Dynamics0
FDARxBench: Benchmarking Regulatory and Clinical Reasoning on FDA Generic Drug Assessment0
Scalable Cross-Facility Federated Learning for Scientific Foundation Models on Multiple Supercomputers0
Verifiable Error Bounds for Physics-Informed Neural Network Solutions of Lyapunov and Hamilton-Jacobi-Bellman Equations0
Efficiency Follows Global-Local Decoupling0
Subspace Kernel Learning on Tensor Sequences0
SeeClear: Reliable Transparent Object Depth Estimation via Generative Opacification0
Plagiarism or Productivity? Students Moral Disengagement and Behavioral Intentions to Use ChatGPT in Academic Writing0
Learning to Bet for Horizon-Aware Anytime-Valid Testing0
StreetForward: Perceiving Dynamic Street with Feedforward Causal Attention0
TextReasoningBench: Does Reasoning Really Improve Text Classification in Large Language Models?0
Optimal Scalar Quantization for Matrix Multiplication: Closed-Form Density and Phase Transition0
Neural Uncertainty Principle: A Unified View of Adversarial Fragility and LLM Hallucination0
Accelerating Diffusion Decoders via Multi-Scale Sampling and One-Step Distillation0
AI Psychosis: Does Conversational AI Amplify Delusion-Related Language?0
PA2D-MORL: Pareto Ascent Directional Decomposition based Multi-Objective Reinforcement Learning0
Evolving Embodied Intelligence: Graph Neural Network--Driven Co-Design of Morphology and Control in Soft Robotics0
Skilled AI Agents for Embedded and IoT Systems Development0
PowerLens: Taming LLM Agents for Safe and Personalized Mobile Power Management0
Data-driven ensemble prediction of the global ocean0
ARMOR: Adaptive Resilience Against Model Poisoning Attacks in Continual Federated Learning for Mobile Indoor Localization0
All-Mem: Agentic Lifelong Memory via Dynamic Topology Evolution0
FlowScene: Style-Consistent Indoor Scene Generation with Multimodal Graph Rectified Flow0
Physics-Informed Neural Network with Adaptive Clustering Learning Mechanism for Information Popularity Prediction0
K-GMRF: Kinetic Gauss-Markov Random Field for First-Principles Covariance Tracking on Lie Groups0
Beyond Quadratic: Linear-Time Change Detection with RWKV0
Physion-Eval: Evaluating Physical Realism in Generated Video via Human Reasoning0
FB-CLIP: Fine-Grained Zero-Shot Anomaly Detection with Foreground-Background Disentanglement0
LoD-Loc v3: Generalized Aerial Localization in Dense Cities using Instance Silhouette Alignment0
ParallelVLM: Lossless Video-LLM Acceleration with Visual Alignment Aware Parallel Speculative Decoding0
Demonstrations, CoT, and Prompting: A Theoretical Analysis of ICL0
OrbitNVS: Harnessing Video Diffusion Priors for Novel View Synthesis0
CAF-Score: Calibrating CLAP with LALMs for Reference-free Audio Captioning EvaluationCode0
UniPR: Unified Object-level Real-to-Sim Perception and Reconstruction from a Single Stereo Pair0
On Performance Guarantees for Federated Learning with Personalized Constraints0
DeepStock: Reinforcement Learning with Policy Regularizations for Inventory Management0
Disentangle-then-Align: Non-Iterative Hybrid Multimodal Image Registration via Cross-Scale Feature Disentanglement0
IUP-Pose: Decoupled Iterative Uncertainty Propagation for Real-time Relative Pose Regression via Implicit Dense Alignment v10
On the role of memorization in learned priors for geophysical inverse problems0
Alternating Diffusion for Proximal Sampling with Zeroth Order Queries0
MetaCues: Enabling Critical Engagement with Generative AI for Information Seeking and Sensemaking0
BEAVER: A Training-Free Hierarchical Prompt Compression Method via Structure-Aware Page Selection0
Show:102550
← PrevPage 31 of 13200Next →