The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 9401–9450 of 661570 papers

Title	Date	Status
Lindbladian Learning with Neural Differential Equations	Mar 8, 2026	—Unverified
Vision Transformers that Never Stop Learning	Mar 8, 2026	—Unverified
Scaling Data Difficulty: Improving Coding Models via Reinforcement Learning on Fresh and Challenging Problems	Mar 8, 2026	—Unverified
ProgAgent:A Continual RL Agent with Progress-Aware Rewards	Mar 8, 2026	—Unverified
OrdinalBench: A Benchmark Dataset for Diagnosing Generalization Limits in Ordinal Number Understanding of Vision-Language Models	Mar 8, 2026	—Unverified
Neural Precoding in Complex Projective Spaces	Mar 8, 2026	—Unverified
HybridStitch: Pixel and Timestep Level Model Stitching for Diffusion Acceleration	Mar 8, 2026	—Unverified
Tracking Phenological Status and Ecological Interactions in a Hawaiian Cloud Forest Understory using Low-Cost Camera Traps and Visual Foundation Models	Mar 8, 2026	—Unverified
An Efficient and Effective Evaluator for Text2SQL Models on Unseen and Unlabeled Data	Mar 8, 2026	CodeCode Available
Column Generation for the Micro-Transit Zoning Problem	Mar 8, 2026	—Unverified
Gradient Iterated Temporal-Difference Learning	Mar 8, 2026	—Unverified
GazeShift: Unsupervised Gaze Estimation and Dataset for VR	Mar 8, 2026	CodeCode Available
AI Misuse in Education Is a Measurement Problem: Toward a Learning Visibility Framework	Mar 8, 2026	—Unverified
DistillGuard: Evaluating Defenses Against LLM Knowledge Distillation	Mar 8, 2026	—Unverified
Training-free Temporal Object Tracking in Surgical Videos	Mar 8, 2026	—Unverified
Intentional Deception as Controllable Capability in LLM Agents	Mar 8, 2026	—Unverified
Generalized Reduction to the Isotropy for Flexible Equivariant Neural Fields	Mar 8, 2026	—Unverified
EDMFormer: Genre-Specific Self-Supervised Learning for Music Structure Segmentation	Mar 8, 2026	—Unverified
On the Formal Limits of Alignment Verification	Mar 8, 2026	—Unverified
Clear, Compelling Arguments: Rethinking the Foundations of Frontier AI Safety Cases	Mar 8, 2026	—Unverified
Benchmarking Large Language Models for Quebec Insurance: From Closed-Book to Retrieval-Augmented Generation	Mar 8, 2026	—Unverified
MathSmith: Towards Extremely Hard Mathematical Reasoning by Forging Synthetic Problems with a Reinforced Policy	Mar 8, 2026	CodeCode Available
Hide and Find: A Distributed Adversarial Attack on Federated Graph Learning	Mar 8, 2026	—Unverified
Beyond Surrogates: A Quantitative Analysis for Inter-Metric Relationships	Mar 8, 2026	—Unverified
Dual-Metric Evaluation of Social Bias in Large Language Models: Evidence from an Underrepresented Nepali Cultural Context	Mar 8, 2026	—Unverified
Why Adam Can Beat SGD: Second-Moment Normalization Yields Sharper Tails	Mar 8, 2026	—Unverified
Global Convergence of Average Reward Constrained MDPs with Neural Critic and General Policy Parameterization	Mar 8, 2026	—Unverified
Toward Global Intent Inference for Human Motion by Inverse Reinforcement Learning	Mar 8, 2026	—Unverified
Deliberative Dynamics and Value Alignment in LLM Debates	Mar 8, 2026	—Unverified
Rigidity in LLM Bandits with Implications for Human-AI Dyads	Mar 8, 2026	—Unverified
Step-Size Decay and Structural Stagnation in Greedy Sparse Learning	Mar 8, 2026	—Unverified
AI-Driven Phase Identification from X-ray Hyperspectral Imaging of cycled Na-ion Cathode Materials	Mar 8, 2026	—Unverified
Learning embeddings of non-linear PDEs: the Burgers' equation	Mar 8, 2026	—Unverified
Skip to the Good Part: Representation Structure & Inference-Time Layer Skipping in Diffusion vs. Autoregressive LLMs	Mar 8, 2026	—Unverified
Goal Alignment in LLM-Based User Simulators for Conversational AI	Mar 8, 2026	—Unverified
ARM-FM: Automated Reward Machines via Foundation Models for Compositional Reinforcement Learning	Mar 8, 2026	—Unverified
Model-Free Neural State Estimation in Nonlinear Dynamical Systems: Comparing Neural and Classical Filters	Mar 8, 2026	—Unverified
Bitcoin Price Prediction using Machine Learning and Combinatorial Fusion Analysis	Mar 8, 2026	—Unverified
Transferable Optimization Network for Cross-Domain Image Reconstruction	Mar 8, 2026	—Unverified
DropVLA: An Action-Level Backdoor Attack on Vision-Language-Action Models	Mar 8, 2026	—Unverified
UniUncer: Unified Dynamic Static Uncertainty for End to End Driving	Mar 8, 2026	—Unverified
FusionRegister: Every Infrared and Visible Image Fusion Deserves Registration	Mar 8, 2026	CodeCode Available
Compressed-Domain-Aware Online Video Super-Resolution	Mar 8, 2026	CodeCode Available
MWM: Mobile World Models for Action-Conditioned Consistent Prediction	Mar 8, 2026	CodeCode Available
Flow Matching Meets Biology and Life Science: A Survey	Mar 8, 2026	CodeCode Available
Reverse Distillation: Consistently Scaling Protein Language Model Representations	Mar 8, 2026	CodeCode Available
Learning Context-Adaptive Motion Priors for Masked Motion Diffusion Models with Efficient Kinematic Attention Aggregation	Mar 8, 2026	CodeCode Available
TDM-R1: Reinforcing Few-Step Diffusion Models with Non-Differentiable Reward	Mar 8, 2026	CodeCode Available
AI Steerability 360: A Toolkit for Steering Large Language Models	Mar 8, 2026	CodeCode Available
ArcLight: A Lightweight LLM Inference Architecture for Many-Core CPUs	Mar 8, 2026	CodeCode Available