The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 5551–5600 of 661570 papers

Title	Date	Status	Hype
A Family of LLMs Liberated from Static Vocabularies	Mar 16, 2026	—Unverified	0
Robust Language Identification for Romansh Varieties	Mar 16, 2026	—Unverified	0
UMO: Unified In-Context Learning Unlocks Motion Foundation Model Priors	Mar 16, 2026	—Unverified	0
An Agentic Evaluation Framework for AI-Generated Scientific Code in PETSc	Mar 16, 2026	—Unverified	0
Standardizing Medical Images at Scale for AI	Mar 16, 2026	—Unverified	0
Aligning Paralinguistic Understanding and Generation in Speech LLMs via Multi-Task Reinforcement Learning	Mar 16, 2026	—Unverified	0
Determinism in the Undetermined: Deterministic Output in Charge-Conserving Continuous-Time Neuromorphic Systems with Temporal Stochasticity	Mar 16, 2026	—Unverified	0
The Midas Touch in Gaze vs. Hand Pointing: Modality-Specific Failure Modes and Implications for XR Interfaces	Mar 16, 2026	—Unverified	0
Mostly Text, Smart Visuals: Asymmetric Text-Visual Pruning for Large Vision-Language Models	Mar 16, 2026	—Unverified	0
Understanding Moral Reasoning Trajectories in Large Language Models: Toward Probing-Based Explainability	Mar 16, 2026	—Unverified	0
IRAM-Omega-Q: A Computational Architecture for Uncertainty Regulation in Artificial Agents	Mar 16, 2026	—Unverified	0
Agentic Exploration of Physics Models	Mar 16, 2026	—Unverified	0
Balancing Saliency and Coverage: Semantic Prominence-Aware Budgeting for Visual Token Compression in VLMs	Mar 16, 2026	—Unverified	0
Describing Agentic AI Systems with C4: Lessons from Industry Projects	Mar 16, 2026	—Unverified	0
POLAR:A Per-User Association Test in Embedding Space	Mar 16, 2026	CodeCode Available	0
GASP: Guided Asymmetric Self-Play For Coding LLMs	Mar 16, 2026	—Unverified	0
MAC: Multi-Agent Constitution Learning	Mar 16, 2026	—Unverified	0
Datasets for Verb Alternations across Languages: BLM Templates and Data Augmentation Strategies	Mar 16, 2026	—Unverified	0
RoCo Challenge at AAAI 2026: Benchmarking Robotic Collaborative Manipulation for Assembly Towards Industrial Automation	Mar 16, 2026	—Unverified	0
Learning Latent Proxies for Controllable Single-Image Relighting	Mar 16, 2026	—Unverified	0
From Text to Forecasts: Bridging Modality Gap with Temporal Evolution Semantic Space	Mar 16, 2026	—Unverified	0
Embedding Compression via Spherical Coordinates	Mar 16, 2026	—Unverified	0
OpenSeeker: Democratizing Frontier Search Agents by Fully Open-Sourcing Training Data	Mar 16, 2026	—Unverified	3
Prompt Readiness Levels (PRL): a maturity scale and scoring framework for production grade prompt assets	Mar 16, 2026	—Unverified	0
PCodeTrans: Translate Decompiled Pseudocode to Compilable and Executable Equivalent	Mar 16, 2026	—Unverified	0
Massive Redundancy in Gradient Transport Enables Sparse Online Learning	Mar 16, 2026	—Unverified	0
AI Evasion and Impersonation Attacks on Facial Re-Identification with Activation Map Explanations	Mar 16, 2026	—Unverified	0
ELISA: An Interpretable Hybrid Generative AI Agent for Expression-Grounded Discovery in Single-Cell Genomics	Mar 16, 2026	CodeCode Available	0
Fold-CP: A Context Parallelism Framework for Biomolecular Modeling	Mar 16, 2026	—Unverified	0
Active Seriation: Efficient Ordering Recovery with Statistical Guarantees	Mar 16, 2026	—Unverified	0
A WDLoRA-Based Multimodal Generative Framework for Clinically Guided Corneal Confocal Microscopy Image Synthesis in Diabetic Neuropathy	Mar 16, 2026	—Unverified	0
OrgForge: A Multi-Agent Simulation Framework for Verifiable Synthetic Corporate Corpora	Mar 16, 2026	—Unverified	0
CyCLeGen: Cycle-Consistent Layout Prediction and Image Generation in Vision Foundation Models	Mar 16, 2026	—Unverified	0
Evolutionary Transfer Learning for Dragonchess	Mar 16, 2026	—Unverified	0
Counteractive RL: Rethinking Core Principles for Efficient and Scalable Deep Reinforcement Learning	Mar 16, 2026	—Unverified	0
Resilience Meets Autonomy: Governing Embodied AI in Critical Infrastructure	Mar 16, 2026	—Unverified	0
Persistent Autoregressive Mapping with Traffic Rules for Autonomous Driving	Mar 16, 2026	—Unverified	0
Deterministic Policy Gradient for Reinforcement Learning with Continuous Time and State	Mar 16, 2026	—Unverified	0
Rethinking LLM Watermark Detection in Black-Box Settings: A Non-Intrusive Third-Party Framework	Mar 16, 2026	—Unverified	0
Interpretable Predictability-Based AI Text Detection: A Replication Study	Mar 16, 2026	—Unverified	0
Detection of Autonomous Shuttles in Urban Traffic Images Using Adaptive Residual Context	Mar 16, 2026	—Unverified	0
Self-supervised Disentanglement of Disease Effects from Aging in 3D Medical Shapes	Mar 16, 2026	CodeCode Available	0
Learning to Recall with Transformers Beyond Orthogonal Embeddings	Mar 16, 2026	—Unverified	0
Learning Question-Aware Keyframe Selection with Synthetic Supervision for Video Question Answering	Mar 16, 2026	—Unverified	0
Machine learning for sustainable geoenergy: uncertainty, physics and decision-ready inference	Mar 16, 2026	—Unverified	0
Mathematical Foundations of Polyphonic Music Generation via Structural Inductive Bias	Mar 16, 2026	—Unverified	0
Consequentialist Objectives and Catastrophe	Mar 16, 2026	—Unverified	0
Efficient Document Parsing via Parallel Token Prediction	Mar 16, 2026	—Unverified	0
Criterion-referenceability determines LLM-as-a-judge validity across physics assessment formats	Mar 16, 2026	—Unverified	0
SWE-Skills-Bench: Do Agent Skills Actually Help in Real-World Software Engineering?	Mar 16, 2026	CodeCode Available	0