The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1651–1700 of 659983 papers

Title	Date	Status
An Agentic Approach to Generating XAI-Narratives	Mar 20, 2026	—Unverified
ReViSQL: Achieving Human-Level Text-to-SQL	Mar 20, 2026	—Unverified
Physics-Informed Long-Range Coulomb Correction for Machine-learning Hamiltonians	Mar 20, 2026	—Unverified
AgenticRS-EnsNAS: Ensemble-Decoupled Self-Evolving Architecture Search	Mar 20, 2026	—Unverified
Detached Skip-Links and R-Probe: Decoupling Feature Aggregation from Gradient Propagation for MLLM OCR	Mar 20, 2026	—Unverified
Orchestrating Human-AI Software Delivery: A Retrospective Longitudinal Field Study of Three Software Modernization Programs	Mar 20, 2026	—Unverified
CoverageBench: Evaluating Information Coverage across Tasks and Domains	Mar 20, 2026	—Unverified
Continual Learning as Shared-Manifold Continuation Under Compatible Shift	Mar 20, 2026	—Unverified
Federated Hyperdimensional Computing for Resource-Constrained Industrial IoT	Mar 20, 2026	—Unverified
LoASR-Bench: Evaluating Large Speech Language Models on Low-Resource Automatic Speech Recognition Across Language Families	Mar 20, 2026	—Unverified
Investigating a Policy-Based Formulation for Endoscopic Camera Pose Recovery	Mar 20, 2026	—Unverified
Structured Latent Dynamics in Wireless CSI via Homomorphic World Models	Mar 20, 2026	—Unverified
DIAL-KG: Schema-Free Incremental Knowledge Graph Construction via Dynamic Schema Induction and Evolution-Intent Assessment	Mar 20, 2026	—Unverified
The End of Rented Discovery: How AI Search Redistributes Power Between Hotels and Intermediaries	Mar 20, 2026	—Unverified
The monotonicity of the Franz-Parisi potential is equivalent with Low-degree MMSE lower bounds	Mar 20, 2026	—Unverified
Antenna Array Beamforming Based on a Hybrid Quantum Optimization Framework	Mar 20, 2026	—Unverified
A Unified Platform and Quality Assurance Framework for 3D Ultrasound Reconstruction with Robotic, Optical, and Electromagnetic Tracking	Mar 20, 2026	—Unverified
Predicting States of Understanding in Explanatory Interactions Using Cognitive Load-Related Linguistic Cues	Mar 20, 2026	—Unverified
Preference-Guided Debiasing for No-Reference Enhancement Image Quality Assessment	Mar 20, 2026	—Unverified
How Out-of-Equilibrium Phase Transitions can Seed Pattern Formation in Trained Diffusion Models	Mar 20, 2026	—Unverified
LLM-Enhanced Semantic Data Integration of Electronic Component Qualifications in the Aerospace Domain	Mar 20, 2026	—Unverified
Pitfalls in Evaluating Interpretability Agents	Mar 20, 2026	—Unverified
Spectral Alignment in Forward-Backward Representations via Temporal Abstraction	Mar 20, 2026	—Unverified
Trojan horse hunt in deep forecasting models: Insights from the European Space Agency competition	Mar 20, 2026	—Unverified
GO-GenZip: Goal-Oriented Generative Sampling and Hybrid Compression	Mar 20, 2026	—Unverified
Var-JEPA: A Variational Formulation of the Joint-Embedding Predictive Architecture -- Bridging Predictive and Generative Self-Supervised Learning	Mar 20, 2026	—Unverified
Demonstration of Adapt4Me: An Uncertainty-Aware Authoring Environment for Personalizing Automatic Speech Recognition to Non-normative Speech	Mar 20, 2026	—Unverified
Current LLMs still cannot 'talk much' about grammar modules: Evidence from syntax	Mar 20, 2026	—Unverified
Conditioning Protein Generation via Hopfield Pattern Multiplicity	Mar 20, 2026	—Unverified
Chain-of-Adaptation: Surgical Vision-Language Adaptation with Reinforcement Learning	Mar 20, 2026	—Unverified
Evolving Jailbreaks: Automated Multi-Objective Long-Tail Attacks on Large Language Models	Mar 20, 2026	—Unverified
Generalizable NGP-SR: Generalizable Neural Radiance Fields Super-Resolution via Neural Graph Primitives	Mar 20, 2026	—Unverified
An Agentic Multi-Agent Architecture for Cybersecurity Risk Management	Mar 20, 2026	—Unverified
Revisiting Gene Ontology Knowledge Discovery with Hierarchical Feature Selection and Virtual Study Group of AI Agents	Mar 20, 2026	—Unverified
Reasoning Gets Harder for LLMs Inside A Dialogue	Mar 20, 2026	—Unverified
Can Large Multimodal Models Inspect Buildings? A Hierarchical Benchmark for Structural Pathology Reasoning	Mar 20, 2026	—Unverified
Improving Generalization on Cybersecurity Tasks with Multi-Modal Contrastive Learning	Mar 20, 2026	—Unverified
Enhancing Hyperspace Analogue to Language (HAL) Representations via Attention-Based Pooling for Text Classification	Mar 20, 2026	—Unverified
Design-OS: A Specification-Driven Framework for Engineering System Design with a Control-Systems Design Case	Mar 20, 2026	—Unverified
Beyond Single Tokens: Distilling Discrete Diffusion Models via Discrete MMD	Mar 20, 2026	—Unverified
Semantic Token Clustering for Efficient Uncertainty Quantification in Large Language Models	Mar 20, 2026	—Unverified
Evaluating Evidence Grounding Under User Pressure in Instruction-Tuned Language Models	Mar 20, 2026	—Unverified
The Robot's Inner Critic: Self-Refinement of Social Behaviors through VLM-based Replanning	Mar 20, 2026	—Unverified
EgoForge: Goal-Directed Egocentric World Simulator	Mar 20, 2026	—Unverified
Learning Dynamic Belief Graphs for Theory-of-mind Reasoning	Mar 20, 2026	—Unverified
TinyML Enhances CubeSat Mission Capabilities	Mar 20, 2026	—Unverified
LagerNVS: Latent Geometry for Fully Neural Real-time Novel View Synthesis	Mar 20, 2026	—Unverified
AI Agents Can Already Autonomously Perform Experimental High Energy Physics	Mar 20, 2026	—Unverified
Adaptive Greedy Frame Selection for Long Video Understanding	Mar 20, 2026	—Unverified
VideoSeek: Long-Horizon Video Agent with Tool-Guided Seeking	Mar 20, 2026	—Unverified