The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 8001–8050 of 661570 papers

Title	Date	Status
A Systematic Study of Pseudo-Relevance Feedback with LLMs	Mar 11, 2026	—Unverified
Algorithmic Capture, Computational Complexity, and Inductive Bias of Infinite Transformers	Mar 11, 2026	—Unverified
Instruction set for the representation of graphs	Mar 11, 2026	—Unverified
V2M-Zero: Zero-Pair Time-Aligned Video-to-Music Generation	Mar 11, 2026	—Unverified
Agentar-Fin-OCR	Mar 11, 2026	—Unverified
Text-Trained LLMs Can Zero-Shot Extrapolate PDE Dynamics, Revealing a Three-Stage In-Context Learning Mechanism	Mar 11, 2026	—Unverified
Domain Feature Collapse: Implications for Out-of-Distribution Detection and Solutions	Mar 11, 2026	—Unverified
MedMO: Grounding and Understanding Multimodal Large Language Model for Medical Images	Mar 11, 2026	—Unverified
Grow with the Flow: 4D Reconstruction of Growing Plants with Gaussian Flow Fields	Mar 11, 2026	—Unverified
Learning to Unscramble: Simplifying Symbolic Expressions via Self-Supervised Oracle Trajectories	Mar 11, 2026	—Unverified
On the Value of Tokeniser Pretraining in Physics Foundation Models	Mar 11, 2026	—Unverified
Human-Aware Robot Behaviour in Self-Driving Labs	Mar 11, 2026	—Unverified
Scaling Reasoning Efficiently via Relaxed On-Policy Distillation	Mar 11, 2026	—Unverified
Fingerprinting Concepts in Data Streams with Supervised and Unsupervised Meta-Information	Mar 11, 2026	—Unverified
RC-NF: Robot-Conditioned Normalizing Flow for Real-Time Anomaly Detection in Robotic Manipulation	Mar 11, 2026	—Unverified
Task-Conditioned Routing Signatures in Sparse Mixture-of-Experts Transformers	Mar 11, 2026	—Unverified
A Learning-Based Superposition Operator for Non-Renewal Arrival Processes in Queueing Networks	Mar 11, 2026	—Unverified
Monitoring and Prediction of Mood in Elderly People during Daily Life Activities	Mar 11, 2026	—Unverified
Catalogue Grounded Multimodal Attribution for Museum Video under Resource and Regulatory Constraints	Mar 11, 2026	—Unverified
High-resolution weather-guided surrogate modeling for data-efficient cross-location building energy prediction	Mar 11, 2026	—Unverified
Procedural Fairness via Group Counterfactual Explanation	Mar 11, 2026	—Unverified
Co-Diffusion: An Affinity-Aware Two-Stage Latent Diffusion Framework for Generalizable Drug-Target Affinity Prediction	Mar 11, 2026	—Unverified
Efficient Approximation to Analytic and L^p functions by Height-Augmented ReLU Networks	Mar 11, 2026	—Unverified
Beyond Barren Plateaus: A Scalable Quantum Convolutional Architecture for High-Fidelity Image Classification	Mar 11, 2026	—Unverified
Attention Gathers, MLPs Compose: A Causal Analysis of an Action-Outcome Circuit in VideoViT	Mar 11, 2026	—Unverified
GGPT: Geometry Grounded Point Transformer	Mar 11, 2026	—Unverified
DeReason: A Difficulty-Aware Curriculum Improves Decoupled SFT-then-RL Training for General Reasoning	Mar 11, 2026	—Unverified
Evidential learning driven Breast Tumor Segmentation with Stage-divided Vision-Language Interaction	Mar 11, 2026	—Unverified
Security-by-Design for LLM-Based Code Generation: Leveraging Internal Representations for Concept-Driven Steering Mechanisms	Mar 11, 2026	—Unverified
Senna-2: Aligning VLM and End-to-End Driving Policy for Consistent Decision Making and Planning	Mar 11, 2026	—Unverified
Frequency-Modulated Visual Restoration for Matryoshka Large Multimodal Models	Mar 11, 2026	—Unverified
Markovian Generation Chains in Large Language Models	Mar 11, 2026	—Unverified
Trustworthy predictive distributions for rare events via diagnostic transport maps	Mar 11, 2026	—Unverified
Cough activity detection for automatic tuberculosis screening	Mar 11, 2026	—Unverified
A Unified Latent Space Disentanglement VAE Framework with Robust Disentanglement Effectiveness Evaluation	Mar 11, 2026	—Unverified
A Standardized Framework For Evaluating Gene Expression Generative Models	Mar 11, 2026	—Unverified
Mind the Sim2Real Gap in User Simulation for Agentic Tasks	Mar 11, 2026	—Unverified
A Machine Learning-Enhanced Hopf-Cole Formulation for Nonlinear Gas Flow in Porous Media	Mar 11, 2026	—Unverified
Artificial Intelligence for Sentiment Analysis of Persian Poetry	Mar 11, 2026	—Unverified
Towards Automated Initial Probe Placement in Transthoracic Teleultrasound Using Human Mesh and Skeleton Recovery	Mar 11, 2026	—Unverified
Enhancing Value Alignment of LLMs with Multi-agent system and Combinatorial Fusion	Mar 11, 2026	—Unverified
Similarity-as-Evidence: Calibrating Overconfident VLMs for Interpretable and Label-Efficient Medical Active Learning	Mar 11, 2026	—Unverified
A Minimal Agent for Automated Theorem Proving	Mar 11, 2026	—Unverified
Bayesian Optimization of Partially Known Systems using Hybrid Models	Mar 11, 2026	—Unverified
PEEM: Prompt Engineering Evaluation Metrics for Interpretable Joint Evaluation of Prompts and Responses	Mar 11, 2026	—Unverified
Toward Closed-loop Molecular Discovery via Language Model, Property Alignment and Strategic Search	Mar 11, 2026	—Unverified
Resource-constrained Amazons chess decision framework integrating large language models and graph attention	Mar 11, 2026	—Unverified
6ABOS: An Open-Source Atmospheric Correction Framework for the EnMAP Hyperspectral Mission Based on 6S	Mar 11, 2026	—Unverified
Gradient Dynamics of Attention: How Cross-Entropy Sculpts Bayesian Manifolds	Mar 11, 2026	—Unverified
Adaptive Event Stream Slicing for Open-Vocabulary Event-Based Object Detection via Vision-Language Knowledge Distillation	Mar 11, 2026	—Unverified