The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 7451–7500 of 661570 papers

Title	Date	Status	Hype
Adaptation of Weakly Supervised Localization in Histopathology by Debiasing Predictions	Mar 12, 2026	—Unverified	0
Unleashing Video Language Models for Fine-grained HRCT Report Generation	Mar 12, 2026	—Unverified	0
Marked Pedagogies: Examining Linguistic Biases in Personalized Automated Writing Feedback	Mar 12, 2026	—Unverified	0
One-Step Flow Policy: Self-Distillation for Fast Visuomotor Policies	Mar 12, 2026	—Unverified	0
CalliMaster: Mastering Page-level Chinese Calligraphy via Layout-guided Spatial Planning	Mar 12, 2026	—Unverified	0
Generating Expressive and Customizable Evals for Timeseries Data Analysis Agents with AgentFuel	Mar 12, 2026	—Unverified	0
Modal Logical Neural Networks for Financial AI	Mar 12, 2026	—Unverified	0
RAW-Domain Degradation Models for Realistic Smartphone Super-Resolution	Mar 12, 2026	—Unverified	0
EB-RANSAC: Random Sample Consensus based on Energy-Based Model	Mar 12, 2026	—Unverified	0
Leveraging Phytolith Research using Artificial Intelligence	Mar 12, 2026	—Unverified	0
Deep Learning-based Assessment of the Relation Between the Third Molar and Mandibular Canal on Panoramic Radiographs using Local, Centralized, and Federated Learning	Mar 12, 2026	—Unverified	0
Orientability of Causal Relations in Time Series using Summary Causal Graphs and Faithful Distributions	Mar 12, 2026	—Unverified	0
Trust Oriented Explainable AI for Fake News Detection	Mar 12, 2026	—Unverified	0
Efficient Generative Modeling with Unitary Matrix Product States Using Riemannian Optimization	Mar 12, 2026	—Unverified	0
Once4All: Skeleton-Guided SMT Solver Fuzzing with LLM-Synthesized Generators	Mar 12, 2026	—Unverified	0
Deployment-Oriented Session-wise Meta-Calibration for Landmark-Based Webcam Gaze Tracking	Mar 12, 2026	—Unverified	0
Semi-Synthetic Parallel Data for Translation Quality Estimation: A Case Study of Dataset Building for an Under-Resourced Language Pair	Mar 12, 2026	—Unverified	0
Resource-Efficient Iterative LLM-Based NAS with Feedback Memory	Mar 12, 2026	—Unverified	0
TaxBreak: Unmasking the Hidden Costs of LLM Inference Through Overhead Decomposition	Mar 12, 2026	—Unverified	0
TURA: Tool-Augmented Unified Retrieval Agent for AI Search	Mar 12, 2026	—Unverified	0
Generalist Large Language Models for Molecular Property Prediction: Distilling Knowledge from Specialist Models	Mar 12, 2026	—Unverified	0
CHiL(L)Grader: Calibrated Human-in-the-Loop Short-Answer Grading	Mar 12, 2026	—Unverified	0
CLASP: Defending Hybrid Large Language Models Against Hidden State Poisoning Attacks	Mar 12, 2026	—Unverified	0
IndexCache: Accelerating Sparse Attention via Cross-Layer Index Reuse	Mar 12, 2026	—Unverified	2
NeuralOS: Towards Simulating Operating Systems via Neural Generative Models	Mar 12, 2026	—Unverified	2
Chemical Reaction Networks Learn Better than Spiking Neural Networks	Mar 12, 2026	—Unverified	0
Can LLM Aid in Solving Constraints with Inductive Definitions?	Mar 12, 2026	—Unverified	0
JOPP-3D: Joint Open Vocabulary Semantic Segmentation on Point Clouds and Panoramas	Mar 12, 2026	—Unverified	0
Stage-Adaptive Reliability Modeling for Continuous Valence-Arousal Estimation	Mar 12, 2026	—Unverified	0
The Mirror Design Pattern: Strict Data Geometry over Model Scale for Prompt Injection Detection	Mar 12, 2026	—Unverified	0
You Told Me to Do It: Measuring Instructional Text-induced Private Data Leakage in LLM Agents	Mar 12, 2026	—Unverified	0
Do LLMs Share Human-Like Biases? Causal Reasoning Under Prior Knowledge, Irrelevant Context, and Varying Compute Budgets	Mar 12, 2026	—Unverified	0
The Perfection Paradox: From Architect to Curator in AI-Assisted API Design	Mar 12, 2026	—Unverified	0
UtilityMax Prompting: A Formal Framework for Multi-Objective Large Language Model Optimization	Mar 12, 2026	—Unverified	0
Personalized Federated Learning via Gaussian Generative Modeling	Mar 12, 2026	—Unverified	0
The Orthogonal Vulnerabilities of Generative AI Watermarks: A Comparative Empirical Benchmark of Spatial and Latent Provenance	Mar 12, 2026	—Unverified	0
ShotVerse: Advancing Cinematic Camera Control for Text-Driven Multi-Shot Video Creation	Mar 12, 2026	—Unverified	2
Towards Universal Computational Aberration Correction in Photographic Cameras: A Comprehensive Benchmark Analysis	Mar 12, 2026	CodeCode Available	0
How Does Fourier Analysis Network Work? A Mechanism Analysis and a New Dual-Activation Layer Proposal	Mar 12, 2026	—Unverified	0
Survival Meets Classification: A Novel Framework for Early Risk Prediction Models of Chronic Diseases	Mar 12, 2026	—Unverified	0
Single-View Rolling-Shutter SfM	Mar 12, 2026	—Unverified	0
Dr. SHAP-AV: Decoding Relative Modality Contributions via Shapley Attribution in Audio-Visual Speech Recognition	Mar 12, 2026	—Unverified	0
From Next Token Prediction to (STRIPS) World Models	Mar 12, 2026	—Unverified	0
RefTr: Recurrent Refinement of Confluent Trajectories for 3D Vascular Tree Centerlines	Mar 12, 2026	—Unverified	0
When Models Fabricate Credentials: Measuring How Professional Identity Suppresses Honest Self-Representation	Mar 12, 2026	—Unverified	0
Deep Eigenspace Network for Parametric Non-self-adjoint Eigenvalue Problems	Mar 12, 2026	—Unverified	0
Do LLMs Judge Distantly Supervised Named Entity Labels Well? Constructing the JudgeWEL Dataset	Mar 12, 2026	—Unverified	0
SENS-ASR: Semantic Embedding injection in Neural-transducer for Streaming Automatic Speech Recognition	Mar 12, 2026	—Unverified	0
CUAAudit: Meta-Evaluation of Vision-Language Models as Auditors of Autonomous Computer-Use Agents	Mar 12, 2026	—Unverified	0
Tiny Aya: Bridging Scale and Multilingual Depth	Mar 12, 2026	—Unverified	0