The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1951–2000 of 659983 papers

Title	Date	Status
From Topic to Transition Structure: Unsupervised Concept Discovery at Corpus Scale via Predictive Associative Memory	Mar 19, 2026	—Unverified
Prune-then-Quantize or Quantize-then-Prune? Understanding the Impact of Compression Order in Joint Model Compression	Mar 19, 2026	—Unverified
Adaptive Decoding via Test-Time Policy Learning for Self-Improving Generation	Mar 19, 2026	—Unverified
Towards Noise-Resilient Quantum Multi-Armed and Stochastic Linear Bandits	Mar 19, 2026	—Unverified
UT-ACA: Uncertainty-Triggered Adaptive Context Allocation for Long-Context Inference	Mar 19, 2026	—Unverified
AS2 -- Attention-Based Soft Answer Sets: An End-to-End Differentiable Neuro-Soft-Symbolic Reasoning Architecture	Mar 19, 2026	—Unverified
SODIUM: From Open Web Data to Queryable Databases	Mar 19, 2026	—Unverified
Seeking Universal Shot Language Understanding Solutions	Mar 19, 2026	—Unverified
MedQ-UNI: Toward Unified Medical Image Quality Assessment and Restoration via Vision-Language Modeling	Mar 19, 2026	—Unverified
Recolour What Matters: Region-Aware Colour Editing via Token-Level Diffusion	Mar 19, 2026	—Unverified
GAIN: A Benchmark for Goal-Aligned Decision-Making of Large Language Models under Imperfect Norms	Mar 19, 2026	—Unverified
Cognitive Mismatch in Multimodal Large Language Models for Discrete Symbol Understanding	Mar 19, 2026	—Unverified
Do Vision Language Models Understand Human Engagement in Games?	Mar 19, 2026	—Unverified
T-QPM: Enabling Temporal Out-Of-Distribution Detection and Domain Generalization for Vision-Language Models in Open-World	Mar 19, 2026	—Unverified
The Truncation Blind Spot: How Decoding Strategies Systematically Exclude Human-Like Token Choices	Mar 19, 2026	—Unverified
Precise Performance of Linear Denoisers in the Proportional Regime	Mar 19, 2026	—Unverified
TexEditor: Structure-Preserving Text-Driven Texture Editing	Mar 19, 2026	CodeCode Available
Cross-Domain Demo-to-Code via Neurosymbolic Counterfactual Reasoning	Mar 19, 2026	—Unverified
NymeriaPlus: Enriching Nymeria Dataset with Additional Annotations and Data	Mar 19, 2026	—Unverified
OnlinePG: Online Open-Vocabulary Panoptic Mapping with 3D Gaussian Splatting	Mar 19, 2026	—Unverified
From Snapshots to Symphonies: The Evolution of Protein Prediction from Static Structures to Generative Dynamics and Multimodal Interactions	Mar 19, 2026	—Unverified
Expert Personas Improve LLM Alignment but Damage Accuracy: Bootstrapping Intent-Based Persona Routing with PRISM	Mar 19, 2026	—Unverified
CAFlow: Adaptive-Depth Single-Step Flow Matching for Efficient Histopathology Super-Resolution	Mar 19, 2026	—Unverified
Counting Circuits: Mechanistic Interpretability of Visual Reasoning in Large Vision-Language Models	Mar 19, 2026	—Unverified
Correlation-Weighted Multi-Reward Optimization for Compositional Generation	Mar 19, 2026	—Unverified
Data-efficient pre-training by scaling synthetic megadocs	Mar 19, 2026	—Unverified
Remedying Target-Domain Astigmatism for Cross-Domain Few-Shot Object Detection	Mar 19, 2026	—Unverified
HEP Statistical Inference for UAV Fault Detection: CLs, LRT, and SBI Applied to Blade Damage	Mar 19, 2026	—Unverified
SINDy-KANs: Sparse identification of non-linear dynamics through Kolmogorov-Arnold networks	Mar 19, 2026	—Unverified
CausalVAD: De-confounding End-to-End Autonomous Driving via Causal Intervention	Mar 19, 2026	—Unverified
SpecForge: A Flexible and Efficient Open-Source Training Framework for Speculative Decoding	Mar 19, 2026	—Unverified
CAPSUL: A Comprehensive Human Protein Benchmark for Subcellular Localization	Mar 19, 2026	—Unverified
MedForge: Interpretable Medical Deepfake Detection via Forgery-aware Reasoning	Mar 19, 2026	—Unverified
ICE: Intervention-Consistent Explanation Evaluation with Statistical Grounding for LLMs	Mar 19, 2026	—Unverified
Breaking Hard Isomorphism Benchmarks with DRESS	Mar 19, 2026	—Unverified
Color image restoration based on nonlocal saturation-value similarity	Mar 19, 2026	—Unverified
Elastic Weight Consolidation Done Right for Continual Learning	Mar 19, 2026	—Unverified
myMNIST: Benchmark of PETNN, KAN, and Classical Deep Learning Models for Burmese Handwritten Digit Recognition	Mar 19, 2026	—Unverified
Complementary Text-Guided Attention for Zero-Shot Adversarial Robustness	Mar 19, 2026	—Unverified
Beyond TVLA: Anderson-Darling Leakage Assessment for Neural Network Side-Channel Leakage Detection	Mar 19, 2026	—Unverified
Improving Joint Audio-Video Generation with Cross-Modal Context Learning	Mar 19, 2026	—Unverified
AutORAN: LLM-driven Natural Language Programming for Agile xApp Development	Mar 19, 2026	—Unverified
DiscoPhon: Benchmarking the Unsupervised Discovery of Phoneme Inventories With Discrete Speech Units	Mar 19, 2026	—Unverified
Cyber-Resilient Digital Twins: Discriminating Attacks for Safe Critical Infrastructure Control	Mar 19, 2026	—Unverified
Benchmarking CNN-based Models against Transformer-based Models for Abdominal Multi-Organ Segmentation on the RATIC Dataset	Mar 19, 2026	—Unverified
GenVideoLens: Where LVLMs Fall Short in AI-Generated Video Detection?	Mar 19, 2026	—Unverified
Agentic Flow Steering and Parallel Rollout Search for Spatially Grounded Text-to-Image Generation	Mar 19, 2026	—Unverified
An Onto-Relational-Sophic Framework for Governing Synthetic Minds	Mar 19, 2026	—Unverified
SwiftGS: Episodic Priors for Immediate Satellite Surface Recovery	Mar 19, 2026	—Unverified
PhysVideo: Physically Plausible Video Generation with Cross-View Geometry Guidance	Mar 19, 2026	—Unverified