The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 6351–6400 of 661570 papers

Title	Date	Status
Generative Inverse Design of Cold Metals for Low-Power Electronics	Mar 14, 2026	—Unverified
SmoothVLA: Aligning Vision-Language-Action Models with Physical Constraints via Intrinsic Smoothness Optimization	Mar 14, 2026	—Unverified
True 4-Bit Quantized Convolutional Neural Network Training on CPU: Achieving Full-Precision Parity	Mar 14, 2026	—Unverified
OmniCompliance-100K: A Multi-Domain, Rule-Grounded, Real-World Safety Compliance Dataset	Mar 14, 2026	—Unverified
DCP-CLIP:A Coarse-to-Fine Framework for Open-Vocabulary Semantic Segmentation with Dual Interaction	Mar 14, 2026	—Unverified
Traffic and weather driven hybrid digital twin for bridge monitoring	Mar 14, 2026	—Unverified
EviAgent: Evidence-Driven Agent for Radiology Report Generation	Mar 14, 2026	—Unverified
Human-like Object Grouping in Self-supervised Vision Transformers	Mar 14, 2026	—Unverified
IMS3: Breaking Distributional Aggregation in Diffusion-Based Dataset Distillation	Mar 14, 2026	—Unverified
vla-eval: A Unified Evaluation Harness for Vision-Language-Action Models	Mar 14, 2026	—Unverified
Leveraging a Statistical Shape Model for Efficient Generation of Annotated Training Data: A Case Study on Liver Landmarks Segmentation	Mar 14, 2026	—Unverified
Shapes are not enough: CONSERVAttack and its use for finding vulnerabilities and uncertainties in machine learning applications	Mar 14, 2026	—Unverified
When Visual Privacy Protection Meets Multimodal Large Language Models	Mar 14, 2026	—Unverified
Location Aware Embedding for Geotargeting in Sponsored Search Advertising	Mar 14, 2026	—Unverified
A Systematic Evaluation Protocol of Graph-Derived Signals for Tabular Machine Learning	Mar 14, 2026	—Unverified
PhyGaP: Physically-Grounded Gaussians with Polarization Cues	Mar 14, 2026	—Unverified
The Taxonomies, Training, and Applications of Event Stream Modelling for Electronic Health Records	Mar 14, 2026	—Unverified
Towards Generalizable Deepfake Detection via Real Distribution Bias Correction	Mar 14, 2026	—Unverified
Beyond Explicit Edges: Robust Reasoning over Noisy and Sparse Knowledge Graphs	Mar 14, 2026	—Unverified
Formal Abductive Explanations for Navigating Mental Health Help-Seeking and Diversity in Tech Workplaces	Mar 14, 2026	—Unverified
SemEval-2026 Task 6: CLARITY -- Unmasking Political Question Evasions	Mar 14, 2026	—Unverified
Schrödinger Bridge Over A Compact Connected Lie Group	Mar 14, 2026	—Unverified
Intrinsic Tolerance in C-Arm Imaging: How Extrinsic Re-optimization Preserves 3D Reconstruction Accuracy	Mar 14, 2026	—Unverified
Probing neural audio codecs for distinctions among English nuclear tunes	Mar 14, 2026	—Unverified
A Theory of Appropriateness That Accounts for Norms of Rationality	Mar 14, 2026	—Unverified
NepTam: A Nepali-Tamang Parallel Corpus and Baseline Machine Translation Experiments	Mar 14, 2026	—Unverified
Demand-Driven Context: A Methodology for Building Enterprise Knowledge Bases Through Agent Failure	Mar 14, 2026	—Unverified
TMPDiff: Temporal Mixed-Precision for Diffusion Models	Mar 14, 2026	—Unverified
Soft Mean Expected Calibration Error (SMECE): A Calibration Metric for Probabilistic Labels	Mar 14, 2026	—Unverified
OasisSimp: An Open-source Asian-English Sentence Simplification Dataset	Mar 14, 2026	—Unverified
Revisiting the Perception-Distortion Trade-off with Spatial-Semantic Guided Super-Resolution	Mar 14, 2026	—Unverified
Is the reconstruction loss culprit? An attempt to outperform JEPA	Mar 14, 2026	—Unverified
Improving Visual Reasoning with Iterative Evidence Refinement	Mar 14, 2026	—Unverified
Towards Agentic Honeynet Configuration	Mar 14, 2026	—Unverified
Low-Field Magnetic Resonance Image Enhancement using Undersampled k-Space	Mar 14, 2026	—Unverified
The GELATO Dataset for Legislative NER	Mar 14, 2026	—Unverified
Multifidelity Surrogate Modeling of Depressurized Loss of Forced Cooling in High-temperature Gas Reactors	Mar 14, 2026	—Unverified
Align Forward, Adapt Backward: Closing the Discretization Gap in Logic Gate Networks	Mar 14, 2026	—Unverified
Clinician input steers frontier AI models toward both accurate and harmful decisions	Mar 14, 2026	—Unverified
PA-Net: Precipitation-Adaptive Mixture-of-Experts for Long-Tail Rainfall Nowcasting	Mar 14, 2026	—Unverified
Evaluation of Visual Place Recognition Methods for Image Pair Retrieval in 3D Vision and Robotics	Mar 14, 2026	—Unverified
EyeWorld: A Generative World Model of Ocular State and Dynamics	Mar 14, 2026	—Unverified
The Geometry of Multi-Task Grokking: Transverse Instability, Superposition, and Weight Decay Phase Structure	Mar 14, 2026	—Unverified
What's the Price of Monotonicity? A Multi-Dataset Benchmark of Monotone-Constrained Gradient Boosting for Credit PD	Mar 14, 2026	—Unverified
Distributionally Robust Geometric Joint Chance-Constrained Optimization: Neurodynamic Approaches	Mar 14, 2026	—Unverified
Two-Step Data Augmentation for Masked Face Detection and Recognition: Turning Fake Masks to Real	Mar 14, 2026	—Unverified
Empowering Future Cybersecurity Leaders: Advancing Students through FINDS Education for Digital Forensic Excellence	Mar 14, 2026	—Unverified
ALTIS: Automated Loss Triage and Impact Scoring from Sentinel-1 SAR for Property-Level Flood Damage Assessment	Mar 14, 2026	—Unverified
ArrayTac: A tactile display for simultaneous rendering of shape, stiffness and friction	Mar 14, 2026	—Unverified
Maximin Robust Bayesian Experimental Design	Mar 14, 2026	—Unverified