The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 6751–6800 of 661570 papers

Title	Date	Status	Hype
A Systematic Benchmark of GAN Architectures for MRI-to-CT Synthesis	Mar 13, 2026	CodeCode Available	0
Instructing Large Language Models for Low-Resource Languages: A Systematic Study for Basque	Mar 13, 2026	CodeCode Available	0
Position: Agentic Evolution is the Path to Evolving LLMs	Mar 13, 2026	CodeCode Available	0
MR-GNF: Multi-Resolution Graph Neural Forecasting on Ellipsoidal Meshes for Efficient Regional Weather Prediction	Mar 13, 2026	CodeCode Available	0
Visual-ERM: Reward Modeling for Visual Equivalence	Mar 13, 2026	—Unverified	1
V-Bridge: Bridging Video Generative Priors to Versatile Few-shot Image Restoration	Mar 13, 2026	—Unverified	1
MIRAGE: Model-agnostic Industrial Realistic Anomaly Generation and Evaluation for Visual Anomaly Detection	Mar 13, 2026	—Unverified	0
Benchmarking Large Language Models on Reference Extraction and Parsing in the Social Sciences and Humanities	Mar 13, 2026	—Unverified	0
Developing and evaluating a chatbot to support maternal health care	Mar 13, 2026	—Unverified	0
Global Sensitivity Analysis for Engineering Design Based on Individual Conditional Expectations	Mar 13, 2026	—Unverified	0
Synthetic Melanoma Image Generation and Evaluation Using Generative Adversarial Networks	Mar 13, 2026	—Unverified	0
NCCL EP: Towards a Unified Expert Parallel Communication API for NCCL	Mar 13, 2026	—Unverified	0
LLM-driven Multimodal Recommendation	Mar 13, 2026	—Unverified	0
mAceReason-Math: A Dataset of High-Quality Multilingual Math Problems Ready For RLVR	Mar 13, 2026	—Unverified	0
ViewMask-1-to-3: Multi-View Consistent Image Generation via Multimodal Diffusion Models	Mar 13, 2026	—Unverified	0
DeCode: Decoupling Content and Delivery for Medical QA	Mar 13, 2026	—Unverified	0
Expert Selections In MoE Models Reveal (Almost) As Much As Text	Mar 13, 2026	—Unverified	0
SPRig: Self-Supervised Pose-Invariant Rigging from Mesh Sequences	Mar 13, 2026	—Unverified	0
Context Engineering: From Prompts to Corporate Multi-Agent Architecture	Mar 13, 2026	—Unverified	0
Sobolev--Ricci Curvature	Mar 13, 2026	—Unverified	0
NeuCo-Bench: A Novel Benchmark Framework for Neural Embeddings in Earth Observation	Mar 13, 2026	—Unverified	0
Examining Users' Behavioural Intention to Use OpenClaw Through the Cognition--Affect--Conation Framework	Mar 13, 2026	—Unverified	0
Taming the Long Tail: Efficient Item-wise Sharpness-Aware Minimization for LLM-based Recommender Systems	Mar 13, 2026	—Unverified	0
DirPA: Addressing Prior Shift in Imbalanced Few-shot Crop-type Classification	Mar 13, 2026	—Unverified	0
Rethinking VLMs for Image Forgery Detection and Localization	Mar 13, 2026	CodeCode Available	0
Anchored Alignment: Preventing Positional Collapse in Multimodal Recommender Systems	Mar 13, 2026	CodeCode Available	0
Speech-Worthy Alignment for Japanese SpeechLLMs via Direct Preference Optimization	Mar 13, 2026	—Unverified	0
Spatial Reasoning is Not a Free Lunch: A Controlled Study on LLaVA	Mar 13, 2026	—Unverified	0
TERMINATOR: Learning Optimal Exit Points for Early Stopping in Chain-of-Thought Reasoning	Mar 13, 2026	—Unverified	0
Spatio-Semantic Expert Routing Architecture with Mixture-of-Experts for Referring Image Segmentation	Mar 13, 2026	—Unverified	0
Embedded Quantum Machine Learning in Embedded Systems: Feasibility, Hybrid Architectures, and Quantum Co-Processors	Mar 13, 2026	—Unverified	0
Decoding Matters: Efficient Mamba-Based Decoder with Distribution-Aware Deep Supervision for Medical Image Segmentation	Mar 13, 2026	—Unverified	0
Asymptotic and Finite-Time Guarantees for Langevin-Based Temperature Annealing in InfoNCE	Mar 13, 2026	—Unverified	0
Beyond Dense Futures: World Models as Structured Planners for Robotic Manipulation	Mar 13, 2026	—Unverified	0
Mobile-VTON: High-Fidelity On-Device Virtual Try-On	Mar 13, 2026	—Unverified	0
Building Effective AI Coding Agents for the Terminal: Scaffolding, Harness, Context Engineering, and Lessons Learned	Mar 13, 2026	—Unverified	0
TerraFlow: Multimodal, Multitemporal Representation Learning for Earth Observation	Mar 13, 2026	—Unverified	0
A Closed-Form Solution for Debiasing Vision-Language Models with Utility Guarantees Across Modalities and Tasks	Mar 13, 2026	—Unverified	0
Variational Garrote for Sparse Inverse Problems	Mar 13, 2026	—Unverified	0
A Spectral Revisit of the Distributional Bellman Operator under the Cramér Metric	Mar 13, 2026	—Unverified	0
Expert Pyramid Tuning: Efficient Parameter Fine-Tuning for Expertise-Driven Task Allocation	Mar 13, 2026	—Unverified	0
DINOLight: Robust Ambient Light Normalization with Self-supervised Visual Prior Integration	Mar 13, 2026	—Unverified	0
Node-RF: Learning Generalized Continuous Space-Time Scene Dynamics with Neural ODE-based NeRFs	Mar 13, 2026	—Unverified	0
RTD-Guard: A Black-Box Textual Adversarial Detection Framework via Replacement Token Detection	Mar 13, 2026	—Unverified	0
CA-HFP: Curvature-Aware Heterogeneous Federated Pruning with Model Reconstruction	Mar 13, 2026	—Unverified	0
Early Pruning for Public Transport Routing	Mar 13, 2026	—Unverified	0
Maximizing Incremental Information Entropy for Contrastive Learning	Mar 13, 2026	—Unverified	0
Optimize Wider, Not Deeper: Consensus Aggregation for Policy Optimization	Mar 13, 2026	—Unverified	0
Feynman: Knowledge-Infused Diagramming Agent for Scalable Visual Designs	Mar 13, 2026	—Unverified	0
Enhancing Novel View Synthesis via Geometry Grounded Set Diffusion	Mar 13, 2026	—Unverified	0