The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 15101–15150 of 474278 papers

Title	Date	Tasks	Status	Hype
NeuralPDR: Neural Differential Equations as surrogate models for Photodissociation Regions	Jun 17, 2025	GPU	CodeCode Available	0
Complete Characterization for Adjustment in Summary Causal Graphs of Time Series	Jun 17, 2025	Time Series	—Unverified	0
Synthetic Data Augmentation for Table Detection: Re-evaluating TableNet's Performance with Automatically Generated Document Images	Jun 17, 2025	Data AugmentationTable Detection	—Unverified	0
On the Hardness of Bandit Learning	Jun 17, 2025	Learning Theory	—Unverified	0
Structured and Informed Probabilistic Modeling with the Thermodynamic Kolmogorov-Arnold Model	Jun 17, 2025	Diversity	CodeCode Available	0
Single-Example Learning in a Mixture of GPDMs with Latent Geometries	Jun 17, 2025	Mixture-of-Experts	—Unverified	0
RL-Obfuscation: Can Language Models Learn to Evade Latent-Space Monitors?	Jun 17, 2025		CodeCode Available	0
Integrating Radiomics with Deep Learning Enhances Multiple Sclerosis Lesion Delineation	Jun 17, 2025	Deep LearningLesion Segmentation	—Unverified	0
Knowledge Adaptation as Posterior Correction	Jun 17, 2025	Federated Learning	—Unverified	0
Foundation Model Insights and a Multi-Model Approach for Superior Fine-Grained One-shot Subset Selection	Jun 17, 2025	model	CodeCode Available	0
GuiLoMo: Allocating Expert Number and Rank for LoRA-MoE via Bilevel Optimization with GuidedSelection Vectors	Jun 17, 2025	Bilevel OptimizationMixture-of-Experts	CodeCode Available	0
Adaptive Data Augmentation for Thompson Sampling	Jun 17, 2025	Data AugmentationMulti-Armed Bandits	—Unverified	0
Don't throw the baby out with the bathwater: How and why deep learning for ARC	Jun 17, 2025	ARC	—Unverified	0
SIRI-Bench: Challenging VLMs' Spatial Intelligence through Complex Reasoning Tasks	Jun 17, 2025	MathSpatial Reasoning	—Unverified	0
Risk Estimation of Knee Osteoarthritis Progression via Predictive Multi-task Modelling from Efficient Diffusion Model using X-ray Images	Jun 17, 2025	Image GenerationInterpretable Machine Learning	—Unverified	0
AgentSynth: Scalable Task Generation for Generalist Computer-Use Agents	Jun 17, 2025		CodeCode Available	1
Towards Desiderata-Driven Design of Visual Counterfactual Explainers	Jun 17, 2025	counterfactual	—Unverified	0
Collaborative Editable Model	Jun 17, 2025	Domain Adaptationmodel	—Unverified	0
Latent Anomaly Detection: Masked VQ-GAN for Unsupervised Segmentation in Medical CBCT	Jun 17, 2025	Anomaly DetectionSegmentation	—Unverified	0
Dense360: Dense Understanding from Omnidirectional Panoramas	Jun 17, 2025	ERP	—Unverified	0
FocalClick-XL: Towards Unified and High-quality Interactive Segmentation	Jun 17, 2025	Interactive Segmentation	—Unverified	0
ASCD: Attention-Steerable Contrastive Decoding for Reducing Hallucination in MLLM	Jun 17, 2025	HallucinationLanguage Modeling	—Unverified	0
S^4C: Speculative Sampling with Syntactic and Semantic Coherence for Efficient Inference of Large Language Models	Jun 17, 2025	Text Generationvalid	—Unverified	0
Revisiting Chain-of-Thought Prompting: Zero-shot Can Be Stronger than Few-shot	Jun 17, 2025	In-Context LearningMathematical Reasoning	—Unverified	0
Fragile Preferences: A Deep Dive Into Order Effects in Large Language Models	Jun 17, 2025	Decision Making	—Unverified	0
Situational-Constrained Sequential Resources Allocation via Reinforcement Learning	Jun 17, 2025	Decision Makingreinforcement-learning	—Unverified	0
Bayesian Hybrid Machine Learning of Gallstone Risk	Jun 17, 2025	Decision MakingHybrid Machine Learning	—Unverified	0
DCRM: A Heuristic to Measure Response Pair Quality in Preference Optimization	Jun 17, 2025		CodeCode Available	0
Adjustment for Confounding using Pre-Trained Representations	Jun 17, 2025	parameter estimationTransfer Learning	CodeCode Available	0
Leveraging Predictive Equivalence in Decision Trees	Jun 17, 2025	Interpretable Machine LearningMissing Values	CodeCode Available	0
sHGCN: Simplified hyperbolic graph convolutional neural networks	Jun 17, 2025	Computational Efficiency	CodeCode Available	0
VisText-Mosquito: A Multimodal Dataset and Benchmark for AI-Based Mosquito Breeding Site Detection and Reasoning	Jun 17, 2025	object-detectionObject Detection	CodeCode Available	0
Scaling-Up the Pretraining of the Earth Observation Foundation Model PhilEO to the MajorTOM Dataset	Jun 17, 2025	Density EstimationEarth Observation	CodeCode Available	0
From What to Respond to When to Respond: Timely Response Generation for Open-domain Dialogue Agents	Jun 17, 2025	Language ModelingLanguage Modelling	CodeCode Available	0
MoORE: SVD-based Model MoE-ization for Conflict- and Oblivion-Resistant Multi-Task Adaptation	Jun 17, 2025		CodeCode Available	0
Xolver: Multi-Agent Reasoning with Holistic Experience Learning Just Like an Olympiad Team	Jun 17, 2025	Code GenerationGSM8K	CodeCode Available	1
When Does Meaning Backfire? Investigating the Role of AMRs in NLI	Jun 17, 2025	Abstract Meaning RepresentationNatural Language Inference	—Unverified	0
Mxplainer: Explain and Learn Insights by Imitating Mahjong Agents	Jun 17, 2025	Decision Making	CodeCode Available	0
Essential-Web v1.0: 24T tokens of organized web data	Jun 17, 2025	Math	CodeCode Available	2
YOLOv11-RGBT: Towards a Comprehensive Single-Stage Multispectral Object Detection Framework	Jun 17, 2025	Multispectral Object Detectionobject-detection	CodeCode Available	4
A Model-Mediated Stacked Ensemble Approach for Depression Prediction Among Professionals	Jun 17, 2025	Ensemble Learning	—Unverified	0
TGDPO: Harnessing Token-Level Reward Guidance for Enhancing Direct Preference Optimization	Jun 17, 2025		CodeCode Available	1
CausalDiffTab: Mixed-Type Causal-Aware Diffusion for Tabular Data Generation	Jun 17, 2025	Tabular Data Generation	CodeCode Available	0
ASAP-FE: Energy-Efficient Feature Extraction Enabling Multi-Channel Keyword Spotting on Edge Processors	Jun 17, 2025	Keyword SpottingScheduling	—Unverified	0
Refining music sample identification with a self-supervised graph neural network	Jun 17, 2025	Contrastive LearningGraph Neural Network	CodeCode Available	1
Egocentric Human-Object Interaction Detection: A New Benchmark and Method	Jun 17, 2025	BenchmarkingHuman-Object Interaction Detection	—Unverified	0
Cross-Modal Geometric Hierarchy Fusion: An Implicit-Submap Driven Framework for Resilient 3D Place Recognition	Jun 17, 2025	3D Place RecognitionAutonomous Driving	—Unverified	0
KDMOS:Knowledge Distillation for Motion Segmentation	Jun 17, 2025	Autonomous DrivingKnowledge Distillation	CodeCode Available	0
Causally Steered Diffusion for Automated Video Counterfactual Generation	Jun 17, 2025	counterfactualVideo Editing	CodeCode Available	0
Decoupled Classifier-Free Guidance for Counterfactual Diffusion Models	Jun 17, 2025	Attributecounterfactual	—Unverified	0