The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 15401–15450 of 474278 papers

Title	Date	Tasks	Status	Hype
Projecting U.S. coastal storm surge risks and impacts with deep learning	Jun 16, 2025	Deep Learning	—Unverified	0
VL-GenRM: Enhancing Vision-Language Verification via Vision Experts and Iterative Training	Jun 16, 2025	HallucinationMultimodal Reasoning	—Unverified	0
StaQ it! Growing neural networks for Policy Mirror Descent	Jun 16, 2025	Reinforcement Learning (RL)	—Unverified	0
Are manual annotations necessary for statutory interpretations retrieval?	Jun 16, 2025	Retrieval	—Unverified	0
Adaptive Guidance Accelerates Reinforcement Learning of Reasoning Models	Jun 16, 2025	Mathreinforcement-learning	—Unverified	0
Prefix-Tuning+: Modernizing Prefix-Tuning by Decoupling the Prefix from Attention	Jun 16, 2025	parameter-efficient fine-tuning	—Unverified	0
ProfiLLM: An LLM-Based Framework for Implicit Profiling of Chatbot Users	Jun 16, 2025	ChatbotLarge Language Model	—Unverified	0
Discovering Temporal Structure: An Overview of Hierarchical Reinforcement Learning	Jun 16, 2025	Hierarchical Reinforcement Learningreinforcement-learning	—Unverified	0
Evolvable Conditional Diffusion	Jun 16, 2025	DenoisingDescriptive	—Unverified	0
Robustness of Reinforcement Learning-Based Traffic Signal Control under Incidents: A Comparative Study	Jun 16, 2025	BenchmarkingTraffic Signal Control	—Unverified	0
Toward Explainable Offline RL: Analyzing Representations in Intrinsically Motivated Decision Transformers	Jun 16, 2025	Decision MakingDecision Making Under Uncertainty	—Unverified	0
Bures-Wasserstein Flow Matching for Graph Generation	Jun 16, 2025	3D Molecule GenerationDrug Discovery	—Unverified	0
Scientifically-Interpretable Reasoning Network (ScIReN): Uncovering the Black-Box of Nature	Jun 16, 2025	scientific discovery	—Unverified	0
Meta Optimality for Demographic Parity Constrained Regression via Post-Processing	Jun 16, 2025	Fairnessregression	—Unverified	0
No-Regret Learning Under Adversarial Resource Constraints: A Spending Plan Is All You Need!	Jun 16, 2025	All	—Unverified	0
Graph-Convolutional-Beta-VAE for Synthetic Abdominal Aorta Aneurysm Generation	Jun 16, 2025	Data AugmentationDiversity	—Unverified	0
Load Balancing Mixture of Experts with Similarity Preserving Routers	Jun 16, 2025	Mixture-of-Experts	—Unverified	0
Integrating Knowledge Graphs and Bayesian Networks: A Hybrid Approach for Explainable Disease Risk Prediction	Jun 16, 2025	Knowledge GraphsPrediction	—Unverified	0
Self-Supervised Enhancement for Depth from a Lightweight ToF Sensor with Monocular Images	Jun 16, 2025	Depth EstimationSelf-Supervised Learning	CodeCode Available	1
Evolutionary chemical learning in dimerization networks	Jun 16, 2025		CodeCode Available	0
Sustainable Machine Learning Retraining: Optimizing Energy Efficiency Without Compromising Accuracy	Jun 16, 2025		CodeCode Available	0
Safe Domains of Attraction for Discrete-Time Nonlinear Systems: Characterization and Verifiable Neural Network Estimation	Jun 16, 2025		CodeCode Available	0
SatHealth: A Multimodal Public Health Dataset with Satellite-based Environmental Factors	Jun 16, 2025		CodeCode Available	0
Density-aware Walks for Coordinated Campaign Detection	Jun 16, 2025	Graph Classification	CodeCode Available	0
Evaluating Generalization and Representation Stability in Small LMs via Prompting, Fine-Tuning and Out-of-Distribution Prompts	Jun 16, 2025	Model Selection	—Unverified	0
Enhancing Goal-oriented Proactive Dialogue Systems via Consistency Reflection and Correction	Jun 16, 2025	Decoder	CodeCode Available	0
Unlearning Isn't Invisible: Detecting Unlearning Traces in LLMs from Model Outputs	Jun 16, 2025	Machine Unlearning	CodeCode Available	1
ROSAQ: Rotation-based Saliency-Aware Weight Quantization for Efficiently Compressing Large Language Models	Jun 16, 2025	Quantization	—Unverified	0
Effective Stimulus Propagation in Neural Circuits: Driver Node Selection	Jun 16, 2025	Stochastic Block Model	—Unverified	0
LocationReasoner: Evaluating LLMs on Real-World Site Selection Reasoning	Jun 16, 2025	Code GenerationMathematical Problem-Solving	CodeCode Available	0
Quantum-Informed Contrastive Learning with Dynamic Mixup Augmentation for Class-Imbalanced Expert Systems	Jun 16, 2025	Contrastive LearningRobust classification	—Unverified	0
Investigating the interaction of linguistic and mathematical reasoning in language models using multilingual number puzzles	Jun 16, 2025	DiversityMathematical Reasoning	—Unverified	0
GITO: Graph-Informed Transformer Operator for Learning Complex Partial Differential Equations	Jun 16, 2025	Graph Neural NetworkSuper-Resolution	—Unverified	0
A Silent Speech Decoding System from EEG and EMG with Heterogenous Electrode Configurations	Jun 16, 2025	EEGspeech-recognition	—Unverified	0
Automatic Extraction of Clausal Embedding Based on Large-Scale English Text Data	Jun 16, 2025	Constituency Parsing	CodeCode Available	0
How Does LLM Reasoning Work for Code? A Survey and a Call to Action	Jun 16, 2025	Code GenerationGitHub issue resolution	—Unverified	0
A Regret Perspective on Online Selective Generation	Jun 16, 2025	HallucinationLEMMA	—Unverified	0
Estimation of Treatment Effects in Extreme and Unobserved Data	Jun 16, 2025	Causal Inference	—Unverified	0
Experimental Design for Semiparametric Bandits	Jun 16, 2025	Experimental Design	—Unverified	0
Constant Stepsize Local GD for Logistic Regression: Acceleration by Instability	Jun 16, 2025	regression	—Unverified	0
Connecting phases of matter to the flatness of the loss landscape in analog variational quantum algorithms	Jun 16, 2025	Visual Question Answering (VQA)	—Unverified	0
Enhancing interpretability of rule-based classifiers through feature graphs	Jun 16, 2025	DiagnosticFeature Importance	CodeCode Available	0
SeqPE: Transformer with Sequential Position Encoding	Jun 16, 2025	image-classificationImage Classification	CodeCode Available	1
MultiFinBen: A Multilingual, Multimodal, and Difficulty-Aware Benchmark for Financial LLM Evaluation	Jun 16, 2025	Optical Character Recognition (OCR)	—Unverified	0
EmoNews: A Spoken Dialogue System for Expressive News Conversations	Jun 16, 2025	Language ModelingLanguage Modelling	CodeCode Available	0
Causal Mediation Analysis with Multiple Mediators: A Simulation Approach	Jun 16, 2025		CodeCode Available	0
A Production Scheduling Framework for Reinforcement Learning Under Real-World Constraints	Jun 16, 2025	Job Shop SchedulingReinforcement Learning (RL)	CodeCode Available	1
HierVL: Semi-Supervised Segmentation leveraging Hierarchical Vision-Language Synergy with Dynamic Text-Spatial Query Alignment	Jun 16, 2025	Semantic SegmentationSemi-Supervised Semantic Segmentation	—Unverified	0
DETRPose: Real-time end-to-end transformer model for multi-person pose estimation	Jun 16, 2025	2D Pose EstimationDecoder	CodeCode Available	2
Sketched Sum-Product Networks for Joins	Jun 16, 2025		CodeCode Available	0