The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 12451–12500 of 474278 papers

Title	Date	Tasks	Status	Hype
LiLM-RDB-SFC: Lightweight Language Model with Relational Database-Guided DRL for Optimized SFC Provisioning	Jul 15, 2025	Deep Reinforcement LearningLanguage Modeling	—Unverified	0
Towards Reliable Objective Evaluation Metrics for Generative Singing Voice Separation Models	Jul 15, 2025	Audio Source Separationblind source separation	CodeCode Available	0
A Risk-Aware Adaptive Robust MPC with Learned Uncertainty Quantification	Jul 15, 2025	Active LearningModel Predictive Control	—Unverified	0
Mind the Gap: Bridging Occlusion in Gait Recognition via Residual Gap Correction	Jul 15, 2025	Gait RecognitionPerson Re-Identification	—Unverified	0
A Learning Framework For Cooperative Collision Avoidance of UAV Swarms Leveraging Domain Knowledge	Jul 15, 2025	Collision AvoidanceMulti-agent Reinforcement Learning	—Unverified	0
GeoDistill: Geometry-Guided Self-Distillation for Weakly Supervised Cross-View Localization	Jul 15, 2025	Autonomous Navigation	CodeCode Available	0
DCR: Quantifying Data Contamination in LLMs Evaluation	Jul 15, 2025	Arithmetic ReasoningBenchmarking	CodeCode Available	0
High-Throughput Distributed Reinforcement Learning via Adaptive Policy Synchronization	Jul 15, 2025	reinforcement-learningReinforcement Learning	CodeCode Available	0
LLM-Driven Dual-Level Multi-Interest Modeling for Recommendation	Jul 15, 2025	Contrastive Learning	—Unverified	0
Multi-Trigger Poisoning Amplifies Backdoor Vulnerabilities in LLMs	Jul 15, 2025	Data Poisoning	—Unverified	0
Sparse Regression Codes exploit Multi-User Diversity without CSI	Jul 15, 2025	DecoderDiversity	—Unverified	0
HUG-VAS: A Hierarchical NURBS-Based Generative Model for Aortic Geometry Synthesis and Controllable Editing	Jul 15, 2025	Denoising	—Unverified	0
Stochastic Entanglement Configuration for Constructive Entanglement Topologies in Quantum Machine Learning with Application to Cardiac MRI	Jul 15, 2025	Quantum Machine Learning	—Unverified	0
Local Pairwise Distance Matching for Backpropagation-Free Reinforcement Learning	Jul 15, 2025	Policy Gradient Methodsreinforcement-learning	—Unverified	0
Fairness-Aware Secure Integrated Sensing and Communications with Fractional Programming	Jul 15, 2025	FairnessISAC	—Unverified	0
Personalized Exercise Recommendation with Semantically-Grounded Knowledge Tracing	Jul 15, 2025	Knowledge TracingMath	CodeCode Available	0
Robust-Multi-Task Gradient Boosting	Jul 15, 2025	Multi-Task LearningTransfer Learning	CodeCode Available	0
Try Harder: Hard Sample Generation and Learning for Clothes-Changing Person Re-ID	Jul 15, 2025	Person Re-Identification	CodeCode Available	0
Towards Depth Foundation Model: Recent Trends in Vision-Based Depth Estimation	Jul 15, 2025	3D ReconstructionAutonomous Driving	—Unverified	0
Sensing Accuracy Optimization for Multi-UAV SAR Interferometry with Data Offloading	Jul 15, 2025	Deep Reinforcement LearningEvolutionary Algorithms	—Unverified	0
Recursive Bound-Constrained AdaGrad with Applications to Multilevel and Domain Decomposition Minimization	Jul 15, 2025	Computational Efficiency	—Unverified	0
KisMATH: Do LLMs Have Knowledge of Implicit Structures in Mathematical Reasoning?	Jul 15, 2025	GSM8KLanguage Modeling	—Unverified	0
SpaRTAN: Spatial Reinforcement Token-based Aggregation Network for Visual Recognition	Jul 15, 2025		CodeCode Available	0
LRCTI: A Large Language Model-Based Framework for Multi-Step Evidence Retrieval and Reasoning in Cyber Threat Intelligence Credibility Verification	Jul 15, 2025	Language ModelingLanguage Modelling	—Unverified	0
AirLLM: Diffusion Policy-based Adaptive LoRA for Remote Fine-Tuning of LLM over the Air	Jul 15, 2025	DenoisingSequential Decision Making	—Unverified	0
COLIBRI Fuzzy Model: Color Linguistic-Based Representation and Interpretation	Jul 15, 2025	AttributeMarketing	—Unverified	0
A Parallelizable Approach for Characterizing NE in Zero-Sum Games After a Linear Number of Iterations of Gradient Descent	Jul 15, 2025		CodeCode Available	0
PGT-I: Scaling Spatiotemporal GNNs with Memory-Efficient Distributed Training	Jul 15, 2025	graph partitioning	CodeCode Available	0
The Devil behind the mask: An emergent safety vulnerability of Diffusion LLMs	Jul 15, 2025	Code GenerationSafety Alignment	CodeCode Available	2
Seq vs Seq: An Open Suite of Paired Encoders and Decoders	Jul 15, 2025	DecoderLarge Language Model	CodeCode Available	2
DrafterBench: Benchmarking Large Language Models for Tasks Automation in Civil Engineering	Jul 15, 2025	BenchmarkingInstruction Following	CodeCode Available	2
Fairness-Aware Grouping for Continuous Sensitive Variables: Application for Debiasing Face Analysis with respect to Skin Tone	Jul 15, 2025	Fairness	CodeCode Available	1
CharaConsist: Fine-Grained Consistent Character Generation	Jul 15, 2025	Consistent Character GenerationImage Generation	CodeCode Available	2
MonoMVSNet: Monocular Priors Guided Multi-View Stereo Network	Jul 15, 2025	Depth EstimationDepth Prediction	CodeCode Available	1
Latent Space Consistency for Sparse-View CT Reconstruction	Jul 15, 2025	Computed Tomography (CT)Contrastive Learning	—Unverified	0
Data Augmentation in Time Series Forecasting through Inverted Framework	Jul 15, 2025	Data AugmentationTime Series	—Unverified	0
Addressing Data Imbalance in Transformer-Based Multi-Label Emotion Detection with Weighted Loss	Jul 15, 2025		CodeCode Available	0
Step-wise Policy for Rare-tool Knowledge (SPaRK): Offline RL that Drives Diverse Tool Use in LLMs	Jul 15, 2025	DiversityMMLU	CodeCode Available	0
Interpretable Bayesian Tensor Network Kernel Machines with Automatic Rank and Feature Selection	Jul 15, 2025	feature selectionUncertainty Quantification	CodeCode Available	0
Learning to Tune Like an Expert: Interpretable and Scene-Aware Navigation via MLLM Reasoning and CVAE-Based Adaptation	Jul 15, 2025	Large Language ModelScene Understanding	CodeCode Available	1
SystolicAttention: Fusing FlashAttention within a Single Systolic Array	Jul 15, 2025	Scheduling	CodeCode Available	2
A Generalizable Physics-Enhanced State Space Model for Long-Term Dynamics Forecasting in Complex Environments	Jul 14, 2025		CodeCode Available	0
RDMA: Cost Effective Agent-Driven Rare Disease Discovery within Electronic Health Record Systems	Jul 14, 2025		CodeCode Available	0
Open-Source LLMs Collaboration Beats Closed-Source LLMs: A Scalable Multi-Agent System	Jul 14, 2025		CodeCode Available	0
Democratizing High-Fidelity Co-Speech Gesture Video Generation	Jul 14, 2025		—Unverified	0
SpeakerVid-5M: A Large-Scale High-Quality Dataset for Audio-Visual Dyadic Interactive Human Generation	Jul 14, 2025		—Unverified	0
Deep Recurrence for Dynamical Segmentation Models	Jul 14, 2025		CodeCode Available	0
DeepResearch^Eco: A Recursive Agentic Workflow for Complex Scientific Question Answering in Ecology	Jul 14, 2025		CodeCode Available	0
Can Multimodal Foundation Models Understand Schematic Diagrams? An Empirical Study on Information-Seeking QA over Scientific Papers	Jul 14, 2025		—Unverified	0
WASABI: A Metric for Evaluating Morphometric Plausibility of Synthetic Brain MRIs	Jul 14, 2025		CodeCode Available	0