The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 17451–17500 of 474278 papers

Title	Date	Tasks	Status	Hype
On Efficient Estimation of Distributional Treatment Effects under Covariate-Adaptive Randomization	Jun 6, 2025	regressionvalid	CodeCode Available	0
Information Bargaining: Bilateral Commitment in Bayesian Persuasion	Jun 6, 2025	Fairness	CodeCode Available	0
Benchmarking Misuse Mitigation Against Covert Adversaries	Jun 6, 2025	BenchmarkingLanguage Modeling	CodeCode Available	0
Antithetic Noise in Diffusion Models	Jun 6, 2025	DiversityNegation	—Unverified	0
Robust sensor fusion against on-vehicle sensor staleness	Jun 6, 2025	Autonomous VehiclesData Augmentation	—Unverified	0
Object Navigation with Structure-Semantic Reasoning-Based Multi-level Map and Multimodal Decision-Making LLM	Jun 6, 2025	Decision MakingObject	—Unverified	0
Trajectory Entropy: Modeling Game State Stability from Multimodality Trajectory Prediction	Jun 6, 2025	Autonomous DrivingTrajectory Prediction	—Unverified	0
Unintended Harms of Value-Aligned LLMs: Psychological and Empirical Insights	Jun 6, 2025		CodeCode Available	0
Splat and Replace: 3D Reconstruction with Repetitive Elements	Jun 6, 2025	3DGS3D Reconstruction	—Unverified	0
ScriptDoctor: Automatic Generation of PuzzleScript Games via Large Language Models and Tree Search	Jun 6, 2025	Game DesignLarge Language Model	—Unverified	0
Proactive Assistant Dialogue Generation from Streaming Egocentric Videos	Jun 6, 2025	Dialogue Generation	—Unverified	0
DeepFake Doctor: Diagnosing and Treating Audio-Video Fake Detection	Jun 6, 2025	BenchmarkingDeepFake Detection	—Unverified	0
Hierarchical and Collaborative LLM-Based Control for Multi-UAV Motion and Communication in Integrated Terrestrial and Non-Terrestrial Networks	Jun 6, 2025	Motion Planning	—Unverified	0
Improving choice model specification using reinforcement learning	Jun 6, 2025	Deep Reinforcement Learningmodel	—Unverified	0
Saffron-1: Towards an Inference Scaling Paradigm for LLM Safety Assurance	Jun 6, 2025	Attribute	CodeCode Available	0
Variational Inference for Quantum HyperNetworks	Jun 6, 2025	Bayesian InferenceVariational Inference	—Unverified	0
Training-Free Query Optimization via LLM-Based Plan Similarity	Jun 6, 2025	Language ModelingLanguage Modelling	—Unverified	0
TADA: Training-free Attribution and Out-of-Domain Detection of Audio Deepfakes	Jun 6, 2025	DeepFake DetectionFace Swapping	CodeCode Available	0
Unlocking Chemical Insights: Superior Molecular Representations from Intermediate Encoder Layers	Jun 6, 2025	Computational chemistryComputational Efficiency	CodeCode Available	0
Recommender systems, stigmergy, and the tyranny of popularity	Jun 6, 2025	Recommendation SystemsWord Embeddings	—Unverified	0
Small Models, Big Support: A Local LLM Framework for Teacher-Centric Content Creation and Assessment using RAG and CAG	Jun 6, 2025	College PhysicsRAG	—Unverified	0
Evaluating AI-Powered Learning Assistants in Engineering Higher Education: Student Engagement, Ethical Challenges, and Policy Implications	Jun 6, 2025	Chatbot	—Unverified	0
The Geometry of Extended Kalman Filters on Manifolds with Affine Connection	Jun 6, 2025	State Estimation	—Unverified	0
Machine learning for in-situ composition mapping in a self-driving magnetron sputtering system	Jun 6, 2025	Active LearningGaussian Processes	—Unverified	0
Dynamic Mixture of Progressive Parameter-Efficient Expert Library for Lifelong Robot Learning	Jun 6, 2025	Lifelong learningparameter-efficient fine-tuning	CodeCode Available	1
DesignBench: A Comprehensive Benchmark for MLLM-based Front-end Code Generation	Jun 6, 2025	Code Generation	CodeCode Available	1
Revealing hidden correlations from complex spatial distributions: Adjacent Correlation Analysis	Jun 6, 2025		CodeCode Available	1
Mapping correlations and coherence: adjacency-based approach to data visualization and regularity discovery	Jun 6, 2025	Data Visualization	CodeCode Available	1
Sequential Monte Carlo approximations of Wasserstein--Fisher--Rao gradient flows	Jun 6, 2025		CodeCode Available	0
When Better Features Mean Greater Risks: The Performance-Privacy Trade-Off in Contrastive Learning	Jun 6, 2025	Contrastive LearningInference Attack	CodeCode Available	0
Towards Efficient Multi-LLM Inference: Characterization and Analysis of LLM Routing and Hierarchical Techniques	Jun 6, 2025	BenchmarkingModel Selection	—Unverified	0
Learning Along the Arrow of Time: Hyperbolic Geometry for Backward-Compatible Representation Learning	Jun 6, 2025	Representation Learning	—Unverified	0
Scalable unsupervised feature selection via weight stability	Jun 6, 2025	feature selection	CodeCode Available	0
The Optimization Paradox in Clinical AI Multi-Agent Systems	Jun 6, 2025	Diagnostic	CodeCode Available	0
SDS-Net: Shallow-Deep Synergism-detection Network for infrared small target detection	Jun 6, 2025	Computational Efficiency	CodeCode Available	1
Domain Adaptation in Agricultural Image Analysis: A Comprehensive Review from Shallow Models to Deep Learning	Jun 6, 2025	Domain Adaptation	—Unverified	0
RecGPT: A Foundation Model for Sequential Recommendation	Jun 6, 2025	Decodermodel	CodeCode Available	2
Membership Inference Attacks for Unseen Classes	Jun 6, 2025	quantile regressionregression	—Unverified	0
DynamicMind: A Tri-Mode Thinking System for Large Language Models	Jun 6, 2025	Computational EfficiencyPrompt Engineering	—Unverified	0
Textile Analysis for Recycling Automation using Transfer Learning and Zero-Shot Foundation Models	Jun 6, 2025	SegmentationTransfer Learning	—Unverified	0
Securing Traffic Sign Recognition Systems in Autonomous Vehicles	Jun 6, 2025	Autonomous VehiclesData Augmentation	—Unverified	0
Graph Persistence goes Spectral	Jun 6, 2025	Graph Representation LearningRepresentation Learning	—Unverified	0
Large Language Models Can Be a Viable Substitute for Expert Political Surveys When a Shock Disrupts Traditional Measurement Approaches	Jun 6, 2025	Position	—Unverified	0
Distribution-Level AirComp for Wireless Federated Learning under Data Scarcity and Heterogeneity	Jun 6, 2025	Bayesian InferenceFederated Learning	—Unverified	0
Multi-Modal Multi-Task Federated Foundation Models for Next-Generation Extended Reality Systems: Towards Privacy-Preserving Distributed Intelligence in AR/VR/MR	Jun 6, 2025	Federated LearningMixed Reality	—Unverified	0
Future of Work with AI Agents: Auditing Automation and Augmentation Potential across the U.S. Workforce	Jun 6, 2025	AI Agent	—Unverified	0
When to use Graphs in RAG: A Comprehensive Analysis for Graph Retrieval-Augmented Generation	Jun 6, 2025	RAGRetrieval	CodeCode Available	3
Few Labels are all you need: A Weakly Supervised Framework for Appliance Localization in Smart-Meter Series	Jun 6, 2025	AllNon-Intrusive Load Monitoring	CodeCode Available	0
Bootstrapping World Models from Dynamics Models in Multimodal Foundation Models	Jun 6, 2025	Weakly-supervised Learning	CodeCode Available	0
Unleashing the Potential of Consistency Learning for Detecting and Grounding Multi-Modal Media Manipulation	Jun 6, 2025	Decoder	CodeCode Available	1