The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 18401–18450 of 474278 papers

Title	Date	Tasks	Status	Hype
Enhancing Convergence, Privacy and Fairness for Wireless Personalized Federated Learning: Quantization-Assisted Min-Max Fair Scheduling	Jun 3, 2025	FairnessFederated Learning	—Unverified	0
Reconciling Hessian-Informed Acceleration and Scalar-Only Communication for Efficient Federated Zeroth-Order Fine-Tuning	Jun 3, 2025	Federated Learning	—Unverified	0
Probabilistic Online Event Downsampling	Jun 3, 2025	object-detectionObject Detection	—Unverified	0
Multi-Spectral Gaussian Splatting with Neural Color Representation	Jun 3, 2025	3DGSCamera Calibration	—Unverified	0
TalkingMachines: Real-Time Audio-Driven FaceTime-Style Video via Autoregressive Diffusion Models	Jun 3, 2025	DecoderKnowledge Distillation	—Unverified	0
FlexPainter: Flexible and Multi-View Consistent Texture Generation	Jun 3, 2025	Texture Synthesis	—Unverified	0
A Machine Learning Theory Perspective on Strategic Litigation	Jun 3, 2025	Learning Theory	—Unverified	0
KVCache Cache in the Wild: Characterizing and Optimizing KVCache Cache at a Large Cloud Provider	Jun 3, 2025		CodeCode Available	2
GeneA-SLAM2: Dynamic SLAM with AutoEncoder-Preprocessed Genetic Keypoints Resampling and Depth Variance-Guided Dynamic Region Removal	Jun 3, 2025	object-detectionObject Detection	CodeCode Available	1
EyeNavGS: A 6-DoF Navigation Dataset and Record-n-Replay Software for Real-World 3DGS Scenes in VR	Jun 3, 2025	3DGS	CodeCode Available	0
SViMo: Synchronized Diffusion for Video and Motion Generation in Hand-object Interaction Scenarios	Jun 3, 2025	Motion GenerationVideo Generation	CodeCode Available	1
Comparative Analysis of AI Agent Architectures for Entity Relationship Classification	Jun 3, 2025	AI AgentRelation	CodeCode Available	0
CyberGym: Evaluating AI Agents' Cybersecurity Capabilities with Real-World Vulnerabilities at Scale	Jun 3, 2025	Large Language Model	CodeCode Available	2
Causal Explainability of Machine Learning in Heart Failure Prediction from Electronic Health Records	Jun 3, 2025	Causal DiscoveryFeature Importance	—Unverified	0
Generative AI for Predicting 2D and 3D Wildfire Spread: Beyond Physics-Based Models and Traditional Deep Learning	Jun 3, 2025	Edge-computing	—Unverified	0
PartComposer: Learning and Composing Part-Level Concepts from Single-Image Examples	Jun 3, 2025	Disentanglement	—Unverified	0
From Theory to Practice with RAVEN-UCB: Addressing Non-Stationarity in Multi-Armed Bandits through Variance Adaptation	Jun 3, 2025	Multi-Armed Bandits	CodeCode Available	0
Occlusion-Aware Ground Target Tracking by a Dubins Vehicle using Visibility Volumes	Jun 3, 2025	Position	CodeCode Available	0
Comparison of different Unique hard attention transformer models by the formal languages they can recognize	Jun 3, 2025	Hard AttentionSurvey	—Unverified	0
VolTex: Food Volume Estimation using Text-Guided Segmentation and Neural Surface Reconstruction	Jun 3, 2025	ManagementNutrition	CodeCode Available	0
Designing Algorithmic Delegates: The Role of Indistinguishability in Human-AI Handoff	Jun 3, 2025	AI AgentDecision Making	—Unverified	0
How Explanations Leak the Decision Logic: Stealing Graph Neural Networks via Explanation Alignment	Jun 3, 2025	Data AugmentationDrug Discovery	CodeCode Available	0
BitBypass: A New Direction in Jailbreaking Aligned Large Language Models with Bitstream Camouflage	Jun 3, 2025	Prompt EngineeringRed Teaming	CodeCode Available	0
Labelling Data with Unknown References	Jun 3, 2025		CodeCode Available	0
HumanRAM: Feed-forward Human Reconstruction and Animation Model using Transformers	Jun 3, 2025	3D Human ReconstructionDecoder	—Unverified	0
Enhancing Automatic PT Tagging for MEDLINE Citations Using Transformer-Based Models	Jun 3, 2025	Retrieval	—Unverified	0
Overcoming Challenges of Partial Client Participation in Federated Learning : A Comprehensive Review	Jun 3, 2025	FairnessFederated Learning	—Unverified	0
A Review of Various Datasets for Machine Learning Algorithm-Based Intrusion Detection System: Advances and Challenges	Jun 3, 2025	Intrusion Detection	—Unverified	0
MISLEADER: Defending against Model Extraction with Ensembles of Distilled Models	Jun 3, 2025	Bilevel OptimizationData Augmentation	CodeCode Available	0
A Multimodal, Multilingual, and Multidimensional Pipeline for Fine-grained Crowdsourcing Earthquake Damage Evaluation	Jun 3, 2025		CodeCode Available	0
PhysGaia: A Physics-Aware Dataset of Multi-Body Interactions for Dynamic Novel View Synthesis	Jun 3, 2025	Novel View SynthesisScene Understanding	CodeCode Available	1
Impact of Rankings and Personalized Recommendations in Marketplaces	Jun 3, 2025	Navigate	—Unverified	0
Dense Match Summarization for Faster Two-view Estimation	Jun 3, 2025		CodeCode Available	1
Cell-o1: Training LLMs to Solve Single-Cell Reasoning Puzzles with Reinforcement Learning	Jun 3, 2025		CodeCode Available	1
VPI-Bench: Visual Prompt Injection Attacks for Computer-Use Agents	Jun 3, 2025		CodeCode Available	1
ReAgent-V: A Reward-Driven Multi-Agent Framework for Video Understanding	Jun 2, 2025	Action RecognitionVideo Understanding	—Unverified	0
Knowledge or Reasoning? A Close Look at How LLMs Think Across Domains	Jun 2, 2025	MathReinforcement Learning (RL)	—Unverified	0
EvolveNav: Self-Improving Embodied Reasoning for LLM-Based Vision-Language Navigation	Jun 2, 2025	NavigateVision-Language Navigation	CodeCode Available	0
Medical World Model: Generative Simulation of Tumor Evolution for Treatment Planning	Jun 2, 2025	Decision MakingSpecificity	—Unverified	0
Small Language Models are the Future of Agentic AI	Jun 2, 2025	AI AgentPosition	—Unverified	0
LAM SIMULATOR: Advancing Data Generation for Large Action Model Training via Online Exploration and Trajectory Feedback	Jun 2, 2025	Large Language Model	—Unverified	0
FormFactory: An Interactive Benchmarking Suite for Multimodal Form-Filling Agents	Jun 2, 2025	BenchmarkingForm	—Unverified	0
Enhancing Interpretable Image Classification Through LLM Agents and Conditional Concept Bottleneck Models	Jun 2, 2025	image-classificationImage Classification	—Unverified	0
CVC: A Large-Scale Chinese Value Rule Corpus for Value Alignment of Large Language Models	Jun 2, 2025	Benchmarking	CodeCode Available	0
PGPO: Enhancing Agent Reasoning via Pseudocode-style Planning Guided Preference Optimization	Jun 2, 2025	Language ModelingLanguage Modelling	—Unverified	0
Can We Trust Machine Learning? The Reliability of Features from Open-Source Speech Analysis Tools for Speech Modeling	Jun 2, 2025	Fairness	—Unverified	0
MODS: Multi-source Observations Conditional Diffusion Model for Meteorological State Downscaling	Jun 2, 2025	Spatial Interpolation	—Unverified	0
Embedded Acoustic Intelligence for Automotive Systems	Jun 2, 2025	Autonomous Driving	—Unverified	0
Alternates, Assemble! Selecting Optimal Alternates for Citizens' Assemblies	Jun 2, 2025	Computational Efficiency	—Unverified	0
Cross-Lingual Transfer of Cultural Knowledge: An Asymmetric Phenomenon	Jun 2, 2025	Cross-Lingual TransferDiversity	—Unverified	0