The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 12551–12600 of 474278 papers

Title	Date	Tasks	Status	Hype
An Empirical Evaluation of AI-Powered Non-Player Characters' Perceived Realism and Performance in Virtual Reality Environments	Jul 14, 2025	Speech-to-Texttext-to-speech	—Unverified	0
Warehouse Spatial Question Answering with LLM Agent	Jul 14, 2025	Question AnsweringSpatial Reasoning	CodeCode Available	1
WhisperKit: On-device Real-time ASR with Billion-Scale Transformers	Jul 14, 2025	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive Token-Level Computation	Jul 14, 2025		—Unverified	0
WildFX: A DAW-Powered Pipeline for In-the-Wild Audio FX Graph Modeling	Jul 14, 2025	Music Generation	CodeCode Available	1
Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination	Jul 14, 2025	MathMathematical Reasoning	CodeCode Available	1
Iceberg: Enhancing HLS Modeling with Synthetic Data	Jul 14, 2025	Data AugmentationHigh-Level Synthesis	CodeCode Available	0
REST: Stress Testing Large Reasoning Models by Asking Multiple Problems at Once	Jul 14, 2025		CodeCode Available	1
A Simple Approximate Bayesian Inference Neural Surrogate for Stochastic Petri Net Models	Jul 14, 2025	Bayesian InferenceEpidemiology	CodeCode Available	0
MLAR: Multi-layer Large Language Model-based Robotic Process Automation Applicant Tracking	Jul 14, 2025	BenchmarkingLanguage Modeling	—Unverified	0
Wavelet-Enhanced Neural ODE and Graph Attention for Interpretable Energy Forecasting	Jul 14, 2025	Graph AttentionTime Series Prediction	—Unverified	0
Scene-Aware Conversational ADAS with Generative AI for Real-Time Driver Assistance	Jul 14, 2025	Autonomous Driving	—Unverified	0
On Gradual Semantics for Assumption-Based Argumentation	Jul 14, 2025		CodeCode Available	0
Text-Visual Semantic Constrained AI-Generated Image Quality Assessment	Jul 14, 2025	Image DescriptionImage Quality Assessment	CodeCode Available	1
Kaleidoscopic Background Attack: Disrupting Pose Estimation with Multi-Fold Radial Symmetry Textures	Jul 14, 2025	Camera Pose EstimationPose Estimation	—Unverified	0
Overcoming catastrophic forgetting in neural networks	Jul 14, 2025	Continual LearningL2 Regularization	—Unverified	0
Bridging Robustness and Generalization Against Word Substitution Attacks in NLP via the Growth Bound Matrix Approach	Jul 14, 2025	Adversarial DefenseAdversarial Robustness	CodeCode Available	0
LifelongPR: Lifelong knowledge fusion for point cloud place recognition based on replay and prompt learning	Jul 14, 2025	Autonomous DrivingContinual Learning	CodeCode Available	0
IM-LUT: Interpolation Mixing Look-Up Tables for Image Super-Resolution	Jul 14, 2025	Image Super-ResolutionSuper-Resolution	CodeCode Available	1
VoTranhAbyssCoreMicro and PoliticalCore: A Unified Framework for Simulating Complex Economic and Political Dynamics	Jul 14, 2025		CodeCode Available	0
Predictive Modeling: BIM Command Recommendation Based on Large-scale Usage Logs	Jul 13, 2025		CodeCode Available	0
TinyTroupe: An LLM-powered Multiagent Persona Simulation Toolkit	Jul 13, 2025		CodeCode Available	0
DRPCA-Net: Make Robust PCA Great Again for Infrared Small Target Detection	Jul 13, 2025		CodeCode Available	0
Auto-Regressively Generating Multi-View Consistent Images	Jul 13, 2025		CodeCode Available	0
SeqCSIST: Sequential Closely-Spaced Infrared Small Target Unmixing	Jul 13, 2025		CodeCode Available	0
EyeSeg: An Uncertainty-Aware Eye Segmentation Framework for AR/VR	Jul 13, 2025		CodeCode Available	0
Hear-Your-Click: Interactive Object-Specific Video-to-Audio Generation	Jul 13, 2025		CodeCode Available	0
ViSP: A PPO-Driven Framework for Sarcasm Generation with Contrastive Learning	Jul 13, 2025		CodeCode Available	0
When Schrödinger Bridge Meets Real-World Image Dehazing with Unpaired Training	Jul 13, 2025		CodeCode Available	0
Generative Cognitive Diagnosis	Jul 13, 2025		CodeCode Available	0
Efficient Multi-Person Motion Prediction by Lightweight Spatial and Temporal Interactions	Jul 13, 2025		CodeCode Available	0
Inter2Former: Dynamic Hybrid Attention for Efficient High-Precision Interactive	Jul 13, 2025	CPUInteractive Segmentation	—Unverified	0
Landmark Detection for Medical Images using a General-purpose Segmentation Model	Jul 13, 2025	Anatomical Landmark DetectionDiagnostic	—Unverified	0
Memory-Augmented SAM2 for Training-Free Surgical Video Segmentation	Jul 13, 2025	SegmentationSemantic Segmentation	—Unverified	0
Federated Learning with Graph-Based Aggregation for Traffic Forecasting	Jul 13, 2025	Federated LearningGraph Learning	—Unverified	0
Lightweight Federated Learning over Wireless Edge Networks	Jul 13, 2025	Bayesian OptimizationFederated Learning	—Unverified	0
Meta-Reinforcement Learning for Fast and Data-Efficient Spectrum Allocation in Dynamic Wireless Networks	Jul 13, 2025	Deep Reinforcement LearningFairness	—Unverified	0
Self-supervised pretraining of vision transformers for animal behavioral analysis and neural encoding	Jul 13, 2025	Action SegmentationContrastive Learning	—Unverified	0
VST-Pose: A Velocity-Integrated Spatiotem-poral Attention Network for Human WiFi Pose Estimation	Jul 13, 2025	3D Pose EstimationPose Estimation	CodeCode Available	0
FedGSCA: Medical Federated Learning with Global Sample Selector and Client Adaptive Adjuster under Label Noise	Jul 13, 2025	Federated Learningimage-classification	—Unverified	0
Token Compression Meets Compact Vision Transformers: A Survey and Comparative Evaluation for Edge AI	Jul 13, 2025	AI Agent	—Unverified	0
Prompt Engineering in Segment Anything Model: Methodologies, Applications, and Emerging Challenges	Jul 13, 2025	Image SegmentationPrompt Engineering	—Unverified	0
DRAGD: A Federated Unlearning Data Reconstruction Attack Based on Gradient Differences	Jul 13, 2025	Federated LearningReconstruction Attack	—Unverified	0
Ref-Long: Benchmarking the Long-context Referencing Capability of Long-context Language Models	Jul 13, 2025	AttributeBenchmarking	CodeCode Available	0
KEN: Knowledge Augmentation and Emotion Guidance Network for Multimodal Fake News Detection	Jul 13, 2025	Fake News DetectionMisinformation	—Unverified	0
BitParticle: Partializing Sparse Dual-Factors to Build Quasi-Synchronizing MAC Arrays for Energy-efficient DNNs	Jul 13, 2025	Scheduling	—Unverified	0
AI-Enhanced Pediatric Pneumonia Detection: A CNN-Based Approach Using Data Augmentation and Generative Adversarial Networks (GANs)	Jul 13, 2025	ClassificationData Augmentation	CodeCode Available	0
Planted in Pretraining, Swayed by Finetuning: A Case Study on the Origins of Cognitive Biases in LLMs	Jul 12, 2025		—Unverified	0
Fast3D: Accelerating 3D Multi-modal Large Language Models for Efficient 3D Scene Understanding	Jul 12, 2025		CodeCode Available	0
WellPINN: Accurate Well Representation for Transient Fluid Pressure Diffusion in Subsurface Reservoirs with Physics-Informed Neural Networks	Jul 12, 2025		CodeCode Available	0