The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 15601–15650 of 474278 papers

Title	Date	Tasks	Status	Hype
A Review of the Long Horizon Forecasting Problem in Time Series Analysis	Jun 15, 2025	Multivariate Time Series ForecastingTime Series	CodeCode Available	0
The Synthetic Mirror -- Synthetic Data at the Age of Agentic AI	Jun 15, 2025	Synthetic Data Generation	—Unverified	0
Serving Large Language Models on Huawei CloudMatrix384	Jun 15, 2025	Mixture-of-ExpertsQuantization	—Unverified	0
Using Neurogram Similarity Index Measure (NSIM) to Model Hearing Loss and Cochlear Neural Degeneration	Jun 15, 2025	Phoneme Recognition	—Unverified	0
Bridging Data-Driven and Physics-Based Models: A Consensus Multi-Model Kalman Filter for Robust Vehicle State Estimation	Jun 15, 2025	Autonomous DrivingState Estimation	—Unverified	0
SC-SOT: Conditioning the Decoder on Diarized Speaker Information for End-to-End Overlapped Speech Recognition	Jun 15, 2025	Decoderspeaker-diarization	—Unverified	0
Homeostatic Coupling for Prosocial Behavior	Jun 15, 2025	Multi-agent Reinforcement Learning	—Unverified	0
MaskPro: Linear-Space Probabilistic Learning for Strict (N:M)-Sparsity on Large Language Models	Jun 15, 2025		CodeCode Available	0
SciSage: A Multi-Agent Framework for High-Quality Scientific Survey Generation	Jun 15, 2025	Language ModelingLanguage Modelling	—Unverified	0
Differentially Private Bilevel Optimization: Efficient Algorithms with Near-Optimal Rates	Jun 15, 2025	Bilevel OptimizationHyperparameter Optimization	—Unverified	0
PDEfuncta: Spectrally-Aware Neural Representation for PDE Solution Modeling	Jun 15, 2025	Meta-Learning	—Unverified	0
KungfuBot: Physics-Based Humanoid Whole-Body Control for Learning Highly-Dynamic Skills	Jun 15, 2025	Humanoid Control	—Unverified	0
Zero-shot denoising via neural compression: Theoretical and algorithmic framework	Jun 15, 2025	Denoising	CodeCode Available	0
iDiT-HOI: Inpainting-based Hand Object Interaction Reenactment via Video Diffusion Transformer	Jun 15, 2025	ObjectVideo Generation	—Unverified	0
Magnetoencephalography (MEG) Based Non-Invasive Chinese Speech Decoding	Jun 15, 2025		CodeCode Available	0
BeyondRPC: A Contrastive and Augmentation-Driven Framework for Robust Point Cloud Understanding	Jun 15, 2025	Point Cloud ClassificationRepresentation Learning	CodeCode Available	0
TCANet: A Temporal Convolutional Attention Network for Motor Imagery EEG Decoding	Jun 15, 2025	Brain Computer InterfaceEEG	CodeCode Available	1
M^3-VOS: Multi-Phase, Multi-Transition, and Multi-Scenery Video Object Segmentation	Jun 15, 2025	ObjectSemantic Segmentation	CodeCode Available	1
Focusing on Tracks for Online Multi-Object Tracking	Jun 15, 2025	global-optimizationMulti-Object Tracking	CodeCode Available	2
MM-R5: MultiModal Reasoning-Enhanced ReRanker via Reinforcement Learning for Document Retrieval	Jun 14, 2025	Instruction FollowingMultimodal Reasoning	CodeCode Available	0
Structural feature enhanced transformer for fine-grained image recognition	Jun 14, 2025	Computational EfficiencyFine-Grained Image Classification	—Unverified	0
CORONA: A Coarse-to-Fine Framework for Graph-based Recommendation with Large Language Models	Jun 14, 2025	Collaborative FilteringRecommendation Systems	—Unverified	0
RealFactBench: A Benchmark for Evaluating Large Language Models in Real-World Fact-Checking	Jun 14, 2025	Explanation GenerationFact Checking	CodeCode Available	0
Is your batch size the problem? Revisiting the Adam-SGD gap in language modeling	Jun 14, 2025	Language ModelingLanguage Modelling	—Unverified	0
Quantizing Small-Scale State-Space Models for Edge AI	Jun 14, 2025	QuantizationState Space Models	—Unverified	0
Beyond Sin-Squared Error: Linear-Time Entrywise Uncertainty Quantification for Streaming PCA	Jun 14, 2025	Uncertainty Quantification	—Unverified	0
A Transfer Learning Framework for Multilayer Networks via Model Averaging	Jun 14, 2025	Link PredictionPrivacy Preserving	—Unverified	0
Interpretable Causal Representation Learning for Biological Data in the Pathway Space	Jun 14, 2025	Representation Learning	—Unverified	0
Understanding the Effect of Knowledge Graph Extraction Error on Downstream Graph Analyses: A Case Study on Affiliation Graphs	Jun 14, 2025	Community DetectionKnowledge Graphs	—Unverified	0
From Ground to Sky: Architectures, Applications, and Challenges Shaping Low-Altitude Wireless Networks	Jun 14, 2025	Integrated sensing and communicationISAC	—Unverified	0
Automated Heuristic Design for Unit Commitment Using Large Language Models	Jun 14, 2025	Scheduling	—Unverified	0
Instantaneous Failure, Repair and Mobility Rates for Markov Reliability Systems: A Wind-Farm application	Jun 14, 2025	Scheduling	—Unverified	0
Speech-Language Models with Decoupled Tokenizers and Multi-Token Prediction	Jun 14, 2025	cross-modal alignment	—Unverified	0
Efficient Star Distillation Attention Network for Lightweight Image Super-Resolution	Jun 14, 2025	Image Super-ResolutionRepresentation Learning	—Unverified	0
Deploying and Evaluating Multiple Deep Learning Models on Edge Devices for Diabetic Retinopathy Detection	Jun 14, 2025	Diabetic Retinopathy DetectionGPU	—Unverified	0
Wasserstein-Barycenter Consensus for Cooperative Multi-Agent Reinforcement Learning	Jun 14, 2025	Multi-agent Reinforcement Learningreinforcement-learning	—Unverified	0
ECLIP: Energy-efficient and Practical Co-Location of ML Inference on Spatially Partitioned GPUs	Jun 14, 2025	GPU	—Unverified	0
Adaptive Multi-resolution Hash-Encoding Framework for INR-based Dental CBCT Reconstruction with Truncated FOV	Jun 14, 2025	Computational EfficiencyComputed Tomography (CT)	—Unverified	0
Relative Entropy Regularized Reinforcement Learning for Efficient Encrypted Policy Synthesis	Jun 14, 2025	Model-based Reinforcement LearningPrivacy Preserving	—Unverified	0
Less Conservative Adaptive Gain-scheduling Control for Continuous-time Systems with Polytopic Uncertainties	Jun 14, 2025	Scheduling	—Unverified	0
Behavioral Generative Agents for Energy Operations	Jun 14, 2025	Decision Makingenergy management	—Unverified	0
Step-by-Step Reasoning Attack: Revealing 'Erased' Knowledge in Large Language Models	Jun 14, 2025	Misinformation	—Unverified	0
Second Order State Hallucinations for Adversarial Attack Mitigation in Formation Control of Multi-Agent Systems	Jun 14, 2025	Adversarial AttackHallucination	—Unverified	0
The Foundation Cracks: A Comprehensive Study on Bugs and Testing Practices in LLM Libraries	Jun 14, 2025	Bug fixingInference Optimization	—Unverified	0
Detecting Narrative Shifts through Persistent Structures: A Topological Analysis of Media Discourse	Jun 14, 2025	Articles	—Unverified	0
FlexRAG: A Flexible and Comprehensive Framework for Retrieval-Augmented Generation	Jun 14, 2025	Language ModelingLanguage Modelling	CodeCode Available	3
SPIRE: Conditional Personalization for Federated Diffusion Generative Models	Jun 14, 2025	Federated Learning	—Unverified	0
A Gradient Meta-Learning Joint Optimization for Beamforming and Antenna Position in Pinching-Antenna Systems	Jun 14, 2025	Meta-LearningPosition	—Unverified	0
OpenUnlearning: Accelerating LLM Unlearning via Unified Benchmarking of Methods and Metrics	Jun 14, 2025	Benchmarking	CodeCode Available	4
Cross-Domain Conditional Diffusion Models for Time Series Imputation	Jun 14, 2025	DenoisingDomain Adaptation	CodeCode Available	0