The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 14101–14150 of 474278 papers

Title	Date	Tasks	Status	Hype
MIRAGE: A Benchmark for Multimodal Information-Seeking and Reasoning in Agricultural Expert-Guided Conversations	Jun 25, 2025	World Knowledge	CodeCode Available	0
High-Resolution Live Fuel Moisture Content (LFMC) Maps for Wildfire Risk from Multimodal Earth Observation Data	Jun 25, 2025	Earth Observation	CodeCode Available	1
MMSearch-R1: Incentivizing LMMs to Search	Jun 25, 2025	RAGRetrieval-augmented Generation	CodeCode Available	3
Loss-Aware Automatic Selection of Structured Pruning Criteria for Deep Neural Network Acceleration	Jun 25, 2025		CodeCode Available	1
Disentangled representations of microscopy images	Jun 25, 2025	ClassificationDisentanglement	CodeCode Available	0
AUTOMATIC PRONUNCIATION MISTAKE DETECTOR PROJECT REPORT	Jun 25, 2025	Mistake Detectionspeech-recognition	—Unverified	0
AN INTERNSHIP REPORT ON E-HELPING HOUSING SOCIETY PROJECT REPORT	Jun 25, 2025	AllManagement	—Unverified	0
Causal-Paced Deep Reinforcement Learning	Jun 24, 2025		CodeCode Available	0
ICP-3DGS: SfM-free 3D Gaussian Splatting for Large-scale Unbounded Scenes	Jun 24, 2025		CodeCode Available	0
Ark: An Open-source Python-based Framework for Robot Learning	Jun 24, 2025	Imitation LearningMotion Planning	—Unverified	0
GBGC: Efficient and Adaptive Graph Coarsening via Granular-ball Computing	Jun 24, 2025		CodeCode Available	0
Ancient Script Image Recognition and Processing: A Review	Jun 24, 2025	DeciphermentFew-Shot Learning	—Unverified	0
Open-Vocabulary Camouflaged Object Segmentation with Cascaded Vision Language Models	Jun 24, 2025	Camouflaged Object SegmentationSegmentation	CodeCode Available	1
NaviAgent: Bilevel Planning on Tool Dependency Graphs for Function Calling	Jun 24, 2025	Heuristic Search	—Unverified	0
Progressive Size-Adaptive Federated Learning: A Comprehensive Framework for Heterogeneous Multi-Modal Data Systems	Jun 24, 2025	Federated LearningTime Series	—Unverified	0
HOIverse: A Synthetic Scene Graph Dataset With Human Object Interactions	Jun 24, 2025	Graph GenerationHuman-Object Interaction Detection	—Unverified	0
KunLunBaizeRAG: Reinforcement Learning Driven Inference Performance Leap for Large Language Models	Jun 24, 2025	Multi-hop Question AnsweringQuestion Answering	—Unverified	0
Learning Task Belief Similarity with Latent Dynamics for Meta-Reinforcement Learning	Jun 24, 2025	Meta Reinforcement LearningMuJoCo	CodeCode Available	0
MAM: Modular Multi-Agent Framework for Multi-Modal Medical Diagnosis via Role-Specialized Collaboration	Jun 24, 2025	DiagnosticMedical Diagnosis	CodeCode Available	1
Augmenting Multi-Agent Communication with State Delta Trajectory	Jun 24, 2025		CodeCode Available	1
Has Machine Translation Evaluation Achieved Human Parity? The Human Reference and the Limits of Progress	Jun 24, 2025	Machine Translation	CodeCode Available	0
Multi-Preference Lambda-weighted Listwise DPO for Dynamic Preference Alignment	Jun 24, 2025	Informativenessreinforcement-learning	CodeCode Available	0
From Memories to Maps: Mechanisms of In-Context Reinforcement Learning in Transformers	Jun 24, 2025	In-Context LearningIn-Context Reinforcement Learning	—Unverified	0
Behavioral Anomaly Detection in Distributed Systems via Federated Contrastive Learning	Jun 24, 2025	Anomaly DetectionContrastive Learning	—Unverified	0
Tailored Conversations beyond LLMs: A RL-Based Dialogue Manager	Jun 24, 2025	Hierarchical Reinforcement LearningMeta-Learning	—Unverified	0
FAF: A Feature-Adaptive Framework for Few-Shot Time Series Forecasting	Jun 24, 2025	Meta-LearningTime Series	—Unverified	0
CoCo4D: Comprehensive and Complex 4D Scene Generation	Jun 24, 2025	Scene Generation	—Unverified	0
SAM2-SGP: Enhancing SAM2 for Medical Image Segmentation via Support-Set Guided Prompting	Jun 24, 2025	Computed Tomography (CT)Image Segmentation	CodeCode Available	0
Overtuning in Hyperparameter Optimization	Jun 24, 2025	AutoMLHyperparameter Optimization	CodeCode Available	0
Scaling Speculative Decoding with Lookahead Reasoning	Jun 24, 2025	GPUGSM8K	CodeCode Available	0
From Data Acquisition to Lag Modeling: Quantitative Exploration of A-Share Market with Low-Coupling System Design	Jun 24, 2025	Dynamic Time Warping	—Unverified	0
Posterior Cramér-Rao Bounds on Localization and Mapping Errors in Distributed MIMO SLAM	Jun 24, 2025	PositionSimultaneous Localization and Mapping	—Unverified	0
Revisiting R: Statistical Envelope Analysis for Lightweight RF Modulation Classification	Jun 24, 2025	Classification	—Unverified	0
A Wireless Self-Calibrating Ultrasound Microphone Array with Sub-Microsecond Synchronization	Jun 24, 2025	Sound Source Localization	—Unverified	0
Reconfigurable Intelligent Surfaces for 6G and Beyond: A Comprehensive Survey from Theory to Deployment	Jun 24, 2025	Survey	—Unverified	0
From High-SNR Radar Signal to ECG: A Transfer Learning Model with Cardio-Focusing Algorithm for Scenarios with Limited Data	Jun 24, 2025	Transfer Learning	—Unverified	0
A standard transformer and attention with linear biases for molecular conformer generation	Jun 24, 2025	Drug Discovery	—Unverified	0
The time course of visuo-semantic representations in the human brain is captured by combining vision and language models	Jun 24, 2025	EEG	—Unverified	0
[Beat-to-beat AV nodal assessment] ECG-based beat-to-beat assessment of AV node conduction properties during AF	Jun 24, 2025	Prognosis	—Unverified	0
Generate the Forest before the Trees -- A Hierarchical Diffusion model for Climate Downscaling	Jun 24, 2025		CodeCode Available	0
Exact Matrix Seriation through Mathematical Optimization: Stress and Effectiveness-Based Models	Jun 24, 2025	Anomaly Detection	CodeCode Available	0
When Can We Reuse a Calibration Set for Multiple Conformal Predictions?	Jun 24, 2025	Conformal PredictionPrediction	—Unverified	0
ADDQ: Adaptive Distributional Double Q-Learning	Jun 24, 2025	Distributional Reinforcement LearningMuJoCo	CodeCode Available	0
Toward Decision-Oriented Prognostics: An Integrated Estimate-Optimize Framework for Predictive Maintenance	Jun 24, 2025	Decision Making	—Unverified	0
The Shape of Consumer Behavior: A Symbolic and Topological Analysis of Time Series	Jun 24, 2025	ClusteringMarketing	—Unverified	0
ProCaliper: functional and structural analysis, visualization, and annotation of proteins	Jun 24, 2025		CodeCode Available	0
Training Flexible Models of Genetic Variant Effects from Functional Annotations using Accelerated Linear Algebra	Jun 24, 2025		CodeCode Available	0
Toward the Explainability of Protein Language Models for Sequence Design	Jun 24, 2025	Explainable artificial intelligenceExplainable Artificial Intelligence (XAI)	—Unverified	0
Neural Collapse based Deep Supervised Federated Learning for Signal Detection in OFDM Systems	Jun 24, 2025	Federated Learning	—Unverified	0
Cross-regularization: Adaptive Model Complexity through Validation Gradients	Jun 24, 2025	Data Augmentation	—Unverified	0