The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 16001–16050 of 474278 papers

Title	Date	Tasks	Status	Hype
Pushing the Limits of Extreme Weather: Constructing Extreme Heatwave Storylines with Differentiable Climate Models	Jun 12, 2025	Blocking	CodeCode Available	0
Efficiency Robustness of Dynamic Deep Learning Systems	Jun 12, 2025	Deep Learning	CodeCode Available	0
From Images to Insights: Explainable Biodiversity Monitoring with Plain Language Habitat Explanations	Jun 12, 2025	Causal Inference	CodeCode Available	0
MSTAR: Box-free Multi-query Scene Text Retrieval with Attention Recycling	Jun 12, 2025	16kRetrieval	CodeCode Available	0
IQE-CLIP: Instance-aware Query Embedding for Zero-/Few-shot Anomaly Detection in Medical Domain	Jun 12, 2025	Anomaly Detection	CodeCode Available	0
Transformer IMU Calibrator: Dynamic On-body IMU Calibration for Inertial Motion Capture	Jun 12, 2025		CodeCode Available	1
Automated Validation of Textual Constraints Against AutomationML via LLMs and SHACL	Jun 12, 2025	Language ModelingLanguage Modelling	CodeCode Available	0
A Study on Individual Spatiotemporal Activity Generation Method Using MCP-Enhanced Chain-of-Thought Large Language Models	Jun 12, 2025		CodeCode Available	0
Constructing and Evaluating Declarative RAG Pipelines in PyTerrier	Jun 12, 2025	Natural QuestionsRAG	CodeCode Available	1
Time Series Forecasting as Reasoning: A Slow-Thinking Approach with Reinforced LLMs	Jun 12, 2025	PhilosophyPrompt Engineering	CodeCode Available	2
Equivariant Neural Diffusion for Molecule Generation	Jun 12, 2025		CodeCode Available	0
AIR: Zero-shot Generative Model Adaptation with Iterative Refinement	Jun 12, 2025		CodeCode Available	0
LightKG: Efficient Knowledge-Aware Recommendations with Simplified GNN Architecture	Jun 12, 2025	Recommendation SystemsSelf-Supervised Learning	CodeCode Available	0
Technical Report with Proofs for A Full Picture in Conformance Checking: Efficiently Summarizing All Optimal Alignments	Jun 12, 2025	All	—Unverified	0
Advanced fraud detection using machine learning models: enhancing financial transaction security	Jun 12, 2025	Feature EngineeringFraud Detection	—Unverified	0
Uncertainty-Aware Deep Learning for Automated Skin Cancer Classification: A Comprehensive Evaluation	Jun 12, 2025	Cancer ClassificationLesion Classification	—Unverified	0
Dense Associative Memory with Epanechnikov Energy	Jun 12, 2025	Density Estimation	—Unverified	0
Leveraging 6DoF Pose Foundation Models For Mapping Marine Sediment Burial	Jun 12, 2025	Depth Estimation	CodeCode Available	0
Specification and Evaluation of Multi-Agent LLM Systems -- Prototype and Cybersecurity Applications	Jun 12, 2025	Code GenerationQuestion Answering	CodeCode Available	0
Precise Zero-Shot Pointwise Ranking with LLMs through Post-Aggregated Global Context Information	Jun 12, 2025	Document Ranking	CodeCode Available	0
Predicting function of evolutionarily implausible DNA sequences	Jun 12, 2025	Prediction	CodeCode Available	0
Prompts to Summaries: Zero-Shot Language-Guided Video Summarization	Jun 12, 2025	GPUQuery focused video summarization	—Unverified	0
PiPViT: Patch-based Visual Interpretable Prototypes for Retinal Image Analysis	Jun 12, 2025	Contrastive LearningDiagnostic	CodeCode Available	0
On the role of non-linear latent features in bipartite generative neural networks	Jun 12, 2025	Retrieval	—Unverified	0
Occlusion-Aware 3D Hand-Object Pose Estimation with Masked AutoEncoders	Jun 12, 2025	hand-object poseObject	—Unverified	0
M4V: Multi-Modal Mamba for Text-to-Video Generation	Jun 12, 2025	MambaText-to-Video Generation	—Unverified	0
OmniFluids: Unified Physics Pre-trained Modeling of Fluid Dynamics	Jun 12, 2025	Operator learning	—Unverified	0
ME: Trigger Element Combination Backdoor Attack on Copyright Infringement	Jun 12, 2025	Backdoor Attack	—Unverified	0
Reasoning RAG via System 1 or System 2: A Survey on Reasoning Agentic Retrieval-Augmented Generation for Industry Challenges	Jun 12, 2025	Decision MakingRAG	CodeCode Available	0
System Identification Using Kolmogorov-Arnold Networks: A Case Study on Buck Converters	Jun 12, 2025	Kolmogorov-Arnold Networksparameter estimation	—Unverified	0
ConTextTab: A Semantics-Aware Tabular In-Context Learner	Jun 12, 2025	In-Context LearningWorld Knowledge	CodeCode Available	2
ReconMOST: Multi-Layer Sea Temperature Reconstruction with Observations-Guided Diffusion	Jun 12, 2025		CodeCode Available	0
Post-Training Quantization for Video Matting	Jun 12, 2025	Image MattingModel Compression	—Unverified	0
On feature selection in double-imbalanced data settings: a Random Forest approach	Jun 12, 2025	feature selectionVariable Selection	—Unverified	0
SWDL: Stratum-Wise Difference Learning with Deep Laplacian Pyramid for Semi-Supervised 3D Intracranial Hemorrhage Segmentation	Jun 12, 2025	Image SegmentationMedical Image Segmentation	CodeCode Available	0
Towards Understanding Bias in Synthetic Data for Evaluation	Jun 12, 2025	Information Retrieval	CodeCode Available	0
Contrastive Matrix Completion with Denoising and Augmented Graph Views for Robust Recommendation	Jun 12, 2025	Contrastive LearningDenoising	CodeCode Available	0
ContextRefine-CLIP for EPIC-KITCHENS-100 Multi-Instance Retrieval Challenge 2025	Jun 12, 2025	Cross-Modal RetrievalEnsemble Learning	CodeCode Available	0
Using Language and Road Manuals to Inform Map Reconstruction for Autonomous Driving	Jun 12, 2025	Autonomous DrivingAutonomous Navigation	—Unverified	0
LogiPlan: A Structured Benchmark for Logical Planning and Relational Reasoning in LLMs	Jun 12, 2025	Relational Reasoning	—Unverified	0
MNN-LLM: A Generic Inference Engine for Fast Large Language Model Deployment on Mobile Devices	Jun 12, 2025	CPUGPU	—Unverified	0
LRSLAM: Low-rank Representation of Signed Distance Fields in Dense Visual SLAM System	Jun 12, 2025	Autonomous DrivingMixed Reality	—Unverified	0
DreamActor-H1: High-Fidelity Human-Product Demonstration Video Generation via Motion-designed Diffusion Transformers	Jun 12, 2025	Data AugmentationMarketing	—Unverified	0
GigaVideo-1: Advancing Video Generation via Automatic Feedback with 4 GPU-Hours Fine-Tuning	Jun 12, 2025	GPUVideo Generation	—Unverified	0
Improving Medical Visual Representation Learning with Pathological-level Cross-Modal Alignment and Correlation Exploration	Jun 12, 2025	cross-modal alignmentImage to text	—Unverified	0
WGSR-Bench: Wargame-based Game-theoretic Strategic Reasoning Benchmark for Large Language Models	Jun 12, 2025	counterfactualCounterfactual Reasoning	—Unverified	0
Beyond Attention or Similarity: Maximizing Conditional Diversity for Token Pruning in MLLMs	Jun 12, 2025	Diversity	CodeCode Available	2
Self-learning signal classifier for decameter coherent scatter radars	Jun 12, 2025	Self-Learning	—Unverified	0
Demonstrating Multi-Suction Item Picking at Scale via Multi-Modal Learning of Pick Success	Jun 12, 2025	Robot ManipulationSemantic Segmentation	—Unverified	0
Macro Graph of Experts for Billion-Scale Multi-Task Recommendation	Jun 12, 2025	Multi-Task LearningRecommendation Systems	—Unverified	0