The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 17851–17900 of 474278 papers

Title	Date	Tasks	Status	Hype
SASVi - Segment Any Surgical Video	Feb 12, 2025	SegmentationVideo Segmentation	CodeCode Available	1
HistoSmith: Single-Stage Histology Image-Label Generation via Conditional Latent Diffusion for Enhanced Cell Segmentation and Classification	Feb 12, 2025	Cell SegmentationImage Generation	CodeCode Available	1
InTAR: Inter-Task Auto-Reconfigurable Accelerator Design for High Data Volume Variation in DNNs	Feb 12, 2025	Computational Efficiency	CodeCode Available	1
Bidirectional Diffusion Bridge Models	Feb 12, 2025	Translation	CodeCode Available	1
Enhanced Load Forecasting with GAT-LSTM: Leveraging Grid and Temporal Features	Feb 12, 2025	Graph AttentionLoad Forecasting	CodeCode Available	1
IHEval: Evaluating Language Models on Following the Instruction Hierarchy	Feb 12, 2025	Instruction Following	CodeCode Available	1
Composite Sketch+Text Queries for Retrieving Objects with Elusive Names and Complex Interactions	Feb 12, 2025	Contrastive LearningImage Retrieval	CodeCode Available	1
HDT: Hierarchical Discrete Transformer for Multivariate Time Series Forecasting	Feb 12, 2025	Multivariate Time Series ForecastingTime Series	CodeCode Available	1
LDC-MTL: Balancing Multi-Task Learning through Scalable Loss Discrepancy Control	Feb 12, 2025	Bilevel OptimizationMulti-Task Learning	CodeCode Available	1
Measuring Diversity in Synthetic Datasets	Feb 12, 2025	ClassificationDiversity	CodeCode Available	1
Hierarchical Learning-based Graph Partition for Large-scale Vehicle Routing Problems	Feb 12, 2025	Reinforcement Learning (RL)	CodeCode Available	1
Heterogeneous Mixture of Experts for Remote Sensing Image Super-Resolution	Feb 12, 2025	Image Super-ResolutionMixture-of-Experts	CodeCode Available	1
Out-of-Distribution Detection on Graphs: A Survey	Feb 12, 2025	Anomaly DetectionGraph Anomaly Detection	CodeCode Available	1
Spatial457: A Diagnostic Benchmark for 6D Spatial Reasoning of Large Multimodal Models	Feb 12, 2025	AttributeDiagnostic	CodeCode Available	1
SelfElicit: Your Language Model Secretly Knows Where is the Relevant Evidence	Feb 12, 2025	Computational EfficiencyLanguage Modeling	CodeCode Available	1
Hi-End-MAE: Hierarchical encoder-driven masked autoencoders are stronger vision learners for medical image segmentation	Feb 12, 2025	Computational EfficiencyImage Segmentation	CodeCode Available	1
From Brainwaves to Brain Scans: A Robust Neural Network for EEG-to-fMRI Synthesis	Feb 11, 2025	EEGSSIM	CodeCode Available	1
Direct Ascent Synthesis: Revealing Hidden Generative Capabilities in Discriminative Models	Feb 11, 2025	Image GenerationStyle Transfer	CodeCode Available	1
Navigating Semantic Drift in Task-Agnostic Class-Incremental Learning	Feb 11, 2025	class-incremental learningClass Incremental Learning	CodeCode Available	1
EventEgo3D++: 3D Human Motion Capture from a Head-Mounted Event Camera	Feb 11, 2025		CodeCode Available	1
Time2Lang: Bridging Time-Series Foundation Models and Large Language Models for Health Sensing Beyond Prompting	Feb 11, 2025	Time Series	CodeCode Available	1
Explaining 3D Computed Tomography Classifiers with Counterfactuals	Feb 11, 2025	Computed Tomography (CT)counterfactual	CodeCode Available	1
EgoTextVQA: Towards Egocentric Scene-Text Aware Video Question Answering	Feb 11, 2025	Question AnsweringVideo Question Answering	CodeCode Available	1
Graph RAG-Tool Fusion	Feb 11, 2025	RAGRetrieval	CodeCode Available	1
DarwinLM: Evolutionary Structured Pruning of Large Language Models	Feb 11, 2025	Model Compression	CodeCode Available	1
TranSplat: Surface Embedding-guided 3D Gaussian Splatting for Transparent Object Manipulation	Feb 11, 2025	Depth CompletionTransparent objects	CodeCode Available	1
Towards Efficient and Multifaceted Computer-assisted Pronunciation Training Leveraging Hierarchical Selective State Space Model and Decoupled Cross-entropy Loss	Feb 11, 2025		CodeCode Available	1
Revisiting Non-Acyclic GFlowNets in Discrete Environments	Feb 11, 2025		CodeCode Available	1
Aligning Large Language Models to Follow Instructions and Hallucinate Less via Effective Data Filtering	Feb 11, 2025		CodeCode Available	1
MGPATH: Vision-Language Model with Multi-Granular Prompt Learning for Few-Shot WSI Classification	Feb 11, 2025	Contrastive LearningData Augmentation	CodeCode Available	1
Stay-Positive: A Case for Ignoring Real Image Features in Fake Image Detection	Feb 11, 2025	Fake Image Detection	CodeCode Available	1
BenchMAX: A Comprehensive Multilingual Evaluation Suite for Large Language Models	Feb 11, 2025	Code GenerationInstruction Following	CodeCode Available	1
Small Language Model Makes an Effective Long Text Extractor	Feb 11, 2025	GPULanguage Modeling	CodeCode Available	1
Generative Modeling with Bayesian Sample Inference	Feb 11, 2025	Density EstimationImage Generation	CodeCode Available	1
EIQP: Execution-time-certified and Infeasibility-detecting QP Solver	Feb 11, 2025	C++ codeModel Predictive Control	CodeCode Available	1
Instance-dependent Early Stopping	Feb 11, 2025	Transfer Learning	CodeCode Available	1
PlaySlot: Learning Inverse Latent Dynamics for Controllable Object-Centric Video Prediction and Planning	Feb 11, 2025	ObjectVideo Prediction	CodeCode Available	1
Principled Data Selection for Alignment: The Hidden Risks of Difficult Examples	Feb 11, 2025		CodeCode Available	1
Joint Modelling Histology and Molecular Markers for Cancer Classification	Feb 11, 2025	Cancer ClassificationPrognosis	CodeCode Available	1
VINP: Variational Bayesian Inference with Neural Speech Prior for Joint ASR-Effective Speech Dereverberation and Blind RIR Identification	Feb 11, 2025	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available	1
Space-Aware Instruction Tuning: Dataset and Benchmark for Guide Dog Robots Assisting the Visually Impaired	Feb 11, 2025		CodeCode Available	1
Integrating Physics and Data-Driven Approaches: An Explainable and Uncertainty-Aware Hybrid Model for Wind Turbine Power Prediction	Feb 11, 2025	Fault Detectionquantile regression	CodeCode Available	1
Flow Matching for Collaborative Filtering	Feb 11, 2025	Collaborative FilteringRecommendation Systems	CodeCode Available	1
On Iterative Evaluation and Enhancement of Code Quality Using GPT-4o	Feb 11, 2025		CodeCode Available	1
MAGELLAN: Metacognitive predictions of learning progress guide autotelic LLM agents in large goal spaces	Feb 11, 2025		CodeCode Available	1
Diffusion Suction Grasping with Large-Scale Parcel Dataset	Feb 11, 2025	Denoising	CodeCode Available	1
JamendoMaxCaps: A Large Scale Music-caption Dataset with Imputed Metadata	Feb 11, 2025	Language ModelingLanguage Modelling	CodeCode Available	1
MAAT: Mamba Adaptive Anomaly Transformer with association discrepancy for time series	Feb 11, 2025	Anomaly DetectionAnomaly Localization	CodeCode Available	1
Bag of Tricks for Inference-time Computation of LLM Reasoning	Feb 11, 2025	GPU	CodeCode Available	1
MiniF2F in Rocq: Automatic Translation Between Proof Assistants -- A Case Study	Feb 11, 2025	Translation	CodeCode Available	1