The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 13801–13850 of 474278 papers

Title	Date	Tasks	Status	Hype
Few-Shot Bearing Fault Diagnosis Via Ensembling Transformer-Based Model With Mahalanobis Distance Metric Learning From Multiscale Features	Mar 25, 2024	ClassificationFault Diagnosis	CodeCode Available	2
DGFont++: Robust Deformable Generative Networks for Unsupervised Font Generation	Dec 30, 2022	Font GenerationImage-to-Image Translation	CodeCode Available	2
YOLOv5-6D: Advancing 6-DoF Instrument Pose Estimation in Variable X-Ray Imaging Geometries	Mar 22, 2024	6D Pose Estimation using RGBGPU	CodeCode Available	2
Reusing Embeddings: Reproducible Reward Model Research in Large Language Model Alignment without GPUs	Feb 4, 2025	Code GenerationLanguage Modeling	CodeCode Available	2
Analysing the Residual Stream of Language Models Under Knowledge Conflicts	Oct 21, 2024		CodeCode Available	2
JAILJUDGE: A Comprehensive Jailbreak Judge Benchmark with Multi-Agent Enhanced Explanation Evaluation Framework	Oct 11, 2024		CodeCode Available	2
Hypergraph Neural Networks	Sep 25, 2018	Object RecognitionRepresentation Learning	CodeCode Available	2
Peeling Back the Layers: An In-Depth Evaluation of Encoder Architectures in Neural News Recommenders	Oct 2, 2024	Model SelectionNews Recommendation	CodeCode Available	2
Efficient Non-stationary Online Learning by Wavelets with Applications to Online Distribution Shift Adaptation	Jul 21, 2024		CodeCode Available	2
ViSpeak: Visual Instruction Feedback in Streaming Videos	Mar 17, 2025	Streaming video understandingVideo Understanding	CodeCode Available	2
SatMAE: Pre-training Transformers for Temporal and Multi-Spectral Satellite Imagery	Jul 17, 2022	Land Cover ClassificationSemantic Segmentation	CodeCode Available	2
Self-Prompting Polyp Segmentation in Colonoscopy using Hybrid Yolo-SAM 2 Model	Sep 14, 2024	Medical Image SegmentationPolyp Segmentation	CodeCode Available	2
Detection Transformer with Stable Matching	Apr 10, 2023	DecoderPosition	CodeCode Available	2
Chain-of-Thought Reasoning Without Prompting	Feb 15, 2024	Prompt Engineering	CodeCode Available	2
Domain Adaptation with a Single Vision-Language Embedding	Oct 28, 2024	Domain AdaptationOne-shot Unsupervised Domain Adaptation	CodeCode Available	2
An Efficient Post-hoc Framework for Reducing Task Discrepancy of Text Encoders for Composed Image Retrieval	Jun 13, 2024	Contrastive LearningImage Retrieval	CodeCode Available	2
HypoBench: Towards Systematic and Principled Benchmarking for Hypothesis Generation	Apr 15, 2025	Benchmarkingscientific discovery	CodeCode Available	2
Prototype-based Cross-Modal Object Tracking	Dec 22, 2023	ObjectObject Tracking	CodeCode Available	2
BatGPT: A Bidirectional Autoregessive Talker from Generative Pre-trained Transformer	Jul 1, 2023	Language ModelingLanguage Modelling	CodeCode Available	2
TurboEdit: Text-Based Image Editing Using Few-Step Diffusion Models	Aug 1, 2024		CodeCode Available	2
1st Place Solution of Multiview Egocentric Hand Tracking Challenge ECCV2024	Sep 28, 2024	Position	CodeCode Available	2
C^2LEVA: Toward Comprehensive and Contamination-Free Language Model Evaluation	Dec 6, 2024	Language Model EvaluationLanguage Modeling	CodeCode Available	2
Region Rebalance for Long-Tailed Semantic Segmentation	Apr 5, 2022	SegmentationSemantic Segmentation	CodeCode Available	2
NLLB-CLIP -- train performant multilingual image retrieval model on a budget	Sep 4, 2023	Image RetrievalRetrieval	CodeCode Available	2
TMR: Text-to-Motion Retrieval Using Contrastive 3D Human Motion Synthesis	May 2, 2023	Moment RetrievalMotion Generation	CodeCode Available	2
Gaussian Processes for Big Data	Sep 26, 2013	Gaussian ProcessesVariational Inference	CodeCode Available	2
DetGPT: Detect What You Need via Reasoning	May 23, 2023	Autonomous DrivingObject	CodeCode Available	2
HPT++: Hierarchically Prompting Vision-Language Models with Multi-Granularity Knowledge Generation and Improved Structure Modeling	Aug 27, 2024	Domain GeneralizationPrompt Engineering	CodeCode Available	2
GAIA: a benchmark for General AI Assistants	Nov 21, 2023	Philosophy	CodeCode Available	2
WMT24++: Expanding the Language Coverage of WMT24 to 55 Languages & Dialects	Feb 18, 2025	Machine Translation	CodeCode Available	2
Seeing through Satellite Images at Street Views	May 22, 2025		CodeCode Available	2
Large Language Models are In-Context Molecule Learners	Mar 7, 2024	Cross-Modal RetrievalIn-Context Learning	CodeCode Available	2
Adaptive Guidance: Training-free Acceleration of Conditional Diffusion Models	Dec 19, 2023	DenoisingNeural Architecture Search	CodeCode Available	2
Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent	May 12, 2025	RAGReinforcement Learning (RL)	CodeCode Available	2
Deduplicating Training Data Mitigates Privacy Risks in Language Models	Feb 14, 2022		CodeCode Available	2
RandAugment: Practical automated data augmentation with a reduced search space	Sep 30, 2019	Data AugmentationDomain Generalization	CodeCode Available	2
Mamba-R: Vision Mamba ALSO Needs Registers	May 23, 2024	MambaSemantic Segmentation	CodeCode Available	2
The Brain Tumor Segmentation (BraTS) Challenge 2023: Focus on Pediatrics (CBTN-CONNECT-DIPGR-ASNR-MICCAI BraTS-PEDs)	May 26, 2023	BenchmarkingBrain Tumor Segmentation	CodeCode Available	2
Structured Denoising Diffusion Models in Discrete State-Spaces	Jul 7, 2021	DenoisingText Generation	CodeCode Available	2
Bridging Model-Based Optimization and Generative Modeling via Conservative Fine-Tuning of Diffusion Models	May 30, 2024		CodeCode Available	2
Neural Responding Machine for Short-Text Conversation	Mar 9, 2015	DecoderRetrieval	CodeCode Available	2
Neural Lander: Stable Drone Landing Control using Learned Dynamics	Nov 19, 2018		CodeCode Available	2
Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate	Jan 29, 2025	Instruction FollowingMath	CodeCode Available	2
Scaling up Differentially Private Deep Learning with Fast Per-Example Gradient Clipping	Sep 7, 2020	GPU	CodeCode Available	2
Interpreting the Latent Space of GANs for Semantic Face Editing	Jul 25, 2019	AttributeDisentanglement	CodeCode Available	2
Improving RetinaNet for CT Lesion Detection with Dense Masks from Weak RECIST Labels	Jun 5, 2019	Computed Tomography (CT)Lesion Detection	CodeCode Available	2
NeuralUQ: A comprehensive library for uncertainty quantification in neural differential equations and operators	Aug 25, 2022	Uncertainty Quantification	CodeCode Available	2
Diff-Foley: Synchronized Video-to-Audio Synthesis with Latent Diffusion Models	Jun 29, 2023	Audio Synthesis	CodeCode Available	2
Double Difference Earthquake Location with Graph Neural Networks	Oct 25, 2024	Graph Neural Network	CodeCode Available	2
A Library for Representing Python Programs as Graphs for Machine Learning	Aug 15, 2022		CodeCode Available	2