The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 13901–13950 of 177340 papers

Title	Date	Tasks	Status	Hype	Score
Retrieval Oriented Masking Pre-training Language Model for Dense Passage Retrieval	Oct 27, 2022	Language ModelingLanguage Modelling	CodeCode Available	2	5
TCMBench: A Comprehensive Benchmark for Evaluating Large Language Models in Traditional Chinese Medicine	Jun 3, 2024	BenchmarkingQuestion Answering	CodeCode Available	2	5
Synchromesh: Reliable code generation from pre-trained language models	Jan 26, 2022	Code GenerationLanguage Modeling	CodeCode Available	2	5
RodinHD: High-Fidelity 3D Avatar Generation with Diffusion Models	Jul 9, 2024	DecoderScheduling	CodeCode Available	2	5
BUDDy: Single-Channel Blind Unsupervised Dereverberation with Diffusion Models	May 7, 2024		CodeCode Available	2	5
AddSR: Accelerating Diffusion-based Blind Super-Resolution with Adversarial Diffusion Distillation	Apr 2, 2024	Blind Super-ResolutionSuper-Resolution	CodeCode Available	2	5
Machine Learning in Asset Management—Part 1: Portfolio Construction—Trading Strategies	Feb 10, 2020	Algorithmic TradingAsset Management	CodeCode Available	2	5
Towards Automatically-Tuned Deep Neural Networks	May 18, 2019	AutoMLBIG-bench Machine Learning	CodeCode Available	2	5
Offline RL for Natural Language Generation with Implicit Language Q Learning	Jun 5, 2022	Language ModellingOffline RL	CodeCode Available	2	5
Fully Test-Time Adaptation for Monocular 3D Object Detection	May 30, 2024	3D Object DetectionMonocular 3D Object Detection	CodeCode Available	2	5
DeepAR: Probabilistic Forecasting with Autoregressive Recurrent Networks	Apr 13, 2017	Multivariate Time Series ForecastingProbabilistic Time Series Forecasting	CodeCode Available	2	5
Open6DOR: Benchmarking Open-instruction 6-DoF Object Rearrangement and A VLM-based Approach	Oct 24, 2024	BenchmarkingInstruction Following	CodeCode Available	2	5
A physics-informed and attention-based graph learning approach for regional electric vehicle charging demand prediction	Sep 11, 2023	Graph LearningMeta-Learning	CodeCode Available	2	5
Inference Scaling Laws: An Empirical Analysis of Compute-Optimal Inference for Problem-Solving with Language Models	Aug 1, 2024	Math	CodeCode Available	2	5
Language Model Crossover: Variation through Few-Shot Prompting	Feb 23, 2023	In-Context LearningLanguage Modeling	CodeCode Available	2	5
Why do tree-based models still outperform deep learning on typical tabular data?	Nov 28, 2022	Benchmarking	CodeCode Available	2	5
AutoVerus: Automated Proof Generation for Rust Code	Sep 19, 2024	Code GenerationLanguage Modeling	CodeCode Available	2	5
TopoLogic: An Interpretable Pipeline for Lane Topology Reasoning on Driving Scenes	May 23, 2024	Autonomous DrivingLane Detection	CodeCode Available	2	5
Preference Optimization for Reasoning with Pseudo Feedback	Nov 25, 2024	GSM8KMath	CodeCode Available	2	5
Leveraging Pre-Trained Autoencoders for Interpretable Prototype Learning of Music Audio	Feb 14, 2024	Audio ClassificationDecoder	CodeCode Available	2	5
TaskPrompter: Spatial-Channel Multi-Task Prompting for Dense Scene Understanding	May 1, 2023	3D Object DetectionMonocular Depth Estimation	CodeCode Available	2	5
Video-P2P: Video Editing with Cross-attention Control	Mar 8, 2023	Image GenerationVideo Editing	CodeCode Available	2	5
Min-K%++: Improved Baseline for Detecting Pre-Training Data from Large Language Models	Apr 3, 2024		CodeCode Available	2	5
Focusing on Tracks for Online Multi-Object Tracking	Jun 15, 2025	global-optimizationMulti-Object Tracking	CodeCode Available	2	5
Consistency-diversity-realism Pareto fronts of conditional image generative models	Jun 14, 2024	Diversity	CodeCode Available	2	5
Enhancing Multi-view Stereo with Contrastive Matching and Weighted Focal Loss	Jun 21, 2022	Contrastive Learning	CodeCode Available	2	5
DFormer: Rethinking RGBD Representation Learning for Semantic Segmentation	Sep 18, 2023	3D geometryDecoder	CodeCode Available	2	5
Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation	Dec 20, 2023	Robot ManipulationZero-shot Generalization	CodeCode Available	2	5
Character-Aware Models Improve Visual Text Rendering	Dec 20, 2022	Image Generation	CodeCode Available	2	5
PetFace: A Large-Scale Dataset and Benchmark for Animal Identification	Jul 18, 2024	Face IdentificationFace Verification	CodeCode Available	2	5
MOODv2: Masked Image Modeling for Out-of-Distribution Detection	Jan 5, 2024	Out-of-Distribution DetectionOut of Distribution (OOD) Detection	CodeCode Available	2	5
Clifford Neural Layers for PDE Modeling	Sep 8, 2022	Weather Forecasting	CodeCode Available	2	5
LamRA: Large Multimodal Model as Your Advanced Retrieval Assistant	Dec 2, 2024	Contrastive LearningInformation Retrieval	CodeCode Available	2	5
Mean Deviation Similarity Index: Efficient and Reliable Full-Reference Image Quality Evaluator	Aug 26, 2016	Full reference image quality assessmentImage Compression	CodeCode Available	2	5
WavMark: Watermarking for Audio Generation	Aug 24, 2023	Audio Generation	CodeCode Available	2	5
CodeEditorBench: Evaluating Code Editing Capability of Large Language Models	Apr 4, 2024	Code Generation	CodeCode Available	2	5
StitchFusion: Weaving Any Visual Modalities to Enhance Multimodal Semantic Segmentation	Aug 2, 2024	SegmentationSemantic Segmentation	CodeCode Available	2	5
Chemformer: a pre-trained transformer for computational chemistry	Jan 31, 2022	Computational chemistryRetrosynthesis	CodeCode Available	2	5
CODESIM: Multi-Agent Code Generation and Problem Solving through Simulation-Driven Planning and Debugging	Feb 8, 2025	Code GenerationHumanEval	CodeCode Available	2	5
FSPEN: AN ULTRA-LIGHTWEIGHT NETWORK FOR REAL TIME SPEECH ENAHNCMENT	Apr 15, 2024	Speech Enhancement	CodeCode Available	2	5
CampNet: Context-Aware Mask Prediction for End-to-End Text-Based Speech Editing	Feb 21, 2022	Few-Shot LearningSentence	CodeCode Available	2	5
Video-P2P: Video Editing with Cross-attention Control	Mar 8, 2023	Image GenerationVideo Editing	CodeCode Available	2	5
MAUVE Scores for Generative Models: Theory and Practice	Dec 30, 2022	Quantization	CodeCode Available	2	5
Label Efficient Visual Abstractions for Autonomous Driving	May 20, 2020	Autonomous DrivingSegmentation	CodeCode Available	2	5
TDT-KWS: Fast And Accurate Keyword Spotting Using Token-and-duration Transducer	Mar 20, 2024	Keyword Spotting	CodeCode Available	2	5
Maximum Entropy Heterogeneous-Agent Reinforcement Learning	Jun 19, 2023	MuJoCoMulti-agent Reinforcement Learning	CodeCode Available	2	5
Self-Supervised Transformers for Unsupervised Object Discovery using Normalized Cut	Feb 23, 2022	Objectobject-detection	CodeCode Available	2	5
Few-Shot Scene Classification of Optical Remote Sensing Images Leveraging Calibrated Pretext Tasks	Jul 6, 2022	Contrastive LearningFew-Shot Learning	CodeCode Available	2	5
Perceive-IR: Learning to Perceive Degradation Better for All-in-One Image Restoration	Aug 28, 2024	AllImage Restoration	CodeCode Available	2	5
FROSTER: Frozen CLIP Is A Strong Teacher for Open-Vocabulary Action Recognition	Feb 5, 2024	Action RecognitionOpen Vocabulary Action Recognition	CodeCode Available	2	5