The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2676–2700 of 177340 papers

Title	Date	Tasks	Status	Hype	Score
Open-Source Skull Reconstruction with MONAI	Nov 25, 2022	C++ codeDeep Learning	CodeCode Available	3	5
MMedAgent: Learning to Use Medical Tools with Multi-modal Agent	Jul 2, 2024		CodeCode Available	3	5
DiarizationLM: Speaker Diarization Post-Processing with Large Language Models	Jan 7, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available	3	5
RelBench: A Benchmark for Deep Learning on Relational Databases	Jul 29, 2024	Deep LearningFeature Engineering	CodeCode Available	3	5
A Survey on Text-guided 3D Visual Grounding: Elements, Recent Advances, and Future Directions	Jun 9, 2024	3D visual groundingSurvey	CodeCode Available	3	5
Learning Bipedal Walking On Planned Footsteps For Humanoid Robots	Jul 26, 2022	Deep Reinforcement LearningMuJoCo	CodeCode Available	3	5
Large Language Monkeys: Scaling Inference Compute with Repeated Sampling	Jul 31, 2024	GSM8KMath	CodeCode Available	3	5
ECG-FM: An Open Electrocardiogram Foundation Model	Aug 9, 2024	Contrastive LearningDiagnostic	CodeCode Available	3	5
Hyper-YOLO: When Visual Object Detection Meets Hypergraph Computation	Aug 9, 2024	object-detectionObject Detection	CodeCode Available	3	5
SoftMatch: Addressing the Quantity-Quality Trade-off in Semi-supervised Learning	Jan 26, 2023	imbalanced classification	CodeCode Available	3	5
SGFormer: Single-Layer Graph Transformers with Approximation-Free Linear Complexity	Sep 13, 2024	Deep AttentionRepresentation Learning	CodeCode Available	3	5
CAD-Recode: Reverse Engineering CAD Code from Point Clouds	Dec 18, 2024	CAD ReconstructionDecoder	CodeCode Available	3	5
EmergentTTS-Eval: Evaluating TTS Models on Complex Prosodic, Expressiveness, and Linguistic Challenges Using Model-as-a-Judge	May 29, 2025	text-to-speechText to Speech	CodeCode Available	3	5
DeepfakeBench: A Comprehensive Benchmark of Deepfake Detection	Jul 4, 2023	DeepFake DetectionFace Swapping	CodeCode Available	3	5
FlowDock: Geometric Flow Matching for Generative Protein-Ligand Docking and Affinity Prediction	Dec 14, 2024	Blind DockingDrug Discovery	CodeCode Available	3	5
LoftUp: Learning a Coordinate-Based Feature Upsampler for Vision Foundation Models	Apr 18, 2025	Feature Upsampling	CodeCode Available	3	5
ImageFolder: Autoregressive Image Generation with Folded Tokens	Oct 2, 2024	Image GenerationImage Reconstruction	CodeCode Available	3	5
ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation	Feb 6, 2024	Image to Video GenerationVideo Generation	CodeCode Available	3	5
Simple linear attention language models balance the recall-throughput tradeoff	Feb 28, 2024	Language ModellingMamba	CodeCode Available	3	5
MoC: Mixtures of Text Chunking Learners for Retrieval-Augmented Generation System	Mar 12, 2025	ChunkingComputational Efficiency	CodeCode Available	3	5
The Tabular Foundation Model TabPFN Outperforms Specialized Time Series Forecasting Models Based on Simple Features	Jan 6, 2025	Feature EngineeringTime Series	CodeCode Available	3	5
Flow Straight and Fast: Learning to Generate and Transfer Data with Rectified Flow	Sep 7, 2022	Domain AdaptationImage Generation	CodeCode Available	3	5
LLaVA-UHD v2: an MLLM Integrating High-Resolution Feature Pyramid via Hierarchical Window Transformer	Dec 18, 2024	AttributeText Generation	CodeCode Available	3	5
IMDL-BenCo: A Comprehensive Benchmark and Codebase for Image Manipulation Detection & Localization	Jun 15, 2024	GPUImage Manipulation	CodeCode Available	3	5
IntactKV: Improving Large Language Model Quantization by Keeping Pivot Tokens Intact	Mar 2, 2024	Language ModelingLanguage Modelling	CodeCode Available	3	5