The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2101–2150 of 177339 papers

Title	Date	Tasks	Status	Hype	Score
Knowledge-tuning Large Language Models with Structured Medical Knowledge Bases for Reliable Response Generation in Chinese	Sep 8, 2023	Domain AdaptationHallucination	CodeCode Available	4	5
MV-DUSt3R+: Single-Stage Scene Reconstruction from Sparse Views In 2 Seconds	Dec 9, 2024	Camera CalibrationCamera Pose Estimation	CodeCode Available	4	5
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model	Nov 9, 2022	DecoderLanguage Modeling	CodeCode Available	4	5
Gender Representation in TV and Radio: Automatic Information Extraction methods versus Manual Analyses	Jun 14, 2024		CodeCode Available	4	5
BEVFormer v2: Adapting Modern Image Backbones to Bird's-Eye-View Recognition via Perspective Supervision	Nov 18, 2022	3D Object Detection	CodeCode Available	4	5
NeRDi: Single-View NeRF Synthesis with Language-Guided Diffusion as General Image Priors	Dec 6, 2022	3D Generation3D geometry	CodeCode Available	4	5
RealisDance-DiT: Simple yet Strong Baseline towards Controllable Character Animation in the Wild	Apr 21, 2025		CodeCode Available	4	5
COS-Mix: Cosine Similarity and Distance Fusion for Improved Information Retrieval	Jun 2, 2024	Information RetrievalRAG	CodeCode Available	4	5
UniScene: Unified Occupancy-centric Driving Scene Generation	Dec 6, 2024	Autonomous DrivingScene Generation	CodeCode Available	4	5
Motion-X++: A Large-Scale Multimodal 3D Whole-body Human Motion Dataset	Jan 9, 2025	Human Mesh RecoveryMotion Generation	CodeCode Available	4	5
Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction	Sep 26, 2024	3D ReconstructionDenoising	CodeCode Available	4	5
Goldfish: Vision-Language Understanding of Arbitrarily Long Videos	Jul 17, 2024	RetrievalVideo Understanding	CodeCode Available	4	5
When Does Perceptual Alignment Benefit Vision Representations?	Oct 14, 2024	Depth EstimationImage Generation	CodeCode Available	4	5
MLPerf Power: Benchmarking the Energy Efficiency of Machine Learning Systems from Microwatts to Megawatts for Sustainable AI	Oct 15, 2024	Benchmarking	CodeCode Available	4	5
Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models	Jan 14, 2025	BenchmarkingText-to-Video Generation	CodeCode Available	4	5
A foundation model for human-AI collaboration in medical literature mining	Jan 27, 2025	Literature MiningSystematic Literature Review	CodeCode Available	4	5
Language Model Beats Diffusion -- Tokenizer is Key to Visual Generation	Oct 9, 2023	Action RecognitionImage Generation	CodeCode Available	4	5
PRISM: A Multi-Modal Generative Foundation Model for Slide-Level Histopathology	May 16, 2024	whole slide images	CodeCode Available	4	5
FFCV: Accelerating Training by Removing Data Bottlenecks	Jun 21, 2023	CPUGPU	CodeCode Available	4	5
Trace is the Next AutoDiff: Generative Optimization with Rich Feedback, Execution Traces, and LLMs	Jun 23, 2024		CodeCode Available	4	5
Building a Culture of Reproducibility in Academic Research	Dec 27, 2022	Cultural Vocal Bursts Intensity Prediction	CodeCode Available	4	5
A deep learning framework for efficient pathology image analysis	Feb 18, 2025	BenchmarkingDeep Learning	CodeCode Available	4	5
Story-Adapter: A Training-free Iterative Framework for Long Story Visualization	Oct 8, 2024	Image GenerationStory Visualization	CodeCode Available	4	5
Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention	Feb 16, 2025		CodeCode Available	4	5
CRUXEval: A Benchmark for Code Reasoning, Understanding and Execution	Jan 5, 2024	HumanEvalPrediction	CodeCode Available	4	5
VideoEval-Pro: Robust and Realistic Long Video Understanding Evaluation	May 20, 2025	MMEMultiple-choice	CodeCode Available	4	5
CitationMap: A Python Tool to Identify and Visualize Your Google Scholar Citations Around the World	Aug 2, 2024	Citation VisualizationData Visualization	CodeCode Available	4	5
Real-time volumetric rendering of dynamic humans	Mar 21, 2023	3D ReconstructionGPU	CodeCode Available	4	5
Improving Parallel Program Performance with LLM Optimizers via Agent-System Interfaces	Oct 21, 2024	Code Generationscientific discovery	CodeCode Available	4	5
DeepFakes and Beyond: A Survey of Face Manipulation and Fake Detection	Jan 1, 2020	AttributeDeepFake Detection	CodeCode Available	4	5
Inductive Moment Matching	Mar 10, 2025		CodeCode Available	4	5
Polysemous codes	Sep 7, 2016	Quantization	CodeCode Available	4	5
SWE-bench: Can Language Models Resolve Real-World GitHub Issues?	Oct 10, 2023	Bug fixingCode Generation	CodeCode Available	4	5
RUMI: Rummaging Using Mutual Information	Aug 19, 2024	Model Predictive ControlObject	CodeCode Available	4	5
ChatGPT Outperforms Crowd-Workers for Text-Annotation Tasks	Mar 27, 2023	text annotationText Classification	CodeCode Available	4	5
A General Theoretical Paradigm to Understand Learning from Human Preferences	Oct 18, 2023		CodeCode Available	4	5
Self-Supervised Geometry-Guided Initialization for Robust Monocular Visual Odometry	Jun 3, 2024	Depth EstimationMonocular Depth Estimation	CodeCode Available	4	5
MUSE: Machine Unlearning Six-Way Evaluation for Language Models	Jul 8, 2024	ArticlesMachine Unlearning	CodeCode Available	4	5
Stock Price Prediction via Discovering Multi-Frequency Trading Patterns	Aug 13, 2017	PredictionStock Price Prediction	CodeCode Available	4	5
The Model Openness Framework: Promoting Completeness and Openness for Reproducibility, Transparency, and Usability in Artificial Intelligence	Mar 20, 2024		CodeCode Available	4	5
Fast Transformer Decoding: One Write-Head is All You Need	Nov 6, 2019	AllLanguage Modelling	CodeCode Available	4	5
OpenMathInstruct-2: Accelerating AI for Math with Massive Open-Source Instruction Data	Oct 2, 2024	Arithmetic ReasoningLarge Language Model	CodeCode Available	4	5
DisCo-DSO: Coupling Discrete and Continuous Optimization for Efficient Generative Design in Hybrid Spaces	Dec 15, 2024	Symbolic Regression	CodeCode Available	4	5
Ideas in Inference-time Scaling can Benefit Generative Pre-training Algorithms	Mar 10, 2025		CodeCode Available	4	5
Tiny-PULP-Dronets: Squeezing Neural Networks for Faster and Lighter Inference on Multi-Tasking Autonomous Nano-Drones	Jul 2, 2024	Autonomous Navigation	CodeCode Available	4	5
ReARTeR: Retrieval-Augmented Reasoning with Trustworthy Process Rewarding	Jan 14, 2025	RAGRetrieval	CodeCode Available	4	5
PointVLA: Injecting the 3D World into Vision-Language-Action Models	Mar 10, 2025	Imitation LearningSpatial Reasoning	CodeCode Available	4	5
ViViD: Video Virtual Try-on using Diffusion Models	May 20, 2024	Virtual Try-on	CodeCode Available	4	5
GeoWizard: Unleashing the Diffusion Priors for 3D Geometry Estimation from a Single Image	Mar 18, 2024	3D geometry3D Reconstruction	CodeCode Available	4	5
GeneFace: Generalized and High-Fidelity Audio-Driven 3D Talking Face Synthesis	Jan 31, 2023	Face GenerationLip Reading	CodeCode Available	4	5