The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1301–1350 of 659983 papers

Title	Date	Tasks	Status	Hype
Building a Culture of Reproducibility in Academic Research	Dec 27, 2022	Cultural Vocal Bursts Intensity Prediction	CodeCode Available	4
A deep learning framework for efficient pathology image analysis	Feb 18, 2025	BenchmarkingDeep Learning	CodeCode Available	4
Story-Adapter: A Training-free Iterative Framework for Long Story Visualization	Oct 8, 2024	Image GenerationStory Visualization	CodeCode Available	4
Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention	Feb 16, 2025		CodeCode Available	4
CRUXEval: A Benchmark for Code Reasoning, Understanding and Execution	Jan 5, 2024	HumanEvalPrediction	CodeCode Available	4
VideoEval-Pro: Robust and Realistic Long Video Understanding Evaluation	May 20, 2025	MMEMultiple-choice	CodeCode Available	4
CitationMap: A Python Tool to Identify and Visualize Your Google Scholar Citations Around the World	Aug 2, 2024	Citation VisualizationData Visualization	CodeCode Available	4
Real-time volumetric rendering of dynamic humans	Mar 21, 2023	3D ReconstructionGPU	CodeCode Available	4
Improving Parallel Program Performance with LLM Optimizers via Agent-System Interfaces	Oct 21, 2024	Code Generationscientific discovery	CodeCode Available	4
DeepFakes and Beyond: A Survey of Face Manipulation and Fake Detection	Jan 1, 2020	AttributeDeepFake Detection	CodeCode Available	4
Inductive Moment Matching	Mar 10, 2025		CodeCode Available	4
Polysemous codes	Sep 7, 2016	Quantization	CodeCode Available	4
SWE-bench: Can Language Models Resolve Real-World GitHub Issues?	Oct 10, 2023	Bug fixingCode Generation	CodeCode Available	4
RUMI: Rummaging Using Mutual Information	Aug 19, 2024	Model Predictive ControlObject	CodeCode Available	4
ChatGPT Outperforms Crowd-Workers for Text-Annotation Tasks	Mar 27, 2023	text annotationText Classification	CodeCode Available	4
A General Theoretical Paradigm to Understand Learning from Human Preferences	Oct 18, 2023		CodeCode Available	4
Self-Supervised Geometry-Guided Initialization for Robust Monocular Visual Odometry	Jun 3, 2024	Depth EstimationMonocular Depth Estimation	CodeCode Available	4
MUSE: Machine Unlearning Six-Way Evaluation for Language Models	Jul 8, 2024	ArticlesMachine Unlearning	CodeCode Available	4
Stock Price Prediction via Discovering Multi-Frequency Trading Patterns	Aug 13, 2017	PredictionStock Price Prediction	CodeCode Available	4
The Model Openness Framework: Promoting Completeness and Openness for Reproducibility, Transparency, and Usability in Artificial Intelligence	Mar 20, 2024		CodeCode Available	4
Fast Transformer Decoding: One Write-Head is All You Need	Nov 6, 2019	AllLanguage Modelling	CodeCode Available	4
OpenMathInstruct-2: Accelerating AI for Math with Massive Open-Source Instruction Data	Oct 2, 2024	Arithmetic ReasoningLarge Language Model	CodeCode Available	4
DisCo-DSO: Coupling Discrete and Continuous Optimization for Efficient Generative Design in Hybrid Spaces	Dec 15, 2024	Symbolic Regression	CodeCode Available	4
Ideas in Inference-time Scaling can Benefit Generative Pre-training Algorithms	Mar 10, 2025		CodeCode Available	4
Tiny-PULP-Dronets: Squeezing Neural Networks for Faster and Lighter Inference on Multi-Tasking Autonomous Nano-Drones	Jul 2, 2024	Autonomous Navigation	CodeCode Available	4
ReARTeR: Retrieval-Augmented Reasoning with Trustworthy Process Rewarding	Jan 14, 2025	RAGRetrieval	CodeCode Available	4
PointVLA: Injecting the 3D World into Vision-Language-Action Models	Mar 10, 2025	Imitation LearningSpatial Reasoning	CodeCode Available	4
ViViD: Video Virtual Try-on using Diffusion Models	May 20, 2024	Virtual Try-on	CodeCode Available	4
GeoWizard: Unleashing the Diffusion Priors for 3D Geometry Estimation from a Single Image	Mar 18, 2024	3D geometry3D Reconstruction	CodeCode Available	4
GeneFace: Generalized and High-Fidelity Audio-Driven 3D Talking Face Synthesis	Jan 31, 2023	Face GenerationLip Reading	CodeCode Available	4
Navigation World Models	Dec 4, 2024	Robot NavigationVideo Generation	CodeCode Available	4
Eagle 2.5: Boosting Long-Context Post-Training for Frontier Vision-Language Models	Apr 21, 2025	MMEVideo MME	CodeCode Available	4
Diffusion-Based Planning for Autonomous Driving with Flexible Guidance	Jan 26, 2025	Autonomous DrivingImitation Learning	CodeCode Available	4
Efficient LoFTR: Semi-Dense Local Feature Matching with Sparse-Like Speed	Mar 7, 2024	3D ReconstructionImage Retrieval	CodeCode Available	4
VideoChat: Chat-Centric Video Understanding	May 10, 2023	Question AnsweringVideo-based Generative Performance Benchmarking	CodeCode Available	4
HaGRIDv2: 1M Images for Static and Dynamic Hand Gesture Recognition	Dec 2, 2024	Gesture RecognitionHand Detection	CodeCode Available	4
Contextual Multilingual Spellchecker for User Queries	May 1, 2023		CodeCode Available	4
Panoptic Feature Pyramid Networks	Jan 8, 2019	Instance SegmentationPanoptic Segmentation	CodeCode Available	4
Evolution Transformer: In-Context Evolutionary Optimization	Mar 5, 2024		CodeCode Available	4
Segment and Track Anything	May 11, 2023	Autonomous Drivingmultimodal interaction	CodeCode Available	4
SmoothGrad: removing noise by adding noise	Jun 12, 2017	Interpretable Machine LearningSensitivity	CodeCode Available	4
A Comprehensive Survey on 3D Content Generation	Feb 2, 2024	Survey	CodeCode Available	4
Autoregressive Models in Vision: A Survey	Nov 8, 2024	3D GenerationImage Generation	CodeCode Available	4
SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints	Dec 10, 2024	4D reconstructionVideo Generation	CodeCode Available	4
Ray: A Distributed Framework for Emerging AI Applications	Dec 16, 2017	reinforcement-learningReinforcement Learning	CodeCode Available	4
RegNet: Self-Regulated Network for Image Classification	Jan 3, 2021	ClassificationGeneral Classification	CodeCode Available	4
MVSGaussian: Fast Generalizable Gaussian Splatting Reconstruction from Multi-View Stereo	May 20, 2024	NeRFNovel View Synthesis	CodeCode Available	4
CrossWOZ: A Large-Scale Chinese Cross-Domain Task-Oriented Dialogue Dataset	Feb 27, 2020	Dialogue State TrackingTask-Oriented Dialogue Systems	CodeCode Available	4
On the Contribution of Per-ICD Attention Mechanisms to Classify Health Records in Languages with Fewer Resources than English	Sep 1, 2021	Language Modelling	CodeCode Available	4
Dive into Deep Learning	Jun 21, 2021	Deep LearningMath	CodeCode Available	4