The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1976–2000 of 177340 papers

Title	Date	Tasks	Status	Hype	Score
MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model	May 30, 2024	Image AnimationVideo Generation	CodeCode Available	4	5
Generalizable Humanoid Manipulation with 3D Diffusion Policies	Oct 14, 2024	Camera CalibrationPoint Cloud Segmentation	CodeCode Available	4	5
LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA	Sep 4, 2024	Question AnsweringSentence	CodeCode Available	4	5
No Pose, No Problem: Surprisingly Simple 3D Gaussian Splats from Sparse Unposed Images	Oct 31, 2024	3D ReconstructionGeneralizable Novel View Synthesis	CodeCode Available	4	5
Do LLMs Possess a Personality? Making the MBTI Test an Amazing Evaluation for Large Language Models	Jul 30, 2023	HallucinationPrompt Engineering	CodeCode Available	4	5
Multimodal Chain-of-Thought Reasoning in Language Models	Feb 2, 2023	HallucinationLanguage Modelling	CodeCode Available	4	5
Efficient Automated Deep Learning for Time Series Forecasting	May 11, 2022	AutoMLBayesian Optimization	CodeCode Available	4	5
SplaTAM: Splat, Track & Map 3D Gaussians for Dense RGB-D SLAM	Dec 4, 2023	Camera Pose EstimationNovel View Synthesis	CodeCode Available	4	5
Not what you've signed up for: Compromising Real-World LLM-Integrated Applications with Indirect Prompt Injection	Feb 23, 2023	Code CompletionComputer Security	CodeCode Available	4	5
Lean Workbook: A large-scale Lean problem set formalized from natural language math problems	Jun 6, 2024	Automated Theorem ProvingMath	CodeCode Available	4	5
GeoCalib: Learning Single-image Calibration with Geometric Optimization	Sep 10, 2024	3D geometryVisual Localization	CodeCode Available	4	5
ManimML: Communicating Machine Learning Architectures with Animation	Jun 29, 2023		CodeCode Available	4	5
Bench2Drive: Towards Multi-Ability Benchmarking of Closed-Loop End-To-End Autonomous Driving	Jun 6, 2024	Autonomous DrivingBench2Drive	CodeCode Available	4	5
TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching and Clap-Ranked Preference Optimization	Dec 30, 2024	Audio GenerationGPU	CodeCode Available	4	5
SQuARE: Sequential Question Answering Reasoning Engine for Enhanced Chain-of-Thought in Large Language Models	Feb 13, 2025	Question AnsweringRAG	CodeCode Available	4	5
Reasoning with Language Model is Planning with World Model	May 24, 2023	Language ModelingLanguage Modelling	CodeCode Available	4	5
Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think	Sep 17, 2024	Conditional Image GenerationDepth Estimation	CodeCode Available	4	5
DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasks	May 7, 2024	BinarizationDeblurring	CodeCode Available	4	5
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis	Sep 30, 2023	GPU	CodeCode Available	4	5
Flamingo: a Visual Language Model for Few-Shot Learning	Apr 29, 2022	Few-Shot LearningGenerative Visual Question Answering	CodeCode Available	4	5
Matching 2D Images in 3D: Metric Relative Pose from Metric Correspondences	Apr 9, 2024		CodeCode Available	4	5
Prompt2Model: Generating Deployable Models from Natural Language Instructions	Aug 23, 2023	Data-free Knowledge DistillationDataset Generation	CodeCode Available	4	5
Sequential Models in the Synthetic Data Vault	Jul 28, 2022	Generative Adversarial Network	CodeCode Available	4	5
UniTS: A Unified Multi-Task Time Series Model	Feb 29, 2024	Anomaly DetectionImputation	CodeCode Available	4	5
YuLan: An Open-source Large Language Model	Jun 28, 2024	Language ModelingLanguage Modelling	CodeCode Available	4	5