The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 3701–3750 of 177340 papers

Title	Date	Tasks	Status	Hype	Score
Diffusion-TS: Interpretable Diffusion for General Time Series Generation	Mar 4, 2024	Audio SynthesisDecoder	CodeCode Available	3	5
TapeAgents: a Holistic Framework for Agent Development and Optimization	Dec 11, 2024		CodeCode Available	3	5
MixLinear: Extreme Low Resource Multivariate Time Series Forecasting with 0.1K Parameters	Oct 2, 2024	Multivariate Time Series ForecastingTime Series	CodeCode Available	3	5
DataSentinel: A Game-Theoretic Detection of Prompt Injection Attacks	Apr 15, 2025		CodeCode Available	3	5
Adversarial Cheap Talk	Nov 20, 2022	Meta-LearningReinforcement Learning (RL)	CodeCode Available	3	5
Flash3D: Feed-Forward Generalisable 3D Scene Reconstruction from a Single Image	Jun 6, 2024	3D Scene ReconstructionDepth Estimation	CodeCode Available	3	5
EscherNet: A Generative Model for Scalable View Synthesis	Feb 6, 2024	3D ReconstructionGPU	CodeCode Available	3	5
3DIS-FLUX: simple and efficient multi-instance generation with DiT rendering	Jan 9, 2025	Image GenerationText to Image Generation	CodeCode Available	3	5
Reactive Diffusion Policy: Slow-Fast Visual-Tactile Policy Learning for Contact-Rich Manipulation	Mar 4, 2025	Contact-rich ManipulationImitation Learning	CodeCode Available	3	5
Rethinking the Evaluation of Visible and Infrared Image Fusion	Oct 9, 2024	object-detectionObject Detection	CodeCode Available	3	5
Training Verifiers to Solve Math Word Problems	Oct 27, 2021	GSM8KMath	CodeCode Available	3	5
Interactive Medical Image Segmentation: A Benchmark Dataset and Baseline	Nov 19, 2024	Image SegmentationInteractive Segmentation	CodeCode Available	3	5
Generating Long Sequences with Sparse Transformers	Apr 23, 2019	DiversityImage Generation	CodeCode Available	3	5
Towards Generalizable Tumor Synthesis	Feb 29, 2024	Computed Tomography (CT)	CodeCode Available	3	5
Kimina-Prover Preview: Towards Large Formal Reasoning Models with Reinforcement Learning	Apr 15, 2025	Automated Theorem ProvingLarge Language Model	CodeCode Available	3	5
Pipeline Parallelism with Controllable Memory	May 24, 2024		CodeCode Available	3	5
SoloSpeech: Enhancing Intelligibility and Quality in Target Speech Extraction through a Cascaded Generative Pipeline	May 25, 2025	Speech ExtractionSpeech Separation	CodeCode Available	3	5
L0: Reinforcement Learning to Become General Agents	Jun 30, 2025	Question Answeringreinforcement-learning	CodeCode Available	3	5
MMAD: The First-Ever Comprehensive Benchmark for Multimodal Large Language Models in Industrial Anomaly Detection	Oct 12, 2024	Anomaly Detection	CodeCode Available	3	5
ASFT: Aligned Supervised Fine-Tuning through Absolute Likelihood	Sep 14, 2024	Instruction FollowingText Generation	CodeCode Available	3	5
AdaWorld: Learning Adaptable World Models with Latent Actions	Mar 24, 2025	Future prediction	CodeCode Available	3	5
SIMPL: A Simple and Efficient Multi-agent Motion Prediction Baseline for Autonomous Driving	Feb 4, 2024	Autonomous DrivingAutonomous Vehicles	CodeCode Available	3	5
cmaes : A Simple yet Practical Python Library for CMA-ES	Feb 2, 2024	Transfer Learning	CodeCode Available	3	5
Emu: Generative Pretraining in Multimodality	Jul 11, 2023	Image CaptioningImage Generation	CodeCode Available	3	5
BlenderLLM: Training Large Language Models for Computer-Aided Design with Self-improvement	Dec 16, 2024	Script GenerationText to 3D	CodeCode Available	3	5
Automatically Interpreting Millions of Features in Large Language Models	Oct 17, 2024	Semantic SimilaritySemantic Textual Similarity	CodeCode Available	3	5
GTSinger: A Global Multi-Technique Singing Corpus with Realistic Music Scores for All Singing Tasks	Sep 20, 2024	AllSinging Voice Synthesis	CodeCode Available	3	5
KIVI: A Tuning-Free Asymmetric 2bit Quantization for KV Cache	Feb 5, 2024	Quantization	CodeCode Available	3	5
AgentDojo: A Dynamic Environment to Evaluate Prompt Injection Attacks and Defenses for LLM Agents	Jun 19, 2024		CodeCode Available	3	5
AndroidLab: Training and Systematic Benchmarking of Android Autonomous Agents	Oct 31, 2024	Benchmarking	CodeCode Available	3	5
HAC++: Towards 100X Compression of 3D Gaussian Splatting	Jan 21, 2025	3DGSAttribute	CodeCode Available	3	5
Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation	Sep 27, 2023	GPUText-to-Video Generation	CodeCode Available	3	5
Deep Reasoning Translation via Reinforcement Learning	Apr 14, 2025	reinforcement-learningReinforcement Learning	CodeCode Available	3	5
Segment Anything in 3D with Radiance Fields	Apr 24, 2023	Inverse RenderingSegmentation	CodeCode Available	3	5
Consistency Flow Matching: Defining Straight Flows with Velocity Consistency	Jul 2, 2024	Image Generation	CodeCode Available	3	5
PhotoDoodle: Learning Artistic Image Editing from Few-Shot Pairwise Data	Feb 20, 2025	Style Transfer	CodeCode Available	3	5
Deep Learning-Based Object Pose Estimation: A Comprehensive Survey	May 13, 2024	Deep LearningObject	CodeCode Available	3	5
MotionFollower: Editing Video Motion via Lightweight Score-Guided Diffusion	May 30, 2024	DenoisingGPU	CodeCode Available	3	5
VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training	Mar 23, 2022	4kAction Classification	CodeCode Available	3	5
AnimeGamer: Infinite Anime Life Simulation with Next Game State Prediction	Apr 1, 2025	Image Generation	CodeCode Available	3	5
PE3R: Perception-Efficient 3D Reconstruction	Mar 10, 2025	3D ReconstructionZero-shot Generalization	CodeCode Available	3	5
The Mighty ToRR: A Benchmark for Table Reasoning and Robustness	Feb 26, 2025		CodeCode Available	3	5
Baichuan-Omni Technical Report	Oct 11, 2024	Language ModelingLanguage Modelling	CodeCode Available	3	5
Robot Utility Models: General Policies for Zero-Shot Deployment in New Environments	Sep 9, 2024	Imitation Learning	CodeCode Available	3	5
RLVR-World: Training World Models with Reinforcement Learning	May 20, 2025	reinforcement-learningReinforcement Learning	CodeCode Available	3	5
Tool Learning with Large Language Models: A Survey	May 28, 2024	Response GenerationSurvey	CodeCode Available	3	5
DragDiffusion: Harnessing Diffusion Models for Interactive Point-based Image Editing	Jun 26, 2023		CodeCode Available	3	5
Step-level Value Preference Optimization for Mathematical Reasoning	Jun 16, 2024	Learning-To-RankMath	CodeCode Available	3	5
Middle Architecture Criteria	Apr 27, 2024		CodeCode Available	3	5
TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones	Dec 28, 2023	Computational EfficiencyImage Captioning	CodeCode Available	3	5