The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 3276–3300 of 177340 papers

Title	Date	Tasks	Status	Hype	Score
SonicSim: A customizable simulation platform for speech processing in moving sound source scenarios	Oct 2, 2024	Speech EnhancementSpeech Separation	CodeCode Available	3	5
Multi-Level Speaker Representation for Target Speaker Extraction	Oct 21, 2024	Target Speaker Extraction	CodeCode Available	3	5
PDL: A Declarative Prompt Programming Language	Oct 24, 2024	RAG	CodeCode Available	3	5
Llama Scope: Extracting Millions of Features from Llama-3.1-8B with Sparse Autoencoders	Oct 27, 2024	Language ModelingLanguage Modelling	CodeCode Available	3	5
When Precision Meets Position: BFloat16 Breaks Down RoPE in Long-Context Training	Nov 20, 2024	Computational EfficiencyPosition	CodeCode Available	3	5
OSDFace: One-Step Diffusion Model for Face Restoration	Nov 26, 2024	Face RecognitionGenerative Adversarial Network	CodeCode Available	3	5
CityWalker: Learning Embodied Urban Navigation from Web-Scale Videos	Nov 26, 2024	Common Sense ReasoningImitation Learning	CodeCode Available	3	5
Time Travel is Cheating: Going Live with DeepFund for Real-Time Fund Investment Benchmarking	May 16, 2025	BenchmarkingManagement	CodeCode Available	3	5
Prithvi-EO-2.0: A Versatile Multi-Temporal Foundation Model for Earth Observation Applications	Dec 3, 2024	BenchmarkingDisaster Response	CodeCode Available	3	5
Reloc3r: Large-Scale Training of Relative Camera Pose Regression for Generalizable, Fast, and Accurate Visual Localization	Dec 11, 2024	Pose EstimationVisual Localization	CodeCode Available	3	5
Attentive Eraser: Unleashing Diffusion Model's Object Removal Potential via Self-Attention Redirection Guidance	Dec 17, 2024	Image GenerationObject	CodeCode Available	3	5
CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up	Dec 20, 2024	8kGPU	CodeCode Available	3	5
UAVs Meet LLMs: Overviews and Perspectives Toward Agentic Low-Altitude Mobility	Jan 4, 2025		CodeCode Available	3	5
LLMs can see and hear without any training	Jan 30, 2025	Audio captioningImage Generation	CodeCode Available	3	5
LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs	Jan 10, 2025	4kVisual Reasoning	CodeCode Available	3	5
PETR: Position Embedding Transformation for Multi-View 3D Object Detection	Mar 10, 2022	3D Object DetectionObject	CodeCode Available	3	5
EasyEdit: An Easy-to-use Knowledge Editing Framework for Large Language Models	Aug 14, 2023	knowledge editing	CodeCode Available	3	5
Improved Denoising Diffusion Probabilistic Models	Feb 18, 2021	DenoisingImage Generation	CodeCode Available	3	5
Pareto Front Approximation for Multi-Objective Session-Based Recommender Systems	Jul 23, 2024	Recommendation Systems	CodeCode Available	3	5
Goedel-Prover: A Frontier Model for Open-Source Automated Theorem Proving	Feb 11, 2025	Automated Theorem ProvingLarge Language Model	CodeCode Available	3	5
Stonefish: Supporting Machine Learning Research in Marine Robotics	Feb 17, 2025	Optical Flow Estimation	CodeCode Available	3	5
Soundwave: Less is More for Speech-Text Alignment in LLMs	Feb 18, 2025		CodeCode Available	3	5
Slamming: Training a Speech Language Model on One GPU in a Day	Feb 19, 2025	GPULanguage Modeling	CodeCode Available	3	5
AlphaAgent: LLM-Driven Alpha Mining with Regularized Exploration to Counteract Alpha Decay	Feb 24, 2025		CodeCode Available	3	5
Emergent Misalignment: Narrow finetuning can produce broadly misaligned LLMs	Feb 24, 2025	Computer Security	CodeCode Available	3	5