The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 8026–8050 of 177340 papers

Title	Date	Tasks	Status	Hype	Score
K2: A Foundation Language Model for Geoscience Knowledge Understanding and Utilization	Jun 8, 2023	Language ModelingLanguage Modelling	CodeCode Available	2	5
GenSim: A General Social Simulation Platform with Large Language Model based Agents	Oct 6, 2024	Language ModelingLanguage Modelling	CodeCode Available	2	5
Metric Flow Matching for Smooth Interpolations on the Data Manifold	May 23, 2024	Trajectory Prediction	CodeCode Available	2	5
Harmonizer: Learning to Perform White-Box Image and Video Harmonization	Jul 4, 2022	Image HarmonizationVideo Harmonization	CodeCode Available	2	5
Android in the Zoo: Chain-of-Action-Thought for GUI Agents	Mar 5, 2024	Language ModelingLanguage Modelling	CodeCode Available	2	5
Knowledge Circuits in Pretrained Transformers	May 28, 2024	In-Context Learningknowledge editing	CodeCode Available	2	5
PyMIC: A deep learning toolkit for annotation-efficient medical image segmentation	Aug 19, 2022	Deep LearningImage Segmentation	CodeCode Available	2	5
PHemoNet: A Multimodal Network for Physiological Signals	Sep 13, 2024	Brain Computer InterfaceEEG	CodeCode Available	2	5
From Sparse to Soft Mixtures of Experts	Aug 2, 2023		CodeCode Available	2	5
ColorizeDiffusion: Adjustable Sketch Colorization with Reference Image and Text	Jan 2, 2024	ColorizationSketch Colorization	CodeCode Available	2	5
Diving into Underwater: Segment Anything Model Guided Underwater Salient Instance Segmentation and A Large-scale Dataset	Jun 10, 2024	Instance SegmentationSalient Object Detection	CodeCode Available	2	5
DifIISR: A Diffusion Model with Gradient Guidance for Infrared Image Super-Resolution	Mar 3, 2025	Autonomous DrivingImage Super-Resolution	CodeCode Available	2	5
nuScenes: A multimodal dataset for autonomous driving	Mar 26, 2019	3D Object DetectionAutonomous Driving	CodeCode Available	2	5
An LLM-Assisted Easy-to-Trigger Backdoor Attack on Code Completion Models: Injecting Disguised Vulnerabilities against Strong Detection	Jun 10, 2024	Backdoor AttackCode Completion	CodeCode Available	2	5
Shape, Light, and Material Decomposition from Images using Monte Carlo Rendering and Denoising	Jun 7, 2022	3D ReconstructionDenoising	CodeCode Available	2	5
Video Prediction Transformers without Recurrence or Convolution	Oct 7, 2024	DecoderPrediction	CodeCode Available	2	5
TinyLLaVA-Video-R1: Towards Smaller LMMs for Video Reasoning	Apr 13, 2025	Question Answeringreinforcement-learning	CodeCode Available	2	5
DeepFilterNet: A Low Complexity Speech Enhancement Framework for Full-Band Audio based on Deep Filtering	Oct 11, 2021	Speech Enhancement	CodeCode Available	2	5
PoseScript: Linking 3D Human Poses and Natural Language	Oct 21, 2022	Cross-Modal RetrievalImage Captioning	CodeCode Available	2	5
SDEdit: Guided Image Synthesis and Editing with Stochastic Differential Equations	Aug 2, 2021	DenoisingImage Generation	CodeCode Available	2	5
Satellite Image Time Series Semantic Change Detection: Novel Architecture and Analysis of Domain Shift	Jul 10, 2024	Change DetectionDisaster Response	CodeCode Available	2	5
LLaMEA: A Large Language Model Evolutionary Algorithm for Automatically Generating Metaheuristics	May 30, 2024	Language ModelingLanguage Modelling	CodeCode Available	2	5
Unsupervised Universal Image Segmentation	Dec 28, 2023	Image SegmentationInstance Segmentation	CodeCode Available	2	5
VideoREPA: Learning Physics for Video Generation through Relational Alignment with Foundation Models	May 29, 2025	Self-Supervised LearningVideo Generation	CodeCode Available	2	5
Collaborative Gym: A Framework for Enabling and Evaluating Human-Agent Collaboration	Dec 20, 2024	Human Agent Collaboration	CodeCode Available	2	5