The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 3026–3050 of 661570 papers

Title	Date	Tasks	Status	Hype
IFEval-Audio: Benchmarking Instruction-Following Capability in Audio-based Large Language Models	May 22, 2025	BenchmarkingInstruction Following	CodeCode Available	3
SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis	Nov 29, 2023	NeRFTalking Face Generation	CodeCode Available	3
Single-Image Shadow Removal Using Deep Learning: A Comprehensive Survey	Jul 11, 2024	Deep LearningImage Restoration	CodeCode Available	3
VARGPT: Unified Understanding and Generation in a Visual Autoregressive Multimodal Large Language Model	Jan 21, 2025	Image GenerationInstruction Following	CodeCode Available	3
Parameter-Efficient Fine-Tuning in Spectral Domain for Point Cloud Learning	Oct 10, 2024	3D Parameter-Efficient Fine-Tuning for Classification3D Point Cloud Classification	CodeCode Available	3
GaussianEditor: Swift and Controllable 3D Editing with Gaussian Splatting	Nov 24, 2023	NeRF	CodeCode Available	3
GraphStorm: all-in-one graph machine learning framework for industry applications	Jun 10, 2024	Allgraph construction	CodeCode Available	3
TokenPacker: Efficient Visual Projector for Multimodal LLM	Jul 2, 2024	Language ModellingLarge Language Model	CodeCode Available	3
WeatherMesh-3: Fast and accurate operational global weather forecasting	Mar 28, 2025	Computational EfficiencyGPU	CodeCode Available	3
NdLinear Is All You Need for Representation Learning	Mar 21, 2025	AllRepresentation Learning	CodeCode Available	3
Bake off redux: a review and experimental evaluation of recent time series classification algorithms	Apr 25, 2023	Dynamic Time WarpingTime Series	CodeCode Available	3
TrafficLLM: Enhancing Large Language Models for Network Traffic Analysis with Generic Traffic Representation	Apr 5, 2025		CodeCode Available	3
CameraHMR: Aligning People with Perspective	Nov 12, 2024	3D human pose and shape estimation	CodeCode Available	3
DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World Knowledge	Jul 6, 2025	Image GenerationMultimodal Reasoning	CodeCode Available	3
DEFOM-Stereo: Depth Foundation Model Based Stereo Matching	Jan 16, 2025	Depth EstimationDisparity Estimation	CodeCode Available	3
Rainbow: Combining Improvements in Deep Reinforcement Learning	Oct 6, 2017	Atari GamesDeep Reinforcement Learning	CodeCode Available	3
Mambular: A Sequential Model for Tabular Deep Learning	Aug 12, 2024	Deep LearningMamba	CodeCode Available	3
Aesthetic Post-Training Diffusion Models from Generic Preferences with Step-by-step Preference Optimization	Jun 6, 2024	DenoisingImage Generation	CodeCode Available	3
WHAC: World-grounded Humans and Cameras	Mar 19, 2024	Camera Pose EstimationPose Estimation	CodeCode Available	3
GTBench: Uncovering the Strategic Reasoning Limitations of LLMs via Game-Theoretic Evaluations	Feb 19, 2024	Card GamesLogical Reasoning	CodeCode Available	3
Generative AI Act II: Test Time Scaling Drives Cognition Engineering	Apr 18, 2025	Prompt Engineering	CodeCode Available	3
ArxivDIGESTables: Synthesizing Scientific Literature into Tables using Language Models	Oct 25, 2024		CodeCode Available	3
Cognify: Supercharging Gen-AI Workflows With Hierarchical Autotuning	Feb 12, 2025	RAGText to SQL	CodeCode Available	3
Unitxt: Flexible, Shareable and Reusable Data Preparation and Evaluation for Generative AI	Jan 25, 2024		CodeCode Available	3
Navigating the Digital World as Humans Do: Universal Visual Grounding for GUI Agents	Oct 7, 2024	Natural Language Visual GroundingNavigate	CodeCode Available	3