The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 6001–6025 of 177340 papers

Title	Date	Tasks	Status	Hype	Score
Learning Dynamic Facial Radiance Fields for Few-Shot Talking Head Synthesis	Jul 24, 2022	3D geometryNeRF	CodeCode Available	2	5
VEGS: View Extrapolation of Urban Scenes in 3D Gaussian Splatting using Learned Priors	Jul 3, 2024	Neural Rendering	CodeCode Available	2	5
Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language Models	Oct 10, 2024	GSM8KMath	CodeCode Available	2	5
Exploring CLIP for Assessing the Look and Feel of Images	Jul 25, 2022	Image Quality AssessmentNo-Reference Image Quality Assessment	CodeCode Available	2	5
Visual Perception by Large Language Model's Weights	May 30, 2024		CodeCode Available	2	5
MCP-Solver: Integrating Language Models with Constraint Programming Systems	Dec 31, 2024	Natural Language Understanding	CodeCode Available	2	5
SegNet4D: Efficient Instance-Aware 4D Semantic Segmentation for LiDAR Point Cloud	Jun 24, 2024	Autonomous DrivingAutonomous Navigation	CodeCode Available	2	5
Hourglass Tokenizer for Efficient Transformer-Based 3D Human Pose Estimation	Nov 20, 2023	3D Human Pose EstimationPose Estimation	CodeCode Available	2	5
Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual Editing	Apr 3, 2025	BenchmarkingLogical Reasoning	CodeCode Available	2	5
Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning	Oct 10, 2023	Language ModelingLanguage Modelling	CodeCode Available	2	5
CMB: A Comprehensive Medical Benchmark in Chinese	Aug 17, 2023		CodeCode Available	2	5
Towards Generalizable Vision-Language Robotic Manipulation: A Benchmark and LLM-guided 3D Policy	Oct 2, 2024	Motion PlanningRobot Manipulation	CodeCode Available	2	5
StructChart: On the Schema, Metric, and Augmentation for Visual Chart Understanding	Sep 20, 2023	Chart Question AnsweringChart Understanding	CodeCode Available	2	5
CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision Making	Jun 13, 2024	Decision Making	CodeCode Available	2	5
The P^3 dataset: Pixels, Points and Polygons for Multimodal Building Vectorization	May 21, 2025		CodeCode Available	2	5
Protein Representation Learning by Geometric Structure Pretraining	Mar 11, 2022	Contrastive LearningPrediction	CodeCode Available	2	5
SegNeXt: Rethinking Convolutional Attention Design for Semantic Segmentation	Sep 18, 2022	Real-Time Semantic SegmentationSegmentation	CodeCode Available	2	5
JudgeLM: Fine-tuned Large Language Models are Scalable Judges	Oct 26, 2023		CodeCode Available	2	5
DeepInteraction: 3D Object Detection via Modality Interaction	Aug 23, 2022	3D Object DetectionDecoder	CodeCode Available	2	5
Internal Consistency and Self-Feedback in Large Language Models: A Survey	Jul 19, 2024		CodeCode Available	2	5
Hybrid-SORT: Weak Cues Matter for Online Multi-Object Tracking	Aug 1, 2023	Multi-Object TrackingMultiple Object Tracking	CodeCode Available	2	5
PartIR: Composing SPMD Partitioning Strategies for Machine Learning	Jan 20, 2024		CodeCode Available	2	5
SQuARe: A Large-Scale Dataset of Sensitive Questions and Acceptable Responses Created Through Human-Machine Collaboration	May 28, 2023	Response Generation	CodeCode Available	2	5
FastVID: Dynamic Density Pruning for Fast Video Large Language Models	Mar 14, 2025		CodeCode Available	2	5
Embedding Earth: Self-supervised contrastive pre-training for dense land cover classification	Mar 11, 2022	Earth ObservationLand Cover Classification	CodeCode Available	2	5