The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1376–1400 of 177339 papers

Title	Date	Tasks	Status	Hype	Score
Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection	Oct 17, 2023	Fact VerificationQuestion Answering	CodeCode Available	4	5
MedSAM2: Segment Anything in 3D Medical Images and Videos	Apr 4, 2025	SegmentationVideo Segmentation	CodeCode Available	4	5
DepthFM: Fast Monocular Depth Estimation with Flow Matching	Mar 20, 2024	Depth EstimationMonocular Depth Estimation	CodeCode Available	4	5
Strip R-CNN: Large Strip Convolution for Remote Sensing Object Detection	Jan 7, 2025	Objectobject-detection	CodeCode Available	4	5
Chain of Ideas: Revolutionizing Research Via Novel Idea Development with LLM Agents	Oct 17, 2024	Experimental Design	CodeCode Available	4	5
T-MAC: CPU Renaissance via Table Lookup for Low-Bit LLM Deployment on Edge	Jun 25, 2024	Computational EfficiencyCPU	CodeCode Available	4	5
JAX-Fluids 2.0: Towards HPC for Differentiable CFD of Compressible Two-phase Flows	Feb 7, 2024	GPU	CodeCode Available	4	5
AltCLIP: Altering the Language Encoder in CLIP for Extended Language Capabilities	Nov 12, 2022	Contrastive LearningCross-Modal Retrieval	CodeCode Available	4	5
Link and code: Fast indexing with graphs and compact regression codes	Apr 26, 2018	Image Similarity SearchQuantization	CodeCode Available	4	5
GRM: Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation	Mar 21, 2024	3D ReconstructionImage to 3D	CodeCode Available	4	5
Can Machines Help Us Answering Question 16 in Datasheets, and In Turn Reflecting on Inappropriate Content?	Feb 14, 2022	Language ModelingLanguage Modelling	CodeCode Available	4	5
Towards Cross-Tokenizer Distillation: the Universal Logit Distillation Loss for LLMs	Feb 19, 2024	Knowledge Distillation	CodeCode Available	4	5
Zero-Shot Learners for Natural Language Understanding via a Unified Multiple Choice Perspective	Oct 16, 2022	Coreference ResolutionMultiple-choice	CodeCode Available	4	5
AsyncDiff: Parallelizing Diffusion Models by Asynchronous Denoising	Jun 11, 2024	Denoising	CodeCode Available	4	5
LLaMA Pro: Progressive LLaMA with Block Expansion	Jan 4, 2024	Instruction FollowingMath	CodeCode Available	4	5
Optimizing LLM Inference: Fluid-Guided Online Scheduling with Memory Constraints	Apr 15, 2025	GPUInference Optimization	CodeCode Available	4	5
Fengshenbang 1.0: Being the Foundation of Chinese Cognitive Intelligence	Sep 7, 2022		CodeCode Available	4	5
OpenCalib: A Multi-sensor Calibration Toolbox for Autonomous Driving	May 27, 2022	Autonomous DrivingAutonomous Vehicles	CodeCode Available	4	5
Delving into RL for Image Generation with CoT: A Study on DPO vs. GRPO	May 22, 2025	Domain GeneralizationImage Generation	CodeCode Available	4	5
SAMPart3D: Segment Any Part in 3D Objects	Nov 11, 2024	3D Generation3D Part Segmentation	CodeCode Available	4	5
Universal and Extensible Language-Vision Models for Organ Segmentation and Tumor Detection from Abdominal Computed Tomography	May 28, 2024	Computational EfficiencyComputed Tomography (CT)	CodeCode Available	4	5
Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey	Mar 16, 2025	Autonomous Drivingmultimodal generation	CodeCode Available	4	5
RGBD GS-ICP SLAM	Mar 19, 2024	3DGSSimultaneous Localization and Mapping	CodeCode Available	4	5
Alice in Wonderland: Simple Tasks Showing Complete Reasoning Breakdown in State-Of-the-Art Large Language Models	Jun 4, 2024	Common Sense Reasoning	CodeCode Available	4	5
Exploring the Capabilities of Large Multimodal Models on Dense Text	May 9, 2024	Prompt EngineeringVisual Question Answering (VQA)	CodeCode Available	4	5