The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,984 papers248,105 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1626–1650 of 659984 papers

Title	Date	Tasks	Status	Hype
Uni-Perceiver v2: A Generalist Model for Large-Scale Vision and Vision-Language Tasks	Nov 17, 2022	DecoderLanguage Modelling	CodeCode Available	4
Generating Structured Outputs from Language Models: Benchmark and Studies	Jan 18, 2025		CodeCode Available	4
Semi-Mamba-UNet: Pixel-Level Contrastive and Pixel-Level Cross-Supervised Visual Mamba-based UNet for Semi-Supervised Medical Image Segmentation	Feb 11, 2024	Cardiac SegmentationContrastive Learning	CodeCode Available	4
Radiative Gaussian Splatting for Efficient X-ray Novel View Synthesis	Mar 7, 2024	CT ReconstructionNeRF	CodeCode Available	4
Timer-XL: Long-Context Transformers for Unified Time Series Forecasting	Oct 7, 2024	Time SeriesTime Series Forecasting	CodeCode Available	4
TRUE: Re-evaluating Factual Consistency Evaluation	Apr 11, 2022	Question GenerationQuestion-Generation	CodeCode Available	4
Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection	Oct 17, 2023	Fact VerificationQuestion Answering	CodeCode Available	4
MedSAM2: Segment Anything in 3D Medical Images and Videos	Apr 4, 2025	SegmentationVideo Segmentation	CodeCode Available	4
DepthFM: Fast Monocular Depth Estimation with Flow Matching	Mar 20, 2024	Depth EstimationMonocular Depth Estimation	CodeCode Available	4
Strip R-CNN: Large Strip Convolution for Remote Sensing Object Detection	Jan 7, 2025	Objectobject-detection	CodeCode Available	4
Chain of Ideas: Revolutionizing Research Via Novel Idea Development with LLM Agents	Oct 17, 2024	Experimental Design	CodeCode Available	4
T-MAC: CPU Renaissance via Table Lookup for Low-Bit LLM Deployment on Edge	Jun 25, 2024	Computational EfficiencyCPU	CodeCode Available	4
JAX-Fluids 2.0: Towards HPC for Differentiable CFD of Compressible Two-phase Flows	Feb 7, 2024	GPU	CodeCode Available	4
AltCLIP: Altering the Language Encoder in CLIP for Extended Language Capabilities	Nov 12, 2022	Contrastive LearningCross-Modal Retrieval	CodeCode Available	4
Link and code: Fast indexing with graphs and compact regression codes	Apr 26, 2018	Image Similarity SearchQuantization	CodeCode Available	4
GRM: Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation	Mar 21, 2024	3D ReconstructionImage to 3D	CodeCode Available	4
Can Machines Help Us Answering Question 16 in Datasheets, and In Turn Reflecting on Inappropriate Content?	Feb 14, 2022	Language ModelingLanguage Modelling	CodeCode Available	4
Towards Cross-Tokenizer Distillation: the Universal Logit Distillation Loss for LLMs	Feb 19, 2024	Knowledge Distillation	CodeCode Available	4
Zero-Shot Learners for Natural Language Understanding via a Unified Multiple Choice Perspective	Oct 16, 2022	Coreference ResolutionMultiple-choice	CodeCode Available	4
AsyncDiff: Parallelizing Diffusion Models by Asynchronous Denoising	Jun 11, 2024	Denoising	CodeCode Available	4
LLaMA Pro: Progressive LLaMA with Block Expansion	Jan 4, 2024	Instruction FollowingMath	CodeCode Available	4
Optimizing LLM Inference: Fluid-Guided Online Scheduling with Memory Constraints	Apr 15, 2025	GPUInference Optimization	CodeCode Available	4
Fengshenbang 1.0: Being the Foundation of Chinese Cognitive Intelligence	Sep 7, 2022		CodeCode Available	4
OpenCalib: A Multi-sensor Calibration Toolbox for Autonomous Driving	May 27, 2022	Autonomous DrivingAutonomous Vehicles	CodeCode Available	4
Delving into RL for Image Generation with CoT: A Study on DPO vs. GRPO	May 22, 2025	Domain GeneralizationImage Generation	CodeCode Available	4