The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2826–2850 of 177340 papers

Title	Date	Tasks	Status	Hype	Score
TransFuser: Imitation with Transformer-Based Sensor Fusion for Autonomous Driving	May 31, 2022	Autonomous DrivingCARLA longest6	CodeCode Available	3	5
EfficientVMamba: Atrous Selective Scan for Light Weight Visual Mamba	Mar 15, 2024	Language ModelingLanguage Modelling	CodeCode Available	3	5
ToolSandbox: A Stateful, Conversational, Interactive Evaluation Benchmark for LLM Tool Use Capabilities	Aug 8, 2024		CodeCode Available	3	5
Accelerating Diffusion Transformers with Dual Feature Caching	Dec 25, 2024	Video Generation	CodeCode Available	3	5
Keypoint Promptable Re-Identification	Jul 25, 2024	Metric LearningOccluded Person Re-Identification	CodeCode Available	3	5
Proteus: A Self-Designing Range Filter	Jun 30, 2022		CodeCode Available	3	5
SARATR-X: Toward Building A Foundation Model for SAR Target Recognition	May 15, 2024	2D Object DetectionEarth Observation	CodeCode Available	3	5
AutoTimes: Autoregressive Time Series Forecasters via Large Language Models	Feb 4, 2024	DecoderIn-Context Learning	CodeCode Available	3	5
PromptKD: Unsupervised Prompt Distillation for Vision-Language Models	Mar 5, 2024	Knowledge DistillationPrompt Engineering	CodeCode Available	3	5
Matbench Discovery -- A framework to evaluate machine learning crystal stability predictions	Aug 28, 2023	BenchmarkingFormation Energy	CodeCode Available	3	5
Mipha: A Comprehensive Overhaul of Multimodal Assistant with Small Language Models	Mar 10, 2024	Visual Question Answering	CodeCode Available	3	5
SAM2-UNet: Segment Anything 2 Makes Strong Encoder for Natural and Medical Image Segmentation	Aug 16, 2024	Image SegmentationMarine Animal Segmentation	CodeCode Available	3	5
Multimodal Foundation Models: From Specialists to General-Purpose Assistants	Sep 18, 2023	Image GenerationSurvey	CodeCode Available	3	5
Aria-UI: Visual Grounding for GUI Instructions	Dec 20, 2024	Natural Language Visual GroundingVisual Grounding	CodeCode Available	3	5
Karatsuba Matrix Multiplication and its Efficient Custom Hardware Implementations	Jan 15, 2025		CodeCode Available	3	5
VRT: A Video Restoration Transformer	Jan 28, 2022	DeblurringDenoising	CodeCode Available	3	5
A Demonstration of Adaptive Collaboration of Large Language Models for Medical Decision-Making	Oct 31, 2024	Decision MakingDiagnostic	CodeCode Available	3	5
TinyAgent: Function Calling at the Edge	Sep 1, 2024	Language ModellingQuantization	CodeCode Available	3	5
Graph-constrained Reasoning: Faithful Reasoning on Knowledge Graphs with Large Language Models	Oct 16, 2024	HallucinationKnowledge Graphs	CodeCode Available	3	5
Graph-Augmented Normalizing Flows for Anomaly Detection of Multiple Time Series	Feb 16, 2022	Anomaly DetectionDensity Estimation	CodeCode Available	3	5
Towards An End-to-End Framework for Flow-Guided Video Inpainting	Apr 6, 2022	HallucinationOptical Flow Estimation	CodeCode Available	3	5
Sintel: A Machine Learning Framework to Extract Insights from Signals	Apr 19, 2022	Anomaly DetectionBIG-bench Machine Learning	CodeCode Available	3	5
VideoCutLER: Surprisingly Simple Unsupervised Video Instance Segmentation	Aug 28, 2023	Instance SegmentationOptical Flow Estimation	CodeCode Available	3	5
TAPIR: Tracking Any Point with per-frame Initialization and temporal Refinement	Jun 14, 2023	GPUMotion Estimation	CodeCode Available	3	5
Playing Non-Embedded Card-Based Games with Reinforcement Learning	Apr 7, 2025	Board GamesDecision Making	CodeCode Available	3	5