The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 4526–4550 of 177340 papers

Title	Date	Tasks	Status	Hype	Score
N-LTP: An Open-source Neural Language Technology Platform for Chinese	Sep 24, 2020	Chinese Word SegmentationDependency Parsing	CodeCode Available	3	5
AgentBoard: An Analytical Evaluation Board of Multi-turn LLM Agents	Jan 24, 2024	Benchmarking	CodeCode Available	3	5
EfficientDet: Scalable and Efficient Object Detection	Nov 20, 2019	AutoMLObject	CodeCode Available	3	5
Auto-Sklearn 2.0: Hands-free AutoML via Meta-Learning	Jul 8, 2020	AutoMLBIG-bench Machine Learning	CodeCode Available	3	5
LongRoPE2: Near-Lossless LLM Context Window Scaling	Feb 27, 2025		CodeCode Available	3	5
A Novel Non-population-based Meta-heuristic Optimizer Inspired by the Philosophy of Yi Jing	Apr 17, 2021	Philosophy	CodeCode Available	3	5
CarDreamer: Open-Source Learning Platform for World Model based Autonomous Driving	May 15, 2024	Autonomous DrivingAutonomous Vehicles	CodeCode Available	3	5
Practical Video Object Detection via Feature Selection and Aggregation	Jul 29, 2024	feature selectionGPU	CodeCode Available	3	5
LIMR: Less is More for RL Scaling	Feb 17, 2025		CodeCode Available	3	5
WebCanvas: Benchmarking Web Agents in Online Environments	Jun 18, 2024	AI AgentBenchmarking	CodeCode Available	3	5
ptwt - The PyTorch Wavelet Toolbox	Mar 1, 2024		CodeCode Available	3	5
UniBench: Visual Reasoning Requires Rethinking Vision-Language Beyond Scaling	Aug 9, 2024	GPULanguage Modeling	CodeCode Available	3	5
DNA Family: Boosting Weight-Sharing NAS with Block-Wise Supervisions	Mar 2, 2024	Neural Architecture Search	CodeCode Available	3	5
Syzygy of Thoughts: Improving LLM CoT with the Minimal Free Resolution	Apr 13, 2025	GSM8KMath	CodeCode Available	3	5
MTVCrafter: 4D Motion Tokenization for Open-World Human Image Animation	May 15, 2025	Image AnimationVideo Generation	CodeCode Available	3	5
mlpack 3: a fast, flexible machine learning library	Jun 18, 2018	BenchmarkingBIG-bench Machine Learning	CodeCode Available	3	5
Large Language Model-Brained GUI Agents: A Survey	Nov 27, 2024	Code GenerationLanguage Modeling	CodeCode Available	3	5
Rethinking Histology Slide Digitization Workflows for Low-Resource Settings	May 13, 2024	Deblurringwhole slide images	CodeCode Available	3	5
Allo: A Programming Model for Composable Accelerator Design	Apr 7, 2024	GPUHigh-Level Synthesis	CodeCode Available	3	5
CogCoM: Train Large Vision-Language Models Diving into Details through Chain of Manipulations	Feb 6, 2024	Visual Reasoning	CodeCode Available	3	5
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context	Mar 8, 2024	1 Image, 2*2 StitchingCode Generation	CodeCode Available	3	5
ShadowKV: KV Cache in Shadows for High-Throughput Long-Context LLM Inference	Oct 28, 2024	CPU	CodeCode Available	3	5
PyTorch Metric Learning	Aug 20, 2020	Metric Learning	CodeCode Available	3	5
ReasonIR: Training Retrievers for Reasoning Tasks	Apr 29, 2025	Information RetrievalMMLU	CodeCode Available	3	5
OCR-free Document Understanding Transformer	Nov 30, 2021	Document Image Classificationdocument understanding	CodeCode Available	3	5