The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 8176–8200 of 177340 papers

Title	Date	Tasks	Status	Hype	Score
Attentive Merging of Hidden Embeddings from Pre-trained Speech Model for Anti-spoofing Detection	Jun 12, 2024	Computational EfficiencySelf-Supervised Learning	CodeCode Available	2	5
ShiftwiseConv: Small Convolutional Kernel with Large Kernel Effect	Jan 1, 2025		CodeCode Available	2	5
Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement Learning	Jun 10, 2025	Model SelectionReinforcement Learning (RL)	CodeCode Available	2	5
Institutional Books 1.0: A 242B token dataset from Harvard Library's collections, refined for accuracy and usability	Jun 10, 2025	Optical Character Recognition (OCR)	CodeCode Available	2	5
Do MIL Models Transfer?	Jun 10, 2025	Multiple Instance LearningTransfer Learning	CodeCode Available	2	5
SDialog: A Python Toolkit for Synthetic Dialogue Generation and Analysis	Jun 12, 2025	BenchmarkingDialogue Generation	CodeCode Available	2	5
Vision Transformers Don't Need Trained Registers	Jun 9, 2025		CodeCode Available	2	5
AutoMind: Adaptive Knowledgeable Agent for Automated Data Science	Jun 12, 2025	Code GenerationLarge Language Model	CodeCode Available	2	5
UI-Genie: A Self-Improving Approach for Iteratively Boosting MLLM-based Mobile GUI Agents	May 27, 2025	16k	CodeCode Available	2	5
IntPhys 2: Benchmarking Intuitive Physics Understanding In Complex Synthetic Environments	Jun 11, 2025	Benchmarking	CodeCode Available	2	5
VerIF: Verification Engineering for Reinforcement Learning in Instruction Following	Jun 11, 2025	Instruction Followingreinforcement-learning	CodeCode Available	2	5
Solving the Job Shop Scheduling Problem with Graph Neural Networks: A Customizable Reinforcement Learning Environment	Jun 10, 2025	Combinatorial OptimizationImitation Learning	CodeCode Available	2	5
AnalogNAS-Bench: A NAS Benchmark for Analog In-Memory Computing	Jun 23, 2025	Neural Architecture SearchQuantization	CodeCode Available	2	5
Confucius3-Math: A Lightweight High-Performance Reasoning LLM for Chinese K-12 Mathematics Learning	Jun 23, 2025	GPULarge Language Model	CodeCode Available	2	5
Towards In-the-wild 3D Plane Reconstruction from a Single Image	Jun 3, 2025	3D Plane Detection	CodeCode Available	2	5
Test3R: Learning to Reconstruct 3D at Test Time	Jun 16, 2025	3D ReconstructionDepth Estimation	CodeCode Available	2	5
Parallels Between VLA Model Post-Training and Human Motor Learning: Progress, Challenges, and Trends	Jun 26, 2025	Action GenerationVision-Language-Action	CodeCode Available	2	5
Flow-Anchored Consistency Models	Jul 4, 2025	Image Generation	CodeCode Available	2	5
Feed-Forward SceneDINO for Unsupervised Semantic Scene Completion	Jul 8, 2025	3D geometryDomain Generalization	CodeCode Available	2	5
EAMamba: Efficient All-Around Vision State Space Model for Image Restoration	Jun 27, 2025	AllDeblurring	CodeCode Available	2	5
Open Source Planning & Control System with Language Agents for Autonomous Scientific Discovery	Jul 9, 2025	Language ModelingLanguage Modelling	CodeCode Available	2	5
CaRL: Learning Scalable Planning Policies with Simple Rewards	Apr 24, 2025	Autonomous DrivingCARLA longest6	CodeCode Available	2	5
HiMTok: Learning Hierarchical Mask Tokens for Image Segmentation with Large Multimodal Model	Mar 17, 2025	Image SegmentationSegmentation	CodeCode Available	2	5
Detecting Spacecraft Anomalies Using LSTMs and Nonparametric Dynamic Thresholding	Feb 13, 2018	Anomaly Detection	CodeCode Available	2	5
The Best of Both Worlds: Combining Recent Advances in Neural Machine Translation	Apr 26, 2018	Machine TranslationTranslation	CodeCode Available	2	5