The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 9001–9025 of 177340 papers

Title	Date	Tasks	Status	Hype	Score
BearLLM: A Prior Knowledge-Enhanced Bearing Health Management Framework with Unified Vibration Signal Representation	Aug 21, 2024	Fault DiagnosisManagement	CodeCode Available	2	5
Scalable Autoregressive Image Generation with Mamba	Aug 22, 2024	Image GenerationMamba	CodeCode Available	2	5
SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning	Jun 30, 2025	MathMulti-agent Reinforcement Learning	CodeCode Available	2	5
MLR-Copilot: Autonomous Machine Learning Research based on Large Language Models Agents	Aug 26, 2024	Language ModelingLanguage Modelling	CodeCode Available	2	5
LLMs as Zero-shot Graph Learners: Alignment of GNN Representations with LLM Token Embeddings	Aug 25, 2024	Language ModellingLink Prediction	CodeCode Available	2	5
Leveraging Hallucinations to Reduce Manual Prompt Dependency in Promptable Segmentation	Aug 27, 2024	Camouflaged Object SegmentationCamouflaged Object Segmentation with a Single Task-generic Prompt	CodeCode Available	2	5
Stochastic Parameter Decomposition	Jun 25, 2025		CodeCode Available	2	5
Enhancing Privacy in Federated Learning: Secure Aggregation for Real-World Healthcare Applications	Sep 2, 2024	CPUFederated Learning	CodeCode Available	2	5
Boosting Vision-Language Models for Histopathology Classification: Predict all at once	Sep 3, 2024	Allzero-shot-classification	CodeCode Available	2	5
FunctionChat-Bench: Comprehensive Evaluation of Language Models' Generative Capabilities in Korean Tool-use Dialogs	Nov 21, 2024	Relevance Detection	CodeCode Available	2	5
Make Your ViT-based Multi-view 3D Detectors Faster via Token Compression	Sep 1, 2024	Autonomous Driving	CodeCode Available	2	5
Towards a Unified View of Preference Learning for Large Language Models: A Survey	Sep 4, 2024		CodeCode Available	2	5
UniDet3D: Multi-dataset Indoor 3D Object Detection	Sep 6, 2024	3D Object DetectionObject	CodeCode Available	2	5
A Pair Programming Framework for Code Generation via Multi-Plan Exploration and Feedback-Driven Refinement	Sep 8, 2024	Code Generation	CodeCode Available	2	5
Assessing SPARQL capabilities of Large Language Models	Sep 9, 2024	BenchmarkingKnowledge Graphs	CodeCode Available	2	5
DiffusionPen: Towards Controlling the Style of Handwritten Text Generation	Sep 9, 2024	DiversityHTR	CodeCode Available	2	5
ThermalGaussian: Thermal 3D Gaussian Splatting	Sep 11, 2024	3DGSNeRF	CodeCode Available	2	5
What is the Relationship between Tensor Factorizations and Circuits (and How Can We Exploit it)?	Sep 12, 2024		CodeCode Available	2	5
Recent Trends of Multimodal Affective Computing: A Survey from NLP Perspective	Sep 11, 2024	Aspect-Based Sentiment AnalysisEmotion Recognition	CodeCode Available	2	5
EZIGen: Enhancing zero-shot personalized image generation with precise subject encoding and decoupled guidance	Sep 12, 2024	DenoisingImage Generation	CodeCode Available	2	5
SSR-Speech: Towards Stable, Safe and Robust Zero-shot Text-based Speech Editing and Synthesis	Sep 11, 2024	DecoderSpeech Synthesis	CodeCode Available	2	5
Fit and Prune: Fast and Training-free Visual Token Pruning for Multi-modal Large Language Models	Sep 16, 2024		CodeCode Available	2	5
Large Language Models are Strong Audio-Visual Speech Recognition Learners	Sep 18, 2024	Audio-Visual Speech RecognitionAutomatic Speech Recognition	CodeCode Available	2	5
HSIGene: A Foundation Model For Hyperspectral Image Generation	Sep 19, 2024	Data AugmentationDenoising	CodeCode Available	2	5
Small Language Models: Survey, Measurements, and Insights	Sep 24, 2024	BenchmarkingDecoder	CodeCode Available	2	5