The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 12701–12750 of 177340 papers

Title	Date	Tasks	Status	Hype	Score
HaluEval: A Large-Scale Hallucination Evaluation Benchmark for Large Language Models	May 19, 2023	HallucinationHallucination Evaluation	CodeCode Available	2	5
Large Language Models(LLMs) on Tabular Data: Prediction, Generation, and Understanding -- A Survey	Feb 27, 2024	Language ModelingLanguage Modelling	CodeCode Available	2	5
On the representation and methodology for wide and short range head pose estimation	Jan 11, 2024	ArticlesHead Pose Estimation	CodeCode Available	2	5
AriGraph: Learning Knowledge Graph World Models with Episodic Memory for LLM Agents	Jul 5, 2024	Decision MakingMulti-hop Question Answering	CodeCode Available	2	5
Low Latency Point Cloud Rendering with Learned Splatting	Sep 24, 2024		CodeCode Available	2	5
LLM-based SPARQL Query Generation from Natural Language over Federated Knowledge Graphs	Oct 8, 2024	Knowledge GraphsRAG	CodeCode Available	2	5
Understanding the Tricks of Deep Learning in Medical Image Segmentation: Challenges and Future Directions	Sep 21, 2022	Data AugmentationDomain Adaptation	CodeCode Available	2	5
Rethinking Test-time Likelihood: The Likelihood Path Principle and Its Application to OOD Detection	Jan 10, 2024	Out of Distribution (OOD) Detection	CodeCode Available	2	5
DaD: Distilled Reinforcement Learning for Diverse Keypoint Detection	Mar 10, 2025	Keypoint Detectionreinforcement-learning	CodeCode Available	2	5
ESLAM: Efficient Dense SLAM System Based on Hybrid Representation of Signed Distance Fields	Nov 21, 2022	3D ReconstructionCamera Localization	CodeCode Available	2	5
SMT 2.0: A Surrogate Modeling Toolbox with a focus on Hierarchical and Mixed Variables Gaussian Processes	May 23, 2023	Gaussian Processes	CodeCode Available	2	5
Advancing 6-DoF Instrument Pose Estimation in Variable X-Ray Imaging Geometries	May 19, 2024	6D Pose EstimationGPU	CodeCode Available	2	5
CoIN: A Benchmark of Continual Instruction tuNing for Multimodel Large Language Model	Mar 13, 2024	General KnowledgeInstruction Following	CodeCode Available	2	5
CURLoRA: Stable LLM Continual Fine-Tuning and Catastrophic Forgetting Mitigation	Aug 26, 2024	Continual Learning	CodeCode Available	2	5
Token Merging for Training-Free Semantic Binding in Text-to-Image Synthesis	Nov 11, 2024	AttributeImage Generation	CodeCode Available	2	5
CT2Rep: Automated Radiology Report Generation for 3D Medical Imaging	Mar 11, 2024		CodeCode Available	2	5
MetaPortrait: Identity-Preserving Talking Head Generation with Fast Personalized Adaptation	Dec 15, 2022	Face SwappingMeta-Learning	CodeCode Available	2	5
TimeMIL: Advancing Multivariate Time Series Classification via a Time-aware Multiple Instance Learning	May 6, 2024	Multiple Instance LearningTime Series	CodeCode Available	2	5
Modelling Non-Smooth Signals with Complex Spectral Structure	Mar 14, 2022	Variational Inference	CodeCode Available	2	5
RapFlow-TTS: Rapid and High-Fidelity Text-to-Speech with Improved Consistency Flow Matching	Jun 20, 2025	SchedulingSpeech Synthesis	CodeCode Available	2	5
NeuralPLexer3: Accurate Biomolecular Complex Structure Prediction with Flow Models	Dec 14, 2024	BenchmarkingDrug Design	CodeCode Available	2	5
Salient Object-Aware Background Generation using Text-Guided Diffusion Models	Apr 15, 2024	Object	CodeCode Available	2	5
StepCoder: Improve Code Generation with Reinforcement Learning from Compiler Feedback	Feb 2, 2024	Code CompletionCode Generation	CodeCode Available	2	5
Dual Memory Networks: A Versatile Adaptation Approach for Vision-Language Models	Mar 26, 2024		CodeCode Available	2	5
The Super Weight in Large Language Models	Nov 11, 2024	Language ModelingLanguage Modelling	CodeCode Available	2	5
ProtComposer: Compositional Protein Structure Generation with 3D Ellipsoids	Mar 6, 2025	Diversity	CodeCode Available	2	5
Instance-Adaptive and Geometric-Aware Keypoint Learning for Category-Level 6D Object Pose Estimation	Mar 28, 2024	6D Pose Estimation using RGBKeypoint Detection	CodeCode Available	2	5
When Counting Meets HMER: Counting-Aware Network for Handwritten Mathematical Expression Recognition	Jul 23, 2022	DecoderHandwritten Mathmatical Expression Recognition	CodeCode Available	2	5
Towards Generalizable Scene Change Detection	Sep 10, 2024	Change DetectionScene Change Detection	CodeCode Available	2	5
CFMW: Cross-modality Fusion Mamba for Multispectral Object Detection under Adverse Weather Conditions	Apr 25, 2024	MambaMultispectral Object Detection	CodeCode Available	2	5
WildAvatar: Web-scale In-the-wild Video Dataset for 3D Avatar Creation	Jul 2, 2024		CodeCode Available	2	5
PointGPT: Auto-regressively Generative Pre-training from Point Clouds	May 19, 2023	3D Point Cloud ClassificationDecoder	CodeCode Available	2	5
Decentralization and Acceleration Enables Large-Scale Bundle Adjustment	May 11, 2023		CodeCode Available	2	5
Quiver: Supporting GPUs for Low-Latency, High-Throughput GNN Serving with Workload Awareness	May 18, 2023	CPUGPU	CodeCode Available	2	5
HermesFlow: Seamlessly Closing the Gap in Multimodal Understanding and Generation	Feb 17, 2025		CodeCode Available	2	5
MetaMind: Modeling Human Social Thoughts with Metacognitive Multi-Agent Systems	May 25, 2025		CodeCode Available	2	5
SSUMamba: Spatial-Spectral Selective State Space Model for Hyperspectral Image Denoising	May 2, 2024	Computational EfficiencyDenoising	CodeCode Available	2	5
Large Scale Longitudinal Experiments: Estimation and Inference	Oct 13, 2024	Computational Efficiency	CodeCode Available	2	5
Image Referenced Sketch Colorization Based on Animation Creation Workflow	Feb 27, 2025	ColorizationSketch Colorization	CodeCode Available	2	5
PromptReps: Prompting Large Language Models to Generate Dense and Sparse Representations for Zero-Shot Document Retrieval	Apr 29, 2024	Document RankingRe-Ranking	CodeCode Available	2	5
Learning Harmonized Representations for Speculative Sampling	Aug 28, 2024		CodeCode Available	2	5
MobileVLM: A Vision-Language Model for Better Intra- and Inter-UI Understanding	Sep 23, 2024	Language ModelingLanguage Modelling	CodeCode Available	2	5
MMSci: A Dataset for Graduate-Level Multi-Discipline Multimodal Scientific Understanding	Jul 6, 2024	ArticlesInstruction Following	CodeCode Available	2	5
Dynamic Mixture of Experts: An Auto-Tuning Approach for Efficient Transformer Models	May 23, 2024	Mixture-of-ExpertsVisual Question Answering	CodeCode Available	2	5
UGPhysics: A Comprehensive Benchmark for Undergraduate Physics Reasoning with Large Language Models	Feb 1, 2025	Math	CodeCode Available	2	5
Mathematical Introduction to Deep Learning: Methods, Implementations, and Theory	Oct 31, 2023	Deep Learning	CodeCode Available	2	5
Adaptive Probabilistic ODE Solvers Without Adaptive Memory Requirements	Oct 14, 2024	State EstimationTime Series	CodeCode Available	2	5
Efficiently Democratizing Medical LLMs for 50 Languages via a Mixture of Language Family Experts	Oct 14, 2024	Mixture-of-Experts	CodeCode Available	2	5
Enhancing Vectorized Map Perception with Historical Rasterized Maps	Sep 1, 2024	Autonomous Driving	CodeCode Available	2	5
RoboBERT: An End-to-end Multimodal Robotic Manipulation Model	Feb 11, 2025	Data Augmentation	CodeCode Available	2	5