The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 8951–8975 of 177340 papers

Title	Date	Tasks	Status	Hype	Score
A Bounding Box is Worth One Token: Interleaving Layout and Text in a Large Language Model for Document Understanding	Jul 2, 2024	document understandingKey Information Extraction	CodeCode Available	2	5
Centerline Boundary Dice Loss for Vascular Segmentation	Jul 1, 2024	Segmentation	CodeCode Available	2	5
Benchmarking Predictive Coding Networks -- Made Simple	Jul 1, 2024	Benchmarking	CodeCode Available	2	5
A Survey of Personalization: From RAG to Agent	Apr 14, 2025	RAGRetrieval	CodeCode Available	2	5
Discovering symbolic expressions with parallelized tree search	Jul 5, 2024	Equation Discoveryregression	CodeCode Available	2	5
TongGu: Mastering Classical Chinese Understanding with Knowledge-Grounded Large Language Models	Jul 4, 2024	RAGRetrieval-augmented Generation	CodeCode Available	2	5
See Further for Parameter Efficient Fine-tuning by Standing on the Shoulders of Decomposition	Jul 7, 2024	parameter-efficient fine-tuning	CodeCode Available	2	5
RPN: Reconciled Polynomial Network Towards Unifying PGMs, Kernel SVMs, MLP and KAN	Jul 5, 2024		CodeCode Available	2	5
Language Representations Can be What Recommenders Need: Findings and Potentials	Jul 7, 2024	Collaborative FilteringContrastive Learning	CodeCode Available	2	5
Multimodal Prompt Learning with Missing Modalities for Sentiment Analysis and Emotion Recognition	Jul 7, 2024	Emotion RecognitionMultimodal Sentiment Analysis	CodeCode Available	2	5
Lookback Lens: Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps	Jul 9, 2024	ArticlesHallucination	CodeCode Available	2	5
LuSNAR:A Lunar Segmentation, Navigation and Reconstruction Dataset based on Muti-sensor for Autonomous Exploration	Jul 9, 2024	3D ReconstructionAutonomous Navigation	CodeCode Available	2	5
MeshAvatar: Learning High-quality Triangular Human Avatars from Multi-view Videos	Jul 11, 2024	NeRF	CodeCode Available	2	5
Adaptive Parametric Activation	Jul 11, 2024	imbalanced classificationInstance Segmentation	CodeCode Available	2	5
WayveScenes101: A Dataset and Benchmark for Novel View Synthesis in Autonomous Driving	Jul 11, 2024	Autonomous DrivingBenchmarking	CodeCode Available	2	5
AddressCLIP: Empowering Vision-Language Models for City-wide Image Address Localization	Jul 11, 2024	Contrastive LearningTransfer Learning	CodeCode Available	2	5
xLSTMTime : Long-term Time Series Forecasting With xLSTM	Jul 14, 2024	Time SeriesTime Series Forecasting	CodeCode Available	2	5
Image Compression for Machine and Human Vision with Spatial-Frequency Adaptation	Jul 13, 2024	Image Compression	CodeCode Available	2	5
GOFA: A Generative One-For-All Model for Joint Graph Language Modeling	Jul 12, 2024	AllLanguage Modeling	CodeCode Available	2	5
TTSDS -- Text-to-Speech Distribution Score	Jul 17, 2024	text-to-speechText to Speech	CodeCode Available	2	5
UrbanWorld: An Urban World Model for 3D City Generation	Jul 16, 2024	Decision MakingLanguage Modelling	CodeCode Available	2	5
GV-Bench: Benchmarking Local Feature Matching for Geometric Verification of Long-term Loop Closure Detection	Jul 16, 2024	BenchmarkingLoop Closure Detection	CodeCode Available	2	5
A Comprehensive Survey of Mamba Architectures for Medical Image Analysis: Classification, Segmentation, Restoration and Beyond	Oct 3, 2024	MambaMedical Image Analysis	CodeCode Available	2	5
GeneralAD: Anomaly Detection Across Domains by Attending to Distorted Features	Jul 17, 2024	Anomaly DetectionSelf-Driving Cars	CodeCode Available	2	5
Weak-to-Strong Reasoning	Jul 18, 2024	GSM8KMath	CodeCode Available	2	5