The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 10951–10975 of 177340 papers

Title	Date	Tasks	Status	Hype	Score
A Prompt-Based Knowledge Graph Foundation Model for Universal In-Context Reasoning	Oct 16, 2024	In-Context LearningKnowledge Graphs	CodeCode Available	2	5
Hacking Back the AI-Hacker: Prompt Injection as a Defense Against LLM-driven Cyberattacks	Oct 28, 2024		CodeCode Available	2	5
GroundingSuite: Measuring Complex Multi-Granular Pixel Grounding	Mar 13, 2025	DiversityLanguage Modeling	CodeCode Available	2	5
What Limits LLM-based Human Simulation: LLMs or Our Design?	Jan 15, 2025		CodeCode Available	2	5
Zero-Shot Vision Encoder Grafting via LLM Surrogates	May 28, 2025	DecoderLanguage Modeling	CodeCode Available	2	5
OpenGlue: Open Source Graph Neural Net Based Pipeline for Image Matching	Apr 19, 2022	Graph Neural Network	CodeCode Available	2	5
Omni-Kernel Network for Image Restoration	Mar 24, 2024	DeblurringImage Defocus Deblurring	CodeCode Available	2	5
FB-OCC: 3D Occupancy Prediction based on Forward-Backward View Transformation	Jul 4, 2023	Autonomous DrivingPrediction Of Occupancy Grid Maps	CodeCode Available	2	5
StreamMapNet: Streaming Mapping Network for Vectorized Online HD Map Construction	Aug 24, 2023	Autonomous Driving	CodeCode Available	2	5
Trends, Applications, and Challenges in Human Attention Modelling	Feb 28, 2024	Language Modelling	CodeCode Available	2	5
MixFormerV2: Efficient Fully Transformer Tracking	May 25, 2023	CPUGPU	CodeCode Available	2	5
PartSTAD: 2D-to-3D Part Segmentation Task Adaptation	Jan 11, 2024	3D Part SegmentationForeground Segmentation	CodeCode Available	2	5
Tri^2-plane: Thinking Head Avatar via Feature Pyramid	Jan 17, 2024		CodeCode Available	2	5
nnMamba: 3D Biomedical Image Segmentation, Classification and Landmark Detection with State Space Model	Feb 5, 2024	3D Medical Imaging SegmentationImage Segmentation	CodeCode Available	2	5
LLaMP: Large Language Model Made Powerful for High-fidelity Materials Knowledge Retrieval and Distillation	Jan 30, 2024	HallucinationKnowledge Distillation	CodeCode Available	2	5
Interpretable Pre-Trained Transformers for Heart Time-Series Data	Jul 30, 2024	DecoderElectrocardiography (ECG)	CodeCode Available	2	5
Multi-Class Road User Detection With 3+1D Radar in the View-of-Delft Dataset	Apr 1, 2022	3D Object DetectionBenchmarking	CodeCode Available	2	5
RigNet: Neural Rigging for Articulated Characters	May 1, 2020	Skeleton Rig Prediction	CodeCode Available	2	5
Building Cooperative Embodied Agents Modularly with Large Language Models	Jul 5, 2023	Text Generation	CodeCode Available	2	5
Pretrained Transformers for Text Ranking: BERT and Beyond	Oct 13, 2020	Information RetrievalReranking	CodeCode Available	2	5
Global Convergence and Generalization Bound of Gradient-Based Meta-Learning with Deep Neural Nets	Jun 25, 2020	Few-Shot LearningMeta-Learning	CodeCode Available	2	5
MedMCQA : A Large-scale Multi-Subject Multi-Choice Dataset for Medical domain Question Answering	Mar 27, 2022	DiversityMultiple-choice	CodeCode Available	2	5
Balanced MSE for Imbalanced Visual Regression	Mar 30, 2022	Age EstimationFairness	CodeCode Available	2	5
A Review of Safe Reinforcement Learning: Methods, Theory and Applications	May 20, 2022	Autonomous DrivingDecision Making	CodeCode Available	2	5
A Unified Evaluation of Textual Backdoor Learning: Frameworks and Benchmarks	Jun 17, 2022	text similarity	CodeCode Available	2	5