The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 8026–8050 of 474278 papers

Title	Date	Tasks	Status	Hype
MMSci: A Dataset for Graduate-Level Multi-Discipline Multimodal Scientific Understanding	Jul 6, 2024	ArticlesInstruction Following	CodeCode Available	2
SCSA: Exploring the Synergistic Effects Between Spatial and Channel Attention	Jul 6, 2024	Classificationobject-detection	CodeCode Available	2
AriGraph: Learning Knowledge Graph World Models with Episodic Memory for LLM Agents	Jul 5, 2024	Decision MakingMulti-hop Question Answering	CodeCode Available	2
RAM: Retrieval-Based Affordance Transfer for Generalizable Zero-Shot Robotic Manipulation	Jul 5, 2024	Human-Object Interaction DetectionRetrieval	CodeCode Available	2
PartCraft: Crafting Creative Objects by Parts	Jul 5, 2024		CodeCode Available	2
Multi-Branch Auxiliary Fusion YOLO with Re-parameterization Heterogeneous Convolutional for accurate object detection	Jul 5, 2024	Novel Object Detectionobject-detection	CodeCode Available	2
Discovering symbolic expressions with parallelized tree search	Jul 5, 2024	Equation Discoveryregression	CodeCode Available	2
SH17: A Dataset for Human Safety and Personal Protective Equipment Detection in Manufacturing Industry	Jul 5, 2024	Benchmarkingobject-detection	CodeCode Available	2
AWT: Transferring Vision-Language Models via Augmentation, Weighting, and Transportation	Jul 5, 2024	Action RecognitionFew-Shot Image Classification	CodeCode Available	2
Associative Recurrent Memory Transformer	Jul 5, 2024	Retrieval	CodeCode Available	2
ANAH-v2: Scaling Analytical Hallucination Annotation of Large Language Models	Jul 5, 2024	HallucinationLong Form Question Answering	CodeCode Available	2
Pretraining End-to-End Keyword Search with Automatically Discovered Acoustic Units	Jul 5, 2024	Acoustic Unit DiscoveryAutomatic Speech Recognition	CodeCode Available	2
AnySR: Realizing Image Super-Resolution as Any-Scale, Any-Resource	Jul 5, 2024	Image Super-ResolutionSuper-Resolution	CodeCode Available	2
RPN: Reconciled Polynomial Network Towards Unifying PGMs, Kernel SVMs, MLP and KAN	Jul 5, 2024		CodeCode Available	2
Isomorphic Pruning for Vision Models	Jul 5, 2024		CodeCode Available	2
Benchmarking Complex Instruction-Following with Multiple Constraints Composition	Jul 4, 2024	BenchmarkingInstruction Following	CodeCode Available	2
Mixture of A Million Experts	Jul 4, 2024	Computational EfficiencyLanguage Modeling	CodeCode Available	2
Occupancy as Set of Points	Jul 4, 2024		CodeCode Available	2
DGR-MIL: Exploring Diverse Global Representation in Multiple Instance Learning for Whole Slide Image Classification	Jul 4, 2024	DescriptiveDiversity	CodeCode Available	2
Unraveling Molecular Structure: A Multimodal Spectroscopic Dataset for Chemistry	Jul 4, 2024		CodeCode Available	2
TongGu: Mastering Classical Chinese Understanding with Knowledge-Grounded Large Language Models	Jul 4, 2024	RAGRetrieval-augmented Generation	CodeCode Available	2
VoxAct-B: Voxel-Based Acting and Stabilizing Policy for Bimanual Manipulation	Jul 4, 2024		CodeCode Available	2
ChartGemma: Visual Instruction-tuning for Chart Reasoning in the Wild	Jul 4, 2024	Chart UnderstandingDecision Making	CodeCode Available	2
MiniGPT-Med: Large Language Model as a General Interface for Radiology Diagnosis	Jul 4, 2024	DiagnosticLanguage Modeling	CodeCode Available	2
Craftium: An Extensible Framework for Creating Reinforcement Learning Environments	Jul 4, 2024	BenchmarkingMinecraft	CodeCode Available	2