Can citations tell us about a paper's reproducibility? A case study of machine learning papers

2024-05-07Code Available0· sign in to hype

Rochana R. Obadage, Sarah M. Rajtmajer, Jian Wu

Code Available — Be the first to reproduce this paper.

Code

github.com/lamps-lab/ccair-ai-reproducibility
OfficialIn papertf★ 2

Abstract

The iterative character of work in machine learning (ML) and artificial intelligence (AI) and reliance on comparisons against benchmark datasets emphasize the importance of reproducibility in that literature. Yet, resource constraints and inadequate documentation can make running replications particularly challenging. Our work explores the potential of using downstream citation contexts as a signal of reproducibility. We introduce a sentiment analysis framework applied to citation contexts from papers involved in Machine Learning Reproducibility Challenges in order to interpret the positive or negative outcomes of reproduction attempts. Our contributions include training classifiers for reproducibility-related contexts and sentiment analysis, and exploring correlations between citation context sentiment and reproducibility scores. Study data, software, and an artifact appendix are publicly available at https://github.com/lamps-lab/ccair-ai-reproducibility .

Tasks

Sentiment Analysis

Can citations tell us about a paper's reproducibility? A case study of machine learning papers

Code

Abstract

Tasks

Reproductions