SOTAVerified

Content Aware Analysis of Scholarly Networks: A Case Study on CORD19 Dataset

2024-10-31Code Available0· sign in to hype

Mehmet Emre Akbulut, Yusuf Erdem Nacar

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

This paper investigates the relationships among key elements of the scientific research network, namely articles, researchers, and journals. We introduce a novel approach to use semantic information through the HITS algorithm-based propagation of topic information in the network. The topic information is derived by using the Named Entity Recognition and Entity Linkage. In our case, MedCAT is used to extract the topics from the CORD19 Dataset, which is a corpus of academic articles about COVID-19 and the coronavirus scientific network. Our approach focuses on the COVID-19 domain, utilizing the CORD-19 dataset to demonstrate the efficacy of integrating topic-related information within the citation framework. Through the application of a hybrid HITS algorithm, we show that incorporating topic data significantly influences article rankings, revealing deeper insights into the structure of the academic community.

Tasks

Reproductions