SOTAVerified

ExpFinder: An Ensemble Expert Finding Model Integrating N-gram Vector Space Model and μCO-HITS

2021-01-18Code Available0· sign in to hype

Yong-Bin Kang, Hung Du, Abdur Rahim Mohammad Forkan, Prem Prakash Jayaraman, Amir Aryani, Timos Sellis

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

Finding an expert plays a crucial role in driving successful collaborations and speeding up high-quality research development and innovations. However, the rapid growth of scientific publications and digital expertise data makes identifying the right experts a challenging problem. Existing approaches for finding experts given a topic can be categorised into information retrieval techniques based on vector space models, document language models, and graph-based models. In this paper, we propose ExpFinder, a new ensemble model for expert finding, that integrates a novel N-gram vector space model, denoted as nVSM, and a graph-based model, denoted as -HITS, that is a proposed variation of the CO-HITS algorithm. The key of nVSM is to exploit recent inverse document frequency weighting method for N-gram words and ExpFinder incorporates nVSM into -HITS to achieve expert finding. We comprehensively evaluate ExpFinder on four different datasets from the academic domains in comparison with six different expert finding models. The evaluation results show that ExpFinder is a highly effective model for expert finding, substantially outperforming all the compared models in 19% to 160.2%.

Tasks

Reproductions