SOTAVerified

Effective Hierarchical Information Threading Using Network Community Detection

2023-03-17European Conference on Information Retrieval 2023Code Available0· sign in to hype

Hitarth Narvala, Graham McDonald, Iadh Ounis

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

With the tremendous growth in the volume of information produced online every day (e.g. news articles), there is a need for automatic methods to identify related information about events as the events evolve over time (i.e., information threads). In this work, we propose a novel unsupervised approach, called HINT, which identifies coherent Hierarchical Information Threads. These threads can enable users to easily interpret a hierarchical association of diverse evolving information about an event or discussion. In particular, HINT deploys a scalable architecture based on network community detection to effectively identify hierarchical links between documents based on their chronological relatedness and answers to the 5W1H questions (i.e., who, what, where, when, why & how). On the NewSHead collection, we show that HINT markedly outperforms existing state-of-the-art approaches in terms of the quality of the identified threads. We also conducted a user study that shows that our proposed network-based hierarchical threads are significantly (p<0.05) preferred by users compared to cluster-based sequential threads.

Tasks

Reproductions