SOTAVerified

WikiUMLS: Aligning UMLS to Wikipedia via Cross-lingual Neural Ranking

2020-05-04COLING 2020Code Available1· sign in to hype

Afshin Rahimi, Timothy Baldwin, Karin Verspoor

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

We present our work on aligning the Unified Medical Language System (UMLS) to Wikipedia, to facilitate manual alignment of the two resources. We propose a cross-lingual neural reranking model to match a UMLS concept with a Wikipedia page, which achieves a recall@1 of 72%, a substantial improvement of 20% over word- and char-level BM25, enabling manual alignment with minimal effort. We release our resources, including ranked Wikipedia pages for 700k UMLS concepts, and WikiUMLS, a dataset for training and evaluation of alignment models between UMLS and Wikipedia. This will provide easier access to Wikipedia for health professionals, patients, and NLP systems, including in multilingual settings.

Tasks

Reproductions