Phenotyping of Clinical Notes with Improved Document Classification Models Using Contextualized Neural Language Models

2019-10-30Code Available0· sign in to hype

Andriy Mulyar, Elliot Schumacher, Masoud Rouhizadeh, Mark Dredze

Code Available — Be the first to reproduce this paper.

Code

github.com/AndriyMulyar/bert_document_classification
OfficialIn paperpytorch★ 154
github.com/ArneDefauw/BERT_doc_classification
pytorch★ 0

Abstract

Clinical notes contain an extensive record of a patient's health status, such as smoking status or the presence of heart conditions. However, this detail is not replicated within the structured data of electronic health systems. Phenotyping, the extraction of patient conditions from free clinical text, is a critical task which supports avariety of downstream applications such as decision support and secondary use of medical records. Previous work has resulted in systems which are high performing but require hand engineering, often of rules. Recent work in pretrained contextualized language models have enabled advances in representing text for a variety of tasks. We therefore explore several architectures for modeling pheno-typing that rely solely on BERT representations of the clinical note, removing the need for manual engineering. We find these architectures are competitive with or outperform existing state of the art methods on two phenotyping tasks.

Tasks

Document Classification General Classification

Phenotyping of Clinical Notes with Improved Document Classification Models Using Contextualized Neural Language Models

Code

Abstract

Tasks

Reproductions