Information Extraction from German Patient Records via Hybrid Parsing and Relation Extraction Strategies

2014-05-01LREC 2014Unverified0· sign in to hype

Hans-Ulrich Krieger, Christian Spurk, Hans Uszkoreit, Feiyu Xu, Yi Zhang, Frank M{\"u}ller, Thomas Tolxdorff

Unverified — Be the first to reproduce this paper.

Abstract

In this paper, we report on first attempts and findings to analyzing German patient records, using a hybrid parsing architecture and a combination of two relation extraction strategies. On a practical level, we are interested in the extraction of concepts and relations among those concepts, a necessary cornerstone for building medical information systems. The parsing pipeline consists of a morphological analyzer, a robust chunk parser adapted to Latin phrases used in medical diagnosis, a repair rule stage, and a probabilistic context-free parser that respects the output from the chunker. The relation extraction stage is a combination of two systems: SProUT, a shallow processor which uses hand-written rules to discover relation instances from local text units and DARE which extracts relation instances from complete sentences, using rules that are learned in a bootstrapping process, starting with semantic seeds. Two small experiments have been carried out for the parsing pipeline and the relation extraction stage.

Tasks

Chunking Medical Diagnosis Morphological Analysis Named Entity Recognition (NER)Relation Relation Extraction

Information Extraction from German Patient Records via Hybrid Parsing and Relation Extraction Strategies

Abstract

Tasks

Reproductions