SOTAVerified

A Fast and Accurate Partially Deterministic Morphological Analysis

2019-09-01RANLP 2019Unverified0· sign in to hype

Hajime Morita, Tomoya Iwakura

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

This paper proposes a partially deterministic morphological analysis method for improved processing speed. Maximum matching is a fast deterministic method for morphological analysis. However, the method tends to decrease performance due to lack of consideration of contextual information. In order to use maximum matching safely, we propose the use of Context Independent Strings (CISs), which are strings that do not have ambiguity in terms of morphological analysis. Our method first identifies CISs in a sentence using maximum matching without contextual information, then analyzes the unprocessed part of the sentence using a bi-gram-based morphological analysis model. We evaluate the method on a Japanese morphological analysis task. The experimental results show a 30\% reduction of running time while maintaining improved accuracy.

Tasks

Reproductions