SOTAVerified

Evaluating Automatic Speech Recognition Quality and Its Impact on Counselor Utterance Coding

2021-06-01NAACL (CLPsych) 2021Unverified0· sign in to hype

Do June Min, Verónica Pérez-Rosas, Rada Mihalcea

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

Automatic speech recognition (ASR) is a crucial step in many natural language processing (NLP) applications, as often available data consists mainly of raw speech. Since the result of the ASR step is considered as a meaningful, informative input to later steps in the NLP pipeline, it is important to understand the behavior and failure mode of this step. In this work, we analyze the quality of ASR in the psychotherapy domain, using motivational interviewing conversations between therapists and clients. We conduct domain agnostic and domain-relevant evaluations using standard evaluation metrics and also identify domain-relevant keywords in the ASR output. Moreover, we empirically study the effect of mixing ASR and manual data during the training of a downstream NLP model, and also demonstrate how additional local context can help alleviate the error introduced by noisy ASR transcripts.

Tasks

Reproductions