SOTAVerified

On Selecting Training Corpora for Cross-Domain Claim Detection

2022-10-01ArgMining (ACL) 2022Unverified0· sign in to hype

Robin Schaefer, René Knaebel, Manfred Stede

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

Identifying claims in text is a crucial first step in argument mining. In this paper, we investigate factors for the composition of training corpora to improve cross-domain claim detection. To this end, we use four recent argumentation corpora annotated with claims and submit them to several experimental scenarios. Our results indicate that the “ideal” composition of training corpora is characterized by a large corpus size, homogeneous claim proportions, and less formal text domains.

Tasks

Reproductions