SOTAVerified

Tracing Traditions: Automatic Extraction of Isnads from Classical Arabic Texts

2020-12-01COLING (WANLP) 2020Unverified0· sign in to hype

Ryan Muther, David Smith

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

We present our work on automatically detecting isnads, the chains of authorities for a re-port that serve as citations in hadith and other classical Arabic texts. We experiment with both sequence labeling methods for identifying isnads in a single pass and a hybrid “retrieve-and-tag” approach, in which a retrieval model first identifies portions of the text that are likely to contain start points for isnads, then a sequence labeling model identifies the exact starting locations within these much smaller retrieved text chunks. We find that the usefulness of full-document sequence to sequence models is limited due to memory limitations and the ineffectiveness of such models at modeling very long documents. We conclude by sketching future improvements on the tagging task and more in-depth analysis of the people and relationships involved in the social network that influenced the evolution of the written tradition over time.

Tasks

Reproductions