SOTAVerified

Detecting Structured Language Alternations in Historical Documents by Combining Language Identification with Fourier Analysis

2024-01-25Unverified0· sign in to hype

Hale Sirin, Sabrina Li, Tom Lippincott

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

In this study, we present a generalizable workflow to identify documents in a historic language with a nonstandard language and script combination, Armeno-Turkish. We introduce the task of detecting distinct patterns of multilinguality based on the frequency of structured language alternations within a document.

Tasks

Reproductions