Curricular Transfer Learning for Sentence Encoded Tasks

2023-08-03Unverified0· sign in to hype

Jader Martins Camboim de Sá, Matheus Ferraroni Sanches, Rafael Roque de Souza, Júlio Cesar dos Reis, Leandro Aparecido Villas

arXiv PDF

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

Fine-tuning language models in a downstream task is the standard approach for many state-of-the-art methodologies in the field of NLP. However, when the distribution between the source task and target task drifts, e.g., conversational environments, these gains tend to be diminished. This article proposes a sequence of pre-training steps (a curriculum) guided by "data hacking" and grammar analysis that allows further gradual adaptation between pre-training distributions. In our experiments, we acquire a considerable improvement from our method compared to other known pre-training approaches for the MultiWoZ task.

Tasks

Sentence Transfer Learning

Curricular Transfer Learning for Sentence Encoded Tasks

Abstract

Tasks

Reproductions