MirrorWiC: On Eliciting Word-in-Context Representations from Pretrained Language Models

2021-09-19CoNLL (EMNLP) 2021Code Available1· sign in to hype

Qianchu Liu, Fangyu Liu, Nigel Collier, Anna Korhonen, Ivan Vulić

Code Available — Be the first to reproduce this paper.

Code

github.com/cambridgeltl/mirrorwic
OfficialIn paperpytorch★ 12

Abstract

Recent work indicated that pretrained language models (PLMs) such as BERT and RoBERTa can be transformed into effective sentence and word encoders even via simple self-supervised techniques. Inspired by this line of work, in this paper we propose a fully unsupervised approach to improving word-in-context (WiC) representations in PLMs, achieved via a simple and efficient WiC-targeted fine-tuning procedure: MirrorWiC. The proposed method leverages only raw texts sampled from Wikipedia, assuming no sense-annotated data, and learns context-aware word representations within a standard contrastive learning setup. We experiment with a series of standard and comprehensive WiC benchmarks across multiple languages. Our proposed fully unsupervised MirrorWiC models obtain substantial gains over off-the-shelf PLMs across all monolingual, multilingual and cross-lingual setups. Moreover, on some standard WiC benchmarks, MirrorWiC is even on-par with supervised models fine-tuned with in-task data and sense labels.

Tasks

Contextualised Word Representations Contrastive Learning Sentence Word Embeddings

MirrorWiC: On Eliciting Word-in-Context Representations from Pretrained Language Models

Code

Abstract

Tasks

Reproductions