GiBERT: Enhancing BERT with Linguistic Information using a Lightweight Gated Injection Method

2021-11-01Findings (EMNLP) 2021Code Available0· sign in to hype

Nicole Peinelt, Marek Rei, Maria Liakata

Code Available — Be the first to reproduce this paper.

Code

github.com/wuningxi/gibert
OfficialIn papertf★ 4

Abstract

Large pre-trained language models such as BERT have been the driving force behind recent improvements across many NLP tasks. However, BERT is only trained to predict missing words – either through masking or next sentence prediction – and has no knowledge of lexical, syntactic or semantic information beyond what it picks up through unsupervised pre-training. We propose a novel method to explicitly inject linguistic information in the form of word embeddings into any layer of a pre-trained BERT. When injecting counter-fitted and dependency-based embeddings, the performance improvements on multiple semantic similarity datasets indicate that such information is beneficial and currently missing from the original model. Our qualitative analysis shows that counter-fitted embedding injection is particularly beneficial, with notable improvements on examples that require synonym resolution.

GiBERT: Enhancing BERT with Linguistic Information using a Lightweight Gated Injection Method

Code

Abstract

Tasks

Reproductions