Attention-based Contrastive Learning for Winograd Schemas
Tassilo Klein, Moin Nabi
Code Available — Be the first to reproduce this paper.
ReproduceCode
- github.com/sap-samples/emnlp2021-attention-contrastive-learningOfficialIn paperpytorch★ 8
Abstract
Self-supervised learning has recently attracted considerable attention in the NLP community for its ability to learn discriminative features using a contrastive objective. This paper investigates whether contrastive learning can be extended to Transfomer attention to tackling the Winograd Schema Challenge. To this end, we propose a novel self-supervised framework, leveraging a contrastive loss directly at the level of self-attention. Experimental analysis of our attention-based models on multiple datasets demonstrates superior commonsense reasoning capabilities. The proposed approach outperforms all comparable unsupervised approaches while occasionally surpassing supervised ones.