| BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding | Oct 11, 2018 | Citation Intent ClassificationCommon Sense Reasoning | CodeCode Available | 3 |
| Scaling Instruction-Finetuned Language Models | Oct 20, 2022 | Coreference ResolutionCross-Lingual Question Answering | CodeCode Available | 3 |
| PAWS-X: A Cross-lingual Adversarial Dataset for Paraphrase Identification | Aug 30, 2019 | Paraphrase IdentificationSentence | CodeCode Available | 2 |
| What Do Questions Exactly Ask? MFAE: Duplicate Question Identification with Multi-Fusion Asking Emphasis | May 7, 2020 | Community Question AnsweringNatural Language Inference | CodeCode Available | 1 |
| Self-Explaining Structures Improve NLP Models | Dec 3, 2020 | Natural Language InferenceParaphrase Identification | CodeCode Available | 1 |
| TSDAE: Using Transformer-based Sequential Denoising Auto-Encoder for Unsupervised Sentence Embedding Learning | Apr 14, 2021 | DenoisingDomain Adaptation | CodeCode Available | 1 |
| NMTScore: A Multilingual Analysis of Translation-based Text Similarity Measures | Apr 28, 2022 | Data-to-Text GenerationMachine Translation | CodeCode Available | 1 |
| Factorising Meaning and Form for Intent-Preserving Paraphrasing | May 31, 2021 | DecoderForm | CodeCode Available | 1 |
| RealFormer: Transformer Likes Residual Attention | Dec 21, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Modelling Latent Translations for Cross-Lingual Transfer | Jul 23, 2021 | Cross-Lingual TransferFew-Shot Learning | CodeCode Available | 1 |
| Charformer: Fast Character Transformers via Gradient-based Subword Tokenization | Jun 23, 2021 | Inductive BiasLinguistic Acceptability | CodeCode Available | 1 |
| An Empirical Study on Robustness to Spurious Correlations using Pre-trained Language Models | Jul 14, 2020 | DiversityMulti-Task Learning | CodeCode Available | 1 |
| SMART: Robust and Efficient Fine-Tuning for Pre-trained Natural Language Models through Principled Regularized Optimization | Nov 8, 2019 | Linguistic AcceptabilityNatural Language Inference | CodeCode Available | 1 |
| Trans-Encoder: Unsupervised sentence-pair modelling through self- and mutual-distillations | Sep 27, 2021 | Contrastive LearningLanguage Modelling | CodeCode Available | 1 |
| BET: A Backtranslation Approach for Easy Data Augmentation in Transformer-based Paraphrase Identification Context | Sep 25, 2020 | Data AugmentationMRPC | CodeCode Available | 1 |
| data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language | Feb 7, 2022 | image-classificationImage Classification | CodeCode Available | 1 |
| Do Multilingual Language Models Think Better in English? | Aug 2, 2023 | Common Sense ReasoningCross-Lingual Natural Language Inference | CodeCode Available | 1 |
| Entailment as Few-Shot Learner | Apr 29, 2021 | Contrastive LearningData Augmentation | CodeCode Available | 1 |
| Improving Paraphrase Detection with the Adversarial Paraphrasing Task | Jun 14, 2021 | Dataset GenerationParaphrase Identification | CodeCode Available | 1 |
| Improving word mover's distance by leveraging self-attention matrix | Nov 11, 2022 | Paraphrase IdentificationSemantic Similarity | CodeCode Available | 1 |
| Adversarial Semantic Collisions | Nov 9, 2020 | Extractive SummarizationParaphrase Identification | CodeCode Available | 1 |
| Intrinsic Dimensionality Explains the Effectiveness of Language Model Fine-Tuning | Dec 22, 2020 | Generalization BoundsLanguage Modeling | CodeCode Available | 1 |
| FNet: Mixing Tokens with Fourier Transforms | May 9, 2021 | Linguistic AcceptabilityMachine Translation | CodeCode Available | 1 |
| PARADE: A New Dataset for Paraphrase Identification Requiring Computer Science Domain Knowledge | Oct 8, 2020 | Paraphrase Identification | CodeCode Available | 1 |
| XLNet: Generalized Autoregressive Pretraining for Language Understanding | Jun 19, 2019 | Audio Question AnsweringChinese Reading Comprehension | CodeCode Available | 1 |