| Scaling Instruction-Finetuned Language Models | Oct 20, 2022 | Coreference ResolutionCross-Lingual Question Answering | CodeCode Available | 3 | 5 |
| BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding | Oct 11, 2018 | Citation Intent ClassificationCommon Sense Reasoning | CodeCode Available | 3 | 5 |
| PAWS-X: A Cross-lingual Adversarial Dataset for Paraphrase Identification | Aug 30, 2019 | Paraphrase IdentificationSentence | CodeCode Available | 2 | 5 |
| BET: A Backtranslation Approach for Easy Data Augmentation in Transformer-based Paraphrase Identification Context | Sep 25, 2020 | Data AugmentationMRPC | CodeCode Available | 1 | 5 |
| Charformer: Fast Character Transformers via Gradient-based Subword Tokenization | Jun 23, 2021 | Inductive BiasLinguistic Acceptability | CodeCode Available | 1 | 5 |
| RealFormer: Transformer Likes Residual Attention | Dec 21, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| TSDAE: Using Transformer-based Sequential Denoising Auto-Encoder for Unsupervised Sentence Embedding Learning | Apr 14, 2021 | DenoisingDomain Adaptation | CodeCode Available | 1 | 5 |
| XLNet: Generalized Autoregressive Pretraining for Language Understanding | Jun 19, 2019 | Audio Question AnsweringChinese Reading Comprehension | CodeCode Available | 1 | 5 |
| Improving Paraphrase Detection with the Adversarial Paraphrasing Task | Jun 14, 2021 | Dataset GenerationParaphrase Identification | CodeCode Available | 1 | 5 |
| FNet: Mixing Tokens with Fourier Transforms | May 9, 2021 | Linguistic AcceptabilityMachine Translation | CodeCode Available | 1 | 5 |
| SMART: Robust and Efficient Fine-Tuning for Pre-trained Natural Language Models through Principled Regularized Optimization | Nov 8, 2019 | Linguistic AcceptabilityNatural Language Inference | CodeCode Available | 1 | 5 |
| An Empirical Study on Robustness to Spurious Correlations using Pre-trained Language Models | Jul 14, 2020 | DiversityMulti-Task Learning | CodeCode Available | 1 | 5 |
| Modelling Latent Translations for Cross-Lingual Transfer | Jul 23, 2021 | Cross-Lingual TransferFew-Shot Learning | CodeCode Available | 1 | 5 |
| PARADE: A New Dataset for Paraphrase Identification Requiring Computer Science Domain Knowledge | Oct 8, 2020 | Paraphrase Identification | CodeCode Available | 1 | 5 |
| Do Multilingual Language Models Think Better in English? | Aug 2, 2023 | Common Sense ReasoningCross-Lingual Natural Language Inference | CodeCode Available | 1 | 5 |
| NMTScore: A Multilingual Analysis of Translation-based Text Similarity Measures | Apr 28, 2022 | Data-to-Text GenerationMachine Translation | CodeCode Available | 1 | 5 |
| Trans-Encoder: Unsupervised sentence-pair modelling through self- and mutual-distillations | Sep 27, 2021 | Contrastive LearningLanguage Modelling | CodeCode Available | 1 | 5 |
| What Do Questions Exactly Ask? MFAE: Duplicate Question Identification with Multi-Fusion Asking Emphasis | May 7, 2020 | Community Question AnsweringNatural Language Inference | CodeCode Available | 1 | 5 |
| data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language | Feb 7, 2022 | image-classificationImage Classification | CodeCode Available | 1 | 5 |
| Adversarial Semantic Collisions | Nov 9, 2020 | Extractive SummarizationParaphrase Identification | CodeCode Available | 1 | 5 |
| Factorising Meaning and Form for Intent-Preserving Paraphrasing | May 31, 2021 | DecoderForm | CodeCode Available | 1 | 5 |
| Improving word mover's distance by leveraging self-attention matrix | Nov 11, 2022 | Paraphrase IdentificationSemantic Similarity | CodeCode Available | 1 | 5 |
| Intrinsic Dimensionality Explains the Effectiveness of Language Model Fine-Tuning | Dec 22, 2020 | Generalization BoundsLanguage Modeling | CodeCode Available | 1 | 5 |
| Self-Explaining Structures Improve NLP Models | Dec 3, 2020 | Natural Language InferenceParaphrase Identification | CodeCode Available | 1 | 5 |
| Entailment as Few-Shot Learner | Apr 29, 2021 | Contrastive LearningData Augmentation | CodeCode Available | 1 | 5 |
| Modelling Sentence Pairs with Tree-structured Attentive Encoder | Oct 10, 2016 | Paraphrase IdentificationQuestion Selection | CodeCode Available | 0 | 5 |
| Bilateral Multi-Perspective Matching for Natural Language Sentences | Feb 13, 2017 | Natural Language InferenceParaphrase Identification | CodeCode Available | 0 | 5 |
| Adaptation of Deep Bidirectional Multilingual Transformers for Russian Language | May 17, 2019 | Natural Language InferenceParaphrase Identification | CodeCode Available | 0 | 5 |
| Memory-efficient Stochastic methods for Memory-based Transformers | Nov 14, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Learning to Represent Bilingual Dictionaries | Aug 10, 2018 | Multi-Task LearningParaphrase Identification | CodeCode Available | 0 | 5 |
| Multi-Task Deep Neural Networks for Natural Language Understanding | Jan 31, 2019 | Domain AdaptationLanguage Modeling | CodeCode Available | 0 | 5 |
| Is Modularity Transferable? A Case Study through the Lens of Knowledge Distillation | Mar 27, 2024 | Domain AdaptationKnowledge Distillation | CodeCode Available | 0 | 5 |
| Dice Loss for Data-imbalanced NLP Tasks | Nov 7, 2019 | Chinese Named Entity RecognitionMachine Reading Comprehension | CodeCode Available | 0 | 5 |
| Is Prompt-Based Finetuning Always Better than Vanilla Finetuning? Insights from Cross-Lingual Language Understanding | Jul 15, 2023 | Cross-Lingual TransferNatural Language Inference | CodeCode Available | 0 | 5 |
| Learning General Purpose Distributed Sentence Representations via Large Scale Multi-task Learning | Mar 30, 2018 | Multi-Task LearningNatural Language Inference | CodeCode Available | 0 | 5 |
| Multiway Attention Networks for Modeling Sentence Pairs | Jul 1, 2018 | Natural Language InferenceParaphrase Identification | CodeCode Available | 0 | 5 |
| Match-Prompt: Improving Multi-task Generalization Ability for Neural Text Matching via Prompt Learning | Apr 6, 2022 | Information RetrievalParaphrase Identification | CodeCode Available | 0 | 5 |
| GAPX: Generalized Autoregressive Paraphrase-Identification X | Oct 5, 2022 | Paraphrase Identification | CodeCode Available | 0 | 5 |
| Idiom Paraphrases: Seventh Heaven vs Cloud Nine | Sep 1, 2015 | Natural Language InferenceParaphrase Identification | CodeCode Available | 0 | 5 |
| Cross-functional Analysis of Generalisation in Behavioural Learning | May 22, 2023 | Paraphrase IdentificationReading Comprehension | CodeCode Available | 0 | 5 |
| A Study of MatchPyramid Models on Ad-hoc Retrieval | Jun 15, 2016 | Machine TranslationParaphrase Identification | CodeCode Available | 0 | 5 |
| Convolutional Neural Network for Paraphrase Identification | May 1, 2015 | ARCBinary Classification | CodeCode Available | 0 | 5 |
| Assessing Word Importance Using Models Trained for Semantic Tasks | May 31, 2023 | Natural Language InferenceParaphrase Identification | CodeCode Available | 0 | 5 |
| Adversarial Self-Attention for Language Understanding | Jun 25, 2022 | Machine Reading ComprehensionNamed Entity Recognition (NER) | CodeCode Available | 0 | 5 |
| ERNIE: Enhanced Language Representation with Informative Entities | May 17, 2019 | Entity LinkingEntity Typing | CodeCode Available | 0 | 5 |
| TinyBERT: Distilling BERT for Natural Language Understanding | Sep 23, 2019 | Knowledge DistillationLanguage Modelling | CodeCode Available | 0 | 5 |
| ETPC - A Paraphrase Identification Corpus Annotated with Extended Paraphrase Typology and Negation | May 1, 2018 | Natural Language InferenceNegation | CodeCode Available | 0 | 5 |
| ABCNN: Attention-Based Convolutional Neural Network for Modeling Sentence Pairs | Dec 16, 2015 | Answer SelectionNatural Language Inference | CodeCode Available | 0 | 5 |
| Balanced Adversarial Training: Balancing Tradeoffs between Fickleness and Obstinacy in NLP Models | Oct 20, 2022 | Contrastive LearningNatural Language Inference | CodeCode Available | 0 | 5 |
| Sentence Embeddings for Russian NLU | Oct 29, 2019 | Multiple-choiceParaphrase Identification | CodeCode Available | 0 | 5 |