| MSA Transformer | Feb 13, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SJ_AJ@DravidianLangTech-EACL2021: Task-Adaptive Pre-Training of Multilingual BERT models for Offensive Language Identification | Feb 1, 2021 | Language IdentificationLanguage Modeling | CodeCode Available | 0 |
| MERMAID: Metaphor Generation with Symbolism and Discriminative Decoding | Jan 23, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| CDLM: Cross-Document Language Modeling | Jan 2, 2021 | Citation RecommendationCoreference Resolution | CodeCode Available | 1 |
| Universal Sentence Representations Learning with Conditional Masked Language Model | Jan 1, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| AraELECTRA: Pre-Training Text Discriminators for Arabic Language Understanding | Dec 31, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Universal Sentence Representation Learning with Conditional Masked Language Model | Dec 28, 2020 | Language ModelingLanguage Modelling | —Unverified | 0 |
| RealFormer: Transformer Likes Residual Attention | Dec 21, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| TAP: Text-Aware Pre-training for Text-VQA and Text-Caption | Dec 8, 2020 | Caption GenerationLanguage Modeling | CodeCode Available | 1 |
| Pre-training Protein Language Models with Label-Agnostic Binding Pairs Enhances Performance in Downstream Tasks | Dec 5, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| XHate-999: Analyzing and Detecting Abusive Language Across Domains and Languages | Dec 1, 2020 | Abusive LanguageDisentanglement | —Unverified | 0 |
| StructFormer: Joint Unsupervised Induction of Dependency and Constituency Structure from Masked Language Modeling | Dec 1, 2020 | Constituency ParsingDependency Parsing | CodeCode Available | 1 |
| Profile Prediction: An Alignment-Based Pre-Training Task for Protein Sequence Models | Dec 1, 2020 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Self-Supervised Relationship Probing | Dec 1, 2020 | Contrastive LearningLanguage Modeling | —Unverified | 0 |
| Self-Supervised learning with cross-modal transformers for emotion recognition | Nov 20, 2020 | Emotion RecognitionLanguage Modeling | —Unverified | 0 |
| A Hierarchical Multi-Modal Encoder for Moment Localization in Video Corpus | Nov 18, 2020 | Language ModelingLanguage Modelling | —Unverified | 0 |
| POSTECH-ETRI’s Submission to the WMT2020 APE Shared Task: Automatic Post-Editing with Cross-lingual Language Model | Nov 1, 2020 | Automatic Post-EditingLanguage Modeling | —Unverified | 0 |
| Controlling the Imprint of Passivization and Negation in Contextualized Representations | Nov 1, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Effective Decoder Masking for Transformer Based End-to-End Speech Recognition | Oct 27, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| DICT-MLM: Improved Multilingual Pre-Training using Bilingual Dictionaries | Oct 23, 2020 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ST-BERT: Cross-modal Language Model Pre-training For End-to-end Spoken Language Understanding | Oct 23, 2020 | cross-modal alignmentLanguage Modeling | —Unverified | 0 |
| ERNIE-Gram: Pre-Training with Explicitly N-Gram Masked Language Modeling for Natural Language Understanding | Oct 23, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Cold-start Active Learning through Self-supervised Language Modeling | Oct 19, 2020 | Active LearningClassification | CodeCode Available | 1 |
| Corruption Is Not All Bad: Incorporating Discourse Structure into Pre-training via Corruption for Essay Scoring | Oct 13, 2020 | AllAutomated Essay Scoring | —Unverified | 0 |
| Cross-Thought for Sentence Encoder Pre-training | Oct 7, 2020 | Information RetrievalLanguage Modeling | CodeCode Available | 1 |