| CiteWorth: Cite-Worthiness Detection for Improved Scientific Document Understanding | May 23, 2021 | document understandingDomain Adaptation | CodeCode Available | 1 |
| Scatterbrain: Unifying Sparse and Low-rank Attention | May 21, 2021 | Image GenerationLanguage Modeling | CodeCode Available | 1 |
| Effective Attention Sheds Light On Interpretability | May 18, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Stage-wise Fine-tuning for Graph-to-Text Generation | May 17, 2021 | Data-to-Text GenerationKB-to-Language Generation | CodeCode Available | 1 |
| RetGen: A Joint framework for Retrieval and Grounded Text Generation Modeling | May 14, 2021 | Dialogue GenerationLanguage Modeling | CodeCode Available | 1 |
| Not All Memories are Created Equal: Learning to Forget by Expiring | May 13, 2021 | AllLanguage Modeling | CodeCode Available | 1 |
| MATE-KD: Masked Adversarial TExt, a Companion to Knowledge Distillation | May 12, 2021 | Adversarial TextData Augmentation | CodeCode Available | 1 |
| BERT is to NLP what AlexNet is to CV: Can Pre-Trained Language Models Identify Analogies? | May 11, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Lawformer: A Pre-trained Language Model for Chinese Legal Long Documents | May 9, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| DocSCAN: Unsupervised Text Classification via Learning from Neighbors | May 9, 2021 | ClassificationClustering | CodeCode Available | 1 |
| Understanding by Understanding Not: Modeling Negation in Language Models | May 7, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| DExperts: Decoding-Time Controlled Text Generation with Experts and Anti-Experts | May 7, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Handwritten Mathematical Expression Recognition with Bidirectionally Trained Transformer | May 6, 2021 | Data AugmentationDecoder | CodeCode Available | 1 |
| When to Fold'em: How to answer Unanswerable questions | May 1, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Learning Passage Impacts for Inverted Indexes | Apr 24, 2021 | Information RetrievalLanguage Modeling | CodeCode Available | 1 |
| Improving Biomedical Pretrained Language Models with Knowledge | Apr 21, 2021 | Entity LinkingLanguage Modeling | CodeCode Available | 1 |
| Should we Stop Training More Monolingual Models, and Simply Use Machine Translation Instead? | Apr 21, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Frustratingly Easy Edit-based Linguistic Steganography with a Masked Language Model | Apr 20, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Differentiable Model Compression via Pseudo Quantization Noise | Apr 20, 2021 | Audio Source Separationimage-classification | CodeCode Available | 1 |
| Operationalizing a National Digital Library: The Case for a Norwegian Transformer Model | Apr 19, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| ELECTRAMed: a new pre-trained language representation model for biomedical NLP | Apr 19, 2021 | Drug–drug Interaction ExtractionLanguage Modeling | CodeCode Available | 1 |
| SIMMC 2.0: A Task-oriented Dialog Dataset for Immersive Multimodal Conversations | Apr 18, 2021 | DiversityLanguage Modeling | CodeCode Available | 1 |
| Text2App: A Framework for Creating Android Apps from Text Descriptions | Apr 16, 2021 | Code GenerationLanguage Modeling | CodeCode Available | 1 |
| Probing Across Time: What Does RoBERTa Know and When? | Apr 16, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Time-Stamped Language Model: Teaching Language Models to Understand the Flow of Events | Apr 15, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| How to Train BERT with an Academic Budget | Apr 15, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| KnowPrompt: Knowledge-aware Prompt-tuning with Synergistic Optimization for Relation Extraction | Apr 15, 2021 | Dialog Relation ExtractionLanguage Modeling | CodeCode Available | 1 |
| TSDAE: Using Transformer-based Sequential Denoising Auto-Encoder for Unsupervised Sentence Embedding Learning | Apr 14, 2021 | DenoisingDomain Adaptation | CodeCode Available | 1 |
| K-PLUG: Knowledge-injected Pre-trained Language Model for Natural Language Understanding and Generation in E-Commerce | Apr 14, 2021 | DecoderKnowledge Base Completion | CodeCode Available | 1 |
| Learning How to Ask: Querying LMs with Mixtures of Soft Prompts | Apr 14, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Paragraph-level Simplification of Medical Texts | Apr 12, 2021 | DecoderLanguage Modeling | CodeCode Available | 1 |
| On the Inductive Bias of Masked Language Modeling: From Statistical to Syntactic Dependencies | Apr 12, 2021 | Inductive BiasLanguage Modeling | CodeCode Available | 1 |
| Revisiting Simple Neural Probabilistic Language Models | Apr 8, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| AlephBERT:A Hebrew Large Pre-Trained Language Model to Start-off your Hebrew NLP Application With | Apr 8, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Librispeech Transducer Model with Internal Language Model Prior Correction | Apr 7, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| MMBERT: Multimodal BERT Pretraining for Improved Medical VQA | Apr 3, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| NewsMTSC: A Dataset for (Multi-)Target-dependent Sentiment Classification in Political News Articles | Apr 1, 2021 | ArticlesDecision Making | CodeCode Available | 1 |
| Controllable Generation from Pre-trained Language Models via Inverse Prompting | Mar 19, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Improving the Lexical Ability of Pretrained Language Models for Unsupervised Neural Machine Translation | Mar 18, 2021 | Bilingual Lexicon InductionLanguage Modeling | CodeCode Available | 1 |
| Structure Inducing Pre-Training | Mar 18, 2021 | DescriptiveInductive Bias | CodeCode Available | 1 |
| Inductive Relation Prediction by BERT | Mar 12, 2021 | Few-Shot LearningInductive Learning | CodeCode Available | 1 |
| MERMAID: Metaphor Generation with Symbolism and Discriminative Decoding | Mar 11, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| The Interplay of Variant, Size, and Task Type in Arabic Pre-trained Language Models | Mar 11, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Read Like Humans: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Recognition | Mar 11, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| OAG-BERT: Towards A Unified Backbone Language Model For Academic Knowledge Services | Mar 3, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Self-Diagnosis and Self-Debiasing: A Proposal for Reducing Corpus-Based Bias in NLP | Feb 28, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Chess as a Testbed for Language Model State Tracking | Feb 26, 2021 | Game of ChessLanguage Modeling | CodeCode Available | 1 |
| ZJUKLAB at SemEval-2021 Task 4: Negative Augmentation with Language Model for Reading Comprehension of Abstract Meaning | Feb 25, 2021 | Language Model EvaluationLanguage Modeling | CodeCode Available | 1 |
| Less is More: Pre-train a Strong Text Encoder for Dense Retrieval Using a Weak Decoder | Feb 18, 2021 | DecoderLanguage Modeling | CodeCode Available | 1 |
| End-to-end lyrics Recognition with Voice to Singing Style Transfer | Feb 17, 2021 | Data AugmentationLanguage Modeling | CodeCode Available | 1 |