| Train No Evil: Selective Masking for Task-Guided Pre-Training | Apr 21, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Adaptive Attention Span in Computer Vision | Apr 18, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Transform and Tell: Entity-Aware News Image Captioning | Apr 17, 2020 | ArticlesImage Captioning | CodeCode Available | 1 |
| Fast and Accurate Deep Bidirectional Language Representations for Unsupervised Learning | Apr 17, 2020 | CPULanguage Modeling | CodeCode Available | 1 |
| SPECTER: Document-level Representation Learning using Citation-informed Transformers | Apr 15, 2020 | Citation PredictionDocument Classification | CodeCode Available | 1 |
| TOD-BERT: Pre-trained Natural Language Understanding for Task-Oriented Dialogue | Apr 15, 2020 | Dialogue State TrackingIntent Detection | CodeCode Available | 1 |
| PALM: Pre-training an Autoencoding&Autoregressive Language Model for Context-conditioned Generation | Apr 14, 2020 | Abstractive Text SummarizationConversational Response Generation | CodeCode Available | 1 |
| AMR Parsing via Graph-Sequence Iterative Inference | Apr 12, 2020 | AMR ParsingLanguage Modeling | CodeCode Available | 1 |
| Unsupervised Commonsense Question Answering with Self-Talk | Apr 11, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Injecting Numerical Reasoning Skills into Language Models | Apr 9, 2020 | Data AugmentationDecoder | CodeCode Available | 1 |
| Have Your Text and Use It Too! End-to-End Neural Data-to-Text Generation with Semantic Fidelity | Apr 8, 2020 | AMR-to-Text GenerationData-to-Text Generation | CodeCode Available | 1 |
| Downstream Model Design of Pre-trained Language Model for Relation Extraction Task | Apr 8, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Exploring Versatile Generative Language Model Via Parameter-Efficient Transfer Learning | Apr 8, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Byte Pair Encoding is Suboptimal for Language Model Pretraining | Apr 7, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Transformers to Learn Hierarchical Contexts in Multiparty Dialogue for Span-based Question Answering | Apr 7, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Sparse Text Generation | Apr 6, 2020 | Dialogue GenerationDiversity | CodeCode Available | 1 |
| SelfORE: Self-supervised Relational Feature Learning for Open Relation Extraction | Apr 6, 2020 | ClusteringLanguage Modeling | CodeCode Available | 1 |
| Optimus: Organizing Sentences via Pre-trained Modeling of a Latent Space | Apr 5, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| MemCap: Memorizing Style Knowledge for Image Captioning | Apr 3, 2020 | Image CaptioningLanguage Modeling | CodeCode Available | 1 |
| Felix: Flexible Text Editing Through Tagging and Insertion | Mar 24, 2020 | Automatic Post-EditingLanguage Modeling | CodeCode Available | 1 |
| ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators | Mar 23, 2020 | GPULanguage Modeling | CodeCode Available | 1 |
| Beheshti-NER: Persian Named Entity Recognition Using BERT | Mar 19, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Efficient Content-Based Sparse Attention with Routing Transformers | Mar 12, 2020 | Image GenerationLanguage Modeling | CodeCode Available | 1 |
| ReZero is All You Need: Fast Convergence at Large Depth | Mar 10, 2020 | AllLanguage Modeling | CodeCode Available | 1 |
| ProGen: Language Modeling for Protein Generation | Mar 8, 2020 | DiversityLanguage Modeling | CodeCode Available | 1 |
| RecipeGPT: Generative Pre-training Based Cooking Recipe Generation and Evaluation System | Mar 5, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Talking-Heads Attention | Mar 5, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Data Augmentation using Pre-trained Transformer Models | Mar 4, 2020 | Data AugmentationDiversity | CodeCode Available | 1 |
| Understanding Contexts Inside Robot and Human Manipulation Tasks through a Vision-Language Model and Ontology System in a Video Stream | Mar 2, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| UniLMv2: Pseudo-Masked Language Models for Unified Language Model Pre-Training | Feb 28, 2020 | Abstractive Text SummarizationDecoder | CodeCode Available | 1 |
| Fill in the BLANC: Human-free quality estimation of document summaries | Feb 23, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Addressing Some Limitations of Transformers with Feedback Memory | Feb 21, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| LAMBERT: Layout-Aware (Language) Modeling for information extraction | Feb 19, 2020 | Key Information ExtractionLanguage Modeling | CodeCode Available | 1 |
| SentenceMIM: A Latent Variable Language Model | Feb 18, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| UniVL: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation | Feb 15, 2020 | Action SegmentationDecoder | CodeCode Available | 1 |
| Transformer on a Diet | Feb 14, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| How Much Knowledge Can You Pack Into the Parameters of a Language Model? | Feb 10, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| REALM: Retrieval-Augmented Language Model Pre-Training | Feb 10, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Blank Language Models | Feb 8, 2020 | Ancient Text RestorationLanguage Modeling | CodeCode Available | 1 |
| Time-aware Large Kernel Convolutions | Feb 8, 2020 | Document SummarizationLanguage Modeling | CodeCode Available | 1 |
| Parsing as Pretraining | Feb 5, 2020 | Dependency ParsingLanguage Modeling | CodeCode Available | 1 |
| Explaining Relationships Between Scientific Documents | Feb 2, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Adversarial Training for Aspect-Based Sentiment Analysis with BERT | Jan 30, 2020 | Aspect-Based Sentiment AnalysisAspect-Based Sentiment Analysis (ABSA) | CodeCode Available | 1 |
| DUMA: Reading Comprehension with Transposition Thinking | Jan 26, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Scaling Laws for Neural Language Models | Jan 23, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Contextualized Embeddings in Named-Entity Recognition: An Empirical Study on Generalization | Jan 22, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| A Simple Baseline to Semi-Supervised Domain Adaptation for Machine Translation | Jan 22, 2020 | Domain AdaptationLanguage Modeling | CodeCode Available | 1 |
| Exploiting Cloze Questions for Few Shot Text Classification and Natural Language Inference | Jan 21, 2020 | Few-Shot Text ClassificationGeneral Classification | CodeCode Available | 1 |
| Domain-Aware Dialogue State Tracker for Multi-Domain Dialogue Systems | Jan 21, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| RobBERT: a Dutch RoBERTa-based Language Model | Jan 17, 2020 | FairnessLanguage Modeling | CodeCode Available | 1 |