| Train No Evil: Selective Masking for Task-Guided Pre-Training | Apr 21, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Adaptive Attention Span in Computer Vision | Apr 18, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Fast and Accurate Deep Bidirectional Language Representations for Unsupervised Learning | Apr 17, 2020 | CPULanguage Modeling | CodeCode Available | 1 |
| Transform and Tell: Entity-Aware News Image Captioning | Apr 17, 2020 | ArticlesImage Captioning | CodeCode Available | 1 |
| SPECTER: Document-level Representation Learning using Citation-informed Transformers | Apr 15, 2020 | Citation PredictionDocument Classification | CodeCode Available | 1 |
| TOD-BERT: Pre-trained Natural Language Understanding for Task-Oriented Dialogue | Apr 15, 2020 | Dialogue State TrackingIntent Detection | CodeCode Available | 1 |
| PALM: Pre-training an Autoencoding&Autoregressive Language Model for Context-conditioned Generation | Apr 14, 2020 | Abstractive Text SummarizationConversational Response Generation | CodeCode Available | 1 |
| AMR Parsing via Graph-Sequence Iterative Inference | Apr 12, 2020 | AMR ParsingLanguage Modeling | CodeCode Available | 1 |
| Unsupervised Commonsense Question Answering with Self-Talk | Apr 11, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Injecting Numerical Reasoning Skills into Language Models | Apr 9, 2020 | Data AugmentationDecoder | CodeCode Available | 1 |
| Exploring Versatile Generative Language Model Via Parameter-Efficient Transfer Learning | Apr 8, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Have Your Text and Use It Too! End-to-End Neural Data-to-Text Generation with Semantic Fidelity | Apr 8, 2020 | AMR-to-Text GenerationData-to-Text Generation | CodeCode Available | 1 |
| Downstream Model Design of Pre-trained Language Model for Relation Extraction Task | Apr 8, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Byte Pair Encoding is Suboptimal for Language Model Pretraining | Apr 7, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Transformers to Learn Hierarchical Contexts in Multiparty Dialogue for Span-based Question Answering | Apr 7, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| SelfORE: Self-supervised Relational Feature Learning for Open Relation Extraction | Apr 6, 2020 | ClusteringLanguage Modeling | CodeCode Available | 1 |
| Sparse Text Generation | Apr 6, 2020 | Dialogue GenerationDiversity | CodeCode Available | 1 |
| Optimus: Organizing Sentences via Pre-trained Modeling of a Latent Space | Apr 5, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| MemCap: Memorizing Style Knowledge for Image Captioning | Apr 3, 2020 | Image CaptioningLanguage Modeling | CodeCode Available | 1 |
| Felix: Flexible Text Editing Through Tagging and Insertion | Mar 24, 2020 | Automatic Post-EditingLanguage Modeling | CodeCode Available | 1 |
| ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators | Mar 23, 2020 | GPULanguage Modeling | CodeCode Available | 1 |
| Beheshti-NER: Persian Named Entity Recognition Using BERT | Mar 19, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Efficient Content-Based Sparse Attention with Routing Transformers | Mar 12, 2020 | Image GenerationLanguage Modeling | CodeCode Available | 1 |
| ReZero is All You Need: Fast Convergence at Large Depth | Mar 10, 2020 | AllLanguage Modeling | CodeCode Available | 1 |
| ProGen: Language Modeling for Protein Generation | Mar 8, 2020 | DiversityLanguage Modeling | CodeCode Available | 1 |