| The Curious Case of Neural Text Degeneration | Apr 22, 2019 | DiversityLanguage Modeling | CodeCode Available | 1 |
| Mask-Predict: Parallel Decoding of Conditional Masked Language Models | Apr 19, 2019 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition | Apr 18, 2019 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| fairseq: A Fast, Extensible Toolkit for Sequence Modeling | Apr 1, 2019 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| SciBERT: A Pretrained Language Model for Scientific Text | Mar 26, 2019 | Citation Intent ClassificationDependency Parsing | CodeCode Available | 1 |
| A Fully Differentiable Beam Search Decoder | Feb 16, 2019 | DecoderLanguage Modeling | CodeCode Available | 1 |
| Language Models are Unsupervised Multitask Learners | Feb 14, 2019 | Common Sense ReasoningCoreference Resolution | CodeCode Available | 1 |
| Pay Less Attention with Lightweight and Dynamic Convolutions | Jan 29, 2019 | Abstractive Text SummarizationLanguage Modeling | CodeCode Available | 1 |
| Passage Re-ranking with BERT | Jan 13, 2019 | Language ModelingPassage Re-Ranking | CodeCode Available | 1 |
| Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context | Jan 9, 2019 | ArticlesLanguage Modeling | CodeCode Available | 1 |