| Neural Shuffle-Exchange Networks -- Sequence Processing in O(n log n) Time | Jul 18, 2019 | LAMBADALanguage Modeling | CodeCode Available | 0 | 5 |
| Neural Shuffle-Exchange Networks - Sequence Processing in O(n log n) Time | Dec 1, 2019 | LAMBADALanguage Modeling | CodeCode Available | 0 | 5 |
| Inconsistencies in Masked Language Models | Dec 30, 2022 | LAMBADAMMLU | CodeCode Available | 0 | 5 |
| Linguistic Knowledge as Memory for Recurrent Neural Networks | Mar 7, 2017 | LAMBADAReading Comprehension | —Unverified | 0 | 0 |
| Matryoshka Model Learning for Improved Elastic Student Models | May 29, 2025 | LAMBADAMath | —Unverified | 0 | 0 |
| E.T.: Entity-Transformers. Coreference augmented Neural Language Model for richer mention representations via Entity-Transformer blocks | Nov 10, 2020 | LAMBADALanguage Modeling | —Unverified | 0 | 0 |
| Neural Models for Reasoning over Multiple Mentions using Coreference | Apr 16, 2018 | LAMBADAReading Comprehension | —Unverified | 0 | 0 |
| Broad Context Language Modeling as Reading Comprehension | Oct 26, 2016 | coreference-resolutionCoreference Resolution | —Unverified | 0 | 0 |
| Attending to Entities for Better Text Understanding | Nov 11, 2019 | LAMBADA | —Unverified | 0 | 0 |
| Not Enough Data? Deep Learning to the Rescue! | Nov 8, 2019 | Data AugmentationDeep Learning | —Unverified | 0 | 0 |