| Specialized Transformers: Faster, Smaller and more Accurate NLP Models | Sep 29, 2021 | Hard AttentionQuantization | —Unverified | 0 | 0 |
| Text as Environment: A Deep Reinforcement Learning Text Readability Assessment Model | Dec 12, 2019 | Deep Reinforcement LearningHard Attention | —Unverified | 0 | 0 |
| Theoretical Limitations of Self-Attention in Neural Sequence Models | Jun 16, 2019 | Hard Attention | —Unverified | 0 | 0 |
| Transformers as Transducers | Apr 2, 2024 | Hard AttentionPOS | —Unverified | 0 | 0 |
| Transformers in Uniform TC^0 | Sep 20, 2024 | Hard Attention | —Unverified | 0 | 0 |
| Unique Hard Attention: A Tale of Two Sides | Mar 18, 2025 | Hard Attention | —Unverified | 0 | 0 |
| Upper, Middle and Lower Region Learning for Facial Action Unit Detection | Feb 10, 2020 | Action Unit DetectionFacial Action Unit Detection | —Unverified | 0 | 0 |
| Video Violence Recognition and Localization Using a Semi-Supervised Hard Attention Model | Feb 4, 2022 | Activity RecognitionHard Attention | —Unverified | 0 | 0 |
| Word Representation Models for Morphologically Rich Languages in Neural Machine Translation | Jun 14, 2016 | Hard AttentionMachine Translation | —Unverified | 0 | 0 |
| You Only Need One Model for Open-domain Question Answering | Dec 14, 2021 | Hard AttentionNatural Questions | —Unverified | 0 | 0 |