| Theoretical Limitations of Self-Attention in Neural Sequence Models | Jun 16, 2019 | Hard Attention | —Unverified | 0 |
| Transformers as Transducers | Apr 2, 2024 | Hard AttentionPOS | —Unverified | 0 |
| Transformers in Uniform TC^0 | Sep 20, 2024 | Hard Attention | —Unverified | 0 |
| Unique Hard Attention: A Tale of Two Sides | Mar 18, 2025 | Hard Attention | —Unverified | 0 |
| Upper, Middle and Lower Region Learning for Facial Action Unit Detection | Feb 10, 2020 | Action Unit DetectionFacial Action Unit Detection | —Unverified | 0 |
| Video Violence Recognition and Localization Using a Semi-Supervised Hard Attention Model | Feb 4, 2022 | Activity RecognitionHard Attention | —Unverified | 0 |
| Word Representation Models for Morphologically Rich Languages in Neural Machine Translation | Jun 14, 2016 | Hard AttentionMachine Translation | —Unverified | 0 |
| Hierarchical Memory Networks | May 24, 2016 | Hard AttentionQuestion Answering | —Unverified | 0 |
| Hierarchical Multi-scale Attention Networks for Action Recognition | Aug 25, 2017 | Action RecognitionHard Attention | —Unverified | 0 |
| Improved Attention Models for Memory Augmented Neural Network Adaptive Controllers | Mar 19, 2020 | Hard Attention | —Unverified | 0 |