| A Hybrid Attention Mechanism for Weakly-Supervised Temporal Action Localization | Jan 3, 2021 | Action LocalizationHard Attention | CodeCode Available | 1 |
| Learning Texture Transformer Network for Image Super-Resolution | Jun 7, 2020 | Hard AttentionImage Generation | CodeCode Available | 1 |
| AMR Parsing with Action-Pointer Transformer | Apr 29, 2021 | Abstract Meaning RepresentationAMR Parsing | CodeCode Available | 1 |
| Coherent Concept-based Explanations in Medical Image and Its Application to Skin Lesion Diagnosis | Apr 10, 2023 | DiagnosticHard Attention | CodeCode Available | 1 |
| NoPE: The Counting Power of Transformers with No Positional Encodings | May 16, 2025 | Hard Attention | —Unverified | 0 |
| Effect of choice of probability distribution, randomness, and search methods for alignment modeling in sequence-to-sequence text-to-speech synthesis using hard alignment | Oct 28, 2019 | Hard AttentionSpeech Synthesis | —Unverified | 0 |
| Average-Hard Attention Transformers are Constant-Depth Uniform Threshold Circuits | Aug 6, 2023 | Hard Attention | —Unverified | 0 |
| AttentionDrop: A Novel Regularization Method for Transformer Models | Apr 16, 2025 | Hard Attention | —Unverified | 0 |
| Ehrenfeucht-Haussler Rank and Chain of Thought | Jan 22, 2025 | DecoderHard Attention | —Unverified | 0 |
| A study of latent monotonic attention variants | Mar 30, 2021 | Hard Attentionspeech-recognition | —Unverified | 0 |