| Table Retrieval May Not Necessitate Table-specific Model Design | May 19, 2022 | Hard AttentionNatural Questions | CodeCode Available | 1 |
| FANet: A Feedback Attention Network for Improved Biomedical Image Segmentation | Mar 31, 2021 | Hard AttentionImage Segmentation | CodeCode Available | 1 |
| Learning Texture Transformer Network for Image Super-Resolution | Jun 7, 2020 | Hard AttentionImage Generation | CodeCode Available | 1 |
| Exact Hard Monotonic Attention for Character-Level Transduction | May 15, 2019 | Hard AttentionInductive Bias | CodeCode Available | 1 |
| Hard Non-Monotonic Attention for Character-Level Transduction | Aug 29, 2018 | Hard AttentionImage Captioning | CodeCode Available | 1 |
| Hard-Attention for Scalable Image Classification | Feb 20, 2021 | ClassificationDeep Attention | CodeCode Available | 1 |
| Hard-Attention Gates with Gradient Routing for Endoscopic Image Computing | Jul 5, 2024 | Binary Classificationfeature selection | CodeCode Available | 1 |
| Investigation of Architectures and Receptive Fields for Appearance-based Gaze Estimation | Aug 18, 2023 | Contrastive LearningDisentanglement | CodeCode Available | 1 |
| Self-Attention Networks Can Process Bounded Hierarchical Languages | May 24, 2021 | Hard Attention | CodeCode Available | 1 |
| A Hybrid Attention Mechanism for Weakly-Supervised Temporal Action Localization | Jan 3, 2021 | Action LocalizationHard Attention | CodeCode Available | 1 |
| Recurrent Models of Visual Attention | Jun 24, 2014 | Hard Attentionimage-classification | CodeCode Available | 1 |
| Mutual Distillation Learning For Person Re-Identification | Jan 12, 2024 | Hard AttentionPerson Re-Identification | CodeCode Available | 1 |
| AMR Parsing with Action-Pointer Transformer | Apr 29, 2021 | Abstract Meaning RepresentationAMR Parsing | CodeCode Available | 1 |
| Coherent Concept-based Explanations in Medical Image and Its Application to Skin Lesion Diagnosis | Apr 10, 2023 | DiagnosticHard Attention | CodeCode Available | 1 |
| NoPE: The Counting Power of Transformers with No Positional Encodings | May 16, 2025 | Hard Attention | —Unverified | 0 |
| Language-Guided Reinforcement Learning for Hard Attention in Few-Shot Learning | Oct 11, 2023 | Deep Reinforcement LearningFew-Shot Learning | —Unverified | 0 |
| Average-Hard Attention Transformers are Constant-Depth Uniform Threshold Circuits | Aug 6, 2023 | Hard Attention | —Unverified | 0 |
| Deep Pneumonia: Attention-Based Contrastive Learning for Class-Imbalanced Pneumonia Lesion Recognition in Chest X-rays | Jul 23, 2022 | Contrastive LearningHard Attention | —Unverified | 0 |
| AttentionDrop: A Novel Regularization Method for Transformer Models | Apr 16, 2025 | Hard Attention | —Unverified | 0 |
| Extractive Adversarial Networks: High-Recall Explanations for Identifying Personal Attacks in Social Media Posts | Sep 1, 2018 | Hard Attention | —Unverified | 0 |
| A study of latent monotonic attention variants | Mar 30, 2021 | Hard Attentionspeech-recognition | —Unverified | 0 |
| DanHAR: Dual Attention Network For Multimodal Human Activity Recognition Using Wearable Sensors | Jun 25, 2020 | Activity RecognitionHard Attention | —Unverified | 0 |
| Ehrenfeucht-Haussler Rank and Chain of Thought | Jan 22, 2025 | DecoderHard Attention | —Unverified | 0 |
| Continual Diffusion with STAMINA: STack-And-Mask INcremental Adapters | Nov 30, 2023 | Continual LearningHard Attention | —Unverified | 0 |
| A Differentiable Self-disambiguated Sense Embedding Model via Scaled Gumbel Softmax | Sep 27, 2018 | Hard AttentionSentence | —Unverified | 0 |