| Hard Attention Control By Mutual Information Maximization | Mar 10, 2021 | Hard AttentionPartially Observable Reinforcement Learning | —Unverified | 0 |
| You Only Need One Model for Open-domain Question Answering | Dec 14, 2021 | Hard AttentionNatural Questions | —Unverified | 0 |
| NoPE: The Counting Power of Transformers with No Positional Encodings | May 16, 2025 | Hard Attention | —Unverified | 0 |
| Achieving Explainability in a Visual Hard Attention Model through Content Prediction | Jan 1, 2021 | Hard Attentionimage-classification | —Unverified | 0 |
| A Differentiable Self-disambiguated Sense Embedding Model via Scaled Gumbel Softmax | Sep 27, 2018 | Hard AttentionSentence | —Unverified | 0 |
| AMR Parsing with Action-Pointer Transformer | Nov 24, 2020 | Abstract Meaning RepresentationAMR Parsing | —Unverified | 0 |
| An Exploration of Neural Sequence-to-Sequence Architectures for Automatic Post-Editing | Jun 13, 2017 | Automatic Post-EditingHard Attention | —Unverified | 0 |
| A study of latent monotonic attention variants | Mar 30, 2021 | Hard Attentionspeech-recognition | —Unverified | 0 |
| AttentionDrop: A Novel Regularization Method for Transformer Models | Apr 16, 2025 | Hard Attention | —Unverified | 0 |
| Average-Hard Attention Transformers are Constant-Depth Uniform Threshold Circuits | Aug 6, 2023 | Hard Attention | —Unverified | 0 |
| Characterizing the Expressivity of Transformer Language Models | May 29, 2025 | Hard Attention | —Unverified | 0 |
| CLAWS: Contrastive Learning with hard Attention and Weak Supervision | Dec 1, 2021 | Anomaly DetectionContrastive Learning | —Unverified | 0 |
| Comparison of different Unique hard attention transformer models by the formal languages they can recognize | Jun 3, 2025 | Hard AttentionSurvey | —Unverified | 0 |
| Continual Diffusion with STAMINA: STack-And-Mask INcremental Adapters | Nov 30, 2023 | Continual LearningHard Attention | —Unverified | 0 |
| DanHAR: Dual Attention Network For Multimodal Human Activity Recognition Using Wearable Sensors | Jun 25, 2020 | Activity RecognitionHard Attention | —Unverified | 0 |
| Look Harder: A Neural Machine Translation Model with Hard Attention | Jul 1, 2019 | Hard AttentionMachine Translation | —Unverified | 0 |
| Lower Bounds for Chain-of-Thought Reasoning in Hard-Attention Transformers | Feb 4, 2025 | Hard Attention | —Unverified | 0 |
| Masked Hard-Attention Transformers Recognize Exactly the Star-Free Languages | Oct 21, 2023 | Hard AttentionPosition | —Unverified | 0 |
| MESAHA-Net: Multi-Encoders based Self-Adaptive Hard Attention Network with Maximum Intensity Projections for Lung Nodule Segmentation in CT Scan | Apr 4, 2023 | Computed Tomography (CT)Decoder | —Unverified | 0 |
| Dual Attention Model with Reinforcement Learning for Classification of Histology Whole-Slide Images | Feb 19, 2023 | Hard Attentionwhole slide images | —Unverified | 0 |
| Multimodal Emergent Fake News Detection via Meta Neural Process Networks | Jun 22, 2021 | Fake News DetectionHard Attention | —Unverified | 0 |
| MultiResolution Attention Extractor for Small Object Detection | Jun 10, 2020 | Hard AttentionObject | —Unverified | 0 |
| Multi-View Unsupervised Image Generation with Cross Attention Guidance | Dec 7, 2023 | Hard AttentionImage Generation | —Unverified | 0 |
| Near-Optimal Glimpse Sequences for Improved Hard Attention Neural Network Training | Jun 13, 2019 | Experimental DesignGeneral Classification | —Unverified | 0 |
| Near-Optimal Glimpse Sequences for Training Hard Attention Neural Networks | Jan 1, 2021 | Experimental DesignGeneral Classification | —Unverified | 0 |