| AMR Parsing with Action-Pointer Transformer | Apr 29, 2021 | Abstract Meaning RepresentationAMR Parsing | CodeCode Available | 1 |
| Coherent Concept-based Explanations in Medical Image and Its Application to Skin Lesion Diagnosis | Apr 10, 2023 | DiagnosticHard Attention | CodeCode Available | 1 |
| Table Retrieval May Not Necessitate Table-specific Model Design | May 19, 2022 | Hard AttentionNatural Questions | CodeCode Available | 1 |
| Exact Hard Monotonic Attention for Character-Level Transduction | May 15, 2019 | Hard AttentionInductive Bias | CodeCode Available | 1 |
| Learning Texture Transformer Network for Image Super-Resolution | Jun 7, 2020 | Hard AttentionImage Generation | CodeCode Available | 1 |
| A Hybrid Attention Mechanism for Weakly-Supervised Temporal Action Localization | Jan 3, 2021 | Action LocalizationHard Attention | CodeCode Available | 1 |
| FANet: A Feedback Attention Network for Improved Biomedical Image Segmentation | Mar 31, 2021 | Hard AttentionImage Segmentation | CodeCode Available | 1 |
| Recurrent Models of Visual Attention | Jun 24, 2014 | Hard Attentionimage-classification | CodeCode Available | 1 |
| Investigation of Architectures and Receptive Fields for Appearance-based Gaze Estimation | Aug 18, 2023 | Contrastive LearningDisentanglement | CodeCode Available | 1 |
| Mutual Distillation Learning For Person Re-Identification | Jan 12, 2024 | Hard AttentionPerson Re-Identification | CodeCode Available | 1 |
| Hard Non-Monotonic Attention for Character-Level Transduction | Aug 29, 2018 | Hard AttentionImage Captioning | CodeCode Available | 1 |
| Hard-Attention Gates with Gradient Routing for Endoscopic Image Computing | Jul 5, 2024 | Binary Classificationfeature selection | CodeCode Available | 1 |
| Self-Attention Networks Can Process Bounded Hierarchical Languages | May 24, 2021 | Hard Attention | CodeCode Available | 1 |
| Hard-Attention for Scalable Image Classification | Feb 20, 2021 | ClassificationDeep Attention | CodeCode Available | 1 |
| Deep Pneumonia: Attention-Based Contrastive Learning for Class-Imbalanced Pneumonia Lesion Recognition in Chest X-rays | Jul 23, 2022 | Contrastive LearningHard Attention | —Unverified | 0 |
| Effect of choice of probability distribution, randomness, and search methods for alignment modeling in sequence-to-sequence text-to-speech synthesis using hard alignment | Oct 28, 2019 | Hard AttentionSpeech Synthesis | —Unverified | 0 |
| Ehrenfeucht-Haussler Rank and Chain of Thought | Jan 22, 2025 | DecoderHard Attention | —Unverified | 0 |
| Emergence of Fixational and Saccadic Movements in a Multi-Level Recurrent Attention Model for Vision | May 19, 2025 | Hard Attentionimage-classification | —Unverified | 0 |
| Exact Expressive Power of Transformers with Padding | May 25, 2025 | Hard Attention | —Unverified | 0 |
| Language-Guided Reinforcement Learning for Hard Attention in Few-Shot Learning | Oct 11, 2023 | Deep Reinforcement LearningFew-Shot Learning | —Unverified | 0 |
| Extractive Adversarial Networks: High-Recall Explanations for Identifying Personal Attacks in Social Media Posts | Sep 1, 2018 | Hard Attention | —Unverified | 0 |
| Formal Language Recognition by Hard Attention Transformers: Perspectives from Circuit Complexity | Apr 13, 2022 | Hard Attention | —Unverified | 0 |
| Generative Adversarial Networks Based on Collaborative Learning and Attention Mechanism for Hyperspectral Image Classification | Apr 3, 2020 | Few-Shot Image ClassificationGenerative Adversarial Network | —Unverified | 0 |
| GQHAN: A Grover-inspired Quantum Hard Attention Network | Jan 25, 2024 | Binary ClassificationHard Attention | —Unverified | 0 |
| Graph Decoupling Attention Markov Networks for Semi-supervised Graph Node Classification | Apr 28, 2021 | General ClassificationGraph Learning | —Unverified | 0 |
| Hard Attention Control By Mutual Information Maximization | Mar 10, 2021 | Hard AttentionPartially Observable Reinforcement Learning | —Unverified | 0 |
| You Only Need One Model for Open-domain Question Answering | Dec 14, 2021 | Hard AttentionNatural Questions | —Unverified | 0 |
| NoPE: The Counting Power of Transformers with No Positional Encodings | May 16, 2025 | Hard Attention | —Unverified | 0 |
| Achieving Explainability in a Visual Hard Attention Model through Content Prediction | Jan 1, 2021 | Hard Attentionimage-classification | —Unverified | 0 |
| A Differentiable Self-disambiguated Sense Embedding Model via Scaled Gumbel Softmax | Sep 27, 2018 | Hard AttentionSentence | —Unverified | 0 |
| AMR Parsing with Action-Pointer Transformer | Nov 24, 2020 | Abstract Meaning RepresentationAMR Parsing | —Unverified | 0 |
| An Exploration of Neural Sequence-to-Sequence Architectures for Automatic Post-Editing | Jun 13, 2017 | Automatic Post-EditingHard Attention | —Unverified | 0 |
| A study of latent monotonic attention variants | Mar 30, 2021 | Hard Attentionspeech-recognition | —Unverified | 0 |
| AttentionDrop: A Novel Regularization Method for Transformer Models | Apr 16, 2025 | Hard Attention | —Unverified | 0 |
| Average-Hard Attention Transformers are Constant-Depth Uniform Threshold Circuits | Aug 6, 2023 | Hard Attention | —Unverified | 0 |
| Characterizing the Expressivity of Transformer Language Models | May 29, 2025 | Hard Attention | —Unverified | 0 |
| CLAWS: Contrastive Learning with hard Attention and Weak Supervision | Dec 1, 2021 | Anomaly DetectionContrastive Learning | —Unverified | 0 |
| Comparison of different Unique hard attention transformer models by the formal languages they can recognize | Jun 3, 2025 | Hard AttentionSurvey | —Unverified | 0 |
| Continual Diffusion with STAMINA: STack-And-Mask INcremental Adapters | Nov 30, 2023 | Continual LearningHard Attention | —Unverified | 0 |
| DanHAR: Dual Attention Network For Multimodal Human Activity Recognition Using Wearable Sensors | Jun 25, 2020 | Activity RecognitionHard Attention | —Unverified | 0 |
| Look Harder: A Neural Machine Translation Model with Hard Attention | Jul 1, 2019 | Hard AttentionMachine Translation | —Unverified | 0 |
| Lower Bounds for Chain-of-Thought Reasoning in Hard-Attention Transformers | Feb 4, 2025 | Hard Attention | —Unverified | 0 |
| Masked Hard-Attention Transformers Recognize Exactly the Star-Free Languages | Oct 21, 2023 | Hard AttentionPosition | —Unverified | 0 |
| MESAHA-Net: Multi-Encoders based Self-Adaptive Hard Attention Network with Maximum Intensity Projections for Lung Nodule Segmentation in CT Scan | Apr 4, 2023 | Computed Tomography (CT)Decoder | —Unverified | 0 |
| Dual Attention Model with Reinforcement Learning for Classification of Histology Whole-Slide Images | Feb 19, 2023 | Hard Attentionwhole slide images | —Unverified | 0 |
| Multimodal Emergent Fake News Detection via Meta Neural Process Networks | Jun 22, 2021 | Fake News DetectionHard Attention | —Unverified | 0 |
| MultiResolution Attention Extractor for Small Object Detection | Jun 10, 2020 | Hard AttentionObject | —Unverified | 0 |
| Multi-View Unsupervised Image Generation with Cross Attention Guidance | Dec 7, 2023 | Hard AttentionImage Generation | —Unverified | 0 |
| Near-Optimal Glimpse Sequences for Improved Hard Attention Neural Network Training | Jun 13, 2019 | Experimental DesignGeneral Classification | —Unverified | 0 |
| Near-Optimal Glimpse Sequences for Training Hard Attention Neural Networks | Jan 1, 2021 | Experimental DesignGeneral Classification | —Unverified | 0 |