| Comparison of different Unique hard attention transformer models by the formal languages they can recognize | Jun 3, 2025 | Hard AttentionSurvey | —Unverified | 0 |
| Characterizing the Expressivity of Transformer Language Models | May 29, 2025 | Hard Attention | —Unverified | 0 |
| Exact Expressive Power of Transformers with Padding | May 25, 2025 | Hard Attention | —Unverified | 0 |
| Emergence of Fixational and Saccadic Movements in a Multi-Level Recurrent Attention Model for Vision | May 19, 2025 | Hard Attentionimage-classification | —Unverified | 0 |
| NoPE: The Counting Power of Transformers with No Positional Encodings | May 16, 2025 | Hard Attention | —Unverified | 0 |
| Neuroevolution of Self-Attention Over Proto-Objects | Apr 30, 2025 | Hard AttentionImage Segmentation | —Unverified | 0 |
| AttentionDrop: A Novel Regularization Method for Transformer Models | Apr 16, 2025 | Hard Attention | —Unverified | 0 |
| Center-guided Classifier for Semantic Segmentation of Remote Sensing Images | Mar 21, 2025 | Hard AttentionSegmentation | CodeCode Available | 0 |
| Unique Hard Attention: A Tale of Two Sides | Mar 18, 2025 | Hard Attention | —Unverified | 0 |
| Lower Bounds for Chain-of-Thought Reasoning in Hard-Attention Transformers | Feb 4, 2025 | Hard Attention | —Unverified | 0 |
| Ehrenfeucht-Haussler Rank and Chain of Thought | Jan 22, 2025 | DecoderHard Attention | —Unverified | 0 |
| Simulating Hard Attention Using Soft Attention | Dec 13, 2024 | Hard Attention | —Unverified | 0 |
| Transformers in Uniform TC^0 | Sep 20, 2024 | Hard Attention | —Unverified | 0 |
| Soft-Hard Attention U-Net Model and Benchmark Dataset for Multiscale Image Shadow Removal | Aug 7, 2024 | BenchmarkingHard Attention | —Unverified | 0 |
| Hard-Attention Gates with Gradient Routing for Endoscopic Image Computing | Jul 5, 2024 | Binary Classificationfeature selection | CodeCode Available | 1 |
| TRIP: Trainable Region-of-Interest Prediction for Hardware-Efficient Neuromorphic Processing on Event-based Vision | Jun 25, 2024 | Event-based visionHard Attention | CodeCode Available | 0 |
| Transformers as Transducers | Apr 2, 2024 | Hard AttentionPOS | —Unverified | 0 |
| Recurrent Alignment with Hard Attention for Hierarchical Text Rating | Feb 14, 2024 | Hard Attention | CodeCode Available | 0 |
| GQHAN: A Grover-inspired Quantum Hard Attention Network | Jan 25, 2024 | Binary ClassificationHard Attention | —Unverified | 0 |
| Mutual Distillation Learning For Person Re-Identification | Jan 12, 2024 | Hard AttentionPerson Re-Identification | CodeCode Available | 1 |
| Multi-View Unsupervised Image Generation with Cross Attention Guidance | Dec 7, 2023 | Hard AttentionImage Generation | —Unverified | 0 |
| Continual Diffusion with STAMINA: STack-And-Mask INcremental Adapters | Nov 30, 2023 | Continual LearningHard Attention | —Unverified | 0 |
| Vamos: Versatile Action Models for Video Understanding | Nov 22, 2023 | EgoSchemaHard Attention | CodeCode Available | 0 |
| Masked Hard-Attention Transformers Recognize Exactly the Star-Free Languages | Oct 21, 2023 | Hard AttentionPosition | —Unverified | 0 |
| Language-Guided Reinforcement Learning for Hard Attention in Few-Shot Learning | Oct 11, 2023 | Deep Reinforcement LearningFew-Shot Learning | —Unverified | 0 |
| Logical Languages Accepted by Transformer Encoders with Hard Attention | Oct 5, 2023 | Hard Attention | —Unverified | 0 |
| Investigation of Architectures and Receptive Fields for Appearance-based Gaze Estimation | Aug 18, 2023 | Contrastive LearningDisentanglement | CodeCode Available | 1 |
| Average-Hard Attention Transformers are Constant-Depth Uniform Threshold Circuits | Aug 6, 2023 | Hard Attention | —Unverified | 0 |
| On the Learning Dynamics of Attention Networks | Jul 25, 2023 | Hard Attention | CodeCode Available | 0 |
| HAT-CL: A Hard-Attention-to-the-Task PyTorch Library for Continual Learning | Jul 18, 2023 | Continual LearningHard Attention | CodeCode Available | 0 |
| Coherent Concept-based Explanations in Medical Image and Its Application to Skin Lesion Diagnosis | Apr 10, 2023 | DiagnosticHard Attention | CodeCode Available | 1 |
| MESAHA-Net: Multi-Encoders based Self-Adaptive Hard Attention Network with Maximum Intensity Projections for Lung Nodule Segmentation in CT Scan | Apr 4, 2023 | Computed Tomography (CT)Decoder | —Unverified | 0 |
| Dual Attention Model with Reinforcement Learning for Classification of Histology Whole-Slide Images | Feb 19, 2023 | Hard Attentionwhole slide images | —Unverified | 0 |
| Learning to Perceive in Deep Model-Free Reinforcement Learning | Jan 10, 2023 | Atari GamesHard Attention | CodeCode Available | 0 |
| Deep Pneumonia: Attention-Based Contrastive Learning for Class-Imbalanced Pneumonia Lesion Recognition in Chest X-rays | Jul 23, 2022 | Contrastive LearningHard Attention | —Unverified | 0 |
| Dual Attention Networks for Few-Shot Fine-Grained Recognition | Jun 28, 2022 | Hard AttentionMeta-Learning | CodeCode Available | 0 |
| Table Retrieval May Not Necessitate Table-specific Model Design | May 19, 2022 | Hard AttentionNatural Questions | CodeCode Available | 1 |
| Binding Actions to Objects in World Models | Apr 27, 2022 | Hard AttentionObject | CodeCode Available | 0 |
| Formal Language Recognition by Hard Attention Transformers: Perspectives from Circuit Complexity | Apr 13, 2022 | Hard Attention | —Unverified | 0 |
| Consistency driven Sequential Transformers Attention Model for Partially Observable Scenes | Apr 1, 2022 | Hard Attention | CodeCode Available | 0 |
| Video Violence Recognition and Localization Using a Semi-Supervised Hard Attention Model | Feb 4, 2022 | Activity RecognitionHard Attention | —Unverified | 0 |
| You Only Need One Model for Open-domain Question Answering | Dec 14, 2021 | Hard AttentionNatural Questions | —Unverified | 0 |
| CLAWS: Contrastive Learning with hard Attention and Weak Supervision | Dec 1, 2021 | Anomaly DetectionContrastive Learning | —Unverified | 0 |
| A Probabilistic Hard Attention Model For Sequentially Observed Scenes | Nov 15, 2021 | Hard Attention | CodeCode Available | 0 |
| Understanding Interlocking Dynamics of Cooperative Rationalization | Oct 26, 2021 | Hard Attention | CodeCode Available | 0 |
| Sharp Attention for Sequence to Sequence Learning | Sep 29, 2021 | Hard AttentionScene Text Recognition | —Unverified | 0 |
| Specialized Transformers: Faster, Smaller and more Accurate NLP Models | Sep 29, 2021 | Hard AttentionQuantization | —Unverified | 0 |
| Saturated Transformers are Constant-Depth Threshold Circuits | Jun 30, 2021 | Hard Attention | —Unverified | 0 |
| Multimodal Emergent Fake News Detection via Meta Neural Process Networks | Jun 22, 2021 | Fake News DetectionHard Attention | —Unverified | 0 |
| Self-Attention Networks Can Process Bounded Hierarchical Languages | May 24, 2021 | Hard Attention | CodeCode Available | 1 |