| Learning Hierarchical Structures with Differentiable Nondeterministic Stacks | Sep 5, 2021 | Inductive BiasLanguage Modeling | CodeCode Available | 1 |
| Depth-Wise Convolutions in Vision Transformers for Efficient Training on Small Datasets | Jul 28, 2024 | image-classificationImage Classification | CodeCode Available | 1 |
| An Inductive Bias for Distances: Neural Nets that Respect the Triangle Inequality | Feb 14, 2020 | Inductive BiasMetric Learning | CodeCode Available | 1 |
| Equivariance and Invariance Inductive Bias for Learning from Insufficient Data | Jul 25, 2022 | Inductive Bias | CodeCode Available | 1 |
| Learning to Encode Position for Transformer with Continuous Dynamical Model | Mar 13, 2020 | Inductive BiasLinguistic Acceptability | CodeCode Available | 1 |
| Learning to Optimize for Reinforcement Learning | Feb 3, 2023 | Inductive BiasMeta-Learning | CodeCode Available | 1 |
| ADeLA: Automatic Dense Labeling with Attention for Viewpoint Adaptation in Semantic Segmentation | Jul 29, 2021 | Domain AdaptationHallucination | CodeCode Available | 1 |
| ALMA: Hierarchical Learning for Composite Multi-Agent Tasks | May 27, 2022 | Decision MakingInductive Bias | CodeCode Available | 1 |
| EViT: An Eagle Vision Transformer with Bi-Fovea Self-Attention | Oct 10, 2023 | Computational Efficiencyimage-classification | CodeCode Available | 1 |
| Attention is Not All You Need: Pure Attention Loses Rank Doubly Exponentially with Depth | Mar 5, 2021 | AllInductive Bias | CodeCode Available | 1 |