| Charformer: Fast Character Transformers via Gradient-based Subword Tokenization | Jun 23, 2021 | Inductive BiasLinguistic Acceptability | CodeCode Available | 1 |
| A Generalization of Transformer Networks to Graphs | Dec 17, 2020 | Graph RegressionInductive Bias | CodeCode Available | 1 |
| K-Space Transformer for Undersampled MRI Reconstruction | Jun 14, 2022 | DecoderInductive Bias | CodeCode Available | 1 |
| Chunked Autoregressive GAN for Conditional Waveform Synthesis | Oct 19, 2021 | Inductive Bias | CodeCode Available | 1 |
| Arithmetic Feature Interaction Is Necessary for Deep Tabular Learning | Feb 4, 2024 | Inductive Bias | CodeCode Available | 1 |
| Convolutional Bypasses Are Better Vision Transformer Adapters | Jul 14, 2022 | Few-Shot LearningInductive Bias | CodeCode Available | 1 |
| Amortized Inference for Causal Structure Learning | May 25, 2022 | Causal DiscoveryInductive Bias | CodeCode Available | 1 |
| CMUNeXt: An Efficient Medical Image Segmentation Network based on Large Kernel and Skip Fusion | Aug 2, 2023 | Image SegmentationInductive Bias | CodeCode Available | 1 |
| Discovering and Explaining the Representation Bottleneck of Graph Neural Networks from Multi-order Interactions | May 15, 2022 | graph constructionGraph Learning | CodeCode Available | 1 |
| DeiT-LT Distillation Strikes Back for Vision Transformer Training on Long-Tailed Datasets | Apr 3, 2024 | Image ClassificationInductive Bias | CodeCode Available | 1 |