| Hierarchical Separable Video Transformer for Snapshot Compressive Imaging | Jul 16, 2024 | Inductive BiasLong-range modeling | CodeCode Available | 1 |
| Long Range Propagation on Continuous-Time Dynamic Graphs | Jun 4, 2024 | Long-range modeling | CodeCode Available | 1 |
| Spatio-Spectral Graph Neural Networks | May 29, 2024 | GPUGraph Classification | CodeCode Available | 1 |
| A Simple LLM Framework for Long-Range Video Question-Answering | Dec 28, 2023 | EgoSchemaLanguage Modelling | CodeCode Available | 1 |
| Recurrent Distance Filtering for Graph Representation Learning | Dec 3, 2023 | Graph ClassificationGraph Representation Learning | CodeCode Available | 1 |
| ViTEraser: Harnessing the Power of Vision Transformers for Scene Text Removal with SegMIM Pretraining | Jun 21, 2023 | DecoderLong-range modeling | CodeCode Available | 1 |
| Sparse Modular Activation for Efficient Sequence Modeling | Jun 19, 2023 | ChunkingLanguage Modeling | CodeCode Available | 1 |
| The Expressive Leaky Memory Neuron: an Efficient and Expressive Phenomenological Neuron Model Can Solve Long-Horizon Tasks | Jun 14, 2023 | 16kClassification | CodeCode Available | 1 |
| Primal-Attention: Self-attention through Asymmetric Kernel SVD in Primal Representation | May 31, 2023 | D4RLLanguage Modelling | CodeCode Available | 1 |
| Fourier Transformer: Fast Long Range Modeling by Removing Sequence Redundancy with FFT Operator | May 24, 2023 | Abstractive Text SummarizationDocument Summarization | CodeCode Available | 1 |
| T-former: An Efficient Transformer for Image Inpainting | May 12, 2023 | Image InpaintingLong-range modeling | CodeCode Available | 1 |
| What Makes Convolutional Models Great on Long Sequence Modeling? | Oct 17, 2022 | Long-range modeling | CodeCode Available | 1 |
| CAB: Comprehensive Attention Benchmarking on Long Sequence Modeling | Oct 14, 2022 | BenchmarkingLanguage Modeling | CodeCode Available | 1 |
| Multi-scale Attention Network for Single Image Super-Resolution | Sep 28, 2022 | BlockingImage Super-Resolution | CodeCode Available | 1 |
| Adapting Pretrained Text-to-Text Models for Long Text Sequences | Sep 21, 2022 | Long-range modelingQuestion Answering | CodeCode Available | 1 |
| U-Net vs Transformer: Is U-Net Outdated in Medical Image Registration? | Aug 7, 2022 | Image RegistrationLong-range modeling | CodeCode Available | 1 |
| Efficient Long-Text Understanding with Short-Text Models | Aug 1, 2022 | ArticlesDecoder | CodeCode Available | 1 |
| Weakly Supervised Object Localization via Transformer with Implicit Spatial Calibration | Jul 21, 2022 | Long-range modelingObject | CodeCode Available | 1 |
| ChordMixer: A Scalable Neural Attention Model for Sequences with Different Lengths | Jun 12, 2022 | ChunkingDocument Classification | CodeCode Available | 1 |
| UL2: Unifying Language Learning Paradigms | May 10, 2022 | Arithmetic ReasoningCommon Sense Reasoning | CodeCode Available | 1 |
| Paramixer: Parameterizing Mixing Links in Sparse Factors Works Better than Dot-Product Self-Attention | Apr 22, 2022 | Long-range modeling | CodeCode Available | 1 |
| SCROLLS: Standardized CompaRison Over Long Language Sequences | Jan 10, 2022 | DecoderLong-range modeling | CodeCode Available | 1 |
| Classification of Long Sequential Data using Circular Dilated Convolutional Neural Networks | Jan 6, 2022 | Audio ClassificationClassification | CodeCode Available | 1 |
| LongT5: Efficient Text-To-Text Transformer for Long Sequences | Dec 15, 2021 | Abstractive Text SummarizationLong-range modeling | CodeCode Available | 1 |
| Efficiently Modeling Long Sequences with Structured State Spaces | Oct 31, 2021 | Data AugmentationLanguage Modeling | CodeCode Available | 1 |