SOTAVerified|Agents Browse Leaderboard About

Long-range modeling

A new task for testing the long-sequence modeling capabilities and efficiency of language models.

Image credit: SCROLLS: Standardized CompaRison Over Long Language Sequences

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 51–75 of 95 papers

Title	Date	Tasks	Status	Hype
Sparse Modular Activation for Efficient Sequence Modeling	Jun 19, 2023	ChunkingLanguage Modeling	CodeCode Available	1
The Expressive Leaky Memory Neuron: an Efficient and Expressive Phenomenological Neuron Model Can Solve Long-Horizon Tasks	Jun 14, 2023	16kClassification	CodeCode Available	1
Primal-Attention: Self-attention through Asymmetric Kernel SVD in Primal Representation	May 31, 2023	D4RLLanguage Modelling	CodeCode Available	1
Fourier Transformer: Fast Long Range Modeling by Removing Sequence Redundancy with FFT Operator	May 24, 2023	Abstractive Text SummarizationDocument Summarization	CodeCode Available	1
Focus Your Attention (with Adaptive IIR Filters)	May 24, 2023	Language ModellingLong-range modeling	—Unverified	0
T-former: An Efficient Transformer for Image Inpainting	May 12, 2023	Image InpaintingLong-range modeling	CodeCode Available	1
A General-Purpose Multilingual Document Encoder	May 11, 2023	Cross-Lingual TransferDocument Classification	CodeCode Available	0
RFR-WWANet: Weighted Window Attention-Based Recovery Feature Resolution Network for Unsupervised Image Registration	May 7, 2023	Computational EfficiencyImage Registration	CodeCode Available	0
HST-MRF: Heterogeneous Swin Transformer with Multi-Receptive Field for Medical Image Segmentation	Apr 10, 2023	Image SegmentationLesion Segmentation	—Unverified	0
CoLT5: Faster Long-Range Transformers with Conditional Computation	Mar 17, 2023	Long-range modeling	—Unverified	0
Hungry Hungry Hippos: Towards Language Modeling with State Space Models	Dec 28, 2022	8kCoreference Resolution	CodeCode Available	2
Token Transformer: Can class token help window-based transformer build better long-range interactions?	Nov 11, 2022	image-classificationImage Classification	—Unverified	0
What Makes Convolutional Models Great on Long Sequence Modeling?	Oct 17, 2022	Long-range modeling	CodeCode Available	1
CAB: Comprehensive Attention Benchmarking on Long Sequence Modeling	Oct 14, 2022	BenchmarkingLanguage Modeling	CodeCode Available	1
Pose Guided Human Image Synthesis with Partially Decoupled GAN	Oct 7, 2022	DecoderImage Generation	—Unverified	0
Multi-scale Attention Network for Single Image Super-Resolution	Sep 28, 2022	BlockingImage Super-Resolution	CodeCode Available	1
Liquid Structural State-Space Models	Sep 26, 2022	Heart rate estimationLong-range modeling	CodeCode Available	2
Mega: Moving Average Equipped Gated Attention	Sep 21, 2022	Image ClassificationInductive Bias	CodeCode Available	2
Adapting Pretrained Text-to-Text Models for Long Text Sequences	Sep 21, 2022	Long-range modelingQuestion Answering	CodeCode Available	1
CNSNet: A Cleanness-Navigated-Shadow Network for Shadow Removal	Sep 6, 2022	Long-range modelingShadow Removal	CodeCode Available	0
Simplified State Space Layers for Sequence Modeling	Aug 9, 2022	Computational EfficiencyListOps	CodeCode Available	2
Investigating Efficiently Extending Transformers for Long Input Summarization	Aug 8, 2022	16kLong-range modeling	CodeCode Available	3
U-Net vs Transformer: Is U-Net Outdated in Medical Image Registration?	Aug 7, 2022	Image RegistrationLong-range modeling	CodeCode Available	1
Efficient Long-Text Understanding with Short-Text Models	Aug 1, 2022	ArticlesDecoder	CodeCode Available	1
Weakly Supervised Object Localization via Transformer with Implicit Spatial Calibration	Jul 21, 2022	Long-range modelingObject	CodeCode Available	1

Show:10 25 50

← PrevPage 3 of 4Next →

No leaderboard results yet.