SOTAVerified

Long-range modeling

A new task for testing the long-sequence modeling capabilities and efficiency of language models.

Image credit: SCROLLS: Standardized CompaRison Over Long Language Sequences

Papers

Showing 2650 of 95 papers

TitleStatusHype
DSANet: Dynamic Segment Aggregation Network for Video-Level Representation LearningCode1
Efficient Long-Text Understanding with Short-Text ModelsCode1
Efficiently Modeling Long Sequences with Structured State SpacesCode1
Fourier Transformer: Fast Long Range Modeling by Removing Sequence Redundancy with FFT OperatorCode1
GIRAFFE: Design Choices for Extending the Context Length of Visual Language ModelsCode1
Hierarchical Separable Video Transformer for Snapshot Compressive ImagingCode1
Image Super-Resolution With Non-Local Sparse AttentionCode1
JanusDNA: A Powerful Bi-directional Hybrid DNA Foundation ModelCode1
KM-UNet KAN Mamba UNet for medical image segmentationCode1
Long Range Arena: A Benchmark for Efficient TransformersCode1
Long Range Propagation on Continuous-Time Dynamic GraphsCode1
LongT5: Efficient Text-To-Text Transformer for Long SequencesCode1
Multi-scale Attention Network for Single Image Super-ResolutionCode1
Paramixer: Parameterizing Mixing Links in Sparse Factors Works Better than Dot-Product Self-AttentionCode1
Primal-Attention: Self-attention through Asymmetric Kernel SVD in Primal RepresentationCode1
QuadMamba: Learning Quadtree-based Selective Scan for Visual State Space ModelCode1
Recurrent Distance Filtering for Graph Representation LearningCode1
SCROLLS: Standardized CompaRison Over Long Language SequencesCode1
Sparse Modular Activation for Efficient Sequence ModelingCode1
Spatio-Spectral Graph Neural NetworksCode1
T-former: An Efficient Transformer for Image InpaintingCode1
The Expressive Leaky Memory Neuron: an Efficient and Expressive Phenomenological Neuron Model Can Solve Long-Horizon TasksCode1
U-Net vs Transformer: Is U-Net Outdated in Medical Image Registration?Code1
UL2: Unifying Language Learning ParadigmsCode1
U-RWKV: Lightweight medical image segmentation with direction-adaptive RWKVCode1
Show:102550
← PrevPage 2 of 4Next →

No leaderboard results yet.