SOTAVerified

Long-range modeling

A new task for testing the long-sequence modeling capabilities and efficiency of language models.

Image credit: SCROLLS: Standardized CompaRison Over Long Language Sequences

Papers

Showing 7695 of 95 papers

TitleStatusHype
Token Transformer: Can class token help window-based transformer build better long-range interactions?0
Focus Your Attention (with Adaptive IIR Filters)0
Gated Relational Graph Attention Networks0
ReGNet: Reciprocal Space-Aware Long-Range Modeling for Crystalline Property Prediction0
HST-MRF: Heterogeneous Swin Transformer with Multi-Receptive Field for Medical Image Segmentation0
A General-Purpose Multilingual Document EncoderCode0
Advancing Regular Language Reasoning in Linear Recurrent Neural NetworksCode0
How to Train Your HiPPO: State Space Models with Generalized Orthogonal Basis ProjectionsCode0
CNSNet: A Cleanness-Navigated-Shadow Network for Shadow RemovalCode0
On the Parameterization and Initialization of Diagonal State Space ModelsCode0
Sparse Factorization of Large Square MatricesCode0
Part Representation Learning with Teacher-Student Decoder for Occluded Person Re-identificationCode0
CDPDNet: Integrating Text Guidance with Hybrid Vision Encoders for Medical Image SegmentationCode0
Hybrid-Emba3D: Geometry-Aware and Cross-Path Feature Hybrid Enhanced State Space Model for Point Cloud ClassificationCode0
vGamba: Attentive State Space Bottleneck for efficient Long-range Dependencies in Visual RecognitionCode0
RS3Mamba: Visual State Space Model for Remote Sensing Images Semantic SegmentationCode0
LaCache: Ladder-Shaped KV Caching for Efficient Long-Context Modeling of Large Language ModelsCode0
Dimension Mixer: Group Mixing of Input Dimensions for Efficient Function ApproximationCode0
Diagonal State Spaces are as Effective as Structured State SpacesCode0
RFR-WWANet: Weighted Window Attention-Based Recovery Feature Resolution Network for Unsupervised Image RegistrationCode0
Show:102550
← PrevPage 4 of 4Next →

No leaderboard results yet.