SOTAVerified

Long-range modeling

A new task for testing the long-sequence modeling capabilities and efficiency of language models.

Image credit: SCROLLS: Standardized CompaRison Over Long Language Sequences

Papers

Showing 150 of 95 papers

TitleStatusHype
Mamba: Linear-Time Sequence Modeling with Selective State SpacesCode6
MedMamba: Vision Mamba for Medical Image ClassificationCode4
LION: Linear Group RNN for 3D Object Detection in Point CloudsCode3
MambaAD: Exploring State Space Models for Multi-class Unsupervised Anomaly DetectionCode3
Investigating Efficiently Extending Transformers for Long Input SummarizationCode3
MambaFusion: Height-Fidelity Dense Global Fusion for Multi-modal 3D Object DetectionCode2
Emulating Self-attention with Convolution for Efficient Image Super-ResolutionCode2
PoseMamba: Monocular 3D Human Pose Estimation with Bidirectional Global-Local Spatio-Temporal State Space ModelCode2
MambaVC: Learned Visual Compression with Selective State SpacesCode2
DVMSR: Distillated Vision Mamba for Efficient Super-ResolutionCode2
LKM-UNet: Large Kernel Vision Mamba UNet for Medical Image SegmentationCode2
nnMamba: 3D Biomedical Image Segmentation, Classification and Landmark Detection with State Space ModelCode2
MambaMorph: a Mamba-based Framework for Medical MR-CT Deformable RegistrationCode2
TaskExpert: Dynamically Assembling Multi-Task Representations with Memorial Mixture-of-ExpertsCode2
Hungry Hungry Hippos: Towards Language Modeling with State Space ModelsCode2
Liquid Structural State-Space ModelsCode2
Mega: Moving Average Equipped Gated AttentionCode2
Simplified State Space Layers for Sequence ModelingCode2
U-RWKV: Lightweight medical image segmentation with direction-adaptive RWKVCode1
JanusDNA: A Powerful Bi-directional Hybrid DNA Foundation ModelCode1
VADMamba: Exploring State Space Models for Fast Video Anomaly DetectionCode1
KM-UNet KAN Mamba UNet for medical image segmentationCode1
GIRAFFE: Design Choices for Extending the Context Length of Visual Language ModelsCode1
CT-Mamba: A Hybrid Convolutional State Space Model for Low-Dose CT DenoisingCode1
QuadMamba: Learning Quadtree-based Selective Scan for Visual State Space ModelCode1
Hierarchical Separable Video Transformer for Snapshot Compressive ImagingCode1
Long Range Propagation on Continuous-Time Dynamic GraphsCode1
Spatio-Spectral Graph Neural NetworksCode1
A Simple LLM Framework for Long-Range Video Question-AnsweringCode1
Recurrent Distance Filtering for Graph Representation LearningCode1
ViTEraser: Harnessing the Power of Vision Transformers for Scene Text Removal with SegMIM PretrainingCode1
Sparse Modular Activation for Efficient Sequence ModelingCode1
The Expressive Leaky Memory Neuron: an Efficient and Expressive Phenomenological Neuron Model Can Solve Long-Horizon TasksCode1
Primal-Attention: Self-attention through Asymmetric Kernel SVD in Primal RepresentationCode1
Fourier Transformer: Fast Long Range Modeling by Removing Sequence Redundancy with FFT OperatorCode1
T-former: An Efficient Transformer for Image InpaintingCode1
What Makes Convolutional Models Great on Long Sequence Modeling?Code1
CAB: Comprehensive Attention Benchmarking on Long Sequence ModelingCode1
Multi-scale Attention Network for Single Image Super-ResolutionCode1
Adapting Pretrained Text-to-Text Models for Long Text SequencesCode1
U-Net vs Transformer: Is U-Net Outdated in Medical Image Registration?Code1
Efficient Long-Text Understanding with Short-Text ModelsCode1
Weakly Supervised Object Localization via Transformer with Implicit Spatial CalibrationCode1
ChordMixer: A Scalable Neural Attention Model for Sequences with Different LengthsCode1
UL2: Unifying Language Learning ParadigmsCode1
Paramixer: Parameterizing Mixing Links in Sparse Factors Works Better than Dot-Product Self-AttentionCode1
SCROLLS: Standardized CompaRison Over Long Language SequencesCode1
Classification of Long Sequential Data using Circular Dilated Convolutional Neural NetworksCode1
LongT5: Efficient Text-To-Text Transformer for Long SequencesCode1
Efficiently Modeling Long Sequences with Structured State SpacesCode1
Show:102550
← PrevPage 1 of 2Next →

No leaderboard results yet.