SOTAVerified

Long-range modeling

A new task for testing the long-sequence modeling capabilities and efficiency of language models.

Image credit: SCROLLS: Standardized CompaRison Over Long Language Sequences

Papers

Showing 150 of 95 papers

TitleStatusHype
U-RWKV: Lightweight medical image segmentation with direction-adaptive RWKVCode1
LaCache: Ladder-Shaped KV Caching for Efficient Long-Context Modeling of Large Language ModelsCode0
MambaFusion: Height-Fidelity Dense Global Fusion for Multi-modal 3D Object DetectionCode2
Med-URWKV: Pure RWKV With ImageNet Pre-training For Medical Image Segmentation0
M2Restore: Mixture-of-Experts-based Mamba-CNN Fusion Framework for All-in-One Image Restoration0
CDPDNet: Integrating Text Guidance with Hybrid Vision Encoders for Medical Image SegmentationCode0
JanusDNA: A Powerful Bi-directional Hybrid DNA Foundation ModelCode1
Hybrid-Emba3D: Geometry-Aware and Cross-Path Feature Hybrid Enhanced State Space Model for Point Cloud ClassificationCode0
ZipIR: Latent Pyramid Diffusion Transformer for High-Resolution Image Restoration0
vGamba: Attentive State Space Bottleneck for efficient Long-range Dependencies in Visual RecognitionCode0
VADMamba: Exploring State Space Models for Fast Video Anomaly DetectionCode1
Emulating Self-attention with Convolution for Efficient Image Super-ResolutionCode2
An Uncertainty Principle for Linear Recurrent Neural Networks0
Is Long Range Sequential Modeling Necessary For Colorectal Tumor Segmentation?0
ReGNet: Reciprocal Space-Aware Long-Range Modeling for Crystalline Property Prediction0
KM-UNet KAN Mamba UNet for medical image segmentationCode1
Exploring Historical Information for RGBE Visual Tracking with Mamba0
GIRAFFE: Design Choices for Extending the Context Length of Visual Language ModelsCode1
MambaXCTrack: Mamba-based Tracker with SSM Cross-correlation and Motion Prompt for Ultrasound Needle Tracking0
CT-Mamba: A Hybrid Convolutional State Space Model for Low-Dose CT DenoisingCode1
QuadMamba: Learning Quadtree-based Selective Scan for Visual State Space ModelCode1
S7: Selective and Simplified State Space Layers for Sequence Modeling0
Shuffle Mamba: State Space Models with Random Shuffle for Multi-Modal Image Fusion0
PoseMamba: Monocular 3D Human Pose Estimation with Bidirectional Global-Local Spatio-Temporal State Space ModelCode2
Pose Magic: Efficient and Temporally Consistent Human Pose Estimation with a Hybrid Mamba-GCN Network0
LION: Linear Group RNN for 3D Object Detection in Point CloudsCode3
Hierarchical Separable Video Transformer for Snapshot Compressive ImagingCode1
Short-Long Convolutions Help Hardware-Efficient Linear Attention to Focus on Long Sequences0
Long Range Propagation on Continuous-Time Dynamic GraphsCode1
Spatio-Spectral Graph Neural NetworksCode1
SMR: State Memory Replay for Long Sequence Modeling0
MambaLLIE: Implicit Retinex-Aware Low Light Enhancement with Global-then-Local State Space0
MambaVC: Learned Visual Compression with Selective State SpacesCode2
DVMSR: Distillated Vision Mamba for Efficient Super-ResolutionCode2
MambaAD: Exploring State Space Models for Multi-class Unsupervised Anomaly DetectionCode3
RS3Mamba: Visual State Space Model for Remote Sensing Images Semantic SegmentationCode0
LKM-UNet: Large Kernel Vision Mamba UNet for Medical Image SegmentationCode2
MedMamba: Vision Mamba for Medical Image ClassificationCode4
nnMamba: 3D Biomedical Image Segmentation, Classification and Landmark Detection with State Space ModelCode2
MambaMorph: a Mamba-based Framework for Medical MR-CT Deformable RegistrationCode2
A Simple LLM Framework for Long-Range Video Question-AnsweringCode1
Part Representation Learning with Teacher-Student Decoder for Occluded Person Re-identificationCode0
Recurrent Distance Filtering for Graph Representation LearningCode1
Mamba: Linear-Time Sequence Modeling with Selective State SpacesCode6
Dimension Mixer: Group Mixing of Input Dimensions for Efficient Function ApproximationCode0
Advancing Regular Language Reasoning in Linear Recurrent Neural NetworksCode0
Improving FHB Screening in Wheat Breeding Using an Efficient Transformer Model0
TaskExpert: Dynamically Assembling Multi-Task Representations with Memorial Mixture-of-ExpertsCode2
AICT: An Adaptive Image Compression Transformer0
ViTEraser: Harnessing the Power of Vision Transformers for Scene Text Removal with SegMIM PretrainingCode1
Show:102550
← PrevPage 1 of 2Next →

No leaderboard results yet.