SOTAVerified

Spatial Token Mixer

Spatial Token Mixer (STM) is a module for vision transformers that aims to improve the efficiency of token mixing. STM is a type of depthwise convolution that operates on the spatial dimension of the tokens. STM is a drop-in replacement for the token mixing layers in vision transformers.

Papers

Showing 14 of 4 papers

TitleStatusHype
UniNeXt: Exploring A Unified Architecture for Vision RecognitionCode1
CARD: Semantic Segmentation with Efficient Class-Aware Regularized DecoderCode1
Demystify Transformers & Convolutions in Modern Image Deep NetworksCode1
WaveMix: A Resource-efficient Neural Network for Image AnalysisCode1
Show:102550

No leaderboard results yet.