Spatial Token Mixer
Spatial Token Mixer (STM) is a module for vision transformers that aims to improve the efficiency of token mixing. STM is a type of depthwise convolution that operates on the spatial dimension of the tokens. STM is a drop-in replacement for the token mixing layers in vision transformers.
Papers
Showing 1–4 of 4 papers
No leaderboard results yet.