SOTAVerified

PatchMixer: A Patch-Mixing Architecture for Long-Term Time Series Forecasting

2023-10-01Code Available2· sign in to hype

Zeying Gong, Yujin Tang, Junwei Liang

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

Although the Transformer has been the dominant architecture for time series forecasting tasks in recent years, a fundamental challenge remains: the permutation-invariant self-attention mechanism within Transformers leads to a loss of temporal information. To tackle these challenges, we propose PatchMixer, a novel CNN-based model. It introduces a permutation-variant convolutional structure to preserve temporal information. Diverging from conventional CNNs in this field, which often employ multiple scales or numerous branches, our method relies exclusively on depthwise separable convolutions. This allows us to extract both local features and global correlations using a single-scale architecture. Furthermore, we employ dual forecasting heads encompassing linear and nonlinear components to better model future curve trends and details. Our experimental results on seven time-series forecasting benchmarks indicate that compared with the state-of-the-art method and the best-performing CNN, PatchMixer yields 3.9\% and 21.2\% relative improvements, respectively, while being 2-3x faster than the most advanced method.

Tasks

Benchmark Results

DatasetModelMetricClaimedVerifiedStatus
ETTh1 (192) MultivariatePatchMixerMSE0.37Unverified
ETTh1 (192) UnivariatePatchMixerMSE0.07Unverified
ETTh1 (336) MultivariatePatchMixerMSE0.39Unverified
ETTh1 (336) UnivariatePatchMixerMSE0.08Unverified
ETTh1 (720) MultivariatePatchMixerMSE0.45Unverified
ETTh1 (720) UnivariatePatchMixerMSE0.09Unverified
ETTh1 (96) MultivariatePatchMixerMSE0.35Unverified
ETTh1 (96) UnivariatePatchMixerMSE0.05Unverified
ETTh2 (192) MultivariatePatchMixerMSE0.27Unverified
ETTh2 (192) UnivariatePatchMixerMSE0.15Unverified
ETTh2 (336) MultivariatePatchMixerMSE0.32Unverified
ETTh2 (336) UnivariatePatchMixerMSE0.17Unverified
ETTh2 (720) MultivariatePatchMixerMSE0.39Unverified
ETTh2 (720) UnivariatePatchMixerMSE0.22Unverified
ETTh2 (96) MultivariatePatchMixerMSE0.23Unverified
ETTh2 (96) UnivariatePatchMixerMSE0.12Unverified

Reproductions