Are Transformers Effective for Time Series Forecasting?

2022-05-26Code Available4· sign in to hype

Ailing Zeng, Muxi Chen, Lei Zhang, Qiang Xu

Code Available — Be the first to reproduce this paper.

Code

github.com/cure-lab/ltsf-linear
In paperpytorch★ 2,435
github.com/cure-lab/DLinear
In paperpytorch★ 2,435
github.com/taohan10200/weather-5k
pytorch★ 88
github.com/ioannislivieris/dlinear
pytorch★ 41
github.com/honeywell21/DLinear
pytorch★ 29
github.com/Hannibal046/GridTST
pytorch★ 18
github.com/jafarbakhshaliyev/wave-augs
pytorch★ 10
github.com/remigenet/TLN
jax★ 1

Abstract

Recently, there has been a surge of Transformer-based solutions for the long-term time series forecasting (LTSF) task. Despite the growing performance over the past few years, we question the validity of this line of research in this work. Specifically, Transformers is arguably the most successful solution to extract the semantic correlations among the elements in a long sequence. However, in time series modeling, we are to extract the temporal relations in an ordered set of continuous points. While employing positional encoding and using tokens to embed sub-series in Transformers facilitate preserving some ordering information, the nature of the permutation-invariant self-attention mechanism inevitably results in temporal information loss. To validate our claim, we introduce a set of embarrassingly simple one-layer linear models named LTSF-Linear for comparison. Experimental results on nine real-life datasets show that LTSF-Linear surprisingly outperforms existing sophisticated Transformer-based LTSF models in all cases, and often by a large margin. Moreover, we conduct comprehensive empirical studies to explore the impacts of various design elements of LTSF models on their temporal relation extraction capability. We hope this surprising finding opens up new research directions for the LTSF task. We also advocate revisiting the validity of Transformer-based solutions for other time series analysis tasks (e.g., anomaly detection) in the future. Code is available at: https://github.com/cure-lab/LTSF-Linear.

Tasks

Anomaly Detection Relation Extraction Temporal Relation Extraction Time Series Time Series Analysis Time Series Forecasting

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
Electricity (192)	DLinear	MSE	0.15	—	Unverified
Electricity (336)	DLinear	MSE	0.17	—	Unverified
Electricity (720)	DLinear	MSE	0.2	—	Unverified
Electricity (96)	DLinear	MSE	0.14	—	Unverified
ETTh1 (192) Multivariate	NLinear	MSE	0.41	—	Unverified
ETTh1 (192) Multivariate	DLinear	MSE	0.41	—	Unverified
ETTh1 (192) Univariate	DLinear	MSE	0.07	—	Unverified
ETTh1 (336) Multivariate	DLinear	MSE	0.44	—	Unverified
ETTh1 (336) Multivariate	NLinear	MSE	0.43	—	Unverified
ETTh1 (336) Univariate	DLinear	MSE	0.1	—	Unverified
ETTh1 (336) Univariate	NLinear	MSE	0.08	—	Unverified
ETTh1 (720) Multivariate	NLinear	MSE	0.44	—	Unverified
ETTh1 (720) Multivariate	DLinear	MSE	0.47	—	Unverified
ETTh1 (720) Univariate	DLinear	MSE	0.19	—	Unverified
ETTh1 (720) Univariate	NLinear	MSE	0.08	—	Unverified
ETTh1 (96) Univariate	DLinear	MSE	0.06	—	Unverified
ETTh1 (96) Univariate	NLinear	MSE	0.05	—	Unverified
ETTh2 (192) Multivariate	NLinear	MSE	0.34	—	Unverified
ETTh2 (192) Multivariate	DLinear	MSE	0.38	—	Unverified
ETTh2 (192) Univariate	NLinear	MSE	0.17	—	Unverified
ETTh2 (192) Univariate	DLinear	MSE	0.18	—	Unverified
ETTh2 (336) Multivariate	DLinear	MSE	0.45	—	Unverified
ETTh2 (336) Multivariate	NLinear	MSE	0.36	—	Unverified
ETTh2 (336) Univariate	DLinear	MSE	0.21	—	Unverified
ETTh2 (336) Univariate	NLinear	MSE	0.19	—	Unverified
ETTh2 (720) Multivariate	NLinear	MSE	0.39	—	Unverified
ETTh2 (720) Multivariate	DLinear	MSE	0.61	—	Unverified
ETTh2 (720) Univariate	NLinear	MSE	0.23	—	Unverified
ETTh2 (720) Univariate	DLinear	MSE	0.28	—	Unverified
ETTh2 (96) Multivariate	NLinear	MSE	0.28	—	Unverified
ETTh2 (96) Multivariate	DLinear	MSE	0.29	—	Unverified
ETTh2 (96) Univariate	NLinear	MSE	0.13	—	Unverified
ETTh2 (96) Univariate	DLinear	MSE	0.13	—	Unverified
Weather (192)	DLinear	MSE	0.22	—	Unverified
Weather (336)	DLinear	MSE	0.27	—	Unverified
Weather (720)	DLinear	MSE	0.32	—	Unverified
Weather (96)	DLinear	MSE	0.18	—	Unverified

Are Transformers Effective for Time Series Forecasting?

Code

Abstract

Tasks

Benchmark Results

Reproductions