SOTAVerified

Mixture-of-Linear-Experts for Long-term Time Series Forecasting

2023-12-11Code Available1· sign in to hype

Ronghao Ni, Zinan Lin, Shuaiqi Wang, Giulia Fanti

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

Long-term time series forecasting (LTSF) aims to predict future values of a time series given the past values. The current state-of-the-art (SOTA) on this problem is attained in some cases by linear-centric models, which primarily feature a linear mapping layer. However, due to their inherent simplicity, they are not able to adapt their prediction rules to periodic changes in time series patterns. To address this challenge, we propose a Mixture-of-Experts-style augmentation for linear-centric models and propose Mixture-of-Linear-Experts (MoLE). Instead of training a single model, MoLE trains multiple linear-centric models (i.e., experts) and a router model that weighs and mixes their outputs. While the entire framework is trained end-to-end, each expert learns to specialize in a specific temporal pattern, and the router model learns to compose the experts adaptively. Experiments show that MoLE reduces forecasting error of linear-centric models, including DLinear, RLinear, and RMLP, in over 78% of the datasets and settings we evaluated. By using MoLE existing linear-centric models can achieve SOTA LTSF results in 68% of the experiments that PatchTST reports and we compare to, whereas existing single-head linear-centric models achieve SOTA results in only 25% of cases.

Tasks

Benchmark Results

DatasetModelMetricClaimedVerifiedStatus
Electricity (192)MoLE-DLinearMSE0.15Unverified
Electricity (336)MoLE-DLinearMSE0.16Unverified
Electricity (720)MoLE-RMLPMSE0.18Unverified
Electricity (720)MoLE-DLinearMSE0.18Unverified
Electricity (96)MoLE-DLinearMSE0.13Unverified
Electricity (96)MoLE-RMLPMSE0.13Unverified
ETTh1 (192) MultivariateMoLE-RLinearMSE0.4Unverified
ETTh1 (192) MultivariateMoLE-DLinearMSE0.45Unverified
ETTh1 (336) MultivariateMoLE-RLinearMSE0.43Unverified
ETTh1 (336) MultivariateMoLE-DLinearMSE0.47Unverified
ETTh1 (720) MultivariateMoLE-RLinearMSE0.45Unverified
ETTh1 (720) MultivariateMoLE-DLinearMSE0.51Unverified
ETTh1 (96) MultivariateMoLE-RLinearMSE0.38Unverified
ETTh1 (96) MultivariateMoLE-DLinearMSE0.38Unverified
ETTh2 (192) MultivariateMoLE-RLinearMSE0.34Unverified
ETTh2 (192) MultivariateMoLE-DLinearMSE0.36Unverified
ETTh2 (336) MultivariateMoLE-RLinearMSE0.37Unverified
ETTh2 (336) MultivariateMoLE-DLinearMSE0.42Unverified
ETTh2 (720) MultivariateMoLE-RLinearMSE0.41Unverified
ETTh2 (720) MultivariateMoLE-DLinearMSE0.61Unverified
ETTh2 (96) MultivariateMoLE-RLinearMSE0.27Unverified
ETTh2 (96) MultivariateMoLE-DLinearMSE0.29Unverified
ETTm1 (192) MultivariateMoLE-DLinearMSE0.33Unverified
ETTm1 (336) MultivariateMoLE-DLinearMSE0.38Unverified
ETTm1 (720) MultivariateMoLE-DLinearMSE0.45Unverified
ETTm1 (96) MultivariateMoLE-DLinearMSE0.29Unverified
ETTm2 (192) MultivariateMoLE-DLinearMSE0.23Unverified
ETTm2 (336) MultivariateMoLE-DLinearMSE0.29Unverified
ETTm2 (720) MultivariateMoLE-DLinearMSE0.4Unverified
ETTm2 (96) MultivariateMoLE-DLinearMSE0.17Unverified
Weather (192)MoLE-DLinearMSE0.2Unverified
Weather (192)MoLE-RMLPMSE0.19Unverified
Weather2K114 (192)MoLE-DLinearMSE0.41Unverified
Weather2K114 (336)MoLE-DLinearMSE0.42Unverified
Weather2K114 (720)MoLE-DLinearMSE0.43Unverified
Weather2K114 (96)MoLE-DLinearMSE0.39Unverified
Weather2K1786 (192)MoLE-DLinearMSE0.6Unverified
Weather2K1786 (192)MoLE-RLinearMSE0.58Unverified
Weather2K1786 (336)MoLE-DLinearMSE0.6Unverified
Weather2K1786 (720)MoLE-RLinearMSE0.63Unverified
Weather2K1786 (720)MoLE-DLinearMSE0.66Unverified
Weather2K1786 (96)MoLE-RLinearMSE0.54Unverified
Weather2K1786 (96)MoLE-DLinearMSE0.54Unverified
Weather2K79 (192)MoLE-DLinearMSE0.57Unverified
Weather2K79 (336)MoLE-DLinearMSE0.55Unverified
Weather2K79 (720)MoLE-DLinearMSE0.54Unverified
Weather2K79 (96)MoLE-DLinearMSE0.56Unverified
Weather2K850 (192)MoLE-DLinearMSE0.48Unverified
Weather2K850 (336)MoLE-DLinearMSE0.47Unverified
Weather2K850 (720)MoLE-DLinearMSE0.46Unverified

Reproductions