Next Article in Journal
Property (h) of Banach Lattice and Order-to-Norm Continuous Operators
Next Article in Special Issue
Calculation of the Electrostatic Field of a Circular Cylinder with a Slot by the Wiener–Hopf Method
Previous Article in Journal
New Results on the Unimodular Equivalence of Multivariate Polynomial Matrices
Previous Article in Special Issue
Magnetohydrodynamics Williamson Nanofluid Flow over an Exponentially Stretching Surface with a Chemical Reaction and Thermal Radiation
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Multi-Step-Ahead Wind Speed Forecast Method Based on Outlier Correction, Optimized Decomposition, and DLinear Model

1
School of Qianhu, Nanchang University, Nanchang 330031, China
2
School of Information Engineering, Nanchang University, Nanchang 330031, China
*
Author to whom correspondence should be addressed.
Mathematics 2023, 11(12), 2746; https://doi.org/10.3390/math11122746
Submission received: 22 May 2023 / Revised: 7 June 2023 / Accepted: 8 June 2023 / Published: 17 June 2023

Abstract

:
Precise and dependable wind speed forecasting (WSF) enables operators of wind turbines to make informed decisions and maximize the use of available wind energy. This study proposes a hybrid WSF model based on outlier correction, heuristic algorithms, signal decomposition methods, and DLinear. Specifically, the hybrid model (HI-IVMD-DLinear) comprises the Hampel identifier (HI), the improved variational mode decomposition (IVMD) optimized by grey wolf optimization (GWO), and DLinear. Firstly, outliers in the wind speed sequence are detected and replaced with the HI to mitigate their impact on prediction accuracy. Next, the HI-processed sequence is decomposed into multiple sub-sequences with the IVMD to mitigate the non-stationarity and fluctuations. Finally, each sub-sequence is predicted by the novel DLinear algorithm individually. The predictions are reconstructed to obtain the final wind speed forecast. The HI-IVMD-DLinear is utilized to predict the real historical wind speed sequences from three regions so as to assess its performance. The experimental results reveal the following findings: (a) HI could enhance prediction accuracy and mitigate the adverse effects of outliers; (b) IVMD demonstrates superior decomposition performance; (c) DLinear has great prediction performance and is suited to WSF; and (d) overall, the HI-IVMD-DLinear exhibits superior precision and stability in one-to-four-step-ahead forecasting, highlighting its vast potential for application.

1. Introduction

The finite and non-renewable nature of fossil fuels has rendered the development and utilization of renewable energy an indispensable choice [1]. A report published in 2021 stated that the cumulative installed capacity of wind farms globally skyrocketed to 744 GW [2]. However, the inherent volatility and instability of wind energy led to frequent fluctuations in wind power, thereby causing continuous oscillations in grid voltage and frequency, which severely impairs power quality. Precise and dependable WSF are important for all aspects of wind power systems, including electricity market operation, energy storage system management, network planning, etc.
Recently, numerous forecasting methods have been developed to achieve WSF based on different time scales. These methods primarily include physical methods, statistical methods, machine learning methods, and neural network models. Physical methods rely on physical factors such as altitude and atmospheric pressure to construct models to predict the changes in wind speed. However, most physical factors are difficult to obtain in the vast majority of situations. Additionally, the construction and the calculation of physical models are complex, which makes it difficult to obtain accurate WSF in a short period [3,4,5]. Luckily, statistical methods such as the autoregressive (AR) [6] and autoregressive integrated moving average (ARIMA) methods [7,8] operate at a fast pace. These models were constructed mainly based on historical wind speed operational data. However, statistical methods can only deal with linear data, but not with nonlinear wind speed series. In addition, machine learning methods such as XGBoost [9], support vector regression (SVR) [10,11], and least squares support vector machine (LSSVM) [12] can identify more complex nonlinear relationships in sequences with good generalization abilities. Usually, the predictive performance of machine learning models on large datasets is limited. Compared with conventional machine learning models, neural networks are favorite due to their unique fully connected structure, which promises better prediction accuracy on large datasets. However, neural networks have not fully considered the temporal properties of wind speed sequences. There exists a loss of temporal information in wind speed sequences since neural networks treat time series as unordered and assign equal weight to all time points [13,14,15].
To address this issue, the recurrent neural network (RNN) was proposed. The self-connection among hidden layers in the RNN enables the retention of prior states, which are then incorporated into the current step. This mechanism facilitates the consideration of temporal information during the processing of sequential data [16]. It has two main variations: long short-term memory (LSTM) [17,18,19] and gated recurrent unit (GRU) [20,21,22]. LSTM resolves the problem of gradient vanishing and exploding encountered in RNN by introducing gate mechanisms and a special memory cell structure. Satyam et al. designed a wind speed prediction method with LSTM [17]. GRU is an improvement of LSTM based on a simpler memory cell structure with only two gates (reset gate and update gate) than LSTM. GRU outperforms LSTM in terms of computational efficiency and storage space [20,21,22]. Syu et al. introduced a WSF model based on GRU to provide more precise WSF than RNN and LSTM [23]. Although LSTM and GRU have shown good performance, they usually only model short-term dependencies, while the transformer can handle longer sequences of dependencies since it does not have a recurrent structure and can simultaneously view the entire sequence at all time steps. Furthermore, the self-attention mechanism of the transformer can capture local and global dependencies in a sequence, which can better handle key information in the sequence [24,25,26]. Wu et al. devised a multi-step WSF model based on the transformer, treating the problem as a sequence-to-sequence mapping. The transformer-based model has better prediction performance than the GRU [26]. However, Zeng et al. indicated that the comparatively elevated long-term forecasting accuracy of the transformer does not substantially correlate with its capability to extract temporal dependencies, and proposed a structurally simple DLinear model with better performance than the complex transformer model in most cases [27]. The effectiveness of WSF based on the transformer model should be reevaluated. Currently, the transformer model is widely employed in the field of WSF, yet its effectiveness in this domain is questionable. Its intricate structure does not improve the forecasting accuracy. To validate this proposition about the transformer, this study utilizes the DLinear as the foundational model for prediction, considering the transformer model as a comparable model. Furthermore, another reason to adopte the DLinear mode is its remarkably simple structure, which has exceptional forecasting accuracy.
Numerous studies have shown that, because of the non-stationarity and strong volatility of wind speed sequences, models with decomposition methods perform better in predicting wind speed than those without decomposition [25,28,29,30]. Decomposition-based models usually decompose the wind speed sequence into multiple sub-sequences, then forecast each sub-sequence, and then the ultimate prediction can be obtained by aggregating the results. Currently, the popular decomposition methods include wavelet transform (WT) [31,32], empirical mode decomposition (EMD) [33,34], and ensemble empirical mode decomposition (EEMD) [35,36]. Zhang et al. decomposed the initial wind speed sequence into finite sub-sequences by the complete ensemble empirical mode decomposition with adaptive noise (CEEMDAN) algorithm [37]. Subsequently, prediction models were applied to each sub-sequence to make individual forecasts. On the other hand, Pan et al. introduced the VMD method to decompose wind speed signals and exploit their latent information for more accurate forecasting results [38]. Li et al. decomposed ship-radiated noise signals, extracting feature information of different frequencies and amplitudes with successive variational mode decomposition (SVMD) [39]. Furthermore, VMD displays superior performance in decomposing non-stationary signals when compared to EMD and its improved methods [40]. Nevertheless, VMD lacks adaptability because critical parameters, e.g., the number of decomposition modes and regularization, require manual adjustment. The choice of these two parameters can impact the decomposition results and performance significantly. Moreover, the grey wolf optimization (GWO) algorithm exhibits superior optimization capabilities compared to renowned algorithms including particle swarm optimization, gravitational search algorithm, and evolution strategy [41]. Consequently, in this paper, the hyperparameters of VMD will be optimized by the GWO algorithm, thereby addressing the challenge of selecting the appropriate hyperparameters of VMD.
Additionally, there are few researches focusing on detecting and correcting outliers in the original wind speed sequence. It is reported that the predictive accuracy could be enhanced by rectifying outliers within the original sequence [42]. To detect and rectify outliers in the original wind speed sequence, the HI algorithm [43] is introduced to enhance the final accuracy of wind speed prediction.
In recent years, various metrics such as entropy [44] and correlation dimension [45] have been extensively employed in signal analysis across various research domains. Li et al. introduced an innovative technique known as simplified coded dispersion entropy (SCDE) to identify nonlinear dynamic transitions in signals [44]. A novel approach called FuzzDEα was developed to detect dynamic changes in time series data for signal analysis and fault diagnosis in bearings [46]. To assess the level of optimization of the hyperparameters of VMD by GWO, the envelope entropy (EE) [47] as the fitness function for GWO was employed. The magnitude of the EE serves as a criterion to evaluate the quality of the hyperparameters obtained by GWO. The magnitude of SampEn reflects the complexity level of a time series. If the series exhibits higher complexity, the corresponding SampEn value will be larger; conversely, a lower complexity will result in a smaller SampEn value [48]. Therefore, in this study, the SampEn will be utilized to assess the effectiveness of the HI.
Synthetically speaking, to achieve high accuracy and stability in WSF, a hybrid model is proposed based on outlier correction, heuristic algorithms, signal decomposition methods, and DLinear. The model begins by employing the HI to detect and rectify outliers in the original wind speed sequence. Subsequently, the GWO algorithm is utilized to optimize the hyperparameters of the VMD. Then, employing the VMD algorithm based on the optimal hyperparameters, the sequence processed by HI is decomposed into several sub-sequences. Lastly, each sub-sequence is forecasted by the DLinear algorithm individually. The final wind speed forecast is obtained by reconstructing the predictions. The primary contributions of this study are as follows:
(1)
To detect and rectify outliers in the wind speed sequence, an outlier detection technique based on the Hampel identifier (HI) is utilized to enhance the accuracy of WSF.
(2)
To optimize the hyperparameters of VMD, the variational mode decomposition is improved by the grey wolf optimization (GWO). The decomposition of the complex non-stationary windspeed sequence with the improved VMD (IVMD) algorithm can reduce the non-stationarity and the complexity of the sequence, thus improving the prediction stability and accuracy.
(3)
DLinear is introduced as a fundamental prediction model including only one decomposition scheme and two linear networks. Its performance is significantly superior to both LSTM and the currently popular transformer models.
(4)
The proposed method combining HI and IVMD with DLinear is utilized for the multi-step WSF of three real windspeed sequences. The performance of the HI-IVMD-DLinear is validated with comparative experiments from various aspects.
The rest of the paper is organized as follows: In Section 2, HI, GWO, VMD, DLinear, and the proposed method are described in detail. Section 3 elucidates the experimental configuration and elaborates, based on multiple evaluation criteria, on the performance of the proposed model. Section 4 provides a detailed discussion on the computational efficiency and the complexity of the HI-IVMD-DLinear. Finally, a concise conclusion is stated in Section 5.

2. Materials and Methods

2.1. Hampel Identifier

Hampel identifier (HI) is a robust algorithm to detect and replace outliers in datasets [43]. This method identifies any value that falls outside of a certain distance window from the median as an outlier and replaces it with the median value within that window. For dataset D = y 1 , y 2 , , y n , let the window size be w = 2 k + 1 . Typically, window sizes of 5 or 7 are commonly used. The evaluation parameter α is set as 0.6745. By utilizing the median absolute deviation (MAD) and α , the standard deviation σ i can be determined [49].
The HI method is composed of the following steps:
(1)
Computing median, MAD, and standard deviation: For each data point, the median and the MAD of the neighboring points within the window size are calculated, and then the standard deviation based on the median and MAD can be computed as [42]:
m i = median y i n , y i n + 1 , , y i , , y i + n 1 , y i + n
MAD i = median y i n m i , y i n + 1 m i , , y i m i , , y i + n 1 m i , y i + n m i
σ i = MAD i / α
(2)
Detecting outlier points: A sample point is considered as an outlier if its value satisfies [50]:
y i m i > 3 σ i
(3)
Substituting outlier points: For the identified outlier points, the median of the window is used for substitution.
(4)
Performing steps (1)–(3) for each sample point.
The HI method has more advantages over other similar methods in terms of robustness to outliers. Additionally, the HI method is highly efficient in computation, making it suitable for large-scale datasets. Processing the dataset with HI can effectively correct its outliers and enhance the accuracy in WSF.

2.2. Variational Mode Decomposition

The VMD is an adaptive decomposition algorithm [51]. Compared to traditional modal decomposition methods, the VMD could avoid aliasing and is more robust to noise.
The VMD method is capable of decomposing complex raw sequences into several relatively simple intrinsic mode functions (IMFs). The VMD is composed of the following steps:
(1)
Construct the variational problem: It is essential for the variational problem to minimize the sum of central frequencies of the IMFs [51]:
min u k , ω k k = 1 K t δ ( t ) + j π t u k ( t ) e j ω k t 2 2
s . t .   k = 1 K u k ( t ) = f ( t )
where u k and ω k are the k - th IMF and its corresponding center frequency, respectively; δ ( t ) is Dirac function; f ( t ) is the original input signal; and K is the number of IMFs.
(2)
Transform variational problems: To make it easier to solve the variational problem above, a Lagrange function is introduced [51]:
L ( { u k } , { ω k } , λ ) = α k t δ ( t ) + j π t u k ( t ) e j ω k t 2 2 + f ( t ) Σ k u k ( t ) 2 2 + λ ( t ) , f ( t ) k u k ( t )
where α represents the penalty factor, and λ represents the Lagrange multiplier.
(3)
Solve the variational problem: To achieve the best solution to the variational problem, the decomposition signal u k and their corresponding center frequencies ω k were updated by the alternate direction method of multipliers (ADMM). The cyclic updating rules and termination conditions for u k and ω k are as follows [51]:
k u k n + 1 u k n 2 2 u k n 2 2 < ε , n < N
u k n + 1 ( ω ) = f ( ω ) i k k u i n ( ω ) + λ n ( ω ) 2 1 + 2 a ω ω k n 2
ω k n + 1 = 0 ω u k n ( ω ) 2 d ω 0 u k n ( ω ) 2 d ω
where f ( ω ) , u i n ( ω ) , and λ n ( ω ) denote the Fourier transform of f ( t ) , u i n ( t ) , and λ n ( t ) , respectively; and n and N are the number and the maximal number of iterations, respectively.

2.3. Grey Wolf Optimization

As a novel heuristic intelligent algorithm, Grey wolf optimization (GWO) [41] seeks the best solution based on the hunting characteristics of wolf packs and the social hierarchy system of grey wolves. There are four social ranks within a wolf pack: the alpha wolf ( α ), the wolves that obey the alpha ( β ), the wolves that obey the top two wolves ( δ ), and the wolves that obey higher-ranked wolves ( ω ). Their hunting process is:
(1)
Wolves surround their prey:
D = C Y p ( i ) Y ( i )
Y ( i + 1 ) = Y p ( i ) A D
where i denotes the current iteration, and D represents the distance between Y p ( i ) (prey) and Y ( i ) (grey wolves). A and C (the coefficient vector) can be represented as:
A = 2 b r 1 b
C = 2 b r 2
where b is linearly diminished from 2 to 0 throughout the iterations; and r 1 and r 2 represent vector compositions comprising random elements, with the values of these elements ranging from 0 to 1. The grey wolves change their positions according to Equation (12).
(2)
Capturing prey: As the location of prey cannot be determined, the optimal strategy cannot be identified either. Therefore, assuming that the α wolf is closest to the prey, followed by β and δ wolves, their distances from the prey are calculated with Equation (11). By iteratively updating the positions of these three types of wolves with Equation (12), the other wolves will also gradually approach the prey. Ultimately, the position of the α wolf is considered to be the location of the prey, leading to the optimal solution.
D α = C 1 Y α Y
D β = C 2 Y β Y
D δ = C 3 Y δ Y
Y 1 = Y α A 1 D α
Y 2 = Y β A 2 D β
Y 3 = Y δ A 2 D δ
Y ( i + 1 ) = Y 1 + Y 2 + Y 3 3
where Y j ( j = α , β , δ ) represents the position of the corresponding individual.

2.4. VMD Optimized by GWO

In practical applications, the hyperparameters K and α of VMD are directly related to the quality of the decomposition results and are often difficult to determine, although the VMD technique exhibits exceptional decomposition capabilities for wind speed sequences. An appropriate value of K can fully decompose the modal sequence, circumventing the emergence of mode-blending issues. α determines the accuracy of signal reconstruction. Therefore, appropriate K and α are crucial for the wind speed sequence decomposition process.
Therefore an improved VMD (IVMD) based on the GWO is proposed. The IVMD method determines K and α with the GWO. The range of K is set as [3, 12] and that of α is set as [0, 2000]. When the decomposed signal has less noise, the EE is smaller, and vice versa. Therefore, the minimal EE E p is utilized as the fitness function for the GWO.
E p = i = 1 N p i lg p i
p i = c ( i ) i = 1 N c ( i )
c ( i ) = x 2 ( i ) + x 2 ( t )
where N is the length of the signal, x ( i ) represents the value of the i - th sample point of the decomposed sequence (IMF), and x ( i ) represents the demodulated result of Hibert of x ( i ) . The minimal envelope entropy is:
min { E p } = min { E p 1 , E p 2 , , E p K }
where E p j represents the value of the EE of the j - th IMF.
The flowchart of IVMD is shown in Figure 1, and the steps of IVMD are as follows:
(1)
Initialize the search space, encompass the ranges of K and α . Additionally, initiate the parameters of the grey wolf optimization algorithm, such as population size, maximum number of iterations, and so forth.
(2)
Generate the initial population of grey wolves randomly within the provided search space. For each grey wolf denoted by i = 1 , 2 , , N (where N represents the total number of grey wolves), the position Y i is initialized as ( K i , α i ).
(3)
Calculate the envelope entropy of each grey wolf with Equation (22). The positions of the three grey wolves with the lowest envelope entropy values are updated by Y α , Y β , and Y δ , respectively. Y α with the best fitness value is recognized as the optimal solution.
(4)
Compute the distance between the remaining grey wolf individuals ( ω ) and the top three grey wolf individual locations Y α , Y β and Y δ according to Equations (15)–(17).
(5)
According to Equations (18)–(21), update the position of individual grey wolves.
(6)
If the iteration of GWO reaches maximum, the algorithm ends and outputs an optimal solution Y α ; otherwise, return to (3) and continue the optimization search.

2.5. DLinear

DLinear is a novel high-precision time-series forecasting model proposed by Zeng et al. in 2022 [27]. Despite its simple structure, consisting solely of a decomposition scheme and two linear networks, its predictive accuracy exceeds that of the more complex transformer model.
During the prediction process, the DLinear first decomposes the original sequence X into a trend component X t and a residual one X r ( X r = X X t ) . Subsequently, two single-layer linear networks are utilized to forecast each of these decomposed components, respectively.
The foundational architecture of DLinear is depicted in Figure 2a. The output results of the two single-layer linear networks are combined to yield the final predicted outcome X ^ [27].
H r = W r X r
H t = W t X t
X ^ = H t + H r
where H r and H t are the output values of the single-layer linear networks for the residual and trend components, respectively. Similarly, W r and W t represent the single-layer linear networks for the residual and the trend components, as depicted in Figure 2b.

2.6. Framework of the Proposed Model

The HI-IVMD-DLinear model is designed to achieve accurate multi-step WSF. The basic framework of this hybrid WSF model is illustrated in Figure 3, which mainly consists of three steps as follows:
Step 1: Outlier detection and replacement based on HI. Due to equipment malfunctions, human factors, and other reasons, it is inevitable that the collected wind speed data will contain outliers during the data collection process. The HI method is used to detect and replace outliers in the dataset, which is beneficial to improve the accuracy of WSF.
Step 2: Decomposition of wind speed sequences. The sequence processed by the HI is considered as input for the VMD. The GWO is then employed to optimize the hyperparameters K and α of the VMD with the minimal envelope entropy as the fitness function. Based on the optimized values of K and α , the VMD decomposes the sequence into K IMFs.
Step 3: Prediction with DLinear. The DLinear model is constructed to predict each of the K IMFs obtained from the decomposition. Subsequently, the predicted results are summed to derive the ultimate wind speed prediction.

3. Results

3.1. Design of the Experiment

3.1.1. Data Source

The historical wind speed datasets collected from three regions in China serve as the experimental dataset. These three regions are, respectively, located in Shijiazhuang, Hebei Province; Lanzhou, Gansu Province; and Nanjing, Jiangsu Province. Their latitudes and longitudes are significantly different. Lanzhou and Shijiazhuang are situated in the northwestern and northern inland regions, respectively, both possessing abundant wind energy resources. On the other hand, Nanjing is located in the southeastern coastal area and consistently ranks among the top in terms of offshore wind power installed capacity nationwide. The wind speed sequences from all three regions were measured at a height of ten meters above ground level at hourly intervals. The basic information of the three wind speed datasets is presented in Table 1.

3.1.2. Evaluation Metrics

To evaluate the accuracy of the prediction methods, the mean absolute error (MAE), root mean square error (RMSE), and mean absolute percentage error (MAPE) are employed
MAE = 1 N t = 1 N Y ^ t Y t
RMSE = 1 N t = 1 N Y ^ t Y t 2
MAPE = t = 1 N Y ^ t Y t Y t N × 100 %
where Y ^ t represents the predicted value of the wind speed, Y t represents the observed value, and N refers to the number of test-set samples. Generally speaking, as the values of these metrics decrease, the prediction accuracy of the model increases.
Furthermore, improvement percentage is utilized to quantitatively evaluate the proposed model. P RMSE , P MAE , and P MAPE are the improvement percentages for RMSE, MAE, and MAPE, respectively.
P RMSE = RMSE i RMSE j RMSE i × 100 %
P MAE = MAE i MAE j MAE i × 100 %
P MAPE = MAPE i MAPE j MAPE i × 100 %
where RMSE i , MAE i , and MAPE i represent the errors of the comparative methods, while RMSE j , MAE j , and MAPE j represent the errors of the HI-IVMD-DLinear method. The larger the P RMSE , P MAE , and P MAPE are, the more superior the precision of the proposed model is.
In addition, the variance of absolute error (VAE) is introduced to assess the stability of the model.
VAE = Var Y t Y ^ t
Simultaneously, the improvement percentage of VAE is also introduced to compare the proposed model with the comparative model.
P VAE = VAE i VAE j VAE i × 100 %
where VAE i and VAE j represent the VAE of the comparative model and the proposed one, respectively.

3.1.3. Model Development

To assess the performance of HI-IVMD-DLinear, a machine learning model, SVR, two prevalent neural network models, namely, back propagation neural network (BPNN) and LSTM, as well as a popular deep learning algorithm, the transformer, are incorporated as comparative models.
The input for predicting the output values includes the true values from the previous 24 h; i.e., the time window size is 24 h. Table 2 provides the parameter settings for all relevant models, along with the methods used to confirm these parameters. The dataset is divided into training, validation, and testing sets at a ratio of 7:1:2. Additionally, all models employ mean squared error (MSE) as the loss function. To optimize the weights and enhance the prediction performance, the Adam algorithm is employed as an optimizer [52].

3.2. Analysis of Hampel Identifier

The performance of utilizing HI for the data cleaning of the wind speed sequence is explored in this section. As illustrated in Figure 4, all three wind speed datasets exhibit some outliers. Failure to clean these outliers would adversely impact the accuracy of the final WSF. Therefore, the HI method can be utilized to handle the outliers in the wind speed sequences. The effectiveness of the HI can generally be evaluated by calculating the sample entropy of the sequences. The magnitude of the SampEn value reflects the complexity of the sequence [48]. If the complexity of the sequence is greater, the SampEn value will be larger, and vice versa. The SampEn values of the original wind speed sequences and the HI-processed wind speed sequences are presented in Table 3. It is evident that the SampEn values of all three wind speed sequences are reduced after applying the HI method. It indicates that the HI method can reduce the complexity of the original sequences.
The predictive performances of models with and without HI are also compared. Table 4 gives the forecasting accuracy of models with HI processing and without HI processing under the three datasets. It can be observed that the improvement percentages of MAPE are 1.2316%, 2.1240%, and 2.1531% compared with the HI-IVMD-DLinear with IVMD-DLinear, respectively. Other HI-based models also reduce the RMSE, MAE, and MAPE values. Therfore the HI can enhance the accuracy of WSF, since HI is able to identify and rectify outliers in the original wind speed sequence, which can efficiently mitigate the interference caused by such outliers.

3.3. Decomposition Results

According to the augmented Dickey–Fuller (ADF) test results presented in Table 5, the ADF statistics of the three datasets, after undergoing HI processing, are all below the critical values at the 1%, 5%, and 10% confidence levels. Additionally, their p-values are greater than 0.1. It is evident that the three wind speed sequences are non-stationary. Hence, it is imperative to decompose the wind speed sequences appropriately to reduce their complexity. The parameters of VMD are optimized by the GWO. The obtained hyperparameters K and α for VMD on the three datasets are 4 and 1.2771, 4 and 0.4501, 5 and 0.2580, respectively.
Taking Lanzhou as an example, the decomposition results of the wind speed sequence, after undergoing HI processing, are illustrated in Figure 5a. IVMD decomposes the sequence of Lanzhou into four IMFs. Among them, IMF1 has the lowest frequency and displays the long-term trend of the wind speed sequence. IMF2 and IMF3 belong to the mid-frequency range signals, reflecting the fluctuations within smaller periods. IMF4 represents the high-frequency range signal. Evidently, the sequence exhibits a more regular pattern after the IVMD decomposition.

3.4. Forecasting Results

The HI-processed dataset is utilized for IVMD decomposition, with the subsequent prediction of each modal component with DLinear. Summing the predicted outcomes of these components yields the final WSF. Specifically, during training, only the training set should undergo decomposition, ensuring the data in the test set remain unknown. This safeguards against the inflated accuracy resulting from test set leakage. Subsequently, the IMFs derived from the training set decomposition are employed to train the model, while hyperparameter selection is performed on the validation set. Finally, test is conducted on the designated subsequence of the complete dataset. To assess the performance of the HI-IVMD-DLinear, HI-SVR, HI-BPNN, HI-LSTM, HI-Transformer, HI-DLinear, HI-IVMD-BPNN, HI-IVMD-LSTM, and HI-IVMD-Transformer are considered.
During the process of WSF, it is important to forecast wind speeds for multiple hours in advance. For instance, multi-step WSF assists wind power generation companies in accurately anticipating changes in wind speed over a specific period. This enables them to devise more efficient power generation plans and scheduling strategies to enhance the capacity and the efficiency of wind power generation. Therefore, the introduction of multi-step WSF is crucial. Given the wind speed sequence { Y 1 , Y 2 , Y 3 , , Y T } , the forecasting value for the k - th step can be calculated as
Y ^ t + k = f Y t , Y t 1 , , Y t ( w 1 ) , t = 1 , 2 , , T
where Y ^ t + k represents the predicted value at time t + k , Y t represents the observed value at time t , and w refers to the lag order. The value of w is set as 24, in other words, the model takes the past 24 h wind speed sequence as its input. The horizon k ranges from 1 to 4.

3.4.1. Forecasting Accuracy

The forecasting results of three metrics for HI-IVMD-DLinear and other comparative models are presented in Table 6, Table 7 and Table 8. In all three datasets, the HI-IVMD-DLinear model outperforms the others in terms of MAE, RMSE, and MAPE. The HI-IVMD-DLinear exhibits the best predictive accuracy among the comparative models and is better suitable for the WSF task.
Taking Lanzhou as an example, compared with other models, the HI-IVMD-DLinear model achieves lower MAE, RMSE, and MAPE values. The MAE, RMSE, and MAPE values for the HI-IVMD-DLinear’s one-step-ahead prediction are 0.0705, 0.0881, and 0.0479, respectively, which are smaller than those of other models. Specifically, the MAPE values for HI-SVR, HI-BPNN, HI-LSTM, HI-Transformer, HI-DLinear, HI-IVMD-BPNN, HI-IVMD-LSTM, HI-IVMD-Transformer, and HI-IVMD-DLinear are 0.3839, 0.3082, 0.2920, 0.2705, 0.2418, 0.0917, 0.0792, 0.0632, and 0.0479, respectively. Among them, the HI-IVMD-DLinear achieves the smallest MAPE value. Moreover, for one-step-ahead to four-steps-ahead predictions, the HI-IVMD-DLinear obtains the optimal MAPE values of 0.0773, 0.0956, and 0.1266, respectively. Additionally, the HI-IVMD-DLinear also obtains the best MAE and RMSE values.
Furthermore, according to Table 6, Table 7 and Table 8, it is evident that the hybrid methods based on IVMD outperform the individual models in terms of accuracy. Taking the one-step-ahead forecasting in Lanzhou as an example, the MAPE value of the HI-DLinear is 0.2418, while that of the HI-IVMD-DLinear is 0.0479. Therefore the IVMD method helps diminish the complexity of the wind speed series enables the forecasting model to capture valuable patterns within the wind speed series, and effectively improves the forecasting performance.
The comparison between the predicted and the observed wind speed values in the three datasets is illustrated in Figure 6. It can be observed that the prediction curve of the HI-IVMD-DLinear hybrid model closely resembles that of the observed wind speed values. This highlights better predictive accuracy of the HI-IVMD-DLinear model in the field of WSF than other comparative models.
Meanwhile, compared with the individual HI-SVR, HI-BPNN, HI-LSTM, HI-Transformer, and HI-DLinear models, the prediction curves of HI-IVMD-SVR, HI-IVMD-BPNN, HI-IVMD-LSTM, HI-IVMD-Transformer, and HI-IVMD-DLinear across the three datasets exhibit a higher degree of similarity to the observed wind speed curves. Thus the IVMD decomposition is helpful in WSF.

3.4.2. Improvement Percentage in Accuracy

The improvement percentages of accuracy metrics for the HI-IVMD-DLinear model are presented in Table 9, Table 10 and Table 11. It can be seen that the proposed model consistently achieves the lowest prediction errors across all four forecasting horizons on the three wind speed datasets. Thus the accuracy of the proposed model is acceptable.
As shown in Table 10, for the Nanjing dataset, the improvement percentages of MAPE for one-step-ahead prediction by the HI-IVMD-DLinear relative to HI-SVR, HI-BPNN, HI-LSTM, HI-Transformer, and HI-DLinear are 87.5236%, 84.4584%, 83.5969%, 82.2938%, and 80.1936% respectively. This indicates the necessity of decomposing the original sequence with IVMD in the WSF process.
Furthermore, like HI-IVMD-BPNN, HI-IVMD-LSTM, and HI-IVMD-Transformer, HI-IVMD-DLinear also demonstrates significantly higher prediction accuracy. As shown in Table 10, compared with HI-IVMD-BPNN, HI-IVMD-LSTM, and HI-IVMD-Transformer, the HI-IVMD-DLinear exhibits improvement percentages 58.2622%, 52.9276%, and 36.0750% in MAPE for the one-step-ahead and two-steps-ahead predictions, respectively.
Therefore the HI-IVMD-DLinear outperforms the other models in the field of WSF in terms of accuracy.

3.4.3. Analysis of Forecasting Errors

Figure 7 is the frequency distribution of the predictive errors with the proposed model and eight other comparative models, regarding one-step-ahead predictions. The graph indicates that the error of the model based on the IVMD is small. Additionally, it can be observed that the HI-IVMD-DLinear presents the smallest errors among the majority of the data points in the test set.
Figure 8 illustrates the distribution of errors for each model. It is noticeable that the HI-IVMD-DLinear exhibits a higher concentration of prediction errors around zero for each dataset than the other models, with a smaller range of error variation. This implies that the HI-IVMD-DLinear possesses exceptional predictive accuracy and robustness.

3.4.4. Stability Analysis

Table 12 is the variance of absolute errors (VAE) with the proposed model and all comparative models across three datasets for multi-step ahead predictions. It reveals that the proposed model has good stability since it obtains the lowest VAE for one-to-four-steps-ahead predictions across all three datasets. For instance, in the Lanzhou dataset, the HI-IVMD-DLinear achieves VAE values of 0.0028, 0.0084, 0.0178, and 0.0264 for one-to-four-steps-ahead predictions, respectively. These values are consistently lower than those of the other comparative models.
The improvement percentages of VAE for the HI-IVMD-DLinear together with other comparative models are depicted in Figure 9. Compared to HI-SVR, HI-BPNN, HI-LSTM, HI-Transformer, and HI-DLinear, the HI-IVMD-DLinear exhibits a reduction in VAE of over 50% under the three datasets,. The IVMD decomposition also enhances the stability of predictions. Furthermore, compared to HI-IVMD-BPNN, HI-IVMD-LSTM, and HI-IVMD-Transformer, the HI-IVMD-DLinear achieves a reduction in VAE ranging from 9.0707% to 85.8084%. Hence, the DLinear-based model is more stable than BPNN, LSTM, and transformer.

3.5. Comparative Analysis of Decomposition Strategies

To validate the decomposition performance of IVMD, IVMD is compared with the decomposition strategies including EMD [53], CEEMDAN [37], and the recently published CEEMDAN-VMD [54] and CEEMDAN-LMD [55]. Additionally, the strategy without decomposition methods is also compared. The CEEMDAN-VMD method begins by decomposing the wind speed sequence into several IMFs with CEEMDAN, followed by another decomposition of the highest-frequency IMF with VMD. Similarly, the CEEMDAN-LMD method involves decomposing the wind speed sequence into multiple IMFs with CEEMDAN, and subsequently decomposing the IMF1 generated from the CEEMDAN decomposition with LMD. The comparative results are presented in Table 13.
  • Based on the data presented in Table 13, the following conclusions can be inferred:
  • Compared with the other decomposition strategies, the predictive models based on IVMD demonstrate the minimal RMSE values, specifically, 0.1712, 0.1668, 0.1472, 0.1253, and 0.0881. This further validates the superior performance of IVMD over the other decomposition strategies. CEEMDAN-VMD and CEEMDAN-LMD fail to address the inherent mode-mixing issue in the CEEMDAN algorithm, although they employ secondary decomposition, which reduces the complexity of sequences once again to some extent. This is why both have lower performance than IVMD.
  • Compared to traditional machine learning methods like SVR, deep learning methods including BPNN, LSTM, transformer, and DLinear present significant improvement in predictive accuracy when combined with decomposition methods. For instance, the RMSE of IVMD-SVR and the SVR are 0.3015 and 0.5533, respectively. The RMSE is reduced by only 45.50% when incorporating IVMD. However, IVMD-DLinear and DLinear achieve an RMSE of 0.4332 and 0.0881, respectively. It is demonstrated that a remarkable RMSE reduction of 79.66% is achieved when combined with IVMD.
  • For the same decomposition strategy, DLinear consistently obtains the lowest RMSE, implying DLinear generally has optimal accuracy.
  • Among different combinations of decomposition strategies and original prediction models, IVMD-DLinear achieves the lowest RMSE of 0.0881. Therefore IVMD-DLinear has best predictive performance than the aforementioned combinations.

4. Discussion

WSF is a complex task influenced by various factors, such as temperature, humidity, and air pressure. These factors contribute to the non-stationary and nonlinear characteristics of wind speed sequences. It is challenging to forecast wind speeds with a single prediction model accurately. Precise WSF holds significant importance in the energy industry, as higher accuracy forecasts can help reduce operational costs of power systems.

4.1. Discussion of Computational Efficiency

In terms of computational efficiency, the proposed method outperforms the other prediction models. Specifically, the VMD allows for higher computational efficiency with distributed storage and parallel computing techniques, since each IMF’s prediction is independent of the others. Furthermore, the DLinear model has high efficiency, with each branch containing only a single linear layer. Significantly lower memory and fewer parameters are involved than the transformer, and faster calculation speeds.

4.2. Discussion of Computational Complexity

Compared with the other non-decomposition methods, the IVMD-DLinear increases the computational complexity within an acceptable range. The predictions for each IMF are obtained by applying the DLinear model to each IMF and the hyperparameter in VMD is optimized by the GWO algorithm, which significantly increases the computational complexity. However, it is reasonable since the substantial improvement in prediction accuracy augments the economic efficacy of wind power systems significantly.

5. Conclusions

Accurate and robust WSF is of great importance for the advancement of the wind power industry. Nevertheless, the intricate and non-stationary nature of wind speed sequences poses a significant challenge to achieve precise predictions. Therefore, a WSF model (HI-IVMD-DLinear) based on outlier correction, heuristic algorithms, and sequence decomposition is proposed to achieve high precision and robust wind speed forecasting. Firstly, the outliers in the wind speed sequence are detected and corrected with the outlier correction method HI to reduce the adverse effects of outliers on prediction accuracy. Secondly, the hyperparameters K and α of the VMD are optimized by the GWO. Thirdly, with the optimized K and α , the wind speed sequence processed by HI is decomposed into several IMFs by the VMD, and the non-stationarity and the complexity of the sequence are reduced. Finally, each IMF is individually predicted by the novel DLinear algorithm, and the predicted outputs are summed to obtain the final wind speed prediction.
The experimental results conducted on wind speed datasets from three cities in China validate the predictive performance of the HI-IVMD-DLinear. Based on the experiments, the following conclusions can be drawn:
  • HI assists in mitigating the detrimental effects of outliers on prediction accuracy, and enhances the overall precision of the predictions. HI can detect and correct outliers in wind speed series and reduce their interference in prediction.
  • The IVMD algorithm demonstrates significant advantages compared to the EEMD, CEEMDAN, CEEMDAN-VMD, and CEEMDAN-LMD algorithms. The CEEMDAN algorithm shows spurious modes during decomposition, which can affect the accuracy of predictions to some extent. CEEMDAN-VMD and CEEMDAN-LMD fail to address the mode-mixing issue in CEEMDAN, although they employ secondary decomposition to reduce sequence complexity to some extent.
  • The DLinear model has better optimal performance than the SVR, BPNN, LSTM, and transformer models. Simultaneously, DLinear is stable with higher prediction accuracy than that of the widely used and highly accurate transformer or LSTM models in the field of WSF, and it is not necessary to adjust its hyperparameters. Therefore, DLinear is more suitable for WSF than transformer and LSTM.
  • In the one-to-four-steps-ahead forecasting on the three datasets, the HI-IVMD-DLinear model demonstrates excellent prediction accuracy compared with the other eight comparative models. This hybrid model utilizes HI for outlier correction, IVMD for sequence decomposition, and DLinear for prediction. The performance of the hybrid model has been validated at each stage.
Nevertheless, our study does possess certain limitations. Primarily, it relies heavily on simulations due to the current cost constraints that prevent us from conducting field measurements.

Author Contributions

Conceptualization, J.L.; methodology, J.L.; investigation, J.L., C.G., S.C. and N.Z.; data curation, J.L. and S.C.; writing—original draft preparation, J.L. and C.G.; validation, C.G. and N.Z.; writing—reviewing and editing, S.C. and N.Z.; supervision, N.Z.; funding acquisition, N.Z.; project administration, N.Z.; resources, N.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported in part by the National Natural Science Foundation of China, grant number 62162041; in part by the Science & Technology Planning Project of Shanghai, grant number 23010501800; and in part by the Top Double 1000 Talent Programme of Jiangxi Province, grant number JXSQ2019201055.

Institutional Review Board Statement

The study did not involve humans or animals.

Informed Consent Statement

The study did not involve humans.

Data Availability Statement

Publicly available datasets and code were analyzed in this study. These data and code can be found here: https://github.com/2076959577/PaperCode (accessed on 7 June 2023).

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Dong, F.; Li, W. Research on the Coupling Coordination Degree of “Upstream-Midstream-Downstream” of China’s Wind Power Industry Chain. J. Clean. Prod. 2021, 283, 124633. [Google Scholar] [CrossRef]
  2. World Wind Energy Association. Worldwide Wind Capacity Reaches 744 Gigawatts—An Unprecedented 93 Gigawatts Added in 2020; World Wind Energy Association: Bonn, Germany, 2021. [Google Scholar]
  3. Wang, L.; Li, J. Estimation of Extreme Wind Speed in SCS and NWP by a Non-Stationary Model. Theor. Appl. Mech. Lett. 2016, 6, 131–138. [Google Scholar] [CrossRef] [Green Version]
  4. Zhao, J.; Guo, Z.-H.; Su, Z.-Y.; Zhao, Z.-Y.; Xiao, X.; Liu, F. An Improved Multi-Step Forecasting Model Based on WRF Ensembles and Creative Fuzzy Systems for Wind Speed. Appl. Energy 2016, 162, 808–826. [Google Scholar] [CrossRef]
  5. Yang, J.; Astitha, M.; Delle Monache, L.; Alessandrini, S. An Analog Technique to Improve Storm Wind Speed Prediction Using a Dual NWP Model Approach. Mon. Weather. Rev. 2018, 146, 4057–4077. Available online: https://journals.ametsoc.org/view/journals/mwre/146/12/mwr-d-17-0198.1.xml (accessed on 30 April 2023). [CrossRef]
  6. Karakuş, O.; Kuruoğlu, E.E.; Altınkaya, M.A. One-day Ahead Wind Speed/Power Prediction Based on Polynomial Autoregressive Model. IET Renew. Power Gener. 2017, 11, 1430–1439. Available online: https://ietresearch.onlinelibrary.wiley.com/doi/10.1049/iet-rpg.2016.0972 (accessed on 30 April 2023). [CrossRef] [Green Version]
  7. Aasim; Singh, S.N.; Mohapatra, A. Repeated Wavelet Transform Based ARIMA Model for Very Short-Term Wind Speed Forecasting. Renew. Energy 2019, 136, 758–768. [Google Scholar] [CrossRef]
  8. Yatiyana, E.; Rajakaruna, S.; Ghosh, A. Wind Speed and Direction Forecasting for Wind Power Generation Using ARIMA Model. In Proceedings of the 2017 Australasian Universities Power Engineering Conference (AUPEC), Melbourne, Australia, 19–22 November 2017; pp. 1–6. [Google Scholar]
  9. Phan, Q.T.; Wu, Y.K.; Phan, Q.D. A Hybrid Wind Power Forecasting Model with XGBoost, Data Preprocessing Considering Different NWPs. Appl. Sci. 2021, 11, 1100. [Google Scholar] [CrossRef]
  10. Santamaría-Bonfil, G.; Reyes-Ballesteros, A.; Gershenson, C. Wind Speed Forecasting for Wind Farms: A Method Based on Support Vector Regression. Renew. Energy 2016, 85, 790–809. [Google Scholar] [CrossRef]
  11. Chen, J.; Zeng, G.-Q.; Zhou, W.; Du, W.; Lu, K.-D. Wind Speed Forecasting Using Nonlinear-Learning Ensemble of Deep Learning Time Series Prediction and Extremal Optimization. Energy Convers. Manag. 2018, 165, 681–695. [Google Scholar] [CrossRef]
  12. Wang, G.; Wang, X.; Wang, Z.; Ma, C.; Song, Z. A VMD–CISSA–LSSVM Based Electricity Load Forecasting Model. Mathematics 2021, 10, 28. [Google Scholar] [CrossRef]
  13. Ren, C.; An, N.; Wang, J.; Li, L.; Hu, B.; Shang, D. Optimal Parameters Selection for BP Neural Network Based on Particle Swarm Optimization: A Case Study of Wind Speed Forecasting. Knowl.-Based Syst. 2014, 56, 226–239. [Google Scholar] [CrossRef]
  14. Yang, Z.; Wang, J. A Hybrid Forecasting Approach Applied in Wind Speed Forecasting Based on a Data Processing Strategy and an Optimized Artificial Intelligence Algorithm. Energy 2018, 160, 87–100. [Google Scholar] [CrossRef]
  15. Wang, Z.; Wang, B.; Liu, C.; Wang, W. Improved BP Neural Network Algorithm to Wind Power Forecast. J. Eng. 2017, 2017, 940–943. [Google Scholar] [CrossRef]
  16. Duan, J.; Zuo, H.; Bai, Y.; Duan, J.; Chang, M.; Chen, B. Short-Term Wind Speed Forecasting Using Recurrent Neural Networks with Error Correction. Energy 2021, 217, 119397. [Google Scholar] [CrossRef]
  17. Gangwar, S.; Bali, V.; Kumar, A. Comparative Analysis of Wind Speed Forecasting Using LSTM and SVM. EAI Endorsed Trans. Scalable Inf. Syst. 2020, 7, e1. [Google Scholar] [CrossRef]
  18. Ying, X.; Zhao, K.; Liu, Z.; Gao, J.; He, D.; Li, X.; Xiong, W. Wind Speed Prediction via Collaborative Filtering on Virtual Edge Expanding Graphs. Mathematics 2022, 10, 1943. [Google Scholar] [CrossRef]
  19. Mousavi, S.M.; Ghasemi, M.; Dehghan Manshadi, M.; Mosavi, A. Deep Learning for Wave Energy Converter Modeling Using Long Short-Term Memory. Mathematics 2021, 9, 871. [Google Scholar] [CrossRef]
  20. Liu, H.; Mi, X.; Li, Y.; Duan, Z.; Xu, Y. Smart Wind Speed Deep Learning Based Multi-Step Forecasting Model Using Singular Spectrum Analysis, Convolutional Gated Recurrent Unit Network and Support Vector Regression. Renew. Energy 2019, 143, 842–854. [Google Scholar] [CrossRef]
  21. Xiang, L.; Li, J.; Hu, A.; Zhang, Y. Deterministic and Probabilistic Multi-Step Forecasting for Short-Term Wind Speed Based on Secondary Decomposition and a Deep Learning Method. Energy Convers. Manag. 2020, 220, 113098. [Google Scholar] [CrossRef]
  22. Zhang, G.; Liu, D. Causal Convolutional Gated Recurrent Unit Network with Multiple Decomposition Methods for Short-Term Wind Speed Forecasting. Energy Convers. Manag. 2020, 226, 113500. [Google Scholar] [CrossRef]
  23. Syu, Y.-D.; Wang, J.-C.; Chou, C.-Y.; Lin, M.-J.; Liang, W.-C.; Wu, L.-C.; Jiang, J.-A. Ultra-Short-Term Wind Speed Forecasting for Wind Power Based on Gated Recurrent Unit. In Proceedings of the 2020 8th International Electrical Engineering Congress (iEECON), Chiang Mai, Thailand, 4–6 March 2020; pp. 1–4. [Google Scholar]
  24. Wu, N.; Green, B.; Ben, X.; O’Banion, S. Deep Transformer Models for Time Series Forecasting: The Influenza Prevalence Case. arXiv 2020, arXiv:2001.08317. [Google Scholar]
  25. Bommidi, B.S.; Teeparthi, K.; Kosana, V. Hybrid Wind Speed Forecasting Using ICEEMDAN and Transformer Model with Novel Loss Function. Energy 2023, 265, 126383. [Google Scholar] [CrossRef]
  26. Wu, H.; Meng, K.; Fan, D.; Zhang, Z.; Liu, Q. Multistep Short-Term Wind Speed Forecasting Using Transformer. Energy 2022, 261, 125231. [Google Scholar] [CrossRef]
  27. Zeng, A.; Chen, M.; Zhang, L.; Xu, Q. Are Transformers Effective for Time Series Forecasting? arXiv 2022, arXiv:2205.13504. [Google Scholar]
  28. Büyükşahin, Ü.Ç.; Ertekin, Ş. Improving Forecasting Accuracy of Time Series Data Using a New ARIMA-ANN Hybrid Method and Empirical Mode Decomposition. Neurocomputing 2019, 361, 151–163. [Google Scholar] [CrossRef] [Green Version]
  29. Zhang, J.; Wei, Y.; Tan, Z. An Adaptive Hybrid Model for Short Term Wind Speed Forecasting. Energy 2020, 190, 115615. [Google Scholar] [CrossRef]
  30. Shang, Z.; He, Z.; Chen, Y.; Chen, Y.; Xu, M. Short-Term Wind Speed Forecasting System Based on Multivariate Time Series and Multi-Objective Optimization. Energy 2022, 238, 122024. [Google Scholar] [CrossRef]
  31. Liu, D.; Niu, D.; Wang, H.; Fan, L. Short-Term Wind Speed Forecasting Using Wavelet Transform and Support Vector Machines Optimized by Genetic Algorithm. Renew. Energy 2014, 62, 592–597. [Google Scholar] [CrossRef]
  32. Liu, H.; Mi, X.; Li, Y. Wind Speed Forecasting Method Based on Deep Learning Strategy Using Empirical Wavelet Transform, Long Short Term Memory Neural Network and Elman Neural Network. Energy Convers. Manag. 2018, 156, 498–514. [Google Scholar] [CrossRef]
  33. Wang, J.; Zhang, W.; Li, Y.; Wang, J.; Dang, Z. Forecasting Wind Speed Using Empirical Mode Decomposition and Elman Neural Network. Appl. Soft Comput. 2014, 23, 452–459. [Google Scholar] [CrossRef]
  34. Liu, H.; Chen, C.; Tian, H.; Li, Y. A Hybrid Model for Wind Speed Prediction Using Empirical Mode Decomposition and Artificial Neural Networks. Renew. Energy 2012, 48, 545–556. [Google Scholar] [CrossRef]
  35. A Complete Ensemble Empirical Mode Decomposition with Adaptive Noise. Available online: https://ieeexplore.ieee.org/document/5947265/ (accessed on 7 April 2023).
  36. Wang, S.; Zhang, N.; Wu, L.; Wang, Y. Wind Speed Forecasting Based on the Hybrid Ensemble Empirical Mode Decomposition and GA-BP Neural Network Method. Renew. Energy 2016, 94, 629–636. [Google Scholar] [CrossRef]
  37. Zhang, W.; Qu, Z.; Zhang, K.; Mao, W.; Ma, Y.; Fan, X. A Combined Model Based on CEEMDAN and Modified Flower Pollination Algorithm for Wind Speed Forecasting. Energy Convers. Manag. 2017, 136, 439–451. [Google Scholar] [CrossRef]
  38. Zhang, Y.; Pan, G.; Chen, B.; Han, J.; Zhao, Y.; Zhang, C. Short-Term Wind Speed Prediction Model Based on GA-ANN Improved by VMD. Renew. Energy 2020, 156, 1373–1388. [Google Scholar] [CrossRef]
  39. Li, Y.; Tang, B.; Jiao, S. SO-Slope Entropy Coupled with SVMD: A Novel Adaptive Feature Extraction Method for Ship-Radiated Noise. Ocean Eng. 2023, 280, 114677. [Google Scholar] [CrossRef]
  40. Liu, H.; Mi, X.; Li, Y. Smart Multi-Step Deep Learning Model for Wind Speed Forecasting Based on Variational Mode Decomposition, Singular Spectrum Analysis, LSTM Network and ELM. Energy Convers. Manag. 2018, 159, 54–64. [Google Scholar] [CrossRef]
  41. Mirjalili, S.; Mirjalili, S.M.; Lewis, A. Grey Wolf Optimizer. Adv. Eng. Softw. 2014, 69, 46–61. [Google Scholar] [CrossRef] [Green Version]
  42. Liu, H.; Xu, Y.; Chen, C. Improved Pollution Forecasting Hybrid Algorithms Based on the Ensemble Method. Appl. Math. Model. 2019, 73, 473–486. [Google Scholar] [CrossRef]
  43. Liu, H.; Shah, S.; Jiang, W. On-Line Outlier Detection and Data Cleaning. Comput. Chem. Eng. 2004, 28, 1635–1647. [Google Scholar] [CrossRef]
  44. Li, Y.; Geng, B.; Tang, B. Simplified Coded Dispersion Entropy: A Nonlinear Metric for Signal Analysis. Nonlinear Dyn. 2023, 111, 9327–9344. [Google Scholar] [CrossRef]
  45. Grassberger, P.; Procaccia, I. Characterization of Strange Attractors. Phys. Rev. Lett. 1983, 50, 346–349. [Google Scholar] [CrossRef]
  46. Li, Y.; Tang, B.; Geng, B.; Jiao, S. Fractional Order Fuzzy Dispersion Entropy and Its Application in Bearing Fault Diagnosis. Fractal Fract. 2022, 6, 544. [Google Scholar] [CrossRef]
  47. Zhu, K.; Song, X.; Xue, D. Fault Diagnosis of Rolling Bearings Based on IMF Envelope Sample Entropy and Support Vector Machine. J. Inf. Comput. Sci. 2013, 10, 5189–5198. [Google Scholar] [CrossRef]
  48. Chen, W.; Zhuang, J.; Yu, W.; Wang, Z. Measuring Complexity Using FuzzyEn, ApEn, and SampEn. Med. Eng. Phys. 2009, 31, 61–68. [Google Scholar] [CrossRef]
  49. Pearson, R.K. Outliers in Process Modeling and Identification. IEEE Trans. Control Syst. Technol. 2002, 10, 55–63. [Google Scholar] [CrossRef] [PubMed]
  50. Wang, J.; Du, P.; Hao, Y.; Ma, X.; Niu, T.; Yang, W. An Innovative Hybrid Model Based on Outlier Detection and Correction Algorithm and Heuristic Intelligent Optimization Algorithm for Daily Air Quality Index Forecasting. J. Environ. Manag. 2020, 255, 109855. [Google Scholar] [CrossRef]
  51. Dragomiretskiy, K.; Zosso, D. Variational Mode Decomposition. IEEE Trans. Signal Process. 2014, 62, 531–544. [Google Scholar] [CrossRef]
  52. Kingma, D.P.; Ba, J. Adam: A Method for Stochastic Optimization. arXiv 2017, arXiv:1412.6980. [Google Scholar]
  53. Huang, N.E.; Shen, Z.; Long, S.R.; Wu, M.C.; Shih, H.H.; Zheng, Q.; Yen, N.-C.; Tung, C.C.; Liu, H.H. The Empirical Mode Decomposition and the Hilbert Spectrum for Nonlinear and Non-Stationary Time Series Analysis. Proc. R. Soc. London. Ser. A Math. Phys. Eng. Sci. 1998, 454, 903–995. [Google Scholar] [CrossRef]
  54. Peng, T. Multi-Step Ahead Wind Speed Forecasting Using a Hybrid Model Based on Two-Stage Decomposition Technique and AdaBoost-Extreme Learning Machine. Energy Convers. Manag. 2017, 153, 589–602. [Google Scholar] [CrossRef]
  55. Emeksiz, C.; Tan, M. Multi-Step Wind Speed Forecasting and Hurst Analysis Using Novel Hybrid Secondary Decomposition Approach. Energy 2022, 238, 121764. [Google Scholar] [CrossRef]
Figure 1. Flowchart of IVMD.
Figure 1. Flowchart of IVMD.
Mathematics 11 02746 g001
Figure 2. Illustration of DLinear. (a): architecture of DLinear; (b): architecture of single-layer linear networks.
Figure 2. Illustration of DLinear. (a): architecture of DLinear; (b): architecture of single-layer linear networks.
Mathematics 11 02746 g002
Figure 3. The basic framework of HI-IVMD-DLinear model.
Figure 3. The basic framework of HI-IVMD-DLinear model.
Mathematics 11 02746 g003
Figure 4. Original wind speed sequences of Lanzhou, Nanjing, and Shijiazhuang.
Figure 4. Original wind speed sequences of Lanzhou, Nanjing, and Shijiazhuang.
Mathematics 11 02746 g004
Figure 5. Decomposition results of the three wind speed sequences: (a) Lanzhou, (b) Nanjing, and (c) Shijiazhuang. Res is the noise after decomposition.
Figure 5. Decomposition results of the three wind speed sequences: (a) Lanzhou, (b) Nanjing, and (c) Shijiazhuang. Res is the noise after decomposition.
Mathematics 11 02746 g005aMathematics 11 02746 g005b
Figure 6. Results of one-step-ahead prediction: (a) Lanzhou; (b) Nanjing; (c) Shijiazhuang.
Figure 6. Results of one-step-ahead prediction: (a) Lanzhou; (b) Nanjing; (c) Shijiazhuang.
Mathematics 11 02746 g006
Figure 7. One-step-ahead prediction errors on the three datasets: (a) Lanzhou; (b) Nanjing; (c) Shijiazhuang.
Figure 7. One-step-ahead prediction errors on the three datasets: (a) Lanzhou; (b) Nanjing; (c) Shijiazhuang.
Mathematics 11 02746 g007aMathematics 11 02746 g007b
Figure 8. The error distribution for one-step-ahead predictions across the three datasets: (a) Lanzhou; (b) Nanjing; (c) Shijiazhuang.
Figure 8. The error distribution for one-step-ahead predictions across the three datasets: (a) Lanzhou; (b) Nanjing; (c) Shijiazhuang.
Mathematics 11 02746 g008
Figure 9. Improvement percentages of VAE for HI-IVMD-DLinear compared to other comparative models.
Figure 9. Improvement percentages of VAE for HI-IVMD-DLinear compared to other comparative models.
Mathematics 11 02746 g009
Table 1. Basic information of three datasets.
Table 1. Basic information of three datasets.
DatasetTime IntervalSample SizeMinimumMeanMaximumStandard Deviation
Lanzhou1 January 2021–31 March 202121600.0001.8306.7651.317
Nanjing1 August 2021–1 November 202122320.0002.8497.6571.705
Shijiazhuang1 July 2021–1 October 202122320.0001.8446.4081.585
Table 2. Parameters of all related methods.
Table 2. Parameters of all related methods.
MethodsParametersValues
IVMDPopulation size50
Maximum iterations30
K [3, 11]
α [0, 1000]
SVRC[0, 10]
Epsilon[0, 1]
Gamma[0, 2]
BPNNDropout[0.05, 0.2]
Batchsize64
Epochs100
Initial lr0.1
Hidden_units[10, 100]
LSTMDropout[0.05, 0.2]
Batchsize64
Epochs100
Initial lr0.1
Hidden_units[10, 100]
TransformerDropout[0.05, 0.2]
Batchsize64
Epochs100
Initial lr0.1
Model dimension[64, 256]
Feedforward dimension[128, 256]
Heads number[1, 5]
Enc_layers[1, 5]
Dec_layers[1, 5]
DLinearBatchsize64
Epochs100
Initial lr0.1
Table 3. The SampEn value of original and HI-processed datasets.
Table 3. The SampEn value of original and HI-processed datasets.
SampEnLanzhouNanjingShijiazhuang
Original sequence1.05621.02301.0658
Sequence after HI1.04970.95340.9570
Table 4. Improvement percentages with HI.
Table 4. Improvement percentages with HI.
DatasetModelPMAE (%)PRMSE (%)PMAPE (%)
LanzhouHI-SVR vs. SVR2.12064.14722.5125
HI-LSTM vs. LSTM1.24523.56122.0106
HI-Transformer vs. Transformer0.89213.51252.2215
HI-DLinear vs. DLinear0.99151.13051.7683
HI-IVMD-DLinear vs. IVMD-DLinear0.76241.06141.2316
NanjingHI-SVR vs. SVR1.51253.89031.7246
HI-LSTM vs. LSTM2.20925.21373.0165
HI-Transformer vs. Transformer1.28755.17513.1062
HI-DLinear vs. DLinear2.17852.18672.1554
HI-IVMD-DLinear vs. IVMD-DLinear1.01261.87512.1240
ShijiazhuangHI-SVR vs. SVR3.56133.14516.1246
HI-LSTM vs. LSTM2.51460.89154.1256
HI-Transformer vs. Transformer1.87451.32714.6012
HI-DLinear vs. DLinear2.07611.05123.1251
HI-IVMD-DLinear vs. IVMD-DLinear1.56120.79512.1531
Table 5. ADF test results.
Table 5. ADF test results.
Datasetst-Statisticp-Value1% Level5% Level10% Level
Lanzhou−1.7140.3704−3.2334−2.6828−2.3674
Nanjing−1.2270.5513−2.8910−2.2150−1.9674
Shijiazhuang−1.8270.3207−3.3517−2.7124−2.4512
Table 6. Results of three evaluation metrics of multi-step-ahead prediction in Lanzhou.
Table 6. Results of three evaluation metrics of multi-step-ahead prediction in Lanzhou.
Estimation HorizonMetricHI-SVRHI-BPNNHI-LSTMHI-TransformerHI-DLinearHI-IVMD-BPNNHI-IVMD-LSTMHI-IVMD-TransformerHI-IVMD-DLinear
1-stepMAE0.31790.27730.26890.23660.20640.16880.12130.07670.0501
RMSE0.41160.37500.35920.32610.25820.20380.14520.10690.0641
MAPE0.15350.14940.13620.12130.10230.07450.07000.04210.0237
2-stepMAE0.43910.43800.33450.33010.28260.22730.20250.15120.1207
RMSE0.60310.58320.45330.44260.37790.31520.28980.21240.1578
MAPE0.22400.22510.17850.18440.15960.12230.10650.08140.0601
3-stepMAE0.45120.44050.3850.38160.34340.28770.25870.22230.1398
RMSE0.60010.58130.51640.51730.45670.40450.36490.28820.1909
MAPE0.23560.23040.2120.20980.17990.17650.15280.11030.0687
4-stepMAE0.52400.51140.44120.41330.37130.34370.33810.25120.1666
RMSE0.68610.67320.59420.55590.48980.41250.35380.30510.2136
MAPE0.25230.25810.23450.22950.20340.20100.17820.13120.0839
Table 7. Results of three evaluation metrics of multi-step-ahead prediction in Nanjing.
Table 7. Results of three evaluation metrics of multi-step-ahead prediction in Nanjing.
Estimation HorizonMetricHI-SVRHI-BPNNHI-LSTMHI-TransformerHI-DLinearHI-IVMD-BPNNHI-IVMD-LSTMHI-IVMD-TransformerHI-IVMD-DLinear
1-stepMAE0.44630.40800.36910.35720.32950.13250.11520.09110.0705
RMSE0.55330.53240.4710.46940.43320.16680.14720.12530.0881
MAPE0.38390.30820.2920.27050.24180.09170.07920.06320.0479
2-stepMAE0.52440.50350.47850.46960.37970.22580.2080.17170.1113
RMSE0.68140.66150.61720.61180.46990.29890.27520.2290.1477
MAPE0.43560.40450.39050.37660.25570.15620.15110.11180.0773
3-stepMAE0.57240.56200.53680.53040.45310.25920.2110.18710.1381
RMSE0.76210.73940.69770.70050.60140.34880.30010.26240.1834
MAPE0.47610.43000.44770.44590.33460.18250.16290.14120.0956
4-stepMAE0.63480.60340.57160.56150.44950.30710.25090.22150.1648
RMSE0.85000.81080.77180.74570.62710.40240.35830.29180.2289
MAPE0.52980.50090.48260.46820.35320.22060.20120.18110.1266
Table 8. Results of three evaluation metrics of multi-step-ahead prediction in Shijiazhuang.
Table 8. Results of three evaluation metrics of multi-step-ahead prediction in Shijiazhuang.
Estimation HorizonMetricHI-SVRHI-BPNNHI-LSTMHI-TransformerHI-DLinearHI-IVMD-BPNNHI-IVMD-LSTMHI-IVMD-TransformerHI-IVMD-DLinear
1-stepMAE0.30730.30410.26970.25400.20600.14410.12770.09390.0669
RMSE0.39370.39080.35640.33830.28470.20130.17650.12830.0861
MAPE0.25060.22810.19350.19360.15100.12250.10220.06610.0480
2-stepMAE0.41210.41390.40260.37450.29760.22550.17680.12020.0814
RMSE0.52610.52860.51880.50400.39450.31320.23710.16120.1054
MAPE0.27990.28230.33790.26420.22150.19460.12680.09120.0569
3-stepMAE0.46250.45740.43960.39950.32460.27520.22480.15120.1000
RMSE0.60310.59970.56570.54010.42240.36740.29030.23420.1338
MAPE0.34120.32750.33920.31140.25510.21570.17350.10230.0690
4-stepMAE0.49510.49410.48710.45190.38140.35020.28610.22150.1439
RMSE0.64310.63350.61950.57610.47450.42640.36830.30120.2141
MAPE0.36200.36390.35300.36140.30950.28490.25430.22020.1023
Table 9. Improvement percentages on three metrics of HI-IVMD-DLinear compared with comparable models in Lanzhou.
Table 9. Improvement percentages on three metrics of HI-IVMD-DLinear compared with comparable models in Lanzhou.
Estimation HorizonMetricHI-SVRHI-BPNNHI-LSTMHI-TransformerHI-DLinearHI-IVMD-BPNNHI-IVMD-LSTMHI-IVMD-Transformer
1-stepPMAE (%)84.203382.719680.900580.265978.603246.784138.810422.6125
PRMSE (%)84.076683.453581.294781.229679.661247.168140.137229.6887
PMAPE (%)87.523684.458483.596982.293880.193647.760539.547524.2089
2-stepPMAE (%)78.774077.895276.739876.300670.689150.704046.479135.1776
PRMSE (%)78.322877.672276.069375.858368.568550.591446.335335.5022
PMAPE (%)82.254980.889980.204979.473969.764550.526648.846730.8587
3-stepPMAE (%)75.873575.427574.273273.963369.524146.730434.539626.1892
PRMSE (%)75.936375.196873.714873.820269.504147.412938.896830.1067
PMAPE (%)79.921277.769978.644778.562271.430447.604041.331332.2946
4-stepPMAE (%)74.038072.686671.170770.651863.333446.333934.308225.5982
PRMSE (%)73.071371.767170.340569.302363.498743.115436.122121.5559
PMAPE (%)76.104774.723573.767372.958864.156642.605837.085830.0939
Table 10. Improvement percentages on three metrics of HI-IVMD-DLinear compared with comparable models in Nanjing.
Table 10. Improvement percentages on three metrics of HI-IVMD-DLinear compared with comparable models in Nanjing.
Estimation HorizonMetricHI-SVRHI-BPNNHI-LSTMHI-TransformerHI-DLinearHI-IVMD-BPNNHI-IVMD-LSTMHI-IVMD-Transformer
1-stepPMAE (%)84.240981.933381.365778.826075.722470.319958.683134.6806
PRMSE (%)84.427282.906082.156480.346075.172768.544255.852940.0374
PMAPE (%)84.559184.131382.599880.455676.835668.167866.164543.7055
2-stepPMAE (%)72.513472.444663.920563.435857.288246.887340.382620.1904
PRMSE (%)73.836272.942865.185864.345858.239449.943245.553825.6908
PMAPE (%)73.171173.295066.330267.408462.351950.871643.563026.1879
3-stepPMAE (%)69.018468.266763.684063.362359.289151.404345.957237.1233
PRMSE (%)68.190867.161663.034963.095758.199052.806047.689233.7700
PMAPE (%)70.841670.182267.590367.258161.819161.077855.027637.7379
4-stepPMAE (%)68.206367.422262.239259.687655.126951.524050.721933.6865
PRMSE (%)68.868468.271164.054861.575056.387048.220239.626829.9954
PMAPE (%)66.751867.493564.222163.450858.755458.262252.927636.0750
Table 11. Improvement percentages on three metrics of HI-IVMD-DLinear compared with comparable models in Shijiazhuang.
Table 11. Improvement percentages on three metrics of HI-IVMD-DLinear compared with comparable models in Shijiazhuang.
Estimation HorizonMetricHI-SVRHI-BPNNHI-LSTMHI-TransformerHI-DLinearHI-IVMD-BPNNHI-IVMD-LSTMHI-IVMD-Transformer
1-stepPMAE (%)78.216777.9975.1873.6567.5053.5447.5928.72
PRMSE (%)78.133477.9775.8474.5569.7657.2451.2132.90
PMAPE (%)80.859078.9875.2275.2368.2460.8653.0927.44
2-stepPMAE (%)80.249180.3379.7878.2672.6563.9053.9632.25
PRMSE (%)79.966580.0679.6879.0973.2866.3555.5434.62
PMAPE (%)79.672279.8483.1678.4674.3170.7755.1137.63
3-stepPMAE (%)78.379078.1477.2574.9769.1963.6655.5133.88
PRMSE (%)77.814677.6976.3575.2368.3263.5953.9142.88
PMAPE (%)79.779378.9379.6677.8472.9668.0160.2232.56
4-stepPMAE (%)70.936370.8770.4668.1662.2758.9149.7135.02
PRMSE (%)66.705666.2065.4462.8454.8849.7941.8728.93
PMAPE (%)71.736571.8971.0271.7066.9464.0959.7753.53
Table 12. Predictive stability results (VAE) in three datasets.
Table 12. Predictive stability results (VAE) in three datasets.
Estimation HorizonHI-SVRHI-BPNNHI-LSTMHI-TransformerHI-DLinearHI-IVMD-BPNNHI-IVMD-LSTMHI-IVMD-TransformerHI-IVMD-DLinear
Lanzhou
1-step0.12000.11000.08930.11020.05130.01970.01290.02110.0028
2-step0.19200.18900.15410.18990.11270.03830.03130.03160.0084
3-step0.21120.22160.18730.20120.15770.05120.05440.04120.0178
4-step0.26350.25390.23670.25240.20250.05510.06760.05180.0264
Nanjing
1-step0.06670.05710.05360.06100.05060.03270.01140.01100.0052
2-step0.10980.08350.09970.08110.07100.04770.03270.04110.0173
3-step0.13710.13800.11780.12250.10480.09020.06690.06240.0353
4-step0.16200.16100.15030.15030.13030.13330.10470.10090.0603
Shijiazhuang
1-step0.06510.06100.06520.07820.05140.01370.01000.04160.0044
2-step0.10210.10190.09220.09560.07810.04120.02660.02100.0096
3-step0.12410.12210.11700.10180.09210.08100.05910.04110.0336
4-step0.14070.13950.13010.13120.11390.10650.08720.07210.0655
Table 13. The RMSE in Lanzhou obtained by basic forecasting models combined with different decomposition strategies.
Table 13. The RMSE in Lanzhou obtained by basic forecasting models combined with different decomposition strategies.
StrategySVRBPNNLSTMTransformerDLinear
Non-decomposition0.55330.53240.47100.46940.4332
EMD0.41190.39920.36110.35740.3192
CEEMDAN0.36330.27310.26320.24270.174
CEEMDAN-VMD0.32110.20310.17540.16410.1259
CEEMDAN-LMD0.32750.18320.16180.15170.1187
IVMD0.30150.16680.14720.12530.0881
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Liu, J.; Gong, C.; Chen, S.; Zhou, N. Multi-Step-Ahead Wind Speed Forecast Method Based on Outlier Correction, Optimized Decomposition, and DLinear Model. Mathematics 2023, 11, 2746. https://doi.org/10.3390/math11122746

AMA Style

Liu J, Gong C, Chen S, Zhou N. Multi-Step-Ahead Wind Speed Forecast Method Based on Outlier Correction, Optimized Decomposition, and DLinear Model. Mathematics. 2023; 11(12):2746. https://doi.org/10.3390/math11122746

Chicago/Turabian Style

Liu, Jialin, Chen Gong, Suhua Chen, and Nanrun Zhou. 2023. "Multi-Step-Ahead Wind Speed Forecast Method Based on Outlier Correction, Optimized Decomposition, and DLinear Model" Mathematics 11, no. 12: 2746. https://doi.org/10.3390/math11122746

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop