Abstract

As a clean energy source, the role of wind power in the energy mix is becoming increasingly important. Reliable and high-quality wind speed prediction results are key to wind energy utilization. The traditional point prediction method cannot effectively analyze the uncertainty of wind speed, and the interval prediction model can provide the possible variation range of wind speed under a certain confidence probability and supply more uncertain information to decision makers. However, the previous interval prediction models generally ignore the random characteristics of capturing wind speed and the importance of objective selection of prediction submodels, leading to poor prediction results. To address these problems, a combined model based on data preprocessing, multi-neural network models, multi-objective optimization, and an improved interval prediction method is proposed. The model is applied to five wind speed forecasting examples in Dalian to test the prediction accuracy, multi-step prediction ability, and universality and generalization ability of the model. The experimental results show that the model proposed in this study is satisfactory for various performance evaluation indexes, has high stability and accuracy, and all the solutions obtained by the model are Pareto optimal solutions. Thus, it provides a reliable reference for the effective utilization of wind energy.

1. Introduction

For a long time, the development mode with overreliance on coal, oil, and other traditional fossil fuels has not only brought economic growth to a majority of countries around the world, but also caused problems that cannot be ignored, such as environmental pollution, abnormal climate change, and frequent natural disasters [1].

At present, countries worldwide are trying to find renewable energy to promote the green, efficient, and sustainable economic development. As a safe and sustainable energy, wind energy has naturally become an effective alternative to traditional energy [2].

In 2020, the new installed capacity of global wind power was still set a new record, reaching 93 GW [3], owing to the two major markets, China and the United States. In China, by the end of 2020, the cumulative installed capacity of wind power had reached , including of onshore wind power and approximately of offshore wind power. This makes China the largest country in wind power, with 130% more power than the United States, which ranks second, and 1.2 times the installed capacity all of Europe. However, in terms of the proportion of wind power in total power demand, China lags behind Europe and the United States, and it is also slightly lower than the global average [4]. Figure 1 shows the power generation of each large wind power country.

At present, there are still many difficulties hindering the development of wind power generation, such as unstable and fluctuating wind speed, substandard technology, and difficulty in predicting wind speed, among which the prediction of wind speed is the most important issue. As we all know, wind speed has the characteristics of randomness and instability, which leads to an unsustainable and unstable output of the power grid, which may result in the collapse of the power grid. Therefore, accurate wind speed prediction is the key to improving wind power generation capacity.

In the past 10 to 20 years, researchers have provided a number of alternative solutions for accurate wind speed prediction, including physical methods [5] and statistical methods [6]. The physical method does not require the support of historical power data from wind farms and is suitable for new wind farms. However, it requires a large number of accurate numerical weather forecast (NWP) data [7]. Further, the physical information of wind power decreases the accuracy of wind power prediction, and thus it is not suitable for countries with large land areas and complex terrain conditions. The statistical method requires the support of abundant historical data and has high requirements for the consistency of the change law of historical data. Most wind farms find it difficult to complete the establishment of statistical models because the position of measurement is not accurate.

As a result, artificial intelligence models have received significant attention. The advantage of the artificial intelligence model is that it does not require high position accuracy for measuring wind speed and does not need any prior assumptions. Common artificial intelligence models for predicting wind speed include the NN neural network, Elman neural network [8], LSTM deep learning [9], and GRU deep learning [10]. For example, Humfeld et al. predicted the wind turbine generation capacity using a neural network (NN) and achieved good results [11]. Choe et al. successfully predicted the power generation efficiency of floating offshore wind turbine blades using LSTM and GRU deep learning [12]. Niu and Wang used the combined model of point prediction to predict the wind speed and achieved good results [13]. Niu et al. proposed a carbon price forecasting system based on error correction and divide-conquer strategies; the use of multi-objective optimization algorithm effectively improves the prediction accuracy [14]. Hao et al. proposed a new combination forecasting model for the prediction of the impact of haze on China’s economy [15]. Gao et al. proposed a multicomponent hybrid system based on predictability recognition and modified multi-objective optimization for ultra-short-term onshore wind speed forecasting [16]. Yang et al. proposed a new novel machine learning-based electricity price forecasting model based on optimal model selection strategy [17]. However, a single artificial intelligence model will fall into the problem of a local minimum or lead to overfitting. Therefore, some experts have proposed a combined model based on artificial intelligence models. For example, the EMD + DBN method [14], after high- and low-frequency data decomposition through the EMD method, predicts and recombines each layer of data to solve the problem of wind speed prediction lag [18]. However, owing to the lack of optimization of the prediction results, there is often a large gap with the real value.

In the current research, most wind speed forecast studies are based on point prediction [19]. Point prediction cannot ensure the reliability and controllability of wind speed prediction, whereas interval prediction can quantify the changes of prediction results caused by uncertainty and observe the changes in wind speed at different significance levels. Most importantly, effective interval prediction results can provide more uncertain information for decision-makers and help them carry out risk prediction and assessment.

At present, the point prediction (PF) accuracy of the combined model has made great progress, while the accuracy of interval prediction has been improved to a lesser extent. Interval prediction methods mainly include the Mean Variance Estimation Method [20], Delta Method [21], Bayesian Method [22], and Bootstrap Method [23, 24]. In this study, the Bootstrap Method is adopted; however, its performance largely depends on the prediction accuracy of a single neural network. To solve this problem, a combined model based on multiple neural networks is proposed, and a heuristic optimization algorithm is used to obtain the optimal super parameters and weight parameters [25, 26]. The evaluation of existing wind speed forecast models is presented in Table 1.

From the review of a large number of studies, we find that the existing wind speed prediction mostly uses single neural network prediction supplemented by multi-objective optimization model or multiple neural network prediction supplemented with a single-objective optimization model. The single-objective optimization model often ignores the stability of prediction while improving the accuracy, whereas single neural network prediction may not be suitable for all wind speed situations. We develop a combined model based on multiple single neural network prediction models and multi-objective optimization. A number of experiments show that, under the condition that the data requirements are not too stringent, the combined model proposed in this study improves the accuracy of wind speed interval prediction and can effectively improve the wind power generation capacity. In addition, in the comprehensive evaluation module, we use several evaluation indicators of interval prediction ability to evaluate the effectiveness of the combination model proposed in this study on a comprehensive manner and prove its superiority.

The contributions and innovations of this study are as follows:(i)The data preprocessing method is used to reduce the volatility and randomness of wind speed series in an effective way. As the application of decomposition and reconstruction technology, the data preprocessing strategy is used to delete the high-frequency noise and leave the low-frequency effective wind speed data. After decomposition and reconstruction, the volatility and randomness of the original wind speed series are effectively reduced, and the prediction performance is improved.(ii)A combined system that includes data preprocessing, combined forecasting of multiple neural networks, and a multi-objective optimization and evaluation module is proposed. Four different neural network models are used for interval prediction of data, overcoming the limitations of a single neural network, and the prediction results of the different neural networks are optimized through a multi-objective optimization model. Thus, the different objectives tend to be Pareto optimal solutions at the same time, and the problem that stability and accuracy cannot coexist is overcome.(iii)Interval forecasts can observe the changes in wind speed at different significant levels, which provides more accurate information on wind speed for decision-makers. In the interval prediction, the combined neural network prediction model is adopted, and a heuristic optimization algorithm is used to improve the Bootstrap method, thus obtaining accurate interval prediction results.(iv)A comprehensive and exhaustive evaluation system is established to validate the performance and prediction accuracy of the forecasting models. Five experiments are performed to verify the superiority, universality, and generalization of the model.

The remaining of this paper is organized as follows: Section 2 introduces the selection of data and data processing; Section 3 introduces the combined neural network and its optimization principle and constructs and optimizes the models; Section 4 evaluates and verifies the superiority of the model through multiple groups of data and multiple indicators; and Section 5 discusses the advantages and disadvantages of the model and the directions for further improvement.

2. Data and Data Processing

Dalian, which is located at the southern end of the Liaodong Peninsula of China (38.43°–40.12° N and 120.58°–123.31° E), has a wind speed suitable for wind power generation throughout the year, making it an ideal location for wind power plants. In fact, wind power development in Dalian City has been at the leading level in China. In 2015, the power grid received 152.4 million kilowatt-hours of wind power in Dalian City, accounting for 5% of the electricity generation in Dalian City. Wind power has become an important energy source for Dalian.

In this study, wind speed monitoring data obtained in January 2011 from four different monitoring sites (WTG03, WTG04, WTG05, and WTG06) in a wind power plant in Dalian are selected. The datasets include four groups, and each group has 1463 sets of wind speed data, at which there is an interval of 10 minutes. Figure 2 shows the basics of each dataset.

As the commonly used data preprocessing strategies, the empirical mode decomposition (EMD) and ensemble empirical mode decomposition (EEMD) have the drawbacks of modal aliasing and involve a certain white noise fluctuation after reducing the noise in raw datasets, the complete ensemble empirical mode decomposition with adaptive noise (CEEMDAN) proposed by Torres [27] is utilized. CEEMDAN solves the above problems from two aspects.(1)The intrinsic mode function (IMF) component with auxiliary noise is added after decomposition by EMD instead of the white Gaussian noise signal being directly added to the original signal.(2)EMD and EEMD [28] are used to calculate the average value of the modal components obtained from empirical mode decomposition, whereas CEEMDAN calculates the average value when the first-order IMF [29] components are obtained; then, the above operations are repeated for the residual parts. In this way, the CEEMDAN has effectively solved the transfer and transmission problem of white noise from high frequency to low frequency.

Let be the ith eigenmode component obtained by EMD and the ith eigenmode component obtained by CEEMDAN. is the Gaussian white noise signal satisfying the standard normal distribution, is the number of added white noise, is the standard table of white noise, and is the signal to be decomposed. The EMD process is summarized as follows:(1)White noise, which is positive and negative pairwise, is added to the signal to be decomposed, and a new signal is obtained: , where . Then, decompose the new signal using EMD to obtain the first-order eigenmode component :(2)Obtain the first eigenmode component of CEEMDAN by averaging generated N-modal components:(3)Secondary calculation residual:(4)The way to obtain a new signal is to add the positive and negative pairwise Gaussian white noise .Then, the first modal component of the EMD () decomposition and the second modal component of the CEEMDAN are obtained:(5)The residual after removing the second modal component is calculated:(6)Repeat the above steps until the obtained residual signal is a monotone function and cannot be decomposed, and the algorithm ends. The number of eigenmode components obtained is and the original signal is decomposed into

Taking the first set of data (Site 1) as an example, a data preprocessing technique is performed, and the CEEMDAN method is implemented in MATLAB. The parameters of the CEEMDAN are set as follows: Nstd value is 0.2, NR value is 50, and Maxlter value is 500. The original data are divided into 10 different stats with different frequencies, the two groups of data with the highest frequency are filtered out, and the others are used to reconstruct a series that is more stable for the subsequent forecasting stage, as shown in Figure 3.

3. Theory and Construction of Combined Model

A combined model is used to achieve more accurate forecasting results. In this study, four AI models are selected as the component models for our proposed combined model: limit learning machine (ELM) [30], support vector machine (SVM) [31], back-propagation neuron network (BP) [32], and wavelet neural network (WAVENN) [33]. To improve its accuracy, the four groups of prediction results from the four AI models are weighted again using the multi-objective locust optimization algorithm (MOGOA). The basic theory of each model is as follows.

3.1. Theory of Combined Models

The network structure of the ELM is the same as that of a single hidden layer feedforward neural network (SLFN) [34], but in the training stage, the traditional neural network (back propagation) algorithm is no longer used, and the weights and deviations of the input layer are randomly set. In this study, we set the implicit layer neuronal activation function , sample size , and network output value .

The objective function is as follows:

According to the linear algebra and the matrix theory, the optimal solution can be deduced as follows:where is the Moore–Penrose generalized inverse matrix of matrix .

The above problem is transformed into calculating the Moore–Penrose generalized inverse matrix of matrix using the singular value decomposition method (SVD) [35]. If is nonsingular, we can use the orthogonal projection method, and the calculated results are as follows:

SVM was first proposed by Vapnik, and it is a two-class classification model. Its basic model is to find a linear classifier with a maximum separation hyperplane in the feature space. The SVM model mainly comprises four parts:(1)Set the known training as(2)Choose an appropriate kernel function and parameter to construct the optimization problem:Obtain the optimal solution: .(3)Choose a positive component of , and calculate the threshold accordingly:(4)Construct a decision function:(5)Add a relaxation term :

The parameters used in the regression are and , and their formulas are

We use the LIBSVM-FarutoUltimate Ultimate toolbox to optimize the SVM. The scale for the SVM function in the toolbox can optimize and normalize the parameters reasonably, which further improves the accuracy of prediction [36].

WAVENN was originally used to predict traffic flow series. Wavelet analysis is developed to solve the deficiency of the Fourier transform, which has no resolution in time domain. The wavelet basis function of Morlet is used in this study: .

In the neural network, we use the first four sets of data of each time node to make a rolling prediction of the node to improve the prediction effect: . The calculation formula and prediction error of the output layer of it are as follows:

The key point of the WAVENN neural network lies in the use of wavelet transform:

Suppose the data can be written in vector form: , :where , .

The BP neural network is a neural network which is based on an error back-propagation algorithm. The prediction function of the BP neural network is . BP neural network is solved through the following steps:(1)Hidden layer output calculation:(2)Output layer result calculation:(3)Error calculation, weight and threshold update, and output of results:

The gradient matrix is mainly used in the BP neural network:

MOGOA is a new type of optimization algorithm [37]. The GOA algorithm has a strong exploration advantage on the whole [38], which ensures that the algorithm has a strong global search ability, and avoids stopping in the local optimization. We use this algorithm to fit the results of the above four models to make the prediction results close to the true value.

The algorithm divides the search into exploration and development by simulating the population migration and foraging behavior of locusts in nature. The mathematical formula for the locust algorithm is as follows:where is the position of the ith locust, refers to the mutual influence between individuals, is the gravity of the ith locust, is the wind force on the ith locust, and is calculated as follows:where is the distance between the ith and the jth locusts and is the unit vector pointing to the jth locust for the ith locust.

However, the function has defects: that is, when the distance between locusts is excessively large, the force will not exist, and thus the distance between locusts should be standardized.where is the gravity constant and is the unit vector pointing to the center of the Earth.

can be calculated as follows:

Usually, when the locust swarm reaches the comfort zone, the swarm does not converge. To adapt to the solution of the optimization problem and coordinate the global and local optimization processes, parameters can be introduced to distinguish different stages of optimization. The original model and the improved mathematical models are as follows:where is the decreasing coefficient, which determines the size of the comfort zone, repulsion zone, and attraction zone, is the number of locusts, and is the best solution of locust position in the -dimensional space thus far. In the formula, the influence of gravity is not considered, and it is assumed that the wind direction always points to the optimal solution. The internal helps to reduce the repulsion or attraction between locusts, which is directly proportional to the number of iterations. The external will reduce the search area around the target with an increase in the number of iterations.

The following formula is used to reduce the global search and increase the local precision search.

Proof. It is proved that when the algorithm finds the optimal solution, it will be stored in Archive, and those with high fitness will be selected in the front position. If there is a , and satisfies , then will be sorted in front of . When Archive is full, the value with low fitness will be proposed, so the first few individuals of the final selected Archive are Pareto optimal solutions of MOGOA.

3.2. Construction of Combined Model

A new wind speed interval prediction framework is proposed using the model introduced in the previous section. The system mainly includes a combined prediction module and evaluation module, as presented in Figure 4 and Tables 2 and 3. We selected the first 1200 data as the training set and the last 203data as the test set. The whole process of the combined prediction model is as follows:(1)Use the CEEMDAN method to decompose the original data into components with different frequencies and delete the component with the highest frequency as noise to achieve the effect of data noise reduction.(2)Use 4 different neural networks to predict the processed data and use the multi-objective optimization algorithm to perform unconstrained weighting on the prediction results to obtain a new prediction solution.(3)Use different indicators to evaluate the prediction results of the combined model to verify its advanced nature.

3.2.1. Combined Prediction Module

The combined model we proposed selects four artificial intelligence models (WAVENN, SVM, ELM, and BP) as component models [39]. The four neural networks have a good prediction accuracy. Through continuous debugging of the number of hidden layer nodes, learning rate, and other parameters, they are suitable for long-term prediction. After obtaining the point prediction, we form different confidence intervals according to the results of point prediction, and set the upper boundary of the confidence interval as and the lower boundary as [40].where is the estimated value of the real regression mean, is the variance of the uncertainty of the NN model, and is the variance of the noise.

To make the interval prediction of the time series more accurate, we use MOGOA to optimize the single predicted value of the above four neural networks and make it as close to the real results as possible (Algorithm 1).

Objective functions:
Parameters:
: The maximum number of iterations
N: The number of grasshoppers
: The position of ith grasshoppers
: The position ofjth grasshoppers
t: The current iteration number
d: The number of dimensions
  WHILE () DO
   Update using
   For all agents DO
     Normal distance between grasshoppers in [1, 4]
     Update grasshoppers-position with
     
     Calculate the fitness function for each search agent
     Find the non-dominated solutions
     Update A
     IF (A is full) THEN
      Run the grid mechanism to one of the currents achieve members
      Add the new solution to A
     END IF
     END FOR
     
   END WHILE
   RETURE A (Contains PO)
3.2.2. Evaluation Module

After the above process, we apply the combination strategy to predict the data, and through a number of interval prediction indicators for comprehensive evaluation, we verify the superiority of the model performance.

We take 1200 sets of data (1th–1200th) in each group of data as the training set and the data (1201th–1403th) as the test set, use the proposed combination model to predict, and draw the prediction results of Site 1, as shown in Figure 5. Confidence level is 0.95, respectively.

4. Verification and Evaluation of the Model

After the construction and optimization of the model, we need to establish an evaluation system to verify the superiority of the model. This is divided into five experiments: (1) comparison between the combined model and the single neural network model; (2) comparison of different data preprocessing methods (EMD and EEMD), CEEMDAN, and data nonpreprocessing; (3) universality test of the combined model proposed in this study; (4) generalization ability test of the combined model; and (5) use of multi-step prediction to verify the prediction ability of the model.

4.1. Experiment I

To verify the superiority of the combined model over the component neural networks (SVM, BP, WAVENN, and ELM), seven indexes, namely, MAPE [41], PICP [42], MPIW [43], CWC [44], ACE [45], MPICD [46], and NMPICD (min-max) [36] are selected for the comprehensive evaluation. The confidence levels of interval prediction are 0.8, 0.9, and 0.95, respectively. The formula of each evaluation index is presented in Table 4, and the evaluation results are presented in Table 5 and Figure 5.

It is obvious that the accuracy of the combined model after multi-objective optimization is better than that of the single-component model:(a)The MAPE value can evaluate the accuracy of point prediction, and the lower the MAPE value is, the better the prediction effect of the model is. It can be found that the MAPE value of the combined model is significantly lower than that of each single neural network. For Data 1, the , which is 31.9% higher than , 38.4% higher than , 125% higher than , and 25.6% higher than that .(b)The PICP value indicates the probability of the real value contained in the interval prediction at different confidence levels. When the real coverage is higher than the confidence level, , and when the real coverage is less than it, :These indicators show the coverage of the interval prediction to the real value. It can be observed from the real data that the actual probability of a small part of the WAVENN neural network is slightly lower than the Confidence Level = 0.8. The other neural network predictions reach the expected level. The PICP value of the combined model proposed in this study is significantly higher than that of the single neural network, which demonstrates the superiority of the combined model. For Data 1, at a confidence level of 0.8, the PICP value of the combined model is , which is 7.22% higher than that predicted by the BP neural network (), 4.70% higher than that of the SVM neural network (), 15.58% higher than , and 6.47% higher than that of the ELM neural network ().(c)MPICD is a measure of the difference between the real level and the intermediate value of the interval, and NMPICD is the normalized value of the former. For the NMPICD, the study uses the min-max normalization method:

It can be observed from the data that, for the combined model, is all approximately 0.1 and the minimum value (standard value) also appears in the optimized model; at the same time, is approximately 0.2, is approximately 0.3, and is approximately 0.7. This indicates that the real value appears more centrally in the middle of the interval prediction, and there is a significant improvement compared with the single model, as shown in Figure 6.

Remark 1. In experiment I, we can observe that the real value coverage and the distance between the real value and the interval center of the combined model proposed in this study are much higher than those of the single neural network model, and the accuracy of the combined model is significantly better than that of the single neural network model.

4.2. Experiment II

In this experiment, we verified the improvement of the wind speed prediction accuracy by data preprocessing and the advantages of the CEEMDAN method over EMD and EEMD. First, the component neural network model is directly used to predict the nonpreprocessed data; second, the EMD and EEMD methods are used to denoise the data and compare them with the results of the CEEMDAN method. In this experiment, the training set, test set, and evaluation indicators are the same as above.

The proposed combination model is directly used to predict the data without denoising. The results are listed in Table 6, and the prediction results of Data1 (confidence level 0.95) are shown in Figure 7.

It is obvious from the chart and table that the prediction ability of the model decreases sharply without data preprocessing. is the ability to contain real values in the prediction interval, and is the difference between and confidence level:

Except for some of the optimized models, the value is negative, indicating that most of the interval forecasts do not reach the expected level, and the ability of the model to predict wind speed is not good, where , , , and are the evaluation indexes of whether the real value is in the center of the confidence interval. From the table, we can clearly see that the of the prediction result without data preprocessing is significantly larger than that after data preprocessing, which shows that data preprocessing plays a significant role in improving the prediction accuracy of the model.(a)For MAPE, the MAPE value of the model with data preprocessing is much lower than that of MAPE without data preprocessing. For Site 1, in the single neural network prediction model, the decreased by 64.28% compared with ; the decreased by 66.13% compared with ; the decreased by 71.72% compared with ; and the decreased by 68.19% compared with . For the prediction results of the four sets of data, the MAPE value is reduced by more than 50% in both the single and combined models.(b)For PICP, the PICP value of the model without data preprocessing, whether a single neural network or the combined model is significantly lower than expected. For Site 1, in a single neural network prediction model, the value of the BP neural network () increases by 13.69% compared with when the confidence level is 0.8, 6.52% when the confidence level is 0.9, and 4.23% when the confidence level is 0.95; the value of SVM neural network () increases by 19.69% compared with when the confidence level is 0.8, 11.52% when the confidence level is 0.9, and 5.96% when the confidence level is 0.95. The PICP value of WAVENN neural network () increases by 17.58% compared with when the confidence level is 0.8, 9.32% when the confidence level is 0.9, and 8.33% when the confidence level is 0.95. The PICP value of the combined model () increases by 14.10% compared with when the confidence level is 0.8, 6.19% when the confidence level is 0.9, and 5.77% when the confidence level is 0.95, which shows that data preprocessing is key to the prediction accuracy of the model.(c)For MPICD, the BP neural network upshifted the true value toward the center of the confidence interval by 0.6 after data preprocessing, and the other neural networks also upshifted it to various degrees, and the combined neural network upshifted it by 0.3. The EMD method is used instead of the CEEMDAN method, and the results are presented in Table 7.

After EMD denoising, although most of the models meet the expectations, the prediction accuracy is still very different from that of CEEMDAN denoising. Regardless of the single neural network prediction or the combination model, MAPE, PICP, and MPICD are significantly smaller than the CEEMDAN method, which further verifies the superiority and advanced nature of the model proposed in the study, as presented in Figure 7.

Remark 2. Through experiment II, it is obvious that data preprocessing plays a significant effect in improving the prediction accuracy of the combined model, and the superiority of the data preprocessing method used in the model is verified by comparing different data preprocessing methods (EMD and CEEMDN).

4.3. Experiment III

To verify the universality of the proposed model [47], based on the wind speed test of 10 min each time in this study, the wind speed predictions of 20 min each time and 30 min each time are added to examine the universality and reliability of the model at different test frequencies.

To ensure the reliability of the experiment, the time of the experimental data is the same as that in the previous article. The specific training set and test set are listed in Table 8, and the test results are presented in Table 9. According to the previous evaluation indexes, the prediction ability of the model at different frequencies is evaluated, as shown in Figure 8.

From Table 9, it can be observed that the models proposed in this study show high accuracy in the prediction of wind speed at different frequencies. The value is higher than the expected confidence level, and the value is greater than 0:

When the frequency is every 20 min, the values of the combined model are 5.1676, 4.9853, 5.7851, and 5.2294, respectively; when the frequency is every 30 min, the values of the combined model are 6.7619, 5.9863, 6.8870, and 6.3812, respectively; when the frequency is 20 min once, the values are 0.85, 0.86, 0.84, and 0.84, respectively, which are 0.05, 0.06, 0.04, and 0.04 higher than the confidence level, respectively. When the frequency is 30 min once, the values are 0.86, 0.83, 0.85, 0.84, and 0.84, which are 0.06, 0.03, 0.05, 0.04, and 0.04, respectively, higher than the confidence level, respectively, and the prediction accuracy is good. For and , the value is 0.5, 0.4, 0.4, and 0.3 when the frequency is 20 min, and the value is 0.6, 0.6, 0.6, and 0.5 at a frequency is 30 min. It can be observed that most of the real values are located in the center of the confidence interval, indicating that the accuracy of the combined model is high. Although the accuracy of the model will be affected to a certain extent with the decrease in the frequency, the model proposed in this study still maintains a good prediction accuracy, which shows that it is reliable and effective.

Remark 3. Experiment III verifies the universality of the combined model, which can play stably under different wind speed frequencies, and gives accurate predicted values, thus laying a good foundation for the final use of the model.

4.4. Experiment IV

To verify the generalization ability of the combined model prediction proposed in the study, we use the method of 10-fold cross-validation to redistribute the data. The data are randomly divided into 10 parts, one of them is selected as the test set and the other nine as the training set, and they are cycled until each has done the test set. The results of cross-validation are shown in Table 10 (each set of data shows five sets of cross-validation). In the 10-fold cross-validation, the prediction accuracy of the combined model proposed in the study is always stable. Take Site 1 as an example, the  =   which are all keeping at approximately 3. value and value also reach the expected level, the interval coverage is greater than the confidence level, and the difference between the real value and the center of the interval is approximately 0.3. The prediction accuracy and stability are very satisfactory, which well verifies the generalization ability of the combined model proposed in this study [48].

Remark 4. Experiment IV validates the generalization ability of the model and shows the reliability of the model through 10-fold cross-validation. The prediction effect of the model is higher than expected under different training sets and test sets.

4.5. Experiment V

In the previous experiment, we verified the single-step prediction ability of the model proposed in this study. However, in the actual wind speed prediction, we need to predict the values of multiple time points in the future. In this experiment, we introduced the concept of multi-step prediction, and the prediction ability of the model is further verified by the accuracy of the multi-step prediction.

In the experiment, we adopted a direct multi-step prediction strategy. The direct method developed a separate model for each predicted time point. For example, when forecasting the wind speed at the next two time points, we established an independent model to predict it at the first time point and another independent model to predict it at the second time point, as shown in Figure 9.

We realize single-step prediction, two-step prediction, and three-step prediction for four groups of data and verify the prediction ability of the model through the evaluation indicators of multiple interval forecasts. The results are presented in Table 11.(a)With an increase in the number of prediction steps, the prediction ability of the model decreases to a certain extent. In terms of , the two-step prediction () is 25.67% lower than , and the three-step prediction is lower than 30.91% (). In terms of , the two-step prediction () is approximately 20% lower than the one-step prediction (), and the three-step prediction () is approximately 35% lower. This shows that the accuracy of the combined model proposed in this study needs to be improved when dealing with multi-step prediction.(b)Although the accuracy of the multi-step prediction of the model has declined to a certain extent, the coverage of interval prediction is still satisfactory. For both the two-step prediction and three-step prediction, most of the values are higher than the confidence level. At Site 1, in the two-step prediction, when the confidence level is 0.8, the interval coverage of the combined model () is 5% higher than the confidence level; at Site 2, the interval coverage of the combined model () is 6.25% higher than the confidence level; at Site 3, the interval coverage of the combined model () is 5% higher than the confidence level; and at Site 4, the interval coverage of the combined model () is 7.5% higher than the confidence level. At Site 1, in the three-step prediction, the interval coverage of the combined model () is 2.5% higher than the confidence level. At Site 2, the interval coverage of the combined model () is 3.75% higher than the confidence level. At Site 3, the interval coverage of the combined model () is 2.5% higher than the confidence level. At Site 4, the interval coverage of the combined model () is 2.5% higher than the confidence level. A few values of the component prediction model are less than 0, but the values of the combined model are all greater than 0. This level is higher than expected.

4.5.1. Sensitivity Analysis

To further explore the influence of the key parameters of the MOGOA algorithm on the sensitivity of the combined model, we propose a sensitivity analysis mechanism. Under the condition of keeping other parameters unchanged, the mechanism only changes one parameter value at a time, and then the fluctuation degrees of the prediction results are analyzed. We measured the sensitivity using the standard deviation of the four evaluation indicators.

The equation is defined aswhere refers to the value of metric (MAE, RMSE, MAPE, and SSE), refers to the average of all testing times, and a smaller value indicates stronger stability.

Definition 1. It is assumed that , , and . When the value of is tested from the first value to the last value, the values of and remain the same; that is, , , and it is defined as .
We set the vectors of the Tunicate Number, Iteration Number, and Archive Size as ; ; , respectively, and the number marked with an asterisk () is the best value.
We test , , and , respectively, and calculate their standard deviations according to the differences in the number of prediction steps, for example, , , and . Let us take Data 1 as an example, and the results are listed in Table 12. We can clearly observe from the table that the volatility of is obviously smaller than that of and .
From the above table, we can observe that the SD values all fluctuate in a small range, indicating that the sensitivity of the three parameter models is low, which shows that our model has high stability in practical applications.

Remark 5. Experiment V verified the prediction ability of the combined model using two-step and three-step predictions. Although the forecast accuracy of the model decreases as the number of prediction steps increases, the coverage rate and other indicators are still higher than the expected value. Therefore, the combined model is still stable and reliable.

4.6. Experiment

In this experiment, the traditional official test functions are used to test the ability of the optimization algorithm. We selected (ZDT1, ZDT2, ZDT3, ZDT1-Line) to test the MOGOA. The test function is shown in Table 13, and the picture of Pareto frontier is shown in Figure 10.

Remark 6. Experiment VI tests the performance of the multi-objective algorithm (MOGOA) through four groups of classical test functions. From the image, it is clear that the performance of the algorithm is good, it has a strong optimization ability when multiple objective functions exist at the same time.

5. Practical Application and Conclusion

With the breakthrough of the key technology of wind power and the expansion of the wind farm construction scale, the large fluctuation range of wind farm output power needs to be taken into account when analyzing the wind farm connected to the power system. Wind energy has intermittent and random fluctuations. The change in wind speed directly leads to a change in the active power and reactive power of the wind farm, and the output power becomes very unstable. When the wind power penetrates the full operation, it not only has a serious impact, but also affects the power quality, economic dispatching, and power bidding. Therefore, it is of practical significance to conduct research on wind power forecasting and improve the accuracy of wind speed prediction. This is of practical significance to power grid dispatching, improving the access capacity of wind power, and reducing the operating cost of the system.

In this paper, CEEMDAN is used for data preprocessing, and then four kinds of neural networks are used to predict the data after noise reduction. Finally, the unconstrained weighting algorithm based on multi-objective optimization is used to weight the results of four groups of neural networks, and the final interval prediction results are obtained. It is proved that the interval prediction results are advanced and can provide more effective information for wind farm decision-makers. Our combination strategy effectively combines the method of machine learning and improves the prediction accuracy of the combination forecasting model. The generalization performance of the model is good, and it has good accuracy in all kinds of complex situations. It is obvious that the model has high application significance.

In view of the shortcomings of the existing wind speed prediction, this study proposes a combined forecasting model and carries out a multi-objective optimization of the combined neural network on the basis of wind speed data noise reduction. The accuracy and superiority of the model are demonstrated through the verification of several indicators.

Although the combined model has achieved good prediction results, there are still some shortcomings, which may be considered in our future work and are the development direction of wind speed prediction:(1)The operating time of our model is a little longer than some other models, but a high-profile computer may simplify the procedures to address this issue.(2)We only study one-dimensional time series data and lack consideration of other factors that affect wind speed, such as humidity and temperature. Moreover, environmental factors or physical conditions may not be predicable and controllable and may be considered in future work.

Abbreviations

AI:Artificial intelligence
BP:Back-propagation neuron network
SVM:Support vector machine
ELM:Extreme learning machine
WAVENN:Wavelet transform
CEEMDAN:Complementary ensemble empirical mode decomposition with adaptive noise
MOGOA:Multi-objective grasshopper optimization algorithm
MAPE:Mean absolute percentage error
PICP:PI coverage probability
MPIW:Mean PI width
CWC:Coverage width-based criterion
ACE:Average coverage error
MPICD:Mean PI centre deviation
NMPICD:Standardized mean PI centre deviation
DBN:Deep belief network
ZDT1:Test function 1
ZDT3:Test function 3
:Original wind speed data
:Time indicator
:Upper limit of interval
:Lower limit of interval
:The range of target values
:Degree of amplification
:Confidence interval
:Estimate of y
:The mean of x
:The weight of hidden layer to output layer
:Hidden layer node output
:Number of hidden layer nodes
:Number of nodes at the output layer
:The generalized inverse
r:Residual error
ZDT2:Test function 2
ZDT1-Line:Test function 1 line.

Data Availability

All data used in the study are publicly available from the corresponding author upon request. The repositories are refenced in the text.

Conflicts of Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Authors’ Contributions

Conceptualization was carried out by Jiyang Wang; methodology and software by Zhirui Tian; and writing by Zhirui Tian and Jiyang Wang. Both authors have read and agreed to the published version of the manuscript.

Supplementary Materials

Graphical Abstract. Because of the volatility and instability of wind speed, it is difficult to accurately predict wind speed by using a single neural network algorithm. In order to correct this problem, we propose a combination model based on data preprocessing, combined neural network algorithm, and multi-objective optimization. The superiority and accuracy of the model are verified by several groups of experiments. (Supplementary Materials)