Rayleigh Lidar Signal Denoising Method Combined with WT, EEMD and LOWESS to Improve Retrieval Accuracy

Zhang, Yijian; Wu, Tong; Zhang, Xianzhong; Sun, Yue; Wang, Yu; Li, Shijie; Li, Xinqi; Zhong, Kai; Yan, Zhaoai; Xu, Degang; Yao, Jianquan

doi:10.3390/rs14143270

Open AccessTechnical Note

Rayleigh Lidar Signal Denoising Method Combined with WT, EEMD and LOWESS to Improve Retrieval Accuracy

by

Yijian Zhang

¹,

Tong Wu

²,

Xianzhong Zhang

¹,

Yue Sun

¹,

Yu Wang

¹,

Shijie Li

¹,

Xinqi Li

¹,

Kai Zhong

^1,*

,

Zhaoai Yan

³,

Degang Xu

¹ and

Jianquan Yao

¹

Key Laboratory of Optoelectronic Information Science and Technology (Ministry of Education), School of Precision Instruments and Opto-Electronic Engineering, Tianjin University, Tianjin 300072, China

²

School of Marine Science and Technology, Tianjin University, Tianjin 300072, China

³

Key Laboratory of Science and Technology on Environmental Space Situation Awareness, National Space Science Center, Chinese Academy of Sciences, Beijing 100190, China

^*

Author to whom correspondence should be addressed.

Remote Sens. 2022, 14(14), 3270; https://doi.org/10.3390/rs14143270

Submission received: 28 May 2022 / Revised: 4 July 2022 / Accepted: 5 July 2022 / Published: 7 July 2022

(This article belongs to the Special Issue Remote Sensing on Theoretical and Observational Issues in Atmospheric Sciences)

Download

Browse Figures

Versions Notes

Abstract

:

Lidar is an active remote sensing technology that has many advantages, but the echo lidar signal is extremely susceptible to noise and complex atmospheric environment, which affects the effective detection range and retrieval accuracy. In this paper, a wavelet transform (WT) and locally weighted scatterplot smoothing (LOWESS) based on ensemble empirical mode decomposition (EEMD) for Rayleigh lidar signal denoising was proposed. The WT method was used to remove the noise in the signal with a signal-to-noise ratio (SNR) higher than 16 dB. The EEMD method was applied to decompose the remaining signal into a series of intrinsic modal functions (IMFs), and then detrended fluctuation analysis (DFA) was conducted to determine the threshold for distinguishing whether noise or signal was the main component of the IMFs. Moreover, the LOWESS method was adopted to remove the noise in the IMFs component containing the signal, and thus, finely extract the signal. The simulation results showed that the denoising effect of the proposed WT-EEMD-LOWESS method was superior to EEMD-WT, EEMD-SVD and VMD-WOA. Finally, the use of WT-EEMD-LOWESS on the measured lidar signal led to significant improvement in retrieval accuracy. The maximum error of density and temperature retrievals was decreased from 1.36% and 125.79 K to 1.1% and 13.84 K, respectively.

Keywords:

rayleigh lidar signal denoising; ensemble empirical mode decomposition; wavelet transform; locally weighted scatterplot smoothing; detrended fluctuation analysis; density retrieval; temperature retrieval

1. Introduction

As an active remote sensing technology, lidar is widely used in autonomous driving, target detection and atmospheric detection, which has the characteristics of high spatial resolution and detection sensitivity [1,2,3,4]. In recent years, the middle atmosphere has received increasing attention. Rayleigh lidar, employing Rayleigh backscattering from atmospheric molecules, has become an effective means for studying middle atmosphere [5,6,7]. However, both the exponential decrease in number density with altitude and the spherical scattered light result in a decrease in lidar echo signal with the altitude squared, severely limiting the effective detection range and retrieval accuracy [8]. Moreover, strong background sunlight noise, dark current noise, and thermal noise interfere with the lidar echo signal [9]. Consequently, the signal-to-noise ratio at high altitudes is too low to retrieve correctly, so the lidar backscattering signal must be denoised.

In the last few decades, many researchers have paid attention to the processing of nonlinear and non-stationary lidar echo signal. The moving averages technique, a low-pass filtering method, is the simplest and most direct denoising method. However, on the one hand, the ideal filter will inevitably depend on the particular scene. On the other hand, this method will create extra errors due to the smearing of resolvable atmospheric structure [10]. Han et al. used the Kalman filtering method to derive water vapor profiles using surface in situ and Raman measurements, but this method may cause retrieval errors in cases of rapidly changing environments [11]. The WT method can decompose the signal into different frequency terms through multiscale analysis, but the main limitations are the selection of suitable wavelet basis functions and the decomposition level according to a particular scene [12,13].

Empirical mode decomposition (EMD) is widely used in the processing of non-stationary signals, due to the fact that its decomposition entirely depends on the signal itself and does not need any priori information [14,15]. Through EMD, a complex signal can be decomposed into a series of IMFs and a residual. The research on EMD mostly focuses on distinguishing whether noise or signal is the main component of the IMFs, or overcoming the shortcomings of EMD, such as modal aliasing and end effects. To solve these problems, Tian et al. suggested typical range (TR) and low-frequency fraction (LFF) as the reference principles to decide how many high-frequency IMFs should be removed as noise [16], and Wu et al. proposed EEMD to effectively suppress modal aliasing [17].

Variational mode decomposition (VMD) is a new method for decomposing signals adaptively [18]. Compared with EMD, VMD is a non-recursive decomposition algorithm, which can overcome the modal aliasing problem and embody Wiener filtering characteristics. In the VMD technique, the selection of the decomposition mode number and quadratic penalty is performed through an optimization algorithm [19]. In addition, the application of machine learning and neural networks for signal denoising has received extensive attention due to their adaptability to include prior information [20]. Sreekanth et al. adopted the dictionary learning technique in denoising lidar signal [21].

In the past, most studies focused on the lidar echo signal with a low signal-to-noise ratio, while ignoring the overall relevance. However, in engineering, it is always necessary to process the lidar echo signal within a certain height range, and therefore the signal-to-noise ratio has a large dynamic range. Since the single denoising method has its own advantages and shortcomings as described above, it is necessary to combine several denoising algorithms for improving the denoising performance. Furthermore, it is more convincing to use the simulated lidar echo signal to verify the effectiveness of the proposed algorithm than to use the synthetic signal.

In this paper, a WT and LOWESS based on the EEMD algorithm, called WT-EEMD-LOWESS, was proposed to extend the effective detection range and improve retrieval accuracy. The proposed method utilized the WT method to remove the noise in the signal with a signal-to-noise ratio higher than 16 dB. The EEMD method was applied to decompose the remaining signal into a series of IMFs, and the signal was reconstructed by using the DFA. Moreover, the LOWESS method was adopted to finely extract the signal in the reconstructed signal. The lidar simulation system with a typical integration time of 1200 s validated the feasibility of the WT-EEMD-LOWESS model. Then, the simulation experiment verified that our algorithm can be effectively applied to the lidar echo signal with different integration times. Finally, the denoising algorithm was investigated on the real Rayleigh lidar signal and experimental results showed that the WT-EEMD-LOWESS method can be directly adapted to engineering.

2. Rayleigh Lidar Simulation System

To select different yet available denoising methods for echo signal with different signal-to-noise ratios, the noisy lidar echo signal was simulated for comparison experiments. In the case of the Rayleigh lidar system, the return signal can be described as follows:

N_{R} (λ, R) = (\frac{E_{0}}{h c / λ}) [σ_{180} (λ, R) n (R) Δ R] \frac{A}{R^{2}} [η (λ) T_{t} T_{r} T^{2} (λ, R) G (R)] + N_{B}

(1)

where

N_{R} (λ, R)

is the number of received photons, when the wavelength of emitted laser is

λ

and the detection distance is R;

E_{0}

is pulse energy of the emitted laser; h is Plank’s constant;

σ_{180} (λ, R)

is the backscatter cross-section,

σ_{180} (λ, R) = \frac{1}{4 π} \times \frac{3}{2} \times 5.16 \times 10^{- 31} m^{- 2}

;

n (R)

is the number density of atmospheric molecules;

Δ R

is range resolution; A is receiving area of telescope;

η (λ)

is detector quantum efficiency;

T_{t}

and

T_{r}

represent transmitting and receiving optical efficiencies, respectively;

T^{2} (λ, R)

is two-way atmospheric transmittance;

G (R)

is geometric overlap factor;

N_{B}

is the noise. The first term of the expression represents the number of emitted photons, the second term indicates the probability of backscattering of atmospheric molecules, the third term represents the ability of the detector to accept backscattered light and the last term stands for the system efficiency and transmittance. The schematic of the Rayleigh lidar system is illustrated in Figure 1.

The lidar signal is inevitably affected by noise during the detection process. The main sources of noise are background radiation noise, detector thermal noise and shot noise. The calculation formulas of background radiation noise

N_{b}

and detector thermal noise

N_{d}

are as follows:

N_{b} = \frac{η λ}{h c} P_{b} {(\frac{θ_{R}}{2})}^{2} Δ λ A Δ t T_{r}

(2)

N_{d} = C P S \cdot Δ t

(3)

where

P_{b}

is the radiances of the sky, which were calculated using the MODTRAN software [22];

θ_{R}

is the field of view of the receiving telescope;

Δ λ

is the bandwidth of the filter;

C P S

is the dark count of the detector;

Δ t

is the acquisition time of the system. Table 1 lists the parameters of our simulation system.

Shot noise follows a Poisson distribution. Therefore, when using the lidar equation to simulate the echo signal, Poisson noise needs to be added. Since the Rayleigh signal per laser pulse is too weak, averaging the pulses or integrating them is often used to increase the signal-to-noise ratio. Figure 2 shows the ideal echo signal obtained from the simulation and the noisy signal after subtracting the background radiation noise under the condition that the platform height is 20 km, the measurement altitude range is 30–70 km, and the integration time is 1200 s.

3. Methodology Descriptions

3.1. Brief Description of the WT Algorithm

Wavelet transform is developed on the basis of short-time Fourier transform and has good time-frequency localization characteristics. Compared with Fourier transform, which only has sine or cosine function as a basis, the types of wavelet basis are diverse. Previous experiments have proved that “

d B 4

” and “

S y m 4

” have achieved good results in the field of lidar signal processing.

In the wavelet threshold denoising, the raw signal is divided into approximation and detail coefficients. The orthogonal wavelet basis can mainly concentrate useful signal energy in some large wavelet coefficients. Therefore, the employment of threshold can retain the large useful signal coefficients and zero or shrink the the small noise coefficients, so as to denoise. In practice, the selection of the wavelet basis, the decomposition level and threshold value is a key point in algorithm design. The specific steps of wavelet threshold denoising are as follows:

Step 1: According to the characteristics of Rayleigh scattering echo signal, the proper wavelet basis and decomposition layer number N are selected, and the original return lidar data is decomposed with N layers based on wavelet transformation.

Step 2: Selecting the soft threshold method to carry on the threshold quantization processing for the high frequency coefficients from layer 1 to layer N by different threshold value of each decomposition layer.

Step 3: The low-frequency coefficient of the Nth layer and all high-frequency coefficients of each layer after wavelet threshold processing are reconstructed by inverse wavelet transformation, and the required signal is obtained.

3.2. Basic Theory of EEMD

EEMD is an improved algorithm of EMD, so the EMD algorithm is introduced first. The EMD method can adaptively decompose signals into a series of IMFs and a residual component based on the local characteristic time scale of the signal itself, which is given by

x (t) = \sum_{i = 1}^{n} i m f_{i} + r_{n}

(4)

Each IMF stands for the information on different scales of the original signal and must satisfy two conditions: (1) the number of extrema and zero crossings must either be equal or differ by one at most; (2) at any point, the mean value of the envelope defined by the local maxima and minima is zero. The first condition is a common signal analysis requirement for narrowband signals and the second condition ensures the local symmetry of the IMFs.

For any original signals

x (t)

, the procedure of EMD can be described as follows:

1.: Identifying the local extrema of $x (t)$ .
2.: Using the cubic spline interpolation function to form the upper envelope $U (t)$ and the lower envelope $L (t)$ .
3.: Calculating the mean value $m (t)$ of the upper and lower envelopes.

$m (t) = [U (t) + L (t)] / 2$

(5)
4.: Denoting the difference in value between $x (t)$ and $m (t)$ as $h (t)$ . If $h (t)$ satisfies conditions for IMFs, then $h (t)$ is defined as the first IMF. If not, $x (t)$ is represented as $h (t)$ . Repeating the steps 1 to 3 K times until $h_{K} (t) = h_{K - 1} (t) - m_{K - 1} (t)$ meets IMF conditions and $h_{K} (t)$ will be the first IMF of the signal.

$h (t) = x (t) - m (t)$

(6)
5.: Separating the first IMF from $x (t)$ . The residue item $r (t)$ can be computed with the following equation:

$r (t) = x (t) - im f_{1} (t)$

(7)

If

r (t)

does not become a monotone function or the number of extrema is less than one or equal to one,

x (t)

is replaced by

r (t)

. Repeating steps 1 to 4 n times, the n IMFs of

x (t)

will be computed.

In order to solve modal aliasing problems of EMD, the EEMD algorithm, an effective noise-aided data analysis (NADA) method, adds white noise to the original signal by using the zero-mean characteristic of white noise. Because the white noise spectrum is evenly distributed, the white-noise signals will be automatically distributed to the appropriate reference scales. The specific procedure of the EEMD method is expressed as follows:

1.: Adding white noise, which follows a normal distribution with mean 0 and standard deviation $σ$ , N times to the original signal $x (t)$ .Then, the new data series can be computed as follows:

$x_{i} (t) = x (t) + \sum_{j = 1}^{N} σ n_{j} (t), n_{j} (t) \sim N (0, 1)$

(8)
2.: Decomposing the signal added with Gaussian white noise by EMD to obtain the ith set of decomposition results, which includes IMF components $c_{i j} (t)$ (j = 1, 2, …, m) and a residual $r_{i} (t)$ .
3.: Repeating the above steps n times, the overall average values $c_{j} (t)$ and $r (t)$ are used as the results of the jth IMF component and the residual decomposed by EEMD, respectively.

$c_{j} (t) = \frac{1}{n} \sum_{i = 1}^{n} c_{i j} (t)$

(9)

$r (t) = \frac{1}{n} \sum_{i = 1}^{n} r_{i} (t)$

(10)

3.3. Principle of DFA

Selecting specific IMF components for superposition can form high-pass, low-pass and band-pass filters. Because the signal is concentrated on the low-frequency IMFs while noise is devoted to the high-frequency IMFs, denoising can be achieved by discarding some high-frequency components. DFA is a scaling analysis method to study long-range dependency for the non-stationary signal. DFA is utilized as a reliable metric to determine noisy IMFs resulting from the EMD [23]. Therefore, in this paper, DFA is used to distinguish whether noise or signal is the main component of the IMFs.

For a given time series

\{v (i), i = 1, 2, \dots, N\}

with the length N, the main procedure of the DFA method is expressed as follows:

1.: Integrating the time series of the mean detached signal and the cumulative series $y (n)$ can be computed as follows:

$y (k) = \sum_{i = 1}^{N} [v (i) - \bar{v}], k = 1, 2, \dots, N$

(11)

where $\bar{v}$ is the average of $v (i)$ , $\bar{v} = \frac{1}{N} \sum_{i = 1}^{N} v (i)$ .
2.: The integrated series $y (n)$ was then divided into equal and non-overlapping windows of length n. For each window, the local linear trend $y_{n} (k)$ is calculated by a polynomial fitting of order l. The root mean square fluctuation $F (n)$ is given by the equation

$F (n) = \sqrt{\frac{1}{N} \sum_{k = 1}^{N} {[y (k) - y_{n} (k)]}^{2}}$

(12)
3.: The scaling exponent $α$ is calculated as a slope of the curve $l o g (F (n)) / l o g (n)$ as follows:

$F (n) \propto n^{α}$

(13)

The scaling exponent

α

is related to the Hurst exponent H, which reflects the correlation of time series. The value

α = 0.5

indicates that the time series is uncorrelated (white noise).

0 < α < 0.5

indicates that the time series is noise, which has short-range correlation and poor self-similarity between local and global data sequences.

α > 0.5

represents the time series signal, which has long-range correlation.

3.4. Brief Description of the LOWESS Method

When processing lidar echo signals, it is expected to be less sensitive to photon fluctuations mainly caused by shot noise without distorting the signal. The LOWESS algorithm, first proposed by Cleveland in 1979 [24], effectively solves this problem. With the LOWESS method, the value of each point is an iterative weighted least square regression through a nearby data point within a specified span. The steps of the LOWESS method are as follows:

1.: Estimating the span width, which usually contains more than 13 data points and the initial weight of points $(x_{i}, y_{i})$ within the span width is defined as a cubic weighting function as follows:

$w_{i} = \{\begin{matrix} (1 - {|\frac{x - x_{i}}{d (x)}|}^{3}, |\frac{x - x_{i}}{d (x)}| \leq 1) \\ 0, |\frac{x - x_{i}}{d (x)}| > 1 \end{matrix}\}$

(14)

where $d (x)$ is the furthest distance between x and $x_{i}$ in the specified span width.
2.: The first-order polynomial fit with the formula $y = a x + b$ is used to estimate the value of $\hat{y}$ at x. The slope a and constant b are defined as:

$a = \frac{\sum w_{i} (x - \bar{x}) \cdot (y - \bar{y})}{\sum {w_{i}}^{2} {(x - \bar{x})}^{2}}$

(15)

$b = \bar{y} - a \bar{x}$

(16)

where $\bar{x}$ and $\bar{y}$ are the weighted averages of x and y, respectively.
3.: The robust weight function is defined to calculate the new weight using the residual of the estimated formula. The robust weighting is:

$δ_{i} = \{\begin{matrix} {(1 - {|e_{i}|}^{2})}^{2}, e_{i} \leq 1 \\ 0, o t h e r w i s e \end{matrix}\}$

(17)

where $e_{i} = \frac{r_{i}}{6 M e d i a n (|r_{1}|, |r_{2}|, \dots, |r_{n}|)}$ and $r_{i} = y_{i} - {\hat{y}}_{i}$ is the error of each y value.
4.: Repeating steps 2–3 with the new weighting so that the smoothed value of any point $(x, y)$ could be obtained after several cycles. The theoretical cycles were repeated less than four times.

3.5. Proposed WT-EEMD-LOWESS Algorithm

A new denoising method, called the WT-EEMD-LOWESS method, for Rayleigh lidar echo signals with a large SNR range is proposed. Figure 3 shows the flowchart of the WT-EEMD-LOWESS algorithm and the specific steps is expressed as follows:

1.: The echo signal is divided into two segments according to Equation (20), and the high SNR signals are imported into the WT method for denoising.
2.: The low SNR signals are decomposed by EEMD and the scaling exponent $α$ is calculated through the DFA algorithm to select IMFs for the signal reconstruction. Then a LOWESS algorithm is performed on the reconstructed signal for further denoising.
3.: The two denoised signals are spliced to obtain the final signals.

4. Results and Discussion

4.1. Experiments with Simulated Signals

In order to evaluate and compare the performance of the proposed method and the other signal processed methods, the evaluation indexes were set as signal-to-noise ratio (SNR) and the root mean square error (RMSE).

S N R = 10 lg (\frac{\sum_{n = 1}^{N} f^{2} (n)}{\sum_{n = 1}^{N} {[f^{'} (n) - f (n)]}^{2}})

(18)

R M S E = \sqrt{\frac{1}{n} \sum_{n = 1}^{N} {[f^{'} (n) - f (n)]}^{2}}

(19)

where

f (n)

is the original signal,

f^{'} (n)

is the denoised signal and N is the length of the signal.

In engineering, the SNR of the lidar signal that needs to be processed often has a large dynamic range, and single denoising methods do not have good performance for the whole echo profile. So we plan to divide the signal to be processed into two parts according to the SNR and explore which method is more suitable in the case of integration time of 1200 s. The denoising performance of different algorithms, namely, moving averages (MA), wavelet hard thresholding (WTH), wavelet soft thresholding (WTS), EMD combined with DFA (EMD), EEMD combined with DFA (EEMD) and VMD combined with whale optimization algorithm (VMD-WOA) were compared for different evaluation indexes. As shown as Table 2, the WT method could achieve better results when dealing with high SNR signal, while the decomposition algorithm had good performance when processing low SNR signal.

In order to make the algorithm as simple and efficient as possible on the basis of the above conclusions, we continued to explore the maximum height range that the WTS method can effectively denoise under the condition that the absolute error of density retrieval was less than 5% and the absolute error of temperature retrieval was less than ±10 K by the CH method [25]. The following paragraphs were also calculated under this condition. Here, the WT method used the “

d B 4

” wavelet basis and had three decomposition layers. Since the lidar echo signal obey Poisson distribution so that the signal are random, the mean value of the absolute errors of 100 profiles determined denoising performance of the following paragraphs. As shown in Figure 4, when the maximum detection range was 59 km and the

S N R_{m}

of the echo signal was generally above 16 dB, the WTS method could obtain valid data with the maximum absolute error of density retrieval of 1.02% and the maximum absolute error of temperature retrieval of 8.36 K.

It should be noted that the calculation of the SNR here is different from the previous one, due to the ideal signal not being able to be acquired in actual measurement. Considering the statistical properties of the detector, the SNR under Poisson distribution is defined as the ratio of the target backscatter echo signal to the square root of the original echo signal, where the original echo signal includes the target backscatter echo signal and the background noise. Usually, after a certain distance, the background counts of the lidar do not change with height, then it is considered that the photon counts collected by lidar is only the background noise. Thus, the average value of the photon counts after the certain distance can be taken as the background noise. The expression of

S N R_{m}

is as follows:

S N R_{m} = \frac{P_{r a w} (z) - B G}{\sqrt{P_{r a w} (z)}}

(20)

where

P_{r a w} (z)

is the original echo signal and

B G

is the background noise. Figure 5 shows the change

S N R_{m}

with height.

Taking the simulated signal with the

S N R_{m}

below 16 dB in Section 2 as an example, the DFA method was used to calculate the scaling exponent

α_{i}

of of each IMF for verification, and the results are given in Table 3. Here, the length of non-overlapping windows was an eighth of the length of the input signal. The scaling exponent of

I M F_{1} - I M F_{3}

were all less than 0.5, so they were considered as noise; The scaling exponent of

I M F_{4} - I M F_{6}

were all higher than 0.5, so they were considered as signal.

For the purpose of validating the accuracy of the DFA method in distinguishing whether noise or signal was the main component of the IMFs, it was accumulated layer-by-layer from

I M F_{2}

to

I M F_{6}

. Meanwhile, SNR and RMSE were calculated, respectively. Comparing the original signal with the accumulated signal shown in Table 4, it was found that the accumulated signal of 4–6 layer IMFs was the closest to the original signal with the largest SNR and the smallest RMSE. This result is consistent with the results in Table 3, proving that the DFA method could be used as a reliable algorithm for qualification between noise and signal.

To explore which decomposition algorithm was more suitable for low

S N R_{m}

signal, the EMD, EEMD, CEEMDAN, and VMD-WOA algorithms were selected for comparison. In the experiments with simulated signal, it was found that since the echo signals obey Poisson distribution and have a certain randomness, there was no method that always had the best denoising performance, which was also related to the selected SNR threshold. So as to solve this problem and find a generally suitable denoising method, the average SNR (

S N R_{a v e}

) was used as the evaluation index to determine the denoising performance on the basis of multiple profiles. When selecting the signal with

S N R_{m}

below 16 dB, after 100 statistics, the EEMD algorithm showed the best performance in 55 of them, and ranked first with 28.56 dB as shown in Table 5. For EEMD, the standard deviation of Gaussian white noise was 0.1, and Gaussian white noise was added 50 times.

Based on the above findings, we proposed to combine EEMD and LOWESS to achieve better denoising performance. Table 6 shows that the proposed EEMD-LOWESS outperformed all the other proposed methods [26] which could retain more effective signal. In addition, it was found that it was more effective at removing noise in low-frequency IMFs than extracting signal in high-frequency IMFs.

The denoising performance of the WT-EEMD-LOWESS algorithm is shown in Figure 6. The echo signals after denoising were smoother, more stable and closer to the ideal signal. The denoised data obtained through the above process were used as the input data of the CH method, and the density and temperature retrieval of the simulated signal could be gained, as presented in Figure 7a,b and then in contrast with the model temperature, the reliability of the proposed algorithm could be evaluated. It was not practical to calculate the absolute value of density due to several parameters fluctuating with measurement, so the relative density was calculated instead. As shown in Figure 8, under the condition that the error of density and temperature retrieval does not exceed 5% and 10 K, the maximum detection range was 67.5 km and 58.4 km after denoising, which were 12.8 km and 11.1 km higher than those without denoising, respectively.

Futhermore, the algorithm was applied to simulated the lidar echo signal with integration times of 300–3000 s. The variation of the maximum detection range with the integration time is shown in Figure 9. It could be found that the maximum detection range increased with the accumulation of the integration time, which was conformed to the law. When the error of density retrieval was less than

5 %

, the maximum detection range increased first fast and then slowly with the integration time. The reason is that when the integration time was too long, the detection range was close to the limit of 70 km. The maximum detection range changed slowly with the integration time at 60 km when the error of temperature retrievals was less than 10 K. The reason is that the measurement was inaccurate within 10 km below the seed temperature, which is an inherent shortcoming of the CH method.

4.2. Real Experiment on a Lidar Echo Signal

In order to further investigate the denoising performance of the algorithm proposed in this paper, we used the measurement data provided by the National Space Science Center, CAS. The signal was obtained by continuously observing the atmosphere at 40

^{\circ}

18

^{'}

N latitude and 116

^{\circ}

40

^{'}

E longitude by ground-based lidar and the parameters of lidar system are shown in Table 7 below. The observation time was 3 September 2018, 17:30. The measurement results and denoising performance are shown in Figure 10.

Based on the denoised echo signal, the density and temperature retrievals of the measured data can be obtained by the CH method, as shown in Figure 11. The maximum error between density retrieval and model was reduced from

1.36 %

to

1.1 %

, and the maximum error of temperature retrieval and model was reduced from 125.79 K to 13.84 K. The improvement of the maximum detection distance was not obvious with error of density and temperature retrievals less than

5 %

and

10

K separately, due to the fact that the measured original signal is different from the model when the SNR is high. It can also be seen that when the SNR is low, the performance improvement of the proposed algorithm is more obvious, and when the SNR is high, the discrepancy between the retrievals and model depends more on the original data itself.

5. Conclusions

Based on WT and EEMD, a new WT-EEMD-LOWESS denoising method for signal with large SNR dynamic range was proposed by using the simulated lidar signal. The echo signal was divided into two parts with SNR of 16 dB as the boundary. The high SNR part was denoised by the WT method; the low SNR part was decomposed by the EEMD algorithm first, and the scaling exponent

α

of each IMF was calculated by DFA, then the IMF with

α > 0.5

was used to obtain the reconstructed signal. The reconstructed signal still contained noise, so the LOWESS method was performed on the reconstructed signal to futher extract the signal. Moreover, in order to verify the effectiveness of the algorithm, the simulated denoised signal with a typical integration time of 1200 s was used as the input data of the CH method for density and temperature retrievals. In the case that the absolute error of density retrieval was less than

5 %

and the absolute error of temperature retrieval was less than

5

K, the maximum detection range improved 12.8 km and 11.1 km, respectively. This algorithm was also verified to be effective with an integration time of 300–3000 s. In addition, the WT-EEMD-LOWESS method was applied to the measured lidar echo signal. The experimental results showed that the algorithm achieved a good denoising performance in the low SNR signal part on the basis of not distorting the signal. The maximum density error was reduced from 1.36% to 1.1% and the maximum temperature error was reduced from 125.79 K to 13.84 K which indicated that the proposed algorithm has good application value in Rayleigh scattering lidar signal processing.

Author Contributions

Conceptualization, Y.Z. and D.X.; methodology, Y.Z. and K.Z.; software, Y.Z. and X.Z.; validation, Y.Z. and T.W.; investigation, Y.Z.; data curation, Z.Y.; writing—original draft preparation, Y.Z.; writing—review and editing, K.Z. and T.W.; visualization, Y.S., Y.W., S.L. and X.L.; project administration, K.Z. and J.Y.; funding acquisition, K.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by National Natural Science Foundation of China (NSFC) grant number 62175184 and Tianjin Key Laboratory of Optoelectronic Sensor and Sensing Network Technology.

Conflicts of Interest

The authors declare no conflict of interest.

References

Li, Z.P.; Huang, X.; Cao, Y.; Wang, B.; Li, Y.H.; Jin, W.; Yu, C.; Zhang, J.; Zhang, Q.; Peng, C.Z.A. Single-photon computational 3D imaging at 45 km. Photon. Res. 2020, 8, 1532–1540. [Google Scholar] [CrossRef]
Bu, L.; Wang, Q.; Xu, J.; Zhu, S.; Tian, L. Validation of an Airborne High Spectral Resolution Lidar and Its Measurement for Aerosol Optical Properties over Qinhuangdao, China. Opt. Express 2020, 28, 24471–24488. [Google Scholar]
Xue, W.; Liu, L.; Dai, X.; Luo, Y. Moving target ranging method for a photon-counting system. Opt. Express 2018, 26, 34161–34178. [Google Scholar] [CrossRef]
Kaifler, B.; Rempel, D.; Roi, P.; Büdenbender, C.; Baturkin, V. A technical description of the Balloon Lidar Experiment BOLIDE. Atmos. Meas. Tech. 2020, 13, 5681–5695. [Google Scholar] [CrossRef]
Pan, D.; Zhang, T.; Chen, W.; Liu, J. Measurements of density, pressure and temperature in the middle atmosphere with Rayleigh lidar. In Proceedings of the International Symposium on Optoelectronic Technology and Application, Beijing, China, 9–11 May 2016. [Google Scholar]
Jalali, A.; Sica, R.J.; Haefele, A. Improvements to a long-term Rayleigh-scatter lidar temperature climatology by using an optimal estimation method. Atmos. Meas. Tech. 2018, 11, 6043–6058. [Google Scholar] [CrossRef] [Green Version]
Khanna, J.; Bandoro, J.; Sica, R.J.; Mcelroy, C.T. New technique for retrieval of atmospheric temperature profiles from Rayleigh-scatter lidar measurements using nonlinear inversion. Appl. Opt. 2012, 51, 7945–7952. [Google Scholar] [CrossRef]
Chang, J.; Zhu, L.; Li, H.; Fan, X.; Yang, Z. Noise reduction in Lidar signal using correlation-based EMD combined with soft thresholding and roughness penalty. Opt. Commun. 2018, 407, 290–295. [Google Scholar] [CrossRef]
Fang, H.T.; Huang, D.S. Noise reduction in lidar signal based on discrete wavelet transform. Opt. Commun. 2004, 233, 67–76. [Google Scholar] [CrossRef]
Sarvani, M.; Raghunath, K.; Rao, S. Lidar signal denoising methods- application to NARL Rayleigh lidar. J. Opt. 2015, 44, 164–171. [Google Scholar] [CrossRef]
Han, Y.; Westwater, E.R.; Ferrare, R.A. Applications of Kalman Filtering to Derive Water Vapor Profiles from Raman Lidar and Microwave Radiometers. In Proceedings of the International Geoscience & Remote Sensing Symposium, Honolulu, HI, USA, 25–30 July 2010. [Google Scholar]
Mao, J. Noise reduction for lidar returns using local threshold wavelet analysis. Opt. Quantum Electron. 2012, 43, 59–68. [Google Scholar] [CrossRef]
Zhou, Z.; Hua, D.; Wang, Y.; Yan, Q.; Li, S.; Yan, L.; Wang, H. Improvement of the signal to noise ratio of Lidar echo signal based on wavelet de-noising technique. Opt. Lasers Eng. 2013, 51, 961–966. [Google Scholar] [CrossRef]
Boudraa, A.O.; Cexus, J.C. EMD-based Signal Filtering. IEEE Trans. Instrum. Meas. 2007, 56, 2196–2202. [Google Scholar] [CrossRef]
Gong, W.; Li, J.; Mao, F.; Zhang, J. Comparison of simultaneous signals obtained from a dual-field-of-view lidar and its application to noise reduction based on empirical mode decomposition. Chin. Opt. Lett. 2011, 9, 4. [Google Scholar]
Tian, P.; Cao, X.; Liang, J.; Zhang, L.; Yi, N.; Wang, L.; Cheng, X. Improved empirical mode decomposition based denoising method for lidar signals. Opt. Commun. 2014, 325, 54–59. [Google Scholar] [CrossRef]
Zhaohua, W.U.; Huang, N.E. Ensemble Empirical Mode Decomposition: A Noise-Assisted Data Analysis Method. Adv. Adapt. Data Anal. 2009, 1, 1–41. [Google Scholar]
Dragomiretskiy, K.; Zosso, D. Variational Mode Decomposition. IEEE Trans. Signal Process. 2014, 62, 531–544. [Google Scholar] [CrossRef]
Li, H.; Chang, J.; Fan, X.; Liu, Z.; Liu, B. Efficient Lidar Signal Denoising Algorithm Using Variational Mode Decomposition Combined with a Whale Optimization Algorithm. Remote Sens. 2019, 11, 126. [Google Scholar] [CrossRef] [Green Version]
Sun, B.Y.; Huang, D.S.; Fang, H.T. Lidar signal denoising using least-squares support vector machine. IEEE Signal Process. Lett. 2005, 12, 101–104. [Google Scholar]
Sreekanth, V.S.; Raghunath, K.; Mishra, D. Dictionary learning technique and penalized maximum likelihood for extending measurement range of a Rayleigh lidar. J. Appl. Remote Sens. 2021, 14, 034529. [Google Scholar] [CrossRef]
Berk, A.; Conforti, P.; Kennett, R.; Perkins, T.; Hawes, F.; van den Bosch, J. MODTRAN® 6: A major upgrade of the MODTRAN® radiative transfer code. In Proceedings of the 2014 6th Workshop on Hyperspectral Image and Signal Processing: Evolution in Remote Sensing (WHISPERS), Lausanne, Switzerland, 24–27 June 2014; pp. 1–4. [Google Scholar] [CrossRef]
Mert, A.; Akan, A. Detrended fluctuation analysis for empirical mode decomposition based denoising. In Proceedings of the 2014 22nd European Signal Processing Conference (EUSIPCO), Lisbon, Portugal, 1–5 September 2014. [Google Scholar]
Cleveland, W.S. Robust Locally Weighted Regression and Smoothing Scatterplots. J. Am. Stat. Assoc. 1979, 74, 829–836. [Google Scholar] [CrossRef]
Alain, H.; Marie-Lise, C. Density and temperature profiles obtained by lidar between 35 and 70 km. Geophys. Res. Lett. 1980, 7, 565–568. [Google Scholar]
Cheng, X.; Mao, J.; Li, J.; Zhao, H.; Rao, Z. An EEMD-SVD-LWT algorithm for denoising a lidar signal. Measurement 2021, 168, 108405. [Google Scholar] [CrossRef]

Figure 1. Schematic diagram of the Rayleigh lidar system.

Figure 2. The ideal echo signals and the noisy signals in the case of integration time of 1200 s.

Figure 3. Flowchart of the proposed WT-EEMD-LOWESS algorithm.

Figure 4. The absolute error of (a) density and (b) temperature retrieval with the WT method.

Figure 5. The

S N R_{m}

changes with height.

Figure 5. The

S N R_{m}

changes with height.

Figure 6. Denoising performance of WT-EEMD-LOWESS algorithm.

Figure 7. The variation of (a) density and (b) temperature retrievals with WT-EEMD-LOWESS algorithm.

Figure 8. The absolute error of (a) density and (b) temperature retrievals with the WT-EEMD-LOWESS method.

Figure 9. The variation of the maximum detection range with the integration time under the condition that (a) the error of density retrieval was less than 5% and (b) the error of temperature retrieval was less than 10 K.

Figure 10. The noisy echo signal and the signal after denoising of ground-based lidar.

Figure 11. The variation of (a) density and (b) temperature retrievals with WT-EEMD-LOWESS algorithm of ground-based lidar.

Table 1. Simulation parameters of lidar system.

System Parameter	Value
Laser pulse energy/mJ	40
Telescope diameter/mm	350
Field of view/ $μ$ rad	165
Detection quantum efficiency	0.5
Total optical transmittance	0.4
Dark counts/counts per second	50
Range resolution/m	100
Filter bandwidth/nm	0.25
Pulse repetition frequency/Hz	50
Platform height/km	20

Table 2. Denoising performance with different heights for each algorithm.

Height Range/km	RAW	MA	WTH	WTS	EEMD	VMD-WOA
30–40	59.7109/1233.5	38.9091/1352.8	58.1694/1473.0	60.7532/1094.0	57.1453/1657.4	23.8776/7634.9
30–50	60.1029/837.7	34.2279/1647.4	60.9346/761.2	62.7611/616.8	56.6552/837.7	22.4360/6403.4
30–60	58.9623/780.6	32.1997/1700.4	59.1537/763.6	60.3729/663.6	55.7952/1124.0	22.4334/5234.4
30–70	59.2095/657.3	31.2823/1637.3	59.6589/624.2	61.2158/521.7	55.5927/996.8	22.4416/4530.8
40–70	42.6718/349.4	34.0989/937.5	44.6919/276.9	47.8397/192.7	47.2425/206.4	24.6075/2795.9
50–70	28.4629/297.1	35.5781/130.9	32.1896/193.4	35.8858/132.5	35.0250/139.6	26.3373/379.4
60–70	16.3705/282.9	26.1785/91.4	18.1742/229.8	20.8954/167.9	23.7415/121.1	26.1806/91.4

Table 3. The scaling exponent

α

of each IMF.

Table 3. The scaling exponent

α

of each IMF.

	$IM F_{1}$	$IM F_{2}$	$IM F_{3}$	$IM F_{4}$	$IM F_{5}$	$IM F_{6}$
$α$	0.19	0.12	0.42	1.00	1.48	1.99

Table 4. Denoising performance of layer-by-layer accumulating of IMFs.

	1–6 IMFs	2–6 IMFs	3–6 IMFs	4–6 IMFs	5–6 IMFs	6th IMF
SNR/dB	18.22	21.43	25.11	28.88	25.00	24.73
RMSE	286.97	198.32	129.87	84.11	131.53	135.67

Table 5. Denoising performance of different algorithms for low

S N R_{m}

signals.

Table 5. Denoising performance of different algorithms for low

S N R_{m}

signals.

	RAW	EMD	EEMD	CEEMDAN	VMD-WOA
$S N R_{a v e}$ /dB	18.56	27.34	28.56	27.71	26.47
Best times		12	55	18	15

Table 6. Denoising performance of proposed algorithms compared with other methods.

	RAW	EEMD	EEMD-NLM	EEMD-WTS	EEMD-SVD	EEMD-LOWESS
SNR/dB	18.56	28.56	20.92	28.97	26.46	30.54

Table 7. Ground-based lidar system.

System Parameter	Value
Laser pulse energy/mJ	500
Telescope diameter/m	1
Average power/W	>10
Wavelength/nm	532
Pulse width/ns	8
Quantum efficiency	0.4
Filter transmittance	0.8
System optical efficiency	0.02
Pulse repetition frequency/Hz	30
Degree from zenith/°	0

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Zhang, Y.; Wu, T.; Zhang, X.; Sun, Y.; Wang, Y.; Li, S.; Li, X.; Zhong, K.; Yan, Z.; Xu, D.; et al. Rayleigh Lidar Signal Denoising Method Combined with WT, EEMD and LOWESS to Improve Retrieval Accuracy. Remote Sens. 2022, 14, 3270. https://doi.org/10.3390/rs14143270

AMA Style

Zhang Y, Wu T, Zhang X, Sun Y, Wang Y, Li S, Li X, Zhong K, Yan Z, Xu D, et al. Rayleigh Lidar Signal Denoising Method Combined with WT, EEMD and LOWESS to Improve Retrieval Accuracy. Remote Sensing. 2022; 14(14):3270. https://doi.org/10.3390/rs14143270

Chicago/Turabian Style

Zhang, Yijian, Tong Wu, Xianzhong Zhang, Yue Sun, Yu Wang, Shijie Li, Xinqi Li, Kai Zhong, Zhaoai Yan, Degang Xu, and et al. 2022. "Rayleigh Lidar Signal Denoising Method Combined with WT, EEMD and LOWESS to Improve Retrieval Accuracy" Remote Sensing 14, no. 14: 3270. https://doi.org/10.3390/rs14143270

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Rayleigh Lidar Signal Denoising Method Combined with WT, EEMD and LOWESS to Improve Retrieval Accuracy

Abstract

1. Introduction

2. Rayleigh Lidar Simulation System

3. Methodology Descriptions

3.1. Brief Description of the WT Algorithm

3.2. Basic Theory of EEMD

3.3. Principle of DFA

3.4. Brief Description of the LOWESS Method

3.5. Proposed WT-EEMD-LOWESS Algorithm

4. Results and Discussion

4.1. Experiments with Simulated Signals

4.2. Real Experiment on a Lidar Echo Signal

5. Conclusions

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI