Next Article in Journal
A Model for Generating Workplace Procedures Using a CNN-SVM Architecture
Next Article in Special Issue
Flexible Birnbaum–Saunders Distribution
Previous Article in Journal
Multi-Aspect Embedding for Attribute-Aware Trajectories
Previous Article in Special Issue
An Asymmetric Bimodal Distribution with Application to Quantile Regression
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

An Asymmetric Distribution with Heavy Tails and Its Expectation–Maximization (EM) Algorithm Implementation

1
Departamento de Matemáticas, Facultad de Ciencias Básicas, Universidad de Antofagasta, Antofagasta 1240000, Chile
2
Departamento de Ciencias Matemáticas y Físicas, Facultad de Ingeniería, Universidad Católica de Temuco, Temuco 4780000, Chile
3
Departamento de Matemática, Facultad de Ingeniería, Universidad de Atacama, Copiapó 1530000, Chile
*
Author to whom correspondence should be addressed.
Symmetry 2019, 11(9), 1150; https://doi.org/10.3390/sym11091150
Submission received: 8 August 2019 / Revised: 31 August 2019 / Accepted: 2 September 2019 / Published: 10 September 2019

Abstract

:
In this paper we introduce a new distribution constructed on the basis of the quotient of two independent random variables whose distributions are the half-normal distribution and a power of the exponential distribution with parameter 2 respectively. The result is a distribution with greater kurtosis than the well known half-normal and slashed half-normal distributions. We studied the general density function of this distribution, with some of its properties, moments, and its coefficients of asymmetry and kurtosis. We developed the expectation–maximization algorithm and present a simulation study. We calculated the moment and maximum likelihood estimators and present three illustrations in real data sets to show the flexibility of the new model.

1. Introduction

In recent years, for data with positive support, specifically, lifetime, or reliability, the half-normal (HN) model has been widely used. The probability density function (pdf) is given by
f ( x ; σ ) = 2 σ ϕ x σ I { x > 0 } ,
where σ > 0 is the scale parameter and ϕ ( · ) represents the standard normal pdf. We denote this by writing X H N ( σ ) .
Some generalizations for this model are proposed by Cooray and Ananda [1], Cordeiro et al. [2], Bolfarine and Gómez [3] and Gómez and Vidal [4].
Olmos et al. [5] extended the HN distribution by incorporating a kurtosis parameter q, with the purpose of obtaining heavier tails, i.e., it has greater kurtosis than the base model. They called this model the slashed half-normal (SHN) distribution. Its construction is based on considering the quotient of two independent random variables, with random variable X H N ( σ ) in the numerator and the U U ( 0 , 1 ) in the denominator (See Rogers and Tukey [6] and Mosteller and Tukey [7] for more details). Thus a model is obtained that has more flexible coefficients of asymmetry and kurtosis than the HN model. We say that a random variable T follows a SHN if its pdf is given by
f T ( t ; σ , q ) = q 2 q π σ q Γ ( ( q + 1 ) / 2 ) t ( q + 1 ) G t 2 ; ( q + 1 ) / 2 , 1 2 σ 2 , t > 0 ,
where σ > 0 is a scale parameter, q > 0 is a kurtosis parameter, G ( z ; a , b ) = 0 2 g ( x ; a , b ) d x is the cumulative distribution function (cdf) of the gamma distribution and g ( · ; a , b ) is the pdf of the gamma model with shape and rate parameters a and b, respectively.
Reyes et al. [8] introduced the modified slash (MS) distribution. We say that M has a MS distribution if
M = Z / E 1 q ,
the construction of which is based on considering an exponential (Exp) distribution with parameter 2 in the denominator, i.e., they consider that E E x p ( 2 ) . The motivation of the selection of the E x p ( 2 ) distribution is given in Reyes et al. [8]. The result of this work shows that the MS model has a greater coefficient of kurtosis and this characteristic is very important for modeling data sets when they contain atypical observations.
The principal goal of this article is to use the idea published by Reyes et al. [8] to construct an extension of the half-normal model with a greater range in the coefficient of kurtosis than the SHN model, in order to use it to model atypical data. This will allow us obtain a new model generated on the basis of a scale mixture between an HN and a Weibull (Wei) distribution.
The rest of the paper is organized as follows: Section 2 contains the representation of this model and we generate the density of the new family, its basic properties and moments, and its coefficients of asymmetry and kurtosis. In Section 3 we make inferences using the moments and maximum likelihood (ML) methods. In Section 4 we implement the expectation–maximization (EM) algorithm. In Section 5 we carry out a simulation study for parameter recovery. We show three illustrations in real datasets in Section 6 and finally in Section 7 we present our conclusions.

2. An Asymmetric Distribution

In this section we introduce the representation, its pdf, and some important properties and graphs to show the flexibility of the new model.

2.1. New Distribution

The representation of this new distribution is
T = X Y 1 / q ,
where X H N ( σ ) and Y E x p ( 2 ) are independent, σ > 0 , q > 0 . We call the distribution of T the modified slashed half-normal (MSHN) distribution. This is denoted by T M S H N ( σ , q ) .

2.2. Density Function

The following Proposition shows the pdf of the MSHN distribution with scale parameter σ and kurtosis parameter q, generated using the representation given in (3).
Proposition 1.
Let T M S H N ( σ , q ) . Then, the pdf of T is given by
f T ( t ; σ , q ) = 2 q 2 π σ 2 t q + 1 N q + 1 2 , 2 t q , q 2 , 1 2 σ 2 ,
where t > 0 , σ > 0 , q > 0 , and N ( · , · , · , · ) is defined in Lemma 1 in the Appendix A.
Proof. 
Using the stochastic representation given in (3) and the Jacobian method, we obtain that the density function associated with T is given by
f T ( t ; σ , q ) = 4 q 2 π σ 2 0 w q exp t 2 w 2 2 σ 2 + 2 w q d w .
Making the change of variable u = t 2 w 2 we have,
f T ( t ; σ , q ) = 2 q 2 π σ 2 t q + 1 0 u q 1 2 exp u 2 σ 2 + 2 u q / 2 t q d u .
Hence, applying the Lemma 1 as set forth in the Appendix A, we obtain the result. □
Figure 1 depicts plots of the density of the MSHN distribution for different values of parameter q.
We perform a brief comparison illustrating that the tails of the MSHN distribution are heavier than those of the SHN distribution.
Table 1 shows the tail probability for different values in the SHN and MSHN models. It is immediately apparent that the MSHN tails are heavier than those of the SHN distribution.

2.3. Properties

In this sub-section we study some properties of the MSHN distribution.
Proposition 2.
Let T M S H N ( σ , q ) , then when σ = q = 1 the density is
f T ( t ) = 4 t 2 1 2 π 2 t exp ( 2 / t 2 ) Φ 2 t , t > 0 ,
where Φ ( · ) is the cdf of the standard normal.
Proof. 
Using Proposition 1 for σ = q = 1 , we have,
f T ( t ) = 2 2 π t 2 N 1 , 2 z , 1 2 , 1 2 = 2 2 π t 2 0 exp 2 z x 1 / 2 1 2 σ 2 x d x , t > 0 .
Changing the variable x = u 2 we obtain the result. □
Proposition 3.
If T | W = w H N σ w and Y 1 / q = W W e i ( q , 1 / 2 ) then T M S H N ( σ , q ) .
Proof. 
Since the marginal pdf of T is given by
f T ( t ; σ , q ) = 0 f T | W ( t | w ) f W ( w ) d w = 4 q σ 2 π 0 w q e w 2 t 2 2 σ 2 2 w q d w ,
and using the Lemma 1 in the Appendix A, the result is obtained. □
Proposition 4.
Let T M S H N ( σ , q ) . If q then T converges in law to a random variable T H N ( σ ) .
Proof. 
Let T M S H N ( σ , q ) and T = X Y 1 / q , where X H N ( σ ) and Y E x p ( 2 ) .
We study the convergence in law of T, since Y E x p ( 2 ) then W = Y 1 / q W e i ( q , 1 / 2 ) , we have that E W 1 2 = 1 2 2 / q Γ ( 1 + 2 / q ) 2 2 1 / q Γ ( 1 + 1 / q ) + 1 . If q then E W 1 2 0 , i.e., we have W P 1 (see Lehmann [9]).
Since T M S H N ( σ , q ) , by applying Slutsky’s Lemma (see Lehmann [9]) to T = X W , we have
T L X H N ( σ ) , q ,
that is, for increasing values of q, T converges in law to a H N ( σ ) distribution. □
Remark 1.
Proposition 2 shows us that the M S H N ( 1 , 1 ) distribution has a closed-form expression. Proposition 3 shows that an M S H N ( σ , q ) distribution can also be obtained as a scale mixture of an HN and a Wei distribution. This property is very important since it makes it possible to generate random numbers and implement the EM algorithm. Proposition 4 implies that, if q then the cdf of an M S H N ( σ , q ) model approaches to the cdf of a H N ( σ ) distribution.

2.4. Moments

In this sub-section, the following proposition shows the computation of the moments of a random variable T M S H N ( σ , q ) . Hence, it also displays the coefficients of asymmetry and kurtosis.
Proposition 5.
Let T MSHN ( σ , q ) . Then the r-th moment of T is given by
μ r = E ( T r ) = 2 r 1 q + 1 2 π σ r Γ r + 1 2 Γ q r q , q > r ,
where Γ ( · ) denotes the gamma function.
Proof. 
Let W W e i ( q , 1 / 2 ) and using Proposition 3, we have
μ r = E ( T r ) = E E ( X r | W r ) = E 2 r π Γ r + 1 2 σ r W r = 2 r π Γ r + 1 2 σ r E W r ,
where E ( W r ) = 2 r / q Γ ( q r ) / q , q > r is the r-th moment of the inverse Weibull distribution. □
Corollary 1.
Let T MSHN ( σ , q ) . Then the expectation and variance of T are given respectively by
E ( T ) = 2 1 q + 1 2 π σ Γ q 1 q , q > 1 , and
V a r ( T ) = 2 2 q + 1 σ 2 1 2 Γ q 2 q 1 π Γ 2 q 1 q , q > 2 .
Corollary 2.
Let T MSHN ( σ , q ) . Then the coefficients of asymmetry ( β 1 ) and kurtosis ( β 2 ) are given by
β 1 = 1 π Γ q 3 q 3 2 π Γ q 1 q Γ q 2 q + 2 π 3 Γ 3 q 1 q 1 2 Γ q 2 q 1 π Γ 2 q 1 q 3 / 2 , q > 3 , and
β 2 = 3 4 Γ q 4 q 4 π Γ q 1 q Γ q 3 q + 3 π Γ 2 q 1 q Γ q 2 q 3 π 2 Γ 4 q 1 q 1 2 Γ q 2 q 1 π Γ 2 q 1 q 2 , q > 4 .
Remark 2.
Figure 2 shows graphs of the coefficients of the MSHN distribution compared with those of the SHN distribution. Note that the MSHN distribution presents higher asymmetry and kurtosis values than the SHN distribution. Furthermore, in both distributions when q the coefficients of asymmetry and kurtosis converge to 2 ( 4 π ) ( π 2 ) 3 / 2 and ( 3 π 2 4 π 12 ) ( π 2 ) 2 , respectively; they coincide with the coefficients of the HN distribution.

3. Inference

Proposition 6.
Let T 1 , , T n be a random sample of size n of the T MSHN ( σ , q ) distribution. Then for q > 2 , the moment estimators of σ and q are given by
σ ^ M = π T ¯ 2 1 q + 1 2 Γ q ^ M 1 q ^ M ,
π T ¯ 2 Γ q ^ M 2 q ^ M 2 T 2 ¯ Γ 2 q ^ M 1 q ^ M = 0 ,
where T ¯ is the mean of the sample and T 2 ¯ is the mean of the sample for the square of the observations.
Proof. 
From Proposition 5, and considering the first two equations in the moments method, we have
T ¯ = 2 1 q + 1 2 π σ Γ q 1 q and T 2 ¯ = 2 2 q σ 2 Γ q 2 q .
Solving the first equation above for σ we obtain σ ^ M given in (9). Substituting σ ^ M in the second equation above, we obtain the result given in (10). □

4. Em Algorithm

The EM algorithm (Dempster et al. [10]) is a useful method for ML estimation in the presence of latent variables.
To facilitate the estimation process, we introduce latent variables W 1 , , W n through the following hierarchical representation of the MSHN model:
T i W i = w i H N σ w and W i W e i ( q , 1 / 2 ) .
In this setting, we have that
f c ( w | t ) w q exp w 2 t 2 2 σ 2 + 2 w q .
Therefore, the complete log-likelihood function can be expressed as
l c ( θ | t c ) n log ( σ ) i = 1 n w i 2 t i 2 2 σ 2 + l c ( q | w c ) ,
where l c ( q | w c ) = n log ( q ) + q i = 1 n log ( w i ) 2 i = 1 n w i q .
Letting w ^ i = E [ W i | t i , θ = θ ^ ] , it follows that the conditional expectation of the complete log-likelihood function has the form
Q ( θ | θ ^ ) n log ( σ ) i = 1 n w ^ i 2 t i 2 2 σ 2 + Q ( q | θ ^ ) ,
where Q ( q | θ ^ ) = n log ( q ) + q S 1 n 2 S 2 n , q , with S 1 n = i = 1 n E [ log ( W i ) | t i ] and S 2 n , q = i = 1 n E [ W i q | t i ] .
As both quantities S 1 n and S 2 n , q have no explicit forms in the context of the MSHN model, they have to be computed numerically. Thus to compute Q ( q | θ ^ ) we use an approach similar to that of Lee and Xu ([11], Section 3.1), i.e., considering { w r ; r = 1 , , R } to be a random sample from the conditional distribution W | ( T = t , θ = θ ^ ) , then Q ( q | θ ^ ) can be approximated as
Q ( q | θ ^ ) 1 R r = 1 R c ( q | w r ) .
Therefore, the EM algorithm for the MSHN model is given by
  • E-step: Given θ = θ ^ ( k ) = ( σ ^ ( k ) , q ^ ( k ) ) , calculate w i ^ ( k ) , for i = 1 , , n .
  • CM-step I: Update σ ^ ( k )
    σ ^ 2 ( k + 1 ) = S u ( k ) 2 ,
  • CM-step II: Fix α = σ ^ ( k + 1 ) , update q ( k ) by optimizing q ^ ( k + 1 ) = arg max q Q ( σ ^ ( k + 1 ) , q | θ ^ ( k ) ) , where S u ( k ) = 1 n i = 1 n w ^ i ( k ) t i .
The E, CM-I and CM-II steps are repeated until a convergence rule is satisfied, say | l ( θ ^ ( k + 1 ) ) l ( θ ^ ( k ) ) | is sufficiently small. Finally, standard errors (SE) can be estimated using the inverse of the observed information matrix.
Remark 3.
i. 
For q , σ ^ in M-step reduces to those obtained when the HN distribution is used;
ii. 
An alternative to the CM-Steps II is obtained considering the idea in Lin et al. ([12], Section 3), by using the following estimation:
CML-step: Update q ( k ) by maximizing the constrained actual log-likelihood function, i.e.,
q ^ ( k + 1 ) = arg max q ( σ ^ ( k + 1 ) , q ) .

5. Simulation

We present a simulation study to assess the performance of the EM algorithm for the parameters σ and q in the MSHN model. We consider 1000 samples of three sample sizes generated from the MSHN model: n = 30 , 50 and 100. To generate T M S H N ( σ ; q ) the following algorithm was used:
  • Simulate X N ( 0 , σ 2 ) and Y E x p ( 2 ) .
  • Compute T = | X | Y 1 / q .
For each sample generated, the ML estimates were computed using the EM algorithm. Table 2 shows the mean of the bias estimated for each parameter (bias), its SE and the estimated root of the mean squared error (RMSE). From Table 2, we conclude that the ML estimates are quite stable. The bias is reasonable and diminishes as the sample size is increased. As expected, the terms SE and RMSE are closer when the sample size is increased, suggesting that the SE of the estimators is well estimated.

6. Aplications

In this section we provide three applications to real datasets that illustrate the flexibility of the proposed model.

6.1. Application 1

Lyu [13] presents a data set related 104 times with programming in the Centre for Software Reliability (CSR). Some descriptive statistics are: mean = 147.8, variance = 60,071.7, skewness = 3, and kurtosis = 14.6. The moment estimators for the MSHN model were σ ^ M = 74.085 and q ^ M = 2.402 , which were used as initial values to compute the M L estimator in Table 3.
For each distribution we report the estimated log-likelihood. To compare the competing models, we consider the Akaike information criterion (AIC) (Akaike [14]) and the Bayesian information criterion (BIC) (Schwarz [15]), which are defined as AIC = 2 k 2 log lik and BIC = k log ( n ) 2 log lik , respectively, where k is the number of parameters in the model, n is the sample size and log lik is the maximum value for the log-likelihood function. Table 4 displays the AIC and BIC for each model fitted. Figure 3 presents the histogram of the data fitted with the HN, SHN and MSHN distributions, provided with the ML estimations. The QQ-plot for the MSHN and SHN distributions are presented in Figure 4.

6.2. Application 2

The second dataset is taken from Von Alven [16], and represents 46 instances of active repairs (in hours) for an airborne communication transceiver. Some descriptive statistics are: mean = 3.607, variance = 24.445, skewness = 2.888, and kurtosis = 11.802.
Initially we computed the moment estimators for the MSHN distribution, obtaining the following estimations: σ ^ M = 2.407 and q ^ M = 2.635 . We used these estimations as initial values in computing the ML estimators presented in Table 5. For each distribution we report the estimated log-likelihood.
Table 6 displays the AIC and BIC for each model fitted. Figure 5 presents the histogram of the data fitted with the HN, SHN and MSHN distributions, provided with the ML estimations.

6.3. Application 3

The third data set (Linhart and Zucchini [17]) represents 31 times of air conditioning system failure of an aeroplane. Some descriptive statistics are: mean = 55.35, variance = 5132.503, skewness = 1.805, and kurtosis = 5.293.
Initially we computed the moment estimators for the MSHN distribution, and obtained the following estimations: σ ^ M = 38.125 and q ^ M = 2.743 . We used these estimations as initial values in computing the ML estimators presented in Table 7. For each distribution we report the estimated log-likelihood.
Table 8 displays the AIC and BIC for each model fitted. Figure 6 presents the histogram of the data fitted with the HN, SHN and MSHN distributions, provided with the ML estimations.

7. Conclusions

In this paper, we have introduced a new and more flexible model, as it increases kurtosis and contains, as a particular case, the HN distribution. The EM algorithm is implemented, obtaining acceptable results for the maximum likelihood estimators. In applications using real data it performs very well, better than competing models. Some further characteristics of the MSHN distribution are:
  • The MSHN distribution has a greater kurtosis than the SHN distribution, as is clearly reflected in Table 1.
  • The proposed model has a closed-form expression and presents more flexible asymmetry and kurtosis coefficients than that of the HN model.
  • Two stochastic representations for the MSHN model are presented. One is defined as the quotient between two independent random variables: An HN in the numerator and Exp(2) in the denominator. The other shows that the MSHN distribution is a scale mixture of an HN and a Wei distribution.
  • Using the mixed scale representation, the EM algorithm was implemented to calculate the ML estimators.
  • Results from a simulation study indicate that with a reasonable sample size, an acceptable bias is obtained.
  • Three illustrations using real data show that the MSHN model achieves a better fit in terms of the AIC and BIC criteria.

Author Contributions

N.M.O., O.V., Y.M.G. and Y.A.I. contributed significantly to this research article.

Funding

The research of Neveka M. Olmos and Yuri A. Iriarte was supported by SEMILLERO UA-2019. The research of Yolanda M. Gómez was supported by proyecto DIUDA programa de inserción No. 22367 of the Universidad de Atacama.

Acknowledgments

The authors would like to thank the editor and the anonymous referees for their comments and suggestions, which significantly improved our manuscript.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Density function of the gamma, exponential and Weibull distributions, respectively, are given by
  • Gamma distribution:
    f ( x ; α , β ) = β α Γ ( α ) x α 1 e β x ,
    with, x > 0 , α > 0 and β > 0 .
  • Exponential distribution:
    f ( x ; β ) = 1 β e x / β ,
    with, x > 0 and β > 0 .
  • Weibull distribution:
    f ( x ; γ , β ) = γ β x γ 1 e x γ / β ,
    with, x > 0 , γ > 0 and β > 0 .
In the following, Lemma presents an important result used in the derivation of the pdf for the MSHN distribution.
Lemma A1.
Prudnikov et al. [18], Equation (2.3.1.13) For γ > 0 , a > 0 , r > 0 and s > 0 . Then
0 x γ 1 exp ( a x r s x ) d x = N ( γ , a , r , s ) ,
where
N = j = 0 q 1 ( a ) j j ! s γ + r j Γ ( γ + r j ) p + 1 F q ( 1 , Δ ( p , γ + r j ) ; Δ ( q , 1 + j ) ; ( 1 ) q z ) , i f   0 < r < 1 h = 0 p 1 ( s ) h r h ! a ( γ + h ) / r Γ ( γ + h r ) q + 1 F p ( 1 , Δ ( q , γ + h r ) ; Δ ( p , 1 + h ) ; ( 1 ) p z ) , i f   r > 1 Γ ( γ ) ( a + s ) γ , i f   r = 1 ,
Considering γ = p / q , p 1 and q 1 are coprime integers, where z = ( p s ) p ( a q ) q , Δ ( k , a ) = a k , ( a + 1 ) k , , ( a + k 1 ) k and p F q ( . , . , . ) is the generalized hypergeometric function defined by
p F q ( a 1 , , a p ; b 1 , , b q ; x ) = k = 0 ( a 1 ) k ( a 2 ) k ( a p ) k x k ( b 1 ) k ( b 2 ) k ( b p ) k k !
where ( c ) k = c ( c + 1 ) ( c + k 1 ) .

References

  1. Cooray, K.; Ananda, M.M.A. A Generalization of the Half-Normal Distribution with Applications to Lifetime Data. Commun. Stat. Theory Methods 2008, 37, 1323–1337. [Google Scholar] [CrossRef]
  2. Cordeiro, G.M.; Pescim, R.R.; Ortega, E.M.M. The Kumaraswamy Generalized Half-Normal Distribution for Skewed Positive Data. J. Data Sci. 2012, 10, 195–224. [Google Scholar]
  3. Gómez, Y.M.; Bolfarine, H. Likelihood-based inference for power half-normal distribution. J. Stat. Theory Appl. 2015, 14, 383–398. [Google Scholar] [CrossRef]
  4. Gómez, Y.M.; Vidal, I. A generalization of the half-normal distribution. Appl. Math. J. Chin. Univ. 2016, 31, 409–424. [Google Scholar] [CrossRef]
  5. Olmos, N.M.; Varela, H.; Gómez, H.W.; Bolfarine, H. An extension of the half-normal distribution. Stat. Pap. 2012, 53, 875–886. [Google Scholar] [CrossRef]
  6. Rogers, W.H.; Tukey, J.W. Understanding some long-tailed symmetrical distributions. Stat. Neerl. 1972, 26, 211–226. [Google Scholar] [CrossRef]
  7. Mosteller, F.; Tukey, J.W. Data Analysis And Regression; Addison-Wesley: Reading, MA, USA, 1977. [Google Scholar]
  8. Reyes, J.; Gómez, H.W.; Bolfarine, H. Modified slash distribution. Statistics 2013, 47, 929–941. [Google Scholar] [CrossRef]
  9. Lehmann, E.L. Elements of Large-Sample Theory; Springer: New York, NY, USA, 1999. [Google Scholar]
  10. Dempster, A.P.; Laird, N.M.; Rubin, D.B. Maximum likelihood from incomplete data via the EM algorithm. J. R. Statist. Soc. Ser. B 1977, 39, 1–38. [Google Scholar] [CrossRef]
  11. Lee, S.Y.; Xu, L. Influence analyses of nonlinear mixed-effects models. Comput. Stat. Data Anal. 2004, 45, 321–341. [Google Scholar] [CrossRef]
  12. Lin, T.I.; Lee, J.C.; Yen, S.Y. Finite mixture modeling using the skew-normal distribution. Stat. Sin. 2007, 17, 909–927. [Google Scholar]
  13. Lyu, M. Handbook of Software Reliability Engineering; IEEE Computer Society Press: Washington, DC, USA, 1996. [Google Scholar]
  14. Akaike, H. A new look at the statistical model identification. IEEE Trans. Auto. Contr. 1974, 19, 716–723. [Google Scholar] [CrossRef]
  15. Schwarz, G. Estimating the dimension of a model. Ann. Stat. 1978, 6, 461–464. [Google Scholar] [CrossRef]
  16. Von Alven, W.H. Reliability Engineering by ARINC; Prentice-Hall, Inc.: Upper Saddle River, NJ, USA, 1964. [Google Scholar]
  17. Linhart, H.; Zucchini, W. Model Selection; Wiley Series in Probability and Statistics; Wiley: Hoboken, NJ, USA, 1986. [Google Scholar]
  18. Prudnikov, A.P.; Brychkov, Y.A.; Marichev, O.I. Integrals and Series; Gordon & Breach Science Publishers: Amsterdam, The Netherlands, 1986. [Google Scholar]
Figure 1. The density function for different values of parameter q and σ = 1 in the MSHN distribution.
Figure 1. The density function for different values of parameter q and σ = 1 in the MSHN distribution.
Symmetry 11 01150 g001
Figure 2. Graph of the coefficients of asymmetry and kurtosis for the MSHN and SHN distributions.
Figure 2. Graph of the coefficients of asymmetry and kurtosis for the MSHN and SHN distributions.
Symmetry 11 01150 g002
Figure 3. Histogram fitted with the HN, SHN and MSHN distributions provided with the ML estimations.
Figure 3. Histogram fitted with the HN, SHN and MSHN distributions provided with the ML estimations.
Symmetry 11 01150 g003
Figure 4. QQ plots: (a) MSHN distribution and (b) SHN distribution.
Figure 4. QQ plots: (a) MSHN distribution and (b) SHN distribution.
Symmetry 11 01150 g004
Figure 5. Histogram fitted with the HN, SHN and MSHN distributions provided with the ML estimations.
Figure 5. Histogram fitted with the HN, SHN and MSHN distributions provided with the ML estimations.
Symmetry 11 01150 g005
Figure 6. Histogram fitted with the HN, SHN and MSHN distributions provided with the ML estimations.
Figure 6. Histogram fitted with the HN, SHN and MSHN distributions provided with the ML estimations.
Symmetry 11 01150 g006
Table 1. Tails comparison for different slashed half-normal (SHN) and modified slashed half-normal (MSHN) distributions.
Table 1. Tails comparison for different slashed half-normal (SHN) and modified slashed half-normal (MSHN) distributions.
Distribution P ( T > 3 ) P ( T > 4 ) P ( T > 5 ) P ( T > 6 ) P ( T > 7 )
SHN(1, 0.5)0.37810.34970.32390.30090.2805
MSHN(1, 0.5)0.53040.482890.44660.41760.3936
SHN(1, 1)0.17770.15700.13850.12240.1086
MSHN(1, 1)0.36780.29920.25190.21730.19102
SHN(1, 3)0.03500.02050.01200.00440.0034
MSHN(1, 3)0.09010.04380.02380.01420.0091
Table 2. Maximum likelihood (ML) estimations for parameters σ and q of the MSHN distribution. Standard error (SE), root of the mean squared error (RMSE).
Table 2. Maximum likelihood (ML) estimations for parameters σ and q of the MSHN distribution. Standard error (SE), root of the mean squared error (RMSE).
True Value n = 30 n = 50 n = 100
σ qEstimatorBiasSERMSEBiasSERMSEBiasSERMSE
11 σ 0.1780.4300.5280.1220.3200.3780.0890.2060.279
q0.1990.4220.6680.0970.2190.2630.0590.1380.163
2 σ 0.1110.3550.4070.0780.2580.2950.0420.1720.186
q1.0062.5002.6030.4801.1051.5190.1820.4580.562
5 σ 0.0260.2770.2390.0330.2220.1890.0230.1590.149
q2.2278.8333.8712.0126.7433.5501.3334.0922.905
21 σ 0.2840.8350.9730.1920.6170.6650.1040.4140.481
q0.1680.3560.5710.0940.2150.2630.0580.1410.166
2 σ 0.2940.7270.8150.1220.5070.5720.0740.3430.383
q1.2102.8212.8350.4651.0671.5340.1740.4540.623
5 σ 0.0570.5440.4540.0660.4410.3710.0440.3050.290
q2.4568.9913.9342.0896.7123.6151.5454.1503.075
51 σ 0.8342.1112.5480.4941.5271.8260.3861.0381.233
q0.2170.4140.7400.1190.2250.2870.0830.1440.174
2 σ 0.6581.7822.0650.2931.2851.4750.2090.8720.966
q1.2182.8722.8360.4131.0181.4140.1880.4890.694
5 σ 0.0941.3791.1600.1461.0960.9500.1230.7790.731
q2.2668.8943.8801.9526.5263.5571.3704.1182.948
Table 3. ML estimations with the corresponding SE for the models fitted. Half-normal (HN).
Table 3. ML estimations with the corresponding SE for the models fitted. Half-normal (HN).
ParametersHN (SE)SHN (SE)MSHN (SE)
σ ^ 285.191 (19.774)20.977 (5.674)19.874 (4.867)
q ^ -0.687 (0.118)0.872 (0.115)
Log-likelihood−663.411−605.102600.876
Table 4. The Akaike information criterion (AIC) and the Bayesian information criterion (BIC) for each model fitted.
Table 4. The Akaike information criterion (AIC) and the Bayesian information criterion (BIC) for each model fitted.
CriterionHNSHNMSHN
AIC1328.8221214.2041205.752
BIC1331.4661219.4931211.041
Table 5. ML estimations with the corresponding SE for the models fitted.
Table 5. ML estimations with the corresponding SE for the models fitted.
ParametersHN (SE)SHN (SE)MSHN (SE)
σ ^ 6.07 (0.6335)1.6251 (0.4777)1.5108 (0.3179)
q ^ -1.3539 (0.4347)1.6365 (0.3425)
Log-likelihood−116.3881−103.1834102.65
Table 6. AIC and BIC for each model fitted.
Table 6. AIC and BIC for each model fitted.
CriterionHNSHNMSHN
AIC234.7762210.3668209.302
BIC236.6048214.0241212.9573
Table 7. ML estimations with the corresponding SE for the models fitted.
Table 7. ML estimations with the corresponding SE for the models fitted.
ParametersHN (SE)SHN (SE)MSHN (SE)
σ ^ 89.616 (11.381)13.785 (6.047)16.148(5.128)
q ^ -0.859 (0.285)1.233 (0.251)
Log-likelihood−161.861−154.857−153.954
Table 8. AIC and BIC for each model fitted.
Table 8. AIC and BIC for each model fitted.
CriterionHNSHNMSHN
AIC325.7224313.715311.908
BIC327.1564316.583314.776

Share and Cite

MDPI and ACS Style

Olmos, N.M.; Venegas, O.; Gómez, Y.M.; Iriarte, Y.A. An Asymmetric Distribution with Heavy Tails and Its Expectation–Maximization (EM) Algorithm Implementation. Symmetry 2019, 11, 1150. https://doi.org/10.3390/sym11091150

AMA Style

Olmos NM, Venegas O, Gómez YM, Iriarte YA. An Asymmetric Distribution with Heavy Tails and Its Expectation–Maximization (EM) Algorithm Implementation. Symmetry. 2019; 11(9):1150. https://doi.org/10.3390/sym11091150

Chicago/Turabian Style

Olmos, Neveka M., Osvaldo Venegas, Yolanda M. Gómez, and Yuri A. Iriarte. 2019. "An Asymmetric Distribution with Heavy Tails and Its Expectation–Maximization (EM) Algorithm Implementation" Symmetry 11, no. 9: 1150. https://doi.org/10.3390/sym11091150

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop