Generalized Truncation Positive Normal Distribution

Gómez, Héctor J.; Gallardo, Diego I.; Venegas, Osvaldo

doi:10.3390/sym11111361

Open AccessArticle

Generalized Truncation Positive Normal Distribution

by

Héctor J. Gómez

^1,*,

Diego I. Gallardo

²

and

Osvaldo Venegas

¹

Departamento de Ciencias Matemáticas y Físicas, Facultad de Ingeniería, Universidad Católica de Temuco, Temuco 4780000, Chile

²

Departamento de Matemática, Facultad de Ingeniería, Universidad de Atacama, Copiapó 1530000, Chile

^*

Author to whom correspondence should be addressed.

Symmetry 2019, 11(11), 1361; https://doi.org/10.3390/sym11111361

Submission received: 27 September 2019 / Revised: 25 October 2019 / Accepted: 29 October 2019 / Published: 3 November 2019

(This article belongs to the Special Issue Symmetric and Asymmetric Distributions: Theoretical Developments and Applications)

Download

Browse Figures

Versions Notes

Abstract

:

In this article we study the properties, inference, and statistical applications to a parametric generalization of the truncation positive normal distribution, introducing a new parameter so as to increase the flexibility of the new model. For certain combinations of parameters, the model includes both symmetric and asymmetric shapes. We study the model’s basic properties, maximum likelihood estimators and Fisher information matrix. Finally, we apply it to two real data sets to show the model’s good performance compared to other models with positive support: the first, related to the height of the drum of the roller and the second, related to daily cholesterol consumption.

Keywords:

truncation; half-normal distribution; likelihood

1. Introduction

The half-normal (HN) distribution is a very important model in the statistical literature. Its density function has a closed-form and its cumulative distribution function (cdf) depends on the cdf of the standard normal model (or the error function), which is implemented in practically all mathematical and statistical software. Pewsey [1,2] provides the maximum likelihood (ML) estimation for the general location-scale HN distribution and its asymptotic properties. Wiper et al. [3] and Khan and Islam [4] perform analysis and applications for the HN model from a Bayesian framework. Moral et al. [5] also present the hnp R package, which produces half-normal plots with simulated envelopes using different diagnostics from a range of different fitted models. The HN model is also presented in the stochastic representation of the skew-normal distribution in Azzalini [6,7] and Henze [8]. In recent years this distribution has been used to model positive data, and it is becoming an important model in reliability theory despite the fact that it accommodates only decreasing hazard rates. Some of the generalizations of this distribution can be found in Cooray and Ananda [9], Olmos et al. [10], Cordeiro et al. [11], Gómez and Bolfarine [12], among others.

In particular, we focused on the extension of Cooray and Ananda [9]. The authors provided a motivation related to static fatigue life to consider the transformation

Z = σ Y^{1 / α}

, where

Y \sim H N (1)

. This model was named the generalized half-normal (GHN) distribution. An alternative way to extend the HN model was introduced by Gómez et al. [13] considering a normal distribution with mean and standard deviation

μ

and

σ

, respectively, truncated to the interval

(0, + \infty)

and considering the reparametrization

λ = μ / σ

. This model was named the truncated positive normal (TPN) distribution with density function given by

f (z; σ, λ) = \frac{1}{σ Φ (λ)} ϕ (\frac{z}{σ} - λ), z, σ > 0, λ \in R,

(1)

where

ϕ (\cdot)

and

Φ (\cdot)

denote the density and cdf of the standard normal models, respectively. We use

TPN (σ, λ)

to refer to a random variable (r.v.) with density function as in Equation (1). Note that

TPN (σ, 0) \equiv H N (σ)

.

In this work we consider a similar idea to that used in Cooray and Ananda [9] to extend the TPN model including the transformation

Y = σ X^{1 / α}

, where

X \sim T P N (1, λ)

. We will refer to this distribution as the generalized truncation positive normal (GTPN).

The rest of the manuscript is organized as follows. Section 2 is devoted to study of some important properties of the model, as well as its moments, quantile and hazard functions and its entropy. In Section 3 we perform an inference and present the Fisher information matrix for the proposed model. Section 4 discusses the selection model in nested and non-nested models for the GTPN distribution. In Section 5 we carry out a simulation study in order to study properties of the ML estimators in finite samples for the proposed distribution. Section 6 presents two applications to real data-sets to illustrate that the proposed model is competitive versus other common models for positive data in the literature. Finally, in Section 7, we present some concluding remarks.

2. Model Properties

In this section we introduce the main properties of the GTPN model such as density, quantile and hazard functions, moments, among others.

2.1. Stochastic Representation and Particular Cases

As mentioned previously, we say that a r.v. Z has

GTPN (σ, λ, α)

distribution if

Z = σ Y^{1 / α}

, where

Y \sim T P N (1, λ)

. By construction, the following models are particular cases for the GTPN distribution:

$GTPN (σ, λ, α = 1) \equiv TPN (σ, λ)$ .
$GTPN (σ, λ = 0, α) \equiv GHN (σ, α)$ .
$GTPN (σ, λ = 0, α = 1) \equiv HN (σ)$ .

Figure 1 summarizes the relationships among the GTPN and its particular cases. We highlight that

λ = 0

and

α = 1

are within the parametric space (not on the boundary). Therefore, to decide between the GTPN versus the TPN, GHN, or HN distributions we can use classical hypothesis tests such as the likelihood ratio test (LRT), score test (ST), or gradient test (GT).

2.2. Density, Cdf and Hazard Functions

Proposition 1.

For

Z \sim G T P N (σ, λ, α)

, the density function is given by

f (z; σ, λ, α) = \frac{α}{σ^{α} Φ (λ)} z^{α - 1} ϕ ({(\frac{z}{σ})}^{α} - λ), z \geq 0 .

(2)

where

σ, α > 0

and

λ \in R

.

Proof.

Considering the stochastic representation discussed in Section 2.1, we have that

z = g (y) = σ y^{1 / α}

and

f_{Z} (z) = f_{Y} (g^{- 1} (z)) |\frac{d g^{- 1} (z)}{d z}| = \frac{1}{Φ (λ)} ϕ (g^{- 1} (z) - λ) \frac{α z^{α - 1}}{σ^{α}} .

Replacing

g^{- 1} (z) = {(z / σ)}^{α}

the result is obtained. □

Proposition 2.

For

Z \sim G T P N (σ, λ, α)

, the cdf and hazard function are given by

F (z; σ, λ, α) = \frac{Φ ({(\frac{z}{σ})}^{α} - λ) + Φ (λ) - 1}{Φ (λ)} a n d h (z) = \frac{f (z)}{1 - F (z)} = \frac{α z^{α - 1} ϕ ({(\frac{z}{σ})}^{α} - λ)}{σ^{α} [1 - Φ ({(\frac{z}{σ})}^{α} - λ)]},

respectively, for all

z \geq 0

.

Figure 2 shows the density and hazard functions for the

GTPN (σ = 1, λ, α)

model, considering some combinations for

(λ, α)

. Note that the GTPN model can assume decreasing and unimodal shapes for the density function and decreasing or increasing shapes for the hazard function.

2.3. Mode

Proposition 3.

The mode of the

G T P N (σ, λ, α)

model is attained:

1.: at $z = \frac{σ}{2^{1 / α}} {(λ + \sqrt{λ^{2} + 4 (1 - \frac{1}{α})})}^{1 / α}$ whenever $α \geq 1$ or $0 < α < 1$ and $λ > 0$ ,
2.: at $z = 0$ in otherwise.

Proof.

Let

l = log (f)

, where f is the density function defined in (2), of a direct computation we have

\frac{\partial l}{\partial z} = l_{z} = - \frac{1}{z} [α {(\frac{z}{σ})}^{2 α} - α λ {(\frac{z}{σ})}^{α} - (α - 1)] .

Note that

l_{z}

vanishes when

α {(\frac{z}{σ})}^{2 α} - α λ {(\frac{z}{σ})}^{α} - (α - 1) = 0

. Using the auxiliary variable

w = {(\frac{z}{σ})}^{α}

the last equation is rewritten as follows:

α w^{2} - α λ w - (α - 1) = 0 .

(3)

In the rest of the proof we use the discriminant of the quadratic equation in Equation (3), which is given by

Δ = α^{2} λ^{2} + 4 α (α - 1) = α^{2} [λ^{2} + 4 (1 - \frac{1}{α})]

and its zeros are given by:

w = (λ \pm \sqrt{λ^{2} + 4 (1 - \frac{1}{α})}) / 2

.

If

α \geq 1

then

Δ \geq λ^{2}

. In consequence, the mode is attained at

z_{1} = \frac{σ}{2^{1 / α}} {(λ + \sqrt{λ^{2} + 4 (1 - \frac{1}{α})})}^{1 / α}

.

If

0 < α < 1

then

Δ < λ^{2}

. Here, two cases may occur. The first when

0 < Δ < λ^{2}

, in which case if

λ > 0

its mode is attained at

z = \frac{σ}{2^{1 / α}} {(λ + \sqrt{λ^{2} + 4 (1 - \frac{1}{α})})}^{1 / α}

, since

l_{z z} < 0

, and if

λ < 0

, then the zeros of Equation (3) are negative, implying that function l is strictly decreasing. Its mode is therefore attained at zero. The other case is when

Δ < 0

, then we have that

α w^{2} - α λ w - (α - 1) > 0

for all

w \geq 0

, implying that

l_{z} < 0

for all

z \geq 0

. Therefore, l is strictly decreasing and thus its mode is zero. □

Remark 1.

Note that

α \geq 1

or

λ > 0

implies that the mode of the GTPN model is attached in a positive value.

2.4. Quantiles

Proposition 4.

The quantile function for the

G T P N (σ, λ, α)

is given by

Q (p) = σ {[Φ^{- 1} (1 - (1 - p) Φ (λ)) + λ]}^{\frac{1}{α}} .

Proof.

Follows from a direct computation, applying the definition of quantile function. □

Corollary 1.

The quartiles of the GTPN distribution are

1.: First quartile $σ {[Φ^{- 1} (1 - \frac{3}{4} Φ (λ)) + λ]}^{\frac{1}{α}}$ .
2.: Median $σ {[Φ^{- 1} (1 - \frac{Φ (λ)}{2}) + λ]}^{\frac{1}{α}}$ .
3.: Third quartile $σ {[Φ^{- 1} (1 - \frac{1}{4} Φ (λ)) + λ]}^{\frac{1}{α}}$ .

2.5. Central Moments

Proposition 5.

Let

Z \sim G T P N (σ, λ, α)

and

r = 1, 2, \dots

. The r-th non-central moment is given by

μ_{r} = E (Z^{r}) = \frac{{(σ λ^{1 / α})}^{r} b_{r} (λ, α)}{2 Φ (λ) \sqrt{π}},

where

b_{r} (λ, α) = \sum_{k = 0}^{\infty} (\binom{r / α}{k}) {(\frac{λ}{2})}^{- k} Γ (\frac{k + 1}{2}, \frac{λ^{2}}{2})

and

(\binom{r / α}{k}) = \frac{1}{k!} \prod_{n = 1}^{k - 1} (r / α - n)

is the generalized binomial coefficient. When

r / α \in N

, the sum in

b_{r} (λ, α)

stops at

r / α

.

Proof.

Considering the stochastic representation of the GTPN model in Section 2.1, it is immediate that

E (Z^{r}) = σ^{r} E (Y^{\frac{r}{α}})

, where

Y \sim T P N (1, λ)

. This expected value can be computed using Proposition 2.2 in Gómez et al. [13]. □

Remark 2.

When

r / α \in N

, Closed Forms Can Be Obtained for

μ_{r}

Figure 3 illustrates the mean, variance, skewness, and kurtosis coefficients for the

GTPN (σ = 1, λ, α)

model for some combinations of its parameters.

2.6. Bonferroni and Lorenz Curves

In this subsection we present the Bonferroni and Lorenz curves (see Bonferroni [14]). These curves have applications not only in economics to study income and poverty, but also in medicine, reliability, etc. The Bonferroni curve is defined as

B (p) = \frac{1}{p μ} \int_{0}^{q} z f (z) d z, 0 \leq p < 1 .

where

μ = E (Z)

,

q = F^{- 1} (p)

. The Lorenz curve is obtained by the relation

L (p) = p B (p)

. Particularly, it can be checked that for the GTPN model the Bonferroni curve is given by

B (p) = \frac{1}{p μ} [E (Z) - \frac{σ}{Φ (λ) \sqrt{2 π}} \sum_{k = 0}^{\infty} (\binom{\frac{1}{α}}{k}) λ^{\frac{1}{α} - k} 2^{\frac{k - 1}{2}} Γ (\frac{k + 1}{2}, \frac{{({(\frac{q}{σ})}^{α} - λ)}^{2}}{2})] .

These curves serve as graphic methods for analysis and comparison, e.g., the inequality of non-negative distributions. See, for example, for a more detailed discussion [15].

Figure 4 shows the Bonferroni curve for the

GTPN (σ = 1, λ, α)

model, considering different values for

λ

and

α

.

2.7. Shannon Entropy

Shannon entropy (see Shannon [16]) measures the amount of uncertainty for a random variable. It is defined as:

S (Z) = - E (log f (Z)) .

Therefore, it can be checked that the Shannon entropy for the GTPN model is

S (Z) = log (\frac{\sqrt{2 π σ Φ (λ)}}{α}) - (α - 1) E (log (Z)) + \frac{μ^{2 α}}{2 σ^{2 α}} - \frac{λ μ^{α}}{σ^{α}} + \frac{λ^{2}}{2},

Figure 5 shows the entropy curve for the

GTPN (σ = 1.5, λ, α)

model, considering different values for

λ

and

α

. We note that this function is increasing in

λ

and

α

. where

E (log (Z)) = \int_{0}^{\infty} log (z) f (z; σ, λ, α) d z

. For

α = 1

, the Shannon entropy is reduced to

S (Z) = log (\sqrt{2 π σ Φ (λ)}) + \frac{μ^{2}}{2 σ^{2}} - \frac{λ μ}{σ} + \frac{λ^{2}}{2},

which corresponds to the Shannon entropy for the TPN model; and for

α = 1

and

λ = 0

,

S (Z)

is reduced to

S (Z) = log (\sqrt{π σ}) + \frac{μ^{2}}{2 σ^{2}},

which corresponds to the Shannon entropy for the HN distribution.

3. Inference

In this section we discuss the ML method for parameter estimation in the GTPN model.

3.1. Maximum Likelihood Estimators

For a random sample

z_{1}, \dots, z_{n}

from the

GTPN (σ, λ, α)

model, the log-likelihood function for

θ = (σ, λ, α)

is given by

ℓ (θ) = n log (α) + (α - 1) \sum_{i = 1}^{n} log (z_{i}) - n α log (σ) - n log (Φ (λ)) - \frac{n}{2} log (2 π) - \frac{1}{2} \sum_{i = 1}^{n} {({(\frac{z_{i}}{σ})}^{α} - λ)}^{2} .

Therefore, the score assumes the form

S (θ) = (S_{σ} (θ), S_{λ} (θ), S_{α} (θ))

, where

\begin{matrix} S_{σ} (θ) & = - \frac{n α}{σ} + \sum_{i = 1}^{n} ({(\frac{z_{i}}{σ})}^{α} - λ) α {(\frac{z_{i}}{σ})}^{α - 1} \frac{z_{i}}{σ^{2}}, \\ S_{λ} (θ) & = - n ξ (λ) + \sum_{i = 1}^{n} ({(\frac{z_{i}}{σ})}^{α} - λ) and \\ S_{α} (θ) & = \frac{n}{α} + \sum_{i = 1}^{n} log z_{i} - n log (σ) - \sum_{i = 1}^{n} ({(\frac{z_{i}}{σ})}^{α} - λ) {(\frac{z_{i}}{σ})}^{α} log (\frac{z_{i}}{σ}), \end{matrix}

where

ξ (λ) = \frac{ϕ (λ)}{Φ (λ)}

is the negative of the inverse Mills ratio. The ML estimators are obtained by solving the equation

S (θ) = 0_{3}

, where

0_{p}

denotes a vector of zeros with dimension p. This equation has the following solution for

λ

\hat{λ} (\hat{σ}, \hat{α}) = \frac{\sum_{i = 1}^{n} {(\frac{z_{i}}{\hat{σ}})}^{2 \hat{α}} - n}{\sum_{i = 1}^{n} {(\frac{z_{i}}{\hat{σ}})}^{\hat{α}}} .

(4)

Replacing Equation (4) in

S_{λ} (θ) = 0

and

S_{α} (θ) = 0

, the problem is reduced to two equations. The solution of this problem needs to be solved by numerical methods such as Newton-Raphson. Below we discuss initial values for the vector

θ

to initialize the algorithm.

3.2. Initial Point to Obtain the Maximum Likelihood Estimators

In this subsection, we discuss the initial points for the iterative methods to find the ML estimators in the GTPN distribution.

3.2.1. A Naive Point Based on the HN Model

In Section 2 we discuss that

GTPN (σ, λ = 0, α = 1) \equiv HN (σ)

. Based on this fact, and considering that the ML estimator for

σ

in the HN distribution has a closed-form, we can consider as an initial point

θ_{n a i v e} = (\sqrt{\frac{1}{n} \sum_{i = 1}^{n} z_{i}^{2}}, 0, 1)

.

3.2.2. An Initial Point Based on Centiles

Let

q_{t}

,

t = 1, \dots, 99

, the t-th sample centile based on

z_{1}, \dots, z_{n}

. An initial point to

θ

can be obtained by matching

q_{u}, q_{50}

and

q_{100 - u}

, with

u \in {1, 2, \dots, 48, 49}

, with their respective theoretical counterparts. Defining

p = u / 100

, the equations obtained are

\begin{matrix} q_{u} & = σ {[Φ^{- 1} (1 - (1 - p) Φ (λ)) + λ]}^{\frac{1}{α}}, q_{50} = σ {[Φ^{- 1} (1 - 0.5 Φ (λ)) + λ]}^{\frac{1}{α}} and \\ q_{100 - u} & = σ {[Φ^{- 1} (1 - p Φ (λ)) + λ]}^{\frac{1}{α}} . \end{matrix}

The solutions for

σ

and

α

are

\tilde{σ} = \tilde{σ} (\tilde{λ}) = \frac{q_{100 - u}}{{[Φ^{- 1} (1 - p Φ (\tilde{λ})) + \tilde{λ}]}^{1 / \tilde{α} (\tilde{λ})}} and \tilde{α} = \tilde{α} (\tilde{λ}) = \frac{log (\frac{Φ^{- 1} (1 - (1 - p) Φ (\tilde{λ})) + \tilde{λ}}{Φ^{- 1} (1 - p Φ (\tilde{λ})) + \tilde{λ}})}{log (\frac{q_{u}}{q_{100 - u}})},

where

\tilde{λ}

is obtained from the non-linear equation

\tilde{σ} (\tilde{λ}) {[Φ^{- 1} (1 - 0.5 Φ (\tilde{λ})) + \tilde{λ}]}^{1 / \tilde{α} (\tilde{λ})} = q_{50} .

Therefore, the initial point based on this method is given by

θ_{c e n t} = (\tilde{σ}, \tilde{λ}, \tilde{α})

.

3.3. An Initial Point Based on the Method of Moments

A more robust initial point can be obtained using the method of moments. The equations to solve are

μ_{r} = \bar{z^{r}}

,

r = 1, 2, 3

. The solution for

σ

is

{\tilde{σ}}^{⋆} = {\tilde{σ}}^{⋆} ({\tilde{λ}}^{⋆}, {\tilde{α}}^{⋆}) = \frac{2 \sqrt{π} \bar{z} Φ ({\tilde{λ}}^{⋆})}{{({\tilde{λ}}^{⋆})}^{1 / {\tilde{α}}^{⋆}} b_{1} ({\tilde{λ}}^{⋆}, {\tilde{α}}^{⋆})} .

The solution for

λ

and

α

(say

{\tilde{λ}}^{⋆}

and

{\tilde{α}}^{⋆}

, respectively) are obtained from the non-linear equations

\frac{{({\tilde{σ}}^{⋆} (λ, α) λ^{1 / α})}^{2} b_{2} (α, λ)}{2 Φ (λ) \sqrt{π}} = \bar{z^{2}} and \frac{{({\tilde{σ}}^{⋆} (λ, α) λ^{1 / α})}^{3} b_{3} (α, λ)}{2 Φ (λ) \sqrt{π}} = \bar{z^{3}} .

Therefore, the initial point based on this method is given by

θ_{m o m} = ({\tilde{σ}}^{⋆}, {\tilde{λ}}^{⋆}, {\tilde{α}}^{⋆})

.

3.4. Fisher Information Matrix

The Fisher information (FI) matrix for the GTPN distribution is given by

I F (θ) = {(I_{a b})}_{a, b \in {σ, λ, α}}

. Consider the notation

T_{j k} = T_{j k} (λ) = \int_{0}^{\infty} {[log (w)]}^{k} w^{j} ϕ (w - λ) d w, j \in {1, 2}, k \in {0, 1, 2} .

Therefore,

\begin{matrix} I_{σ σ} & = - \frac{α}{σ^{2}} + \frac{α (2 α + 1)}{σ^{2} Φ (λ)} T_{20} - \frac{λ α (α + 1)}{σ^{2} Φ (λ)} T_{10}, \\ I_{σ λ} & = \frac{α}{σ Φ (λ)} T_{10}, \\ I_{σ α} & = \frac{1}{σ} + \frac{1}{σ Φ (λ)} [λ (T_{10} + T_{11}) - T_{20} - 2 T_{21}], \\ I_{λ λ} & = 1 - λ ξ (λ) - ξ^{2} (λ), \\ I_{λ α} & = - \frac{1}{α Φ (λ)} T_{11}, \\ I_{α α} & = \frac{1}{α^{2}} + \frac{2}{α^{2} Φ (λ)} T_{22} - \frac{λ}{α^{2} Φ (λ)} T_{12} \end{matrix}

We observe that

T_{j k} (λ)

is a continuous function and

{lim}_{| λ | \to + \infty} T_{j k} (λ) = 0

, then

T_{j k} (λ) < + \infty

. Note that for

λ = 0

and

α = 1

this matrix is reduced to

F I (σ, 0, 1) = (\begin{matrix} \frac{2}{σ^{2}} & \frac{1}{σ} \sqrt{\frac{2}{π}} & - \frac{4}{σ} {\hat{T}}_{21} \\ \cdot & 1 - \frac{2}{π} & - 2 {\hat{T}}_{11} \\ \cdot & \cdot & 1 + 4 {\hat{T}}_{22} \end{matrix}),

where

{\hat{T}}_{j k} = T_{j k} (0)

. Additionally, we note that

\det (F I (σ, 0, 1)) = \frac{2}{σ^{2}} [\frac{(π - 3)}{π} (1 + 4 {\hat{T}}_{22} + 4 {\hat{T}}_{21}) - 4 {({\hat{T}}_{11} - \sqrt{\frac{2}{π}} {\hat{T}}_{21})}^{2}] \approx \frac{0.1021506}{σ^{2}} > 0, \forall σ > 0,

where

\det (\cdot)

denotes the determinant operator. Therefore, the FI matrix for the reduced model (HN) is invertible.

4. Model Discrimination

In this section we discuss some techniques to discriminate among the GTPN distribution and other models.

4.1. GTPN Versus Submodels

An interesting problem to solve is the discrimination between GTPN and the three submodels represented in Figure 1. In other words, we are interested in testing the following hypotheses:

$H_{0}^{(1)} : α = 1$ versus $H_{1}^{(1)} : α \neq 1$ (TPN versus GTPN distribution).
$H_{0}^{(2)} : λ = 0$ versus $H_{1}^{(2)} : λ \neq 0$ (GHN versus GTPN distribution).
$H_{0}^{(3)} : (α, λ) = (1, 0)$ versus $H_{1}^{(3)} : (α, λ) \neq (1, 0)$ (HN versus GTPN distribution).

The three hypotheses can be tested considering the LRT, ST, and GT. Below we present the statistics for the three tests considered and for the three hypotheses of interest.

4.1.1. Likelihood Ratio Test

The statistic for the LRT (say SLR) to tests

H_{0}^{(j)}

,

j = 1, 2, 3

, is defined as

{SLR}_{j} = 2 [ℓ (\hat{σ}, \hat{λ}, \hat{α}) - ℓ ({\hat{σ}}_{0 j}, {\hat{λ}}_{0 j}, {\hat{α}}_{0 j})],

where

{\hat{σ}}_{0 j}, {\hat{λ}}_{0 j}

and

{\hat{α}}_{0 j}

denote the ML estimators for

σ

,

λ

and

α

restricted to

H_{0}^{(j)}

,

j = 1, 2, 3

. Under

H_{0}^{(j)}

,

j = 1, 2

,

{SLR}_{j} \sim χ_{(1)}^{2}

and under

H_{0}^{(3)}

,

{SLR}_{3} \sim χ_{(2)}^{2}

, where

χ_{(p)}^{2}

denotes the Chi-squared distribution with p degrees of freedom. For

H_{0}^{(3)}

, we obtain

{\hat{σ}}_{03} = \sqrt{\frac{1}{n} \sum_{i = 1}^{n} z_{i}^{2}}, {\hat{λ}}_{03} = 0 and {\hat{α}}_{03} = 1,

whereas to test

H_{0}^{(1)}

and

H_{0}^{(2)}

the ML estimators under the null hypotheses need to be computed numerically. However, in both cases the problem is reduced to a unidimensional maximization. For details see Cooray and Ananda [9] and Gómez et al. [13], respectively.

4.1.2. Score Test

The statistic for the ST (say SR) to test

H_{0}^{(j)}

,

j = 1, 2, 3

, is defined as

{SR}_{j} = {[S ({\hat{θ}}_{0 j})]}^{⊤} {[I F ({\hat{θ}}_{0 j})]}^{- 1} S ({\hat{θ}}_{0 j}),

where

{\hat{θ}}_{j}

is the ML estimator under

H_{0}^{(j)}

. Under

H_{0}^{(j)}

,

j = 1, 2

,

{SR}_{j} \sim χ_{(1)}^{2}

and under

H_{0}^{(3)}

,

{SR}_{3} \sim χ_{(2)}^{2}

.

4.1.3. Gradient Test

The statistic for the GT (say ST) to tests

H_{0}^{(j)}

,

j = 1, 2, 3

, is defined as

{ST}_{j} = S ({\hat{θ}}_{j}) (\hat{θ} - {\hat{θ}}_{j}) .

Again, under

H_{0}^{(j)}

,

j = 1, 2

,

{ST}_{j} \sim χ_{(1)}^{2}

and under

H_{0}^{(3)}

,

{ST}_{3} \sim χ_{(2)}^{2}

. After some algebraic manipulations, we obtain that

\begin{matrix} {ST}_{1} & = (\hat{α} - 1) \{n [1 - log ({\hat{σ}}_{01})] + \sum_{i = 1}^{n} [log z_{i} - ((\frac{z_{i}}{{\hat{σ}}_{01}}) - {\hat{λ}}_{01}) (\frac{z_{i}}{{\hat{σ}}_{01}}) log (\frac{z_{i}}{{\hat{σ}}_{01}})]\} . \\ {ST}_{2} & = \hat{λ} [- n \sqrt{\frac{2}{π}} + \sum_{i = 1}^{n} {(\frac{z_{i}}{{\hat{σ}}_{02}})}^{{\hat{α}}_{02}}] . \\ {ST}_{3} & = (\hat{α} - 1) \{n [1 - log ({\hat{σ}}_{03})] + \sum_{i = 1}^{n} [log z_{i} - {(\frac{z_{i}}{{\hat{σ}}_{03}})}^{2} log (\frac{z_{i}}{{\hat{σ}}_{03}})]\} + \hat{λ} [- n \sqrt{\frac{2}{π}} + \sum_{i = 1}^{n} (\frac{z_{i}}{{\hat{σ}}_{03}})] . \end{matrix}

4.2. Non-Nested Models

The comparison of non-nested models can be performed based on the AIC criteria (Akaike [17]), where the model with a lower AIC is preferred. However, in practice we can have a set of inappropriate models for a certain data set. For this reason, we also need to perform a goodness-of-fit validation. This can be performed, for instance, based on the quantile residuals (QR). For more details see Dunn and Smyth [18]. These residuals are defined as

Q R_{i} = G (z_{i}; \hat{ψ}), i = 1, \dots, n,

where

G (; \hat{ψ})

is the cdf of the specified distribution evaluated in the estimator for

ψ

. If the model is correctly specified, such residuals are a random sample from the standard normal distribution. This can be assessed using, for instance, the Anderson–Darling (AD), Cramer-Von-Mises (CVM) and Shapiro–Wilks (SW) tests. A discussion of these tests can be seen in Yazici and Yocalan [19].

5. Simulation

In this section we present a Monte Carlo (MC) simulation study in order to illustrate the behavior of the ML estimators. We consider three sample sizes:

n = 50, 150

and 300; two values for

σ

:1 and 2; two values for

λ

:3 and 4; and three values for

α

:

0.8, 1

and 2. For each combination of n,

σ

,

λ

and

α

, we draw 10,000 samples of size n from the

GTPN (σ, λ, α)

model. To simulate a value from this distribution, we consider the following scheme:

Simulate $X \sim U n i f o r m (0, 1)$ .
Compute $Y = Φ^{- 1} (1 + (X + 1) Φ (λ)) + λ$ .
Compute $Z = σ Y^{\frac{1}{α}}$ .

For each sample generated, ML estimators were computed numerically using the Newton-Raphson algorithm. Table 1 presents means and standard deviations for each parameter in each case. Notice that bias and standard deviations are reduced as the sample size increases, suggesting that the ML estimators are consistent.

6. Applications

In this section we present two real data applications to illustrate the better performance of the GTPN model over other well known models in the literature. For these comparisons we also consider the Weibull (WEI) and the Generalized Lindley (GL, Zakerzadeh [20]) models. The density function of the Weibull distribution is given by

f (x; σ, λ) = \frac{σ}{λ} x^{σ - 1} e^{- x^{σ} / λ},

with

x > 0

,

σ > 0

and

λ > 0

, whereas for the GL model is given by

f (x; α, θ, γ) = \frac{θ^{2} {(θ x)}^{α - 1} (α + γ x) e^{- θ x}}{(γ + θ) Γ (α + 1)},

with

x > 0

,

θ > 0

α > 0

and

γ > 0

.

6.1. Application 1

The data set was taken from Laslett [21], and consisted of

n = 115

heights measured at 1 micron intervals along the drum of a roller (i.e., parallel to the axis of the roller). This was part of an extensive study of the surface roughness of the rollers. A statistical summary of the data set is presented in Table 2.

Initially, we calculate the estimators based on the centiles, naive, and moments of the GTPN distribution, which are

θ_{c e n t} = (2.326, 2.741, 2.376)

,

θ_{n a i v e} = (3.557, 0, 1)

, and

θ_{m o m} = (3.075, 1.572, 3.203)

. We used these estimations as initial values in computing the ML estimators for the GTPN model. Results are presented in Table 3. Note the high estimated standard error for the

γ

parameter in the GL model. In addition, note that, based on the AIC criteria and BIC criteria [22], the GTPN model is preferred (among the fitted models) for this data set. Figure 6 shows the estimated density for each model in this data set, where the GTPN model appears to provide a better fit. Finally, Figure 7 also presents the qq-plot for the QR in the same models and the

p - v a l u e s

for the three normality tests discussed in Section 4.2. Results suggest that the GTPN model is an appropriate model for this data set while the rest of models are not.

6.2. Application 2

The data set to be investigated was taken from Nierenberg et al. [23], and is related to a study on plasma retinol and betacarotene levels from a sample of

n = 315

subjects. More specifically, the response variable observed is grams of cholesterol consumed per day. Descriptive statistics for the variable are provided in Table 4.

Initially, we calculate the estimators based on the centiles, naive, and moments of the GTPN distribution, which are

θ_{c e n t} = (2.326, 2.741, 2.376)

,

θ_{n a i v e} = (275.960, 0, 1)

, and

θ_{m o m} = (0.003, - 86.024, 1.854)

. We used these estimations as initial values in computing the ML estimators for the GTPN model. Table 5 summarizes the fit for this data set. As in last application, we noted the high estimated standard error for the

γ

parameter in the GL model. Again, based on the AIC criteria and BIC criteria, the preferred model is the GTPN. Figure 8 shows the estimated density for each model in the cholesterol data set. The GTPN model appears to provide a better fit. For this data set, we also present the hypothesis tests for the three particular models of the GTPN distribution discussed in Section 4.1. Specifically, for

H_{0}^{(1)} : α = 1

versus

H_{1}^{(1)} : α \neq 1

we obtained

p - v a l u e s < 0.0001

,

0.0157

, and

< 0.0001

for the SLR, SR, and ST tests, respectively. Therefore, with a significance of 5% we preferred the GTPN over the TPN model. For

H_{0}^{(2)} : λ = 0

versus

H_{1}^{(2)} : λ \neq 0

we obtained

p - v a l u e s < 0.0001

,

0.0008

and

< 0.0001

for the SLR, SR and ST tests, respectively. Hence, with a significance of 5% we preferred the GTPN over the GHN model. Finally,

H_{0}^{(3)} : (α, λ) = (1, 0)

versus

H_{1}^{(3)} : (α, λ) \neq (1, 0)

we obtained

p - v a l u e s < 0.0001

for the three tests. Therefore, with a significance of 5% we preferred the GTPN over the HN model. Finally, Figure 9 also presents the qq-plot for the QR in the fitted models and the

p - v a l u e s

for the three normality tests. Results suggest that the GTPN model is appropriate for this data set while the rest of the models are not.

7. Conclusions

In this work we introduce a new distribution for positive data named the GTPN model. This new distribution includes as particular cases three models well-known in the literature: the TPN, GHN, and HN models. The basic properties of the model and ML estimation were studied. We performed a simulation study in finite samples and two real data applications, showing the good performance of the model compared with other usual models in the literature.

Author Contributions

All of the authors contributed significantly to this research article.

Funding

The research of H.J.G. was supported by Proyecto de Investigación de Facultad de Ingeniería. Universidad Católica de Temuco. UCT-FDI012019. The research of O.V. was supported by Vicerrectoría de Investigación y Postgrado de la Universidad Católica de Temuco, Projecto FEQUIP 2019-INRN-03.

Conflicts of Interest

The authors declare no conflict of interest.

References

Pewsey, A. Large-sample inference for the general half-normal distribution. Commun. Stat. Theory Methods 2002, 31, 1045–1054. [Google Scholar] [CrossRef]
Pewsey, A. Improved likelihood based inference for the general half-normal distribution. Commun. Stat. Theory Methods 2004, 33, 197–204. [Google Scholar] [CrossRef]
Wiper, M.P.; Girón, F.J.; Pewsey, A. Objective Bayesian inference for the half-normal and half-t distributions. Commun. Stat. Theory Methods 2008, 37, 3165–3185. [Google Scholar] [CrossRef]
Khan, M.A.; Islam, H.N. Bayesian analysis of system availability with half-normal life time. Math. Sci. 2012, 9, 203–209. [Google Scholar] [CrossRef]
Moral, R.A.; Hinde, J.; Demétrio, C.G.B. Half-normal plots and overdispersed models in R: The hnp package. J. Stat. Softw. 2017, 81, 1–23. [Google Scholar] [CrossRef]
Azzalini, A. A class of distributions which includes the normal ones. Scand. J. Stat. 1985, 12, 171–178. [Google Scholar]
Azzalini, A. Further results on a class of distributions which includes the normal ones. Statistics 1986, 12, 199–208. [Google Scholar]
Henze, N. A probabilistic representation of the skew-normal distribution. Scand. J. Stat. 1986, 13, 271–275. [Google Scholar]
Cooray, K.; Ananda, M.M.A. A generalization of the half-normal distribution with applications to lifetime data. Commun. Stat. Theory Methods 2008, 10, 195–224. [Google Scholar] [CrossRef]
Olmos, N.M.; Varela, H.; Gómez, H.W.; Bolfarine, H. An extension of the half-normal distribution. Stat. Pap. 2012, 53, 875–886. [Google Scholar] [CrossRef]
Cordeiro, G.M.; Pescim, R.R.; Ortega, E.M.M. The Kumaraswamy generalized half-normal distribution for skewed positive data. J. Data Sci. 2012, 10, 195–224. [Google Scholar]
Gómez, Y.M.; Bolfarine, H. Likelihood-based inference for power half-normal distribution. J. Stat. Theory Appl. 2015, 14, 383–398. [Google Scholar] [CrossRef]
Gómez, H.J.; Olmos, N.M.; Varela, H.; Bolfarine, H. Inference for a truncated positive normal distribution. Appl. Math. J. Chin. Univ. 2015, 33, 163–176. [Google Scholar] [CrossRef]
Bonferroni, C.E. Elementi di Statistica Generale; Libreria Seber: Firenze, Italy, 1930. [Google Scholar]
Arcagnia, A.; Porrob, F. The Graphical Representation of Inequality. Rev. Colomb. Estad. 2014, 37, 419–436. [Google Scholar] [CrossRef] [Green Version]
Shannon, C.E. A Mathematical Theory of Communication. Bell Syst. Tech. J. 1948, 27, 379–423. [Google Scholar] [CrossRef] [Green Version]
Akaike, H. A new look at the statistical model identification. IEEE Trans. Auto. Contr. 1974, 19, 716–723. [Google Scholar] [CrossRef]
Dunn, P.K.; Smyth, G.K. Randomized Quantile Residuals. J. Comput. Graph. Stat. 1996, 5, 236–244. [Google Scholar]
Yazici, B.; Yocalan, S. A comparison of various tests of normality. J. Stat. Comput. Simul. 2007, 77, 175–183. [Google Scholar] [CrossRef]
Zakerzadeh, H.; Dolati, A. Generalized Lindley Distribution. J. Math. Ext. 2009, 3, 1–17. [Google Scholar]
Laslett, G.M. Kriging and Splines: An Empirical Comparison of Their Predictive Performance in Some Applications. J. Am. Stat. Assoc. 1994, 89, 391–400. [Google Scholar] [CrossRef]
Schwarz, G. Estimating the dimension of a model. Ann. Stat. 1978, 6, 461–464. [Google Scholar] [CrossRef]
Nierenberg, D.W.; Stukel, T.A.; Baron, J.A.; Dain, B.J.; Greenberg, E.R. Determinants of plasma levels of beta-carotene and retinol. Am. J. Epidemiol. 1989, 130, 511–521. [Google Scholar] [CrossRef] [PubMed]

Figure 1. Particular cases for the GTPN distribution.

Figure 2. Density and hazard functions for the

GTPN (σ = 1, λ, α)

model with different combinations of

λ

and

α

.

Figure 2. Density and hazard functions for the

GTPN (σ = 1, λ, α)

model with different combinations of

λ

and

α

.

Figure 3. Plots of the (a) expectation, (b) variance, (c) skewness and (d) kurtosis for

GTPN (σ = 1, λ, α)

for

α \in {0.75, 1, 1.5}

as a function of

λ

. In (d), the dashed line represents the kurtosis of the normal distribution.

Figure 3. Plots of the (a) expectation, (b) variance, (c) skewness and (d) kurtosis for

GTPN (σ = 1, λ, α)

for

α \in {0.75, 1, 1.5}

as a function of

λ

. In (d), the dashed line represents the kurtosis of the normal distribution.

Figure 4. Bonferroni curve for the generalized truncation positive normal (GTPN)

(σ, λ, α)

model.

Figure 4. Bonferroni curve for the generalized truncation positive normal (GTPN)

(σ, λ, α)

model.

Figure 5. Entropy for the

G T P N (σ = 1, α, λ)

model.

Figure 5. Entropy for the

G T P N (σ = 1, α, λ)

model.

Figure 6. Fit of the distributions for the Laslett data set.

Figure 7. QR for the fitted models in the Laslett data set. The

p - v a l u e s

for the Anderson–Darling (AD), Cramer-Von-Mises (CVM) and Shapiro–Wilks (SW) normality tests are also presented to check if the RQ came from the standard normal distribution.

Figure 7. QR for the fitted models in the Laslett data set. The

p - v a l u e s

for the Anderson–Darling (AD), Cramer-Von-Mises (CVM) and Shapiro–Wilks (SW) normality tests are also presented to check if the RQ came from the standard normal distribution.

Figure 8. Fit of the distributions for the cholesterol data set.

Figure 9. RQ for the fitted models in the cholesterol data set. The

p - v a l u e s

for the AD, CVM, and SW normality tests are also presented to check if the QR came from the standard normal distribution.

Figure 9. RQ for the fitted models in the cholesterol data set. The

p - v a l u e s

for the AD, CVM, and SW normality tests are also presented to check if the QR came from the standard normal distribution.

Table 1. Monte Carlo (MC) simulation study for the maximum likelihood (ML) estimators in the

(GTPN σ, λ, α)

model in 12 combinations of

σ

,

λ

and

α

. The results summarize the mean and the standard deviation (sd) of the respective estimators obtained in the 10,000 replicates.

Table 1. Monte Carlo (MC) simulation study for the maximum likelihood (ML) estimators in the

(GTPN σ, λ, α)

model in 12 combinations of

σ

,

λ

and

α

. The results summarize the mean and the standard deviation (sd) of the respective estimators obtained in the 10,000 replicates.

			n = 50						n = 150						n = 300						n = 1000
True Valor			$\hat{σ}$		$\hat{λ}$		$\hat{α}$		$\hat{σ}$		$\hat{λ}$		$\hat{α}$		$\hat{σ}$		$\hat{λ}$		$\hat{α}$		$\hat{σ}$		$\hat{λ}$		$\hat{α}$
$σ$	$λ$	$α$	mean	sd	mean	sd	mean	sd	mean	sd	mean	sd	mean	sd	mean	sd	mean	sd	mean	sd	mean	sd	mean	sd	mean	sd
1	3	0.8	1.523	1.573	3.089	1.797	0.987	1.029	1.227	1.029	3.000	1.052	0.875	0.350	1.081	0.614	3.017	0634	0.830	0.202	1.011	0.233	3.014	0.267	0.803	0.071
		1	1.286	1.105	3.118	1.843	1.227	0.705	1.111	0.705	3.027	1.040	1.083	0.425	1.044	0.444	3.020	0.629	1.031	0.250	1.006	0.185	3.012	0.266	1.004	0.089
		2	1.006	0.510	3.220	2.036	2.432	0.309	1.011	0.309	3.019	1.055	2.171	0.850	1.001	0.196	3.022	0.614	2.058	0.481	0.998	0.093	3.013	0.266	2.008	0.177
	4	0.8	1.489	1.799	4.391	2.010	0.971	0.860	1.087	0.860	4.280	1.199	0.817	0.282	1.017	0.513	4.164	0.786	0.798	0.148	1.003	0.301	4.049	0.413	0.799	0.082
		1	1.236	1.219	4.439	2.182	1.203	0.631	1.018	0.631	4.295	1.222	1.017	0.343	0.986	0.418	4.187	0.826	0.995	0.191	0.993	0.230	4.055	0.404	0.998	0.099
		2	0.962	0.543	4.642	2.495	2.360	0.321	0.957	0.321	4.323	1.326	2.033	0.696	0.974	0.223	4.168	0.839	2.000	0.382	0.993	0.118	4.045	0.404	2.000	0.198
2	3	0.8	3.006	3.131	3.157	1.876	0.974	1.996	2.423	1.996	3.018	1.045	0.869	0.340	2.152	1.185	3.023	0.624	0.823	0.194	2.034	0.472	3.007	0.269	0.805	0.072
		1	2.512	2.198	3.215	1.942	1.207	1.437	2.237	1.437	3.028	1.077	1.086	0.431	2.094	0.897	3.017	0.642	1.033	0.250	2.009	0.373	3.015	0.269	1.004	0.089
		2	1.999	1.039	3.318	2.223	2.417	0.631	2.017	0.631	3.035	1.096	2.171	0.862	2.000	0.405	3.027	0.638	2.061	0.504	1.995	0.186	3.016	0.267	2.006	0.177
	4	0.8	2.880	3.544	4.562	2.283	0.970	1.714	2.161	1.714	4.326	1.288	0.813	0.282	2.024	1.078	4.194	0.849	0.796	0.160	2.003	0.573	4.053	0.404	0.799	0.079
		1	2.402	2.434	4.659	2.419	1.176	1.283	2.033	1.283	4.344	1.346	1.015	0.355	1.974	0.835	4.190	0.844	0.995	0.191	1.990	0.465	4.053	0.408	0.998	0.100
		2	1.913	1.132	4.802	2.848	2.378	0.668	1.898	0.668	4.393	1.469	2.024	0.708	1.938	0.460	4.198	0.901	1.993	0.391	1.984	0.236	4.047	0.401	2.000	0.198

Table 2. Descriptive statistics of the Laslett data set.

Dataset	n	$\bar{X}$	S²	$\sqrt{b_{1}}$	$b_{2}$
Heights measured	115	3.48	0.52	−1.24	6.30

Table 3. Estimation of the parameters and their standard errors (in parentheses) for the GTPN, TPN, WEI, and GL models for the data set. The AIC and BIC criteria are also included.

Estimated	GTPN	TPN	WEI	GL
$σ$	2.354(0.379)	0.720(0.047)	5.855(0.435)	-
$λ$	2.512(0.495)	4.842(0.333)	3.744(0.062)	4.093(0.539)
$α$	2.227(0.401)	-	-	13.467(1.902)
$γ$	-	-	-	15.528(29.638)
AIC	238.43	254.63	251.74	308.67
BIC	246.67	260.12	257.23	316.90

Table 4. Descriptive statistics for the cholesterol data set.

Data Set	n	$\bar{X}$	$S^{2}$	$\sqrt{b_{1}}$	$b_{2}$
Heights measured	315	242.46	17421.79	1.47	6.34

Table 5. Estimated parameters and their standard errors (in parentheses) for the GTPN, TPN, WEI, and GL models for the

c h o l e s t e r o l

data set. The AIC and BIC criteria are also presented.

Table 5. Estimated parameters and their standard errors (in parentheses) for the GTPN, TPN, WEI, and GL models for the

c h o l e s t e r o l

data set. The AIC and BIC criteria are also presented.

Estimated	GTPN	TPN	WEI	GL
$σ$	0.040(0.027)	151.106(8.734)	1.964(0.080)	-
$λ$	7.888(0.459)	1.455(0.138)	274.717(8.353)	0.016(0.001)
$α$	0.240(0.013)	-	-	2.831(0.293)
$γ$	-	-	-	2.067(6.720)
AIC	3874.92	3943.25	3905.68	3879.07
BIC	3886.17	3950.75	3913.18	3890.42

© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Gómez, H.J.; Gallardo, D.I.; Venegas, O. Generalized Truncation Positive Normal Distribution. Symmetry 2019, 11, 1361. https://doi.org/10.3390/sym11111361

AMA Style

Gómez HJ, Gallardo DI, Venegas O. Generalized Truncation Positive Normal Distribution. Symmetry. 2019; 11(11):1361. https://doi.org/10.3390/sym11111361

Chicago/Turabian Style

Gómez, Héctor J., Diego I. Gallardo, and Osvaldo Venegas. 2019. "Generalized Truncation Positive Normal Distribution" Symmetry 11, no. 11: 1361. https://doi.org/10.3390/sym11111361

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Generalized Truncation Positive Normal Distribution

Abstract

1. Introduction

2. Model Properties

2.1. Stochastic Representation and Particular Cases

2.2. Density, Cdf and Hazard Functions

2.3. Mode

2.4. Quantiles

2.5. Central Moments

2.6. Bonferroni and Lorenz Curves

2.7. Shannon Entropy

3. Inference

3.1. Maximum Likelihood Estimators

3.2. Initial Point to Obtain the Maximum Likelihood Estimators

3.2.1. A Naive Point Based on the HN Model

3.2.2. An Initial Point Based on Centiles

3.3. An Initial Point Based on the Method of Moments

3.4. Fisher Information Matrix

4. Model Discrimination

4.1. GTPN Versus Submodels

4.1.1. Likelihood Ratio Test

4.1.2. Score Test

4.1.3. Gradient Test

4.2. Non-Nested Models

5. Simulation

6. Applications

6.1. Application 1

6.2. Application 2

7. Conclusions

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI