1. Introduction
Several empirical studies confirm that the real financial market data are often skewed and heavy-tailed (e.g., [
1,
2,
3]). Therefore, a Gaussian distribution rarely fits the data—for example, stock returns or risk factors are badly fit by this law [
4,
5]. Note that the
-stable distribution offers a reasonable improvement, if not the best choice, among the alternative distributions that have been proposed in the literature (e.g., [
6,
7]). Thus, normal distributions are usually substituted by more general stable distributions that allow us to model both the effects of leptokurtosis and asymmetry in the data.
In the one-dimensional case, the distribution of the random
-stable variable is characterized by four parameters: the tail (stability) parameter
, skewness
, scale
and position
. The stability parameter
is the most important, because it describes the behavior of the tails [
2]. Moreover, when
, the distribution of the random variable is equivalent to a normal distribution and
(normal tail behavior). If
, then the distribution becomes degenerate (extremely heavy tails). For example, Fielitz and Smith [
8] showed that a symmetric univariate
-stable distribution appears to be appropriate when stock prices are modeled. Rachev and Mittnik [
2] estimated the parameters of autoregressive-moving-average models with asymmetric stable Paretian distributions. Kabasinskas et al. [
9] proposed a mixed-stable model to model the intra-daily data from German DAX component stock returns. Furthermore, the parameters of the stable distribution collectively and the skewed t-distribution, calculated on the basis of stock data, were applied to predict the direction of the change in future Accounting and Governance Risk rating [
10]. Ogata [
11] revealed that cryptocurrencies are a special class of assets. During various clustering procedures, parameters of
-stable distribution (together with other) were used as the main features.
One of the most recent applications of
-stable and truncated
-stable distributions in pension fund selection was published by Kopa [
12]. They used first-, second- and third-order stochastic dominance rules to select the best pension fund from second pillar in Lithuania on the basis of the risk profile of the participant.
Furthermore, the multivariate
-stable symmetric vector with the stability parameter
is expressed through a normally distributed random vector whose variance is distributed by an
-stable distribution [
13,
14,
15,
16]
where
is a subordinator with the stability parameter
;
is a random vector, distributed by the
d-variate normal law
; and
is a random vector of the means. The random
-stable variable with
and
is called a subordinator. In general, a subordinator is a one-dimensional non-decreasing Lévy process [
17]. The estimation of parameters of multivariate
-stable distributions has been a possibility for long time [
18,
19,
20,
21,
22,
23]; however, researchers emphasize that methods of the estimation of parameters still have weaknesses. For example, the authors of [
20] gave an example of an application of the elliptical sub-Gaussian distribution to DAX 30 and reported how to deal with issues when the
values of underlying assets are different and data are asymmetric; however, they could not incorporate time dependencies. Daul [
24] explained how to deal with large portfolios when assets have very different marginal distributions (
s are very different). Furthermore, Nolan [
25] gave an example of how to use it for Dow Jones 30 portfolios and introduced how to solve tail and serial dependency issues. Moreover, Kouaissah [
26] showed how to use parametric multivariate stochastic dominance in the case of a sub-Gaussian distribution, even with different tail parameters. Such results are crucial in portfolio selection and management problems. However, practical applications of the multivariate stable law are limited by the fact that its distribution and density functions are not expressed through elementary functions. That said, Ravishanker’s group and Rezakhah’s group [
15,
27] reported the estimation of parameters of sub-Gaussian vectors by stochastic EM algorithms. The MCMC approach for the maximum likelihood estimation of model parameters (
) was considered by Ravishanker [
15]. However, the accuracy of stochastic methods (e.g., [
27]) is highly dependent on internal sample size, the error of stochastic integration and convergence criteria.
The analytical EM parameter estimation approach is considered in this paper. Since the probability density function (see the next section) of a random symmetric multivariate -stable vector might be expressed through bivariate integrals, it may be calculated using Gaussian and Gauss–Laguerre quadratures instead of using stochastic integration. This approach increases accuracy, simplifies the calculation of likelihood functions and increases computational efficiency compared to regular MLE methods and stochastic EM algorithms.
Now, let us discuss goodness-of-fit tests that were used in similar studies. The well known Kolmogorov–Smirnov test [
28,
29] is sensitive to differences in the position and shape of the sample distribution. It is extremely difficult to implement the proposed methods, as they require a transformation whose derivation is analytically intractable for most multivariate distributions. Another famous goodness-of-fit test is the Anderson–Darling test [
30], which is used to estimate statistical differences between observed values and theoretical univariate distributions. However, there has been no multivariate case for this test created yet. Goodness-of-fit tests for univariate, symmetric,
-stable distributions were considered by Matsui and Takemura [
31]. Their methodology requires complicated numerical integration in order to preform goodness-of-fit tests for symmetric stable distributions. A completely different approach based on the empirical characteristic function tests for the multivariate stable distributions was considered by Meintanis [
32]. However, they used weighted
-type statistics that are difficult to compute. In [
33], specific goodness-of-fit test for multivariate normal distributions are proposed. The authors claimed that their test is overall the most powerful and is effective against skewed, long-tailed and short-tailed symmetric alternatives. Moreover, they claimed that this test can be used for testing fitness for an assumed
p-variate non-normal distribution. However, the major weakness of the test is related to the proposed test statistics. McAssey [
34] introduced a technique of a goodness-of-fit test for general multivariate distributions based on multivariate normal, uniform, Student’s t and beta distributions. This method is simple to implement, involves a reasonable computational burden and does not require analytically intractable derivations. It has sufficient power to detect a poor fit in most situations. However, the authors did not provide evidence that such a technique could be used for a multuvariate
-stable distribution.
As an alternative to the mentioned goodness-of-fit tests, in this paper, we suggest a likelihood ratio test (developed in [
35]). This test has never been used in the scientific literature for the goodness-of-fit of multivariate
-stable distributions. However, this likelihood ratio test is computationally fast and does not require specific knowledge. The only two things that must be known are how to generate random variables and how to compute a likelihood function.
The paper is structured as follows: the likelihood function and the maximum likelihood method are described in
Section 2. The EM algorithm is considered and applications of the Gaussian and Gauss–Laguerre quadrature formulas for bivariate integration are discussed in
Section 2.3. The likelihood ratio test for distribution fitting with the estimated parameters is given in
Section 2.4. In
Section 3, we provide evidence of the convergence of the EM algorithm and real world examples with stock data, biomedical data and cryptocurrency data. Finally, we provide empirical recommendations in
Section 3.5 and discuss our findings in
Section 4.
3. Results
The estimation of MLE
-stable parameters by the EM algorithm is an iterative process that requires one to choose initial values, (
10) and (
11), and perform a certain number of iterations—(
7)–(
9)—while the values in adjacent steps differ insignificantly. To test the behavior of the designed algorithm, the experiments were performed with simulated and real world data. The integrals were calculated using the Gaussian (
12) and the Gauss–Laguerre (
13) quadratures.
3.1. Test Data Modeling
To test the convergence of the proposed analytical EM algorithm, the following experiment was conducted.
First, we generated
two-dimensional,
-stable random vectors of length
or
(for the simulation algorithm, see [
1,
2]). All samples were distributed by a bivariate,
-stable distribution with parameters
,
and
Then, for each two-dimensional vector, we estimated parameters of a bivariate,
-stable distribution using the proposed EM algorithm and direct minimization of (
3). We assume that the direct MLE method gives true estimates of a bivariate,
-stable distribution. During each iteration, we registered EM estimates of all parameters until their values deviated from the previous estimate by no more than
. Moreover, the terminal estimate was compared with the corresponding true value (obtained by direct method), and the statistical significance of differences was tested. It turns out that such precision ensures that there are no statistically significant differences between the true value and the corresponding terminal estimate. Thus, the analytical EM algorithm converged for all 100 samples. The averaged results are shown in
Figure 1.
The averages of LLF and EM estimates are shown as solid lines for each iteration; corresponding values of direct MLE are dashed lines; and dotted lines are 95% confidence intervals calculated using the normal approximation. As shown in
Figure 1, the convergence of the EM algorithm is rather fast independently of the sample size. The likelihood function and location parameter converged in 5–7 iterations; however, tail parameter
and covariance matrix converged only in slightly fewer than 20 iterations. This is not a surprise, because LLF is much more sensitive to changes in
and
. In the case of shorter samples (
,
Figure 1a), the average LLF converged to 268.38, which is equal to the average value of the direct method (the average is calculated from
estimates of random two-dimensional samples). The corresponding values of other parameters on average converged to
,
and
. If the samples were of length
(
Figure 1b), then the average LLF converged to 567.69, which on average is the same as that obtained using the direct MLE method. The corresponding values of distribution parameters on average converged to
,
and
. The presented results illustrate the convergence of the created EM algorithm with respect to (
3), (
5) and (
6).
It must be noted that the same convergence (as seen in the average case) was observed in each of the 100 cases, too. This means that, independently of the sample selected, the EM estimates always converged to the values obtained by the direct MLE method.
Finally, the likelihood ratio test was performed for all simulated datasets, where a particular LLF value in each case was compared with the empirical distribution function of the LLF with original parameters
,
and
. If we set the significance level
p to 0.01, then, according to the likelihood ratio test (
Section 2.4), the empirical probability should be within the interval
, i.e.,
. In this case, we cannot reject the hypothesis that the simulated data fit a symmetrical, multivariate
-stable distribution with given parameters
,
and
. During our experiment, there were no cases when the hypothesis could be rejected.
It must be noted that analytical EM algorithm is 30 times faster than the direct MLE algorithm and could be used in practice.
3.2. Stock Market Data
In the previous section, we showed that the EM algorithm is fast and converges in approximately 20 iterations independently of the length of the dataset. Theoretically, if the datasets are symmetrically distributed with the same parameter
, then the EM algorithm should converge quickly and the parameter estimates should converge to true values. We now show a practical example with real stock price data from Yahoo [
46]. The algorithm was applied to daily stock returns of four companies, AAPL, TSLA, LUMIN and ATnT, collected during 2019–2021. This experiment dataset consisted of four-dimensional vectors of length
.
First, we investigate the marginal characteristics of the underlying returns (AAPL, TSLA, LUMIN and ATnT).
Table 1 presents the empirical characteristics and estimates of marginal
-stable distributions.
As we can see, the mean returns of AAPL and TSLA were positive, while those of LUMIN and ATnT were negative, which indicates that prices of the first two were mostly increasing during the period analyzed, while the prices of the latter two mainly decreased. We use the vector of empirical means as one of input parameters for the EM algorithm. Furthermore, negative skewness and kurtosis above three indicate that the marginal empirical distributions for all the assets deviate from normality/symmetries. This assumption is supported by the fact that estimated marginal parameters are different from two and are different from zero. Moreover, Anderson–Darling statistics (A-D) that is calculated for each sample, if we assume an -stable distribution with given parameter estimates, indicate that we cannot reject the non-parametric hypothesis about data fitness with -stable distribution. It must be noted that was slightly different for all assets analyzed, which indicates that we deviated from theoretical assumptions, but not too far.
Next, in
Table 2, we show the correlation and covariance among the assets analyzed.
The covariance matrix is input parameter for the EM algorithm, while the correlation allows us to understand how strong the linear relation between variables is. As can be seen, stock returns are positively but weakly correlated. Only ATnT and LUMIN have correlations above 0.5.
Figure 2 shows that the LLF for fixed
and
is unimodal with respect to the tail parameter
. Moreover, this property was observed in all iterations.
Therefore, the application of the golden section search method is reasonable in order to minimize LLF with respect to
. In this case, the EM algorithm converged in fewer than 30 iterations.
Figure 3 shows the dependence of the id of iterations and LLF and the estimates of parameters
,
and
.
As it may be observed, the convergence is slightly slower in the stock data case compared with the simulated datasets. Such behavior might be due to the fact that the real datasets are asymmetric and have different marginal parameters . However, this assumption is not supported by any kind of experiments yet.
According to results of the EM algorithm, the parameter estimates of the multivariate -stable distribution for stock data are as follows:
,
while the terminal value of the LLF
function was equal to
. The subscript above indicates the iteration at which the parameter estimate converged. As we can see, the LLF converged after 29 iterations so we say that the algorithm converged after 29 iterations too. The estimate
confirms the assumption (from literature) that
for stock returns from developed markets is above 1.5.
We demonstrated that, even if the data are not purely symmetrically distributed, EM converges quickly and allows us to obtain ML estimates of parameters of multivariate -stable random variables. However, it was not shown yet how good the the estimation is. Now, let us show the likelihood ratio test as a goodness-of-fit test.
First,
four-dimensional testing samples with estimated parameters
,
and
were generated, and LLF values for each sample were calculated. Then, the empirical distribution function from LLF values was constructed (shown as a solid line in
Figure 4).
Secondly, the empirical probability that the random value of LLF is less than or equal to the terminal value of
(test statistics) was calculated (shown as a dashed line in
Figure 4). In the case of stock market data, the empirical probability of the test statistics was equal to 0.020927.
Finally, if we set the significance level to 0.01, according to the likelihood ratio test, this probability is within the interval (0.005, 0.995). This means that we cannot reject the hypothesis that the data fit a multivariate, symmetric, -stable distribution with parameters , and .
3.3. AIS Data Modeling
In a similar manner, we give an example of the analysis of data from the Australian Institute of Sport (AIS), examined in [
47] (the database is often used for statistical calculations). This dataset contains various biomedical measurements (body fat (Bfat, %), lean body mass (LBM, kg), height (Ht, cm) and total weight (Wt, kg)) of a group of female Australian athletes [
48]. In this experiment, the data consisted of a four-dimensional vector of length
.
Table 3 gives the empirical characteristics and estimates of marginal
-stable distributions for female athletes.
As shown in
Table 3, the data are not leptokurtic (kurtosis < 3), which means that the data are light-tailed or lack outliers. Moreover, the negative kurtosis of Bfat indicates that its distribution has a lighter tail than the normal distribution. Bfat, LBM and Wt are fairly symmetrical, because their skewness is between −0.5 and 0.5. All mentioned observations indicate that Bfat, LBM and Wt could be normally distributed. This assumption is supported by goodness-of-fit statistics (A-D statistics), which, being less that 2, indicate that Bfat, LBM and Wt are
-stable distributed with
(normal distribution). However, the marginal estimate of parameter
for Ht is equal to 1.6858, and we cannot state that the four-dimensional vector is distributed by a multivariate normal distribution anymore. That is why we proceed with the estimation of parameters of the multivariate
-stable distribution. If the terminal
of the EM algorithm turned out to be close to 2, then we would run a test for multivariate normality.
Next, we give estimates of correlation and covariance matrices for AIS data (see
Table 4).
As shown in
Table 4, AIS data are strongly correlated, with the exception of Bfat, which correlates weakly with LBM and Ht. When we initiated the EM algorithm (in the same vein as in previous section), we used the vector of means as the initial value of
and the covariance matrix as the initial value of
. Similar to
Figure 2, in
Figure 5, we show that the LLF for a fixed
and
is unimodal with respect to the stability parameter
.
Using the EM algorithm described in this paper, the LLF value and parameter estimates were registered during 100 iterations (see
Figure 6).
The optimal value of the parameter
was obtained after 14 iterations. However, the EM algorithm converged with respect to the mean and covariance matrix, respectively, after 40 and 47 (slightly more than in stock case) iterations. From the figure of parameters
(middle column in
Figure 6), it may appear that the convergence of the location parameter was achieved very early; however, as the initial value of
was luckily selected very close to true value, the oscillations were very small. The greatest absolute deviation of
in sequential iterations was in the interval
and visually cannot be seen. According to the EM algorithm, the estimates of the parameters of the multivariate
-stable distribution for AIS data are as follows:
,
while the terminal value of the LLF
function is equal to
. It is interesting to note that the sample size in the AIS data case was five times smaller than that of the stock data; however, the number of iterations was nearly double.
As we can see again, terminal is not very different form the smallest marginal . Furthermore, the estimated was different from two, which is why we can reject the hypothesis about the multivariate normality of AIS data.
Now, let us show a likelihood ratio test in the same manner as in the previous section. First, we constructed empirical distribution function from LLF values with parameters
,
and
(shown as a solid line in
Figure 7).
Secondly, we calculated the empirical probability that a random value of LLF is less than or equal to the terminal value
(shown as a dashed line in
Figure 7). In the case of AIS data, the empirical probability is equal to 0.3662.
Finally, we cannot reject the hypothesis that AIS data fit a symmetric multivariate -stable distribution with parameters , and because this probability is within the interval (0.005, 0.995).
3.4. Cryptocurrency Data
Finally, we show how the EM algorithm can be applied to estimate the parameters of a four-dimensional cryptocurrency dataset. As an example, Bitcoin, Ethereum, Ripple and ADA (during 2019–2021 [
46]) were used. Hence, the length of the vector is
(exchanges operate 24/7 the whole year).
Table 5 gives empirical characteristics and estimates of marginal
-stable distributions of four cryptocurrencies.
There is no big surprise that the characteristics of cryptocurrency data are different from those analyzed in
Section 3.2 and
Section 3.3. The first obvious things that are very different from the previous sections are the skewness and kurtosis. The negative skewness is nearly twice greater than for stocks and AIS data, while the kurtosis is extremely large and exceeds 20. Such a composition indicates that we should expect fat left tails, i.e.,
and
. However, marginal estimates of the
-stable distribution show that such an assumption is not completely true. Parameter
is mainly close to 0, which indicates fair symmetry of the tails. Moreover, marginal
s are not as extreme as expected, but they are significantly different from stocks and AIS data.
Next, in
Table 6, we check for empirical correlation and covariance.
As we can see, cryptocurrencies are average (above 0.5) to strongly (above 0.8) correlated, which indicates that there are high chances that the prices of underlying assets will move in the same direction. Such a portfolio would have a higher risk than a less correlated one.
Next, we ran the EM algorithm to estimate the parameters of the multivariate
-stable distribution. There is no surprise that the LLF for fixed
and
is unimodal with respect to the stability parameter
(see
Figure 8).
Using the EM algorithm, the LLF value and parameter estimates were registered for the first 100 iterations. However, the convergence conditions were reached much more quickly (see
Figure 9).
Figure 9 shows that LLF and parameter estimates by the EM algorithm converge to the optimal maximum likelihood value after 50 iterations. The number of iterations is similar to the case of the AIS data case.
The estimates of parameters of a multivariate
-stable distribution for cryptocurrency data are as follows:
,
while the terminal value of the LLF
function is equal to
. By contrast to the stock market, the estimate of parameter
for cryptocurrency is below 1.5, which is not a surprise.
Now, let us show a likelihood ratio test in the same manner as in the two previous sections. First, we constructed an empirical distribution function from LLF values with parameters
,
and
(shown as a solid line in
Figure 10).
Secondly, we calculated the empirical probability that a random value of LLF is less than or equal to the terminal value
(shown as dashed line in
Figure 10). In the case of cryptocurrency data, the empirical probability is equal to 0.08119 and is near the lower bound of non-rejection (0.005, 0.995) region. However, we cannot reject hypothesis that cryptocurrency data fits multivariate symmetric
-stable distribution with parameters
,
and
.
3.5. Recommendations
In the previous three sections, we demonstrate examples of how to estimate parameters of a multivariate
-stable distribution using an analytical EM algorithm. In the case of stock data, the estimated tail parameter
of the multivariate case was very close to the smallest marginal
estimate (given in
Table 1). Moreover, a similar situation was observed in AIS and cryptocurrency data. This gives us an idea of how to chose the initial
for applications of the EM algorithm in the future.
Furthermore, the estimate in the case of stock data was more similar to the vector of medians or vector of marginal estimates rather than the vector of empirical means. However, for AIS data, marginal empirical distributions were symmetrical (all empirical location parameters were very similar) so there was no difference if we chose a vector of empirical means, medians or marginal parameters as the initial value for .
By summarizing two latest assumptions, we can recommend one to chose the smallest marginal , vector of marginal parameters (or medians) and covariance matrix as initial values for the EM algorithm. This could increase efficiency and reduce the number of iterations required to converge.
By summarizing the findings from the last four sections, we can give the following recommendations for the implementation of an analytical EM algorithm:
Check for empirical characteristics of datasets. The vectors of means or medians can be used as initial values for , and the covariance matrix as an input parameter for .
If estimates of a marginal -stable distribution are available, then choose the minimal as the initial value of and the vector of location parameters as .
Run the EM algorithm and check for convergence.
(Optional) When one or more parameter estimates converge, stop updating them and optimize only with respect to parameters that did not converge. However, such a modification may cause the algorithm to get stuck in a local minimum.
Use the likelihood ratio test to check the goodness-of-fit.
4. Discussion and Conclusions
Today, the research of distributions related to
-stable ones, including sub-Gaussian vectors, is more relevant, because they often occur in the analysis of economic data, information flows along computer networks, cryptocurrency markets, etc. The statistical maximum likelihood approach is constructed by the analytical EM algorithm in this paper, which allows us to estimate the parameters of the symmetric multivariate
-stable distribution with high precision. Computational experiments showed that multivariate
-stable distribution parameter estimates, obtained by the developed numerical method, are statistically adequate, because, after a certain number of iterations, the value of the likelihood function and estimates of the parameters converged to their corresponding (“true”) values. However, due to the fact that real data are often asymmetric, it is very important to perform goodness-of-fit tests. The proposed likelihood ratio test (for assessment of the goodness-of-fit) showed that the data fit a multivariate
-stable model well with obtained estimates in all three cases analyzed (stock market data, biomedical measurements data and cryptocurrency data). In future research, this test could be compared with other multivariate goodness-of-fit tests, e.g., the one proposed in [
34] or based on characteristic functions [
32].
Moreover, it is more important to make sure that the methodology is robust with respect to the asymmetry and the fact that marginal s in real data are different. This is also the future trend of our research.
Furthermore, in the Results Section, we emphasized that the number of iterations is nearly independent of sample size; however, the duration of calculations (real CPU time) is highly dependent on that. This is due to the fact that the likelihood function is sample dependent. We plan (in the near future) to implement parallel summation in LLF and parallel quadrature integration to reduce overall processing time.
Finally, it must be noted that the analytical EM algorithm is 30 times faster than the regular ML algorithm and could be very useful in practice. However, it is currently not available in R or Python, where the stochastic EM algorithm is already implemented.