Minimum message length analysis of multiple short time series
Introduction
Consider data comprised of sequences generated by the following stationary first order Gaussian autoregressive model: where , are the sequence means, is a common autoregressive parameter and denotes the innovations which are independently and identically distributed as . The starting point of this paper is to make inferences about the parameters given data sampled from the model (1)–(2). The sequences are considered exchangeable, in the sense that inferences made about the model parameters should be invariant under the interchange of any pair of sequences in the matrix . This model appears frequently in epidemiological and medical studies in which several measurements have been made over time on a large number of people. In this case, the autocorrelation parameter is of particular interest, as it represents how well the physical quantity “tracks” over time.
Making inferences about in this setting is complicated by the fact that the number of parameters grows with the number of sequences and a straightforward application of the maximum likelihood principle leads to inconsistent estimates of both and . A likelihood-based solution to the problem of estimating in the model (1)–(2) using the method of approximate conditional likelihood was presented in Cruddas et al. (1989) and shown to yield significant improvements over the standard maximum likelihood estimates. Two frequentist test procedures for the presence of autocorrelation are also discussed in Cox and Solomon (1988).
A solution within the Bayesian framework of inference would be of great value. Unfortunately, with the choice of sensible priors that reflect the invariance properties required of the problem, the usual method of analysing the posterior distribution formed from the product of the prior distribution and likelihood is unsatisfactory. The posterior distribution does not concentrate probability mass around the true parameter values even as the number of sequences , and parameter estimation based on this posterior is subsequently inconsistent. This paper demonstrates that estimation based on the alternative information-theoretic Bayesian principle of minimum message length (Wallace, 2005) leads to satisfactory estimates of all parameters as well as providing a simple basis for testing for autocorrelation.
This paper has three aims: (1) to produce satisfactory point estimates for all parameters of the first order Gaussian autoregressive model, (2) to produce a suitable test for autocorrelation, and (3) demonstrate the resolution of a difficult estimation problem using the minimum message length principle.
Section snippets
Minimum message length
The minimum message length (MML) principle (Wallace, 2005, Wallace and Boulton, 1968, Wallace and Freeman, 1987) is a Bayesian principle for inductive inference based on information theory. The essential idea behind the minimum message length principle is that compressing data is equivalent to learning structure in the data. The key measure of the quality of fit of a model to data is the length of the data after it has been compressed by the model under consideration. As the compressed data
Wallace–Freeman estimates
Inference using the Wallace–Freeman estimator requires specifying a likelihood function, the corresponding Fisher information matrix and prior densities over all parameters. In the multiple short time series setting specified by (1)–(2), the negative log-likelihood function for the parameters is where The determinant of the Fisher information matrix for is
Estimation of
The Wallace–Freeman estimates of were compared against the approximate conditional likelihood estimates derived in Cruddas et al. (1989) and the regular maximum likelihood estimates. Due to the translation invariance of the estimates for given by (12), and the fact that the estimates for and are based on the residuals for both the Wallace–Freeman procedure and approximate conditional likelihood, the particular choice of will have no effect on the behaviour of the
Comparison with approximate conditional likelihood
The approximate conditional likelihood procedure can be shown to be equivalent to a restricted Wallace–Freeman estimator. In particular, the approximate conditional likelihood estimate for a parameter of interest orthogonal to nuisance parameters can be written as with prior densities of where is the maximum likelihood estimate of the orthogonalised parameters given . Although this estimate is permissible in the sense that
References (15)
- et al.
Noninformative priors and Bayesian testing for the AR(1) model
Econ. Theory
(1994) - et al.
Sphere Packing, Lattices and Groups
(1998) - et al.
On testing for serial correlation in large numbers of small samples
Biometrika
(1988) - et al.
A time series illustration of approximate conditional likelihood
Biometrika
(1989) - et al.
Resolving the Neyman–Scott problem by minimum message length
- et al.
Invariant HPD credible sets and MAP estimators
Bayesian Anal.
(2007) A new message length formula for parameter estimation and model selection
Cited by (5)
Minimum Message Length Inference of the Weibull Distribution with Complete and Censored Data
2024, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)MML is not consistent for neyman-scott
2020, IEEE Transactions on Information Theory