Elsevier

Journal of Econometrics

Volume 170, Issue 2, October 2012, Pages 331-349
Journal of Econometrics

GEL statistics under weak identification

https://doi.org/10.1016/j.jeconom.2012.05.009Get rights and content

Abstract

The central concern of this paper is the provision in a time series moment condition framework of practical recommendations of confidence regions for parameters whose coverage probabilities are robust to the strength or weakness of identification. To this end we develop Pearson-type test statistics based on GEL implied probabilities formed from general kernel smoothed versions of the moment indicators. We also modify the statistics suggested in Guggenberger and Smith (2008) for a general kernel smoothing function. Importantly for our conclusions, we provide GEL time series counterparts to GMM and GEL conditional likelihood ratio statistics given in Kleibergen (2005) and Smith (2007). Our analysis not only demonstrates that these statistics are asymptotically (conditionally) pivotal under both classical asymptotic theory and weak instrument asymptotics of Stock and Wright (2000) but also provides asymptotic power results in the weakly identified time series context. Consequently, the empirical null rejection probabilities of the associated tests and, thereby, the coverage probabilities of the corresponding confidence regions, should not be affected greatly by the strength or otherwise of identification. A comprehensive Monte Carlo study indicates that a number of the tests proposed here represent very competitive choices in comparison with those suggested elsewhere in the literature.

Introduction

In many situations empirical researchers are confronted with instrumental variables only weakly correlated with the endogenous variables with the consequence that structural parameters are only weakly identified. This paper contributes to what is now a voluminous literature on the topic of partial and weak identification initiated in Sargan (1983), Phillips (1989), and Staiger and Stock (1997). Those papers demonstrate that the correspondence between the limiting and finite-sample behaviours of estimators and test statistics may be poor in such circumstances.1 The particular aim of this paper then is the practical recommendation of test statistics appropriate for nonlinear moment condition models with time series data that possess finite sample size properties relatively unaffected by the weakness or strength of identification and that also display good power. Consequently their acceptance regions may be inverted to provide reliable and robust confidence regions for parameters of interest.

The contributions of the paper are as follows. We propose new test statistics for a time series setting based on the implied probabilities generated by generalised empirical likelihood (GEL) estimation, see, e.g., Smith, 1997, Smith, 2011 and Newey and Smith (2004), appropriate for circumstances in which identification may be weak as discussed in Stock and Wright (2000), henceforth SW. These statistics generalise to the time series context versions of the Pearson-type GEL statistics suggested in Ramalho and Smith (2004) for independently and identically distributed (i.i.d.) observations and are defined in terms of smoothed moment indicators obtained using kernel function based weights which incorporate a bandwidth parameter; cf. Kitamura and Stutzer (1997) and Smith, 1997, Smith, 2000, Smith, 2011. We also reconsider the statistics developed in Guggenberger and Smith (2005), hereafter GSa, for i.i.d. observations and those proposed for time series data in Guggenberger and Smith (2008), henceforth GSb, that use a truncated kernel function for the smoothed moment indicators. In particular, since their finite sample behaviour may be affected by the choice of kernel function, we reformulate the GSb statistics for a general form of kernel function. Likewise, our Pearson-type GEL statistics also incorporate a general kernel function. The tests discussed here deal with both simple hypotheses on the full parameter vector and composite hypotheses on a subvector of parameters, the latter case, apart from one particular statistic, requiring that the parameters not under test be strongly identified, an assumption adopted in other papers, see, e.g., Kleibergen (2005), GSa, Otsu (2006) and GSb. In addition, and importantly for our conclusions, we propose GEL time series counterparts to the GMM conditional likelihood ratio (CLR) statistic in Kleibergen (2005) and the GEL CLR statistics developed in Smith (2007) for the i.i.d context; cf. Moreira (2003).

We investigate the large sample behaviour of the statistics proposed here at fixed alternative values for the weakly identified parameters, a scenario that permits a comparison of their relative asymptotic power properties in the weakly identified context. Importantly, under the null hypothesis, the statistics proposed here, in GSb and elsewhere, e.g., in Kleibergen (2005) and Otsu (2006), are asymptotically (conditionally) pivotal, unlike the corresponding Wald and likelihood ratio statistics. Indeed, with the notable exception of the GEL CLR statistics, whose large sample conditional null distribution may be straightforwardly simulated, our statistics are asymptotically chi-square under the null hypothesis. Thus, the empirical rejection probabilities under the null hypothesis (NRPs) of tests formed from these statistics may be expected to be relatively invariant to the strength or weakness of identification. Our elucidation of the asymptotic properties of the statistics also demonstrates certain asymptotic equivalences among classes of statistics. Hence, without recourse to higher order analysis, reliance must be placed on simulation experiments for statistical choice. Consequently, in order to inform our recommendations for econometric practice, we undertake a comprehensive Monte Carlo study to assess the finite sample size and power characteristics of the statistics proposed here based on a similar design to that in GSb. For comparative purposes we also include in this study a number of additional tests favourably reported in the recent weak instrument literature; see Kleibergen (2005), Otsu (2006) and GSb. To anticipate our conclusions, particular implementations of the GEL CLR and Pearson-type tests have satisfactory finite sample size properties and power characteristics that are very competitive with existing tests.

The paper is organised as follows. Section 2 reviews GEL estimation and associated constructs. Statistics appropriate for tests on the full vector of parameters are introduced in Section 3 and their limiting behaviour stated whereas those appropriate for the subvector case are discussed in Section 4. Section 5 describes the results of the Monte Carlo study. Section 6 concludes. The appendices contain the technical assumptions and proofs of results contained in the text.

The following notation is used in the paper. The symbols “p”, “d” and “” denote convergence in distribution, convergence in probability and weak convergence of empirical processes, respectively, with “a” indicating that the limiting distribution of the left hand side expression is bounded above by that of the right hand side. Convergence “with probability approaching 1” is written as “w.p.a.1”. The space Ci(S) contains all functions that are i times continuously differentiable on the set S. The notation vec(M) stands for the column vectorisation of the k×p matrix M, i.e., if M=(m1,,mp) then vec(M)(m1,,mp), “M” denotes the transpose matrix of M and (M)i,j the element in the i-th row and j-th column. Furthermore, for a symmetric matrix M, “M>0” means that M is positive definite, λmin(M) and λmax(M) are the smallest and largest eigenvalues of M, respectively, and Mλmax(MM). The vector inequality yz is to be interpreted as the inequality holding element-wise. For a full column rank matrix ARk×p and 0<KRk×k, we denote by PA(K) the oblique projection matrix A(AK1A)1AK1 and define MA(K)IkPA(K), where Ik is the k-dimensional identity matrix. We abbreviate this notation to PA and MA if K=Ik and, if p=0, set MA=Ik. Finally χ2(k) and χ2(k;δ) respectively denote central chi-square and noncentral chi-square random variables with k degrees of freedom and noncentrality parameter δ.

Section snippets

Generalised empirical likelihood

This section outlines GEL estimation for time series observations based on kernel smoothed counterparts of the moment indicators; see Kitamura and Stutzer (1997) and Smith, 1997, Smith, 2011.2 For i.i.d. observations, Newey and Smith (2004) show that GEL estimators possess attractive higher order properties as compared to those based on GMM (Hansen, 1982, Newey, 1985, Newey and West, 1987).

Full vector test statistics

The primary concern of this section is the provision of robust confidence regions for θ0 in a weakly identified and time series setting by the consideration of tests of the simple null hypothesis H0:θ=θ0 against the composite alternative H1:θθ0.

We obtain versions of the Pearson-type class of statistics due to Ramalho and Smith (2004) for these circumstances utilising the implied probabilities π̂i(θ),(i=1,,n), of (2.10). We also adapt the statistics considered in GSb for the general kernel

Subvector test statistics

Our concern now is the construction of robust confidence regions for the subvector α0 of θ0=(α0,β0) where α0RpA. Consequently our interest is in testing H0:α=α0versusH1:αα0.

As in much of the literature, see, e.g., Kleibergen (2005), GSa and GSb, a critical limitation of our analysis, with the exception of the GEL criterion function statistic (4.5) discussed below, is that we require the parameter vector β not under test to be strongly identified. Thus, only the subvector α01 (say) of α0=(α

Monte Carlo study

This section focuses on tests for the null hypothesis H0:θ=0 of (3.1) against the alternative hypothesis H1:θ0; see Section 3. In particular, it reports on a comprehensive Monte Carlo study that assesses the finite sample size and power characteristics of the new tests suggested in Section 3 whose asymptotic behaviour is detailed in Theorem 3.1, Theorem 3.2, Theorem 3.3 above. For comparative purposes we include a number of additional tests that have been shown to perform favourably in the

Concluding remarks

The main purpose of this paper is the recommendation of various tests appropriate for inference in a weakly identified time series environment.

A comprehensive Monte Carlo study compares the tests proposed here with a number of other candidate tests that the literature has suggested possess satisfactory finite sample size and power properties. On the basis of these experiments, in terms of results for both size and power for nonrandom choices of bandwidth Sn and partition number s, the GEL test

References (62)

  • T.W. Anderson et al.

    Estimators of the parameters of a single equation in a complete set of stochastic equations

    Annals of Mathematical Statistics

    (1949)
  • D.W.K. Andrews

    Chi-square diagnostic tests for econometric models: theory

    Econometrica

    (1988)
  • D.W.K. Andrews

    Heteroskedasticity and autocorrelation consistent covariance matrix estimation

    Econometrica

    (1991)
  • D.W.K. Andrews et al.

    Hybrid and size-corrected subsampling methods

    Econometrica

    (2009)
  • D.W.K. Andrews et al.

    An improved heteroskedasticity and autocorrelation consistent covariance matrix estimator

    Econometrica

    (1992)
  • D.W.K. Andrews et al.

    Optimal invariant similar tests for instrumental variables regression

    Econometrica

    (2006)
  • D.W.K. Andrews et al.

    Inference with weak instruments

  • K. Back et al.

    Implied probabilities in GMM estimators

    Econometrica

    (1993)
  • Brown, B.W., Newey, W.K., 1992. Bootstrapping for GMM. Unpublished Manuscript,...
  • B.W. Brown et al.

    Efficient semiparametric estimation of expectations

    Econometrica

    (1998)
  • B.W. Brown et al.

    Generalized method of moments, efficient bootstrapping, and improved inference

    Journal of Business and Economic Statistics

    (2002)
  • M. Caner

    Exponential tilting with weak instruments: estimation and testing

    Oxford Bulletin of Economics and Statistics

    (2010)
  • J.C. Chao et al.

    Consistent estimation with a large number of weak instruments

    Econometrica

    (2005)
  • Chaudhuri, S., Renault, E., 2010. Finite-sample improvements of score tests by the use of implied probabilities from...
  • J.G. Cragg et al.

    On the asymptotic properties of ldu-based tests of the rank of a matrix

    Journal of the American Statistical Association

    (1996)
  • J.-M. Dufour

    Some impossibility theorems in econometrics with applications to structural and dynamic models

    Econometrica

    (1997)
  • J.-M. Dufour

    Identification, weak instruments and statistical inference in econometrics. Presidential Address to the Canadian Economics Association

    Canadian Journal of Economics

    (2003)
  • J.-M. Dufour et al.

    Projection–based statistical inference in linear structural models with possibly weak instruments

    Econometrica

    (2005)
  • Guggenberger, P., 2003. Econometric essays on generalized empirical likelihood, long–memory time series, and...
  • P. Guggenberger et al.

    Generalized empirical likelihood estimators and tests under partial, weak and strong identification

    Econometric Theory

    (2005)
  • L.P. Hansen

    Large sample properties of generalized method of moments estimators

    Econometrica

    (1982)
  • Cited by (14)

    • Generic results for establishing the asymptotic size of confidence sets and tests

      2020, Journal of Econometrics
      Citation Excerpt :

      This arises in tests of stochastic dominance and CS’s based on conditional moment inequalities, e.g., see Andrews and Shi (2013, 2014) and Linton et al. (2010). Selected references in the literature regarding uniformity issues in the models discussed above include the following: for unit roots, Bobkowski (1983), Cavanagh (1985), Chan and Wei (1987), Phillips (1987), Stock (1991), Park (2002), Giraitis and Phillips (2006), Phillips and Magdalinos (2007), and Andrews and Guggenberger (2012); for weak identification due to weak IV’s, Staiger and Stock (1997), Stock and Wright (2000), Moreira (2003), Kleibergen (2005), Guggenberger et al. (2012a), Guggenberger et al. (2012b, 2019); for weak identification in other models, Andrews and Cheng (2012a, 2013, 2014), Qu (2014), Andrews and Mikusheva (2015, 2016), Cox (2016), and Han and McCloskey (2019); for parameters near a boundary, Chernoff (1954), Self and Liang (1987), Shapiro (1989), Geyer (1994), Andrews (1999, 2001, 2002), Andrews and Guggenberger (2010b), and McCloskey (2017); and for post-model selection inference, Kabaila (1995), Leeb and Pötscher (2005), Leeb (2006), Andrews and Guggenberger (2009a, b) and McCloskey (2017). This paper is organized as follows.

    View all citing articles on Scopus

    We are grateful for the helpful and constructive comments on and criticisms of previous versions of the paper by the Editors, a Co-Editor, two referees and S. Chaudhuri. Earlier versions of this paper were presented at the Tinbergen Institute, Amsterdam, the CIREQ Conference on Generalized Method of Moments, Université de Montréal, 2007, the Econometric Society European Meetings, Milan, 2008, the Conference in Honour of M.R. Wickens, University of York, 2008, the University of Hong Kong and the New Zealand Econometric Study Group Conference 2011, University of Otago, Dunedin.

    View full text