Skip to main content
Log in

Is the Magic Still There? The Use of the Heckman Two-Step Correction for Selection Bias in Criminology

  • Original Paper
  • Published:
Journal of Quantitative Criminology Aims and scope Submit manuscript

Abstract

Issues of selection bias pervade criminological research. Despite their ubiquity, considerable confusion surrounds various approaches for addressing sample selection. The most common approach for dealing with selection bias in criminology remains Heckman’s [(1976) Ann Econ Social Measure 5:475–492] two-step correction. This technique has often been misapplied in criminological research. This paper highlights some common problems with its application, including its use with dichotomous dependent variables, difficulties with calculating the hazard rate, misestimated standard error estimates, and collinearity between the correction term and other regressors in the substantive model of interest. We also discuss the fundamental importance of exclusion restrictions, or theoretically determined variables that affect selection but not the substantive problem of interest. Standard statistical software can readily address some of these common errors, but the real problem with selection bias is substantive, not technical. Any correction for selection bias requires that the researcher understand the source and magnitude of the bias. To illustrate this, we apply a diagnostic technique by Stolzenberg and Relles [(1997) Am Sociol Rev 62:494–507] to help develop intuition about selection bias in the context of criminal sentencing research. Our investigation suggests that while Heckman’s two-step correction can be an appropriate technique for addressing this bias, it is not a magic solution to the problem. Thoughtful consideration is therefore needed before employing this common but overused technique.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3

Similar content being viewed by others

Notes

  1. These topics range from those concerned with sample attrition or non-response (e.g. Gondolf 2000; Maxwell et al. 2002; Robertson et al. 2002; Worrall 2002) to studies of racial profiling in police stops (Lundman and Kaufman 2003), to research examining the effects of race on criminal justice outcomes (Klepper et al. 1983; Wooldredge 1998).

  2. We identified articles by electronically searching for papers in Criminology that cite the seminal article by Berk (1983) and/or included the word Heckman. This process resulted in 25 articles that use the Heckman in some shape or form (7 other studies cite Heckman or Berk but were not directly relevant).

  3. Part of the reason for the limited application of Heckman in the treatment equation approach is the availability of a number of other approaches for dealing with selection, including experiments, instrumental variable estimation and panel models. For useful in-depth discussions of alternative approaches to the treatment effects model, see Halaby (2004), Angrist (2001), and Heckman et al. (1999).

  4. It is important to distinguish between the terms conditional and marginal. A slope coefficient is an estimate of the marginal impact of x on y. We can usually estimate two different terms—the marginal effect of x in the conditional and unconditional models. As Greene (1993) makes clear, the estimate of interest will vary by case.

  5. An additional distinction can be made between the simple Two Part Model (TPM) and the censored two stage model (CTSM). Whereas the substantive regression in the TPM is based on the sub-sample of selected cases, for the CTSM, it includes all observations (including zeros for unobserved values). The TPM provides the conditional estimate given selection into the sub-sample of the population whereas the CTSM provides the unconditional estimate based on all observations. This distinction can be useful in certain applications, but it is often confusing because the CTSM requires that censored observations have meaningful 0 values (e.g. sentence lengths of 0 months of incarceration). This distinction is additionally muddled by the common convention of using the TPM to obtain estimates that ostensibly represent the unconditional population of interest (Puhani 2000). We constrain our discussion to the TPM given its focus in the literature, but further comparison of these two estimators offers an additional line of potentially interesting future research.

  6. While it is possible to estimate the TPM with alternative specifications, such as the logit instead of the probit, it is important to recognize that the estimator then assumes a different error structure (i.e. errors are distributed log normal).

  7. This distinction is relevant to the extent that slightly different modeling procedures are appropriate for the two types of samples. Truncated regression models are more appropriate for situations with truncated samples whereas limited-dependent-variable models or Tobit models are generally preferred for censored samples (see Fig. 3).

  8. The Tobit model also has some limitations. These limitations include restrictive normality assumptions regarding the dependent variable (Chay and Powell 2001; Chesher and Irish 1987) restrictive homoskeasticity assumptions regarding error terms (Wooldridge 2005), and the key assumption that the effects of independent variables are constant for the selection process and the outcome of interest (Smith and Brame 2003). See Osgood et al. (2002) for a recent overview of Tobit models.

  9. There is also a large statistical literature that attempts to loosen the assumption of bivariate normality by estimating the first stage semi-parametrically, see especially Ahn and Powell (1993) and Blundell and Powell (2001) as well as Vella (1999) and Newey (1999).

  10. STATA 8.2 provides for both the FIML and Heckman two-step estimators, while LIMDEP 7.0 also provides for a third maximum likelihood estimator of the Heckman two-step, sometimes called the Limited Information Maximum Likelihood (LIML).

  11. We are unaware of any case in which the FIML (or the related LIML model) has been used in criminology. This is somewhat ironic because Heckman originally recommended using the two-step approach only to generate starting values for the FIML estimator (Heckman 1976).

  12. Considerable confusion surrounds the predicted probability, inverse Mills ratio, and what is often referred to as the “hazard rate”. The probability is simply the predicted value from the probit equation. To calculate the inverse Mills ratio, this value is multiplied by negative one and inserted into (Eq. 5). This inverse Mills ratio, then, represents the hazard rate, or the instantaneous probability of exclusion for each observation conditional on being at risk (see Berk 1983).

  13. Dubin and Rivers (1990) discuss alternative formulations of both a two-step and maximum likelihood adaptation of Heckman’s model to the case of dichotomous outcomes, but we are unaware of any criminological application of these procedures.

  14. As one example, Nobiling et al. (1998, p. 470) describe this convention as follows: “We used logistic regression to estimate the likelihood that the offender would be sentenced to prison. For each case, the logistic regression produced its predicted probability of exclusion from the sentence length model—the hazard rate.” An anonymous referee suggested this approach represents an altogether different estimator based on propensity scoring, and should therefore not be confused with the Heckman approach. Given that each study reviewed cited Heckman (1976, 1979) or Berk (1983) and no study made explicit reference to the literature on propensity scoring, we feel justified in associating this approach with the Heckman correction. It is possible that there is statistical merit in such an approach, but we leave it to other researchers to verify it in this context.

  15. In the treatment effects version of the Heckman estimator, there are actually two different formulas for the inverse Mills ratio depending on whether an individual case is selected. Failure to correctly specify the inverse Mills ratio can bias one’s findings in the face of selection. Spohn and Holleran (2002) is an example of an application of the treatment model where the probability was substituted for the inverse Mills ratio. Correspondence with the authors indicated that the coefficient on the inverse Mills ratio was not significant when the model was re-estimated correctly, suggesting an absence of selection bias in their study. We thank Cassia Spohn and David Holleran for their willingness to engage in a constructive discussion regarding their paper.

  16. The majority of studies in Table 1 use logit rather than probit, a feature not available in the standard software programs. Also, it is unlikely that any paper that describes using the probability rather than the IMR used standard software.

  17. Desirable estimators have three properties—they are unbiased, efficient and consistent. An unbiased estimate means the difference between the true parameter and the expected value of the estimator of the parameter is 0. This has to do with how well our point estimate represents the true population value of interest. Efficiency has to do with the standard error of the estimate. More efficient estimates have smaller standard errors, meaning any one estimate is more likely to approximate the true population value. Finally, consistent estimates are estimates that become unbiased as N approaches infinity. While the TPM estimates are biased, they will be more efficient, whereas the Heckman estimates will be unbiased and consistent, but less efficient. Which estimate better captures the true parameter of interest depends on the degree of bias and inefficiency that exists in each.

  18. This list includes articles outside of our Criminology sample. This list was compiled using a search of articles that cite Stolzenberg and Relles.

  19. Exclusion restrictions are very similar to instrumental variables (Angrist 2001). In each case, we try to identify factors that affect one variable but do not affect a second variable. The ultimate exclusion restriction would be experimental or random assignment. Clearly, if people are randomly assigned to prison, then there will be no correlation between the error terms of the prison and sentence length equations, and there will be no selection bias. In the absence of experiments, we need to find quasi-experiments or elements of the selection process that are uncorrelated with the substantive decision of interest.

  20. One possible exclusion restriction for sentencing research is strength of evidence. Early studies suggest that this factor affects the likelihood of conviction, but not the severity of the sentence post-conviction (Albonetti 1991).

  21. To be more comprehensive, we attempted to identify additional papers outside the journal Criminology that used the Heckman with exclusion restrictions. We did find four other examples of models using the Heckman correction and incorporating exclusion restrictions (Albonetti 1991; Maxwell et al. 2002; Robertson et al. 2002; Worrall 2002), but none of the articles we reviewed defended or discussed their exclusion restrictions in any detail.

  22. In Table 1, we counted those articles which arbitrarily excluded variables that were highly correlated with the correction factors in order to estimate the model as not having exclusion restrictions.

  23. There is a substantial literature on alternative estimation frameworks, including non-parametric frameworks, which avoid some of the problems associated with the restrictive parametric assumptions required by the bivariate normal models (Ahn and Powell 1993, Chay and Lee 2001). While we do not deny the potential usefulness of these alternative frameworks, we have chosen to remain within a framework known to most criminologists.

  24. The assumption of a two-stage decision making process in criminal sentencing largely grew out of early research on white collar offenders in federal districts that argued “the first and hardest decision the judge makes is whether the person will go to prison or not,” which is “experienced as qualitatively different from the decision as to how long an inmate should stay in prison” (Wheeler et al. 1982, pp. 642, 652). This assumption has been challenged recently and may be less valid for cases sentenced in states with presumptive sentencing guidelines (Bushway and Piehl 2001).

  25. The STATA do file used to estimate these coefficients is available from the authors by request.

  26. Statistically, the condition number is the ratio of the maximum Eigenvalue to the minimum Eigenvalue for the matrix of independent variables in an analysis (see Belsley et al. 1980, pp. 100–104, for a technical treatment of the condition number). It can be calculated in STATA 8.2 with the collin() function and is also provided in standard output for collinearity diagnostics in SPSS and other statistical software packages.

  27. This is commonly understood by criminologists in another context. The logit and probit models will provide very similar effect size estimates when the probability of the dependent variable being a success is not close to 1 or 0. However, as the proportion of successes approaches 1 or 0, which requires extreme values of Z, the two models can provide very different results because of the different parametric assumptions.

  28. A similar problem is endemic in comparisons of coefficients across different models affected by selection. In sentencing, key modeling decisions, such as the decision to examine incarceration and sentence length as two distinct outcomes, rest on the finding that coefficients exert different effects on the two outcomes. However, if the error terms in the two models are correlated, the coefficients for sentence length will be biased. This will lead to flawed comparisons if a simple TPM is utilized or if the Heckman is implemented incorrectly. This observation may also apply to the current debate about whether jail and prison can be treated as the same outcome (Holleran and Spohn 2004). To the extent that selection bias exists in the coefficients for jail and prison sentences (i.e. non-random selection processes determine conviction and therefore offender placement), any comparison of these effects across models without good controls for selection must be conducted with caution.

References

  • Ahn H, Powell J (1993) Semiparametric estimation of censored selection models with a nonparametric selection mechanism. J Econom 58:3–29

    Article  Google Scholar 

  • Albonetti C (1986) Criminality, prosecutorial screening, and uncertainty: toward a theory of discretionary decision making in felony case processing. Criminology 24:623–644

    Article  Google Scholar 

  • Albonetti C (1991) An integration of theories to explain judicial discretion. Soc Probl 38:247–266

    Google Scholar 

  • Albonetti C (1997) Sentencing under the federal sentencing guidelines: an analysis of the effects of defendant characteristics, guilty pleas, and departures on sentencing outcomes for drug offenses. Law Soc Rev 31:601–634

    Article  Google Scholar 

  • Amemiya T (1985) Advanced econometrics. Harvard University Press, Cambridge, MA

    Google Scholar 

  • Angrist J (2001) Estimation of limited dependent variable models with dummy endogenous regressors: simple strategies for empirical practice. J Bus Econ Stat 19:2–16

    Article  Google Scholar 

  • Belsley David, Kuh E, Welsch R (1980) Regression diagnostics: identifying influential data and sources of collinearity. John Wiley & Sons, New York

    Google Scholar 

  • Berk R (1983) An introduction to sample selection bias in sociological data. Am Soc Rev 48:386–398

    Article  Google Scholar 

  • Berk R, Ray S (1982) Selection bias in sociological data. Soc Sci Res 11:352–398

    Article  Google Scholar 

  • Blundell R, Powell J (2001) Endogeneity in nonparametric and semiparametric regression models. CEMMAP Working Paper No CWP09/01

  • Bontrager S, Bales W, Chiricos T (2005) Race, ethnicity, threat and the labeling of convicted felons. Criminology 43:589–622

    Article  Google Scholar 

  • Bushway S, Piehl A (2001) Judging judicial discretion: legal factors and racial discrimination in sentencing. Law Soc Rev 35:733–767

    Article  Google Scholar 

  • Chay KY, Powell JL (2001) Semiparametric censored regression models. J Econ Persp 15:29–42

    Article  Google Scholar 

  • Chesher A, Irish M (1987) Residual analysis in the grouped and censored normal linear model. J Econom 34:33–61

    Article  Google Scholar 

  • Chiricos T, Bales W (1991) Unemployment and punishment: an empirical assessment. Criminology 29:701–724

    Article  Google Scholar 

  • D’Alessio S, Stolzenberg L (2003) Race and the probability of arrest. Soc Forces 84:1381–1398

    Article  Google Scholar 

  • Demuth S (2003) Racial and ethnic differences in pretrial release decisions and outcomes: a comparison of Hispanic, black, and white felony arrestees. Criminology 41:873–908

    Article  Google Scholar 

  • Duan N, Manning WG, Morris CN, Newhouse JP (1983) A comparison of alternative models for the demand for medical care. J Econ Bus Stat 1:115–126

    Article  Google Scholar 

  • Duan N, Manning WG, Morris CN, Newhouse JP (1984) Choosing between the sample-selection model and the multi-part model. J Bus Econ Stat 2:283–289

    Article  Google Scholar 

  • Dubin JA, Rivers D (1990) Selection bias in linear regression, logit and probit models. In: Fox J, Long JS (eds) Modern methods of data analysis. Sage Publications, Thousand Oaks, CA, pp 411–443

    Google Scholar 

  • Engen RL, Gainey RR (2000) Modeling the effects of legally relevant and extralegal factors under sentencing guidelines: the rules have changed. Criminology 38:1207–1228

    Article  Google Scholar 

  • Erez E, Tontodonato P (1990) The effect of victim participation in sentencing on sentencing outcome. Criminology 28:451–474

    Article  Google Scholar 

  • Felson RB (1996) Big people hit little people: sex differences in physical power and interpersonal violence. Criminology 34:433–453

    Article  Google Scholar 

  • Felson RB, Messner S (1996) To kill or not to kill: lethal outcomes in injurious attacks. Criminology 36:519–545

    Article  Google Scholar 

  • Gondolf EW (2000) A 30-month follow-up of court-referred batterers in four cities. Int J Offender Ther Comp Criminol 44:111–128

    Article  Google Scholar 

  • Greene WH (1993) Econometric analysis, 2nd edn. Macmillan, New York

    Google Scholar 

  • Hagan J, Palloni A (1986) ‘Club Fed’ and the sentencing of white-collar offenders before and after Watergate. Criminology 24:603–622

    Article  Google Scholar 

  • Hagan J, Parker P (1985) White-collar crime and punishment: the class structure and legal sanctioning of security violations. Am Sociol Rev 50:302–316

    Article  Google Scholar 

  • Halaby C (2004) Panel models in sociological research: theory into practice. Ann Rev Sociol 30

  • Heckman JJ (1976) The common structure of statistical models of truncation, sample selection and limited dependent variables. Ann Econ Soc Measure 5:475–492

    Google Scholar 

  • Heckman JJ (1979) Sample selection bias as a specification error. Econometrica 47:153–161

    Article  Google Scholar 

  • Heckman JJ, LaLonde R, Smith J. (1999) The economics and econometrics of active labor market programs. In: Ashenfelter O, Card D (eds) Handbook of Labor Economics, vol 3A. North-Holland, Amsterdam, pp 1865–2097

    Google Scholar 

  • Holleran D, Spohn C (2004) On the use of the total incarceration variable in sentencing research. Criminology 42(1):211–240

    Article  Google Scholar 

  • Johnson BD (2006) The Multilevel context of criminal sentencing: integrating judge and county level influences in the study of courtroom decision making. Criminology 44(2):259–298

    Article  Google Scholar 

  • Kautt PM (2002) Location, location, location: interdistrict and intercircuit variation in sentencing outcomes for federal drug-trafficking offenses. Justice Q 19:633–671

    Article  Google Scholar 

  • Keil TJ, Vito GF (1989) Race, homicide severity, and application of the death penalty: a consideration of the Barnett scale. Criminology 27:511–536

    Article  Google Scholar 

  • Kingsnorth R, Lopez J, Wentworth J (1998) Adult sexual assault: the role of racial/ethnic composition in prosecution and sentencing. J Crim Justice 26:359–371

    Article  Google Scholar 

  • Kingsnorth R, MacIntosh R, Sutherland S (2002) Criminal charge or probation violation: Prosecutorial discretion and implications for research. Criminology 40:553–578

    Article  Google Scholar 

  • Kingsnorth R, MacIntosh R, Wentworth J (1999) Sexual assault: the role of prior relationship and victim characteristics in case processing. Justice Q 16:275–302

    Article  Google Scholar 

  • Klepper S, Nagin D, Tierney LJ (1983) Discrimination in the criminal justice system: a critical appraisal of the literature. In: Blumstein A, Cohen J, Martin S, Tonry M (eds) Research on sentencing: the search for reform, vol II. National Academy Press, Washington, DC

    Google Scholar 

  • Kurlychek M, Johnson BD (2004) The juvenile penalty: a comparison of juvenile and young adult sentencing outcomes. Criminology 42:485–517

    Article  Google Scholar 

  • Leiber M, Mack K (2003) The individual and joint effects of race, gender, and family status on juvenile justice decision making. J Res Crime Delinq 40:34–71

    Article  Google Scholar 

  • Leiber M, Jamieson K (1995) Race and decision making within juvenile justice: the importance of context. J Quantitative Criminol 11:363

    Google Scholar 

  • Leung SF, Yu S (1996) On the choice between sample selection and two-part models. J Econometrics 72:197–229

    Article  Google Scholar 

  • Leung SF, Yu S (2000) Collinearity and two-step estimation of sample selection models: problems, origins, and remedies. Comput Econ 15:173–199

    Article  Google Scholar 

  • Loeffler, Charles (2006) Using inter-judge sentencing disparity to estimate the effect of imprisonment on criminal recidivism. Harvard University Working Paper

  • Lundman R, Kaufman R (2003) Driving while black: effects of race, ethnicity, and gender on citizen self-reports of traffic stops and police actions. Criminology 41:195–220

    Article  Google Scholar 

  • MacMillan R (2000) Adolescent victimization and income deficits in adulthood: rethinking the costs of criminal violence from a life-course perspective. Criminology 38:553–587

    Article  Google Scholar 

  • Maddala GS (1985) A survey of the literature on selectivity bias as it pertains to health care markets. Adv Health Econ Health Serv Res 6:3–18

    Google Scholar 

  • Maxwell C, Garner J, Fagan J (2002) The preventative effects of arrest on intimate partner violence. Criminol Publ Policy 2:51–80

    Article  Google Scholar 

  • McCarthy B, Smith BL (1986) The conceptualization of discrimination in the juvenile justice process: the impact of administrative factors and screening decisions on juvenile court dispositions. Criminology 24:41–64

    Article  Google Scholar 

  • Messner SF, Deane G, Beaulieu M (2002) A log-multiplicative association model for allocating homicides with unknown victim–offender relationships. Criminology 40:457–479

    Article  Google Scholar 

  • Messner SF, Deane GD, Anselin L, Pearson-Nelson B (2005) Locating the vanguard in rising and falling homicide rates across US cities. Criminology 43(3):661–696

    Article  Google Scholar 

  • Miethe TD, Moore CA (1985) Socioeconomic disparities under determinant sentencing systems: a comparison of preguideline and postguideline practices in Minnesota. Criminology 23(2):337–363

    Article  Google Scholar 

  • Moffitt R (1999) Econometric methods for labor market analysis. In: Ashenfelter O, Card D (eds) Handbook of labor economics, vol 3A. North-Holland, Amsterdam

    Google Scholar 

  • Myers MA (1988) Social background and the sentencing behavior of judges. Criminology 26:649–675

    Article  Google Scholar 

  • Myers MA, Talarico SM (1986) Urban justice, rural injustice? Urbanization and its effects on sentencing. Criminology 24:367–391

    Article  Google Scholar 

  • Newey WK (1999) Consistency of two step sample selection estimators despite misspecifications of distribution. Econ Lett 63:129–132

    Article  Google Scholar 

  • Nobiling T, Spohn C, Delone M (1998) A tale of two counties: unemployment and sentence severity. Justice Q 15:459–485

    Article  Google Scholar 

  • Osgood DW, Finken L, McMorris B (2002) Analyzing multiple item measures of crime and deviance II: Tobit regression analysis of transformed scores. J Quantitative Criminol 18:859–866

    Google Scholar 

  • Owens, EG (2006) More time, less crime? The incapacitative effect of sentence enhancements. University of Maryland working paper

  • Peterson RD, Hagan J (1984) Changing conceptions of race: toward an account of anomalous findings of sentencing research. Am Sociol Rev 49:56–70

    Article  Google Scholar 

  • Puhani PA (2000) The Heckman correction for sample selection and its critique. J Econ Surveys 14:53–68

    Article  Google Scholar 

  • Reitz K (1998) Modeling discretion in American sentencing systems. Law & Policy 20:389–428

    Article  Google Scholar 

  • Rhodes W (1991) Federal criminal sentencing: some measurement issues with application to pre-guideline sentencing disparity. J Criminal Law Criminol 81:1002

    Article  Google Scholar 

  • Robertson AA, Grimes PW, Rogers KE (2002) A short-run cost-benefit analysis of community based interventions for juvenile offenders. Crime Delinq 47:265–284

    Article  Google Scholar 

  • Smith DA (1986) The plea bargaining controversy. J Criminal Law Criminol 77:949–967

    Article  Google Scholar 

  • Smith DA, Brame R (2003) Tobit models in social science research: some limitations and a more general alternative. Sociol Methods Res 31:364–388

    Article  Google Scholar 

  • Sorenson J, Wallace DH (1999) Prosecutorial discretion in seeking death: an analysis of racial disparity in the pretrial stages of case processing in a Midwestern county. Justice Q 16:559–578

    Article  Google Scholar 

  • Spohn C, Holleran D (2002) The effect of imprisonment on recidivism rates of felony offenders: a focus on drug offenders. Criminology 40:329–358

    Article  Google Scholar 

  • Spohn C, Horney J (1996) The impact of rape law reform on the processing of simple and aggravated rape cases. J Criminal Law Criminol 86:861–886

    Article  Google Scholar 

  • Stata Corp. (2003) Stata Statistical Software Release 8.0. Reference Manual G-M. Stata Corporation, College Station, TX, p 71

    Google Scholar 

  • Steen S, Engen RL, Gainey RR (2005) Images of danger and culpability: racial stereotyping, case processing, and criminal sentences. Criminology 43:435–467

    Article  Google Scholar 

  • Steffensmeier D, Kramer JH, Streifel C (1993) Gender and imprisonment decisions. Criminology 31:411–446

    Article  Google Scholar 

  • Steffensmeier D, Ulmer JT, Kramer JH (1998) The interaction of race, gender, and age in criminal sentencing: the punishment cost of being young, black, and male. Criminology 36:763–798

    Article  Google Scholar 

  • Steffensmeier D, DeMuth S (2001) Ethnicity and judges’ sentencing decisions: Hispanic-Black-White comparisons. Criminology 39:145–176

    Article  Google Scholar 

  • Stolzenberg RM, Relles DA (1990) Theory testing in a world of constrained research design: the significance of Heckman’s censored sampling bias correction for nonexperimental research. Sociol Methods Res 18:395–415

    Article  Google Scholar 

  • Stolzenberg RM, Relles DA (1997) Tools for intuition about sample selection bias and its correction. Am Sociol Rev 62:494–507

    Article  Google Scholar 

  • Stolzenberg L, D’Alessio SJ, Eitle D (2004) A multilevel test of racial threat theory. Criminology 42:673–698

    Article  Google Scholar 

  • Tibbets SG, Piquero AR (1999) The influence of gender, low birth weight, and disadvantaged environment in predicting early onset of offending: a test of Moffitt’s interactional hypothesis. Criminology 37:843–878

    Article  Google Scholar 

  • Tobin J (1958) Estimation of relationships for limited dependent-variables. Econometrica 26:24–36

    Article  Google Scholar 

  • Ulmer JT (2000) The rules have changed—so proceed with caution: a comment on Engen and Gainey’s methods for modeling sentencing outcomes under guidelines. Criminology 38:1231–1243

    Article  Google Scholar 

  • Ulmer JT, Johnson BD (2004) Sentencing in context: a multilevel analysis. Criminology 42:137–178

    Article  Google Scholar 

  • Ulmer JT, Kramer JH (1996) Court communities under sentencing guidelines: dilemmas of formal rationality and sentencing disparity. Criminology 34:383–408

    Article  Google Scholar 

  • Vella F (1999) Estimating models with sample selection bias: a survey. J Hum Resour 33:127–169

    Article  Google Scholar 

  • Wheeler S, Weisburd D, Bode N (1982) Sentencing the white collar offender: rhetoric and reality. Am Sociol Rev 47:641–659

    Article  Google Scholar 

  • Winship C, Mare RD (1992) Models for selection bias. Ann Rev Sociol 18:327–350

    Article  Google Scholar 

  • Wooldredge JD (1998) Analytic rigor in studies of disparities in criminal case processing. J Quantitative Criminol 14:155–179

    Article  Google Scholar 

  • Wooldridge J (2005) Introductory economics: a modern approach. South Western College Publishing

  • Wooldredge JD, Thistlewaite A (2004) Bilevel disparities in court dispositions for intimate assault. Criminology 42:417–456

    Article  Google Scholar 

  • Worrall JL (2002) If you build it, they will come: consequences of improved citizen complaint review procedures. Crime Delinq 48:355–379

    Article  Google Scholar 

  • Wright JP, Cullen FT (2000) Juvenile involvement in occupational delinquency. Criminology 38:863–892

    Article  Google Scholar 

  • Zatz MS (2000) The convergence of race, ethnicity, gender, and class on court decision making: looking toward the 21st century. Policies, processes, and decisions of the criminal justice system, vol. 3. US Department of Justice, Washington, DC, pp 503–552

  • Zatz MS, Hagan J (1985) Crime, time, and punishment: an exploration of selection bias in sentencing research. J Quantitative Criminol 1:103–126

    Article  Google Scholar 

Download references

Acknowledgements

The authors would like to thank Justin McCrary, Wayne Osgood, Anne Piehl, Jeff Smith and seminar participants at the Economics and Crime summer workshop for helpful comments. We would also like to thank Gary Sweeten for research assistance and useful discussions. Most of the research for the paper was completed while Bushway was a professor at the University of Maryland. The paper is dedicated to the memory of our friend, mentor, and former colleague Douglas A. Smith. Doug helped us to understand the relevant selection issues, and encouraged us to pursue the paper in its formative stages. He is sorely missed.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Brian D. Johnson.

Appendix

Appendix

Table 4

Rights and permissions

Reprints and permissions

About this article

Cite this article

Bushway, S., Johnson, B.D. & Slocum, L.A. Is the Magic Still There? The Use of the Heckman Two-Step Correction for Selection Bias in Criminology. J Quant Criminol 23, 151–178 (2007). https://doi.org/10.1007/s10940-007-9024-4

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10940-007-9024-4

Keywords

Navigation