Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

The bi-factor structure of the 17-item Hamilton Depression Rating Scale in persistent major depression; dimensional measurement of outcome

  • Neil Nixon,

    Roles Data curation, Resources, Validation, Writing – review & editing

    Affiliations Division of Psychiatry and Applied Psychology, School of Medicine, University of Nottingham, Nottingham, United Kingdom, Adult Mental Health Directorate, Nottinghamshire Healthcare Trust, Nottingham, United Kingdom

  • Boliang Guo,

    Roles Conceptualization, Formal analysis, Methodology, Writing – original draft

    Affiliations Division of Psychiatry and Applied Psychology, School of Medicine, University of Nottingham, Nottingham, United Kingdom, ARC EM, School of Medicine, University of Nottingham, Nottingham, United Kingdom

  • Anne Garland,

    Roles Data curation, Funding acquisition, Writing – review & editing

    Affiliation Adult Mental Health Directorate, Nottinghamshire Healthcare Trust, Nottingham, United Kingdom

  • Catherine Kaylor-Hughes,

    Roles Data curation, Project administration, Writing – review & editing

    Affiliations Division of Psychiatry and Applied Psychology, School of Medicine, University of Nottingham, Nottingham, United Kingdom, ARC EM, School of Medicine, University of Nottingham, Nottingham, United Kingdom

  • Elena Nixon,

    Roles Investigation, Project administration, Writing – review & editing

    Affiliations Division of Psychiatry and Applied Psychology, School of Medicine, University of Nottingham, Nottingham, United Kingdom, ARC EM, School of Medicine, University of Nottingham, Nottingham, United Kingdom

  • Richard Morriss

    Roles Conceptualization, Funding acquisition, Resources, Supervision, Writing – review & editing

    mczrkm@exmail.nottingham.ac.uk

    Affiliations Division of Psychiatry and Applied Psychology, School of Medicine, University of Nottingham, Nottingham, United Kingdom, ARC EM, School of Medicine, University of Nottingham, Nottingham, United Kingdom

Abstract

Background

The 17-item Hamilton Depression Rating Scale (HDRS17) is used world-wide as an observer-rated measure of depression in randomised controlled trials (RCTs) despite continued uncertainty regarding its factor structure. This study investigated the dimensionality of HDRS17 for patients undergoing treatment in UK mental health settings with moderate to severe persistent major depressive disorder (PMDD).

Methods

Exploratory Structural Equational Modelling (ESEM) was performed to examine the HDRS17 factor structure for adult PMDD patients with HDRS17 score ≥16. Participants (n = 187) were drawn from a multicentre RCT conducted in UK community mental health settings evaluating the outcomes of a depression service comprising CBT and psychopharmacology within a collaborative care model, against treatment as usual (TAU). The construct stability across a 12-month follow-up was examined through a measurement equivalence/invariance (ME/I) procedure via ESEM.

Results

ESEM showed HDRS17 had a bi-factor structure for PMDD patients (baseline mean (sd) HDRS17 22.6 (5.2); 87% PMDD >1 year) with an overall depression factor and two group factors: vegetative-worry and retardation-agitation, further complicated by negative item loading. This bi-factor structure was stable over 12 months follow up. Analysis of the HDRS6 showed it had a unidimensional structure, with positive item loading also stable over 12 months.

Conclusions

In this cohort of moderate-severe PMDD the HDRS17 had a bi-factor structure stable across 12 months with negative item loading on domain specific factors, indicating that it may be more appropriate to multidimensional assessment of settled clinical states, with shorter unidimensional subscales such as the HDRS6 used as measures of change.

Introduction

The 17-item Hamilton Depression Rating Scale (HDRS17), which was developed in late 1950s has been the most frequently used observer-rated measure of depression for research including randomised controlled trials (RCTs) of treatments for depression [13]. The positive and negative features of the HDRS17 have been comprehensively reviewed [46]. One of its most serious problems, poor inter-rater and test-retest reliability has been addressed with the development of the GRID- HDRS17 version [5]. Overall despite its flaws, it continues to be recommended by licensing and treatment guideline bodies such as the Federal Drug Administration in the US [7] and the National Institute for Care Excellence [8] because of its longitudinal continuity for historical comparison in more than 1500 randomised controlled trials, widespread use for meta-analysis and the lack of a superior measure despite many attempts and considerable resources including the National Institute of Mental Health and the World Health Organisation [46].

However, concerns persist about the widespread use of the of the HDRS17 as a unidimensional measure of depression severity, given indications that it has a more complex factor structure that is not fully captured by a single, total score [913]. Evidence supporting a multidimensional structure has been reviewed by Fried et al [9] and demonstrated across different methodologies, including hierarchical confirmatory factor analysis (CFA) showing a general 2nd order depression factor [13] and exploratory factor analysis in ‘treatment naïve’ mainly non-persistent depression [14, 15]. Fried et al [9], went further to show that this multifactorial structure became more pronounced as depression severity increased, indicating the potential importance of assessing HDRS17 structure in clearly defined clinical groups identified by levels of persistence and severity.

In fact, since Hamilton’s original factor analysis [2], relatively little work has been done on the HDRS17 factor structure in patient groups with more severe, persistent major depressive disorder (PMDD) under treatment in mental health settings. Findings have instead emerged from a variety of other clinical settings [11] and populations, including people whose primary health problem was not depression [12]; and since the nature and complexity of depression has been shown to vary widely across these populations, including the degree of persistence, melancholia, anxiety and other associated co-morbidity [1618], it follows from Fried et al [9] that these findings cannot be assumed to give a true impression of how the HDRS17 functions within more severe PMDD. Additionally, methods of statistical analysis have changed over the 60 years since Hamilton’s original work and earlier reports often lacked the more robust analytical approach now available for establishing factor structure through Exploratory Structural Equational Modelling (ESEM) [19].

Current modelling via ESEM also allows assessment of the related issue of measurement invariance, assessed through the consistency of construct measurement across time. Whilst Fried et al [9] showed this was generally poor for a range of depression measures including the HDRS17, more recent work using ESEM has shown invariance over 12 months for a patient completed outcome measure in more severe PMDD (the 9-item Personal Health Questionnnaire; PHQ-9) [15, 20]. However, there has been no equivalent assessment of clnician outcome measures, such as the HDRS17 in this PMDD popultation.

Previous statistical approaches assessing the dimensionality of the HDRS17 have included both exploratory factor analysis (EFA) and CFA [13]. However, recent literature has shown that both EFA and CFA have methodological limitations [21, 22]. In EFA modelling it is impossible to incorporate latent EFA factors into subsequent analyses and it is not easy to test measure invariance across groups and/or times [22]. In CFA modelling, each item is strictly allowed to load on one factor and all non-target loadings are constrained to zero. The latest factor analytical ESEM, integrates the best features of both EFA and CFA together by applying EFA rigorously to specify more appropriately the underlying factor structure together with the advanced statistical methods typically associated with CFAs [22]. ESEM allows cross item factor loadings which are coherent with the underlying theory and/or item contents that item(s) could cross load on different latent factors; ESEM could reduce the bias in parameter estimates due to zero loading restriction which generally results in inflated CFA factor correlation because items might not be perfect factor indicators with some degree of irrelevant association with other constructs [2224].

Following on from these findings, in order to explore whether the HDRS17 has a general depression factor and additional domain specific factors a bi-factor model exploring psychometric multidimensionality within ESEM, is now recommended, rather than the traditional second order factor analytical model [23, 25, 26]. Compared with hierarchical factor analysis model (Fig 1), bi-factor models have statistical advantages such as fitting data better and allowing external prediction by group factors with or without overall factors [27]. Conceptually, as group factors in a bi-factor model are not subsumed by the overall factor [28], they represent factors explaining items variances which were not accounted for by the overall factor [27]. Therefore the group specific factors have influence over and above the general factor that might help explain the clinical heterogeneity observed among individual patients with depression [29], providing valuable clarity for future research and practice.

thumbnail
Fig 1. Schematic example of 2nd order factor and bi-factor model: G = general factor, F = group factor.

https://doi.org/10.1371/journal.pone.0241370.g001

There is therefore an opportunity and need to re-assess the factor structure and measurement invariance of the HDRS17 in more severe PMDD, made more pressing by the fact that treatment guidelines, including those currently in preparation [30], continue to use the HDRS17 as a single total score across a range of depression severity and persistence.

We address this issue here via ESEM bi-factor modelling in a well-defined patient population with moderate to severe PMDD, recruited from UK mental health care settings in a previously published RCT [18], assessing construct stability across 12-month follow-up using a measurement equivalence/invariance (ME/I) procedure. The chosen 12-month period is clinically relevant through the extended clinical treatment often necessary in patients with PMDD.

Materials and methods

Patients and instruments

Patients (N = 187) were drawn from a multicentre pragmatic randomised controlled trial (RCT) evaluating outcomes of a Special Depression Service (SDS; specialist pharmacotherapy and psychotherapy within a collaborative care model) against treatment as usual (TAU) within UK mental health services [18]. At the time of recruitment participants were all adults receiving community treatment for persistent depression in one of three UK mental health centres (Nottingham, Derby and Cambridge). Ethics approval was obtained from the Trent Research Ethics Service in Derby, England. Approval number 09/H0405/42. Oral and written informed consent was obtained from each participant.

Participants were eligible for the study if they were: thought by the referrer to have primary unipolar depression; aged 18 years or over; able and willing to give oral and written informed consent to participate in the study; had been offered or received direct and continuous care from one or more health professionals in the preceding 6 months and currently be under the care of a secondary care mental health team; had a diagnosis of major depressive disorder with a current major depressive episode according to the structured clinical interview for DSM-IV (SCID) [31]; met five of nine NICE criteria for symptoms of moderate depression; had a score of ≥16 on the 17-item GRID version of the Hamilton Depression Rating Scale (HDRS17) [5]; and had a Global Assessment of Functioning (GAF) [32] score ≤ 60. Referrals were excluded if they: were in receipt of emergency care for suicide risk; were at risk of severe neglect, or posed a homicide risk, unless that risk was adequately contained in their current care setting; were not fluent English speakers; were pregnant; had unipolar depression secondary to a primary psychiatric or medical disorder, except when bipolar disorder was identified by the research team after referral with unipolar depression because an SDS would be expected to manage bipolar depression in clinical practice (n = 8, 4.3%).

The mean age of patients was 46.8 years (sd 11.4) and 61.1% (114 of total 187) were female. Following randomisation 93 (49.7%) patients were allocated to the SDS treatment arm and 94 (50.3%) to treatment as usual (TAU). In the treatment arm, participants received specialist pharmacological and cognitive behaviour therapy within a collaborative care model structured and planned over 12 months. TAU comprised multidisciplinary, community-based care delivered by general mental health services. The primary clinical outcome measure in this trial was the HDRS17 assessed at baseline, 6 and 12 month follow up time points [33]. One hundred and sixty-three (87%) participants entering the RCT suffered depression for more than 1 year with the median (interquartile range) duration of the current episode of 6.5 (2.6–16.0) years. The mean (sd) severity of the HDRS17 at baseline was 22.6 (5.2) years. Melancholia was present in 105 (56.1%) participants and 146 (78.1%) also had a comorbid anxiety disorder. The study design, data collection procedures, treatment offered and trial results can be found from the published protocol [33] and trial report [18].

The HDRS17 evaluates depression severity through items on: 1) depressed mood, 2) guilt, 3) suicidal thought or action, 4) insomnia initial, 5) insomnia middle, 6) insomnia late, 7) work and interests (assessing pleasure and functioning), 8) motor retardation, 9) motor agitation, 10) psychic anxiety, 11) somatic anxiety, 12) appetite, 13) tiredness, 14) sexual interest, 15) hypochondriasis, 16) weight loss, 17) insight. Among these 17 items, 9 items are scored on a 5-point scale (0–4) and 8 items on 3-point scale (0–2) with higher scores indicating greater depressive severity for all items. In keeping with current practice, the total item score was used to quantify the severity of depression and treatment effect estimates in the RCT [2].

Statistics

We first examined the frequency of patients’ response on each HDRS17 item across three time points (baseline, 6 and 12 months). ESEM was then used to explore the factor structure of the HDRS17 [22]. With reference to existing evidence on the factor structure of the HDRS17, we tested separately one to five first order factors and also bi-factor models with two-three domain specific factors for data measured at each time point. Data measured at each time point were stored in wide format for ESEM modelling with alike items measured at adjacent time correlated to take into account the non-independence of data due to the nature of longitudinal design [34]. Ordinal item score was analysed with the WLSMV estimator using Delta parameterization; missing values were automatically accounted for using the full-information maximum likelihood approach built into Mplus [35, 36]. Measurement invariance across all follow-up time points for the best fitted factor structure was further tested using ESEM by comparing configural invariance model and scalar invariance (item factor loading and item threshold invariance) model fittings [9, 34, 37]. All ESEM models were performed using software Mplus 8 and in keeping with standard practice correlation between item residuals was set as 0 [37].

Several fitting indices along with chi-square (χ2) test were used to judge model fit as χ2 tests are sensitive to large sample sizes and non-normal data [38]. The criterion are both comparative fit index (CFI) and the non-normed fit index (NNFI) > 0.90, Root Mean Square Error of Approximation (RMSEA) < 0.08 [39]. The factor loading and item-factor mapping pattern were additionally examined by two senior psychiatrists (RM, NN) to make the factor structure clinically plausible and meaningful. Model comparisons were evaluated by reference to the χ2 change test using Mplus DIFFTEST function to conduct χ2 difference tests, as the WLSMV estimator was used to analyse ordinal items scores [37]. Since the χ2 change tests are influenced by sample size and data non-normality [34, 40, 41], the CFI change is independent of both model complexity and sample size and it is not correlated with the overall fit measurements. A reduction of 0.01 or more in CFI suggests the null hypothesis of no difference should be rejected [41]. We therefore mainly judged model improvement on the CFI change [34, 41] A number of specific modelling details are presented alongside the results.

Results

Frequency of item response

The frequency of each item by arm across measurement time are presented as an appendix. There is an extreme response pattern for the item “insight loss”, for which all but one response was recorded as 0 across measurement time. This extreme response on item “insight loss” would result in it being excluded from all ESEM modelling due to 0 variability. Hence all ESEM models in this study were performed using 16 items.

HDRS17 factor structure

Model fitting indices of structure included one to five first order factors and bi-factor models with two or three domain specific factors for measures at each time (Table 1). Although the model fitting increased with an increased number of latent factors, the items-factors association mapping showed that the bi-factor model with two domain specific factors (bi-2factor) had the most meaningful factor structure in term of model fitting and item-factor mapping pattern. A similar pattern was shown when all models in Table 1 were rerun with alike item loading set to be equal across measurement time (Table 2). The item-factor association mapping also showed that a bi-factor model with two domain specific factors (Table 4) had the most meaningful factor structure (Table 3). By examining the factor loading pattern shown in Table 3, it was suggested HDRS17 measured a general depression factor for patients with moderate-severe PMDD, which comprised all items except “motor retardation” together with a vegetative-worry factor comprising positively loading items “insomnia” (early, middle and late), ‘weight loss”, “appetite loss” and negative loading items “psychic anxiety” and “hypochondriasis”; and a retardation-agitation factor comprising positive loading items “motor retardation”, “depressed mood”, diminished pleasure (“work and interests”), “suicidal thoughts” and negative loading for “agitation”. Item factor loadings for all models shown in Table 2 are presented as supplementary material (appendix).

thumbnail
Table 1. Modelling fitting indices for model with different 1st order and bi-factor structures.

https://doi.org/10.1371/journal.pone.0241370.t001

thumbnail
Table 2. Modelling fitting indices for various models with equal loading across measurement time.

https://doi.org/10.1371/journal.pone.0241370.t002

thumbnail
Table 4. Fitting indices of ME/I across measurement time.

https://doi.org/10.1371/journal.pone.0241370.t004

Stability of factor structure across measure time

The fitting indices of ME/I test models for configural and scalar invariance across measurement time are presented for comparison in Table 4, indicating that the scalar invariant model should be retained as the CFI drop is 0.001 with χ2 increase at 147.674 (df = 119), p = 0.038. These results evidence that the bi-2factor structure is stable through follow up from baseline to 6 and 12 months.

In view of this stable but complex bi-2factor structure, including negative item loadings on both domain specific factors, we conducted a further post-hoc analysis of the most commonly used HDRS subscale, the HDRS6 in the same cohort to investigate its potential as an alternative change measure to the full HDRS17 in moderate-severe PMDD [42]. The HDRS6 comprises 6 items: depressed mood, work and interests (pleasure), general somatic (tiredness), psychic anxiety, guilt feelings and psychomotor retardation; and since it was not plausible to perform an exploratory analysis testing a model with 1 to 3 factors on a 6-item scale, we instead used a one factor model to test its unidimensional factor structure. Results given in Tables 5 and 6 show that all 6 items of the HDRS6 subscale loaded positively and significantly, with time invariance; supporting this as a stable, unidimensional outcome measure in moderate-severe PMDD, in contrast to the 17-item scale.

thumbnail
Table 6. Fitting indices of ME/I across measurement time, HDRS6 subscale.

https://doi.org/10.1371/journal.pone.0241370.t006

Discussion

In light of findings that the HDRS17 is not a unidimensional measure of depression [9, 14, 43, 44], that the factor structure may differ between clinical populations [9] and may not be stable over time, we aimed to assess the HDRS17 in a well-defined group of patients with moderate to severe PMDD, using contemporary ESEM modelling. Consistent with much of this earlier work, our results in moderate-severe PMDD showed that the HDRS17 had a bi-factor, rather than unidimensional structure. We additionally showed that this structure was time-invariant through the full 12-month period of study. The bi-factor structure comprised a general depression factor and two domain specific factors, which we refer to as ‘vegetative-worry’ and ‘retardation-agitation’. The bi-factor structure was further complicated by the two domain specific factors including both positively and negatively loading items, problematising use of the HDRS17 as an outcome measure in moderate to severe PMDD–even allowing for multiple domain scoring within a bi-factor structure, we are left with the problem of incorporating domain factor items with opposite directionality. This problem was previously encountered within development of the 6-item subscale (HDRS6) where agitation was excluded due to reciprocal interaction with the other items [45]; and opposite directionality cannot be surprising when applying the GRID-HDRS17 to severe PMDD, when severe retardation is described by as ‘all movements very slowed’ and severe agitation as ‘cannot sit still…pacing’ [5].

Given these findings on the complex multidimensional structure of the HDRS17 in moderate-severe PMDD and the associated question of its legitimacy as an outcome measure for this patient group, we ran a further post-hoc analysis of the most commonly used 6-item subscale to test its dimensionality and potential as an alternative measure of change to the 17-item scale [42]. The HDRS6 subscale was derived through item analysis of the HDRS17 against global assessment of depression by experienced psychiatrists and it has already demonstrated a unidimensional structure in some clinical populations [43, 45]. Our results confirm this unidimensionality in moderate-severe PMDD, additionally showing time-invariance over 12 months; supporting use of the HDRS6 as an appropriate outcome measure in this group. In contrast our findings on the HDRS17 do not support its use in this way.

What then for the 17-item scale? Firstly, it seems likely that this was initially conceived as a state measure, rather than a measure of change [2]. It’s more complex structure, including concepts now understood as near polar opposites (e.g. agitation and retardation as operationalised in the GRID-HDRS17) may still be more relevant to the assessment of settled clinical states, where the domain factors we have identified may further clarify depression type, acting as predictor variables to assist development of treatment strategies [45]. A patient loading high on worry (psychic anxiety, hypochondriasis) rather than vegetative disturbance (sleep, appetite, weight), may for example benefit from more targeted initial clinical interventions reflecting this delineated state rather than non-specific depression treatments [46]. The HDRS17 might then be repeated later on for this individual, not as a measure of change, but to re-conceptualise a later settled state (such as a limited but stable treatment response) in order to develop next-step treatment strategies–in this model outcome change would be assessed through more parsimonious, evidence-based item-sets, such as the HDRS6.

This approach seems in keeping with the initial history of the Hamilton scale. An awareness of the multidimensionality of the HDRS17 dates back 60 years to Hamilton’s original work, also based in observations on patients suffering severe depression within mental health treatment; identifying four hierarchical factors (“general”, “endogenous”, “anxious” and “insomnia”) that show parallels with the bi-factor model derived here; including a main “general depression” factor, a retarded-depressed factor, a broadly vegetative factor and a separate factor including psychic anxiety [2]. Subsequent use of the HDRS17 to report a single, total item score risks missing the potential richness and purpose of this scale; confirmed again by the ESEM structure presented here. Similarly, use of the HDRS17 to measure change seems both unintended and unsupported by the growing evidence base.

The strengths of our study include a well characterised sample; the systematic application of a standardised interview version of the HDRS17; the multicentre design; and assessment over three-time intervals across 12 months with adequate retention. The systematic application of both psychiatric and psychological treatment over this time period in one group versus usual care provided both a test of the robustness of the factor structure of the HDRS17 and data from a broad group within UK mental health service care. Analysis included the first use of the most advanced ESEM modelling which allows cross factor loading and bi-factor modelling to simultaneously explore the overall latent factor and specific sub-factors for PMDD patients HDRS17 measures, incorporating the ME/I test of invariance [22, 23, 40].

Our findings on the HDRS17 and HDRS6 are however limited to a single UK cohort of patients with moderate-severe PMDD. Given previous findings that factor structure may change with clinical characteristics of depression, such as severity [9], our findings do not presume that the same structure holds for other populations with less persistent, complex or severe depression. This caution fits with recognised features of PMDD, such as rumination/worry [47] and high comorbidity (e.g. 78.1% of the current cohort had a separate anxiety disorder), which may not be present in less severe, less persistent depression. Equally, psychomotor disturbance (through agitation or retardation) identified within our PMDD cohort may be much less prevalent in patients recruited from primary care or other general medical settings [1, 17]. It seems quite plausible in this regard that a different factor structure may emerge in these different clinical groups and whilst our preliminary findings in PMDD remain important, they cannot be assumed to generalise. Rather, important differences between clinical groups may be reflected in real changes to the underlying factor structure of the measurement tool, accounting for some observed differences between this and earlier studies conducted in predominately non-persistent depression [14]. Other important limitations include: the lack of a specific power calculation for the purposes of the current analysis [18, 33], though its size was sufficient to perform factor analysis modelling based in previous work on the methodology used here [48]; and the 40 per cent attrition over 12 months follow up, though again this left a sufficient sample for invariance analysis.

Data from the current study could be meta-analysed in future with other studies with similar designs and analysis methods to provide more robust results on the HDRS17 factor structure in patients with moderate-severe PMDD.

Conclusions

These preliminary findings in patients with moderate-severe PMDD indicate the HDRS17 has a bi-factorial structure characterised by a general depression factor with two additional factors, ‘vegetative-worry’ and ‘retardation-agitation’. This conceptual structure was found to be relatively stable across a 12-month follow up period but negative item loading on the HDRS17 domain specific factors does not support its use as an outcome measure in this clinical population. Instead, the HDRS17 may be more appropriate in the multidimensional assessment of settled clinical states, helping to guide targeted interventions; with shorter unidimensional subscales such as the HDRS6 used as measures of change.

Acknowledgments

We wish to acknowledge the contribution of all the staff who obtained the data and participants in the original randomised controlled trial from which this analysis was conducted.

References

  1. 1. Rush AJ. STAR*D: what have we learned? Am J Psychiatry. 2007;164:201–4. pmid:17267779
  2. 2. Hamilton M. A rating scale for depression. Journal of Neurology, Neurosurgery & Psychiatry. 1960;23(1):56–62. pmid:14399272
  3. 3. Salagre E, Fernandes BS, Dodd S, Brownstein DJ, Berk M. Statins for the treatment of depression: A meta-analysis of randomized, double-blind, placebo-controlled trials. Journal of Affective Disorders. 2016;200(Supplement C):235–42. https://doi.org/10.1016/j.jad.2016.04.047.
  4. 4. Zimmerman M, Posternak MA, Chelminski I. Is It time to replace the Hamilton Depression Rating Scale as the primary outcome measure in treatment studies of depression? Journal of Clinical Psychopharmacology. 2005;25(2):105–10. 00004714-200504000-00001. pmid:15738740
  5. 5. Williams JBW, Kobak KA, Bech P, Engelhardt N, Evans K, Lipsitz J, et al. The GRID-HAMD: standardization of the Hamilton Depression Rating Scale. International Clinical Psychopharmacology. 2008;23(3):120–9. 00004850-200805000-00002. pmid:18408526
  6. 6. Vaccarino AL, Evans KR, Kalali AH, Kennedy SH, Engelhardt N, Frey BN, et al. The Depression Inventory Development Workgroup: A Collaborative, Empirically Driven Initiative to Develop a New Assessment Tool for Major Depressive Disorder. Innovations in Clinical Neuroscience. 2016;13(9–10):20–31. PMC5141593. pmid:27974997
  7. 7. Guidance for industry: Patient-reported outcome measures: use in medical product development to support labeling claims, (2009).
  8. 8. NICE. National Clinical Practice Guideline 90: The Treatment and Management of Depression in Adults (updated version). In: Excellence NIfHC, editor. The Royal College of Psychiatrists, London E1 8AA: The British Psychological Society & The Royal College of Psychiatrists; 2010.
  9. 9. Fried EI, van Borkulo CD, Epskamp S, Schoevers RA, Tuerlinckx F, Borsboom D. Measuring depression over time… Or not? Lack of unidimensionality and longitudinal measurement invariance in four common rating scales of depression. Psychological Assessment. 2016;28(11):1354–67. pmid:26821198
  10. 10. Bech P, Csillag C, Hellström L, Fleck MPdA. The time has come to stop rotations for the identification of structures in the Hamilton Depression Scale (HAM-D17). Revista Brasileira de Psiquiatria. 2013;35:360–3. pmid:24402210
  11. 11. Olden M, Rosenfeld B, Pessin H, Breitbart W. Measuring depression at the end of life. Assessment. 2009;16(1):43–54. pmid:18676960.
  12. 12. Moritz S, Meier B, Hand I, Schick M, Jahn H. Dimensional structure of the Hamilton Depression Rating Scale in patients with obsessive–compulsive disorder. Psychiatry Research. 2004;125(2):171–80. pmid:15006440
  13. 13. Cole JC, Motivala SJ, Dang J, Lucko A, Lang N, Levin MJ, et al. Structural validation of the Hamilton Depression Rating Scale. Journal of Psychopathology and Behavioral Assessment. 2004;26(4):241–54.
  14. 14. Dunlop BW, Cole SP, Nemeroff CB, Mayberg HS, Craighead WE. Differential change on depressive symptom factors with antidepressant medication and cognitive behavior therapy for major depressive disorder. Journal of affective disorders. 2018;229:111–9. Epub 2017/12/27. pmid:29306690.
  15. 15. Guo B, Kaylor-Hughes C, Garland A, Nixon N, Sweeney T, Simpson S, et al. Factor structure and longitudinal measurement invariance of PHQ-9 for specialist mental health care patients with persistent major depressive disorder: Exploratory Structural Equation Modelling. Journal of affective disorders. 2017;219:1–8. pmid:28501679
  16. 16. Rush AJ, Wisniewski SR, Zisook S, Fava M, Sung SC, Haley CL, et al. Is prior course of illness relevant to acute or longer-term outcomes in depressed out-patients? A STAR*D report. Psychological Medicine. 2012;.42(6):1131–49. 2012-11782-002. pmid:22008447
  17. 17. Bennabi D, Aouizerate B, El-Hage W, Doumy O, Moliere F, Courtet P, et al. Risk factors for treatment resistance in unipolar depression: A systematic review. Journal of Affective Disorders. 2015;171(0):137–41. pmid:25305428
  18. 18. Morriss R, Garland A, Nixon N, Guo B, James M, Kaylor-Hughes C, et al. Efficacy and cost-effectiveness of a specialist depression service versus usual specialist mental health care to manage persistent depression: a randomised controlled trial. The Lancet Psychiatry. 2016;3(9):821–31. pmid:27498098
  19. 19. Arens AK, Morin AJS. Improved representation of the self-perception profile for children through bifactor Exploratory Structural Equation Modeling. American Educational Research Journal. 2017;54(1):59–87.
  20. 20. Kroenke K, Spitzer RL, Williams JBW. The PHQ-9: the validity of a brief depression severity measure. Journal of General Internal Medicine. 2001;16:606–13. pmid:11556941
  21. 21. Asparouhov T, Muthén B. Exploratory Structural Equation Modeling. Structural Equation Modeling: A Multidisciplinary Journal. 2009;16(3):397–438.
  22. 22. Marsh HW, Morin AJS, Parker PD, Kaur G. Exploratory Structural Equation Modeling: An integration of the best features of exploratory and confirmatory factor analysis. Annual Review of Clinical Psychology. 2014;10(1):85–110. pmid:24313568.
  23. 23. Morin AJS, Arens AK, Marsh HW. A Bifactor Exploratory Structural Equation Modeling framework for the identification of distinct sources of construct-relevant psychometric multidimensionality. Structural Equation Modeling: A Multidisciplinary Journal. 2016;23(1):116–39.
  24. 24. Guay F, Morin AJS, Litalien D, Valois P, Vallerand RJ. Application of Exploratory Structural Equation Modeling to evaluate the Academic Motivation Scale. The Journal of Experimental Education. 2015;83(1):51–82.
  25. 25. Morin AJS, Arens AK, Tran A, Caci H. Exploring sources of construct-relevant multidimensionality in psychiatric measurement: A tutorial and illustration using the Composite Scale of Morningness. International Journal of Methods in Psychiatric Research. 2016;25(4):277–88. pmid:26265387
  26. 26. Chen FF, West SG, Sousa K. A comparison of bifactor and second-order models of quality of life. Multivariate Behavioral Research. 2006;41:189–255. pmid:26782910
  27. 27. Reise SP, Moore TM, Haviland MG. Bifactor models and rotations: Exploring the extent to which multidimensional data yield univocal scale scores. Journal of Personality Assessment. 2010;92(6):544–59. pmid:20954056
  28. 28. Patrick CJ, Hicks BM, Nichol PE, Krueger RF. A Bi-factor approach to modelling the structure of the psychopathy checklist-revised. Journal of Personality Disorders. 2007;21(2):118–41. PMC2242629. pmid:17492917
  29. 29. Simms LJ, Grös DF, Watson D, O'Hara MW. Parsing the general and specific components of depression and anxiety with bifactor modeling. Depression and Anxiety. 2008;25(7):E34–E46. pmid:18027844
  30. 30. NICE. Depression in adults: treatment and management (update). In: NICE, editor. online: National Guideline Alliance; 2020.
  31. 31. First MB, Gibbon M. User's Guide for the Structured Clinical Interview for DSM-IV Axis II Personality Disorders: SCID-II American Psychiatric Pub; 1997.
  32. 32. American Psychiatric Association. Diagnostic and Statistical Manual of Mental Disorders. 4th ed. Washington, D.C.: American Psychiatric Association; 1994.
  33. 33. Morriss R, Marttunnen S, Garland A, Nixon N, McDonald R, Sweeney T, et al. Randomised controlled trial of the clinical and cost effectiveness of a specialist team for managing refractory unipolar depressive disorder. BMC Psychiatry. 2010;10(1):100. pmid:21114826
  34. 34. Vandenberg RJ, Lance CE. A review and synthesis of the measurement invariance literature: suggestions, practices, and recommendations for organizational research. Organizational Research Methods. 2000;3:4–70.
  35. 35. Graham JW. Adding missing-data relevant variables to FIML-based structural equation models. Structural Equation Modeling: A Multidisciplinary Journal. 2003;10:80–100.
  36. 36. Enders CK, Bandalos DL. The relative performance of Full Information Maximum Likelihood estimation for missing data in structural equation models. Structural Equation Modeling: A Multidisciplinary Journal. 2001;8(3):430–57.
  37. 37. Muthén BO, Muthén LK. Mplus User's Guide. Los Angeles, CA 90066: Muthén & Muthén; 2017 2017.
  38. 38. Wen Z, Hau KT, Marsh HW. Structural equation model testing: Cutoff criteria for goodness of fit indices and chi-square test. Acta Psychologica Sinica. 2004;36(2):186–94.
  39. 39. Marsh HW, Lüdtke O, Muthén B, Asparouhov T, Morin AJS, Trautwein U, et al. A new look at the big five factor structure through exploratory structural equation modeling. Psychological Assessment. 2010;22(3):471–91. pmid:20822261
  40. 40. Marsh HW, Muthén B, Asparouhov T, Lüdtke O, Robitzsch A, Morin AJS, et al. Exploratory Structural Equation Modeling, integrating CFA and EFA: Application to students' evaluations of university teaching. Structural Equation Modeling: A Multidisciplinary Journal. 2009;16(3):439–76.
  41. 41. Cheung GW, Rensvold RB. Evaluating goodness-of-fit indexes for testing measurement invariance. Structural Equation Modeling: A Multidisciplinary Journal. 2002;9:233–55.
  42. 42. Bech P, Allerup P, Gram LF, Reisby N, Rosenberg R, Jacobsen O, et al. The Hamilton Depression Scale: Evaluation of objectivity using logistic models. Acta Psychiatrica Scandinavica. 1981;63(3):290–9. pmid:7015793
  43. 43. Licht RW, Qvitzau S, Allerup P, Bech P. Validation of the Bech-Rafaelsen Melancholia Scale and the Hamilton Depression Scale in patients with major depression: is the total score a valid measure of illness severity? Acta Psychiatrica Scandinavica. 2005;111(2):144–9. pmid:15667434.
  44. 44. Gibbons RD, Clark DC, Kupfer DJ. Exactly what does the Hamilton Depression Rating Scale measure? Journal of Psychiatric Research. 1993;27(3):259–73. Epub 1993/07/01. pmid:8295158.
  45. 45. Bech P. Rating scales in depression: Limitations and pitfalls. Dialogues in clinical neuroscience. 2006;8(2):207–15. pmid:16889106.
  46. 46. Coplan JD, Aaronson CJ, Panthangi V, Kim Y. Treating comorbid anxiety and depression: Psychosocial and pharmacological approaches. World journal of psychiatry. 2015;5(4):366–78. pmid:26740928.
  47. 47. Wiersma JE, van Oppen P, van Schaik DJ, van der Does AJ, Beekman AT, Penninx BW. Psychological characteristics of chronic depression: a longitudinal cohort study. J Clin Psychiatry. 2011;72(3):288–94. pmid:21450151.
  48. 48. Mundfrom DJ, Shaw DG, Ke TL. Minimum sample size recommendations for conducting factor analyses. International Journal of Testing. 2005;5(2):159–68.