The validity and responsiveness of the ICECAP-A capability-well-being measure in women with irritative lower urinary tract symptoms

Goranitis, Ilias; Coast, Joanna; Al-Janabi, Hareth; Latthe, Pallavi; Roberts, Tracy E.

doi:10.1007/s11136-015-1225-y

The validity and responsiveness of the ICECAP-A capability-well-being measure in women with irritative lower urinary tract symptoms

Open access
Published: 11 January 2016

Volume 25, pages 2063–2075, (2016)
Cite this article

Download PDF

You have full access to this open access article

Quality of Life Research Aims and scope Submit manuscript

The validity and responsiveness of the ICECAP-A capability-well-being measure in women with irritative lower urinary tract symptoms

Download PDF

Ilias Goranitis¹,
Joanna Coast²,
Hareth Al-Janabi¹,
Pallavi Latthe^3,4 &
…
Tracy E. Roberts¹

2117 Accesses
20 Citations
9 Altmetric
Explore all metrics

Abstract

Purpose

A desire to incorporate broader aspects of well-being in health economic evaluations has led to the development of the ICEpop CAPability measure for Adults (ICECAP-A). The ICECAP-A draws upon Amartya Sen’s capability approach and conceptualises well-being as the capability to achieve Stability, Attachment, Autonomy, Achievement, and Enjoyment. The aim of this study was to assess the psychometric performance of the ICECAP-A in a context where patient outcomes can extend beyond health-related quality of life.

Methods

Longitudinal data were collected for 478 women with symptoms of urinary frequency and urgency, with or without incontinence. Women were recruited across 22 hospitals in the UK and had a mean age of 55 (SD 14). The psychometric performance of the measure was evaluated in relation to the EuroQol Five-Dimension Questionnaire (EQ-5D-3L) and the International Consultation on Incontinence Questionnaire for Overactive Bladder (ICIQ-OAB) and involved an assessment of acceptability, construct validity, and responsiveness using parametric and nonparametric methods.

Results

ICECAP-A showed good convergence with the ICIQ-OAB with 20 out of 22 expected patterns of relationship confirmed. Findings suggested that the ICECAP-A has better discriminative properties than EQ-5D-3L and as good as those of the ICIQ-OAB, confirming expected associations with clinical and demographic factors. The ICECAP-A was more responsive than EQ-5D-3L and ICIQ-OAB to deteriorations of clinical symptoms. Improvements in symptoms were not valued as highly as deteriorations by either ICECAP-A or EQ-5D-3L.

Conclusions

The ICECAP-A is a valid and responsive measure capturing broad emotional and practical impacts of urinary symptoms on women’s well-being and could be considered for use in economic evaluations in this context.

Female urinary incontinence and wellbeing: results from a multi-national survey

Article Open access 23 May 2016

Andrew P. Smith

Development of the Incontinence Utility Index: estimating population-based utilities associated with urinary problems from the Incontinence Quality of Life Questionnaire and Neurogenic Module

Article Open access 08 October 2014

Jesús Cuervo, Nacho Castejón, … Donald L Patrick

Validation of overactive bladder questionnaire (1-week recall version) in medically complex elderly patients with overactive bladder

Article 04 April 2017

Alexandra I. Barsdorf, Martin Carlsson, … Andreas Pleil

Introduction

Consideration of health-related quality of life (HrQoL) is an integral component of healthcare decision-making in many systems of the developed world. HrQoL, however, may offer limited scope when interventions result in wider personal well-being gains [1–4] or in external effects on groups other than the patient [5, 6]. One appropriate framework for conceptualising these broader well-being impacts for health policy purposes is the capability approach [7, 8]. The capability approach was developed by Amartya Sen as a basis for assessing well-being in terms of what people do and are (functioning) and particularly, what people are able to do and be (capability) in their lives [9]. While a number of capability measures have been developed [10–14], the ICEpop^{Footnote 1} CAPability (ICECAP) measures are distinct as they provide a generic measure of capability-well-being for use in the economic evaluation of health and social care interventions.

The ICECAP measure for the general adult population (ICECAP-A) has recently been developed [12] and conceptualises well-being as the capability of an individual to achieve the valuable functionings of Stability, Attachment, Autonomy, Achievement, and Enjoyment, with health potentially being a direct determinant of functioning. Previous validation work on the ICECAP-A has suggested that the attributes of the measure can comprehensively capture quality of life [15] and that the measure is able to identify expected differences in capability-well-being in a general population sample [16]. In terms of responsiveness, small changes in capability-well-being were evident as a result of changes in physical and psychological health after a knee pain intervention [17].

However, no evidence for the psychometric properties of the ICECAP-A exists in a clinical context where there are likely to be impacts on well-being more broadly than those captured by conventional HrQoL measures. This paper explores the acceptability, construct validity, and responsiveness of the ICECAP-A in relation to the three-level EuroQol Five-Dimension Questionnaire (EQ-5D-3L) [18] and the International Consultation on Incontinence Questionnaire for Overactive Bladder (ICIQ-OAB) [19] in women with irritative lower urinary tract symptoms (LUTS) involving urinary frequency and urgency, with or without incontinence. The impact of these symptoms on HrQoL is well established [20, 21], but broader well-being issues may arise as a result of shame, embarrassment, discomfort, and lack of confidence [22]. It is, therefore, possible that such effects may be missed by HrQoL measures, but picked up by measures of broader capability-well-being.

Methods

Data source

The paper relied on data from the largest observational study undertaken to estimate the accuracy and cost-effectiveness of bladder ultrasound scan in the diagnosis of detrusor overactivity [23]. Detrusor overactivity is termed the involuntary contraction of the detrusor muscle observed during the filling phase of urodynamic studies and is perceived to be one of the main causes of LUTS. The study was carried out in 22 hospitals across the UK, and women were recruited if they presented increased frequency of urination and mild to severe urgency, with or without urinary incontinence. Exclusion criteria involved pregnancy or up to 6 weeks post-partum, stress-predominant mixed incontinence, continuous medical treatment, like antimuscarinics, for more than 6 months, and a surgical treatment or urodynamic studies during the past 6 months for a bladder condition. Women in the study had a transvaginal bladder ultrasound scan (index test) followed by urodynamic studies (reference test). Women were initially treated conservatively. All women provided written informed consent and were followed up for a year.

Outcome measures

The outcome measures used in the analysis included the ICECAP-A, EQ-5D-3L, and ICIQ-OAB. These measures were administered prior to diagnostic testing at baseline and 6-month follow-up, while the latter two were additionally administered at the 12-month follow-up. More information about the different measures is provided below.

ICEpop CAPability measure for adults (ICECAP-A)

The ICECAP-A is a generic and preference-based measure of capability-well-being [12]. It comprises five conceptual attributes (Stability, Attachment, Autonomy, Achievement, and Enjoyment) with each having four response options that range from full capability to no capability. Individual responses to the five attributes can subsequently be translated into a capability index score using a UK population value set obtained using the best–worst scaling method [24]. The capability index scores range from 0 to 1, indicating no capability and full capability, respectively.

EuroQol Five-Dimension Questionnaire (EQ-5D-3L)

The EQ-5D-3L is a generic and preference-based measure of HrQoL [18], comprising five conceptual attributes (Mobility, Self-care, Usual activities, Pain and discomfort, and Anxiety and depression). Each attribute has three response options ranging from no problems to severe problems. Responses to the EQ-5D-3L are used to derive a health index score based on country-specific value sets, which represent general population preferences for the different health states. In this study, health index scores were calculated using the UK value set obtained based on the time trade-off method [25]. The scores range from −0.594 to 1, depending on whether severe problems or no problems are reported across the five dimensions of the instrument. On this scale, the values of 0 and 1 represent death and full health, respectively, while values lower than 0 represent health states considered to be worse than death.

International Consultation on Incontinence Questionnaire for Overactive Bladder (ICIQ-OAB)

The ICIQ-OAB is a urinary incontinence-specific measure of quality of life [19]. This measure asks four questions, each having five response options. The questions relate to: (a) the frequency of urination during the day, (b) frequency of nocturia, (c) frequency of having to rush to the toilet for urination, and (d) frequency of leaking before getting to the toilet. Responses to these questions are scored from 0 to 4, whereby a higher score reflects increased frequency (severity) of symptoms. A total ICIQ-OAB score is derived by adding the scores from all responses and thus can range from 0 to 16. Each of the four questions has a second part intended to measure, on an 11 (0–10)-point Likert scale, the level of ‘bother’ from the different symptoms. Although responses to these questions are not included in the scoring of the instrument, they are helpful in determining patient’s priority for treatment or monitoring changes over time.

Psychometric analysis

The sample size was determined by the main study [23], which aimed to recruit at least 600 women after loss to follow-up. The psychometric properties of the ICECAP-A were assessed in relation to the EQ-5D-3L and ICIQ-OAB and involved explorations of acceptability, construct validity, and responsiveness. Analyses for this research were based upon women who responded at both baseline and 6-month follow-up, allowing for the same sample to be used in all analyses. No data imputation was performed, and all analyses were carried out in Stata version 12MP.

Acceptability

Acceptability is a term used to reflect the perceived relevance of an outcome measure to the respondents in certain clinical contexts. Generic outcome measures, such as the ICECAP-A and EQ-5D-3L, are developed for application in all clinical contexts, and, therefore, demonstrating high levels of acceptability is an important quality. The acceptability of the ICECAP-A was approximated through the completion rates at baseline and 6-month follow-up [26], with rates above 95 % indicating high levels of acceptability [27].

Validity

Construct validity relates to the degree that relationships between a measure and other factors confirm a priori expected patterns of relationship and comprises both convergent and discriminative (known group) validity [28]. Convergent validity assesses the extent of correlation between instruments intended to measure similar or overlapping constructs [28]. The convergence between the ICECAP-A, EQ-5D-3L, and ICIQ-OAB index scores was explored using Pearson’s correlation coefficients. Spearman rank correlation coefficients were used for the convergence across dimension scores and between index and dimension scores. Correlations were considered strong if the coefficient was above 0.5, moderate if the coefficient was between 0.3 and 0.5, and weak if the coefficient was below 0.3 [29]. Given that the EQ-5D-3L attributes are scored from no problems (lowest level) to severe problems (highest level), and the ICECAP-A attributes from no capability (lowest level) to full capability (highest level), the scoring of the EQ-5D-3L dimensions was reversed for the purposes of this analysis in order to allow for a more intuitive interpretation of findings.

Discriminative or known-group validity assesses the extent to which instruments are able to distinguish between dissimilar constructs [28], namely constructs differing in a trait likely to be associated with women’s quality of life. The constructs used in the analysis related to age, body mass index (BMI), presence of detrusor overactivity, previous urinary surgery, and presence of prolapse or voiding dysfunction. The four questions included in the ICIQ-OAB, which indicate how bothersome the frequencies of the different urinary symptoms are to women, and which are not considered as part of the scoring process of the ICIQ-OAB, were also used to construct known groups. To test whether the mean index scores of the three measures differed between known groups, a univariate analysis using one-way ANOVA and a Kruskal–Wallis H test was undertaken. To account for potential confounding problems associated with univariate analyses, a multivariate regression analysis was additionally carried out using age, BMI, past surgery, presence of detrusor overactivity, advance prolapse, and voiding dysfunction as covariates.

Responsiveness

Given that a fundamental principle underpinning healthcare interventions is the improvement of health and well-being, it is important that instruments are also valid in a longitudinal context. In the assessment of responsiveness, the different measures are compared for patient groups expected to have experienced a change in health and well-being based on an external criterion (anchor) [26]. Three analyses were undertaken to explore the responsiveness of the ICECAP-A using different anchors of potential clinical change.

In the first analysis, changes in the scores of the three outcome measures were assessed based on changes in the mean self-reported ‘bother’ across individual urinary symptoms in the ICIQ-OAB [30]. In this analysis, responsiveness was assessed for the overall sample and for specific subgroups (those with the same, decreased and increased level of ‘bother’). In the second analysis, changes in the scores of the ICECAP-A and EQ-5D-3L were assessed relative to changes in the actual ICIQ-OAB score and thus based on changes in the frequency of urinary symptoms. This analysis explored changes in capability and health index scores for those of whom ICIQ-OAB score decreased (symptoms less frequent), increased (symptoms more frequent), and remained the same. In the third analysis, changes in the scores of the three measures were assessed based on whether women felt that symptoms were ‘improved’, ‘deteriorated’, or ‘without change’ on a retrospective transition question.

In the absence of a gold-standard measure of HrQoL and well-being, responsiveness was evaluated using the standardised response mean (SRM) effect size statistic, calculated as the ratio of the mean change between baseline and follow-up index scores to the standard deviation of the change scores [26, 31]. Alternative methods for assessing responsiveness, such as the receiver operating characteristic (ROC) curve analysis, which require a gold-standard anchor, were not explored, as none of the anchors of this study can be considered an appropriate reference standard of a valued change of clinical symptoms by the general public, which is inherent in the valuation of preference-based outcome measures. Paired t tests and Wilcoxon rank sum tests were also carried out to identify significant changes in scores. The values 0.2, 0.5, and 0.8 were used as thresholds for small, moderate, and large SRM statistics [32]. Floor and ceiling effects were calculated as the proportion of women selecting the response options indicating the lowest (floor effect) or highest (ceiling effect) level of quality of life across all attributes of each questionnaire.

Hypothetical constructs

Good measurement validation practices require an a priori statement of hypotheses on the expected relationship between the theoretical concepts explored [33, 34]. Therefore, hypothetical constructs were developed independently by each author in the light of available evidence and personal judgment before seeing any of the results. These are available in ‘Appendices 1 and 2’. The two overarching expectations were that the ICECAP-A would show better convergence with the condition-specific measure than the EQ-5D-3L and that the ICECAP-A would be more sensitive in identifying differences and changes in the level of ‘bother’ from urinary symptoms.

Results

The primary study recruited 687 women with lower urinary tract symptoms. Responses to at least one of the outcome measures were provided by 655 (95.3 %) women at baseline and 478 (69.6 %) at the 6-month follow-up period. The results presented in this section are based on women who responded to at least one of the outcome measures at both baseline and 6-month follow-up (n = 478). Women had a mean age of 55 (SD 14) and a mean weight of approximately 77 kg (SD 18), with 198 (41.4 %) women being classified as obese based on their BMI. Most women (44.8 %) were diagnosed with detrusor overactivity, had no evidence of prolapse (74.2 %), and no voiding difficulties (56.4 %). A significant proportion of women (73.2 %) reported high levels of ‘bother’ from urinary symptoms and had no previous urinary surgery (82.4 %). More information about the sample characteristics is provided in Table 1.

Table 1 Sample characteristics (N = 478)

Full size table

Acceptability

Missing data for the ICECAP-A attributes ranged between 1.3 % (Autonomy) and 1.9 % (Enjoyment) at baseline, and between 3.8 % (Achievement) and 4.6 % (Attachment) at 6-month follow-up. For the EQ-5D-3L, missing data ranged between 0.6 % (Mobility and Self-care) and 0.8 % (Pain and discomfort and Anxiety and depression) at baseline, and between 3.3 % (Self-care and Anxiety and depression) and 4 % (Pain and discomfort) at 6-month follow-up. For the ICIQ-OAB, 0–1.9 % of values was missing at baseline and 0–1.3 % at 6-month follow-up. In all instances, completion rates were greater than 95 % indicating a high level of acceptability.

Construct validity

The convergence between the three outcome measures is given in Table 2. A strong correlation was found between the capability and health index scores, and all attributes of the EQ-5D-3L were found to have a moderate to strong correlation with the ICECAP-A index score. All correlations between the ICECAP-A and EQ-5D-3L were statistically significant at the 1 % level, apart from correlations between the ICECAP-A attribute of Attachment and the EQ-5D-3L attributes of Mobility, Usual activities, and Pain and discomfort. For the latter two, however, correlations were statistically significant at the 5 % level.

Table 2 Convergent validity between the ICECAP-A, EQ-5D-3L, and ICIQ-OAB (n = 478)

Full size table

Correlations between the ICECAP-A index score and ICIQ-OAB, although being slightly higher than those between the EQ-5D-3L index score and ICIQ-OAB (apart from the case of frequency of nocturia), were of similar strength. From the 17 hypothesised associations between the ICECAP-A attributes and ICIQ-OAB (Appendix 1), only the correlations between the frequency of urination during the day and the attributes of Stability and Autonomy were not statistically significant. In addition to the hypothesised correlations, other significant correlations were found. Attachment was significantly correlated at the 5 % level with the ICIQ-OAB score and the frequency of leaking before urination. Finally, frequency of nocturia was found to have a significant correlation with Autonomy (5 % level of significance), Achievement and Enjoyment (1 % level of significance). All correlations were in the expected direction (Appendix 1).

The results on the discriminative validity of the different outcome measures are presented in Table 3. According to the a priori hypotheses (Appendix 2), the ICECAP-A was expected to be able to discriminate among the categories of BMI, detrusor overactivity, and the different variables related to self-reported levels of ‘bother’ from urinary symptoms. There were significant differences in terms of both ICECAP-A and EQ-5D-3L among the categories of BMI. The presence of detrusor overactivity was significantly associated with lower levels of capability-well-being (at the 5 % level), but only in the univariate analysis. Significantly lower levels of HrQoL (at the 1 % level) were also evident for those with detrusor overactivity. Statistically significant differences in capability-well-being were evident between those with high and low levels of ‘bother’ from the different urinary symptoms, apart from the symptom of urgency. These differences were also captured by the ICIQ-OAB, but not from the EQ-5D-3L, which only identified significant differences in HrQoL (at the 5 % level) for the urinary frequency symptom, and only in the univariate analysis.

Table 3 Discriminative (known group) validity of the ICECAP-A, EQ-5D-3L, and ICIQ-OAB (n = 478)

Full size table

Responsiveness

The responsiveness of the three measures for all women and by self-reported change in the level of ‘bother’ is given in Table 4. There were no floor effects evident for the three measures. There was some evidence of ceiling effect for the EQ-5D-3L, with 16 % of women at baseline and 21 % at 6-month follow-up reporting full health. Approximately 12 % of women reported full capability at the two time periods. Across the three responsiveness analyses, the ICECAP-A appeared to be more responsive than the EQ-5D-3L, but with effect sizes being trivial to small. More specifically, for women with the same and, particularly, increased level of ‘bother’, the ICECAP-A was found to be more responsive in comparison with the EQ-5D-3L and ICIQ-OAB, with effect sizes being around 0.3 (Table 4). Even when changes in the ICECAP-A score were assessed based on changes in the frequency of symptoms (Table 5) or based on women’s self-perceived change of symptoms (Table 6), the ICECAP-A was the only measure capturing statistically significant deteriorations in clinical outcomes.

Table 4 Responsiveness of the ICECAP-A, EQ-5D-3L, and ICIQ-OAB by self-reported change in symptoms’ bother

Full size table

Table 5 Responsiveness of the ICECAP-A and EQ-5D-3L by change in symptoms’ frequency (i.e. ICIQ-OAB score)

Full size table

Table 6 Responsiveness of the ICECAP-A, EQ-5D-3L, and ICIQ-OAB by self-perceived change of symptoms

Full size table

Discussion

This paper explored the psychometric properties of the ICECAP-A in relation to the EQ-5D-3L and ICIQ-OAB in a sample of women with lower urinary tract symptoms. This was the first study assessing the construct validity of the ICECAP-A in a clinical group, and the first assessing its responsiveness in a clinical area where symptoms are likely to affect an individual’s quality of life, or well-being, in a much broader sense than conceptualised by conventional health status measures.

The results provided supporting evidence for the acceptability, construct validity, and responsiveness of the ICECAP-A in this context. The ICECAP-A showed high levels of acceptability, with completion rates being above 95 %. In terms of construct validity, a strong correlation was found between the ICECAP-A and EQ-5D-3L index scores and with the EQ-5D-3L attribute of Anxiety and depression. Out of the 36 correlations explored between the two measures, only the correlation between the attributes of Attachment and Mobility was not statistically significant, while from the remaining correlations, 33 (94.3 %) were statistically significant at the 1 % level. Similarly, out of the 22 hypothesised correlations between the ICECAP-A and ICIQ-OAB, 20 (90.9 %) appeared to be statistically significant, with 15 (75 %) of them being significant at the 1 % level.

In terms of discriminative validity, the ICECAP-A was found to have better discriminative properties than EQ-5D-3L and as good as those of the condition-specific questionnaire (ICIQ-OAB), as it was able to detect significant differences in capability-well-being, not only among the BMI categories, and according to the presence or not of detrusor overactivity, but also between the different levels of ‘bother’ from urinary symptoms. In the light of mixed evidence for the association between age and quality of life in this clinical group (see Appendix 2), no significant difference in capability-well-being was hypothesised between age groups. Even though age is expected to inhibit capability and health, this study found no significant differences in terms of health status (EQ-5D-3L) and capability-well-being (ICECAP-A) between those above and below the age of 65. These findings are in line with previous validation work on the ICECAP-A in a general population sample [16] and are potentially attributable to the fact that urinary symptoms might disproportionately affect those employed or more socially engaged, diluting the age effect. The absence of such information did not enable these covariates to be controlled for in the analysis.

The responsiveness analyses explored changes in the ICECAP-A index score in response to changes in the level of ‘bother’ and frequency as well as in response to self-perceived change of urinary symptoms. The results indicated that the ICECAP-A was more responsive to a deterioration of women’s symptoms compared with the EQ-5D-3L in all responsiveness analyses and also compared with the ICIQ-OAB when ‘bother’ and self-perceived change of symptoms were used as anchors. Thus, deteriorations in clinical outcomes appeared to be ‘valued’ more highly than improvements by the ICECAP-A, in line with previous evidence [17], even though this could be due to the baseline distribution of scores.

The study benefited from a relatively large sample size and the use of longitudinal data, which enabled a thorough assessment of both construct validity and responsiveness. In addition, given that the assumption of normality underpinning parametric tests is often violated in quality of life data, nonparametric tests were also included in the analysis. Although evidence exists in support of parametric tests even in violations of the normality assumption [35], the results obtained from the two tests were sometimes contradictory.

Nevertheless, there are a number of caveats worth highlighting in the interpretation of the study’s findings. First, in the absence of a gold-standard measure of well-being, the psychometric properties of the ICECAP-A could only be investigated against hypothetically developed constructs and imperfect anchors of clinical change. Second, the primary study was designed to test the accuracy and cost-effectiveness of a diagnostic strategy, rather than the clinical effectiveness of an intervention. Because of limitations in the primary data, it is uncertain whether there were other health or well-being impacts, such as an unrelated adverse health event, that a woman might have experienced that could have influenced the generic health or well-being measures of this study. Finally, the primary study targeted only women with symptoms of urinary urgency and frequency, with or without urinary incontinence, and thus, findings are restricted to the specific sample used. Strengths and limitations associated with the primary study, from which the data were drawn, can be found in the full Health Technology Assessment report [23].

There are potentially several reasons explaining the good psychometric performance of the ICECAP-A in this clinical group. First, the ICECAP-A comprises conceptual attributes that capture a broader evaluative space that extends beyond HrQoL to the capability to function in terms of Stability, Attachment, Autonomy, Achievement and Enjoyment. This allows for more extensive practical and emotional implications from urinary symptoms to be captured. Intuitively, it might be expected that, in this clinical group, symptoms of urgency or incontinence would be significantly correlated with the EQ-5D-3L attribute of Anxiety and depression [20, 36, 37]. However, this was not evident in this study. While the EQ-5D-3L attribute of Usual activities might capture some broader practical implications of urinary symptoms, the emotional ones appear to be largely missed. This also possibly explains why in this study the EQ-5D-3L was not able to distinguish between different levels of ‘bother’ from urinary symptoms, a finding that confirms previous validation work which found no association between symptom severity and the EQ-5D-3L index score and attributes [38].

Second, the ICECAP-A has more response options than the EQ-5D-3L, which in turn may allow for a greater degree of sensitivity and smaller floor and ceiling effects. In this study, 16 and 21 % of women reported full health at baseline and 6-month follow-up, respectively, whereas approximately 12 % of women reported full capability at the two time-points. Of course, this issue might be ameliorated with the development of the new five-level EQ-5D (EQ-5D-5L) [39]. Finally, another driver of the good performance of the ICECAP-A is the lower statistical dispersion observed in the results, which subsequently made the different statistics more favourable compared to the EQ-5D-3L, even when absolute changes were of similar or smaller magnitude. This might be an implication arising from the wider scale of values generated from the EQ-5D-3L, which can range from −0.594 to 1 and not necessarily between 0 and 1 as the ICECAP-A. This, however, allows for larger changes to be seen, especially when interventions are aimed at those with low levels of health.

More research is required in order to establish the psychometric performance of the ICECAP-A. Comparisons with other capability measures (e.g. ASCOT [40] or OxCap-MH [11]) or other measures of HrQoL (e.g. EQ-5D-5L [39] or SF-6D [41, 42]), and in different settings are required to shed further light on its measurement properties. Given that recent recommendations for the evaluation of social care interventions, published by the National Institute for Health and Care Excellence (NICE) in the UK, suggest a parallel use of an ICECAP measure when capability benefits are relevant [43], further research is required to establish the validity and responsiveness of the ICECAP-A in different social care contexts. Finally, given the limited empirical evidence for the validity and responsiveness of the measure in the evaluation of physical health problems, further research is required to establish the sensitivity of the measure to capture differences and changes in physical health status.

In conclusion, the findings of this study have provided strong evidence for the construct validity and responsiveness of the ICECAP-A and support its use in the economic evaluation of interventions for urinary symptoms in women. Using the ICECAP-A in this context will allow for a more holistic assessment of women’s experience of urinary symptoms and treatment outcomes.

Notes

ICEPOP was a UK MRC-funded Health Services Research Collaboration programme on Investigating Choice Experiments for Preferences of Older People; it was the research programme in which the first ICECAP measure was developed.

References

Coast, J. (2014). Strategies for the economic evaluation of end-of-life care: Making a case for the capability approach. Expert Review of Pharmacoeconomics and Outcomes Research, 14(4), 473–482.
Article PubMed Google Scholar
Makai, P., Brouwer, W. B., Koopmanschap, M. A., Stolk, E. A., & Nieboer, A. P. (2014). Quality of life instruments for economic evaluations in health and social care for older people: A systematic review. Social Science and Medicine, 102, 83–93.
Article PubMed Google Scholar
Chalkidou, K., Culyer, A., Naidoo, B., & Littlejohns, P. (2008). Cost-effective public health guidance: Asking questions from the decision-maker’s viewpoint. Health Economics, 17(3), 441–448.
Article PubMed Google Scholar
Chisholm, D., Healey, A., & Knapp, M. (1997). QALYs and mental health care. Social Psychiatry and Psychiatric Epidemiology, 32(2), 68–75.
Article CAS PubMed Google Scholar
Al-Janabi, H., Coast, J., & Flynn, T. N. (2008). What do people value when they provide unpaid care for an older person? A meta-ethnography with interview follow-up. Social Science and Medicine, 67(1), 111–121.
Article PubMed Google Scholar
Al-Janabi, H., Flynn, T. N., & Coast, J. (2011). QALYs and carers. Pharmacoeconomics, 29(12), 1015–1023.
Article PubMed Google Scholar
Lorgelly, P. K., Lawson, K. D., Fenwick, E. A., & Briggs, A. H. (2010). Outcome measurement in economic evaluations of public health interventions: A role for the capability approach? International Journal of Environmental Research and Public Health, 7(5), 2274–2289.
Article PubMed PubMed Central Google Scholar
Coast, J., Smith, R., & Lorgelly, P. (2008). Should the capability approach be applied in health economics? Health Economics, 17(6), 667–670.
Article PubMed Google Scholar
Sen, A. (1993). Capability and well-being. In M. Nussbaum & A. Sen (Eds.), The quality of life. Oxford: Oxford University Press.
Google Scholar
Malley, J., Towers, A.-M., Netten, A. P., Brazier, J. E., Forder, J. E., & Flynn, T. (2012). An assessment of the construct validity of the ASCOT measure of social care-related quality of life with older people. Health Qual Life Outcomes, 10(21), 1477–7525.
Simon, J., Anand, P., Gray, A., Rugkåsa, J., Yeeles, K., & Burns, T. (2013). Operationalising the capability approach for outcome measurement in mental health research. Social Science and Medicine, 98, 187–196.
Article PubMed Google Scholar
Al-Janabi, H., Flynn, T. N., & Coast, J. (2012). Development of a self-report measure of capability wellbeing for adults: The ICECAP-A. Quality of Life Research, 21(1), 167–176.
Article PubMed Google Scholar
Coast, J., Flynn, T. N., Natarajan, L., Sproston, K., Lewis, J., Louviere, J. J., & Peters, T. J. (2008). Valuing the ICECAP capability index for older people. Social Science and Medicine, 67(5), 874–882.
Article PubMed Google Scholar
Sutton, E. J., & Coast, J. (2014). Development of a supportive care measure for economic evaluation of end-of-life care using qualitative methods. Palliative Medicine, 28(2), 151–157.
Article PubMed Google Scholar
Keeley, T., Al-Janabi, H., Lorgelly, P., & Coast, J. (2013). A qualitative assessment of the content validity of the ICECAP-A and EQ-5D-5L and their appropriateness for use in health research. PloS One, 8(12), e85287.
Article PubMed PubMed Central Google Scholar
Al-Janabi, H., Peters, T. J., Brazier, J., Bryan, S., Flynn, T. N., Clemens, S., et al. (2013). An investigation of the construct validity of the ICECAP-A capability measure. Quality of Life Research, 22(7), 1831–1840.
Article PubMed Google Scholar
Keeley, T., Al-Janabi, H., Nicholls, E., Foster, N., Jowett, S., & Coast, J. (2015). A longitudinal assessment of the responsiveness of the ICECAP-A in a randomised controlled trial of a knee pain intervention. Quality of Life Research. doi:10.1007/s11136-015-0980-0.
PubMed PubMed Central Google Scholar
Brooks, R., & EuroQol Group. (1996). EuroQol: The current state of play. Health Policy, 37(1), 53–72.
Article CAS PubMed Google Scholar
Avery, K., Donovan, J., Peters, T. J., Shaw, C., Gotoh, M., & Abrams, P. (2004). ICIQ: A brief and robust measure for evaluating the symptoms and impact of urinary incontinence. Neurourology and Urodynamics, 23(4), 322–330.
Article PubMed Google Scholar
Coyne, K. S., Wein, A. J., Tubaro, A., Sexton, C. C., Thompson, C. L., Kopp, Z. S., & Aiyer, L. P. (2009). The burden of lower urinary tract symptoms: Evaluating the effect of LUTS on health-related quality of life, anxiety and depression: EpiLUTS. BJU International, 103(s3), 4–11.
Article PubMed Google Scholar
Tincello, D., Sculpher, M., Tunn, R., Quail, D., Van Der Vaart, H., Falconer, C., et al. (2010). Patient Characteristics Impacting Health State Index Scores, Measured by the EQ-5D of Females with Stress Urinary Incontinence Symptoms. Value in Health, 13(1), 112–118.
Article PubMed Google Scholar
Digesu, G. A., Khullar, V., Cardozo, L., & Salvatore, S. (2003). Overactive bladder symptoms: Do we need urodynamics? Neurourology and Urodynamics, 22(2), 105–108.
Article PubMed Google Scholar
Rachaneni, S., McCooty, S., Middleton, L., Brookes, V., Daniels, J., Coomarasamy, A., et al. (2015). Accuracy and economic evaluation of bladder ultrasound in the diagnosis of detrusor overactivity: A study to evaluate if ultrasound can reduce the need for urodynamics. NIHR Health Technology Assessment (in press).
Flynn, T. N., Huynh, E., Peters, T. J., Al-Janabi, H., Clemens, S., Moody, A., & Coast, J. (2015). Scoring the ICECAP-A capability instrument. Estimation of a UK general population tariff. Health Economics, 24, 258–269.
Article PubMed Google Scholar
Dolan, P. (1997). Modeling valuations for EuroQol health states. Medical Care, 35(11), 1095–1108.
Article CAS PubMed Google Scholar
Brazier, J., & Deverill, M. (1999). A checklist for judging preference-based measures of health related quality of life: Learning from psychometrics. Health Economics, 8(1), 41–51.
Article CAS PubMed Google Scholar
Nunnally, J. C., & Bernstein, I. (1994). Psychometric theory (3rd ed.). New York: McGraw Hill.
Google Scholar
Streiner, D. L., & Norman, G. R. (2003). Health measurement scales: A practical guide to their development and use. New York: Oxford University Press.
Google Scholar
Cohen, J. (1988). Set correlation and contingency tables. Applied Psychological Measurement, 12(4), 425–434.
Article Google Scholar
Souto, S. C., Reis, L. O., Palma, T., Palma, P., & Denardi, F. (2014). Prospective and randomized comparison of electrical stimulation of the posterior tibial nerve versus oxybutynin versus their combination for treatment of women with overactive bladder syndrome. World Journal of Urology, 32(1), 179–184.
Article CAS PubMed Google Scholar
Brazier, J., Ratcliffe, J., Salomon, J., & Tsuchiya, A. (2007). Measuring and valuing health benefits for economic evaluation. New York: Oxford University Press.
Google Scholar
Cohen, J. (1988). Statistical power analysis for the behavioral sciences. Hillsdale, NJ: Erlbaum Associates.
Google Scholar
Cronbach, L. J., & Meehl, P. E. (1955). Construct validity in psychological tests. Psychological Bulletin, 52(4), 281.
Article CAS PubMed Google Scholar
Kane, M. T. (2001). Current concerns in validity theory. Journal of Educational Measurement, 38(4), 319–342.
Article Google Scholar
Schmider, E., Ziegler, M., Danay, E., Beyer, L., & Bühner, M. (2010). Is it really robust? Reinvestigating the robustness of ANOVA against violations of the normal distribution assumption. Methodology: European Journal of Research Methods for the Behavioral and Social Sciences, 6(4), 147.
Article Google Scholar
Milsom, I., Kaplan, S. A., Coyne, K. S., Sexton, C. C., & Kopp, Z. S. (2012). Effect of bothersome overactive bladder symptoms on health-related quality of life, anxiety, depression, and treatment seeking in the United States: Results from EpiLUTS. Urology, 80(1), 90–96.
Article PubMed Google Scholar
Perry, S., McGrother, C. W., & Turner, K. (2006). An investigation of the relationship between anxiety and depression and urge incontinence in women: Development of a psychological model. British Journal Of Health Psychology, 11(3), 463–482.
Article PubMed Google Scholar
Haywood, K. L., Garratt, A. M., Lall, R., Smith, J. F., & Lamb, S. E. (2008). EuroQol EQ-5D and condition-specific measures of health outcome in women with urinary incontinence: Reliability, validity and responsiveness. Quality of Life Research, 17(3), 475–483.
Article PubMed Google Scholar
Herdman, M., Gudex, C., Lloyd, A., Janssen, M., Kind, P., Parkin, D., et al. (2011). Development and preliminary testing of the new five-level version of EQ-5D (EQ-5D-5L). Quality of Life Research, 20(10), 1727–1736.
Article CAS PubMed PubMed Central Google Scholar
Netten, A., Burge, P., Malley, J., Potoglou, D., Towers, A.-M., Brazier, J., et al. (2012). Outcomes of social care for adults: Developing a preference-weighted measure. Health Technology Assessment, 16(16), 1–166.
Article CAS PubMed Google Scholar
Brazier, J., Roberts, J., & Deverill, M. (2002). The estimation of a preference-based measure of health from the SF-36. Journal of Health Economics, 21(2), 271–292.
Article PubMed Google Scholar
Brazier, J. E., & Roberts, J. (2004). The estimation of a preference-based measure of health from the SF-12. Medical Care, 42(9), 851–859.
Article PubMed Google Scholar
National Institute for Health and Care Excellence (NICE). (2013). Guide to The Methods of Technology Appraisal 2013. London: NICE.
Coyne, K., Revicki, D., Hunt, T., Corey, R., Stewart, W., Bentkover, J., et al. (2002). Psychometric validation of an overactive bladder symptom and health-related quality of life questionnaire: The OAB-q. Quality of Life Research, 11(6), 563–574.
Article CAS PubMed Google Scholar
Coyne, K. S., Payne, C., Bhattacharyya, S. K., Revicki, D. A., Thompson, C., Corey, R., & Hunt, T. L. (2004). The impact of urinary urgency and frequency on health-related quality of life in overactive bladder: Results from a national community survey. Value in Health, 7(4), 455–463.
Article PubMed Google Scholar
Coyne, K., Zhou, Z., Bhattacharyya, S., Thompson, C., Dhawan, R., & Versi, E. (2003). The prevalence of nocturia and its effect on health-related quality of life and sleep in a community sample in the USA. BJU International, 92(9), 948–954.
Article CAS PubMed Google Scholar
Tikkinen, K. A., Johnson, T. M., Tammela, T. L., Sintonen, H., Haukka, J., Huhtala, H., & Auvinen, A. (2010). Nocturia frequency, bother, and quality of life: How often is too often? A population-based study in Finland. European Urology, 57(3), 488–498.
Article PubMed Google Scholar
Donovan, J., Kay, H., Peters, T., Abrams, P., Coast, J., Matos-Ferreira, A., et al. (1997). Using the ICSQoL to measure the impact of lower urinary tract symptoms on quality of life: Evidence from the ICS–‘BPH’study. British Journal of Urology, 80(5), 712–721.
Article CAS PubMed Google Scholar
Barentsen, J. A., Visser, E., Hofstetter, H., Maris, A. M., Dekker, J. H., & de Bock, G. H. (2012). Severity, not type, is the main predictor of decreased quality of life in elderly women with urinary incontinence: A population-based study as part of a randomized controlled trial in primary care. Health Qual Life Outcomes, 10(1), 153.
Article PubMed PubMed Central Google Scholar
Pinto, A. M., Kuppermann, M., Nakagawa, S., Vittinghoff, E., Wing, R. R., Kusek, J. W., et al. (2011). Comparison and correlates of three preference-based health-related quality-of-life measures among overweight and obese women with urinary incontinence. Quality of Life Research, 20(10), 1655–1662.
Article PubMed PubMed Central Google Scholar
Subak, L. L., Whitcomb, E., Shen, H., Saxton, J., Vittinghoff, E., & Brown, J. S. (2005). Weight loss: A novel and effective treatment for urinary incontinence. The Journal of Urology, 174(1), 190–195.
Article PubMed PubMed Central Google Scholar
Coyne, K., Zhou, Z., Thompson, C., & Versi, E. (2003). The impact on health-related quality of life of stress, urge and mixed urinary incontinence. BJU International, 92(7), 731–735.
Article CAS PubMed Google Scholar
Kelleher, C., Cardozo, L., Khullar, V., & Salvatore, S. (1997). A new questionnaire to assess the quality of life of urinary incontinent women. BJOG: An International Journal of Obstetrics and Gynaecology, 104(12), 1374–1379.
Article CAS Google Scholar
Davis, S., & Wailoo, A. (2013). A review of the psychometric performance of the EQ-5D in people with urinary incontinence. Health Qual Life Outcomes, 11, 20.
Article PubMed PubMed Central Google Scholar
Tincello, D., Owen, R., Slack, M., & Abrams, K. (2013). Validation of the Patient Global Impression scales for use in detrusor overactivity: Secondary analysis of the RELAX study. BJOG: An International Journal of Obstetrics and Gynaecology, 120(2), 212–216.
Article CAS Google Scholar

Download references

Acknowledgments

The main phase of the accuracy and cost-effectiveness studies was funded by the National Institute for Health Research (NIHR) Health Technology Assessment Programme (Grant Reference Number 09/22/122). The views and opinions expressed are those of the authors and not necessarily those of the NHS, the NIHR or the Department of Health. The authors thank the members of the Trial Steering Committee and Data Monitoring Committee Prof. D. Tincello, J. Perks, Dr P. Chein, Prof. J. Cook and Dr K. Ward, the rest of the co-investigators of the study, and the trial management team including Dr S. Rachaneni, S. McCooty, L. Middleton, J. Daniels, Prof. A. Coomarasamy, and Prof. J. Deeks.

Funding

This work represents independent research funded by the National Institute for Health Research (NIHR) Health Technology Assessment Programme (Grant Reference Number 09/22/122).

Author information

Authors and Affiliations

Health Economics Unit, Institute of Applied Health Research, Public Health Building, University of Birmingham, Birmingham, B15 2TT, UK
Ilias Goranitis, Hareth Al-Janabi & Tracy E. Roberts
School of Social and Community Medicine, University of Bristol, Bristol, UK
Joanna Coast
School of Clinical and Experimental Medicine, University of Birmingham, Birmingham, UK
Pallavi Latthe
Birmingham Women’s NHS Foundation Trust, Birmingham, UK
Pallavi Latthe

Authors

Ilias Goranitis
View author publications
You can also search for this author in PubMed Google Scholar
Joanna Coast
View author publications
You can also search for this author in PubMed Google Scholar
Hareth Al-Janabi
View author publications
You can also search for this author in PubMed Google Scholar
Pallavi Latthe
View author publications
You can also search for this author in PubMed Google Scholar
Tracy E. Roberts
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Tracy E. Roberts.

Ethics declarations

Conflict of interest

H.A. and J.C. were involved in developing the ICECAP-A capability index measure.

Ethical approval

Ethical approval was received from the Nottingham Research Ethics Committee (Reference: 10/H0408/57). All procedures performed involving human participants were in accordance with the ethical standards of the institutional and/or national research committee and with the 1964 Helsinki Declaration and its later amendments or comparable ethical standards.

Informed consent

Informed consent was obtained from all individual participants included in the study.

Appendices

Appendix 1

Correlations between the ICECAP-A and ICIQ-OAB that were expected by the authors to be significant at the 5 % level (✓) or not (✗) based on available evidence from the literature and their personal opinion before the statistical analysis.

ICIQ-OAB	ICECAP-A
ICIQ-OAB	Capability index score	Stability	Attachment	Autonomy	Achievement	Enjoyment
ICIQ-OAB score	✓	✓	✗	✓	✓	✓
Frequency of urination (day)	✓	✓	✗	✓	✓	✓
Frequency of urination (night)	✓	✓	✗	✗	✗	✗
Frequency of rush for urination	✓	✓	✗	✓	✓	✓
Frequency of leaking before urination	✓	✓	✗	✓	✓	✓

Correlations were expected to be negative and in the weak range

Evidence upon which correlations where hypothesised

Frequency of urination tends to impact on social function, general and mental health and often results in sleep problems [44, 45].
Urgency and nocturia tend to have a significant impact on quality of life dimensions, such as physical functioning, pain, general health, vitality, social functioning, physical and emotional role, mental health and sleep [44–47].
Urinary incontinence affects daily life activities, limits behaviour, and also has a psychosocial impact [38, 48].

Appendix 2

Associations between the ICECAP-A index score and different indicators that were expected by the authors to be significant at the 5 % level (✓) or not (✗) based on available evidence from the literature and their personal opinion before the statistical analysis.

Variables	Expected association	Evidence upon which associations where hypothesised
Age (<65, ≥65)	✗	Evidence for the relationship between age and quality of life among people with symptoms of OAB is contradictory [21, 49, 50]
BMI (normal, overweight, obese)	✓	Evidence for BMI suggests a significant association with quality of life measured with disease-specific and general measures of HrQoL [21, 49–51]
Clinical diagnosis (overactive bladder, mixed incontinence, stress incontinence)	✗	The type of clinical diagnosis among individuals with symptoms of OAB is not a significant determinant of quality of life [49, 52]
Detrusor overactivity (no, yes)	✓	Quality of life appears to be impaired among those with an urodynamically verified detrusor overactivity [53]
Surgery (no, yes)	✗	Evidence for the relationship between quality of life and previous urinary surgery, presence of prolapse or existence of voiding difficulties is scarce and contradictory [20, 45, 50]
Advance prolapse (no, yes)	✗
Voiding difficulty (no, yes)	✗
Bother—frequency of urination (day) (≤5, >5)	✓	There is robust evidence, indicating that OAB symptoms severity significantly impacts on quality of life and can be captured by both generic measures, like the EQ-5D, and disease-specific [44, 49, 54, 55]. For the EQ-5D, however, there has been evidence, indicating that severity is not significantly associated with HrQoL [38]
Bother—frequency of urination (night) (≤5, >5)	✓
Bother—frequency of rush (≤5, >5)	✓
Bother—frequency of leaking (≤5, >5)	✓
Total bother of symptoms (low, moderate, high)	✓

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Cite this article

Goranitis, I., Coast, J., Al-Janabi, H. et al. The validity and responsiveness of the ICECAP-A capability-well-being measure in women with irritative lower urinary tract symptoms. Qual Life Res 25, 2063–2075 (2016). https://doi.org/10.1007/s11136-015-1225-y

Download citation

Accepted: 24 December 2015
Published: 11 January 2016
Issue Date: August 2016
DOI: https://doi.org/10.1007/s11136-015-1225-y

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

The validity and responsiveness of the ICECAP-A capability-well-being measure in women with irritative lower urinary tract symptoms

Abstract

Purpose

Methods

Results

Conclusions

Similar content being viewed by others

Female urinary incontinence and wellbeing: results from a multi-national survey

Development of the Incontinence Utility Index: estimating population-based utilities associated with urinary problems from the Incontinence Quality of Life Questionnaire and Neurogenic Module

Validation of overactive bladder questionnaire (1-week recall version) in medically complex elderly patients with overactive bladder

Introduction

Methods

Data source

Outcome measures

ICEpop CAPability measure for adults (ICECAP-A)

EuroQol Five-Dimension Questionnaire (EQ-5D-3L)

International Consultation on Incontinence Questionnaire for Overactive Bladder (ICIQ-OAB)

Psychometric analysis

Acceptability

Validity

Responsiveness

Hypothetical constructs

Results

Acceptability

Construct validity

Responsiveness

Discussion

Notes

References

Acknowledgments

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Ethical approval

Informed consent

Appendices

Appendix 1

Evidence upon which correlations where hypothesised

Appendix 2

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation