Calculating age-adjusted cancer survival estimates when age-specific data are sparse: an empirical evaluation of various methods

Gondos, A; Parkin, D M; Chokunonga, E; Brenner, H

doi:10.1038/sj.bjc.6602976

Download PDF

Epidemiology
Open access
Published: 24 January 2006

Calculating age-adjusted cancer survival estimates when age-specific data are sparse: an empirical evaluation of various methods

A Gondos¹,
D M Parkin^2,3,
E Chokunonga⁴ &
…
H Brenner¹

British Journal of Cancer volume 94, pages 450–454 (2006)Cite this article

551 Accesses
8 Citations
Metrics details

This article has been updated

Abstract

We evaluated empirically the performance of various methods of calculating age-adjusted survival estimates when age-specific data are sparse. We have illustrated that a recently proposed alternative method of age adjustment involving the use of balanced age groups or age truncation may be useful for enhancing calculability and reliability of adjusted survival estimates.

Standardised survival probabilities: a useful and informative tool for reporting regression models for survival data

Article Open access 01 September 2022

Obtaining long-term stage-specific relative survival estimates in the presence of incomplete historical stage information

Article Open access 17 June 2022

Conditional crude probabilities of death for English cancer patients

Article Open access 11 October 2019

Main

Age adjustment is widely used in studies comparing survival of different cancer patient populations. Most commonly, the comparison is made by calculating a weighted average of age-specific survival estimates, using weights reflecting the age distribution of some defined standard population. There has, however, been a large diversity in the practice of age adjustment, mainly concerning the definition, the number and width of the age groups, and methods to overcome difficulties when the age-specific survival could not be calculated. Such situations are not uncommon in comparative survival studies, particularly when data for rare cancers or data from registries covering smaller populations is involved. Examples of the practice of age adjustment of cancer survival are given in Table 1.

Table 1 Examples of age adjustment of population-based cancer survival estimates in the recent literature

Full size table

Recently, an alternative method was proposed for age adjustment of survival estimates (Brenner et al, 2004), in which weights are assigned to the patients before one carries out the survival analysis. Weights are determined as the percentage of patients in the age group the patient belongs to in the standard population divided by the corresponding percentage in the study population. Survival analysis is carried out using the weighted individual data, without the need to calculate age-specific survival estimates (Brenner et al, 2004).

In this paper, we empirically evaluate and compare the performance of different options of age adjustment methods in situations when age-specific data are sparse.

Materials and methods

Data set

Data from the Zimbabwe National Cancer Registry (ZNCR) (Chokunonga et al, 2000; Parkin et al, 2003) were used. The registry, established in 1985, covers the population of the Zimbabwean capital, Harare. In terms of operational circumstances, which have been described in detail elsewhere, the registry may be considered as a typical example for an urban developing country cancer registry with appropriate data quality outcomes (Sankaranarayanan et al, 1998; Parkin et al, 2003; Chokunonga et al, 2004). Survival results for a large number of cancer sites were recently published elsewhere (Gondos et al, 2004).

We assessed various options of age adjustment of 5-year survival estimates among patients diagnosed with five different types of cancers in 1993–1997 and followed up until 31 December 1999: skin melanomas, breast, cervical and prostate cancer and lymphomas. Breast and cervical cancers were included because they were represented by relatively large samples. Prostate cancer, for which only 3-year survival could be calculated due to the lack of patients with a 5-year follow-up time, was included because of the unusual age distribution (high proportions of older patients). Lymphomas were included because of the uniquely wide age range of the patients, and skin melanomas were selected as an example of an analysis with a very small sample.

Calculation of age-adjusted survival

The site-specific World Standard Cancer Patient Populations (WSCPP) were used as standard populations (Black and Bashir, 1998). Adjustment of the survival estimates was carried out according to the traditional method, and the alternative method recently proposed by Brenner et al (2004), using the age categorisation schemes described below. Throughout this paper, relative rather than absolute survival estimates are presented. The relative survival estimates were calculated according to Hakulinen's method (Hakulinen, 1982), using the WHO life tables for Zimbabwe (WHO, 2001). The calculations were carried out using the SAS macros periodh (Brenner et al, 2002) and adperiodh (Brenner et al, 2004).

Scheme 1: Age adjustment with fixed age group width

First, for each cancer site, we classified the patients by 5-, 10-, 15- and 30-year age groups. With each of these classifications, both the youngest and the oldest age groups were selected so that they actually included patients, and the age of the youngest/oldest patient determined the first/last age group (eg, if the youngest patient was 28, the first 5/10/15/30-year age group was 25–29/20–29/15–29/0–29, respectively).

Scheme 2: Collapsing the youngest and oldest age groups

If needed, that is, if the age adjustment failed, we applied modifications to the boundaries of the youngest and the oldest age groups to enhance calculability: the boundaries of these age groups were modified so that the age-specific survival in these (youngest and oldest) age groups could be calculated. Age groups in between were left unchanged, except for the 30-year age groups, where the shifting of the first or last age group affected the middle age group as well.

Scheme 3: Balanced age groups

Here, we reorganised the age groups in such a way that the number of observations in the age groups would be approximately evenly distributed. The number of age groups was varied between 3 and 5. The boundaries of these ‘balanced’ age groups were aligned to the nearest of those of the original 5-year age groups.

Calculation of truncated survival

Age-specific survival estimates are often unreliable for older age groups in data from developing countries. Often, as is the case in our study, the standard population gives more weight to the oldest age group than does the study population. In these cases, the adjusted survival estimate can easily become unreliable, as the adjustment assigns a large weight to an unreliable age-specific survival estimate. We therefore repeated all calculations with a truncated age range (0–74 years), following the practice in the so far largest comparative survival study from developing countries (Sankaranarayanan et al, 1998).

Results

Table 2 shows the numbers of patients by age group in the Zimbabwean cancer populations, illustrates the differences between the age distributions of the study and the standard populations, and indicates the age groups for which the 5-year age-specific survival estimate could not be calculated. The WSCPP include a much higher proportion of patients in the oldest 2–4 age groups than the Zimbabwean patient populations.

Table 2 Age distributions of cancer patient populations in Zimbabwe and of the WSCCP, and the calculability of age specific 5-year survival estimates, by 5-year age groups

Full size table

Table 3a provides survival estimates adjusted by the traditional and the alternative method, with all ages included, according to the different schemes we applied. With the traditional method, the fixed age group classifications often failed, due to a failure in calculating 5-year age-specific survival estimates. With collapsed or balanced age groups, the traditional age adjustment became feasible in most cases. The alternative method was feasible even with most fixed age group categorisations, except for the 5-year categories for skin melanomas and lymphomas, where one age group was empty and therefore the weight to be assigned to the patients in the age group could not be calculated. With both the traditional and the alternative method, the application of different age groups resulted in different adjusted survival estimates for all cancer types studied. With the use of balanced age groups, variation was strongly reduced.

Table 3 (a) The 5-year relative survival (in %) of Zimbabwean cancer patients, adjusted to WSCPP and (b) summary of calculating truncated survival estimates: differences in the crude estimates, and ranges of the adjusted survival estimates for all ages and truncated cancer patient populations

Full size table

Table 3b summarises and compares the results obtained by calculating adjusted survival estimates with all ages involved and with truncated cancer patient populations. The truncation did not alter the crude survival estimates significantly: the differences between the crude and the truncated crude survival estimates were between 0.6 and 3.6% units. However, the variation in the adjusted survival estimates among the different categorisation schemes was strongly reduced for all cancer sites.

Discussion

With the traditional method, the calculation of 5-year age-specific survival estimates often failed in age groups with a few patients only. Failures could mostly be overcome by the application of different age categorisation schemes, that is, by collapsing or balancing the age groups. When using the alternative method (Brenner et al, 2004), calculability was generally very good, even with the fixed age group categorisation schemes.

However, with both methods, the different age group classifications produced age-adjusted estimates with a rather large variability, mainly because of the assignment of large weights for the older age groups in which data were sparse in the ZNCR. These variations could be effectively reduced using balanced age groups and by restricting the analysis to a truncated age range up to 74 years.

There is no theoretically best practice with regard to the number of age groups, their width and the boundaries of the individual age group classifications. For practical purposes, however, adjusted estimates should be reasonably consistent, no matter what age classifications are used. Limitations in data quality frequently impair the reliability of age-specific data among older patients, particularly in case of patient populations from developing countries (Sankaranarayanan et al, 1998). In such cases, the calculation of truncated adjusted survival may provide estimates of improved reliability and comparability. On the other hand, truncation means that the survival experience of older patients is neglected, which would be justified only if the proportion of these patients is very small. The use of balanced age groups is not affected by this limitation and may be preferred if the exclusion of older patients is of concern.

In looking at the results, the following limitations should be kept in mind. Our empirical evaluation is based on five cancer sites from one cancer registry only. The cancer sites were chosen to represent various sample sizes, age distributions, and higher and lower survival patterns, and therefore reflect a variety of scenarios encountered in comparative analyses of survival. While the problems of sparseness of data and discrepancy between the age distribution of the study population and the standard population were probably more extreme than in most other practical applications, such extreme data situations may facilitate the demonstration of the implications of various analysis strategies under the above conditions. We did not include standard errors of survival estimates obtained with the various methods. As recently demonstrated elsewhere (Brenner and Hakulinen, 2005), the alternative method often provides estimates with a smaller standard error.

In summary, our results on the one hand illustrate that the enhanced calculability of age-adjusted survival estimates by the alternative method may be relevant in practice. Nevertheless, the unreliability of estimates in case of sparse data within age groups may remain a concern for both the traditional and the alternative method. In such situations, the use of balanced age groups or the calculation of truncated age-adjusted survival estimates may be useful analytical options.

Change history

16 November 2011
This paper was modified 12 months after initial publication to switch to Creative Commons licence terms, as noted at publication

References

Aareleid T, Sant M, Hédelin G and the EUROCARE Working Group (1998) Improved survival for patients with testicular cancer in Europe since 1978. Eur J Cancer 34: 2236–2240
Article CAS Google Scholar
Black RJ, Bashir SA (1998) World standard cancer patient populations: a resource for comparative analysis of survival data. In: Cancer Survival in Developing Countries, IARC Scientific Publications No. 145, Sankaranarayanan R, Black RJ, Parkin DM (eds), pp 9–11. Lyon:IARC
Google Scholar
Brenner H, Arndt V, Gefeller O, Hakulinen T (2004) An alternative approach to age adjustment of cancer survival rates. Eur J Cancer 40: 2317–2322
Article Google Scholar
Brenner H, Hakulinen T (2005) Age adjustment of cancer survival rates: methods, point estimates and standard errors. Br J Cancer 93: 372–375
Article CAS Google Scholar
Brenner H, Hakulinen T, Gefeller O (2002) Computational realization of period analysis for monitoring cancer patient survival. Epidemiology 13: 611–612
Article Google Scholar
Capocaccia R, Gatta G, Roazzi P, Carrani E, Santaquilani M, De Angelis R, Tavilla A, EUROCARE Working Group (2003) The EUROCARE-3 database: methodology of data collection, standardization, quality control and statistical analysis. Ann Oncol 14 (Suppl 5): v14–v27
Article Google Scholar
Chokunonga E, Levy L, Bassett MT, Mauchaza B, Thomas DB, Parkin DM (2000) Cancer incidence in the African population of Harare, Zimbabwe: second results from the cancer registry 1993–1995. Int J Cancer 85: 54–59
Article CAS Google Scholar
Chokunonga E, Ramanakumar AV, Nyakabau AM, Borok MZ, Chirenje ZM, Sankila R, Parkin DM (2004) Survival of cervix cancer patients in Harare, Zimbabwe, 1995–97. Int J Cancer 109: 274–277
Article CAS Google Scholar
Dickman PW, Hakulinen T, Luostarinen T, Pukkala E, Sankila E, Sonderman B, Teppo L (1999) Survival of cancer patients in Finland 1955–1994. Acta Oncol 38 (Suppl 12): 1–103
Article Google Scholar
Gondos A, Chokunonga E, Brenner H, Parkin DM, Sankila R, Borok MZ, Chirenje ZM, Nyakabau AM, Bassett MT (2004) Cancer survival in a southern African urban population. Int J Cancer 112: 860–864
Article CAS Google Scholar
Hakulinen T (1982) Cancer survival corrected for heterogeneity in patient withdrawal. Biometrics 38: 933–942
Article CAS Google Scholar
Parkin DM, Ferlay J, Hamdi-Chérif M, Sitas F, Thomas JO, Wabinga H, Whelan SL (eds) (2003) Cancer in Africa: Epidemiology and Prevention, IARC Scientific Publications No. 153. Lyon: IARC
Google Scholar
Sankaranarayanan R, Black RJ, Parkin DM (eds) (1998) Cancer Survival in Developing Countries, IARC Scientific Publications No. 145. Lyon: IARC
Google Scholar
Sant M, Capocaccia R, Coleman MP, Berrino F, Gatta G, Micheli A, Verdecchia A, Faivre J, Hakulinen T, Coebergh JW, Martinez-Garcia C, Forman D, Zappone A and the EUROCARE Working Group (2001) Cancer survival increases in Europe, but international differences remain wide. Eur J Cancer 37: 1659–1667
Article CAS Google Scholar
Sant M, van der Sanden G, Capocaccia R and the EUROCARE Working Group (1998) Survival rates for primary malignant brain tumours in Europe. Eur J Cancer 34: 2241–2247
Article CAS Google Scholar
Wang H, Chia KS, Du WB, Lee J, Sankaranarayanan R, Sankila R, Sng I, Seow A, Lee HP (2003) Population-based survival for cervical cancer in Singapore, 1968–1992. Am J Obstet Gynecol 188: 324–329
Article Google Scholar
WHO (2001) Life tables for 191 countries for 2000. World Health Organization. Data available from: http://www3.who.int/whosis/life/life_tables/life_tables.cfm

Download references

Acknowledgements

This work was supported in part by the German Research Foundation (Deutsche Forschungsgemeinschaft, Graduiertenkolleg 793).

Author information

Authors and Affiliations

Department of Epidemiology, German Centre for Research on Ageing, Bergheimer Str. 20, Heidelberg, 69115, Germany
A Gondos & H Brenner
Unit of Descriptive Epidemiology, International Agency for Research on Cancer, Lyon, France
D M Parkin
Nuffield Department of Clinical Medicine, Clinical Trial Service Unit and Epidemiological Studies Unit, University of Oxford, Oxford, UK
D M Parkin
Zimbabwe National Cancer Registry, Harare, Zimbabwe
E Chokunonga

Authors

A Gondos
View author publications
You can also search for this author in PubMed Google Scholar
D M Parkin
View author publications
You can also search for this author in PubMed Google Scholar
E Chokunonga
View author publications
You can also search for this author in PubMed Google Scholar
H Brenner
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to H Brenner.

Rights and permissions

From twelve months after its original publication, this work is licensed under the Creative Commons Attribution-NonCommercial-Share Alike 3.0 Unported License. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-sa/3.0/

Reprints and permissions

About this article

Cite this article

Gondos, A., Parkin, D., Chokunonga, E. et al. Calculating age-adjusted cancer survival estimates when age-specific data are sparse: an empirical evaluation of various methods. Br J Cancer 94, 450–454 (2006). https://doi.org/10.1038/sj.bjc.6602976

Download citation

Received: 28 October 2005
Revised: 21 December 2005
Accepted: 06 January 2006
Published: 24 January 2006
Issue Date: 13 February 2006
DOI: https://doi.org/10.1038/sj.bjc.6602976

Keywords

This article is cited by

Epidemiologie bösartiger Knochentumoren in Deutschland 2004–2018
- Anita Feller
- Volker Arndt
Die Onkologie (2022)

Calculating age-adjusted cancer survival estimates when age-specific data are sparse: an empirical evaluation of various methods

Abstract

Similar content being viewed by others

Standardised survival probabilities: a useful and informative tool for reporting regression models for survival data

Obtaining long-term stage-specific relative survival estimates in the presence of incomplete historical stage information

Conditional crude probabilities of death for English cancer patients

Main

Materials and methods

Data set

Calculation of age-adjusted survival

Scheme 1: Age adjustment with fixed age group width

Scheme 2: Collapsing the youngest and oldest age groups

Scheme 3: Balanced age groups

Calculation of truncated survival

Results

Discussion

Change history

16 November 2011

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

This article is cited by

Epidemiologie bösartiger Knochentumoren in Deutschland 2004–2018

Search

Quick links

Abstract

Similar content being viewed by others

Standardised survival probabilities: a useful and informative tool for reporting regression models for survival data

Obtaining long-term stage-specific relative survival estimates in the presence of incomplete historical stage information

Conditional crude probabilities of death for English cancer patients

Main

Materials and methods

Data set

Calculation of age-adjusted survival

Scheme 1: Age adjustment with fixed age group width

Scheme 2: Collapsing the youngest and oldest age groups

Scheme 3: Balanced age groups

Calculation of truncated survival

Results

Discussion

Change history

16 November 2011

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

This article is cited by

Epidemiologie bösartiger Knochentumoren in Deutschland 2004–2018

Search

Quick links