By 2050, the proportion of the global population aged ≥ 60 yr will more than double, reaching 2.1 billion people.1 In high-income countries, this older cohort already accounts for 30–50% of all surgical procedures, a proportion which is increasing.2,3,4 As a result, the syndrome of increased vulnerability to external stressors known as frailty has emerged as a major area of importance. In older surgical patients, the prevalence of frailty is as high as 40–50%.5 Preoperative frailty also doubles the risk of surgical complications, and leads to increased mortality and length of stay, posing unique challenges for anesthesiologists, surgeons, and health services.6,7 There are two “paradigms” of frailty: the phenotypic model (based on five criteria: exhaustion, weight loss, weakness, and decreased physical activity),8 and the deficit model (which conceptualizes frailty as an accumulation of “health deficits”, or impairments across the spectrum of health).9 The phenotypic model is attractive in its simplicity; however, the deficit model is likely a more comprehensive and multidimensional representation of frailty.

Given this evidence of increased harm, preoperative frailty screening is necessary. Although recommended by the 2010 United Kingdom National Confidential Enquiry into Patient Outcome and Death (NCEPOD) in older surgical patients (which stated that “an agreed means of assessing frailty in the perioperative period should be developed and included in risk assessment”), this is not yet commonplace.10 The challenges are complex and logistical. Comprehensive frailty measurement instruments, such as the 70-item frailty index, take a long time, so are impractical prior to anesthesia.9 Other frailty measurement scales require time, training, and functional testing of patients, constraining widespread uptake.11 Numerous prior studies have investigated more simple frailty instruments in surgical patients to find the optimal scale in the perioperative context. For example, a 2016 North American study of 415 orthopedic patients measured the five-item frailty phenotype scale, showing only moderate concordance with the reference frailty index (weighted kappa = 0.42; 95% confidence interval [CI], 0.36 to 0.49).12 A promising, simplified instrument is the nine-point Clinical Frailty Scale (CFS). This scale enables rapid frailty screening incorporating a range of areas known to be affected by frailty without the need for specific geriatric expertise or functional testing; it was reliable and accurate in other populations.9,13 It also requires considerably less time to administer compared with other multidimensional frailty measurement tools. The CFS has been examined in surgical populations, including a 2018 study of 702 patients undergoing elective non-cardiac surgery, where it was found to be predictive of death, new disability, and institutional discharge.14 Nevertheless, its application in anesthesia and surgery compared with other more validated frailty tools requires further investigation.

Accordingly, we aimed to test the CFS as a frailty screening tool in a cohort of elective and emergency surgical patients. In particular, we hypothesized that the CFS would predict frailty with excellent accuracy and be strongly correlated and in agreement with the Edmonton Frail Scale—a well-researched, well-used, and well-studied multidimensional frailty scale that is too detailed to be applied as a widespread screening tool. We secondarily hypothesized that frailty would affect the full range of health domains in surgical patients and be associated with detrimental postoperative outcomes, including increased mortality and complications.

Methods

This was a prospective, observational cohort study in the perioperative department of the Royal Melbourne Hospital, and formed a complementary, concurrent investigation to our previously published study in 160 critically ill patients.15 The protocol for this frailty research program has previously been published.16 Ethics approval for this study was issued by the Human Research and Ethics Committee of Melbourne Health (20 January 2017, HREC/16/MH/321). Patients aged ≥ 65 yr presenting for either elective or emergency surgery between February 1, 2017 and June 10, 2017 were eligible. Written consent was obtained from either the patient or their next-of-kin if the patient was incapacitated.

Patients who were non-English speaking or presenting for endoscopy procedures were excluded. A convenience sample of patients were enrolled around availability of investigators; only the first operation during the hospitalization was included. Patients were identified from the elective pre-admission anesthetic clinic, or the emergency operating room list. Perioperative data collected included age, sex, height, weight, Charlson comorbidity index, Katz index of independence in daily living,17 residential location prior to admission, P-POSSUM surgical risk score,18 and surgical subspecialty.

Frailty was defined as the state of the patient in the two weeks prior to admission to hospital (elective surgery) or prior to onset of acute illness (emergency surgery) to be consistent with past definitions19; it was preoperatively collected by one of three study investigators. Treating medical teams were blind to patient data collected for this study. Before data collection started, all assessors were trained in the measurement of the CFS by the lead investigator, who has been specifically trained in deriving the CFS score, with ongoing supervision of randomly selected patients throughout the study period. Illness severity, functional ability, and social independence were assessed using the free-text descriptions of these domains contained within the CFS tool. The Edmonton Frail Scale quantifies frailty across nine areas: general health, independence, cognition, social support, medication, nutrition, mood, continence, and function. If the “timed-up-and-go” test was not possible, a Reported Edmonton Frail Scale was conducted, which assesses three areas of reported performance (ability to perform heavy housework, walk up two floors of stairs, walk one kilometre unaided), with a total score of 18 vs 17 for the original scale.20 In the case of abnormal cognition (e.g., delirium or confusion), history from patients’ next-of-kin was obtained; this surrogate history was also used to determine frailty status using the abovementioned definitions if required. Following collection of the Edmonton frailty scale data, the CFS was also measured preoperatively, which is judgement-based around illness severity, functional ability, and social independence.9 The Edmonton frailty scores were subsequently aggregated and the total score assigned after all data were collected.

The primary outcome was agreement and correlation between CFS and Edmonton scales (as described below). Secondary outcomes included 30-day mortality, hospital length of stay, postoperative complications (acute myocardial infarction, cardiac arrest, tracheal reintubation, acute pulmonary edema, deep venous thrombosis, pulmonary embolus, stroke, wound infection, acute kidney injury, unplanned need for reoperation, unplanned admission to the intensive care unit [ICU], all the definitions of which are listed in Electronic Supplementary Material [ESM]); discharge status at postoperative day 30 (i.e,, home, assisted living facility, rehabilitation, other acute hospital, or not discharged); and residential location at six-months postoperatively (i.e., home, acute hospital, assisted-living facility). Outcomes were measured at 30 days postoperatively by investigators not blinded to study data, although the Edmonton frailty scores were aggregated after all data were collected.

Statistical analysis

Continuous data were summarized using mean (standard deviation [SD]) or median [interquartile range (IQR)] in the case of skewed data. Binary and categorical data were summarized as frequencies and percentages. We compared baseline characteristics, health outcomes, and health domains of patients with and without frailty using Chi square or Fisher’s exact tests, two-sample t tests, and Wilcoxon rank-sum tests as appropriate. Univariable and multivariable regression models were also fitted to secondary outcomes, the latter adjusting for age, sex, admission source, Charlson comorbidity index, and emergency/elective surgery. Binary variables with more than one patient were analyzed using Firth logistic regression to obtain the estimated odds ratio and 95% CI.21 The estimated median difference and 95% CI between patients with and without frailty regarding hospital length of stay (days) were obtained from bootstrapped quantile regression with 5,000 replications. Discharge location was analyzed using multinomial logistic regression, yielding the estimated relative risk ratio and 95% CI. Spearman correlation coefficient was used to measure correlation between Edmonton and CFS scales, with inter-rater agreement assessed using Cohen’s kappa between dichotomized (not frail: CFS 1–4, Edmonton 0–7; frail: CFS ≥ 5, Edmonton ≥ 8) and ordinal categories (not frail: CFS 1–3, Edmonton 0–5; vulnerable: CFS 4, Edmonton 6–7; mildly frail: CFS 5, Edmonton 8–9; moderately frail: CFS 6, Edmonton 10–11; severely frail: CFS ≥ 7, Edmonton ≥ 12). The latter were measured via quadratic weighting because higher CFS categories are increasingly important. Kappa coefficients were categorized using the scale of Landis and Koch.22 A Firth logistic regression model was fitted to frailty defined via the Edmonton scale using the ordinal CFS as a predictor to obtain the area under the receiver operating characteristic curve (AUCROC) and its exact binomial 95% CI. The AUCROC was categorized using the general guidelines of Hosmer, Lemeshow, and Sturdivant.23 Sensitivity and specificity were also calculated guided by the optimal cut-off point according to the highest Youden index. We included all enrolled patients in the analyses except in the analysis of residence location at follow-up, which was restricted to patients admitted from home. In addition, we performed separate analyses to examine differences between the whole data set and subgroups according to whether patients were admitted to the ICU or not (and hence included in our previous report).15 No adjustment for multiple testing was performed. STATA 15.1 (College Station, TX, USA) was used for statistical analyses.

Sample size

A sample size of 200 was calculated to have power > 99% (two-sided alpha of 5%) for the Spearman coefficient between the Edmonton and CFS scales to be at least strong (0.80), assuming a correlation of 0.90.24 A sample of 200 participants was also calculated to produce a 95% Clopper-Pearson binomial CI of 15 to 26% around a frailty prevalence estimate of 20%, based on a meta-analysis encompassing more than 8,000 surgical patients. This provided acceptable CIs around likely frailty prevalence for this sample size.6

Results

Two-hundred and eighteen patients were enrolled during the four-month recruitment period, 118 (54%) undergoing elective and 100 (46%) emergency surgery (Fig. 1). Forty-two patients (19%) were admitted to the ICU following surgery, and were included in the previous published analysis.15 Median [IQR] age of patients was 74 [69–80] yr, 99 (45%) of patients were female, with 195 (89%) residing at home prior to surgery. Compared with non-included eligible patients undergoing identical surgical operations in the study period, included patients were of similar age (74 [69–80] yr vs 74 [70–80] yr for non-included patients; P = 0.51), with an increased median [IQR] hospital length of stay (5 [2–13] days vs 4 [1–9] days; P = 0.005). Frailty scales were completed for all included patients; reported physical performance was used for 107 (49%) patients who were unable to perform the timed-up-and-go test. A further 28 (13%) patients had incomplete cognition (clock-drawing) tests because their conscious state had decreased. All other data were complete.

Fig. 1
figure 1

Study flow diagram

Fifty-two patients (24%; 95% CI, 18 to 30) were measured as frail using the Edmonton scale, compared with 61 (28%; 95% CI, 22 to 34) via the CFS (Fig. 2). The Spearman correlation coefficient comparing the continuous CFS and Edmonton was 0.81 (95% CI, 0.77 to 0.86), indicating a “high” correlation. The kappa coefficient comparing the dichotomized CFS and Edmonton was 0.65 (95% CI, 0.54 to 0.77), and comparing ordinal scales was 0.76 (95% CI, 0.70 to 0.81), both indicating substantial agreement. The AUCROC of the CFS when predicting frailty according to the Edmonton scale was 0.91 (95% CI, 0.86 to 0.94) indicating excellent to outstanding discrimination. A CFS cut-point of “vulnerable” (CFS = 4) captured all but one patient with frailty, as scored by the Edmonton scale, for a sensitivity of 98.1% (95% CI, 89.7 to 100) and specificity of 54.8% (95% CI, 46.9 to 62.5). A CFS cut-point of “mildly frail” (CFS = 5) had the highest Youden index for a sensitivity of 80.8% (95% CI, 67.5 to 90.4) and specificity of 88.6% (95% CI, 82.7 to 93.0). Agreement, correlation, sensitivity, and specificity of the CFS were similar when considering subgroups of patients admitted and not admitted to the ICU (ESM, eTables 1 and 6). Compared with patients without frailty, those scored as frail had higher Charlson comorbidity scores, less functional independence, less likely to be living at home, with higher P-POSSUM scores (Table 1). These associations with frailty persisted when considering the previously unreported subgroup of 176 patients not admitted to ICU (ESM, eTable 2).

Fig. 2
figure 2

Frailty status of study participants at baseline

Table 1 Baseline demographics according to Edmonton Frail and Clinical Frailty Scales

After adjusting for potential confounders, mortality at 30 days was higher in patients with frailty using the Edmonton scale (9.6% vs 1.8%; OR, 5.26; 95% CI, 1.28 to 21.62) (Table 2). Any postoperative complications were more common with frailty using the Edmonton scale (OR, 3.67; 95% CI, 1.84 to 7.30), in particular, any unplanned admission to the ICU (OR, 4.54; 95% CI, 1.83 to 11.26), and unplanned return to the operating theatre (OR, 6.20; 95% CI, 2.09 to 18.38). The median [IQR] hospital length of stay was longer in patients with frailty (9.0 [5.0—14.6] vs 3.0 [1.0—9.0] days; mean difference, 2.52; 95% CI, -0.66 to 5.69). When considered via the CFS, patients with frailty were more commonly discharged to assisted living or rehabilitation (23 [38%] frail vs 23 [15%] non-frail) compared with home (29 [48%] frail vs 118 [75%] non-frail; relative risk ratio, 2.66; 95% CI, 1.15 to 6.13). At six-month follow-up, excluding five patients who were lost to follow-up and 33 patients who had died, eight of 41 surviving patients with frailty (20%) were living in an assisted living facility compared with five of 134 surviving patients without frailty (4%), as measured via the CFS. The relative change in the odds of living in an assisted living facility at follow-up, for frail patients compared with non-frail patients (as measured via the CFS) admitted from home, was 1.74-fold (95% CI, 0.37 to 8.16).

Table 2 Secondary outcomes according to frailty, Edmonton Frail and Clinical Frailty Scales

Across the spectrum of Edmonton scale domains, patients diagnosed with frailty had significantly worse health; the magnitude of discrepancy in health status was similar whether assessed via the Edmonton and CFS scales (Table 3). Frailty was associated with significantly worse malnutrition (48% vs 19% patients; P = 0.001), medication usage (92% vs 45% of patients taking five or more medications; P < 0.001), higher functional dependence (42% vs 4% of patients; P < 0.001), and lower physical performance (47% vs 7% of patients requiring > 20 sec or unable to perform a timed-up-and-go test, P < 0.001). Patients with frailty also had considerably more hospital admissions in the year prior to surgery compared with patients without frailty (P < 0.001). When separately considering subgroups of patients admitted and not admitted to ICU, demographics were similar apart from higher median [IQR] P-POSSUM mortality risk in ICU-admitted patients (9.4% [6.2–20.1] vs 3.9% [2.5–8.1]; P < 0.001) and more emergency surgery (25 [59.5%] vs 75 [42.6%]; P = 0.048) (ESM, eTable 3); findings with frailty across domains remained, although analyses were constrained by small numbers in the ICU-admitted cohort (ESM, eTables 4 and 5).

Table 3 Analysis of individual frailty domains according to Edmonton and Clinical Frailty Scales

Discussion

In a single-centre prospective cohort study, we found strong correlation and agreement between the CFS and the Edmonton Frail Scale in a surgical population. Furthermore, the CFS showed a strong ability to discriminate between people identified as being frail or not frail based on the Edmonton scale. This study also shows that frailty in surgical patients measured by the CFS affects the full spectrum of health, including functional dependence, medication use, physical ability, and nutrition. Consistent with the larger perioperative literature, surgical patients with frailty also had considerably worse postoperative outcomes, with increased complications, higher mortality, and reduced discharge home after surgery.

It is ten years since the NCEPOD recommendation to include frailty screening in surgical risk assessment for older patients.10 Nevertheless, there remains a major gap in perioperative frailty screening. The Association of Anaesthetists of Great Britain and Ireland, the American College of Surgeons, and the American Geriatrics Society all similarly emphasize the importance of perioperative frailty measurement.25,26 The Edmonton Frail Scale has been applied in a variety of surgical cohorts, where it has been found predictive of postoperative complications, increased length of stay, and inability to be discharged home after surgery.27 It also has good inter-rater reliability, and correlates well with a comprehensive geriatric assessment.11 There are significant challenges, however, in its widespread application as a screening tool to a surgical population. Half of our cohort, for example, could not undergo the timed-up-and-go test, with a further 13% unable to undergo clock-drawing cognitive assessment because of their decreased conscious state. There is also a time and training requirement, which presents challenges in a busy preoperative assessment clinic, as well as the emergency surgical setting. Because of these challenges, the Edmonton scale has limited scope for widespread deployment in an unselected population at risk for frailty. The CFS thus presents a convenient, easy to deploy screening tool devoid of many of these constraints. The CFS has been well studied in the perioperative literature, and has been shown to be feasible. In the study by McIsaac et al., it was rated by anesthesiologists as easy to use, useful, and beneficial to patient care, and taking a mean (SD) of 44 (40) sec to administer,14 and was associated with poor outcomes in surgical patients. The AUCROC of 0.91 and high degree of correlation between the CFS and Edmonton scales found in this study supports its role as a screening tool in surgical patients. This study also implies that the threshold for frailty measured by both the CFS and Edmonton scales is similar. Nine of ten divergent patients (Edmonton frail, CFS non-frail) had a CFS score of 4 (“vulnerable”), only one point less than the frailty cut-point of 5. Similarly, 18 of 19 patients scored frail via the CFS and non-frail with the Edmonton scale—their Edmonton score was 6–7 (“vulnerable”), which was just one point less than the frailty cut-point score of 8. This indicates that the CFS adequately selects patients for further assessment with a more detailed, granular multidimensional frailty measure such as the Edmonton scale. Although the highest Youden index was seen for a CFS ≥ “mildly frail” (CFS = 5), we advocate that this screening cut-point should be a CFS ≥ 4, which we would have captured in our cohort in all but one Edmonton-scored patient with frailty. This also represents an appropriate balance between specificity and sensitivity; patients selected at this point will likely have Edmonton scores achieving or very near the “vulnerable” threshold, and will likely also benefit from formalized frailty measurement. This formalized assessment is obviously less applicable to emergency surgical patients. In this population, the finding of significantly increased mortality and postoperative complications with frailty (as with other studies) may instead prompt more attention to a shared decision-making process for frail patients.

A major finding of our study was that perioperative frailty affects a wide range of health domains, with screening able to identify particular areas in which health deficits are over-represented in frail surgical patients. This included functional dependence, medication use, physical performance, and malnutrition, and concurs with our recent study into the areas of health affected by frailty in critical illness.15 This is a significant finding, as it implies that follow-up frailty measurement following a screening process must be comprehensive, particularly given recent trends to use hospital databases to construct so-called “frailty” measures. Although these rapid scales are easy to derive, there may be a significant bias towards medical comorbidities, and comparatively little information regarding other equally important domains of frailty. For example, the “modified frailty index”, an automated frailty measure derived from coding data, which has seen numerous publications in the surgical literature, is almost entirely a comorbidity measure.28 It may therefore fail to capture the complex multidimensional state that is frailty, and risk over-simplifying this complex condition to the detriment of future research and potential development of interventions for frail surgical patients.29,30

More recent examples of better-designed electronic frailty indices exist. The “eFI”, derived and validated in primary-care setting data sets in the UK, is a 36-item frailty index that captures a wide range of health deficits.31 It is able to predict mortality, hospitalization, and nursing home admission, but has not been tested in the perioperative setting. A more recent surgical-specific index, the “perioperative frailty index” has been developed by McIsaac’s group using population-based health administrative data in Canada.32 This index comprises 30-items, and in over half-a-million surgical patients was shown to correlate with postoperative death and institutional discharge. In this new era of electronic health records, electronic frailty indices thus represent an extremely useful approach to measuring frailty, however, must be comprehensive across the spectrum of health.

The finding of specific health domains that are disproportionately affected by frailty may also help identify areas that are potentially modifiable perioperatively. For example, the emerging interest in preoperative physical training (“prehabilitation”) may potentially be able to improve physical performance prior to surgery in frail patients. Limited research has shown this potential, with ongoing studies in this area.33,34 Preoperative nutritional support may also be able to reduce the malnutrition seen in surgical patients with frailty.35,36 More research is needed to determine whether these or other interventions can translate to better postoperative outcomes with frailty. Although logistically challenging for emergency surgery, elective surgical timelines may allow such targeted preoperative optimisation. Moreover, in the postoperative period there may be benefit in addressing specific areas of health deficits for individual patients, to reduce overall operative risk.

Strengths of our study are the inclusion of a broad range of both emergency and elective surgical patients across the spectrum of surgical subspecialties, enhancing generalizability. Although other studies have examined the CFS in specific surgical cohorts, including cardiac surgical37 and general surgical patients,38 the inclusion of a wide range of surgical specialties is previously limited within the literature (an exception being McIsaac et al.’s study of 702 non-cardiac surgical patients).14 A further strength is the similarity between included and non-included patients, indicating that our study cohort is likely representative of surgical patients in our institution, as well as the completeness of study data.

A study limitation was the use of both the Edmonton and Reported Edmonton scales, with total scores possibly varying by one point. Although possibly influencing the comparability of scores obtained, we considered this preferable to missing physical performance scores for half of the cohort (107 of 218 patients), which would have made assessment of this frailty domain impossible. A further limitation was the fact our study was conducted in a single hospital, which may reduce generalizability, and also that investigators assigned both frailty scores without blinding nor randomisation of assessment order, potentially biasing measurement. Nevertheless, we consider the risk of bias to be low since CFS and Edmonton scores were assigned based on objective criteria and strict definitions regarding function, independence, and medical disease status, and total Edmonton scores were aggregated after the CFS score was assigned. Moreover, the methodology in this study (data collectors non-blinded to the comparator scale) is similar to that of other studies comparing frailty scales, including in critically ill and other patient populations.39,40 Future research could improve on our and others’ study designs by utilising blinded assessment in frailty measurement, and by having individual patient frailty measured by more than one investigator to assess inter-rater reliability and to explore potential differences in assessments according to healthcare provider background. For example, a recent study in an ICU population revealed similar CFS scores when assigned by raters from medical, research coordinator, or occupational therapy backgrounds.41 This remains unexplored in the perioperative setting. An extension of the study design to compare the CFS with a comprehensive geriatric assessment in surgical patients would also be of value, although we note the significant challenges in administering such a comprehensive tool in the perioperative setting. Forty-two (19%) of our patients were also included in our previous study examining the Edmonton and CFS in ICU patients. Sensitivity analyses, however, showed little impact of inclusion of these patients on overall findings. In particular, the screening performance of the CFS with respect to the Edmonton scale was maintained, association with frailty and baseline demographics preserved, and the finding of worse health status across frailty domains persisted regardless of inclusion or omission of these patients. Therefore, our conclusions about frailty in the perioperative context remain valid.

In conclusion, the CFS is an accurate, sensitive screening tool, with good face and content validity to measure frailty in the perioperative setting. Frailty in surgical patients affects the spectrum of health-related domains, which are important to include in candidate frailty measurement instruments. Higher risk patients should be screened for frailty prior to anesthesia with a cut-point of a CFS ≥ 4 selecting those for more comprehensive measurement. This may help identify particular health domains amenable to interventions to reduce the impact of perioperative frailty.