Introduction

The small intestine comprises more than two-thirds of the digestive tract in length and more than 90% of the absorptive surface area [1]. It is situated between the stomach and colon, which are both common sites for cancer to develop [2]; however, the small intestine rarely develops cancer, with incidence ranging between less than 0.5 per 100,000 in some parts of Africa and Asia to 4.1 in specific regions of the United States (US) in the period between 2008 and 2012 [3].

The two main histologic subtypes of small intestinal cancer (SIC) are adenocarcinoma and carcinoid tumours. According to US Surveillance, Epidemiology, and End Results (SEER) registries data, the incidence of SIC increased from 1.16 to 2.52 per 100,000 between 1975 and 2019 [4]; this trend is mainly explained by the 4.4-fold increase in carcinoid tumour incidence, but the underlying aetiological factors remain largely unknown [5]. The increase in incidence has been consistently observed in both sexes and by ethnicity [5]. In concordance, European studies have also reported increased incidence of SIC over the last few decades [6,7,8]. As a result, it could be hypothesised that the increase in incidence is partly related to changes in modifiable risk factors such as lifestyle and dietary factors [5].

In addition to the anatomic and physiologic similarities, there is further evidence of common causal pathway for adenocarcinoma of the small intestine and colorectum. Firstly, there is a geographical correlation in the incidence of both malignancies, which is attributed to the increases in risk factors associated with the “Westernisation” of diet and lifestyle [9, 10]. Furthermore, the two anatomical sites share the widely acknowledged adenoma-carcinoma sequence of events [11] and there is more than a two-fold increased risk of developing colorectal cancer (CRC) for SIC patients and greater than a three-fold increased risk of developing SIC for CRC patients [12, 13]. These findings suggest that SIC may also share the modifiable risk factors for CRC [1].

While CRC has been extensively studied, the aetiology of SIC remains largely unknown as relatively few epidemiological studies have been conducted due to sample size constraints. Considering the increasing incidence of SIC and the evidence for common causality with CRC, it is important to examine how modifiable risk factors for CRC are associated with SIC. Using the European Prospective Investigation into Cancer and Nutrition (EPIC) cohort, we investigated whether previously identified risk factors for CRC, such as smoking, alcohol, physical activity, and dietary factors such as meat, fat, dairy products, vegetables and fibre are associated with SIC incidence.

Materials and methods

Study population

EPIC is an ongoing multicentre prospective cohort study; details of the rationale, design, and data collection methods have been described previously [14, 15]. Between 1992 and 2000, 521,323 participants, mostly between 35 and 70 years of age, were recruited from 23 study centres in ten European countries (Denmark, France, Germany, Greece, Italy, Norway, Spain, Sweden, The Netherlands, and the UK). After exclusion of prevalent cancer cases (n = 29,332), participants who did not complete the questionnaires (n = 6,259), participants with extreme energy intake (top and bottom 1% based on energy intake to energy requirement ratio or daily energy intake of < 600 kcal or > 6000 kcal) (n = 9,577), and participants from Greece (n = 26,048) due to data restriction issues, our analytic cohort consisted of 450,107 persons.

At baseline, participants completed detailed questionnaires and anthropometric measurements and blood samples were taken. All participants signed an informed consent agreement, and ethical approval for the EPIC study was provided by the review boards of the International Agency for Research on Cancer (IARC) and local participating centres.

Assessment of exposures

Diet at baseline was measured with validated questionnaires to measure habitual consumption over the preceding year. Most centres adopted a self-administered quantitative dietary questionnaire of 260 food items while semi-quantitative food frequency questionnaires (FFQs) were used in Denmark, Norway, Naples (Italy), and Umeå (Sweden) and combined dietary methods were used in the UK and Malmö (Sweden). Nutrient intakes were calculated with the use of the EPIC Nutrient DataBase (ENDB), a standardised food-composition table [16, 17]. We examined red meat, processed meat, fibre and dairy as they have either been convincingly or probably associated with CRC [18]. Although the evidence for vegetable and fat intake and CRC risk is limited, we examined these two variables as they have been studied in relation to SIC before [19]. Unfortunately, complete data on whole-grain intake was not available in EPIC, and hence was not included.

A non-dietary questionnaire collected detailed information about alcohol consumption, smoking, physical activity and education at baseline. In most centres, height and body weight were measured at baseline according to standardised procedures; however, in Oxford (UK), France, and Norway, anthropometric values at baseline were self-reported [15]. A study deriving prediction equations from EPIC-Oxford subjects with both standard and self-reported measures showed self-reported measures are valid for identifying relationships in epidemiological studies [20]. A previous analysis in this cohort investigated waist circumference (WC) and body mass index (BMI) in relation to SIC, finding that WC was positively associated with SIC [21].

Assessment of outcome

Participants were followed for cancer incidence through surveillance of medical records, tumour registry linkage and active follow-up. Follow-up was based on population cancer registries in Denmark, Italy, Netherlands, Norway, Spain, Sweden, and the UK. A combination of other methods, such as health insurance records, cancer and pathology registries and active contact of participants or next of kin were used in France and Germany. Furthermore, the information provided by the participants or their next of kin was verified by physician records. Participants were at risk from their enrollment into the study until diagnosis of SIC, death, loss to follow-up or the end of follow-up (December 2013), whichever occurred first.

The tenth revision of the International Classification of Diseases (ICD-10) and the third edition of the International Classification of Disease for Oncology (ICD-O) were used to code SIC by anatomical location (ICD10: C17.0–17.9) [22, 23]. Analyses by histological subtypes included the two main subtypes of SIC: adenocarcinoma (morphology codes in the EPIC data: 8140/3, 8141/3, 8143/3, 8144/3, 8210/3, 8211/3, 8480/3, and 8481/3) and malignant carcinoid tumours (morphology codes: 8240/3, 8241/3, 8244/3, 8245/3, and 8246/3).

Statistical analysis

We examined dietary and lifestyle factors in relation to SIC, as well as the subtypes of adenocarcinoma and carcinoid tumours, using Cox proportional hazards models to estimate hazard ratios (HRs) and 95% confidence intervals (95% CIs). Age was used as the underlying timescale in all analyses. All variables had less than 5% missingness except WC, which had 23.5% missing. Those with missing data for a given continuous covariate were automatically removed from that specific model, and missing categories were created for smoking status, education and physical activity.

Univariable and multivariable HRs were reported within predefined categories or tertiles from the whole cohort, using the lowest tertile as the reference category, as well as for continuous data for dietary factors and alcohol consumption. Tests for linear trend within tertiles were performed using the median value of each tertile.

Multivariable models for lifestyle factors were stratified by age (in quintiles), sex and country to control for possible confounding effect of age, differing follow-up methods, questionnaire design, and other variations between countries. For dietary factors, univariable models were adjusted for energy using the nutrient density method while multivariable models were further stratified by age, sex and country [24].

When not the main exposure variable, the following potential confounders were examined based on previous studies on SIC and CRC: BMI (kg/m2), WC (cm), height (cm), education (none/primary school, technical/professional, secondary, and university), smoking status (never, former, current), baseline alcohol drinking (g/day), physical activity (inactive, moderately inactive, moderately active, and active, defined by the Cambridge index), intakes of fibre, meat, fish, fruit, vegetables, dairy, calcium and folate. These potential covariates were checked for each multivariable model using the “change-in-estimate” approach in which covariates are selected if their inclusion in a stepwise-manner changes the effect estimate by 10% or more [25]. The confounder selection approach yielded no significant confounding for any of the models.

In sensitivity analyses, we included WC in all multivariable models. In addition, we performed a sensitivity analysis for SIC in which we selected covariates a priori based on evidence from previous studies on SIC and CRC and included all of the following covariates in multivariable models: sex, country, education, smoking status (never, former, current), baseline alcohol drinking (g/day), physical activity (inactive, moderately inactive, moderately active, and active), BMI (kg/m2) and energy using the nutrient density method for dietary factors. A lag-analysis was conducted by excluding the 1st year of follow-up to evaluate the potential bias of reverse causality since undiagnosed disease at baseline may have led to changes in diet and lifestyle.

The proportional hazards assumption was tested based on the Schoenfeld residuals. Except for sex, for which the models were stratified by, all the variables fitted the proportionality assumption. Two-sided tests with a significance level of 0.05 were chosen, and all analyses were performed using R version 4.2.2.

Results

Basic characteristics

During an average follow-up of 14.1 years, 160 incident SIC cases were identified (66 in men and 94 in women) and they were comprised of 51 adenocarcinomas, 62 carcinoids, 19 sarcomas, 13 lymphomas, and 15 unknown histology. Adenocarcinomas were most commonly found in the duodenum (75.0% of all duodenal cancers were adenocarcinomas and only 2.5% were carcinoids) and jejunum (47.8% of all jejunal cancers were adenocarcinomas, 13.0% were carcinoids). Conversely, carcinoid tumours were mainly located in the ileum (64.6% of ileal cancers were carcinoids and 6.3% were adenocarcinomas).

The average age at study entry was 55.7 years for cases and 51.1 years for non-cases (Table 1). There was a lower percentage of women among cases (58.8%) compared to non-cases (70.8%). The distributions of baseline characteristics are given in Table 1 by sex.

Table 1 Baseline characteristics of small intestinal cancer cases and non-cases in EPIC (n = 450,107)

Lifestyle risk factors

In the univariable model, current smokers had an elevated risk of SIC compared to never-smokers (HR, 95% CI: 1.77, 1.21–2.60) but the association attenuated once it was stratified by age, sex and country (HR, 95% CI: 1.29, 0.87–1.92) (Table 2). HRs were elevated but not statistically significant for current smoking and risk of both histological subtypes of SIC. There were no associations between alcohol consumption or levels of physical activity in relation to the risk of SIC, adenocarcinoma or carcinoid tumours (Table 2).

Table 2 HRs and 95% CIs for small intestinal cancer risk in relation to lifestyle factors

Dietary risk factors

In univariable models, there was an inverse association in the highest tertile of vegetable intake for SIC overall (HR, 95% CI: 0.48, 0.32–0.71, p-trend: < 0.001) and for continuous data (HR, 95% CI: 0.67, 0.50–0.90, per 100 g/1000 kcal increase) that remained statistically significant in the multivariable model for the medium versus lowest tertile only (HR, 95% CI: 0.64, 0.44–0.94, p-trend: 0.23) (Table 3). A similar inverse association was observed in the univariable model for carcinoid tumours (HRT3vsT1, 95% CI: 0.44, 0.24–0.82, p-trend: 0.01; HRcontinous, 95% CI: 0.49, 0.29–0.83) that remained significant in the multivariable model for the medium versus lowest tertile only (HR, 95% CI: 0.46, 0.23–0.88, p-trend: 0.37; Table 3).

Table 3 HRs and 95% CIs for small intestinal cancer risk in relation to meat, dairy, and vegetable intake

Total fat was inversely associated with SIC only in the middle versus lowest tertile (univariable HRT2vsT1, 95% CI: 0.57, 0.38–0.84; multivariable HRT2vsT1, 95% CI: 0.55, 0.37–0.81; Table 4) and was consistent across both histologic subtypes (adenocarcinoma multivariable HRT2vsT1, 95% CI: 0.44, 0.22–0.91; carcinoid multivariable HRT2vsT1, 95% CI: 0.44, 0.22–0.87; Table 4). There were no clear associations by fat subtype. Similar to total fat, monounsaturated fat was inversely associated with SIC only in the middle versus lowest tertile in the multivariable model (HRT2vsT1, 95% CI: 0.63, 0.42–0.93; Table 4). Although there was an inverse association in the univariable model for polyunsaturated fat and SIC in the continous data (HR, 95% CI: 0.40, 0.18–0.90, per 10 g/1000 kcal increase), this attenuated in the multivariable model. We did not observe any associations for red meat, processed meat, dairy, fibre or saturated fat intake (Tables 3, 4).

Table 4 HRs and 95% CIs for small intestinal cancer risk in relation to fibre, fat and its sub-groups

The exclusion of the first year of follow-up eliminated 10 SIC cases (3 adenocarcinomas and 2 carcinoids) and the results were not materially different from the main findings (data not shown). In sensitivity analyses, further adjustment for WC did not change the results substantially (Supplementary Tables 1–3); however, it revealed an inverse association for processed meat and SIC adenocarcinoma in both categorical (HRT3vsT1, 95% CI: 0.41, 0.17–0.95, p-trend: 0.04) and in continous data for multivariable models (HR, 95% CI: 0.64, 0.45–0.91, per 10 g/1000 kcal increase; Supplementary Table 2). Multivariable models containing all a priori selected covariates did not materially change our results for SIC (data not shown).

Discussion

In this large cohort of European adults, we found a suggestive positive association for smoking and suggestive inverse associations for vegetables and total fat intake with SIC. However, our sample size was limited, particularly in analyses stratified by histologic subtype. In agreement with existing evidence from other cancer databases [26], the most common histological subtype in our study was carcinoid tumours (39%) followed by adenocarcinomas (32%), with adenocarcinomas occurring most frequently in the duodenum and carcinoid tumours in the ileum.

Although the literature is still limited for SIC, there is now strong evidence for the positive association between smoking and CRC [27]. In our univariable model, current smokers had an elevated risk of SIC but the association attenuated upon stratification by age, sex and country. Our findings support the meta-analysis (28) of four case–control studies [29,30,31,32] and one prospective cohort study [33] investigating small intestinal adenocarcinoma risk, which yielded a non-significant pooled risk ratio (RR) of 1.24 (95% CI: 0.71–2.17) for those in the highest versus lowest category of smoking. A previous European case–control study suggested that ever being a smoker was positively associated with carcinoid tumours in the small intestine [34]; however, a case–control [35] and a cohort study [33] did not replicate these results.

According to the IARC Monographs, there is convincing evidence to conclude that alcohol consumption is causally related to CRC risk, but the evidence is not sufficient for SIC [27]. Our study found no associations for alcohol and SIC, which is in agreement with a meta-analysis of small intestinal adenocarcinoma [28] of four case–control studies [29,30,31,32] and one cohort study [33]. Evidence is sparse for the role of alcohol consumption for carcinoid tumours, with one case–control study [29] that suggested a positive association, while two case–control studies [34, 35] and a cohort study [33] reported null results.

There is consistent evidence of a protective association between physical activity and CRC [18, 36, 37]; however, we observed no associations between physical activity and SIC overall or by subtype. The only other cohort study investigating physical activity in relation to SIC studied adenocarcinoma specifically and found no association [33].

While obesity is an established risk factor for CRC, a meta-analysis of cohort studies showed that the association is more sensitive to anthropometric indexes of abdominal obesity than to overall obesity [38]. Similarly, the results from the only prospective cohort study that investigated the role of obesity in SIC using EPIC data suggested that abdominal obesity (measured by WC), rather than overall adiposity (measured by BMI), was positively associated with SIC, specifically for adenocarcinomas [21]. However, the interpretability of results was limited due to an even smaller number of cases than observed in this current study.

While there is little evidence in the literature for an association between red meat or processed meat intake and SIC, there is a considerable amount of data suggesting a positive association for cancers of the colorectum [39], oesophagus and stomach [40, 41]. The current analysis found no significant association for red or processed meat and SIC, except for an inverse association for processed meat and adenocarcinoma that was only observed in sensitivity analyses upon further adjustment for WC. The only other cohort study that explored these dietary factors reported no associations between red or processed meats and SIC [19]. There are case–control studies [31, 32, 42] that have reported significantly increased risks of SIC with consumption of red and processed meat; however, these are subject to various biases and two were small in size [31, 32].

Evidence for the association between vegetable intake and SIC is limited to a case–control study [31], which reported a reduced risk of small intestinal adenocarcinoma for individuals with high intake of vegetables. This is the first prospective cohort study examining the association between vegetable intake and SIC risk. In our univariable models for vegetable intake, the risk of SIC and carcinoid tumours was significantly decreased across the tertiles and in continuous data but these associations attenuated in multivariable models.

Although some studies [43, 44] support a positive association between dietary fat and CRC risk, the evidence for fat and its subtypes in relation to SIC is limited to one other cohort study [19]. This previous study in the US reported a positive association between saturated fat consumption and carcinoid tumours, a suggestive elevation in risk for adenocarcinoma with polyunsaturated fat intake, but no association for monounsaturated fat. In contrast, our study yielded an inverse association for total fat in relation to SIC, adenocarcinoma and carcinoid tumour but only in the middle compared to the lowest tertile of intake. Although the inverse association was also observed in the middle category of monounsaturated fat intake and SIC overall in multivariable models, the findings by subtype in our data were not statistically significant in either group. An important potential difference between the data from the US and Europe is the likely different sources of fat. In Europe, particularly Southern Europe, significant sources of monounsaturated fat intake would include olive oil [45], whereas in the US it would include French fries, potato chips, whole milk and ground beef in adults [46].

Studies investigating the association between dietary fibre intake and CRC risk have yielded inconsistent results [43, 47, 48], but inverse associations for whole grains specifically in relation to CRC are more robust [49]. We observed no associations for fibre and SIC, which is in agreement with a previous prospective cohort study [50]. Unfortunately, we were unable to estimate intake of whole grains in EPIC.

The strengths of the present study include the large size and long follow-up of the cohort, which has allowed us to study lifestyle and dietary risk factors by histological subtypes of SIC. The collection of exposure information at baseline and comprehensive follow-up through tumour registry linkage and/or active follow-up provided valuable information about temporality, minimised recall bias and reduced selection bias. However, this study lacked time-varying information on exposures and covariates as these were only measured at baseline. Other limitations included potential measurement error due to use of dietary data from self-reported questionnaires and the relatively small number of incident cases, which restricted the power to detect associations and the interpretability of results, especially by histological subtypes. It should be noted that the findings of this study are exploratory, and they should be interpreted cautiously. Nevertheless, this study is valuable as it is one of the few prospective cohort studies investigating diet and lifestyle factors in relation to SIC and its histological subtypes.

In summary, the epidemiological evidence for dietary and lifestyle risk factors for SIC is limited to mostly case–control studies and only a handful of prospective cohort studies. Therefore, the exploration of these risk factors in EPIC provides valuable insight with its prospective design and large sample size. This study revealed suggestive inverse associations for vegetable intake with SIC and carcinoid tumour risk as well as suggestive inverse associations for total fat with SIC overall and by histological subtypes. Additional research is needed to investigate the associations with a larger number of cases, which could be achieved by pooling existing studies with relevant data in order to suggest preventive strategies for SIC.