Lifestyle predictors of colorectal cancer in European populations: a systematic review

Elly Mertens; Maria Keuchkarian; Maria Salve Vasquez; Stefanie Vandevijvere; José L Peñalvo

doi:10.1136/bmjnph-2022-000554

Article Text

PDF

PDF +
Supplementary
Material

Systematic review

Lifestyle predictors of colorectal cancer in European populations: a systematic review

http://orcid.org/0000-0003-4957-4235Elly Mertens1,
Maria Keuchkarian1,2,
Maria Salve Vasquez3,
Stefanie Vandevijvere3 and
José L Peñalvo1,4

¹Unit of Non-Comunicable Diseases, Department of Public Health, Institute of Tropical Medicine, Antwerp, Belgium
²Faculty of Bioscience Engineering, Ghent University, Gent, Belgium
³Department of Epidemiology and Public Health, Sciensano, Brussels, Belgium
⁴Global Health Institute, University of Antwerp, Wilrijk, Belgium

Correspondence to Dr Elly Mertens; ellymertens{at}itg.be

Abstract

Background Colorectal cancer (CRC) is the second most prevalent cancer in Europe, with one-fifth of cases attributable to unhealthy lifestyles. Risk prediction models for quantifying CRC risk and identifying high-risk groups have been developed or validated across European populations, some considering lifestyle as a predictor.

Purpose To identify lifestyle predictors considered in existing risk prediction models applicable for European populations and characterise their corresponding parameter values for an improved understanding of their relative contribution to prediction across different models.

Methods A systematic review was conducted in PubMed and Web of Science from January 2000 to August 2021. Risk prediction models were included if (1) developed and/or validated in an adult asymptomatic European population, (2) based on non-invasively measured predictors and (3) reported mean estimates and uncertainty for predictors included. To facilitate comparison, model-specific lifestyle predictors were visualised using forest plots.

Results A total of 21 risk prediction models for CRC (reported in 16 studies) were eligible, of which 11 were validated in a European adult population but developed elsewhere, mostly USA. All models but two reported at least one lifestyle factor as predictor. Of the lifestyle factors, the most common predictors were body mass index (BMI) and smoking (each present in 13 models), followed by alcohol (11), and physical activity (7), while diet-related factors were less considered with the most commonly present meat (9), vegetables (5) or dairy (2). The independent predictive contribution was generally greater when they were collected with greater detail, although a noticeable variation in effect size estimates for BMI, smoking and alcohol.

Conclusions Early identification of high-risk groups based on lifestyle data offers the potential to encourage participation in lifestyle change and screening programmes, hence reduce CRC burden. We propose the commonly shared lifestyle predictors to be further used in public health prediction modelling for improved uptake of the model.

Preventive counselling

Data availability statement

Data are available upon request.

http://creativecommons.org/licenses/by-nc/4.0/

This is an open access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited, appropriate credit is given, any changes made indicated, and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/.

https://doi.org/10.1136/bmjnph-2022-000554

Statistics from Altmetric.com

Request Permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.

Preventive counselling

WHAT IS ALREADY KNOWN ON THIS TOPIC

Colorectal cancer is the second most prevalent cancer in Europe with one-fifth of cases attributable to unhealthy lifestyles, hence employing lifestyle data in a risk prediction model would facilitate identification of high-risk groups or individuals that would benefit the most from participation in lifestyle change and screening programmes.
Most of the available models for colorectal cancer risk prediction have been developed in the USA, carrying intrinsic risk factors, and those available for European populations have not been comprehensively compared and evaluated.

WHAT THIS STUDY ADDS

The study provides a comprehensive summary of population-based risk prediction models of primary colorectal cancer, that are applicable for European adult populations and incorporate easily available predictors, such as lifestyle data.
Beyond older age, and male sex, commonly shared easily available predictors for colorectal cancer risk prediction were family history of (colorectal) cancer, the use of non-steroidal anti-inflammatory drugs, overweight or obesity, and lifestyle variables such as alcohol consumption, smoking and physical inactivity, while diet-related factors were less considered.

HOW THIS STUDY MIGHT AFFECT RESEARCH, PRACTICE OR POLICY

Findings from this study will be relevant for future public health prediction modelling and propose the use of lifestyle data for enhanced credibility and uptake of the prediction model across different settings and populations.

Background

Colorectal cancer (CRC) was estimated to be the second most frequently diagnosed cancer after breast cancer, and the second leading cause of cancer-related death (after lung cancer) in Europe, with nearly 520 000 new cases and 245 000 deaths in 2020, corresponding to one-eight of the total cancer burden.1 Population-based screening has contributed substantially to reductions in this burden,2 with 20 Member States of the European Union offering screening programmes.3 In addition to the implementation of CRC screening strategies targeting the average-risk population aged 50–75 years, the gradual development of CRC (between 10–15 years) provides an opportunity for primary prevention by reducing modifiable CRC risk factors, such as excess body weight, smoking, alcohol consumption, physical inactivity and unhealthy diets. Lack of adherence to healthy lifestyle recommendations, potentially also partly due to barriers to prevention policy implementation, has been associated to be responsible to almost one-fifth of CRC in Europe.4 Early identification of high-risk groups or individuals would offer the potential for them to participate in tailored lifestyle programmes as well as existing screening programmes.

A number of risk prediction models for primary CRC have been developed and summarised in previous systematic reviews,5–8 including two identifying all published models incorporating known genetic markers.9 10 Introducing genetic information into a risk model that also includes family history and/or phenotypic variables has been shown to modestly improve discriminatory performance,11–13 though their clinical use in routine real-life settings remains uncertain, as it requires considerations on the wider financial, ethical, legal, social and health concerns, including the cost-benefit/health risk-benefit of measuring additional (genetic) risk factors among others.14 On the other hand, risk prediction models incorporating easily available predictors, such as lifestyle data, are particularly relevant to facilitate risk stratification among the general population. However, most of the available models for CRC risk prediction have been developed in the USA carrying intrinsic risk factors,15 and those available in Europe have not been comprehensively compared and evaluated.

The aim of this review is to systematically assess population-based risk prediction models of primary CRC, based on demographic and phenotypic factors, developed and/or validated for European adult populations, including an evaluation of the risk of bias in the model development and validation. In addition, this review aims to identify the lifestyle predictors considered in existing risk prediction models applicable for European populations and to characterise and compare their corresponding parameter values for an improved understanding of their relative contribution to prediction across the different models.

Materials and methods

A systematic literature review was performed in accordance with the Preferred Reporting Items for Systematic Reviews and Meta-Analyses guidelines during all stages of the design, implementation and reporting of systematic review.16

Search strategy

We performed an electronic literature search in PubMed and Web of Science from January 2000 to August 2021 using key words related to “colorectal cancer”, “risk”, and “model” and “prediction/assessment/estimation”. We then carried out hand searches of the citations of the retrieved systematic reviews. In addition, through hand searches, studies describing the development of models that were validated in the eligible studies but developed in non-European populations were retrieved and considered for inclusion.

Study selection

To be included in the systematic review, studies had to be published as a primary research paper in a peer-reviewed journal and either describe the development and/or the validation (performance assessment) of a risk prediction model identifying groups or individuals at higher risk of CRC or advanced colorectal neoplasia. Source data had to concern European populations of asymptomatic (for cancer) adults from the general population or information on adults presenting at a preventive CRC screenings. The risk model had to be based on two or more phenotyping predictors that are readily available from individuals or from their medical records without the need for laboratory tests. Other frequent variables from patients’ consultation questionnaires, such as colon-related symptoms on rectal bleeding, change in bowel habits, diarrhoea, constipation, abdominal pain, weight loss, loss of appetite, mucous in the stool, extensive laboratory analysis and/or genetic information, such as single nucleotides polymorphisms and omics, were not considered for inclusion. Furthermore, studies were included in the quantitative analyses if estimates and uncertainty of the predictors were reported. Conference proceedings, papers in languages other than English and studies of a specific population subgroup with (multi)morbidity as well as risk models incorporating extensive patient consultation and/or genetic information were excluded.

Title and abstract screening, followed by a full-text review of the studies complying with the inclusion/exclusion criteria, were independently analysed by two investigators. Any discrepancy during the selection of the studies was resolved by consensus, and where necessary, group discussions among all investigators.

Data extraction and synthesis

Data extraction for each paper was performed in duplicate using a standardised electronic excel template based on the framework of critical appraisal and data extraction for systematic reviews of prediction modelling studies checklist17 to extract information on each risk prediction model. When the same study described multiple risk prediction models or applied multiple data sources for validation, each prediction model or data source was included separately. Any discrepancy after comparing the data extracted in duplicate was resolved by consensus, and where necessary group discussion among all investigators.

Extracted information included publication details (author, year, country, study name if applicable); study setting and population (outcome to be predicted, timeframe of prediction, source of data, sample size including total number for development and/or validation, number with outcome and number excluded); methods of model development (type of the regression model, variable selection method, missing data handling); predicting variables (including the number of potential predictors considered and selected, and, their associated parameters (ie, exponentiated regression coefficients and a measure of uncertainty, that is, SE or 95% CI)); and, if available, reported performance measures in internal or external validation for calibration (calibration plot, the ratio of expected to observed (E/O) probabilities, Hosmer-Lemeshow test) and discrimination (area under the receiver operating characteristic curve (AUROC)).

Bias assessment was performed in parallel to data extraction, also in duplicate, and for both model development and validation, following the framework of prediction model risk of bias assessment tool (PROBAST),18 allowing to classify each study as having a high, unclear or low risk of bias for the domain of participants, predictors, outcome and analyses. No studies were excluded based on bias assessment alone.

Data analysis

Eligible studies and their prediction models were summarised in evidence tables. Furthermore, we inquired the established lifestyle aetiological risk factors, as taken from the Continuous Update Project (CUP) steered by the World Cancer Research Fund Network,19 to be employed in the different risk prediction models. After identifying those lifestyle factors with an explanatory and predictive character, their retrieved estimates and uncertainty were standardised to be visually compared in forest plots, stratified according to their choice of comparison; for continuous variables per X-level increment, and for categorical variables, the contrast between the groups, using the extremes if more than two groups available. The type of the estimates varied between the studies included in our systematic review, hence conversion of ORs and HRs into a risk ratio (also known as relative risk, RR) was necessary for comparability. All non-RR point estimates were converted to RR using one of the following equations:

where p_0, and r represent the baseline risk and the incidence rate, respectively, of the outcome for the reference group, or when not reported for the referent of a particular risk factor class under study, the incidence proportion or rate for the overall study population was taken. Studies were omitted from the quantitative analyses when they did not report a measure of uncertainty for the predictors included in their final prediction model, or when their risk prediction model was built on estimates of RRs taken from the literature. Statistical analyses were carried out in R V.4.1.2, and a p value of <0.05 was considered statistically significant.

Results

The initial search yielded 2365 articles, and after removing duplicates, 1613 abstracts were screened yielding 23 articles to be retrieved for full-text review (figure 1). After exclusion of 18 articles for varied reasons (as mentioned in figure 1), and an additional inclusion of 12 full-text articles identified through hand searching from citations (ie, five from previously published review, and seven from studies reporting the validation for a European population of a prediction model developed elsewhere), a total of 17 studies were included in the present review.

Figure 1

Flowchart of studies included in the review. From: Page MJ et al.52 For more information, visit: http://www.prisma-statement.org/.

Model development studies

This review identified eight studies,20–27 describing the development of risk prediction models developed in a European population (accounting for a total of 10 different models), and eight studies,28–35 describing risk prediction models developed elsewhere but validated in Europe (accounting for a total of 11 different models) (online supplemental table 1). The majority of the latter were developed in US populations (seven models).28–30 32 35

Supplemental material

[bmjnph-2022-000554supp001.pdf]

In addition to the country of origin of the model, the risk prediction models identified were differentiated by their choice of predicted outcome: either prevalent advanced neoplasia at screening20–25 or CRC incidence at 5–20 year from assessment.26–35 The former has the aim to classify at-risk individuals as eligible for screening (ie, screening eligibility) and the latter to identify population groups at higher risk of CRC who should benefit most from preventive programmes (ie, population-wide primary prevention). In this respect, the data sources and the methods used for model development were different: for prevalence models discerning screening eligibility models, a logistic regression with cross-sectional data from screening records was used, while models predicting incidence used generally a Cox (proportional hazards) regression model with data from prospective cohorts including a record linkage with cancer registries.

When assessing bias according to PROBAST, most models developed were considered to carry an either unclear or high risk of bias for the domain of analyses (A) due mainly to inadequate handling of participants with missing data,20–23 25 27 29–32 and/or applying univariate analysis for selecting predictors20 29 31 33 as well as lack of accounting for optimism and overfitting.20 22 23 25 30 34

Variables included in the risk prediction models

Predictors were categorised into five types: demographic, medical history (family and personal) and lifestyle (anthropometrics and lifestyle factors) (online supplemental table 2). The number of predicting variables varied widely: from two (as in Taylor et al model) 32 to thirteen (as in Colditz et al women-only model).28 From the list of variables selected in the risk prediction models, age was included in all risk prediction models (16 models),20–25 27 29–35 while other most common identified predictors were body mass index (BMI) (13 models),20 21 23 26 28–31 33–35 smoking (13 models),23–27 29–31 34 alcohol consumption (11 models),25–29 31 33–35 family history of colorectal/colon cancer (11 models),21 23–25 28 30 32 33 35 followed by physical activity (7 models),26–31 35 sex (8 models),20–25 34 and the use of non-steroidal anti-inflammatory drugs (NSAIDs); four models.25 28 30 35 Diet-related factors were selected as predictor only in a limited number of models, with the most shared being the consumption of meat (included in nine models within two as total meat,33 four as red meat25–28 and three as processed meat)26 27 and vegetables (five models).27 28 30 No lifestyle factors were included in two models.22 32

For a visual comparison of model-specific estimates of lifestyle predictors that are also recognised as aetiological factor, data from 12 studies representing 16 risk prediction models (half of them developed in Europe20 21 23–27 and half only validated in Europe29–35 were depicted). The predictors from Usher-Smith et al26 and Colditz et al model28 were excluded because they obtained RR estimates from literature.

From the lifestyle risk factors considered convincing or probable by the CUP programme,19 the following factors were also included in the risk prediction models: BMI and waist circumference as anthropometric predictors, and as lifestyle factors physical activity, alcohol, meat (red, processed and total) and dairy consumption as well as smoking. Online supplemental table 3 presents the model-specific effect sizes as reported, as well as the RR, as displayed in the forest plots (figure 2).

Figure 2

Forest plots of standardised (RR and corresponding 95% CI) estimates of lifestyle predictors, that are also recognised as aetiological factors, across risk prediction models stratified by their choice of comparison group.^{1 1}Excluded from the plots are the model of Usher-Smith et al26; Colditz et al28 because they obtained their relative risk estimates from literature. (A) Anthropometrics (BMI in kg/m² and waist circumferences in cm). (B) Lifestyle factors: alcohol. (C) Lifestyle factors: smoking. (D) Lifestyle factors: physical activity. (E) Lifestyle factors: Diet. ACRN, advanced colorectal neoplasia; BMI, body mass index; cig, cigarettes; CPhM, Cox Proportional-hazards model; d, distal colon; EUR, European population (‘in’ developed in a European population, and ‘out’ developed outside a European population); IN, incidence; LR, logistic regression; M, model developed in men; p, proximal colon; r, rectal colon; RR, relative risk; W, model developed in women.

For anthropometrics, the model-specific estimates for BMI of RR ranged from 0.90 (95% CI 0.58 to 1.36) to 1.50 (1.00; 2.10) for overweight (nine models,20 23 29 30 33 35 from 0.95 (0.58 to 1.52) to 1.93 (1.27 to 2.82) for obesity (eight models,20 23 29 30 and varied between 1.00 (0.90 to 1.10) and 1.05 (1.02 to 1.08) per one unit increment in BMI (three models.21 31 34 For waist circumferences between 1.05 (1.01 to 1.09) and 1.19 (1.13 to 1.23) per 10 cm increment (3 models27 (figure 2A). These estimates for continuous RR were slightly greater than those calculated by the CUP dose-response meta-analysis (online supplemental table 3A).

For the lifestyle behaviour predictors, their independent predictive contribution was generally greater when they were collected with greater detail, allowing for comparison of extremes instead of a two-level categorical variable (online supplemental table 3B). Additionally, the predictive contribution of alcohol consumption and smoking showed a noticeable variation, particularly for alcohol ranging from 1.14 (1.06 to 1.22) to 1.35 (1.08 to 1.69) when comparing high versus low (three models,27 29 and from 0.99 (0.93 to 1.06) to 1.93 (1.21 to 2.83) when comparing extremes (four models)25 33 35 (figure 2B), and for smoking from 1.16 (1.06 to 1.27) to 1.41 (1.17 to 1.70) when comparing ever versus never (four models)27 29 and from 1.06 (0.93 to 1.21) to 1.70 (1.12 to 2.33) when comparing extremes (six models)23 30 31 34 35 (figure 2C). For the lifestyle predicting factors of physical activity and diet, the model-specific effect sizes across risk prediction equations were found to be of similar magnitude, although the limited number of risk model equations (figures 2D and 2E).

Model validation studies

A total of 11 studies were identified describing the validation of a risk prediction model for CRC in a European population, either for models developed in Europe (8 studies20–27 validating a total of 10 models) or for models developed elsewhere (1 study36 validating a total of 11 models) (online supplemental table 4). For the models developed in a European population, four of them20–22 24 were only internally validated, that is, the development was also used for validation, while four23 27 were validated by splitting the data into training and testing sets, and two externally validated using a different data source.25 26 Model calibration was reported by a calibration plot displaying the observed against the predicted probabilities (5 studies) 20 24 26 27 36 and/or Hosmer-Lemeshow test (5 studies),21–25 all suggesting no evidence for significant over- nor underprediction of risk. All studies, except one,21 reported the discriminating ability of the risk prediction model, as operationalised using the AUROC, that is, the c-statistic. In general, various levels of estimated discrimination were observed with c-statistics varying between 0.5830 and 0.76,24 yet this was irrespectively of the origin of the model and the data sources used for validation.

According to PROBAST, most models validated were considered to carry an either unclear or high risk of bias for the domain of ‘analyses’ (A) because of inadequate handling of participants with missing data20–23 25 27 29–32 36 and/or only considering calibration instead of both calibration and discrimination.21 External validation studies were considered to carry an unclear risk of bias for the domain of ‘predictors’ (P) and ‘outcome’ (O) in case of divergent predictor assessment and prediction time interval, respectively, for the external data source as intended with model development.36

Discussion

This systematic review and meta-analysis summarised and quantified the evidence published over the last two decades on primary CRC risk prediction models with routinely available or easily ascertained predictors, validated for a European population. In addition to older age, and male sex, other commonly shared risk factors identified in the risk prediction models reviewed were family history of (colorectal) cancer, the use of NSAIDs, overweight or obesity, and lifestyle variables such as alcohol consumption, smoking and physical inactivity. Validation studies suggested overall good calibration, as showed by calibration plot and/or Hosmer-Lemeshow test, and acceptable discrimination, as shown by c-statistics closely to 0.7 for most models.

To the best of our knowledge, this work is the first to visualise the predictive value of commonly present lifestyle predictors, that are also recognised as aetiological, employed in CRC risk prediction models. Supported by these results and proven associations from aetiological studies, targeting lifestyle factors, including diet, in those at highest risk could complement CRC screening prevention programmes as means to reduce cancer risk and improve overall health and survival after cancer diagnosis.37 38 Particularly, dietary exposures, which may play a prominent role in the CRC prevention,39 is inherently challenging to assess, and therefore barely considered as a predictor. Though predictors may be any variable associated with outcome, casually or otherwise, considering particularly the previously identified causal factors as predictors would enhance credibility and uptake of the model in different settings and populations.40 41 Both the models of Usher-Smith et al26 and of Colditz et al28 summarised the probable/convincing evidence of associations into a risk score using RR available in published literature. This establishing a claim of prediction from association studies is a recognised conflation in causal research, and likewise the most frequent conflation type in prediction studies is the aetiological interpretation of prediction results, attributing causal meaning to the individual predictors.42 43

Still, various remaining key statistical considerations were often not addressed in the existing CRC risk prediction studies, and hence they were considered to be at unclear or high risk of bias for their analyses. In particular, not only in the selection of predictors (ie, univariate prior to multivariate)18 but also in the handling of missing data and the corrections for optimism and overfitting. Consistent with the literature, the most commonly adopted approach for handling missing data was the complete-case analysis, that is, (automatically) removing individuals with missing data on predictor or outcome variables from the analysis, in spite of its increased susceptibility to bias in estimated model parameters and model’s predictive performance.44 45 Instead, multivariable imputation models, that is, to generate (multiple) imputation(s) conditionally on observed patient characteristics, has been generally recommended to avoid bias in model development and validation44 45 as well as during model application in clinical practice,46 47 but barely implemented (also in this review in only four studies). Furthermore, with the increasing interest in accurate risk prediction, it is key to evaluate its external validity, that is, its predictive performance outside of the development sample. While a vast majority of external validation studies were poorly reported/performed,48 49 current research recognises a potentially inferior performance of prediction models in external validation studies.49 This relates back to the need for model development studies to adjust for model overfitting and optimism in model performance by including internal validation techniques of cross-validation or bootstrapping.49If optimism is present, adjusting or shrinking the model predictive performance estimates and predictors in the final model may be needed, provided that an adequately large development sample with a reasonable number of events per variables are available.50 Future (CRC) risk prediction model development studies should, therefore, incorporate improved methodological quality by at least avoiding univariable selection before multivariable modelling, applying multiple imputation for missing data, and adjusting for model overfitting and optimism.

Evidence synthesis of studies assessing a model’s performance in new individuals (ie, external validation) plays a key role in interpretating the potential applicability and generalisability of a prediction model across different settings and populations.51 However, in this study, the retrieved estimates of model discrimination and calibration could not be summarised into a weighted average because external validation studies for CRC risk prediction models were limited for European populations. Nevertheless, model performance was similar for models developed inside or outside of Europe, suggesting a high generalisability of models incorporating demographic and phenotypic (eg, lifestyle) factors. Particularly models with high predictive performance across different population and subgroups might, therefore, have a good potential for future implementation in screening and clinical settings.

Conclusions

In conclusion, lifestyle factors, beyond age and sex, were identified as significant modifiable predictors in multiple risk prediction models for CRC. Early identification of high-risk groups or individuals based on lifestyle-based data would, therefore, offer the potential to encourage participation in tailored lifestyle and screening programmes and subsequently reduce the CRC burden. However, external validation of the models identified is recommended to further investigate their predictive performance across different settings and populations, aiding the selection and optimisation of the best models for use in clinical practice.

Data availability statement

Data are available upon request.

Ethics statements

Patient consent for publication

Ethics approval

Not applicable.

Acknowledgments

The authors express special thanks to Chiara Colizzi and James Cottam for their contribution to the underlying process of data retrieval.

References

↵
1. Ferlay J,
2. Ervik M,
3. Lam F, et al
. Global cancer observatory: cancer today Lyon. France International Agency for Research on Cancer; 2020. Available: https://gco.iarc.fr/today [Accessed 06 Jan 2022].
↵
1. Cardoso R,
2. Guo F,
3. Heisser T, et al
. Colorectal cancer incidence, mortality, and stage distribution in European countries in the colorectal cancer screening era: an international population-based study. Lancet Oncol 2021;22:1002–13. doi:10.1016/S1470-2045(21)00199-6
OpenUrl CrossRef PubMed
↵
1. European Commission
. Cancer screening in the European Union: scientific advice on improving cancer screening across the EU. Brussels: Group of Chief Scientific Advisors, 2022: 48.
↵
1. Aleksandrova K,
2. Pischon T,
3. Jenab M, et al
. Combined impact of healthy lifestyle factors on colorectal cancer: a large European cohort study. BMC Med 2014;12:168. doi:10.1186/s12916-014-0168-4
↵
1. Win AK,
2. Macinnis RJ,
3. Hopper JL, et al
. Risk prediction models for colorectal cancer: a review. Cancer Epidemiol Biomarkers Prev 2012;21:398–410. doi:10.1158/1055-9965.EPI-11-0771
OpenUrl Abstract/FREE Full Text
↵
1. Ma GK,
2. Ladabaum U
. Personalizing colorectal cancer screening: a systematic review of models to predict risk of colorectal neoplasia. Clin Gastroenterol Hepatol 2014;12:1624–34. doi:10.1016/j.cgh.2014.01.042
OpenUrl CrossRef PubMed
↵
1. Usher-Smith JA,
2. Walter FM,
3. Emery JD, et al
. Risk prediction models for colorectal cancer: a systematic review. Cancer Prev Res (Phila) 2016;9:13–26. doi:10.1158/1940-6207.CAPR-15-0274
OpenUrl Abstract/FREE Full Text
↵
1. Williams TGS,
2. Cubiella J,
3. Griffin SJ, et al
. Risk prediction models for colorectal cancer in people with symptoms: a systematic review. BMC Gastroenterol 2016;16:63. doi:10.1186/s12876-016-0475-7
↵
1. McGeoch L,
2. Saunders CL,
3. Griffin SJ, et al
. Risk prediction models for colorectal cancer incorporating common genetic variants: a systematic review. Cancer Epidemiol Biomarkers Prev 2019;28:1580–93. doi:10.1158/1055-9965.EPI-19-0059
OpenUrl Abstract/FREE Full Text
↵
1. Sassano M,
2. Mariani M,
3. Quaranta G, et al
. Polygenic risk prediction models for colorectal cancer: a systematic review. BMC Cancer 2022;22:65. doi:10.1186/s12885-021-09143-2
↵
1. Iwasaki M,
2. Tanaka-Mizuno S,
3. Kuchiba A, et al
. Inclusion of a genetic risk score into a validated risk prediction model for colorectal cancer in Japanese men improves performance. Cancer Prev Res (Phila) 2017;10:535–41. doi:10.1158/1940-6207.CAPR-17-0141
OpenUrl Abstract/FREE Full Text
↵
1. Xin J,
2. Chu H,
3. Ben S, et al
. Evaluating the effect of multiple genetic risk score models on colorectal cancer risk prediction. Gene 2018;673:174–80. doi:10.1016/j.gene.2018.06.035
OpenUrl
↵
1. Weigl K,
2. Thomsen H,
3. Balavarca Y, et al
. Genetic risk score is associated with prevalence of advanced neoplasms in a colorectal cancer screening population. Gastroenterology 2018;155:88–98. doi:10.1053/j.gastro.2018.03.030
OpenUrl
↵
1. Cecile A,
2. Janssens JW,
3. Joyner MJ
. Polygenic risk scores that predict common diseases using millions of single nucleotide polymorphisms: is more, better Clin Chem 2019;65:609–11. doi:10.1373/clinchem.2018.296103
OpenUrl FREE Full Text
↵
1. Arnold M,
2. Sierra MS,
3. Laversanne M, et al
. Global patterns and trends in colorectal cancer incidence and mortality. Gut 2017;66:683–91. doi:10.1136/gutjnl-2015-310912
OpenUrl Abstract/FREE Full Text
↵
1. Page MJ,
2. McKenzie JE,
3. Bossuyt PM, et al
. The PRISMA 2020 statement: an updated guideline for reporting systematic reviews. J Clin Epidemiol 2021;134:178–89. doi:10.1016/j.jclinepi.2021.03.001
OpenUrl CrossRef PubMed
↵
1. Moons KGM,
2. de Groot JAH,
3. Bouwmeester W, et al
. Critical appraisal and data extraction for systematic reviews of prediction modelling studies: the CHARMS checklist. PLOS Med 2014;11:e1001744. doi:10.1371/journal.pmed.1001744
↵
1. Moons KGM,
2. Wolff RF,
3. Riley RD, et al
. PROBAST: a tool to assess risk of bias and applicability of prediction model studies: explanation and elaboration. Ann Intern Med 2019;170:W1. doi:10.7326/M18-1377
↵
1. World Cancer Research Fund/American Institute for Cancer Research
. Diet, nutrition, physical activity and cancer: a global perspective. Continuous update project expert report; 2018.
↵
1. Betés M,
2. Muñoz-Navas MA,
3. Duque JM, et al
. Use of colonoscopy as a primary screening test for colorectal cancer in average risk people. Am J Gastroenterol 2003;98:2648–54. doi:10.1111/j.1572-0241.2003.08771.x
OpenUrl PubMed Web of Science
↵
1. Hassan C,
2. Pooler BD,
3. Kim DH, et al
. Computed tomographic colonography for colorectal cancer screening: risk factors for the detection of advanced neoplasia. Cancer 2013;119:2549–54. doi:10.1002/cncr.28007
OpenUrl CrossRef PubMed
↵
1. Auge JM,
2. Pellise M,
3. Escudero JM, et al
. Risk stratification for advanced colorectal neoplasia according to fecal hemoglobin concentration in a colorectal cancer screening program. Gastroenterology 2014;147:628–36. doi:10.1053/j.gastro.2014.06.008
OpenUrl CrossRef PubMed
↵
1. Kaminski MF,
2. Polkowski M,
3. Kraszewska E, et al
. A score to estimate the likelihood of detecting advanced colorectal neoplasia at colonoscopy. Gut 2014;63:1112–9. doi:10.1136/gutjnl-2013-304965
OpenUrl Abstract/FREE Full Text
↵
1. Stegeman I,
2. de Wijkerslooth TR,
3. Stoop EM, et al
. Combining risk factors with faecal Immunochemical test outcome for selecting CRC screenees for colonoscopy. Gut 2014;63:466–71. doi:10.1136/gutjnl-2013-305013
OpenUrl Abstract/FREE Full Text
↵
1. Tao S,
2. Hoffmeister M,
3. Brenner H
. Development and validation of a scoring system to identify individuals at high risk for advanced colorectal neoplasms who should undergo colonoscopy screening. Clin Gastroenterol Hepatol 2014;12:478–85. doi:10.1016/j.cgh.2013.08.042
OpenUrl CrossRef PubMed
↵
1. Usher-Smith JA,
2. Sharp SJ,
3. Luben R, et al
. Development and validation of lifestyle-based models to predict incidence of the most common potentially preventable cancers. Cancer Epidemiol Biomarkers Prev 2019;28:67–75. doi:10.1158/1055-9965.EPI-18-0400
OpenUrl Abstract/FREE Full Text
↵
1. Aleksandrova K,
2. Reichmann R,
3. Kaaks R, et al
. Development and validation of a lifestyle-based model for colorectal cancer risk prediction: the lifecrc score. BMC Med 2021;19:1. doi:10.1186/s12916-020-01826-0
↵
1. Colditz GA,
2. Atwood KA,
3. Emmons K, et al
. Harvard report on cancer prevention volume 4: harvard cancer risk index. risk index working group, harvard center for cancer prevention. Cancer Causes Control 2000;11:477–88. doi:10.1023/a:1008984432272
OpenUrl CrossRef PubMed Web of Science
↵
1. Driver JA,
2. Gaziano JM,
3. Gelber RP, et al
. Development of a risk score for colorectal cancer in men. Am J Med 2007;120:257–63. doi:10.1016/j.amjmed.2006.05.055
OpenUrl CrossRef PubMed Web of Science
↵
1. Freedman AN,
2. Slattery ML,
3. Ballard-Barbash R, et al
. Colorectal cancer risk prediction tool for white men and women without known susceptibility. J Clin Oncol 2009;27:686–93. doi:10.1200/JCO.2008.17.4797
OpenUrl Abstract/FREE Full Text
↵
1. Ma E,
2. Sasazuki S,
3. Iwasaki M, et al
. 10-year risk of colorectal cancer: development and validation of a prediction model in middle-aged Japanese men. Cancer Epidemiol 2010;34:534–41. doi:10.1016/j.canep.2010.04.021
OpenUrl PubMed
↵
1. Taylor DP,
2. Stoddard GJ,
3. Burt RW, et al
. How well does family history predict who will get colorectal cancer? Implications for cancer screening and counseling. Genet Med 2011;13:385–91. doi:10.1097/GIM.0b013e3182064384
OpenUrl CrossRef PubMed
↵
1. Shin A,
2. Joo J,
3. Yang H-R, et al
. Risk prediction model for colorectal cancer: national health insurance corporation study, Korea. PLoS One 2014;9:e88079. doi:10.1371/journal.pone.0088079
↵
1. Steffen A,
2. MacInnis RJ,
3. Joshy G, et al
. Development and validation of a risk score predicting risk of colorectal cancer. Cancer Epidemiol Biomarkers Prev 2014;23:2543–52. doi:10.1158/1055-9965.EPI-14-0206
OpenUrl Abstract/FREE Full Text
↵
1. Wells BJ,
2. Kattan MW,
3. Cooper GS, et al
. Colorectal cancer predicted risk online (CRC-PRO) calculator using data from the multi-ethnic cohort study. J Am Board Fam Med 2014;27:42–55. doi:10.3122/jabfm.2014.01.130040
OpenUrl Abstract/FREE Full Text
↵
1. Smith T,
2. Muller DC,
3. Moons KGM, et al
. Comparison of Prognostic models to predict the occurrence of colorectal cancer in asymptomatic individuals: a systematic literature review and external validation in the EPIC and UK Biobank prospective cohort studies. Gut 2019;68:672–83. doi:10.1136/gutjnl-2017-315730
OpenUrl Abstract/FREE Full Text
↵
1. Demark-Wahnefried W,
2. Rock CL,
3. Patrick K, et al
. Lifestyle interventions to reduce cancer risk and improve outcomes. Am Fam Physician 2008;77:1573–8.
OpenUrl PubMed Web of Science
↵
1. Anderson AS,
2. Mackison D,
3. Boath C, et al
. Promoting changes in diet and physical activity in breast and colorectal cancer screening settings: an unexplored opportunity for endorsing healthy behaviors. Cancer Prev Res (Phila) 2013;6:165–72. doi:10.1158/1940-6207.CAPR-12-0385
OpenUrl Abstract/FREE Full Text
↵
1. Veettil SK,
2. Wong TY,
3. Loo YS
. Role of diet in colorectal cancer incidence: umbrella review of meta-analyses of prospective observational studies. JAMA Netw Open 2021;4:e2037341. doi:10.1001/jamanetworkopen.2020.37341
↵
1. Schooling CM,
2. Jones HE
. Clarifying questions about “risk factors”: predictors versus explanation. Emerg Themes Epidemiol 2018;15:10. doi:10.1186/s12982-018-0080-z
↵
1. van Diepen M,
2. Ramspek CL,
3. Jager KJ, et al
. Prediction versus aetiology: common pitfalls and how to avoid them. Nephrol Dial Transplant 2017;32:ii1–5. doi:10.1093/ndt/gfw459
OpenUrl PubMed
↵
1. Ramspek CL,
2. Steyerberg EW,
3. Riley RD, et al
. Prediction or causality? A scoping review of their conflation within current observational research. Eur J Epidemiol 2021;36:889–98. doi:10.1007/s10654-021-00794-w
OpenUrl CrossRef PubMed
↵
1. Poldrack RA,
2. Huckins G,
3. Varoquaux G
. Establishment of best practices for evidence for prediction: a review. JAMA Psychiatry 2020;77:534–40. doi:10.1001/jamapsychiatry.2019.3671
OpenUrl
↵
1. Sterne JAC,
2. White IR,
3. Carlin JB, et al
. Multiple imputation for missing data in epidemiological and clinical research: potential and pitfalls. BMJ 2009;338:b2393. doi:10.1136/bmj.b2393
↵
1. Janssen KJM,
2. Vergouwe Y,
3. Donders ART, et al
. Dealing with missing predictor values when applying clinical prediction models. Clin Chem 2009;55:994–1001. doi:10.1373/clinchem.2008.115345
OpenUrl Abstract/FREE Full Text
↵
1. Nijman SWJ,
2. Hoogland J,
3. Groenhof TKJ, et al
. Real-time imputation of missing predictor values in clinical practice. Eur Heart J Digit Health 2021;2:154–64. doi:10.1093/ehjdh/ztaa016
OpenUrl
↵
1. Hoogland J,
2. van Barreveld M,
3. Debray TPA, et al
. Handling missing predictor values when validating and applying a prediction model to new patients. Stat Med 2020;39:3591–607. doi:10.1002/sim.8682
OpenUrl CrossRef PubMed
↵
1. Collins GS,
2. de Groot JA,
3. Dutton S, et al
. External validation of multivariable prediction models: a systematic review of methodological conduct and reporting. BMC Med Res Methodol 2014;14:40. doi:10.1186/1471-2288-14-40
↵
1. Steyerberg EW
. Validation in prediction research: the waste by data splitting. J Clin Epidemiol 2018;103:131–3. doi:10.1016/j.jclinepi.2018.07.010
OpenUrl CrossRef PubMed
↵
1. Riley RD,
2. Snell KIE,
3. Martin GP, et al
. Penalization and shrinkage methods produced unreliable clinical prediction models especially when sample size was small. J Clin Epidemiol 2021;132:88–96. doi:10.1016/j.jclinepi.2020.12.005
OpenUrl PubMed
↵
1. Debray TPA,
2. Damen JAAG,
3. Snell KIE, et al
. A guide to systematic review and meta-analysis of prediction model performance. BMJ 2017;356:i6460. doi:10.1136/bmj.i6460
↵
1. Page MJ,
2. McKenzie JE,
3. Bossuyt PM, et al
. The PRISMA 2020 statement: an updated guideline for reporting systematic reviews. BMJ 2021;372:n71. doi:10.1136/bmj.n71

Supplementary materials

Supplementary Data

This web only file has been produced by the BMJ Publishing Group from an electronic file supplied by the author(s) and has not been edited for content.

Data supplement 1

Footnotes

Twitter @JosePenalvo
Contributors EM, JLP conceptualised and developed the research protocol and methodology; EM, MK, MSV developed standardised data extraction tools; EM, MK, MSV reviewed literature, selected eligible studies and performed data extraction. EM, JLP developed underlying calculation algorithms for standardised estimates and carried out statistical analyses. EM, JLP interpreted the results, drafted, reviewed and edited the manuscript. All authors reviewed and approved the final manuscript. EM acts as guarantor.
Funding Research supported by the Research Foundation of Flanders (FWO), Grant G0C2520N.
Competing interests None declared.
Provenance and peer review Not commissioned; externally peer reviewed by Dr. Emmanuel Baah, University of North Carolina System Nutrition Research Institute, USA.
Supplemental material This content has been supplied by the author(s). It has not been vetted by BMJ Publishing Group Limited (BMJ) and may not have been peer-reviewed. Any opinions or recommendations discussed are solely those of the author(s) and are not endorsed by BMJ. BMJ disclaims all liability and responsibility arising from any reliance placed on the content. Where the content includes any translated material, BMJ does not warrant the accuracy and reliability of the translations (including but not limited to local regulations, clinical guidelines, terminology, drug names and drug dosages), and is not responsible for any error and/or omissions arising from translation and adaptation or otherwise.

[1] ↵
Ferlay J,
Ervik M,
Lam F, et al
. Global cancer observatory: cancer today Lyon. France International Agency for Research on Cancer; 2020. Available: https://gco.iarc.fr/today [Accessed 06 Jan 2022].

[2] Ferlay J,

[3] Ervik M,

[4] Lam F, et al

[5] ↵
Cardoso R,
Guo F,
Heisser T, et al
. Colorectal cancer incidence, mortality, and stage distribution in European countries in the colorectal cancer screening era: an international population-based study. Lancet Oncol 2021;22:1002–13. doi:10.1016/S1470-2045(21)00199-6
OpenUrl CrossRef PubMed

[6] Cardoso R,

[7] Guo F,

[8] Heisser T, et al

[9] ↵
European Commission
. Cancer screening in the European Union: scientific advice on improving cancer screening across the EU. Brussels: Group of Chief Scientific Advisors, 2022: 48.

[10] European Commission

[11] ↵
Aleksandrova K,
Pischon T,
Jenab M, et al
. Combined impact of healthy lifestyle factors on colorectal cancer: a large European cohort study. BMC Med 2014;12:168. doi:10.1186/s12916-014-0168-4

[12] Aleksandrova K,

[13] Pischon T,

[14] Jenab M, et al

[15] ↵
Win AK,
Macinnis RJ,
Hopper JL, et al
. Risk prediction models for colorectal cancer: a review. Cancer Epidemiol Biomarkers Prev 2012;21:398–410. doi:10.1158/1055-9965.EPI-11-0771
OpenUrl Abstract/FREE Full Text

[16] Win AK,

[17] Macinnis RJ,

[18] Hopper JL, et al

[19] ↵
Ma GK,
Ladabaum U
. Personalizing colorectal cancer screening: a systematic review of models to predict risk of colorectal neoplasia. Clin Gastroenterol Hepatol 2014;12:1624–34. doi:10.1016/j.cgh.2014.01.042
OpenUrl CrossRef PubMed

[20] Ma GK,

[21] Ladabaum U

[22] ↵
Usher-Smith JA,
Walter FM,
Emery JD, et al
. Risk prediction models for colorectal cancer: a systematic review. Cancer Prev Res (Phila) 2016;9:13–26. doi:10.1158/1940-6207.CAPR-15-0274
OpenUrl Abstract/FREE Full Text

[23] Usher-Smith JA,

[24] Walter FM,

[25] Emery JD, et al

[26] ↵
Williams TGS,
Cubiella J,
Griffin SJ, et al
. Risk prediction models for colorectal cancer in people with symptoms: a systematic review. BMC Gastroenterol 2016;16:63. doi:10.1186/s12876-016-0475-7

[27] Williams TGS,

[28] Cubiella J,

[29] Griffin SJ, et al

[30] ↵
McGeoch L,
Saunders CL,
Griffin SJ, et al
. Risk prediction models for colorectal cancer incorporating common genetic variants: a systematic review. Cancer Epidemiol Biomarkers Prev 2019;28:1580–93. doi:10.1158/1055-9965.EPI-19-0059
OpenUrl Abstract/FREE Full Text

[31] McGeoch L,

[32] Saunders CL,

[33] Griffin SJ, et al

[34] ↵
Sassano M,
Mariani M,
Quaranta G, et al
. Polygenic risk prediction models for colorectal cancer: a systematic review. BMC Cancer 2022;22:65. doi:10.1186/s12885-021-09143-2

[35] Sassano M,

[36] Mariani M,

[37] Quaranta G, et al

[38] ↵
Iwasaki M,
Tanaka-Mizuno S,
Kuchiba A, et al
. Inclusion of a genetic risk score into a validated risk prediction model for colorectal cancer in Japanese men improves performance. Cancer Prev Res (Phila) 2017;10:535–41. doi:10.1158/1940-6207.CAPR-17-0141
OpenUrl Abstract/FREE Full Text

[39] Iwasaki M,

[40] Tanaka-Mizuno S,

[41] Kuchiba A, et al

[42] ↵
Xin J,
Chu H,
Ben S, et al
. Evaluating the effect of multiple genetic risk score models on colorectal cancer risk prediction. Gene 2018;673:174–80. doi:10.1016/j.gene.2018.06.035
OpenUrl

[43] Xin J,

[44] Chu H,

[45] Ben S, et al

[46] ↵
Weigl K,
Thomsen H,
Balavarca Y, et al
. Genetic risk score is associated with prevalence of advanced neoplasms in a colorectal cancer screening population. Gastroenterology 2018;155:88–98. doi:10.1053/j.gastro.2018.03.030
OpenUrl

[47] Weigl K,

[48] Thomsen H,

[49] Balavarca Y, et al

[50] ↵
Cecile A,
Janssens JW,
Joyner MJ
. Polygenic risk scores that predict common diseases using millions of single nucleotide polymorphisms: is more, better Clin Chem 2019;65:609–11. doi:10.1373/clinchem.2018.296103
OpenUrl FREE Full Text

[51] Cecile A,

[52] Janssens JW,

[53] Joyner MJ

[54] ↵
Arnold M,
Sierra MS,
Laversanne M, et al
. Global patterns and trends in colorectal cancer incidence and mortality. Gut 2017;66:683–91. doi:10.1136/gutjnl-2015-310912
OpenUrl Abstract/FREE Full Text

[55] Arnold M,

[56] Sierra MS,

[57] Laversanne M, et al

[58] ↵
Page MJ,
McKenzie JE,
Bossuyt PM, et al
. The PRISMA 2020 statement: an updated guideline for reporting systematic reviews. J Clin Epidemiol 2021;134:178–89. doi:10.1016/j.jclinepi.2021.03.001
OpenUrl CrossRef PubMed

[59] Page MJ,

[60] McKenzie JE,

[61] Bossuyt PM, et al

[62] ↵
Moons KGM,
de Groot JAH,
Bouwmeester W, et al
. Critical appraisal and data extraction for systematic reviews of prediction modelling studies: the CHARMS checklist. PLOS Med 2014;11:e1001744. doi:10.1371/journal.pmed.1001744

[63] Moons KGM,

[64] de Groot JAH,

[65] Bouwmeester W, et al

[66] ↵
Moons KGM,
Wolff RF,
Riley RD, et al
. PROBAST: a tool to assess risk of bias and applicability of prediction model studies: explanation and elaboration. Ann Intern Med 2019;170:W1. doi:10.7326/M18-1377

[67] Moons KGM,

[68] Wolff RF,

[69] Riley RD, et al

[70] ↵
World Cancer Research Fund/American Institute for Cancer Research
. Diet, nutrition, physical activity and cancer: a global perspective. Continuous update project expert report; 2018.

[71] World Cancer Research Fund/American Institute for Cancer Research

[72] ↵
Betés M,
Muñoz-Navas MA,
Duque JM, et al
. Use of colonoscopy as a primary screening test for colorectal cancer in average risk people. Am J Gastroenterol 2003;98:2648–54. doi:10.1111/j.1572-0241.2003.08771.x
OpenUrl PubMed Web of Science

[73] Betés M,

[74] Muñoz-Navas MA,

[75] Duque JM, et al

[76] ↵
Hassan C,
Pooler BD,
Kim DH, et al
. Computed tomographic colonography for colorectal cancer screening: risk factors for the detection of advanced neoplasia. Cancer 2013;119:2549–54. doi:10.1002/cncr.28007
OpenUrl CrossRef PubMed

[77] Hassan C,

[78] Pooler BD,

[79] Kim DH, et al

[80] ↵
Auge JM,
Pellise M,
Escudero JM, et al
. Risk stratification for advanced colorectal neoplasia according to fecal hemoglobin concentration in a colorectal cancer screening program. Gastroenterology 2014;147:628–36. doi:10.1053/j.gastro.2014.06.008
OpenUrl CrossRef PubMed

[81] Auge JM,

[82] Pellise M,

[83] Escudero JM, et al

[84] ↵
Kaminski MF,
Polkowski M,
Kraszewska E, et al
. A score to estimate the likelihood of detecting advanced colorectal neoplasia at colonoscopy. Gut 2014;63:1112–9. doi:10.1136/gutjnl-2013-304965
OpenUrl Abstract/FREE Full Text

[85] Kaminski MF,

[86] Polkowski M,

[87] Kraszewska E, et al

[88] ↵
Stegeman I,
de Wijkerslooth TR,
Stoop EM, et al
. Combining risk factors with faecal Immunochemical test outcome for selecting CRC screenees for colonoscopy. Gut 2014;63:466–71. doi:10.1136/gutjnl-2013-305013
OpenUrl Abstract/FREE Full Text

[89] Stegeman I,

[90] de Wijkerslooth TR,

[91] Stoop EM, et al

[92] ↵
Tao S,
Hoffmeister M,
Brenner H
. Development and validation of a scoring system to identify individuals at high risk for advanced colorectal neoplasms who should undergo colonoscopy screening. Clin Gastroenterol Hepatol 2014;12:478–85. doi:10.1016/j.cgh.2013.08.042
OpenUrl CrossRef PubMed

[93] Tao S,

[94] Hoffmeister M,

[95] Brenner H

[96] ↵
Usher-Smith JA,
Sharp SJ,
Luben R, et al
. Development and validation of lifestyle-based models to predict incidence of the most common potentially preventable cancers. Cancer Epidemiol Biomarkers Prev 2019;28:67–75. doi:10.1158/1055-9965.EPI-18-0400
OpenUrl Abstract/FREE Full Text

[97] Usher-Smith JA,

[98] Sharp SJ,

[99] Luben R, et al

[100] ↵
Aleksandrova K,
Reichmann R,
Kaaks R, et al
. Development and validation of a lifestyle-based model for colorectal cancer risk prediction: the lifecrc score. BMC Med 2021;19:1. doi:10.1186/s12916-020-01826-0

[101] Aleksandrova K,

[102] Reichmann R,

[103] Kaaks R, et al

[104] ↵
Colditz GA,
Atwood KA,
Emmons K, et al
. Harvard report on cancer prevention volume 4: harvard cancer risk index. risk index working group, harvard center for cancer prevention. Cancer Causes Control 2000;11:477–88. doi:10.1023/a:1008984432272
OpenUrl CrossRef PubMed Web of Science

[105] Colditz GA,

[106] Atwood KA,

[107] Emmons K, et al

[108] ↵
Driver JA,
Gaziano JM,
Gelber RP, et al
. Development of a risk score for colorectal cancer in men. Am J Med 2007;120:257–63. doi:10.1016/j.amjmed.2006.05.055
OpenUrl CrossRef PubMed Web of Science

[109] Driver JA,

[110] Gaziano JM,

[111] Gelber RP, et al

[112] ↵
Freedman AN,
Slattery ML,
Ballard-Barbash R, et al
. Colorectal cancer risk prediction tool for white men and women without known susceptibility. J Clin Oncol 2009;27:686–93. doi:10.1200/JCO.2008.17.4797
OpenUrl Abstract/FREE Full Text

[113] Freedman AN,

[114] Slattery ML,

[115] Ballard-Barbash R, et al

[116] ↵
Ma E,
Sasazuki S,
Iwasaki M, et al
. 10-year risk of colorectal cancer: development and validation of a prediction model in middle-aged Japanese men. Cancer Epidemiol 2010;34:534–41. doi:10.1016/j.canep.2010.04.021
OpenUrl PubMed

[117] Ma E,

[118] Sasazuki S,

[119] Iwasaki M, et al

[120] ↵
Taylor DP,
Stoddard GJ,
Burt RW, et al
. How well does family history predict who will get colorectal cancer? Implications for cancer screening and counseling. Genet Med 2011;13:385–91. doi:10.1097/GIM.0b013e3182064384
OpenUrl CrossRef PubMed

[121] Taylor DP,

[122] Stoddard GJ,

[123] Burt RW, et al

[124] ↵
Shin A,
Joo J,
Yang H-R, et al
. Risk prediction model for colorectal cancer: national health insurance corporation study, Korea. PLoS One 2014;9:e88079. doi:10.1371/journal.pone.0088079

[125] Shin A,

[126] Joo J,

[127] Yang H-R, et al

[128] ↵
Steffen A,
MacInnis RJ,
Joshy G, et al
. Development and validation of a risk score predicting risk of colorectal cancer. Cancer Epidemiol Biomarkers Prev 2014;23:2543–52. doi:10.1158/1055-9965.EPI-14-0206
OpenUrl Abstract/FREE Full Text

[129] Steffen A,

[130] MacInnis RJ,

[131] Joshy G, et al

[132] ↵
Wells BJ,
Kattan MW,
Cooper GS, et al
. Colorectal cancer predicted risk online (CRC-PRO) calculator using data from the multi-ethnic cohort study. J Am Board Fam Med 2014;27:42–55. doi:10.3122/jabfm.2014.01.130040
OpenUrl Abstract/FREE Full Text

[133] Wells BJ,

[134] Kattan MW,

[135] Cooper GS, et al

[136] ↵
Smith T,
Muller DC,
Moons KGM, et al
. Comparison of Prognostic models to predict the occurrence of colorectal cancer in asymptomatic individuals: a systematic literature review and external validation in the EPIC and UK Biobank prospective cohort studies. Gut 2019;68:672–83. doi:10.1136/gutjnl-2017-315730
OpenUrl Abstract/FREE Full Text

[137] Smith T,

[138] Muller DC,

[139] Moons KGM, et al

[140] ↵
Demark-Wahnefried W,
Rock CL,
Patrick K, et al
. Lifestyle interventions to reduce cancer risk and improve outcomes. Am Fam Physician 2008;77:1573–8.
OpenUrl PubMed Web of Science

[141] Demark-Wahnefried W,

[142] Rock CL,

[143] Patrick K, et al

[144] ↵
Anderson AS,
Mackison D,
Boath C, et al
. Promoting changes in diet and physical activity in breast and colorectal cancer screening settings: an unexplored opportunity for endorsing healthy behaviors. Cancer Prev Res (Phila) 2013;6:165–72. doi:10.1158/1940-6207.CAPR-12-0385
OpenUrl Abstract/FREE Full Text

[145] Anderson AS,

[146] Mackison D,

[147] Boath C, et al

[148] ↵
Veettil SK,
Wong TY,
Loo YS
. Role of diet in colorectal cancer incidence: umbrella review of meta-analyses of prospective observational studies. JAMA Netw Open 2021;4:e2037341. doi:10.1001/jamanetworkopen.2020.37341

[149] Veettil SK,

[150] Wong TY,

[151] Loo YS

[152] ↵
Schooling CM,
Jones HE
. Clarifying questions about “risk factors”: predictors versus explanation. Emerg Themes Epidemiol 2018;15:10. doi:10.1186/s12982-018-0080-z

[153] Schooling CM,

[154] Jones HE

[155] ↵
van Diepen M,
Ramspek CL,
Jager KJ, et al
. Prediction versus aetiology: common pitfalls and how to avoid them. Nephrol Dial Transplant 2017;32:ii1–5. doi:10.1093/ndt/gfw459
OpenUrl PubMed

[156] van Diepen M,

[157] Ramspek CL,

[158] Jager KJ, et al

[159] ↵
Ramspek CL,
Steyerberg EW,
Riley RD, et al
. Prediction or causality? A scoping review of their conflation within current observational research. Eur J Epidemiol 2021;36:889–98. doi:10.1007/s10654-021-00794-w
OpenUrl CrossRef PubMed

[160] Ramspek CL,

[161] Steyerberg EW,

[162] Riley RD, et al

[163] ↵
Poldrack RA,
Huckins G,
Varoquaux G
. Establishment of best practices for evidence for prediction: a review. JAMA Psychiatry 2020;77:534–40. doi:10.1001/jamapsychiatry.2019.3671
OpenUrl

[164] Poldrack RA,

[165] Huckins G,

[166] Varoquaux G

[167] ↵
Sterne JAC,
White IR,
Carlin JB, et al
. Multiple imputation for missing data in epidemiological and clinical research: potential and pitfalls. BMJ 2009;338:b2393. doi:10.1136/bmj.b2393

[168] Sterne JAC,

[169] White IR,

[170] Carlin JB, et al

[171] ↵
Janssen KJM,
Vergouwe Y,
Donders ART, et al
. Dealing with missing predictor values when applying clinical prediction models. Clin Chem 2009;55:994–1001. doi:10.1373/clinchem.2008.115345
OpenUrl Abstract/FREE Full Text

[172] Janssen KJM,

[173] Vergouwe Y,

[174] Donders ART, et al

[175] ↵
Nijman SWJ,
Hoogland J,
Groenhof TKJ, et al
. Real-time imputation of missing predictor values in clinical practice. Eur Heart J Digit Health 2021;2:154–64. doi:10.1093/ehjdh/ztaa016
OpenUrl

[176] Nijman SWJ,

[177] Hoogland J,

[178] Groenhof TKJ, et al

[179] ↵
Hoogland J,
van Barreveld M,
Debray TPA, et al
. Handling missing predictor values when validating and applying a prediction model to new patients. Stat Med 2020;39:3591–607. doi:10.1002/sim.8682
OpenUrl CrossRef PubMed

[180] Hoogland J,

[181] van Barreveld M,

[182] Debray TPA, et al

[183] ↵
Collins GS,
de Groot JA,
Dutton S, et al
. External validation of multivariable prediction models: a systematic review of methodological conduct and reporting. BMC Med Res Methodol 2014;14:40. doi:10.1186/1471-2288-14-40

[184] Collins GS,

[185] de Groot JA,

[186] Dutton S, et al

[187] ↵
Steyerberg EW
. Validation in prediction research: the waste by data splitting. J Clin Epidemiol 2018;103:131–3. doi:10.1016/j.jclinepi.2018.07.010
OpenUrl CrossRef PubMed

[188] Steyerberg EW

[189] ↵
Riley RD,
Snell KIE,
Martin GP, et al
. Penalization and shrinkage methods produced unreliable clinical prediction models especially when sample size was small. J Clin Epidemiol 2021;132:88–96. doi:10.1016/j.jclinepi.2020.12.005
OpenUrl PubMed

[190] Riley RD,

[191] Snell KIE,

[192] Martin GP, et al

[193] ↵
Debray TPA,
Damen JAAG,
Snell KIE, et al
. A guide to systematic review and meta-analysis of prediction model performance. BMJ 2017;356:i6460. doi:10.1136/bmj.i6460

[194] Debray TPA,

[195] Damen JAAG,

[196] Snell KIE, et al

[197] ↵
Page MJ,
McKenzie JE,
Bossuyt PM, et al
. The PRISMA 2020 statement: an updated guideline for reporting systematic reviews. BMJ 2021;372:n71. doi:10.1136/bmj.n71

[198] Page MJ,

[199] McKenzie JE,

[200] Bossuyt PM, et al

Log in using your username and password

Main menu

Log in using your username and password

You are here

Abstract

Data availability statement

Statistics from Altmetric.com

Request Permissions

WHAT IS ALREADY KNOWN ON THIS TOPIC

WHAT THIS STUDY ADDS

HOW THIS STUDY MIGHT AFFECT RESEARCH, PRACTICE OR POLICY

Background

Materials and methods

Search strategy

Study selection

Data extraction and synthesis

Data analysis

Results

Model development studies

Supplemental material

Variables included in the risk prediction models

Model validation studies

Discussion

Conclusions

Data availability statement

Ethics statements

Patient consent for publication

Ethics approval

Acknowledgments

References

Supplementary materials

Supplementary Data

Footnotes

Read the full text or download the PDF:

Log in using your username and password