PredictCBC-2.0: a contralateral breast cancer risk prediction model developed and validated in ~ 200,000 patients
Breast Cancer Research volume 24, Article number: 69 (2022)
Prediction of contralateral breast cancer (CBC) risk is challenging due to moderate performances of the known risk factors. We aimed to improve our previous risk prediction model (PredictCBC) by updated follow-up and including additional risk factors.
We included data from 207,510 invasive breast cancer patients participating in 23 studies. In total, 8225 CBC events occurred over a median follow-up of 10.2 years. In addition to the previously included risk factors, PredictCBC-2.0 included CHEK2 c.1100delC, a 313 variant polygenic risk score (PRS-313), body mass index (BMI), and parity. Fine and Gray regression was used to fit the model. Calibration and a time-dependent area under the curve (AUC) at 5 and 10 years were assessed to determine the performance of the models. Decision curve analysis was performed to evaluate the net benefit of PredictCBC-2.0 and previous PredictCBC models.
The discrimination of PredictCBC-2.0 at 10 years was higher than PredictCBC with an AUC of 0.65 (95% prediction intervals (PI) 0.56–0.74) versus 0.63 (95%PI 0.54–0.71). PredictCBC-2.0 was well calibrated with an observed/expected ratio at 10 years of 0.92 (95%PI 0.34–2.54). Decision curve analysis for contralateral preventive mastectomy (CPM) showed the potential clinical utility of PredictCBC-2.0 between thresholds of 4 and 12% 10-year CBC risk for BRCA1/2 mutation carriers and non-carriers.
Additional genetic information beyond BRCA1/2 germline mutations improved CBC risk prediction and might help tailor clinical decision-making toward CPM or alternative preventive strategies. Identifying patients who benefit from CPM, especially in the general breast cancer population, remains challenging.
Contralateral breast cancer (CBC) is the most common second primary cancer among women diagnosed with first primary invasive breast cancer (BC) . CBC accounts for approximately 40–50% of all new secondary cancers in women with first primary invasive BC and has a potentially less favorable prognosis [2,3,4,5,6]. Worries regarding CBC risk have increased the demand for contralateral preventive mastectomy (CPM) [7, 8]. However, the impact of CPM on survival is uncertain, especially in women with a low risk to develop a CBC [9,10,11,12,13]. Thus, improved CBC risk prediction is important in order to inform decision-making on surveillance and preventive strategies. Currently, the most important factor for decision-making on CPM is the BRCA1/2 mutation status .
We previously developed and cross-validated two models using data from 132,756 invasive BC patients with a median follow-up of 8.8 years including 4672 CBC events . One model (PredictCBC-1A) was developed including information about BRCA1/2 mutation status and another model (PredictCBC-1B) for the general breast cancer population of genetically untested women. Two other specific CBC prediction tools are currently available in the literature: the Manchester formula (part of the Manchester guidelines for CPM) and CBCrisk [15,16,17,18].
In addition to BRCA1/2 mutations, other genetic risk factors for breast cancer are also associated with CBC risk. In particular, there is substantial evidence that the CHEK2 c.1100delC variant increases the risk of developing CBC [19, 20]. In addition, polygenic risk scores (PRS) of common variants, developed for association with first breast cancer, have been shown to predict CBC in the general BC population and in BRCA1/2 mutation carriers [21,22,23,24], particularly the extensively validated 313 SNP PRS . With regard to the lifestyle and reproductive factors, there is evidence that body mass index (BMI) and parity at or around the time of the first primary invasive BC diagnosis are associated with CBC risk .
Our aim was to refit PredictCBC models incorporating these additional risk factors. We utilized the same dataset but with updated follow-up and added additional studies, especially one large study of BRCA1 and BRCA2 mutation carriers. We evaluated the potential improvement in prediction performance and utility for clinical decision-making of the updated models for both BRCA1/2 carriers as the general (non-tested) breast cancer population (PredictCBC-2.0).
Material and methods
Study population and available data
We used the data from the same five main sources previously used for PredictCBC models to develop the PredictCBC-2.0 models including updated follow-up information, additional patients, and invasive or in situ CBC events . Two studies were additionally included from the Breast Cancer Association Consortium (BCAC) compared to the version of the BCAC data used to develop PredictCBC-1A and PredictCBC-1B models. Most of the studies were either population- or hospital-based series; and most women were of European descent (Additional file 1: Data and patient selection and Additional file 2: Table S1 and Additional file 1: Table S2, available online). We also additionally included patients selected from the Hereditary Breast and Ovarian cancer study in the Netherlands (HEBON) , a nationwide study based on clinical genetic centers. The eligibility criteria were the same as previously: briefly, we included female patients with invasive first primary BC with no sign of distant metastases at diagnosis or prior history of any cancer (except for non-melanoma skin cancer) . We included women diagnosed after 1990 so that diagnostic and treatment procedures were close to modern practice while follow-up was sufficient to study CBC incidence. In total, 207,510 women with first primary invasive BC from 23 studies were included. All studies were approved by the appropriate ethics and scientific review boards. All women provided written informed consent; or, for some Dutch cohorts as applicable, the secondary use of clinical data was in accordance with Dutch legislation and codes of conduct [28, 29]. Information about the sample size for every data source and the total sample size after eligibility criteria are provided in Table 1. The choice of additional predictors in the analyses was based on evidence from the literature and the availability of predictors in our data sources. In particular, evidence from the literature suggests that CHEK2 c.1100delC and 313 SNP PRS increased the risk of developing CBC [21,22,23,24]. In addition, a systematic review of lifestyle and reproductive factors suggested that BMI and parity at or around the time of the first primary invasive BC diagnosis are associated with CBC risk . Details about sample size per study and about the factors included in the analyses, follow-up per dataset, and study design are in Additional file 2: Table S1 and Additional file 3: Table S3, available online.
Primary endpoint and follow-up
The primary endpoint in the analyses was the incidence of invasive or in situ metachronous CBC. Follow-up started 3 months after invasive first primary BC diagnosis, to exclude synchronous CBCs, and ended at the date of CBC, distant metastasis (but not a loco-regional relapse), CPM, or last date of follow-up (due to death, loss to follow-up, or end of study), whichever occurred first. For 36,553 (17.6%) women, from BCAC and HEBON, recruitment or blood sampling for DNA testing occurred more than 3 months after diagnosis of the first primary BC. For women with the first primary invasive BC, follow-up started at recruitment or at the date of blood draw or at DNA test result (left truncation). Patients who underwent CPM during the follow-up were censored because of negligible CBC risk after a CPM . Missing data were multiply imputed by chained equations (MICE) to avoid loss of information due to case-wise deletion [31,32,33] (Additional file 1: Multiple imputation of missing values, available online).
Model development and validation
We used multivariable Fine and Gray regression models to account for death and distant metastases as competing events . Analyses were stratified by a study to allow baseline hazard (sub)distributions to differ across studies. The assumption of proportional subdistribution hazards was graphically checked using Schoenfeld residuals . The resulting subdistribution hazard ratios (sHRs) and corresponding 95% confidence intervals (CI) were pooled from 5 imputed datasets using Rubin’s rules . We re-estimated the coefficients of PredictCBC-1A and PredictCBC-1B, and we re-fitted the PredictCBC models using the extended dataset with updated follow-up time. PredictCBC-1A, developed including information about BRCA1/2 mutation carrier status, was extended by including CHEK2 c.1110delC status, PRS-313, self-reported BMI, and self-reported parity (hereafter: PredictCBC-2.0A) . CHEK2 c.1110delC and PRS-313 were derived from the BCAC database, as published previously [25, 36, 37]. We extended PredictCBC-1B, developed for genetically untested women, incorporating self-reported BMI and parity (hereafter: PredictCBC-2.0B). Potential nonlinear relations between continuous predictors and CBC risk were investigated using restricted cubic splines with three knots.
The validity of the model was investigated by leave-one-study-out cross-validation . In each validation cycle, all studies were analyzed except one, in which the validity of the model was evaluated. Since some BCAC studies had insufficient CBC events required for reliable validation, we used the geographic area as a unit for splitting [38,39,40]. Nineteen out of 23 studies were combined in 4 geographic areas (Additional file 1: Table S2, available online). A total of 8 units of splitting including 4 geographic areas and 4 studies were used to cross-validate the models.
The performance of the PredictCBC-2.0 was assessed by discrimination, i.e., the ability to differentiate between patients diagnosed with CBC and those who were not, and by calibration, which measures the agreement between the actual (observed) risk and CBC risk estimated by the prediction models (predicted). Discrimination was quantified by time-dependent areas under the ROC curve (AUCs) based on Inverse Censoring Probability Weighting at 5 and 10 years . The AUCs were estimated using the prognostic index which is a/the combination of the estimated coefficients (betas) of PredictCBC models multiplied by the corresponding individual characteristics (i.e., predictors) included in the models. Values of AUCs close to 1 indicate good discrimination, while values close to 0.5 indicated poor discrimination. Calibration was assessed by the observed-to-expected (O/E) ratio and calibration plots at 5 and 10 years [42, 43]. An O/E ratio lower or higher than 1 indicates that average predictions are too high or low, respectively.
To consider heterogeneity among studies, a random-effect meta-analysis was performed to provide summaries of discrimination and calibration performance. The 95% prediction intervals (PI) indicate the likely performance of the model in a new dataset. The summary performances of PredictCBC-2.0 and 1.0 models were compared to evaluate whether adding the new predictors improved the performance of CBC risk prediction. We developed and validated the risk prediction model following the Transparent Reporting of a Multivariable Prediction model for Individual Prognosis or Diagnosis (TRIPOD) statement . Analyses were done in SAS (SAS Institute Inc., Cary, NC, USA) and R (version 3.6.1).
The clinical utility of the prediction models was evaluated using decision curve analysis (DCA) [45, 46]. A key metric DCA is the net benefit, which is the number of true-positive classifications (in this example: the number of CPMs in patients who would have developed a CBC) minus the weighted number of false-positive classifications (in this example: the number of unnecessary CPMs in patients who would not have developed a CBC). The false positives are weighted by a factor related to the relative harm of a missed CBC versus an unnecessary CPM. The weighting is derived from the threshold probability to develop a CBC using a fixed time horizon (e.g., CBC risk at 5 or 10 years) . For example, a threshold of 10% implies that CPM in 10 patients, of whom one would develop CBC if untreated, is acceptable (thus performing 9 unnecessary CPMs). The net benefit of a prediction model is traditionally compared with the strategies of treat all or treat none. Since the use of CPM is generally only considered among BRCA1/2 mutation carriers, the decision curve analysis was reported among BRCA1/2 mutation carriers and non-carriers separately . Among patients not tested for BRCA1/2 germline mutations, we assumed that the decision for CPM is based on family history of breast cancer. The net benefits of PredictCBC-2.0A and PredictCBC-2.0B were compared with the net benefit of PredictCBC-1A and 1B, respectively, to assess the potential improvement in the clinical utility of the updated models.
A total of 207,510 women with invasive first primary BC diagnosed between 1990 and 2017, with 8225 CBC events (6828 invasive, 1397 in situ), from 23 studies, were used for CBC risk prediction modeling (Additional file 2: Table S1, available online). Median follow-up time was 10.2 years, and CBC cumulative incidences at 5 and 10 years were 2.2% and 4.1%, respectively. Details of the studies and patient, tumor, and treatment characteristics are provided in Additional file 3: Table S3 (available online). The multivariable models with estimates for all included factors are given in Table 2.
Most of the factors were independently associated with CBC risk, including the new factors incorporated in the PredictCBC-2.0 models, i.e., s BMI, parity, CHEK2 c.1110delC, and PRS-313. There was no evidence against log-linear relationships between BMI, parity and PRS-313 and CBC risk. Nonlinearity between age at first BC diagnosis and CBC risk was accounted for with a linear spline at age 60 years. The formulae of the PredictCBC models are provided in Additional file 1: Formula to estimate the contralateral breast cancer risk using PredictCBC-2.0A and PredictCBC-2.0B (available online). To calculate the predicted CBC cumulative incidence, we used the event-free baseline probability of the Netherlands Cancer Registry (NCR), as previously .
The AUCs at 5 and 10 years of PredictCBC-2.0A were higher than of PredictCBC-1A at 5 years: 0.66, 95% prediction interval (PI) 0.55–0.76 versus 0.62 (95%PI 0.51–0.74); and at 10 years: 0.65 (95%PI 0.56–0.74) versus 0.63 (95%PI 0.54–0.71) (Figs. 1 and 2, Table 3). The AUCs for PredictCBC-2.0B and PredictCBC-1B were both 0.59 (95%PI: PredictCBC-2.0B: 0.51–0.68; PredictCBC-1B:0.49–0.69) at 5 years and both 0.58 (95%PI 0.51–0.65) at 10 years (Figs. 1 and 2, Table 3).
The O/E ratio at 5 and 10 years across all versions of PredictCBC models ranged between 0.90 and 0.92 with similar 95%PIs (Figs. 1 and 2, Table 3). Calibration plots of PredictCBC-2.0 models are provided in Additional file 1: Figs, S1–S4 (available online).
The decision curves showed the net benefit for a range of harm–benefit thresholds at 10-year CBC risk (Fig. 3). We evaluated the potential clinical utility of PredictCBC-2A versus PredictCBC-1.0A for decision thresholds between 4 and 12% for the 10-year CBC risk among BRCA1/2 mutation carriers and non-carriers (Figs. 3 and 4, Table 4). For example, if consensus guidelines would indicate the acceptability of 1 in 10 patients for whom a CPM is recommended developing CBC, a risk threshold of 10% may be used to define high- and low-risk BRCA1/2 mutation carriers based on the absolute 10-year CBC risk prediction estimated by the models. Compared with a strategy recommending CPM to all BRCA1/2 mutation carriers, PredictCBC-1A avoids 76.9 net CPMs per 1000 patients (Table 4). An additional 50.0 CPMs may be avoided using PredictCBC-2.0A compared to PredictCBC-1A. In contrast, almost no non-BRCA1/2 mutation carriers had predictions above the 10% threshold (general BC population, Table 4); three necessary CPMs per 1000 patients would be indicated using PredictCBC-2.0A. Analyses for PredictCBC-1B and PredictCBC-2.0B at 10 years suggested a potential clinical utility between 4 and 6% 10-year CBC risk for patients with and without family history (Table 4 and Figs. 3 and 4). No remarkable improvement in net benefit was detected using PredictCBC-2.0B compared to PredictCBC-1B in decision-making regarding CPM (Table 4 and Fig. 3). Decision curves for CBC risk using PredictCBC and PredictCBC-2.0 at 5 years and the corresponding clinical utility showed similar patterns (Additional file 1: Figs. S5-S6 and Table S4, available online).
We evaluated the potential improvement in CBC risk prediction by adding established genetic (CHEK2 c.1100delC and PRS-313) and lifestyle (BMI and parity) factors to the previous PredictCBC models and used additional follow-up information and new studies to provide more reliable estimates.
The current clinical recommendations of CPM are mostly based on the presence of a pathogenic mutation in BRCA1/2 [49, 50]. This seems a reasonable approach according to CBC risk predictions based on the PredictCBC models: few non-BRCA1/2 carriers exceed a 10% 10-year risk threshold. However, approximately 40% of BRCA1/2 mutation carriers do not reach this threshold either, suggesting that a significant proportion of BRCA1/2 carriers might be spared CPM. Additional genetic information beyond BRCA1/2 germline mutation such as the presence of the CHEK2 c.1110delC variant and PRS-313 might improve decision-making.
Currently available CBC models, such as CBCrisk and the Manchester formula, show only moderate discrimination . In addition, the Manchester formula has been shown to systematically overestimate CBC risk . The BOADICEA model, a well-known risk prediction tool to estimate the risk of developing the first primary BC, also allows the calculation of CBC risk [52,53,54,55]. Although BOADICEA includes rare pathogenic variants in moderate- and high-risk BC susceptibility genes (i.e., BRCA1, BRCA2, PALB2, ATM and CHEK2, BARD1, RAD51C, RAD51D), and PRS-313, it does not incorporate information on the systemic treatment of the primary BC, which are important predictors of CBC risk .
A model for the prediction of recurrence, the INFLUENCE nomogram, was developed to estimate 5-year recurrence risk as well as conditional annual risks of developing a local or regional recurrence based on first BC and treatment characteristics . A more recent version (INFLUENCE 2.0) also provides 5-year individualized predictions for secondary primary breast cancer based on cases older than 50 years at first cancer diagnosis from the NCR nationwide cohort irrespective of their genetic status or testing status using random survival forests . The model provided moderate discrimination (AUC at 5 years: 0.67; 95%CI 0.65–0.68) using internal validation. In our comparable population- and hospital-based Dutch series, EMC and NCR, the AUCs at 5 years of PredictCBC-1A were 0.69 (95%CI 0.64–0.73) and 0.66 (95%CI 0.65–0.67), and of PredictCBC-2.0A 0.71 (95%CI 0.66–0.75) and 0.68 (95%CI 0.66–0.69), respectively. Moreover, INFLUENCE 2.0 is only relevant to the general population, while PredictCBC can also be used in the clinical genetic setting. Notably, we demonstrated that decision-making about preventive strategies in clinical practice is unlikely to improve without genetic information.
Our work has some limitations: firstly, some women included in the Dutch studies (providing specific information on family history, BRCA mutation or CPM) were also present in our selection of the NCR population, as described previously . Privacy and coding issues prevented linkage at the individual patient level, but based on the hospitals from which the studies were recruited, and the age and period criteria used, we calculated a maximum potential overlap of 9%. Secondly, important predictors such as family history, BRCA1/2 and CHEK2 c.1110delC status, and PRS-313, were only available in a subset of the women, although the multiple imputation approach should lead to consistent estimates [59,60,61]. Detailed information about family history of breast cancer would have been useful to improve CBC risk prediction, especially among patients with a mutation in BRCA1/2 or CHEK2. Nonetheless, we considerably increased the number of patients with BRCA1/2 mutation status and family history information compared to our previous publication (40,343 vs. 7704 and 53,399 vs. 30,541 patients with available BRCA mutation status and family history information, respectively), and added CHEK2 c.1110delC, which is a founder mutation present in approximately 0.5–1.6% of individuals of Northern and Eastern European descent and explains the large majority of carriers of CHEK2 protein truncating variants in these populations [19, 62]. Further validation will be required to investigate how well PredictCBC models predict risk in other populations. In particular, the model was developed in patients of European ancestry and further evaluation and adaptation will be needed to extend PredictCBC models to non-European populations, including Asia [63, 64]. Future research might also include comparisons of machine learning (ML) methods with classical statistical regression models [65, 66].
The prediction models may be further improved by including additional risk factors. In particular, rare mutations in other breast cancer susceptibility genes, such as ATM and PALB2, are also likely to be associated with an increased risk of CBC [22, 67, 68]. The discrimination provided by the PRS will also improve as more SNPs are added [69, 70]. Prediction performance might also be improved by adding breast density and other risk factors (e.g., additional lifestyle and reproductive factors such as alcohol use, age at primiparity, age at menopause) modeled dynamically in a time-dependent fashion . Finally, we wish to emphasize that adequate presentation (e.g., with online tools) of the risk estimates is crucial for effective communication about CBC risk during doctor–patient consultations [72, 73].
In conclusion, we present an updated version of a previously proposed contralateral breast cancer risk model (PredictCBC) including additional information on breast cancer genetic variants beyond BRCA1/2, lifestyle and reproductive factors. PredictCBC-2.0, available online at , is based on longer follow-up from a wide range of new European-descent population and hospital-based studies, with reasonable calibration. PredictCBC-2.0 may be used to tailor clinical decision-making toward CPM or alternative preventive strategies, especially when genetic information is available.
Availability of data and materials
The datasets analyzed during the current study are not publicly available due to the protection of participant privacy and confidentiality, and ownership of the contributing institutions, but may be made available in an anonymized form via the corresponding author on reasonable request and after approval of the involved institutions.
Area under the ROC curve
Breast Cancer Association Consortium
Body mass index
Contralateral breast cancer
Contralateral preventive mastectomy
Decision curve analysis
The Hereditary Breast and Ovarian Cancer Research Group Netherlands
Human epidermal growth receptor 2
Multiple imputation by chained equations
Netherlands Cancer Registry
Polygenic risk score
Subdistribution hazard ratio
Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis
Chen Y, Thompson W, Semenciw R, Mao Y. Epidemiology of contralateral breast cancer. Cancer Epidemiol Biomarkers Prev. 1999;8(10):855–61.
Gao X, Fisher SG, Emami B. Risk of second primary cancer in the contralateral breast in women treated for early-stage breast cancer: a population-based study. Int J Radiat Oncol Biol Phys. 2003;56(4):1038–45.
Curtis RE, Ron E, Hankey BF, Hoover RN. New malignancies following breast cancer. In: New malignancies among cancer survivors: SEER Cancer Registries, 1973–2000; 181–205.
Yu GP, Schantz SP, Neugut AI, Zhang ZF. Incidences and trends of second cancers in female breast cancer patients: a fixed inception cohort-based analysis (United States). Cancer Causes Control. 2006;17(4):411–20.
Soerjomataram I, Louwman WJ, Lemmens VE, de Vries E, Klokman WJ, Coebergh JW. Risks of second primary breast and urogenital cancer following female breast cancer in the south of The Netherlands, 1972–2001. Eur J Cancer. 2005;41(15):2331–7.
Schaapveld M, Visser O, Louwman WJ, Willemse PH, de Vries EG, van der Graaf WT, Otter R, Coebergh JW, van Leeuwen FE. The impact of adjuvant therapy on contralateral breast cancer risk and the prognostic significance of contralateral breast cancer: a population based study in the Netherlands. Breast Cancer Res Treat. 2008;110(1):189–97.
Tuttle TM, Habermann EB, Grund EH, Morris TJ, Virnig BA. Increasing use of contralateral prophylactic mastectomy for breast cancer patients: a trend toward more aggressive surgical treatment. J Clin Oncol. 2007;25(33):5203–9.
Narod SA. Bilateral breast cancers. Nat Rev Clin Oncol. 2014;11(3):157–66.
Metcalfe K, Gershman S, Ghadirian P, Lynch HT, Snyder C, Tung N, Kim-Sing C, Eisen A, Foulkes WD, Rosen B, et al. Contralateral mastectomy and survival after breast cancer in carriers of BRCA1 and BRCA2 mutations: retrospective analysis. BMJ. 2014;348:g226.
Xiong Z, Yang L, Deng G, Huang X, Li X, Xie X, Wang J, Shuang Z, Wang X. Patterns of occurrence and outcomes of contralateral breast cancer: analysis of SEER data. J Clin Med. 2018;7(6):133.
Wong SM, Freedman RA, Sagara Y, Aydogan F, Barry WT, Golshan M. Growing use of contralateral prophylactic mastectomy despite no improvement in long-term survival for invasive breast cancer. Ann Surg. 2017;265(3):581–9.
Murphy JA, Milner TD, O’Donoghue JM. Contralateral risk-reducing mastectomy in sporadic breast cancer. Lancet Oncol. 2013;14(7):e262-269.
Basu NN, Hodson J, Chatterjee S, Gandhi A, Wisely J, Harvey J, Highton L, Murphy J, Barnes N, Johnson R, et al. The Angelina Jolie effect: contralateral risk-reducing mastectomy trends in patients at increased risk of breast cancer. Sci Rep. 2021;11(1):2847.
Domchek SM. Risk-reducing mastectomy in BRCA1 and BRCA2 mutation carriers: a complex discussion. JAMA. 2019;321(1):27.
Giardiello D, Steyerberg EW, Hauptmann M, Adank MA, Akdeniz D, Blomqvist C, Bojesen SE, Bolla MK, Brinkhuis M, Chang-Claude J, et al. Prediction and clinical utility of a contralateral breast cancer risk model. Breast Cancer Res. 2019;21(1):144.
Basu NN, Ross GL, Evans DG, Barr L. The Manchester guidelines for contralateral risk-reducing mastectomy. World J Surg Oncol. 2015;13:237.
Chowdhury M, Euhus D, Onega T, Biswas S, Choudhary PK. A model for individualized risk prediction of contralateral breast cancer. Breast Cancer Res Treat. 2017;161(1):153–60.
Chowdhury M, Euhus D, Arun B, Umbricht C, Biswas S, Choudhary P. Validation of a personalized risk prediction model for contralateral breast cancer. Breast Cancer Res Treat. 2018;170(2):415–23.
Weischer M, Nordestgaard BG, Pharoah P, Bolla MK, Nevanlinna H, Van’t Veer LJ, Garcia-Closas M, Hopper JL, Hall P, Andrulis IL, et al. CHEK2*1100delC heterozygosity in women with breast cancer associated with early death, breast cancer-specific death, and increased risk of a second breast cancer. J Clin Oncol. 2012;30(35):4308–16.
Akdeniz D, Schmidt MK, Seynaeve CM, McCool D, Giardiello D, van den Broek AJ, Hauptmann M, Steyerberg EW, Hooning MJ. Risk factors for metachronous contralateral breast cancer: a systematic review and meta-analysis. Breast. 2019;44:1–14.
Robson ME, Reiner AS, Brooks JD, Concannon PJ, John EM, Mellemkjaer L, Bernstein L, Malone KE, Knight JA, Lynch CF, et al. Association of common genetic variants with contralateral breast cancer risk in the WECARE study. J Natl Cancer Inst. 2017. https://doi.org/10.1093/jnci/djx051.
Fanale D, Incorvaia L, Filorizzo C, Bono M, Fiorino A, Calo V, Brando C, Corsini LR, Barraco N, Badalamenti G, et al. Detection of germline mutations in a cohort of 139 patients with bilateral breast cancer by multi-gene panel testing: impact of pathogenic variants in other genes beyond BRCA1/2. Cancers (Basel). 2020;12(9):2415.
Kramer I, Hooning MJ, Mavaddat N, Hauptmann M, Keeman R, Steyerberg EW, Giardiello D, Antoniou AC, Pharoah PDP, Canisius S, et al. Breast cancer polygenic risk score and contralateral breast cancer risk. Am J Hum Genet. 2020;107(5):837–48.
Lakeman IMM, van den Broek AJ, Vos JAM, Barnes DR, Adlard J, Andrulis IL, Arason A, Arnold N, Arun BK, Balmana J, et al. The predictive ability of the 313 variant-based polygenic risk score for contralateral breast cancer risk prediction in women of European ancestry with a heterozygous BRCA1 or BRCA2 pathogenic variant. Genet Med. 2021;23:1726–37.
Mavaddat N, Michailidou K, Dennis J, Lush M, Fachal L, Lee A, Tyrer JP, Chen TH, Wang Q, Bolla MK, et al. Polygenic risk scores for prediction of breast cancer and breast cancer subtypes. Am J Hum Genet. 2019;104(1):21–34.
Akdeniz D, Klaver MM, Smith CZA, Koppert LB, Hooning MJ. The impact of lifestyle and reproductive factors on the risk of a second new primary cancer in the contralateral breast: a systematic review and meta-analysis. Cancer Causes Control. 2020;31(5):403–16.
Pijpe A, Manders P, Brohet RM, Collee JM, Verhoef S, Vasen HF, Hoogerbrugge N, van Asperen CJ, Dommering C, Ausems MG, et al. Physical activity and the risk of breast cancer in BRCA1/2 mutation carriers. Breast Cancer Res Treat. 2010;120(1):235–44.
Riegman PH, van Veen EB. Biobanking residual tissues. Hum Genet. 2011;130(3):357–68.
Foundation Federation of Dutch Medical Scientific Societies. Human tissue and medical research: code of conduct for responsible use. 2011.
van den Broek AJ, Schmidt MK, van’t Veer LJ, Oldenburg HSA, Rutgers EJ, Russell NS, Smit V, Voogd AC, Koppert LB, Siesling S, et al. Prognostic impact of breast-conserving therapy versus mastectomy of BRCA1/2 mutation carriers compared with noncarriers in a consecutive series of young breast cancer patients. Ann Surg. 2019;270(2):364–72.
Buuren S. Flexible imputation of missing data. Boca Raton: CRC Press; 2012.
Resche-Rigon M, White IR, Bartlett JW, Peters SA, Thompson SG. Group P-IS: Multiple imputation for handling systematically missing confounders in meta-analysis of individual participant data. Stat Med. 2013;32(28):4890–905.
Van Buuren S. Flexible imputation of missing data. 2nd ed. Boca Raton: Chapman and Hall/CRC; 2018.
Geskus RB. Cause-specific cumulative incidence estimation and the fine and gray model under both left truncation and right censoring. Biometrics. 2011;67(1):39–49.
Schoenfeld DA. Sample-size formula for the proportional-hazards regression model. Biometrics. 1983;39(2):499–503.
Schmidt MK, Tollenaar RA, de Kemp SR, Broeks A, Cornelisse CJ, Smit VT, Peterse JL, van Leeuwen FE, Van’t Veer LJ. Breast cancer survival and tumor characteristics in premenopausal women carrying the CHEK2*1100delC germline mutation. J Clin Oncol. 2007;25(1):64–9.
Schmidt MK, Hogervorst F, van Hien R, Cornelissen S, Broeks A, Adank MA, Meijers H, Waisfisz Q, Hollestelle A, Schutte M, et al. Age- and tumor subtype-specific breast cancer risk estimates for CHEK2*1100delC carriers. J Clin Oncol. 2016;34(23):2750–60.
Steyerberg EW, Harrell FE Jr. Prediction models need appropriate internal, internal-external, and external validation. J Clin Epidemiol. 2016;69:245–7.
Austin PC, van Klaveren D, Vergouwe Y, Nieboer D, Lee DS, Steyerberg EW. Geographic and temporal validity of prediction models: different approaches were useful to examine model performance. J Clin Epidemiol. 2016;79:76–85.
Collins GS, Ogundimu EO, Altman DG. Sample size considerations for the external validation of a multivariable prognostic model: a resampling study. Stat Med. 2016;35(2):214–26.
Blanche P, Dartigues JF, Jacqmin-Gadda H. Estimating and comparing time-dependent areas under receiver operating characteristic curves for censored event times with competing risks. Stat Med. 2013;32(30):5381–97.
Brentnall AR, Cuzick J. Risk models for breast cancer and their validation. Stat Sci. 2020;35(1):14–30.
Austin PC, Putter H, Giardiello D, van Klaveren D. Graphical calibration curves and the integrated calibration index (ICI) for competing risk models. Diagn Progn Res. 2022;6(1):2.
Collins GS, Reitsma JB, Altman DG, Moons KG. Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD). Ann Intern Med. 2015;162(10):735–6.
Vickers AJ, Elkin EB. Decision curve analysis: a novel method for evaluating prediction models. Med Decis Mak. 2006;26(6):565–74.
Kerr KF, Brown MD, Zhu K, Janes H. Assessing the clinical impact of risk prediction models with decision curves: guidance for correct interpretation and appropriate use. J Clin Oncol. 2016;34(21):2534–40.
Vickers AJ, Cronin AM, Elkin EB, Gonen M. Extensions to decision curve analysis, a novel method for evaluating diagnostic tests, prediction models and molecular markers. BMC Med Inform Decis Mak. 2008;8:53.
Heemskerk-Gerritsen BA, Rookus MA, Aalfs CM, Ausems MG, Collee JM, Jansen L, Kets CM, Keymeulen KB, Koppert LB, Meijers-Heijboer HE, et al. Improved overall survival after contralateral risk-reducing mastectomy in BRCA1/2 mutation carriers with a history of unilateral breast cancer: a prospective analysis. Int J Cancer. 2015;136(3):668–77.
Balmana J, Diez O, Rubio IT, Cardoso F, Group EGW. BRCA in breast cancer: ESMO clinical practice guidelines. Ann Oncol. 2011;22(Suppl 6):31–4.
Rutgers EJT. Is prophylactic mastectomy justified in women without BRCA mutation? Breast. 2019;48(Suppl 1):S62–4.
Giardiello D, Hauptmann M, Steyerberg EW, Adank MA, Akdeniz D, Blom JC, Blomqvist C, Bojesen SE, Bolla MK, Brinkhuis M, et al. Prediction of contralateral breast cancer: external validation of risk calculators in 20 international cohorts. Breast Cancer Res Treat. 2020;181(2):423–34.
Antoniou AC, Pharoah PP, Smith P, Easton DF. The BOADICEA model of genetic susceptibility to breast and ovarian cancer. Br J Cancer. 2004;91(8):1580–90.
Antoniou AC, Cunningham AP, Peto J, Evans DG, Lalloo F, Narod SA, Risch HA, Eyfjord JE, Hopper JL, Southey MC, et al. The BOADICEA model of genetic susceptibility to breast and ovarian cancers: updates and extensions. Br J Cancer. 2008;98(8):1457–66.
Lee AJ, Cunningham AP, Tischkowitz M, Simard J, Pharoah PD, Easton DF, Antoniou AC. Incorporating truncating variants in PALB2, CHEK2, and ATM into the BOADICEA breast cancer risk model. Genet Med. 2016;18(12):1190–8.
Carver T, Hartley S, Lee A, Cunningham AP, Archer S, Babb de Villiers C, Roberts J, Ruston R, Walter FM, Tischkowitz M, et al. CanRisk Tool-A web interface for the prediction of breast and ovarian cancer risk and the likelihood of carrying genetic pathogenic variants. Cancer Epidemiol Biomarkers Prev. 2021;30(3):469–73.
Kramer I, Schaapveld M, Oldenburg HSA, Sonke GS, McCool D, van Leeuwen FE, Van de Vijver KK, Russell NS, Linn SC, Siesling S, et al. The influence of adjuvant systemic regimens on contralateral breast cancer risk and receptor subtype. J Natl Cancer Inst. 2019;111(7):709–18.
Witteveen A, Vliegen IM, Sonke GS, Klaase JM, Siesling S. Personalisation of breast cancer follow-up: a time-dependent prognostic nomogram for the estimation of annual risk of locoregional recurrence in early breast cancer patients. Breast Cancer Res Treat. 2015;152(3):627–36.
Volkel V, Hueting TA, Draeger T, van Maaren MC, de Munck L, Strobbe LJA, Sonke GS, Schmidt MK, van Hezewijk M, Groothuis-Oudshoorn CGM, et al. Improved risk estimation of locoregional recurrence, secondary contralateral tumors and distant metastases in early breast cancer: the INFLUENCE 2.0 model. Breast Cancer Res Treat. 2021;189:817–26.
Nieboer D, Vergouwe Y, Ankerst DP, Roobol MJ, Steyerberg EW. Improving prediction models with new markers: a comparison of updating strategies. BMC Med Res Methodol. 2016;16(1):128.
Madley-Dowd P, Hughes R, Tilling K, Heron J. The proportion of missing data should not be used to guide decisions on multiple imputation. J Clin Epidemiol. 2019;110:63–73.
Collins GS, Altman DG. Predicting the 10 year risk of cardiovascular disease in the United Kingdom: independent and external validation of an updated version of QRISK2. BMJ. 2012;344:e4181.
Breast Cancer Association C, Dorling L, Carvalho S, Allen J, Gonzalez-Neira A, Luccarini C, Wahlstrom C, Pooley KA, Parsons MT, Fortuno C, et al. Breast cancer risk genes—association analysis in more than 113,000 women. N Engl J Med. 2021;384(5):428–39.
Ho WK, Tan MM, Mavaddat N, Tai MC, Mariapun S, Li J, Ho PJ, Dennis J, Tyrer JP, Bolla MK, et al. European polygenic risk score for prediction of breast cancer shows similar performance in Asian women. Nat Commun. 2020;11(1):3833.
Evans DG, van Veen EM, Byers H, Roberts E, Howell A, Howell SJ, Harkness EF, Brentnall A, Cuzick J, Newman WG. The importance of ethnicity: Are breast cancer polygenic risk scores ready for women who are not of White European origin? Int J Cancer. 2021;150:73–9.
Christodoulou E, Ma J, Collins GS, Steyerberg EW, Verbakel JY, Van Calster B. A systematic review shows no performance benefit of machine learning over logistic regression for clinical prediction models. J Clin Epidemiol. 2019;110:12–22.
Giardiello D, Antoniou AC, Mariani L, Easton DF, Steyerberg EW. Letter to the editor: a response to Ming’s study on machine learning techniques for personalized breast cancer risk prediction. Breast Cancer Res. 2020;22(1):17.
Thompson D, Easton D. The genetic epidemiology of breast cancer genes. J Mammary Gland Biol Neoplasia. 2004;9(3):221–36.
Reiner AS, Sisti J, John EM, Lynch CF, Brooks JD, Mellemkjaer L, Boice JD, Knight JA, Concannon P, Capanu M, et al. Breast cancer family history and contralateral breast cancer risk in young women: an update from the women’s environmental cancer and radiation epidemiology study. J Clin Oncol. 2018;36(15):1513–20.
Torkamani A, Wineinger NE, Topol EJ. The personal and clinical utility of polygenic risk scores. Nat Rev Genet. 2018;19(9):581–90.
Wald NJ, Old R. The illusion of polygenic disease risk prediction. Genet Med. 2019;21:1705–7.
Knight JA, Blackmore KM, Fan J, Malone KE, John EM, Lynch CF, Vachon CM, Bernstein L, Brooks JD, Reiner AS, et al. The association of mammographic density with risk of contralateral breast cancer and change in density with treatment in the WECARE study. Breast Cancer Res. 2018;20(1):23.
Van Belle V, Van Calster B. Visualizing risk prediction models. PLoS ONE. 2015;10(7):e0132614.
Bonnett LJ, Snell KIE, Collins GS, Riley RD. Guide to presenting clinical prediction models for use in clinical settings. BMJ. 2019;365:l737.
PREDICTCBC 2.0. https://www.evidencio.com/models/show/2949
We thank all individuals who took part in these studies and all researchers, clinicians, technicians, and administrative staff who have enabled this work to be carried out. ABCFS thanks Maggie Angelakos, Judi Maskiell, and Gillian Dite. ABCS thanks the Blood bank Sanquin, The Netherlands. ABCTB Investigators: Christine Clarke, Deborah Marsh, Rodney Scott, Robert Baxter, Desmond Yip, Jane Carpenter, Alison Davis, Nirmala Pathmanathan, Peter Simpson, J. Dinny Graham, Mythily Sachchithananthan. ABCS and BOSOM thank all the collaborating hospitals and pathology departments and many individuals that made this study possible; specifically, we wish to acknowledge: Annegien Broeks, Sten Cornelissen, Frans Hogervorst, Laura van ‘t Veer, Emiel Rutgers. EMC thanks J.C. Blom-Leenheer, P.J. Bos, C.M.G. Crepin, and M. van Vliet for data management. CGPS thanks staff and participants of the Copenhagen General Population Study. For the excellent technical assistance: Dorthe Uldall Andersen, Maria Birna Arnadottir, Anne Bank, Dorthe Kjeldgård Hansen. The Danish Cancer Biobank is acknowledged for providing infrastructure for the collection of blood samples for the cases. HEBCS thanks Johanna Kiiski, Taru A. Muranen, Kristiina Aittomäki, Kirsimari Aaltonen, Karl von Smitten, and Irja Erkkilä. The Hereditary Breast and Ovarian Cancer Research Group Netherlands (HEBON) consists of the following Collaborating Centers: Netherlands Cancer Institute (coordinating center), Amsterdam, NL: M.A. Rookus, F.B.L. Hogervorst, M.A. Adank, D.J. Stommel-Jenner, R. de Groot; Erasmus Medical Center, Rotterdam, NL: J.M. Collée, A.M.W. van den Ouweland, M.J. Hooning, I.A. Boere; Leiden University Medical Center, NL: C.J. van Asperen, P. Devilee, R.B. van der Luijt, T.C.T.E.F. van Cronenburg; Radboud University Nijmegen Medical Center, NL: M.R. Wevers, A.R. Mensenkamp; University Medical Center Utrecht, NL: M.G.E.M. Ausems, M.J. Koudijs; Amsterdam UMC, Univ of Amsterdam, NL: I. van de Beek; Amsterdam UMC, Vrije Universiteit Amsterdam, NL: J.J.P. Gille; Maastricht University Medical Center, NL: E.B. Gómez García, M.J. Blok, M. de Boer; University of Groningen, NL: L.P.V. Berger, M.J.E. Mourits, G.H. de Bock; The Netherlands Comprehensive Cancer Organisation (IKNL): J. Verloop; The nationwide network and registry of histo- and cytopathology in The Netherlands (PALGA): E.C. van den Broek. HEBON thanks the study participants and the registration teams of IKNL and PALGA for part of the data collection. KARMA thanks the Swedish Medical Research Counsel. LMBC thanks Gilian Peuteman, Thomas Van Brussel, EvyVanderheyden and Kathleen Corthouts. MARIE thanks Petra Seibold, Nadia Obi, Sabine Behrens, Ursula Eilber and Muhabbet Celik ORIGO thanks E. Krol-Warmerdam, and J. Blom for patient accrual, administering questionnaires, and managing clinical information. The authors thank the registration team of the Netherlands Comprehensive Cancer Organisation (IKNL) for the collection of data for the Netherlands Cancer Registry as well as IKNL staff for scientific advice. PBCS thanks Louise Brinton, Mark Sherman, Neonila Szeszenia-Dabrowska, Beata Peplonska, Witold Zatonski, Pei Chao, and Michael Stagner. The ethical approval for the POSH study is MREC /00/6/69, UKCRN ID: 1137. We thank the SEARCH and EPIC teams. SKKDKFZS thanks all study participants, clinicians, family doctors, researchers, and technicians for their contributions and commitment to this study. SZBCS thanks Ewa Putresza. UBCS thanks all study participants, the ascertainment, laboratory and research informatics teams at Huntsman Cancer Institute and Intermountain Healthcare, and Justin Williams, Brandt Jones, Myke Madsen, Melissa Cessna, Stacey Knight, and Kerry Rowe for their important contributions to this study. Special thanks are due to Stefano Bottelli for his R programming support.We highly appreciate the help of Tom Hueting to translate PREDICTCBC-2.0 into an online tool.
This work is supported by the Alpe d’HuZes/Dutch Cancer Society (KWF Kankerbestrijding) project 6253. BCAC is funded by Cancer Research UK [C1287/A16563, C1287/A10118], the European Union's Horizon 2020 Research and Innovation Programme (Grant Numbers 634935 and 633784 for BRIDGES and B-CAST, respectively), and by the European Community´s Seventh Framework Programme under grant agreement number 223175 (Grant Number HEALTH-F2-2009-223175) (COGS). The EU Horizon 2020 Research and Innovation Programme funding source had no role in study design, data collection, data analysis, data interpretation, or writing of the report. Additional funding for BCAC is provided via the Confluence project which is funded with intramural funds from the National Cancer Institute Intramural Research Program, National Institutes of Health. The Australian Breast Cancer Family Study (ABCFS) was supported by grant UM1 CA164920 from the National Cancer Institute (USA). The ABCFS was also supported by the National Health and Medical Research Council of Australia, the New South Wales Cancer Council, the Victorian Health Promotion Foundation (Australia), and the Victorian Breast Cancer Research Consortium. J.L.H. is a National Health and Medical Research Council (NHMRC) Senior Principal Research Fellow. M.C.S. is a NHMRC Senior Research Fellow. The ABCS study was supported by the Dutch Cancer Society [Grants NKI 2007-3839; 2009 4363]. The work of the BBCC was partly funded by ELAN-Fond of the University Hospital of Erlangen. BOSOM was supported by the Dutch Cancer Society Grant Numbers DCS-NKI 2001-2423, DCS-NKI 2007-3839, and DCSNKI 2009-4363; the Cancer Genomics Initiative; and notary office Spier & Hazenberg for the coding procedure. The BREast Oncology GAlician Network (BREOGAN) is funded by Acción Estratégica de Salud del Instituto de Salud Carlos III FIS PI12/02125/Cofinanciado and FEDER PI17/00918/Cofinanciado FEDER; Acción Estratégica de Salud del Instituto de Salud Carlos III FIS Intrasalud (PI13/01136); Programa Grupos Emergentes, Cancer Genetics Unit, Instituto de Investigacion Biomedica Galicia Sur. Xerencia de Xestion Integrada de Vigo-SERGAS, Instituto de Salud Carlos III, Spain; Grant 10CSA012E, Consellería de Industria Programa Sectorial de Investigación Aplicada, PEME I + D e I + D Suma del Plan Gallego de Investigación, Desarrollo e Innovación Tecnológica de la Consellería de Industria de la Xunta de Galicia, Spain; Grant EC11-192. Fomento de la Investigación Clínica Independiente, Ministerio de Sanidad, Servicios Sociales e Igualdad, Spain; and Grant FEDER-Innterconecta. Ministerio de Economia y Competitividad, Xunta de Galicia, Spain. The EMC was supported by grants from Alpe d’HuZes/Dutch Cancer Society NKI2013-6253 and from Pink Ribbon 2012.WO39.C143. The HEBCS was financially supported by the Helsinki University Hospital Research Fund, the Finnish Cancer Society, and the Sigrid Juselius Foundation. The HEBON study is supported by the Dutch Cancer Society grants NKI1998-1854, NKI2004-3088, NKI2007-3756, NKI 12535, the Netherlands Organisation of Scientific Research grant NWO 91109024, the Pink Ribbon Grants 110005 and 2014-187.WO76, the BBMRI Grant NWO 184.021.007/CP46, and the Transcan Grant JTC 2012 Cancer 12-054. Financial support for KARBAC was provided through the regional agreement on medical training and clinical research (ALF) between Stockholm County Council and Karolinska Institutet, the Swedish Cancer Society, The Gustav V Jubilee foundation and Bert von Kantzows foundation. The KARMA study was supported by Märit and Hans Rausings Initiative Against Breast Cancer. LMBC is supported by the ‘Stichting tegen Kanker.’ The MARIE study was supported by the Deutsche Krebshilfe e.V. [70-2892-BR I, 106332, 108253, 108419, 110826, 110828], the Hamburg Cancer Society, the German Cancer Research Center (DKFZ) and the Federal Ministry of Education and Research (BMBF) Germany [01KH0402]. MEC was supported by NIH grants CA63464, CA54281, CA098758, CA132839 and CA164973. The ORIGO study was supported by the Dutch Cancer Society (RUL 1997-1505) and the Biobanking and Biomolecular Resources Research Infrastructure (BBMRI-NL CP16). The Netherlands Cancer Registry is hosted by the Netherlands Comprehensive Cancer Organisation (IKNL) and financed by the Dutch Ministry of Health, Welfare and Sports. The PBCS was funded by Intramural Research Funds of the National Cancer Institute, Department of Health and Human Services, USA. The POSH study is funded by Cancer Research UK (grants C1275/A11699, C1275/C22524, C1275/A19187, C1275/A15956 and Breast Cancer Campaign 2010PR62, 2013PR044). SKKDKFZS is supported by the DKFZ. The SZBCS was supported by Grant PBZ_KBN_122/P05/2004 and the program of the Minister of Science and Higher Education under the name "Regional Initiative of Excellence" in 2019–2022 project number 002/RID/2018/19 amount of financing 12 000 000 PLN. UBCS was supported by funding from National Cancer Institute (NCI) grant R01 CA163353 (to N.J. Camp) and the Women’s Cancer Center at the Huntsman Cancer Institute (HCI). Data collection for UBCS was supported by the Utah Population Database, Intermountain Healthcare and the Utah Cancer Registry which is funded by the NCI's SEER Program (HHSN261201800016I), the US Centers for Disease Control and Prevention's National Program of Cancer Registries (NU58DP006320), with additional support from the University of Utah and Huntsman Cancer Foundation.
Ethics approval and consent to participate
All studies were approved by the appropriate ethics and scientific review boards. All procedures performed in studies involving human participants were in accordance with the ethical standards of international, national, and institutional research committees and with the 1964 Declaration of Helsinki and its later amendments or comparable ethical standards.
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary methods also including the following tables and figures Table S2. List of BCAC studies (including ABCS source) with the corresponding country and geographic area. Table S4: Clinical utility of the 5-year contralateral breast cancer risk prediction models (PredictCBC-1A with PredictCBC-2.0A and PredictCBC-1B with PredictCBC-2.0B). Figure S1. Visual assessment of calibration through calibration plots in the internal–external cross-validation at 5 years for the PredictCBC-2.0A model. Figure S2. Visual assessment of calibration through calibration plots in the internal–external cross-validation at 10 years for the PredictCBC-2.0A model. Figure S3. Visual assessment of calibration through calibration plots in the internal–external cross-validation at 5 years for the PredictCBC-2.0B model. Figure S4. Visual assessment of calibration through calibration plots in the internal–external cross-validation at 10 years for the PredictCBC-2.0B model. Figure S5. Density distribution of 5-year predicted contralateral breast cancer using PredictCBC-2.0 models. Figure S6. Decision curve analysis at 5 years for the contralateral breast cancer risk models (PredictCBC and PredictCBC-2.0) including BRCA mutation information.
Description of the studies included in the analyses.
Patient and primary breast cancer characteristics per study.
About this article
Cite this article
Giardiello, D., Hooning, M.J., Hauptmann, M. et al. PredictCBC-2.0: a contralateral breast cancer risk prediction model developed and validated in ~ 200,000 patients. Breast Cancer Res 24, 69 (2022). https://doi.org/10.1186/s13058-022-01567-3