Novel mammographic image features differentiate between interval and screen-detected breast cancer: a case-case study
Breast Cancer Research volume 18, Article number: 100 (2016)
Interval breast cancers are often diagnosed at a more advanced stage than screen-detected cancers. Our aim was to identify features in screening mammograms of the normal breast that would differentiate between future interval cancers and screen-detected cancers, and to understand how each feature affects tumor detectability.
From a population-based cohort of invasive breast cancer cases in Stockholm-Gotland, Sweden, diagnosed from 2001 to 2008, we analyzed the contralateral mammogram at the preceding negative screening of 394 interval cancer cases and 1009 screen-detected cancers. We examined 32 different image features in digitized film mammograms, based on three alternative dense area identification methods, by a set of logistic regression models adjusted for percent density with interval cancer versus screen-detected cancer as the outcome. Features were forward-selected into a multiple logistic regression model adjusted for mammographic percent density, age, BMI and use of hormone replacement therapy. The associations of the identified features were assessed also in a sample from an independent cohort.
Two image features, ‘skewness of the intensity gradient’ and ‘eccentricity’, were associated with the risk of interval compared with screen-detected cancer. For the first feature, the per-standard deviation odds ratios were 1.32 (95 % CI: 1.12 to 1.56) and 1.21 (95 % CI: 1.04 to 1.41) in the primary and validation cohort respectively. For the second feature, they were 1.20 (95 % CI: 1.04 to 1.39) and 1.17 (95%CI: 0.98 to 1.39) respectively. The first feature was associated with the tumor size at screen detection, while the second feature was associated with the tumor size at interval detection.
We identified two novel mammographic features in screening mammograms of the normal breast that differentiated between future interval cancers and screen-detected cancers. We present a starting point for further research into features beyond percent density that might be relevant for interval cancer, and suggest ways to use this information to improve screening.
Interval cancer (IC) is a cancer that is diagnosed in the interval between two mammographic screening visits. In this study, we have defined IC as breast cancer that is detected after a negative screen, or after a positive screen with a negative workup, but before the next regular visit or end of a normal screening interval, whichever came first. The length of a normal screening interval was between 18 and 24 months depending on age and county. IC has been shown to have a more aggressive phenotype than screen-detected cancer (SC) [1–7]. Several breast cancer screening strategies have been proposed which might decrease the IC incidence, such as increasing the screening frequency  or using supplemental imaging  in addition to mammography, e.g., magnetic resonance imaging (MRI) or ultrasound. However, adding examinations strain scarce medical resources, increase cost and impose a burden on the women in the screening program. There is a need to more accurately identify women at high risk of interval breast cancer and to understand which strategy would be most adequate for decreasing the risk of interval cancer by earlier screen detection.
Until now, the only feature of mammographic images of the normal breast that has been found to be associated with IC, compared with SC, is percent mammographic density (PD), i.e., the percentage of the pixels of the total breast area that has an intensity above a certain threshold. The association between high PD and interval cancer is believed to be related to an increased risk that dense benign tissue would mask a tumor [1, 7, 10]. If a woman has a high PD the tumor often needs to be larger before it can be detected on a screening mammogram [11, 12]. Previous studies of associations between interval cancer and image features other than PD have mainly focused on identifying pre-malignant changes [13, 14].
In the present study we examine a large set of image features, an approach that is often applied in machine learning. This feature-based approach has not been used previously in IC studies of the mammographic image of the normal breast phenotype. Our aim was to identify image features, in addition to PD, of preceding negative screening mammograms that might differentiate between interval cancers and screen-detected cancers. A secondary aim was to examine which feature might help us more accurately select women for whom supplemental imaging examinations would be most valuable in order to detect tumors earlier.
We analyzed breast cancer cases in the Libro-1 population-based cohort, which consists of women in the Stockholm-Gotland region diagnosed with breast cancer from 2001 to 2008. The Libro-1 cohort has been described in detail earlier [7, 15]. All women were identified through the Stockholm-Gotland Regional Breast Cancer quality register. Invitations were mailed out in 2009, together with informed consent documents and a link to an online questionnaire. The quality and organization of the Stockholm mammography screening program has been described in detail previously by Lind et al. . For our study, only women diagnosed with incident unilateral invasive breast cancer, without any other previous cancer with the exception of non-melanoma skin cancer, without prior breast surgery, for which mode of detection was available, and who had at least one pre-diagnostic mammogram and corresponding PD measurement were included (n = 1403). Our primary study sample consisted of 394 IC cases and 1009 SC cases.
To try to validate our findings, we analyzed a second population-based cohort of breast cancer cases, called ‘CAHRES’, which has been described in detail elsewhere . It contains incident breast cancer cases diagnosed from October 1, 1993, to March 15, 1995, and reported to any of the six Swedish Regional Cancer Registries. The validation sample included 1182 breast cancer cases (281 IC and 901 SC). The same inclusion and exclusion criteria were applied as for the primary cohort.
The data collection approach has been described in more detail previously . Information about body mass index (BMI), hormone replacement therapy (HRT), and other sociodemographic, anthropometric, hormonal, and lifestyle factors were obtained through questionnaires collected during 2009. Use of HRT was classified as ‘yes’ if HRT pills had been used during the year of diagnosis, and as ‘no’ otherwise. Tumor characteristics were obtained from linkage with the Stockholm-Gotland Regional Breast Cancer quality register.
We collected mammograms by contacting local mammography units as well as the national Swedish medical image repository in Vilhelmina, Sweden. All mammograms were analog film mammograms that were digitized using an Array 2905HD Laser Film Digitizer, which covers a range of 0 to 4.7 optical densities. The density resolution was set at 12-bit dynamic range. The data collection was performed similarly for the validation cohort.
All analysis was based on digitized film mammograms. Analysis was based on the last pre-diagnostic mammographic image per woman, using the mediolateral oblique view of the breast contralateral to the future tumor. To avoid image acquisition bias related to suspected or diagnosed cancer we did not consider mammograms that were acquired later than 30 days before diagnosis. PD values were calculated by an automated ImageJ-based method developed and validated by Li et al. in 2012 . This method attempts to mimic the outcome of the gold standard area-based PD measurement method, Cumulus , and a high correlation (r = 0.884) between the two methods was demonstrated.
Our aim was to study features of the dense area in the mammograms, which were obtained by three processing steps: pre-processing of the image, dense area selection, and feature extraction. First, the images were pre-processed by automatically removing nametags and other artifacts, as well as reducing contrast intensity differences between mammograms, and finally by removing the image area corresponding to the pectoral muscle. Second, the dense area of the mammogram was automatically identified by a thresholding method. The dense area thus identified was not necessarily a single continuous area, but could consist of separate parts of the image. The primary thresholding approach was chosen to be Otsu’s method  for two reasons: it has been widely used and it was the only specified method used in the previous two studies of image features and interval cancer [14, 21]. In addition to Otsu’s method, we used two alternative thresholding methods: the percentile method and the entropy of histogram method . Otsu’s method was based on minimizing the weighted sum of within-class variances of the foreground and background pixels to establish an optimum threshold. The percentile method was based on defining the dense area as all pixels with intensity above the median, i.e., the 50th percentile. The entropy of the histogram method is based on maximizing the entropy of the histogram, which can be interpreted as maximum information transfer.
The third step in the image analysis involved the calculation of 32 different features based on the identified dense area. These were the same features as in a previous study from our group, which aimed to predict percent density from statistical features of digital mammograms (using machine learning approaches) . That study showed that a density estimate based on feature extraction from processed digital mammograms was associated with breast cancer status, with a similar amount of evidence as that found for density measures calculated using another automated method (Volpara) on raw digital images. In the above-mentioned previous study, the features were selected to represent various feature categories. The statistical image features are based on the intensity value of each pixel, and can belong to either first-order or higher-order statistics. First-order statistics are based on the histogram of all pixels; examples include median, skewness and kurtosis. Higher-order statistics take the spatial relationship between pixels into account; examples include shape features based on fitting a shape to the silhouette of the segmented area and texture features based on measurements of the difference between neighboring pixels. The features were calculated using the computer software Matlab . There were only two mammograms for which the quality did not allow feature calculations; one interval cancer and one screen-detected cancer.
Before performing any statistical tests, we transformed the feature measures using a Box-Cox transformation after which they were standardized, in order to attain near standard normal distributions. We carried out 3 × 32 tests of association – three tests (the three thresholding methods described above) for each of the 32 extracted image features (Table 2). These (Wald) tests were based on fitting logistic regression models with IC versus SC status as the outcome and using continuous PD as an adjustment variable. We performed a global test of association testing the null hypothesis that none of the features were associated with IC versus SC status by examining the number of test results that were significant at the 5 % level (global test statistic). An empirical (global) level of significance was obtained by permuting IC versus SC status over a large number of simulations (10,000), and calculating the fraction of (global) test statistic values based on permuted data that were larger than the test statistic value obtained for the non-permuted data set. This global test is similar to Wilkinson’s test  but accounts for the correlation of the features.
To identify individual features, we used a forward selection procedure (the step function in R ) based on our logistic regression model with IC versus SC status as outcome and with continuous PD, age at diagnosis, BMI and HRT as covariates. The forward selection was based on features extracted from the dense area identified by our primary thresholding approach, Otsu’s method (see ‘Image analysis’). To test for potential confounding by different length of the time between mammography and diagnosis for IC and SC, we extended the final model by adding this as a covariate. The associations between the identified features and IC versus SC status were assessed in the validation cohort. Finally, we estimated univariate linear regression models with tumor size as the outcome and each feature as the predictor. Since tumor size was left-skewed we square-root-transformed the variable prior to analysis. The same analysis was performed within the SC and the IC subgroups separately. Additionally, regression models were fitted to examine the association between each of the identified features and each risk factor listed in Table 1. All statistical tests were two-sided. Statistical analyses were carried out in R  or Stata .
We included 1403 women, 394 IC cases and 1009 SC cases, in the primary study sample (Table 1). The women with IC, compared with SC, had a significantly lower BMI, higher PD, more often had used HRT during the year before diagnosis, and more often had a positive family history of breast cancer. Compared with SC tumors, the IC tumors were larger, had more often lymph node metastasis, were of a higher Elston grade, and were more often hormone receptor negative. Corresponding patient and mammographic characteristics for the validation sample can be found in Additional file 1: Table S1.
Individual p values for the associations between IC versus SC status and each combination of feature and dense area selection method are presented in Table 2, along with a short technical description of each feature. According to a global test of association we could refute the null hypothesis that none of the image features are associated with IC versus SC status with a p value of 0.007. Using a forward selection procedure, features ‘skewness of the intensity gradient’ and ‘eccentricity’ of the dense area were included in the final model. Eccentricity is calculated by fitting an ellipse to the dense area, and then taking the ratio of the distance between the foci of the ellipse and its major axis. Examples of mammograms with high and low ‘skewness of the intensity gradient’ are presented in Fig. 1, while examples of mammograms with high and low values of ‘eccentricity’ are presented in Fig. 2.
Table 3 presents the associations of the two image features and IC versus SC status in multivariate regression modelling. In the fully adjusted model, the per-standard deviation odds ratio (OR) for ‘skewness of the intensity gradient’ was 1.32 [95 % confidence interval (95 % CI): 1.12 to 1.56] in the primary cohort and 1.21 (95 % CI: 1.04 to 1.41) in the validation cohort, while the corresponding OR for ‘eccentricity’ was 1.20 (95 % CI: 1.04 to 1.39) and 1.17 (0.98 to 1.39). In the primary cohort, the median time periods between pre-diagnostic mammogram and diagnosis were 24.6 and 15.3 months for screen-detected and interval cancers respectively. Adjusting the final regression model for this time period only slightly changed the estimated odds ratios, and did not affect the significance of our results. Estimates of the association between each of the two identified features and the risk factors in Table 1 can be found in Additional file 2: Table S2.
As shown above, a high ‘skewness of the intensity gradient’ and a high ‘eccentricity’ were both associated with an increased risk of interval cancer compared with screen-detected cancer. From fitting linear regression models with tumor size as the outcome, separately for SCs and ICs, we found significant evidence that high ‘skewness of the intensity gradient’ of the dense area is associated with an increased tumor size for SCs, but not for ICs (Table 4). A high ‘eccentricity’ of the dense area was significantly associated with a decreased tumor size for ICs, but not for SCs (Table 4). A high ‘eccentricity’ of the dense area was significantly associated with a decreased tumor size for ICs, but without association with tumor size for SCs.
We identified two novel mammographic features in screening mammograms of the normal breast that differentiated between future interval cancers and screen-detected cancers. Since we had no prior hypothesis about which features should be related to interval cancer, we examined a range of statistical and textural features used in a previous study  by our group. This set of features is not an exhaustive list of possible features to examine; many others have been used in mammographic studies previously [28, 29].
We identified two novel image features that were significantly different between interval and screen-detected breast cancers: first, the ‘skewness of the intensity gradient’, and second, the ‘eccentricity’ of the dense area in the mammogram. The two features are described in more technical detail in the Appendix. The p value for the association between certain features and IC versus SC status changed markedly depending on which thresholding method was used. This might be interpreted as either that certain features being more or less robust than others or that a specific thresholding method is better suited for calculating certain features. The two features that we identified were significant for two of three thresholding methods, and therefore considered to be quite robust. The first feature, after adjustment for percent density, was associated with a larger tumor size at screen detection, and thus related to a reduced mammographic detectability. We speculate that this feature is a reflection of the extent to which the dense area is interspersed by fatty streaks, reducing the size that a tumor must attain before being detectable at screening mammography. The second feature is related to how elongated the overall shape of the dense area is. This feature was associated with the size at interval cancer detection, most of which are detected by palpation. The mechanism why an elongated shape would promote early clinical detection is unclear. Speculatively, it might be related either to less firm normal breast tissue, or to tumors being located closer to the skin, both of which would enhance palpability. In Fig. 3, we illustrate how different combinations of high or low values of these two features may be manifested in an image.
Several breast cancer screening strategies to reduce the incidence of IC have been proposed, such as using a supplemental imaging , e.g., MRI or ultrasound, which would be most effective against masked tumors. If validated in a cohort including healthy women, the two mammographic features might increase our ability to individually tailor the breast cancer screening strategy. Supplemental imaging should potentially be directed toward women with a high percent density and a high value of our first feature ‘skewness of the intensity gradient’.
Previous studies of associations between interval breast cancer and image features include Banik et al.  who examined features potentially related to ‘architectural distortion’. The study was small, (56 interval cancer cases and 13 screen-detected cases) and did not take PD into consideration. Another study, by Tan et al. , examined the association between near-term breast cancer and the difference between image features of the left versus the right breast mammogram. Neither study included information about use of HRT, which is an established IC risk factor [7, 10]. Both studies were aimed at identifying pre-malignant changes, in principle masked tumors, while our study was aimed at features of the normal breast that would impact the risk of IC of any type: masked, fast-growing or facilitated clinical detection.
One strength of our study is that we were able to use an independent breast cancer cohort for the validation of the two features in a fully adjusted model. Another strength is that we not only examined image features but also tried alternative methods to select the dense area. We were also able to adjust for the potential risk factors PD and HRT, as well as age at mammography and BMI, with a relatively low degree of missingness. A potential weakness of our study is that BMI was measured several years after the analyzed mammographic images were acquired. This might have affected the (lack of) association between BMI and IC versus SC status in our final model. However, from previous literature there is no consistent evidence that such an association exists . A limitation of our study is that we only examined digitized film mammograms. However, the same feature algorithms have been used previously for processed digital images, but for other purposes, and would thus need to be validated for digital mammograms in relation to IC versus SC status. The exact values of the features are the result of pre-processing, dense area selection, and feature calculation. This entire process was assessed in two independent cohorts yielding comparable odds ratios. In future studies aimed at validating our results, it would be important to perform all three steps according to our method.
In conclusion, using a novel feature-based approach, we found two mammographic features of screening mammograms of the normal breast that differentiated between future interval cancers and screen-detected cancers independently of percent density. We present a starting point for further research into the utility of image features as a way to identify women at risk of interval cancer.
- 95 % CI:
95 % confidence interval
body mass index
hormone replacement therapy
magnetic resonance imaging
percent (mammographic) density
Eriksson L, Czene K, Rosenberg LU, et al. Mammographic density and survival in interval breast cancers. Breast Cancer Res. 2013;15:R48.
Kirsh VA, Chiarelli AM, Edwards SA, et al. Tumor characteristics associated with mammographic detection of breast cancer in the Ontario breast screening program. J Natl Cancer Inst. 2011;103:942–50.
Gilliland FD, Joste N, Stauber PM, et al. Biologic characteristics of interval and screen-detected breast cancers. J Natl Cancer Inst. 2000;92:743–9.
Porter PL, El-Bastawissi AY, Mandelson MT, et al. Breast tumor characteristics as predictors of mammographic detection: comparison of interval- and screen-detected cancers. J Natl Cancer Inst. 1999;91:2020–8.
Domingo L, Blanch J, Servitja S, et al. Aggressiveness features and outcomes of true interval cancers: comparison between screen-detected and symptom-detected cancers. Eur J Cancer Prev. 2013;22:21–8.
Domingo L, Sala M, Servitja S, et al. Phenotypic characterization and risk factors for interval breast cancers in a population-based breast cancer screening program in Barcelona, Spain. Cancer Causes Control. 2010;21:1155–64.
Holm J, Humphreys K, Li J, et al. Risk factors and tumor characteristics of interval cancers by mammographic density. J Clin Oncol. 2015;33:1030–7.
Gocgun Y, Banjevic D, Taghipour S, et al. Cost-effectiveness of breast cancer screening policies using simulation. Breast. 2015;24:440–8.
Morris EA. Rethinking breast cancer screening: ultra FAST breast magnetic resonance imaging. J Clin Oncol. 2014;32:2281–3.
Chiarelli AM, Kirsh VA, Klar NS, et al. Influence of patterns of hormone replacement therapy use and mammographic density on breast cancer detection. Cancer Epidemiol Biomarkers Prev. 2006;15:1856–62.
van Gils CH, Otten JD, Verbeek AL, et al. Mammographic breast density and risk of breast cancer: masking bias or causality? Eur J Epidemiol. 1998;14:315–20.
Boyd N, Guo H, Martin L, et al. Mammographic density and the risk and detection of breast cancer. N Engl J Med. 2007;356:227–36.
Bertrand K, Tamimi R, Scott C, et al. Mammographic density and risk of breast cancer by age and tumor characteristics. Breast Cancer Res. 2013;15:R104.
Banik S, Rangayyan RM, Desautels JE. Detection of architectural distortion in prior mammograms of interval-cancer cases with neural networks. Conf Proc IEEE Eng Med Biol Soc. 2009;2009:6667–70.
Cheddad A, Czene K, Shepherd JA, et al. Enhancement of mammographic density measures in breast cancer risk prediction. Cancer Epidemiol Biomarkers Prev. 2014;23:1314–23.
Lind H, Svane G, Kemetli L, et al. Breast cancer screening program in Stockholm County, Sweden - aspects of organization and quality assurance. Breast Care (Basel). 2010;5:353–7.
Magnusson C, Baron J, Persson I, et al. Body size in different periods of life and breast cancer risk in post-menopausal women. Int J Cancer. 1998;76:29–34.
Li J, Szekely L, Eriksson L, et al. High-throughput mammographic-density measurement: a tool for risk prediction of breast cancer. Breast Cancer Res. 2012;14:R114.
Byng JW, Boyd NF, Fishell E, et al. The quantitative analysis of mammographic densities. Phys Med Biol. 1994;39:1629–38.
Otsu N. A Threshold selection method from gray-level histograms. IEEE Trans Syst Man Cyb. 1979;9:62–6.
Gonzalez RCW, RE; Eddins SL. Image segmentation/thresholding, Digital image processing using MATLAB (2nd ed). Natick: Gatesmark, LLC; 2009, pp 557–7
Tan M, Zheng B, Ramalingam P, et al. Prediction of near-term breast cancer risk based on bilateral mammographic feature asymmetry. Acad Radiol. 2013;20:1542–50.
Cheddad A, Czene K, Eriksson M, et al. Area and volumetric density estimation in processed full-field digital mammograms for risk assessment of breast cancer. PLoS ONE. 2014;9:e110690.
MATLAB. Version 8.5. Natick: The MathWorks Inc; 2015.
Wilkinson B. A statistical consideration in psychological research. Psychol Bull. 1951;48:156–8.
R Development Core Team. R: A language and environment for statistical computing. Vienna: R Foundation for Statistical Computing; 2015.
StataCorp. Stata Statistical Software. College Station: StataCorp LP; 2013.
Haberle L, Wagner F, Fasching PA, et al. Characterizing mammographic images by using generic texture features. Breast Cancer Res. 2012;14:R59.
Manduca A, Carston MJ, Heine JJ, et al. Texture features from mammographic images and risk of breast cancer. Cancer Epidemiol Biomarkers Prev. 2009;18:837–45.
Krishnan K, Baglietto L, Apicella C, et al. Mammographic density and risk of breast cancer by mode of detection and tumor size: a case-control study. Breast Cancer Res. 2016;18:63.
The authors wish to gracefully acknowledge funding from the Swedish Research Council (grant no: 2014-2271 and 521-2011-3205), the Swedish Cancer Society (grant no: CAN 2013/469 and CAN 2014/472), the Cancer Society in Stockholm (grant no: 141092) and Stockholm County Council (grant no: 20150332).
Availability of supporting data
Subject to participants’ consent and legal requirements, data can be made available upon request to the primary department of the corresponding author.
FS was involved in shaping the research question, performed the statistical analyses, was responsible for interpreting the results, and drafted the manuscript. KH helped in selecting the image analysis method, in specifying the statistical methods and in drafting and revising the manuscript. AC was responsible for the image analysis including feature set selection, Matlab programming and calculating all feature values, and was involved in revising the manuscript. ST was involved in collection and assembly of data, and in revising the manuscript. EA helped in interpreting the results, and in revising the manuscript. JS helped in defining the research idea, in interpreting the results, and revising the manuscript. PH helped in collection and assembly of data, in revising the manuscript, and helped with the interpretation of data. KC was involved in developing the research idea, defining the study design, collection and assembly of data, interpreting the results, and in revising the manuscript. All authors read and approved the final manuscript.
The authors declare that they have no competing interests.
Consent for publication
Ethical approval and consent to participate
All participants in both the primary and the validation cohort had provided written informed consent, and the studies had the approval of the ethics review board at Karolinska Institutet, Stockholm, Sweden (DNR2009/254-31/4).
Appendix – technical description of identified features
Appendix – technical description of identified features
Feature F21 – ‘eccentricity’
The technical description of Feature 21 is eccentricity. Eccentricity was calculated by first fitting an ellipse according to the perimeter of the dense area based on the second moment method, and then calculating the ratio between the distance between the foci of the ellipse and its major axis. A single ellipse was fitted regardless of whether the dense area was one continuous shape or consisted of separate parts. Eccentricity is a continuous measure ranging from 0 to 1. A high eccentricity value means the ellipse is more elongated, less circular, and a low eccentricity value means the ellipse is rounder, more circular. All feature calculations were carried out in Matlab .
Feature F40 – ‘skewness of the intensity gradient’
The technical description of Feature 40 is the skewness of the intensity gradient. The intensity gradient is the change in intensity value between a certain pixel and its neighboring pixel. All pixels outside the selected dense area were defined as having zero intensity. The gradient in our analysis was based only on increases in the horizontal direction. When calculating the skewness, all non-dense area pixels are defined as having zero intensity. The largest gradient is thus achieved at the border between the dense area and the non-dense area. This gives the gradient value histogram a bimodal distribution, with the border pixels corresponding to large gradients and the interior pixels corresponding to smaller gradients. The intensity gradient is always skewed since there are more interior pixels than border pixels. The skewness is increased for a dense area having a smaller proportion of border pixels compared to interior pixels. All feature calculations were carried out in Matlab .
About this article
Cite this article
Strand, F., Humphreys, K., Cheddad, A. et al. Novel mammographic image features differentiate between interval and screen-detected breast cancer: a case-case study. Breast Cancer Res 18, 100 (2016). https://doi.org/10.1186/s13058-016-0761-x