Mammographic density, breast cancer risk and risk prediction
© BioMed Central Ltd 2007
Published: 20 December 2007
Skip to main content
© BioMed Central Ltd 2007
Published: 20 December 2007
In this review, we examine the evidence for mammographic density as an independent risk factor for breast cancer, describe the risk prediction models that have incorporated density, and discuss the current and future implications of using mammographic density in clinical practice. Mammographic density is a consistent and strong risk factor for breast cancer in several populations and across age at mammogram. Recently, this risk factor has been added to existing breast cancer risk prediction models, increasing the discriminatory accuracy with its inclusion, albeit slightly. With validation, these models may replace the existing Gail model for clinical risk assessment. However, absolute risk estimates resulting from these improved models are still limited in their ability to characterize an individual's probability of developing cancer. Promising new measures of mammographic density, including volumetric density, which can be standardized using full-field digital mammography, will likely result in a stronger risk factor and improve accuracy of risk prediction models.
Combined relative risks for breast cancer associated with different classifications of mammographic density, study designs, and study populations from meta-analysis 
RR (95% CI)
RR (95% CI)
RR (95% CI)
Wolfe parenchymal pattern
1.8 (1.4, 2.2)
1.3 (1.0, 1.5)
1.0 (0.77, 1.3)
3.1 (2.5, 3.7)
2.0 (1.3, 3.0)
1.5 (0.91, 2.4)
4.0 (2.5, 6.3)
2.4 (2.0, 3.0)
1.7 (1.0, 2.8)
Percentage mammographic density
1.8 (1.5, 2.2)
1.4 (1.1, 1.8)
2.1 (1.7, 2.6)
2.2 (1.8, 2.8)
2.9 (2.5, 3.4)
2.9 (2.3, 3.8)
4.6 (3.6, 5.9)
3.7 (2.7, 5.0)
Vacekand Geller 
Ziv et al. 
0.3 (0.2, 0.4)
2.2 (1.6, 3.0)
1.6 (0.9, 2.8)
3.0 (2.2, 4.1)
1.3 (1.1, 1.5)
2.3 (1.3, 4.3)
4.0 (2.8, 5.7)
20.1 (1.6, 2.8)
4.5 (1.9, 10.6)
The magnitude and consistency of the MD and breast cancer association place its importance as a breast cancer risk factor alongside age, the presence of atypia on a breast biopsy, or carrying a highly penetrant breast cancer susceptibility gene (for example, BRCA1 and BRCA2) , the latter two which are rare in the population and responsible for only a small proportion of breast cancer. However, until recently, MD has not been used in clinical risk prediction models or clinical decision making. The purpose of this review is to summarize the evidence and strength of MD as a risk factor, review the studies that have evaluated MD in risk prediction, and discuss the implications of incorporating this trait into clinical practice for improving breast cancer risk assessment.
Classifications of mammographic density
Computer-assisted thresholding method
Visual classification of the mammographic image into four categories based on extent and distribution of the parenchyma, including ducts, nodular, homogeneous densities, and fat.
Standardized reporting of visual assessment of mammographic findings by the American College of Radiology BI-RADS. Both breasts are used for the BI-RADS
An acetate overlay is placed over the mammogram image to outline the breast, and an expert reader traces the areas of breast density. The total breast and dense areas are measured with an outlining tool.
Mammographic films are digitized, and two threshold values are selected. The first separates the breast from background, and the second identifies the edges of the regions representing radiographically dense tissue. The numbers of pixels comprising the total breast area and those for dense area are calculated. Results can be reported as percentage density (the ratio of the dense pixels to total breast area) or absolute area of density (in pixels, square centimeters or square millimeters)
Radiologist or expert reader subjectively assigns a percentage density corresponding to the proportion of breast that is dense.
N1 – Completely fatty breast
P1 – Mainly fatty breast with prominent ducts, up to 25% density
P2 – Prominent ducts, more than 25% density
DY – No visible ducts, diffuse and extensive nodular density .
Category 1 – Almost entirely fatty (<25% dense)
Category 2 – Scattered fibroglandular densities (approximately 25%–50% dense)
Category 3 – Heterogeneously dense (approximately 51%–75% dense)
Category 4 – Extremely dense (>75% dense) .
Percentage density on a continuous scale (0%–100%).
Can be categorized for analyses (such as 0%, 1%–24%, 25%–49%, 50%–74%, 75%) .
Percentage density (0%–100%) or absolute area of density (in pixels, square centimeters or square millimeters) on continuous scale . Both measures can also be categorized.
Percentage or absolute density can be categorized.
Visually estimated directly into categories (such as 0%, <10%, 10%–<25%, 25%–<50%, 50%–<75%, >75%) .
MD is correlated with several breast cancer risk factors; the strongest associations are seen with body mass index (BMI) and age [4, 8]. Importantly, though, MD is an independent risk factor for breast cancer, illustrated in the majority of studies by its robust association with breast cancer after adjustment for these and other risk factors for breast cancer [4, 8]. In fact, recent studies of percentage MD and breast cancer illustrated that failure to adjust for BMI resulted in underestimation of the effect of MD on risk [17, 18]. Thus, the adjustments for BMI and age are important for accurately estimating the risk associated with MD.
Few studies have examined potential modifying effects of risk factors on the MD and breast cancer association. Hormone replacement therapy (HRT), especially combination therapy, consistently shows a strong positive association with MD  and should be considered in analyses of MD with risk, but studies have not seen a modification of the MD and risk association by HRT use [18, 20]. A stronger association of MD and risk has been seen among women with breast cancer in a first-degree relative [21–24]; but among carriers with a BRCA1 or BRCA2 mutation, relative risks were similar to those of non-carriers . The suggestion of stronger associations among women with high BMI  has been reported, as well as findings that the higher risk associated with low parity is stronger among women with high MD [20, 26]. No difference of association has been seen by alcohol use . To date, there is little consistent evidence that the risk associated with MD varies according to other risk factors for breast cancer.
The MD and breast cancer association is not limited to older or younger women of mammogram age. But there is currently no consensus as to whether the association is stronger among one age or menopausal group. Some studies [4, 12, 18, 28] observed stronger risk estimates among postmenopausal women (or those over age 50), whereas others [6, 29] found stronger associations in younger or premenopausal women or neither group . The recent meta-analysis suggested stronger relative risks at older ages that were limited to the 25% to 49% category (versus less than 5%) but no consistent increase across all categories . Importantly, a larger proportion of premenopausal women have dense breasts (greater than 50% dense), with estimates of 37% among premenopausal women compared with 12% among postmenopausal women. Even without significant differences in association by menopausal status, the attributable risk is much higher in younger women (26%) than in older women (7%) . This underscores the importance of MD for potential risk prediction in younger women.
Unfortunately, due to the nature of this trait's dependence on a mammogram for estimation, the significance of MD in young women below mammogram age is unknown.
In addition, MD has been seen to be associated with increased risk across several ethnic groups. Studies of Caucasians, African-Americans, and Asian-Americans [12, 14, 31] have all shown increased risk with percentage or area density. However, the magnitude of association has been weaker [14, 31] or inconsistent in the Asian and Asian-American populations [12, 20, 32], questioning the importance of this predictor in the Asian population. In fact, some have suggested that absolute area of density is a better measure of breast cancer risk than percentage density in the Asian population due to their distinct physical proportions . In general, MD assessed as the parenchymal pattern, percentage density, and absolute area of density appears to be a strong risk factor in a number of populations.
The relationship between MD and breast cancer is thought to be multifactorial, and in early studies, the main explanation was thought to be due to 'masking bias' . In breasts with extensive MD, cancers may be masked because they have the same x-ray attenuation properties as fibroglandular tissue. At an initial mammogram, then, cancers in dense breasts would often escape detection and could manifest themselves shortly thereafter. Therefore, the sole inclusion of incident cases arising shortly after a negative screening examination would erroneously give the impression of increased breast cancer risk in women with extensive MD. The MD and breast cancer association was expected to disappear with longer follow-up and repeated screening. But two large cohort studies from the 1990s [4, 28] challenged the 'masking bias' hypothesis, finding increased breast cancer risks for at least 7 to 10 years after a screening examination. This is also confirmed in the latest large-scale studies on MD and breast cancer risk [6, 13, 35]. Furthermore, although relative risks for breast cancer are higher when studying incident cases diagnosed relatively shortly after a negative examination than when studying prevalent cases, risk is still strong among prevalent cases [36–38]. Similarly, although relative risks are higher when studying interval cancers than when studying screen-detected cancers, studies of screen-detected cancers still demonstrate a strong association [6, 37]. This was recently illustrated in three nested case-control studies by Boyd and colleagues , who found that compared with women with density in less than 10% of the mammogram, women with more than 75% density had an increased risk of breast cancer (odds ratio [OR] = 4.7; 95% confidence interval [CI]: 3.0, 7.4), whether detected by screening (OR = 3.5; 95% CI: 2.0, 6.2) or detected within 12 months of a negative screening examination (OR = 17.8; 95% CI: 4.8, 65.9).
In summary, the MD and breast cancer association is robust irrespective of measurement of MD, strong in magnitude, not explained by masking bias, independent of the influence of other risk factors, and generalizable to several populations, including both premenopausal and postmenopausal women. Due to the high prevalence of increased MD in the population, this risk factor could explain a large proportion of breast cancers as well as provide additional clinical information for breast cancer risk prediction. Translating the estimates of risk corresponding to different levels of MD into a model that could be used as an assessment tool for breast cancer risk prediction is a logical consideration and is explored in the following section.
The extant literature on MD and breast cancer is based almost entirely on the OR or relative risk as the measure of association. However, these reports do not adequately describe the ability of a risk factor to discriminate between individuals who are at higher and lower risk of an outcome . To assess the predictive quality of a risk factor, such as MD, other measures of association are required. The most commonly used measure of discriminatory power is often referred to as the C-statistic. This quantity is the concordance statistic that measures the agreement between predicted and observed outcomes. For diagnostic tests, this is equivalent to the area under the receiver operating characteristic (ROC) curve. A mathematically identical measure can be computed for outcomes that have a time-to-event component. It is computed by forming all possible pairs of subjects whose outcomes are distinct (for example, one case and one control) and by tallying the number of pairs in which the subject with higher observed risk also has a higher risk predicted by the risk factor, or risk model, of interest . This C-statistic ranges from 0.5 to 1.0. Values of 0.5 indicate that the risk predictions are no better than a coin toss at discriminating a high-risk from a low-risk individual, and values of 1.0 show that the risk prediction can make a perfect discrimination.
The most commonly used breast cancer risk prediction tool is the Gail model [42, 43]. This validated model is comprised of six breast cancer risk factors, including age, age at menarche (less than 12 years, 12, 13, 14 or more), age at first live birth (nulliparous, less than 20 years, 20 to 24, 25 to 29, 30 or more), number of first-degree relatives with breast cancer (0, 1, 2 or more), number of biopsies (0, 1, 2 or more), and presence of atypical hyperplasia on a biopsy (yes or no). The initial Gail model was based on a primarily Caucasian population participating in the Breast Cancer Detection and Demonstration Project (BCDDP) . Later modifications to this model included replacing the breast cancer incidence rates from the BCDDP with estimates from the Surveillance, Epidemiology, and End Results program, allowing for risk estimates for African-American and Asian-American women . These modifications form the model that is known as the Gail model 2 and are implemented in the National Cancer Institute Breast Cancer Risk Assessment Tool . Clinically, this computerized model is used 20,000 to 30,000 times each month to calculate a prediction of the absolute risk of breast cancer . The Gail model 2 has been shown to be well calibrated, with the predicted number of breast cancers being nearly equal to the number of breast cancers observed . However, the C-statistic for the model is low and has been estimated to be equal to 0.58 in an external validation study . Because of this modest concordance between observed and predicted events, the Gail model is currently of limited practical usefulness for obtaining risk estimates for any given individual. Even so, it remains the most commonly used tool for breast cancer risk prediction.
One approach to improve the precision of existing risk prediction models is to incorporate other major risk factors. Since MD is a strong risk factor with high population-attributable risk , it is a likely candidate for addition to risk prediction models. In fact, several studies have assessed the contribution of a measure of MD to these models, including the Gail model. The first of these examined the addition of the BI-RADS measure of density to the original Gail model in a multiethnic population . In this study, the racial composition of the participants was more varied than in the original study on which the Gail model was based. The authors also built a simple model using only the BI-RADS measure, age, and participants' ethnicity which performed on a par with the original Gail model in this study set. However, important variables were not available for inclusion in the model. The most notable of these is BMI, which is known to be highly correlated with MD and whose inclusion has been shown to strengthen risk estimates of MD . Two additional studies were performed to incorporate density into risk prediction models. One of these was constructed with the intent of providing a counseling tool at the time of mammography . The model of Barlow and colleagues  was built using participants who were seen at one of seven participating centers in the National Institutes of Health Breast Cancer Surveillance Consortium and incorporated the BI-RADS density measure. The models were evaluated among premenopausal and postmenopausal women separately; besides BI-RADS density, the covariates included age, prior breast procedure, and first-degree family history of breast cancer for premenopausal women with the additional covariates of Hispanic origin (yes/no), race, BMI, age at first birth, current HRT, surgical menopause, and previous mammographic outcome for postmenopausal prediction models. Because the analyses were conducted on data combined across centers, many of the covariates of interest had a substantial degree of missing data, in particular on BMI. In addition, the model of Barlow and colleagues was developed and validated to predict 1-year risk and therefore likely overemphasizes masked cancers and could overestimate long-term risk. The third model to incorporate MD into a risk prediction tool was a direct extension of the Gail model 2  constructed on the same BCDDP populations used in the initial Gail model. Weight, rather than BMI, was available and included in the model. Unlike in the two previous studies, Chen and colleagues  employed the planimetry assessment of MD (Table 1), which provided a quantitative and more reproducible MD estimate on all mammograms. However, mammograms were not available on all subjects in the BCDDP; consequently, the number of subjects whose data went into the estimation of the relative risk model was much smaller than was available in the initial Gail model.
Although the addition of MD shows promise for improving risk prediction models, the information provided by these current measures of MD improves the discriminatory power of risk prediction models only incrementally. The application of risk prediction models including MD will likely be preferred to the existing Gail model, but even with MD, these models remain limited in their ability to provide accurate individual estimates of absolute risk. However, their use in targeting high-risk groups on a population basis for the impact on modalities of screening, intervals for surveillance, or preventive therapies or strategies remains important.
The current measurement of MD on a two-dimensional view has proven to be robust in assessing breast cancer risk. However, this assessment is not invariant to compression and projection angle. Methods are now being developed that take into account these factors and provide an estimation of the volume of the fibroglandular tissue relative to the total breast volume. These methods have been developed for digitized mammograms and work best when reliable calibration data (such as milliampere-second and peak kilovoltage) have been collected with the mammogram [50–52]. With full-field digital mammography (FFDM), calibration data are stored automatically, and recently several FFDM volumetric assessment methods have been developed and validated [53, 54]. Volumetric methods are expected to provide even stronger breast cancer risk estimates than observed until now, as the relative amount of fibroglandular tissue can be measured more precisely.
In addition, change in MD (or volumetric density) may provide improvement in risk prediction. Recent evidence illustrated that changes in BI-RADS categories within women (which coincide with substantial changes in percentage density) over an average of 3 years of follow-up were associated with risk . This association was applicable only to women with average (BI-RADS of 2) or high (BI-RADS of 3) density; women with the highest density (BI-RADS of 4) remained at greatest risk of breast cancer, and their risk did not decrease with reductions in density, even when lowering their density by three categories. Thus, multiple measures of MD could also be important for predicting a woman's risk.
Because breast cancer is a heterogeneous disease, it is likely that there are different risk factors associated with subtypes. As an example, estrogen receptor-positive/progesterone receptor-positive (ER+/PR+) cancers appear to share a different risk factor profile than ER-/PR- . To date, the limited data show MD as a risk factor in both ER+ and ER-cancers (and likewise for PR+ and PR-)  and both in situ and invasive cancers [58, 59]. But there is the possibility that MD is a stronger risk factor for cancers of certain histologic subtypes (such as lobular cancers) or grade . The ability to predict breast cancer risk may then be improved when models are used to predict specific subtypes of breast cancers.
If the risk prediction models that incorporate MD show improved discrimination between cases and controls in validation studies and across populations, they could readily be incorporated into clinical practice and replace the existing Gail model. However, the standardization of the MD measure will be important to consider prior to their widespread use. Of the three risk models that have incorporated MD to date, two have used the BI-RADS density estimate and the other, planimetry or tracing method (Table 1). The BI-RADS estimate is currently assessed by most mammography clinics and could easily be factored into the new risk models. However, the quantitative MD estimates, such as the planimetry method, require an experienced reader with ongoing training, evaluation, quality control, and standardization across sites. Fortunately, FFDM will allow for automated measures of both area and volumetric density, which could provide highly standardized measurements with appropriate calibration. However, only 15% of mammogram units in the US are FFDM , which speaks to the slow rate at which such MD measures may be routinely available for risk models.
Although there is promise for improved risk prediction with incorporation of MD, younger women who are not yet of mammogram age (usually less than 40) will not reap the benefits. These women are also those most likely to benefit from improved risk assessment and targeted early prevention strategies. With 30% of MD explained by known epidemiologic risk factors  and 30% to 60% by genetics [5, 63], a large portion of variability in MD could be understood. Thus, factors that explain MD, including both classical risk factor and genetic information, could replace the MD measure in risk models for younger women. Otherwise, evaluation of younger women by cost-effective and lower radiation alternative imaging modalities (such as a single-view mammogram) or MRI is a potential option that could provide an MD estimate for risk models in these women.
In this review, we have attempted to illustrate that MD is now firmly established as a risk factor for breast cancer. The association with breast cancer risk is not the reflection of masking bias and is independent of other breast cancer risk factors. A few recent studies have incorporated aspects of the mammogram into risk prediction models, with modest but demonstrable improvements. As yet, none of these studies has explored the potential contribution of the best quantitative measures of MD. With additional calibration and increasing use of FFDM in screening, it is possible to envision clinical adoption of MD quantification in risk prediction models. Given the evidence that the pathogenesis of breast cancer likely begins early in life, prior to the age at which mammography is recommended, additional risk prediction approaches are likely necessary to broadly affect risk reduction.
Breast Cancer Detection and Demonstration Project
Breast Imaging Reporting and Data System
body mass index
full-field digital mammography
hormone replacement therapy
receiver operating characteristic
This work was supported by National Cancer Institute grants R01 CA97396 and P50 CA116201.