Skip to main content


Genetic variation in stromal proteins decorin and lumican with breast cancer: investigations in two case-control studies



The stroma is the supportive framework of biologic tissue in the breast, consisting of various proteins such as the proteoglycans, decorin and lumican. Altered expression of decorin and lumican is associated with breast tumors. We hypothesized that genetic variation in the decorin (DCN) and lumican (LUM) genes may contribute to breast cancer.


We investigated associations of 14 common polymorphisms in the DCN and LUM genes with 798 breast cancer cases and 843 controls from Mayo Clinic, MN, USA. One polymorphism per gene with the strongest risk association in the Mayo Clinic sample was genotyped in 4,470 breast cancer cases and 4,560 controls from East Anglia, England (Studies of Epidemiology and Risk Factors in Cancer Heredity (SEARCH)).


In the Mayo Clinic sample, six polymorphisms were associated with breast cancer risk (Ptrend ≤ 0.05). The association with LUM rs2268578, evaluated further in SEARCH, was positive, although the odds ratios (OR) were weaker and not statistically significant. ORs were 1.4 (95% confidence interval [CI], 1.1 to 1.8) for heterozygotes and 2.2 (95% CI, 1.1 to 4.3; P2 df = 0.002) for homozygotes in the Mayo Clinic sample, and were 1.1 (95% CI, 0.9 to 1.2) for heterozygotes and 1.4 (95% CI, 1.0 to 2.1; P2 df = 0.13) for homozygotes in the SEARCH sample. In combined analyses, the ORs were 1.1 (95% CI, 1.0 to 1.2) for heterozygotes and 1.6 (95% CI, 1.2 to 2.3; P2 df = 0.005) for homozygotes. Positive associations for this polymorphism were observed for estrogen receptor-positive tumors in both the Mayo Clinic sample (OR for heterozygotes = 1.5, 1.1 to 1.9 and OR for homozygotes = 2.5, 1.2 to 5.3;P2 df = 0.001) and the SEARCH sample (OR for heterozygotes = 1.0, 0.9 to 1.1 and OR for homozygotes = 1.6, 1.0 to 2.5; P2 df = 0.10). In combined analyses, the ORs were 1.1 (95% CI, 0.9 to 1.2) for heterozygotes and 1.9 (95% CI, 1.3 to 2.8; P2 df = 0.001) for homozygotes.


Although LUM rs2268578 was associated with breast cancer in the Mayo Clinic study, particularly estrogen receptor-positive breast cancer, weaker and modest associations were observed in the SEARCH sample. These modest associations will require larger samples to adequately assess the importance of this polymorphism in breast cancer.


Stromal changes are well documented in breast tumors [1, 2] and in preinvasive breast lesions [2, 3], and are hypothesized to play a role in breast cancer. The stroma may lead to morphologic changes that manifest as tumors through a stromal reaction or perturbation of epithelium [4] or, conversely, may play an initial landscaping role in tumorigenesis independent of epithelial neoplastic alterations [5].

The stroma is the supportive framework of biologic tissue consisting of an extensive extracellular matrix that supports cells, separates tissues and regulates intercellular communication. The extracellular matrix is composed of different proteins: decorin and lumican are members of the small leucine-rich proteoglycan family of proteins and are involved in matrix assembly and structure, and in the control of cell proliferation [6]. Knockout mice deficient for either the decorin (DCN) gene or the lumican (LUM) gene are viable, but show skin fragility, marked reductions in tensile strength and loosely packed collagen fibers with abnormal sheath diameters [7, 8] – implicating the small leucine-rich proteoglycans as major regulators of collagen fibril assembly that probably play a role in the development of a barrier against cell penetration and infiltration of macromolecules [6].

Recent evidence supports an anti-oncogenic role for decorin. Injection of decorin protein into mammary carcinoma rodent models resulted in a marked reduction in both primary tumor growth and metastatic spread compared with animals injected with vehicle alone [9]. Low levels of decorin protein in invasive breast cancers have also been associated with larger tumor size, shortened duration to progression and poor outcome [10]. The role of lumican in carcinogenesis has been less well studied. Immunohistochemical analyses of breast tissue detected significantly higher lumican protein expression in tumors than in adjacent normal tissue, yet significantly lower decorin protein expression in tumors than in normal breast tissue [11]. Positive associations between decorin and lumican protein expression and mammographic density, a major risk factor for breast cancer, have also been observed [12]. These findings suggest that expression and/or activity of members of the small leucine-rich proteoglycan family may affect breast cancer risk.

Given the role of the stroma in breast cancer [1, 2], the occurrence of genetic alterations in the stroma of breast tumors [1315] and the putative contribution of decorin and lumican to this disease [10, 11], we hypothesized, a priori, that common genetic variation in the DCN and LUM genes is associated with breast cancer risk. We evaluated this hypothesis using data from a clinic-based case-control study of breast cancer, with follow-up in a large British case-control study.

Materials and methods

Mayo Clinic study design and population

The Mayo Clinic Breast Cancer study is an Institutional Review Board-approved, ongoing clinic-based case-control study initiated in February 2001 at Mayo Clinic, Rochester, MN, USA. The study design has been presented previously [16, 17]. Clinic attendance formed the sampling frame for Mayo Clinic cases and controls.

Consecutive cases were women aged 18 years or older with histologically confirmed primary invasive breast carcinoma who were recruited within 6 months of their date of diagnosis. Women with a history of cancer (excluding nonmelanoma skin cancer) were ineligible. Cases lived in the six-state region that defines Mayo Clinic's primary service population (Minnesota, Iowa, Wisconsin, Illinois, North Dakota and South Dakota). Although Mayo Clinic is widely perceived to be a specialty tertiary care facility, it also provides primary care for over 500,000 individuals per year.

Control individuals without prior history of cancer (other than nonmelanoma skin cancer) were frequency matched on age (5-year age category), race and six-state region of residence to cases. Controls were recruited from the outpatient practice of the Divisions of General Internal Medicine and Primary Care Internal Medicine at Mayo Clinic, where they were seen for routine medical examinations.

Written informed consent was obtained from all participants. Case participation was 69% and control participation was 71%. The present investigation genotyped Caucasian women (99% of study participants) enrolled up to 30 June 2005, representing 798 cases and 843 controls.

Both the cases and controls completed a self-administered questionnaire comprised of known or suspected breast cancer risk factors, and they provided blood samples from which genomic DNA was isolated using the Gentra AutoPure LS Purgene salting out methodology (Gentra, Minneapolis, MN, USA). Quantities of 250 ηg genomic DNA were adjusted to 50 ηg/μl before genotyping and were verified using the PicoGreen dsDNA quantitation kit (Molecular Probes, Inc., Eugene, OR, USA). The samples were bar coded to ensure accurate and reliable sample processing and storage.

SNP selection, genotyping and quality control

All SNPs in the DCN and LUM genes within 5 kb of the largest cDNA isoform (genome build 35) were selected from the Caucasian samples within the HapMap Consortium's release 21 [18]. We applied the ldSelect program [19] to bin SNPs with minor allele frequency (MAF) ≥ 0.05 and pairwise linkage disequilibrium threshold of r2 ≥ 0.80. tagSNPs were selected from these bins that met the criteria for predicted likelihood of successful genotyping using the Illumina GoldenGate Assay™ quality score metrics (Illumina Corporation, San Diego, CA, USA). We also included all putative functional SNPs (within 1 kb upstream, 5' UTR, 3' UTR or nonsynonymous) with MAF ≥ 0.05 identified in Ensembl version 34 (Ensembl, European Bioinformatics Institute/Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridgeshire, UK). Eight SNPs in DCN (including six functional SNPs) and six SNPs in LUM (including three functional SNPs) were identified and examined in the Mayo Clinic study based on these two methods of selection.

The DCN and LUM SNPs were assayed at Illumina Corporation (San Diego, CA, USA) using the GoldenGate Assay™ on the Illumina BeadLab [2022] as part of a larger Mayo Clinic genetic association study. Successful genotyping was achieved for all 14 selected SNPs in the 798 cases and 843 controls. All but one of the 14 SNPs had MAF ≥ 0.05 among the Mayo controls (Table 1). Concordance between 100 duplicate samples was >99.99% for all assays.

Table 1 Genetic polymorphisms in the decorin (DCN) and lumican (LUM) genes and minor allele frequencies (MAFs) among 843 Caucasian controls, Mayo Clinic 2001 to 2005

SEARCH replication study population

Studies of Epidemiology and Risk Factors in Cancer Heredity (SEARCH), an ongoing population-based study of cases with invasive breast cancer ascertained through the Eastern Cancer Registration and Information Centre in England [23], was used as a second, independent study to evaluate significant breast cancer findings in the Mayo Clinic study. All women diagnosed after 1990 in the East Anglia region (median age = 51 years, range = 25 to 69 years) were eligible, with approximately 65% of eligible breast cancer cases enrolled. Unaffected female controls (median age = 65, range = 45 to 81 years) from the same geographic region were randomly selected from the European Prospective Investigation into Cancer and Nutrition -Norfolk component of the European Prospective Investigation into Cancer and Nutrition in East Anglia, with 41% participation. Over 98% of cases and controls were white Europeans. The SEARCH study has been used extensively to evaluate associations between breast cancer risk and SNPs in various genes [23] and as part of a genome-wide association study for breast cancer [16].

Evaluation of the Mayo Clinic findings for DCN rs3138165 and LUM rs2268578, the SNPs with the most significant associations with risk in each gene, was performed in the SEARCH study. These SNPs were selected for their strength of associations (point estimates and number of cases with two copies of the minor allele) with breast cancer risk to genotype in the SEARCH study comprising 4,470 cases and 4,560 controls with a 5' nuclease assay (Taqman®) using the ABI PRISM 7900 HT Sequence Detection System according to manufacturer's instructions (Applied Biosystems, Foster City, CA, USA). Primers and probes were supplied directly by Applied Biosystems [24] as Assays-by-Design™. Successful genotyping was achieved for 98.6% of DNA samples.

Statistical analysis

Genotypes from the Mayo Clinic cases and controls were used to estimate allele frequencies. Among control subjects only, the genotypes were compared with those expected under Hardy-Weinberg equilibrium using a Pearson goodness-of-fit test; no departures were found.

Pairwise linkage disequilibrium between SNPs was estimated with r2 values [25] using Haploview [26]. Individual SNP associations for breast cancer risk were assessed using unconditional logistic regression to estimate the odds ratios (ORs) and 95% confidence intervals (CIs). Analyses compared women with one copy and two copies of the minor allele with women with no copies using a two-degrees-of-freedom (2 df) model. We then assessed the dose-response effect of the SNP using an ordinal (log-additive) model.

Haplotype analyses were performed to identify whether the association with breast cancer risk could be informed further by phased combinations of alleles within each gene. Such combinations of alleles on a single chromosome can provide information about the possible presence of nearby breast cancer risk alleles that were not genotyped. Haplotype frequencies for each gene were estimated using all SNPs within the gene, and associations between individual haplotypes and breast cancer risk were evaluated compared with all other haplotypes combined. Haplotype frequencies for each gene were estimated using all SNPs within the gene, and a global haplotype score test of no association between haplotypes and breast cancer risk was evaluated at the gene level by the method proposed by Schaid and colleagues [27]. When the global haplotype score test suggested significance at the gene level, individual haplotype-specific associations for risk of breast cancer were compared with all other haplotypes combined.

In the SEARCH study, age-adjusted single-SNP associations were conducted using unconditional logistic regression under one-copy/two-copy and ordinal genetic models as described above.

In secondary analyses in both the Mayo Clinic and SEARCH studies, we stratified the data to evaluate the risk of breast cancer attributable to LUM rs2268578 in cases with estrogen receptor (ER)-positive and ER-negative tumors compared with control subjects because of recent reports of differential lumican protein expression by tumor ER status [10, 28].

Analyses were implemented using Haplo.stats [29], the SAS software system (version 8, 1999; SAS Institute, Cary, NC, USA) and the S-Plus software system (version 7.05, 2005; Insightful Corp., Seattle, WA, USA). Given prior hypotheses that SNPs in the DCN and LUM genes are associated with breast cancer risk, corrections for multiple testing were not performed. Two-tailed P ≤ 0.05 was considered statistically significant.


Eight tagSNPs in DCN representing 21 individual SNPs and six tagSNPs in LUM representing 15 individual SNPs were identified from the HapMap (Table 1 and Figure 1). Of eight tagSNPs genotyped in DCN, three were highly correlated (rs3138165, rs2070985 and rs13312816: pairwise r2 ≥ 0.98). This redundancy resulted from genotyping all putative functional SNPs in addition to the tagSNPs; only data from one DCN tagSNP (rs3138165) are therefore shown in subsequent analyses. Further, another DCN tagSNP (rs3138268) had MAF = 0 among control subjects and was excluded from further analyses.

Figure 1

Linkage disequilibrium plots for polymorphisms in the DCN and LUM genes. Linkage disequilibrium (LD) plots for polymorphisms in the Mayo Clinic sample, 2001 to 2005, for (a) the decorin (DCN) gene and (b) the lumican (LUM) gene. Shaded regions indicate strength of LD between pairwise combinations of polymorphisms (white, r2 = low LD; black, r2 = near-perfect LD). Numbers in squares indicate estimates of the pairwise r2. DCN SNP rs3138268 had minor allele frequency = 0 (absent in figure), and DCN SNPs rs3138165, rs2070985 and rs13312816 had pairwise r2 ≥ 0.98: only rs3138165 was included in the statistical analyses.

The Mayo Clinic cases and controls were adequately matched on age and region of residence, but differed in distribution by menopausal status, postmenopausal hormone use, age at menarche and smoking (Table 2). Covariates were evaluated for confounding in statistical models, and there was no appreciable difference in risk estimates of association when these covariates were excluded. The results for risk models are therefore presented adjusted for age and region of residence.

Table 2 Demographic, personal and lifestyle characteristics among 1,641 Caucasian breast cancer cases and controls, Mayo Clinic 2001 to 2005

Breast cancer risk analyses

In the individual SNP analyses, three SNPs in DCN and three SNPs in LUM were associated with breast cancer risk in the Mayo Clinic sample: results of the one-copy/two-copy genetic model and of the ordinal model are presented in Table 3. The ORs associated with each copy of the minor allele varied slightly for DCN rs7441 (OR = 1.3, 95% CI = 1.0 to 1.7; Ptrend = 0.05), DCN rs516115 (OR = 1.2, 95% CI = 1.0 to 1.4; Ptrend = 0.03) and DCN rs3138165 (OR = 1.3, 95% CI = 1.0 to 1.8; Ptrend = 0.03). In LUM, the associations with each copy of the minor allele were similar for LUM rs2268578 (OR = 1.4, 95% CI = 1.2 to 1.8; Ptrend = 0.0003), LUM rs10859110 (OR = 1.3, 95% CI = 1.1 to 1.5; Ptrend = 0.001) and LUM rs17018765 (OR = 1.3, 95% CI = 1.0 to 1.7; Ptrend = 0.04).

Table 3 Odds ratios (OR) and 95% confidence intervals (CI)a between polymorphisms in the decorin (DCN) and lumican (LUM) genes and breast cancer risk among 1,641 Caucasian subjectsb, Mayo Clinic 2001 to 2005

At the gene level, only LUM (P = 0.01) showed evidence of a significant association across haplotypes for breast cancer risk (Table 4). Of nine haplotypes observed in LUM, one six-SNP haplotype (haplotype 2a) – accounting for 71% of all estimated haplotypes – was associated with decreased breast cancer risk (P = 0.01), while two separate haplotypes with 6% (haplotype 2 h) and 7% (haplotype 2i) frequency were associated with increased risk (P ≤ 0.03).

Table 4 Gene-levela analysis of the decorin (DCN) and lumican (LUM) genes with breast cancer risk among 1,641 Caucasian subjectsb, Mayo Clinic 2001 to 2005

The DCN rs3138165 and LUM rs2268578 SNPs were genotyped in the SEARCH study. The association of DCN rs3138165 in the Mayo Clinic sample was not confirmed in the SEARCH study, whereas LUM rs2268578 was positively associated with breast cancer risk but the ORs attenuated and did not reach statistical significance (Table 5). Compared with women with no copies of the minor allele in LUM rs2268578, women with two copies showed the greatest risk for breast cancer in both the Mayo Clinic sample (OR = 2.2, 95% CI = 1.1 to 4.3; P2 df = 0.002) and the SEARCH study sample (OR = 1.4, 95% CI = 1.0 to 2.1; P2 df = 0.13). When the data from the two studies were pooled in age-adjusted and study-adjusted models, LUM rs2268578 was associated with increased risk among heterozygotes (OR = 1.1, 95% CI = 1.0 to 1.2) and homozygotes (OR = 1.6, 95% CI = 1.2 to 2.3; P2 df = 0.005) (Table 5). The corresponding per-minor allele risk was 1.1 (95% CI = 1.0 to 1.2; Ptrend = 0.004). These data suggest that LUM rs2268578 or a variant in strong linkage disequilibrium with rs2268578 may be a risk factor for breast cancer.

Table 5 Odds ratios (OR) and 95% confidence intervals (CI)a between polymorphisms in the decorin (DCN) and lumican (LUM) genes and breast cancer risk among 1,641 Caucasian subjects (Mayo Clinic 2001 to 2005) and 9,030 Caucasian subjects (SEARCH study 1990 2005)b

Estrogen receptor subgroup analyses

Women with two copies compared with no copies of the minor allele in LUM rs2268578 were at higher risk, compared with control individuals, of ER-positive breast cancer in both the Mayo Clinic study (OR = 2.5, 95% CI = 1.2 to 5.3; P2 df = 0.001) and the SEARCH study (OR = 1.6, 95% CI = 1.0 to 2.5; P2 df = 0.10) (Table 6). The per-minor allele risks were 1.5 (95% CI = 1.2 to 1.9; Ptrend = 0.0003) in the Mayo Clinic sample and 1.1 (95% CI = 0.9 to 1.2; Ptrend = 0.29) in the SEARCH sample. In pooled age-adjusted and study-adjusted analyses, LUM rs2268578 was associated with ER-positive tumors among heterozygotes (OR = 1.1, 95% CI = 0.9 to 1.2) and homozygotes (OR = 1.9, 95% CI = 1.3 to 2.8; P2 df = 0.001). The corresponding per-minor allele risk was 1.1 (95% CI = 1.0 to 1.3; Ptrend = 0.01).

Table 6 Odds ratios (OR) and 95% confidence intervals (CI)a between LUM SNP rs2268578 and breast cancer risk, stratified by tumor estrogen receptor status among 1,641 Caucasian subjects (Mayo Clinic 2001 to 2005) and 9,030 Caucasian subjects (SEARCH study 1990 to 2005)b

The number of ER-negative cases was small in both samples and associations with breast cancer were not evident (Table 6). These data may suggest that the associations observed from the main effects models in both studies (Table 5) are due to the findings from ER-positive tumors. Almost one-half of SEARCH study tumors and one-quarter of Mayo Clinic tumors, however, could not be classified – precluding a definitive interpretation of the results, particularly for ER-negative breast cancer.


We found a positive association between LUM rs2268578 and breast cancer risk in two large independent case-control studies, although the association in the SEARCH sample was attenuated and did not reach statistical significance. Two copies of the minor allele in LUM rs2268578 were associated with an average 60% increased risk of breast cancer compared with women with no copies, and the data suggest increased risk for ER-positive tumors. DCN rs3138165 was positively associated with breast cancer risk in the Mayo Clinic sample but the finding was not confirmed in the SEARCH sample. Haplotypes from each gene were also associated with breast cancer risk in the Mayo Clinic sample.

The weaker association of LUM rs2268578 with breast cancer risk in the SEARCH sample may be due to a lack of causal association of this SNP with breast cancer, or the results may be a more valid estimate of effect. It is not uncommon for replicated findings to report ORs for the variant homozygote that are weaker compared with the initial study's findings, a phenomenon known as winner's curse [30]. Furthermore, a small sample size can frequently result in insufficient power to detect minor contributions of one or more alleles [30] – as we observed with the results from the SEARCH sample.

In post hoc evaluation we also compared our results for LUM rs2268578 with those from the Cancer Genetic Markers of Susceptibility (CGEMS) genome-wide association study of breast cancer risk among 1,145 postmenopausal breast cancer cases and 1,142 controls of European ancestry from the Nurses' Health Study [31, 32]. When restricted to postmenopausal women, the associations with breast cancer risk were weaker but remained positive in the Mayo Clinic and SEARCH samples; however, in the CGEMS data, women with one copy (OR = 1.1) or two copies (OR = 0.9, P2 df = 0.52), compared with no copies, of the minor allele were not at risk of breast cancer despite similar MAF = 0.12 among controls. The CGEMS data did not report ER-positive or ER-negative results. At MAF = 0.12, if the true OR for homozygotes is in the range 1.1 to 1.5, then a much larger study than SEARCH or CGEMS would be needed to have sufficient power to replicate the association at either the conventional (P ≤ 0.05) or the genome-wide (P ≤ 10-7) level of significance. It should be noted that, even in the combined Mayo Clinic and SEARCH samples, there were only 108 cases and 75 controls homozygous for the minor allele. Comparable information from CGEMS is not available. It remains possible that the distribution of other exposures – for example, 60% of Mayo Clinic cases did not use postmenopausal hormones versus <30% in the Nurses' Health Study [33] – may also explain the differences in genetic associations with CGEMS.

The hypothesis that genetic variation in LUM is associated with breast cancer is based on a recent series of reports by Watson and colleagues of altered regulation of LUM in human breast tumors [10, 11]. Using in situ hybridization and western blot techniques, LUM mRNA levels were significantly higher (P < 0.0001) in stroma associated with breast carcinoma compared with stroma associated with adjacent normal tissue in the same woman [11]. In the present study, the observed risk associated with breast cancer from the LUM intronic SNP rs2268578 or from a SNP in strong linkage disequilibrium with rs2268578 may be consistent with increased protein expression of lumican in the study by Leygue and colleagues [11], if it represents a negative host response contributing to early tumor development through increased proteolysis or altered lumican deposition that precedes disorganized collagenous stroma [11]. Elucidation of the functional impact of the LUM SNP(s) is needed in order to provide insight into the effects on risk.

Associations with breast cancer risk for the DCN and LUM genes were observed with haplotype analyses in the Mayo Clinic sample. Of note, the three haplotypes in LUM that were associated with breast cancer risk had in common the G allele at position 2 (rs17714469) and a graduated change in the combination of alleles in the last three positions (rs2268578, rs10859110 and rs17018765) from GGA (haplotype 2a), associated with decreased risk, to AAA (haplotype 2h) and AAG (haplotype 2g), each associated with increased risk. It is possible that the association between breast cancer risk and these inferred haplotypes may be attributable to LUM rs2268578 or a SNP in strong linkage disequilibrium with rs2268578 because this single SNP was also associated with risk in both the Mayo Clinic and SEARCH samples. Further association testing in this chromosomal region – based on genotypes from a denser marker set – is required, however, in order to fully understand the nature of the relationship.

The increased risk of breast cancer from LUM rs2268578 may be due to the positive association with ER-positive breast tumors in both the Mayo Clinic and SEARCH samples, although the analyses were underpowered and were not based on a priori hypotheses. The association between lumican expression and ER-positive breast tumors is supported by the findings from two studies [10, 28]. Troup and colleagues [10] found that a greater number of ER-positive tumors (n = 99) compared with ER-negative tumors (n = 6) had lumican protein expression ≥ 25th percentile among 140 women with breast cancer (P = 0.002). Mackay and colleagues [28] evaluated gene microarray expression profiles in biopsies obtained from 34 women with primary ER-positive breast cancer before and after a 2-week intervention of aromatase inhibitor treatment. Among the 2,418 genes with the greatest variability in expression, the LUM gene was the most highly upregulated by a factor, on average, of 2.9-fold following aromatase inhibitor treatment, and the DCN gene was upregulated by a factor of 2.3. Collectively, these data suggest a potential mechanistic link between LUM expression and ER-positive tumors, which requires further investigation.

The strengths of the present study include the incorporation of a second, independent large sample of cases and controls to confirm initial findings. Our study populations were enrolled from defined regions, were of Caucasian ancestry and were less probably influenced by population stratification [34]. This does not necessarily, however, allow generalization of the observed association with breast cancer to other ethnic/racial populations [35]. One limitation is that the classification of tumors by ER status was not centrally reviewed and a large proportion of tumors could not be classified, hampering a strong conclusion of the association of the LUM SNP by ER status. Furthermore, it remains possible that the other variants in DCN and LUM that were not selected for genotyping in SEARCH may also influence breast cancer risk.


Although LUM rs2268578 was associated with breast cancer in the Mayo Clinic study, particularly ER-positive breast cancer, weaker and modest associations were observed in the SEARCH sample. Evaluation of this SNP in a larger study (such as the Breast Cancer Association Consortium) along with functional studies will be needed to adequately assess the importance of this SNP in breast cancer.



Cancer Genetic Markers of Susceptibility


decorin gene

2 df:

two degrees of freedom


estrogen receptor


lumican gene


minor allele frequency


Studies of Epidemiology and Risk Factors in Cancer Heredity


single nucleotide polymorphism


untranslated region.


  1. 1.

    Barcellos-Hoff MH, Ravani SA: Irradiated mammary gland stroma promotes the expression of tumorigenic potential by unirradiated epithelial cells. Cancer Res. 2000, 60: 1254-1260.

  2. 2.

    Brown LF, Guidi AJ, Schnitt SJ, Water Van De L, Iruela-Arispe ML, Yeo TK, Tognazzi K, Dvorak HF: Vascular stroma formation in carcinoma in situ, invasive carcinoma, and metastatic carcinoma of the breast. Clin Cancer Res. 1999, 5: 1041-1056.

  3. 3.

    Peyrol S, Raccurt M, Gerard F, Gleyzal C, Grimaud JA, Sommer P: Lysyl oxidase gene expression in the stromal reaction to in situ and invasive ductal breast carcinoma. Am J Pathol. 1997, 150: 497-507.

  4. 4.

    Ronnov-Jessen L, Petersen OW, Bissell MJ: Cellular changes involved in conversion of normal to malignant breast: importance of the stromal reaction. Physiol Rev. 1996, 76: 69-125.

  5. 5.

    Kinzler KW, Vogelstein B: Landscaping the cancer terrain. Science. 1998, 280: 1036-1037. 10.1126/science.280.5366.1036.

  6. 6.

    Hocking AM, Shinomura T, McQuillan DJ: Leucine-rich repeat glycoproteins of the extracellular matrix. Matrix Biol. 1998, 17: 1-19. 10.1016/S0945-053X(98)90121-4.

  7. 7.

    Danielson KG, Baribault H, Holmes DF, Graham H, Kadler KE, Iozzo RV: Targeted disruption of decorin leads to abnormal collagen fibril morphology and skin fragility. J Cell Biol. 1997, 136: 729-743. 10.1083/jcb.136.3.729.

  8. 8.

    Chakravarti S, Magnuson T, Lass JH, Jepsen KJ, LaMantia C, Carroll H: Lumican regulates collagen fibril assembly: skin fragility and corneal opacity in the absence of lumican. J Cell Biol. 1998, 141: 1277-1286. 10.1083/jcb.141.5.1277.

  9. 9.

    Reed CC, Waterhouse A, Kirby S, Kay P, Owens RT, McQuillan DJ, Iozzo RV: Decorin prevents metastatic spreading of breast cancer. Oncogene. 2005, 24: 1104-1110. 10.1038/sj.onc.1208329.

  10. 10.

    Troup S, Njue C, Kliewer EV, Parisien M, Roskelley C, Chakravarti S, Roughley PJ, Murphy LC, Watson PH: Reduced expression of the small leucine-rich proteoglycans, lumican, and decorin is associated with poor outcome in node-negative invasive breast cancer. Clin Cancer Res. 2003, 9: 207-214.

  11. 11.

    Leygue E, Snell L, Dotzlaw H, Troup S, Hiller-Hitchcock T, Murphy LC, Roughley PJ, Watson PH: Lumican and decorin are differentially expressed in human breast carcinoma. J Pathol. 2000, 192: 313-320. 10.1002/1096-9896(200011)192:3<313::AID-PATH694>3.0.CO;2-B.

  12. 12.

    Alowami S, Troup S, Al-Haddad S, Kirkpatrick I, Watson PH: Mammographic density is related to stroma and stromal proteoglycan expression. Breast Cancer Res. 2003, 5: R129-R135. 10.1186/bcr622.

  13. 13.

    Moinfar F, Man YG, Arnould L, Bratthauer GL, Ratschek M, Tavassoli FA: Concurrent and independent genetic alterations in the stromal and epithelial cells of mammary carcinoma: implications for tumorigenesis. Cancer Res. 2000, 60: 2562-2566.

  14. 14.

    Kurose K, Hoshaw-Woodard S, Adeyinka A, Lemeshow S, Watson PH, Eng C: Genetic model of multi-step breast carcinogenesis involving the epithelium and stroma: clues to tumour-microenvironment interactions. Hum Mol Genet. 2001, 10: 1907-1913. 10.1093/hmg/10.18.1907.

  15. 15.

    Kurose K, Gilley K, Matsumoto S, Watson PH, Zhou XP, Eng C: Frequent somatic mutations in PTEN and TP53 are mutually exclusive in the stroma of breast carcinomas. Nat Genet. 2002, 32: 355-357. 10.1038/ng1013.

  16. 16.

    Easton DF, Pooley KA, Dunning AM, Pharoah PD, Thompson D, Ballinger DG, Struewing JP, Morrison J, Field H, Luben R, Wareham N, Ahmed S, Healey CS, Bowman R, SEARCH collaborators, Meyer KB, Haiman CA, Kolonel LK, Henderson BE, Le Marchand L, Brennan P, Sangrajrang S, Gaborieau V, Odefrey F, Shen CY, Wu PE, Wang HC, Eccles D, Evans DG, Peto J, et al: Genome-wide association study identifies novel breast cancer susceptibility loci. Nature. 2007, 447: 1087-1093. 10.1038/nature05887.

  17. 17.

    Cox A, Dunning AM, Garcia-Closas M, Balasubramanian S, Reed MW, Pooley KA, Scollen S, Baynes C, Ponder BA, Chanock S, Lissowska J, Brinton L, Peplonska B, Southey MC, Hopper JL, McCredie MR, Giles GG, Fletcher O, Johnson N, dos Santos Silva I, Gibson L, Bojesen SE, Nordestgaard BG, Axelsson CK, Torres D, Hamann U, Justenhoven C, Brauch H, Chang-Claude J, Kropp S, et al: A common coding variant in CASP8 is associated with breast cancer risk. Nat Genet. 2007, 39: 352-358. 10.1038/ng1981.

  18. 18.

    The International HapMap Consortium: The International HapMap Project. Nature. 2003, 426: 789-796. 10.1038/nature02168.

  19. 19.

    Carlson CS, Eberle MA, Rieder MJ, Yi Q, Kruglyak L, Nickerson DA: Selecting a maximally informative set of single-nucleotide polymorphisms for association analyses using linkage disequilibrium. Am J Hum Genet. 2004, 74: 106-120. 10.1086/381000.

  20. 20.

    Steemers FJ, Gunderson KL: Illumina, Inc. Pharmacogenomics. 2005, 6: 777-782. 10.2217/14622416.6.7.777.

  21. 21.

    Oliphant A, Barker DL, Stuelpnagel JR, Chee MS: BeadArray technology: enabling an accurate, cost-effective approach to high-throughput genotyping. Biotechniques. 2002, 56-61. Suppl

  22. 22.

    Fan JB, Hu SX, Craumer WC, Barker DL: BeadArray-based solutions for enabling the promise of pharmacogenomics. Biotechniques. 2005, 39: 583-588. 10.2144/000112047.

  23. 23.

    Pharoah PD, Tyrer J, Dunning AM, Easton DF, Ponder BA: Association between common variation in 120 candidate genes and breast cancer risk. PLoS Genet. 2007, 3: e42-10.1371/journal.pgen.0030042.

  24. 24.

    Applied Biosystems. []

  25. 25.

    Devlin B, Risch N: A comparison of linkage disequilibrium measures for fine-scale mapping. Genomics. 1995, 29: 311-322. 10.1006/geno.1995.9003.

  26. 26.

    Barrett JC, Fry B, Maller J, Daly MJ: Haploview: analysis and visualization of LD and haplotype maps. Bioinformatics (Oxford, England). 2005, 21: 263-265. 10.1093/bioinformatics/bth457.

  27. 27.

    Schaid DJ, Rowland CM, Tines DE, Jacobson RM, Poland GA: Score tests for association between traits and haplotypes when linkage phase is ambiguous. Am J Hum Genet. 2002, 70: 425-434. 10.1086/338688.

  28. 28.

    Mackay AG, Urruticoechea A, Dixon JM, Dexter T, Fenwick K, Ashworth A, Drury S, Larionov A, Young O, White S, Miller WR, Evans DB, Dowsett M: Molecular response to aromatase inhibitor treatment in primary breast cancer. Breast Cancer Res. 2007, 9: R37-10.1186/bcr1732.

  29. 29.

    Schaid DJ: Mayo Clinic Research. []

  30. 30.

    Chanock SJ, Manolio T, Boehnke M, Boerwinkle E, Hunter DJ, Thomas G, Hirschhorn JN, Abecasis G, Altshuler D, Bailey-Wilson JE, Brooks LD, Cardon LR, Daly M, Donnelly P, Fraumeni JF, Freimer NB, Gerhard DS, Gunter C, Guttmacher AE, Guyer MS, Harris EL, Hoh J, Hoover R, Kong CA, Merikangas KR, Morton CC, Palmer LJ, Phimister EG, Rice JP, Roberts J, et al: Replicating genotype-phenotype associations. Nature. 2007, 447: 655-660. 10.1038/447655a.

  31. 31.

    Cancer Genetic Markers of Susceptibility. []

  32. 32.

    Hunter DJ, Kraft P, Jacobs KB, Cox DG, Yeager M, Hankinson SE, Wacholder S, Wang Z, Welch R, Hutchinson A, Wang J, Yu K, Chatterjee N, Orr N, Willett WC, Colditz GA, Ziegler RG, Berg CD, Buys SS, McCarty CA, Feigelson HS, Calle EE, Thun MJ, Hayes RB, Tucker M, Gerhard DS, Fraumeni JF, Hoover RN, Thomas G, Chanock SJ: A genome-wide association study identifies alleles in FGFR2 associated with risk of sporadic postmenopausal breast cancer. Nat Genet. 2007, 39: 870-874. 10.1038/ng2075.

  33. 33.

    Eliassen AH, Missmer SA, Tworoger SS, Hankinson SE: Circulating 2-hydroxy- and 16α-hydroxy estrone levels and risk of breast cancer among postmenopausal women. Cancer Epidemiol Biomarkers Prev. 2008, 17: 2029-2035. 10.1158/1055-9965.EPI-08-0262.

  34. 34.

    Goode EL, Pharoah PDP, Dunning AM, Wareham NJ, Ponder BAJ, Easton D: No evidence for population substructure within the EPIC-Norfolk cohort [abstract]. AACR Meeting Abstracts. 2005, #4066: 959-

  35. 35.

    Ries LAG, Krapcho M, Mariotto A, Miller BA, Feuer EJ, Clegg L, Horner MJ, Howlader N, Eisner MP, Reichman M, Edwards BK, Eds: SEER Cancer Statistics Review, 1975–2004. Bethesda, MD: National Cancer Institute, []

Download references


The present project was supported by grants from the National Institutes of Health (NCI P50 CA116201 and NCI R01 CA122340). NCI P50 CA116201 is a Breast Specialized Program Of Research Excellence grant that provided funding for statistical analyses, interpretation of the data and writing of the manuscript. NCI R01 CA122340 also provided funding for the genotyping, statistical analyses, interpretation of the data and writing of the manuscript. SEARCH is funded through a program grant from Cancer Research UK. PDPP is a Cancer Research UK Senior Clinical Research Fellow. The decision to submit the manuscript was independent of the funding source.

Author information

Correspondence to Celine M Vachon.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors' contributions

LEK drafted the manuscript. LEK, FJC and CMV revised the manuscript for intellectual content. CMV conceived the study hypothesis and selected the genes. FJC, VSP, ELG, DNR, XW, JRC and CMV designed the Mayo Clinic study and acquired data. ZSF, RAV, VSP and CGS performed the statistical analyses. FJC and CMV were responsible for funding for the Mayo Clinic Study. AMD, PDPP and DFE were responsible for study design and funding for, and SA for genotyping in, the SEARCH study. All authors contributed to data interpretation, and read and approved the final manuscript.

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Rights and permissions

Reprints and Permissions

About this article


  • Breast Cancer
  • Breast Cancer Risk
  • Mayo Clinic
  • Breast Cancer Association Consortium
  • Aromatase Inhibitor Treatment