- Research article
- Open Access
Embryonic mammary signature subsets are activated in Brca1-/- and basal-like breast cancers
Breast Cancer Research volume 15, Article number: R25 (2013)
Cancer is often suggested to result from development gone awry. Links between normal embryonic development and cancer biology have been postulated, but no defined genetic basis has been established. We recently published the first transcriptomic analysis of embryonic mammary cell populations. Embryonic mammary epithelial cells are an immature progenitor cell population, lacking differentiation markers, which is reflected in their very distinct genetic profiles when compared with those of their postnatal descendents.
We defined an embryonic mammary epithelial signature that incorporates the most highly expressed genes from embryonic mammary epithelium when compared with the postnatal mammary epithelial cells. We looked for activation of the embryonic mammary epithelial signature in mouse mammary tumors that formed in mice in which Brca1 had been conditionally deleted from the mammary epithelium and in human breast cancers to determine whether any genetic links exist between embryonic mammary cells and breast cancers.
Small subsets of the embryonic mammary epithelial signature were consistently activated in mouse Brca1-/- tumors and human basal-like breast cancers, which encoded predominantly transcriptional regulators, cell-cycle, and actin cytoskeleton components. Other embryonic gene subsets were found activated in non-basal-like tumor subtypes and repressed in basal-like tumors, including regulators of neuronal differentiation, transcription, and cell biosynthesis. Several embryonic genes showed significant upregulation in estrogen receptor (ER)-negative, progesterone receptor (PR)-negative, and/or grade 3 breast cancers. Among them, the transcription factor, SOX11, a progenitor cell and lineage regulator of nonmammary cell types, is found highly expressed in some Brca1-/- mammary tumors. By using RNA interference to silence SOX11 expression in breast cancer cells, we found evidence that SOX11 regulates breast cancer cell proliferation and cell survival.
Specific subsets of embryonic mammary genes, rather than the entire embryonic development transcriptomic program, are activated in tumorigenesis. Genes involved in embryonic mammary development are consistently upregulated in some breast cancers and warrant further investigation, potentially in drug-discovery research endeavors.
The notion that some cancers may arise because of the reactivation of embryonic developmental programs was first proposed in the 19th century. Among the proponents of this idea was Rudolf Virchow, who recognized elements of embryonic development in cancers. Virchow coined the term "teratoma" to describe tumors containing differentiated elements of the three embryonic germ layers and also suggested that cancers arise from embryo-like cells . Lobstein and Cohnheim  also noted similarities between embryogenesis and the biology of cancer cells and put forward the hypothesis that tumorigenesis recapitulates aspects of development . During organ formation, cells proliferate, migrate, and invade into adjacent tissues to produce highly organized tissues, and these same cellular processes are used during carcinogenesis, which results in the formation of relatively organized populations of abnormal cells, which comprise tumors. Therefore, it has been suggested that some tumors arise from reactivation of embryonic developmental programs in postnatal tissues.
Two of the most common breast cancer-driver mutations, which confer clonal selective advantage on cancer cells and are causally implicated in oncogenesis, are found in GATA3 and TBX3, which are genes that have been shown to be required for embryonic mammary development [3–5]. Many other signaling pathways have also been implicated in both embryonic mammary morphogenesis and carcinogenesis, providing support for the contention that neoplastic and immature tissues share important similarities and that organ development and primary tumor formation are likely to be underpinned by common mechanisms . Newly identified cancer stem cells in skin, gut, and brain are very similar to healthy stem cells responsible for growing and renewing tissue in the body, highlighting the need for further understanding of the normal mammary progenitor cells and their potential links to cancer, as tumors may develop from progenitor-like cells from diverse stages of cellular differentiation [7–9].
Recently we completed a transcriptomic analysis of embryonic mouse mammary primordial cells, the first such study of separated embryonic mammary epithelial and mammary mesenchymal cell populations . These two cell populations interact in a complex, reciprocal manner as the mammary primordium forms during embryogenesis. Recent data from cell-lineage tracing studies suggest that embryonic mammary cells are the only cell populations that are truly multipotent in vivo . Embryonic mammary epithelial cells are an immature cell population, lacking differentiation markers, which is reflected in their very distinct genetic profiles when compared with those of their postnatal descendents .
In this study, we explored the hypothesis that reactivation of embryonic developmental programs in mature breast cells promotes tumor formation. We defined an embryonic mammary signature to incorporate the most highly expressed genes from the embryonic epithelium during organ formation when compared with the postnatal mammary epithelial cells and compared them with gene signatures of breast cancers. We found reactivation of small modules of embryonic mammary epithelial genes within mouse Brca1-/- tumors and human basal-like/triple-negative breast cancers. Many embryonic genes are activated across breast cancer datasets, and several are linked to clinical parameters, including hormone-receptor expression, subtype, and grade. We found that embryonic mammary signature activation in breast cancer samples is predictive of breast cancer patient outcome, suggesting clinical relevance. Our studies therefore provide new insights into the association of embryonic signature activation with clinical features of some breast cancers.
Materials and methods
Transcriptome analysis on normal mammary populations and tumor RNA profiled with Affymetrix 430 2.0 mouse gene-expression chips was as described [10, 12, 13]. The microarray data are available in ArrayExpress with accession numbers E-TABM-1099, E-TABM-683, E-TABM-684, and E-TABM-997. Raw Affymetrix.CEL files were normalized and summarized by robust multiarray analysis (RMA) by using the Affy package from BioConductor . Probe sets were used for a multiclass Significance Analysis of Microarrays (SAM) by using a local false-discovery rate of 5% to determine whether their mean expression was different across the three mammary epithelial cell (MEC) subpopulations and three embryonic mammary populations described [10, 13]. Probes are considered embryonic-enriched when they have a mean relative abundance of 10-fold or more when compared with the postnatal mammary epithelial samples.
With 799 probe sets shown to distinguish robustly between embryonic mammary epithelium and postnatal mammary cells, normal and tumor samples were clustered by using a Ward algorithm based on Pearson correlation distance. Human orthologues for 689 genes encoded by the 799-probe set were used to cluster human breast cancers in three datasets [15–17] based on Ward clustering with correlation distance. Breast cancer subtypes in the Natrajan  and NKI295  datasets were as defined by the research version of PAM50 classification ; PAM50 from Parker et al.  was used to describe subtypes in the UNC337 dataset . The 70-gene prognosis signature was used to classify tumors into poor or good prognosis on the basis of their risk of developing distant metastases within 5 years [15, 19].
We tested for presence of clusters and observed hierarchic clustering with two clusters to be the most suitable for our dataset. The agglomerative method of Ward hierarchic clustering, as implemented in the R-package pvclust , was used for subsequent analysis. Parameters were set to 10,000 bootstrap replicates, with relative sample sizes set from 0.5 to 1.4, incrementing in steps of 0.1 to determine AU (approximately unbiased) P values. Hypergeometric statistical analysis was used to demonstrate that enrichment of embryonic gene activation in mouse tumor and breast cancer datasets was significant.
We used proliferation signatures defined by Ben-Porath et al.  to designate tumor-associated embryonic genes as proliferative or not. Two additional proliferation signatures, defined, by Desmedt et al.  and Ghazoui et al. , provided a list of additional genes to exclude. For Spearman correlation, a cut-off was used to exclude all genes with an absolute correlation > 0.5 with proliferation genes.
From the embryonic and postnatal mammary gene signatures, centroids were defined for 37 genes comprising the nonproliferative embryonic gene signature. Centroid correlation was performed with the NKI295 dataset by using Spearman correlation. The nearest centroid was recorded for every sample, and those with correlation of < 0.1 were assigned to no correlation, whereas those with a correlation ≥ 0.1 were classified as "embryonic." Kaplan-Meier analysis and multivariate Cox proportional hazard regression analysis were carried out with the R survival package. The nonproliferative embryonic gene signature and tumor annotations were tested in models containing various combinations of tumor size, differentiation status, lymph node positivity, ER status, and 70-gene signature, as indicated.
Pathway and network analysis
Pathway analysis was performed by using functional annotation cluster analysis by using DAVID . An interaction network was generated within ROCK by using genes of interest and visualized by using ROCKscape . Initially, only interactions between selected genes were allowed; this was then extended by allowing one joining gene between two selected genes to form interactions where the genes were not interacting in the first phase.
Statistical analysis of embryonic mammary genes in tumors
For expression fold-change, genes were submitted in the ROCK resource  to identify significant changes in expression between specific groups of tumors. Only studies in which samples were run on the same chip and normalized in the same manner were included. An average fold-change of twofold or more (up or down) was considered a significant fold-change. Results were also verified by using (SAM) analysis tool in ROCK to determine significant changes of expression in subtypes, tumor type, and grade classification. Molecular subtypes were defined by PAM50 .
For survival curves, genes with significant expression changes were subjected to Kaplan-Meier plot survival calculation within the ROCK resource. Significant impact on survival was assumed if the χ2 P value was < 0.05, or its associated log2 rank P value was < 0.05.
All animal work was carried out under UK Home Office project and personal licenses after local ethical approval from The Institute of Cancer Research Ethics Committee and in accordance with local and national guidelines. Embryonic day 12.5 (E12.5) mammary primordia were manually microdissected, and tissue separations were performed as previously described .
Quantitative real-time polymerase chain reaction
Total RNA was extracted from purified populations of two to three independent biologic replicates by using Qiagen RNeasy Micro Plus kit (Qiagen, Hilden, Germany). cDNA synthesis of RNA was carried out by using Quantitect Reverse Transcription kit (Qiagen, Hilden, Germany) and run with TaqMan Array Assay-on-Demand probes (Applied Biosystems, Life Technologies Corporation, Carlsbad, CA, USA). Results were analyzed by using the Δ-ΔCt method normalized to Actb. Total RNA from tumor and mammary samples were reverse transcribed and linearly amplified by using the Ovation Amplification System V2 kit (NuGEN Technologies, San Carlos, CA, USA), as described previously, before Quantitative real-time polymerase chain reaction (qRT-PCR) analysis . The expressions of SOX11 in BT474 and BT549 cells were analyzed with qRT-PCR by using TaqMan Gene Expression Assay for SOX11, Hs00846583_s1 (Applied Biosystems, Life Technologies Corporation, Carlsbad, CA, USA) combined with FAM and normalized against β-actin, Hs99999903_m1, combined with VIC.
Immunohistochemistry and whole-mount immunofluorescence
Methods were as previously described [10, 26]. Antibodies are listed in Additional file 1A; Sox11 guinea pig antiserum is described ; and the specificity of this antibody was previously demonstrated . Transverse cryosections from the forelimb region of Sox11-/- embryos were used to demonstrate the specificity of the SOX11 mouse monoclonal antibody MRQ-58 from Cell Marque (Rocklin, CA, USA) in mouse tissue. Negative controls were performed for all antibodies by the omission of primary antibody. Expression at other sites (embryonic brain or skin) was used for positive controls. Representative micrographs of controls are shown in Additional file 1B, C.
SOX11 knockdown in breast cancer cells
BT474 and BT549 cells were transfected with 80 pmol of each SOX11 siRNA (siGENOME SMARTpool and four individual siRNAs), control nontargeting siRNA or Cyclophilin control siRNA (Thermo Scientific, Waltham, MA, USA) by using Lipofectamine 2000 (Invitrogen, Life Technologies Corporation, Carlsbad, CA, USA) in Opti-MEM (Gibco, Life Technologies Corporation, Carlsbad, CA, USA) media according to the manufacturer's instructions for 6 hours in a six-well plate, and then incubated with DMEM supplemented with 10% fetal bovine serum.
BT474 cells were lysed with RIPA buffer 72 hours after transfection and subjected to immunoblotting, as previously described . SOX11 expression was detected by using a rabbit monoclonal antibody (Epitomics, Burlingame, CA, USA, clone EPR8192); caspase-3 (R&D Systems, Minneapolis, MN, USA) and cleaved caspase-3 (Cell Signaling Technology, Danvers, MA, USA) were detected by using mouse monoclonal and rabbit polyclonal antibodies.
The 1 × 106 BT549 cells were transfected with 3 μg of either pCMV6-AC-GFP plasmid containing the sequence for a fusion protein between SOX11 and GFP (RG220681, Origene Rockville, MD, USA) or a control plasmid containing GFP, pIRES2-EGFP (Clontech, Mountain View, CA, USA), by nucleoporation by using the Amaxa Cell Line Nucleofector kit V (Lonza, Basel, Switzerland) with the T-024 program. The transfection efficiency was evaluated with flow cytometry.
At 48 hours after transfection, 3,000 BT474 cells or 1,000 BT549 cells were plated per well of a 96-well plate. Cell-growth rates were assessed 24, 48, and 72 hours later by incubating for 2 hours with PrestoBlue Cell Viability Reagent (Life Technologies, Carlsbad, CA, USA). The absorbance obtained at each time point was normalized to the absorbance at 0 hours. Statistical significance was determined by using a two-way ANOVA test followed by a Bonferroni post hoc test. The results at 72 hours are presented as the percentage of growth relative to the population transfected with the nontargeting siRNA. Statistical significance was determined by using a 1-way ANOVA test followed by a Bonferroni post hoc test.
Cell populations were trypsinized 48 hours after transfection with siRNAs and fixed in 70% ethanol overnight. After a 1-hour incubation with RNase A at 37°C, the cells were stained with 7AAD (eBioscience, San Diego, CA, USA) before they were subjected to FACs analysis by using a BD LSR II flow cytometer and analyzed with the FACSDiva software. Statistical significance was determined by using a one-way ANOVA test followed by a Bonferroni post hoc test.
Embryonic mammary epithelial cells are estrogen receptor (ER)-, progesterone receptor (PR)-, and express low levels////of Erbb2
Midgestation embryonic mammary bud epithelial (MBE) cells are ER-, PR- and express low to moderate levels of Erbb2 (Figure 1). Many MBE cells express high levels of basal keratins (Krt5, Krt14), Egfr, and all express p63 (Figure 1). MBE cells exhibit marker profiles similar to those used to describe the defining features of triple-negative and basal-like breast cancers and may use similar signaling pathways and networks to underpin key biologic properties of similar cell types found enriched within both populations.
Subsets of the embryonic mammary signatures are activated in Brca1-/- mouse tumors
We defined an embryonic mammary signature based on expression profiles of genes found highly expressed within midgestation (E12.5-stage) embryonic epithelium compared with postnatal mammary epithelial cells described in Additional file 2[10, 13]. This signature is distinct from the fetal mammary stem cell signature recently defined by Spike et al. , which profiled subpopulations of late-gestation (E18.5-stage) mammary cells. Only 12 genes (1.4%) are shared between the two embryonic signatures, which are both defined by enriched expression in embryonic versus postnatal mammary cell populations (see Additional file 2).
Next, we interrogated the embryonic-enriched mammary epithelial signature expression in mammary tumors that formed in mouse strains in which Brca1 had been deleted in either mammary epithelial luminal progenitors (Blg-Cre Brca1f/f p53+/-) or in basal cells, including basal stem cells (K14-Cre Brca1f/f p53+/-) , to determine whether the embryonic signature is activated in a validated mouse model of triple-negative breast cancer . Small subsets of the embryonic epithelial signature (123 of 689 genes (18%)) were activated in Brca1-/- mouse tumors when the embryonic epithelial signature was used for hierarchic cluster analysis (Figure 2A, B and Additional file 3).
Subsets of the embryonic mammary signatures are activated in human breast cancers
Because only subsets of the embryonic mammary signature, and not the entire developmental program, appear activated in mouse tumors, we sought to define the genes shared between the embryonic signature and breast cancers across multiple datasets. We reasoned that this strategy should result in the identification of embryonic mammary genes consistently activated in breast cancers that are not normally highly expressed by postnatal mammary epithelial cells.
First, we compared the embryonic mammary epithelial signature with those of human breast cancers by using expression arrays from a dataset of 48 grade III ductal carcinomas that were microdissected so that at least 90% of the sample contained tumor cells . The embryonic and tumor datasets profiled microdissected tissues and reflected gene signatures present in highly purified epithelial cell populations isolated from intact tissues. One cluster of 30 embryonic mammary epithelial genes, enriched for regulators of transcription and actin cytoskeleton organization (see Additional File 4), was found to be activated predominantly in ER-negative breast cancers, including all 13 basal-like tumors, all five HER2-positive tumors, and four (13%) of 30 Luminal B tumors (Figure 3A, B). Another small basal-like tumor-associated subset was composed of genes encoding cell-cycle and microtubule cytoskeleton components, suggesting significant overlap with proliferation signatures, a general hallmark of poor-prognosis breast cancers  (see Additional file 4). The embryonic mammary epithelium displays a relatively low proliferation index at E12.5, but Ki67+ epithelial cells can be detected at this stage (Figure 3C). Three other subsets of the embryonic mammary epithelial signature are activated in many non-basal-like tumor types (Figure 3B). One cluster activated predominantly in luminal tumors and repressed in most basal-like tumors consists of genes regulating neuron-projection development (Additional file 4). Two other clusters are activated in some luminal and HER2+ tumors and are enriched for genes involved in embryonic appendage morphogenesis, ossification, regionalization, negative regulation of macromolecule synthesis, and wound response (Additional file 4). The stability of the gene clusters was assessed with pvClust (see Additional file 5). Of 57 genes activated in basal-like breast cancers, 55 are found in one of the two major clusters, which have robustness indices larger than 95%. Network analysis suggests complex genetic regulatory potential, and interacting associations exist between the proteins encoded by embryonic genes found activated and repressed in breast cancers (Figure 3D).
We also compared the embryonic mammary epithelial signature with two additional breast cancer datasets, the UNC337 dataset [17, 33] and NKI295 dataset . Distinct subsets of the embryonic mammary epithelial signature were shown to be activated in breast cancers; many were similar to those observed in the Natrajan dataset (see Additional files 6, 7, and 8). Five genes are found activated in the mouse Brca1-/- tumor dataset and the three breast cancer datasets, predominantly in basal-like cancers; statistical analysis indicated significant enrichment of these genes (see Additional file 9). These included two transcription factors, Bcl11a and Sox11, and three other genes: B3gnt5, Ptdss1, and Tpx2. Fifty-seven genes activated in at least two of four tumor datasets displayed enrichment of cell-cycle components (Additional file 3). When 18 proliferation/cell-cycle-associated genes (from signatures described ) were removed, 39 remaining genes showed enrichment for embryonic morphogenesis, suggesting that tumor-associated genes mediate proliferation and processes associated with embryonic development in basal-like cancers (Additional file 9). Fifty genes found activated predominantly in non-basal-like types of breast cancers were enriched for neuronal projection/differentiation and ossification, suggesting potential links to regulation of cellular processes regulating bone and nerve development in other breast cancer subtypes (Additional file 10).
Many embryonic mammary signature components, including ASPM, CDCA2, and KIF20A, are highly correlated with established proliferation genes, such as KIF23 (58%) and TPX2 (69%) in the Natrajan dataset and with TOP2A (39%), MKI67 (36%), and Ki67 protein expression (28%) in the Ghazoui et al. dataset  (Additional file 10). We defined a 37-gene nonproliferative embryonic mammary signature by excluding two genes found present within two additional published proliferation signatures [22, 23] (see Additional file 11). When used in hierarchic cluster analysis, this gene list resulted in robust clustering of basal-like and non-basal-like cancers in the Natrajan dataset. In addition, in the UNC337 and NKI295 datasets, stable basal-like clusters were observed (see Additional file 12). Different single-sample predictors (SSPs) were used to classify the breast cancer subtypes in the original publications. Given the differences in the classification of breast cancers into the molecular subtypes by means of SSPs [18, 34], we retrieved the research version of the PAM50 classification for the Natrajan dataset  and NKI295 dataset  from  and PAM50 classification of the UNC337 dataset from . Expression levels of the embryonic gene signature were shown to be highest in basal-like breast cancers compared with the other breast cancer subtypes (Figure 3E). Enrichment for the 37-gene nonproliferative embryonic signature was correlated with reduced-distance metastasis-free survival, larger tumor size, and the 70-gene signature used for prognostication of breast cancer patients [15, 19] in the NKI295 dataset (Figure 3F; Additional file 13).
Given that many cancer cells undergo some degree of epithelial-mesenchymal transition (EMT), we also defined an embryonic mammary mesenchymal signature based on expression profiles of genes found highly expressed within embryonic mammary mesenchymal tissue compared with postnatal mammary cells (see Additional file 14). We found that a large percentage (62%) of the mesenchymal genes are components of the embryonic mammary epithelial signature, consistent with these epithelial cells undergoing morphogenesis and harboring some inherent mesenchymal-like traits. Of the overlapping mesenchymal genes, 25 were found in the 37-gene tumor-associated embryonic epithelial signature, and could be considered candidate regulators of EMT in breast cancers (see Additional file 15).
We next defined a tumor-associated mesenchymal signature. We used the criterion of genes found to be activated in basal-like cancers of at least two of four datasets, and we removed genes that overlapped with the epithelial signature. The final embryonic mesenchymal signature would represent transcriptomic features unique to the embryonic stroma. Several of these strictly mesenchymal signature components (TGFBI, TWIST2, ZEB2) have established links to EMT [35–37]. Enrichment for the 172-gene mammary mesenchymal signature was correlated with large tumor size and the 70-gene prognostic signature [15, 19] in the NKI295 dataset (Additional file 15). No significant association with overall survival was observed in patients whose breast cancers showed activation of the embryonic mesenchymal signature (see Additional file 16).
BCL11A, SOX11, and TPX2 showed consistent upregulation at an average of twofold or greater in ER- breast cancers across datasets (Figure 4A, B; Additional file 17) [16, 38–43]. SOX11 and TPX2 showed consistent upregulation of twofold or greater in PR- breast cancers across datasets (Figure 4C; Additional file 17) [16, 39, 41–46]. SOX11 levels were consistently twofold higher in HER2+ versus HER2- samples across datasets [16, 40, 42, 47–49] (Figure 4D and Additional file 18). SOX11 levels were higher in basal-like and HER2+ breast cancers compared with other subtypes (Figure 5E). BCL11A levels were consistently higher in basal-like breast cancers compared with other subtypes (Figure 4E). Both SOX11 and TPX2 showed a trend of increased expression levels with increasing tumor grade, whereas BCL11A did not (Figure 4F; Additional file 19). B3GNT5 levels tended to be higher in both ER-negative and PR-negative tumors. No significant association of PTDSS1 with ER- , PR-, HER2- status, or histologic grade was found.
Several of the 52 genes found highly expressed in at least two tumor datasets showed consistent trends in expression within tumor subtypes. UCHL1 is generally found expressed at higher levels in basal-like tumors than the other breast cancer subtypes (Figure 4E). Many cell-cycle-associated genes (ASPM, CENPE, FAM60A, TPX2, TRIP13, KIF11, KIF20A) were expressed at the highest levels in basal-like tumors followed by HER2+, LumB, Normal, and LumA (Figure 4E). Similar trends for the cell-cycle-associated genes (ASPM, CENPE, TPX2, TRIP13, KIF11, KIF20A) were observed with their distribution in different-grade tumors, with higher expression levels observed as tumor grade increased (Figure 4F). Patients with breast cancers expressing higher levels of SOX11 showed worse overall survival than did those with tumors expressing lower levels (Figure 4G). A trend exists for reduced distant metastasis-free survival in patients with breast cancers expressing higher levels of SOX11, but is not statistically significant.
Tumor-associated embryonic mammary transcriptional regulators are expressed in invasive Brca1-/- mammary tumor cells
We analyzed expression of four embryonic mammary signature components that encode transcription factors in normal mammary tissues and tumors. Bcl11a and Sox11 were expressed at approximately 20-fold and 100-fold greater levels, respectively, in the embryonic mammary epithelium when compared with postnatal mammary epithelial cell (MEC) populations when assayed with qRT-PCR (Figure 5A). Expression was also detected in RNA isolated from Brca1-/- mouse mammary tumors: Bcl11a was detected in seven of eight tumors, and Sox11 was detected in two of eight tumors (see Additional file 20). Grhl3 and Prox1 were expressed at 10-fold or more in the embryonic mammary epithelium when compared with postnatal MEC expression levels (Figure 5A) and were expressed in some Brca1-/- tumors when profiled by qRT-PCR (Additional file 20). Sox11 expression is predominantly observed in epidermal cells of the E12.5-stage mammary bud (Figure 5B). Weak expression of Sox11 is detected in postnatal MECs (Figure 5C). Nuclear Sox11 expression is observed in two of eight Brca1-/- tumors, with highest levels of expression observed at the tumor-invasion front adjacent to normal tissue (Figure 5D through 5G). We conclude that several signature components identified in our cancer dataset analysis are highly embryonic enriched, expressed at sites of active tissue remodeling in vivo during embryonic mammary development and in many Brca1-/- mammary tumors.
SOX11 knockdown and overexpression in breast cancer cells
We carried out loss-of-function assays to study further the role of SOX11 in BT474 and BT549 invasive breast cancer cells, which express relatively high (BT474) and low levels (BT549) of SOX11 (see Additional file 21). The results indicated that SOX11 knockdown significantly impaired the viability and proliferation of both cell types (Figure 6A-C and Additional file 21). BT549 cells transiently transfected with pCMV6-AC-SOX11-GFP exhibited higher proliferation rates than did BT549 cells transiently transfected with a control GFP-expressing plasmid (Additional file 21). SOX11 knockdown in BT474 cells increased levels of cleaved caspase-3, a marker for apoptosis (Figure 6D). A significant reduction in cells in G2/M phase was observed in cells transfected with both the SOX11 SMARTpool and siRNA16, but not with the siRNA15, which exhibits the largest change in cell viability and largest increase in cleaved caspase-3 levels on SOX11 knockdown (Figure 6B through 6E).
Embryonic mammary epithelium represents the least differentiated mammary cells. Tumor-associated embryonic mammary epithelial gene activation may therefore reflect tumors containing a large proportion of less-differentiated cells. Differentiation status, as defined by histologic grade, is a clinically relevant aspect of breast tumors . Undifferentiated tumors generally have a much worse prognosis than do more-differentiated tumors . A small component of the embryonic-specific mammary signature appears activated in mouse Brca1-/- tumors and in approximately 80% of human basal-like breast cancers in the datasets we examined. It is unclear whether they express these programs for the same reasons or if their expression in basal-like/triple-negative breast cancers is due to genetic aberrations they harbor.
Many of the most common breast cancer driver mutations, which confer survival advantage to breast cancer cells and are implicated in causing cancers to form, are found in genes that are also highly expressed by prenatal breast cells. We have established this by comparing gene-expression profiles of embryonic mammary tissues  with recent mutational analyses obtained through deep sequencing of breast cancers . Aspects of embryonic genetic programs with relevance to cancer have also been suggested because "embryonic stem cell-like" (ESC) signatures are found activated in many cancers, including aggressive breast cancers . However, most of these signatures show a very strong correlation with levels of proliferation-related genes [32, 51, 52]. Although we observe correlation with proliferation for embryonic mammary signature components after removing proliferation-associated genes, we still observed clustering into basal-like and non-basal breast cancers, suggesting that the embryonic gene activation is also mediating other cellular processes.
Four transcription factors (Bcl11a, Grhl3, Prox1, Sox11) activated in Brca1-/- mouse tumors and basal-like human breast cancers across multiple datasets were chosen for validation studies, and all were confirmed to be embryonic-enriched and highly expressed by some tumors. All four genes have links to progenitor-cell regulation. GRHL3 collaborates with Trithorax group members to activate the epidermal progenitor differentiation program . Prox1 has been identified as a suppressor of hematopoietic stem cell activity , primary mediator of lymphangiogenesis , and promotes maintenance of intermediate neural progenitors during adult neurogenesis . BCL11A is expressed in lymphohematopoietic cells, controls the development of B- and T-lymphocytes, and is a common site of retroviral integration in myeloid leukemia [57, 58]. Two somatic mutations in BCL11A have been reported in breast cancer . Sox11, a high-mobility-group transcription factor, has a widespread role for in tissue remodeling in multiple organs  and regulates neurogenesis [61, 62]. Activated SOX11 expression has been described in Wilms tumor , a classic example of an embryonic tumor, often characterized by retention of embryonic cellular structures within the tumor-bearing kidney . SOX11 plays pivotal roles in lymphoblastic neoplasms, mantle cell lymphoma, and Burkitt lymphoma . Both BCL11A and SOX11 belong to the top 20 transcriptional regulators that correlate with the core ES signature found activated in aggressive breast cancers .
Antibody staining found Sox11 highly expressed at the invasion front in some Brca1-/- mammary tumors. SOX11 has been identified as mesenchymal stem cell (MSC) characteristic gene  and potential biomarker for early progenitor human MSCs . Knockdown of SOX11 suppressed the self-renewal capacity and differentiation potential of multiple MSC lines  and MSCs isolated from bone marrow aspirates . In mice, Sox11 is required for proliferation of the sympathetic ganglia during early developmental stages . We found that silencing of SOX11 in breast cancer cells led to an increased expression of the apoptotic marker, cleaved caspase-3. SOX11-deficient cell populations that showed moderate decreases in viability also exhibited moderate increases in cleaved caspase-3 levels and decreased percentages of cells in G2/M phase, whereas no change in the G2/M percentage was observed in the least viable SOX11-deficient cell population that displayed the highest increase in cleaved caspase-3 levels. These results suggest that a more efficient SOX11 knockdown could lead to more rapid apoptosis when compared with cells with moderately reduced SOX11 levels, which may possibly undergo prolonged cell-cycle arrest before subsequent apoptosis. A number of studies have found that Sox11 is also required for survival of neural cells and mesenchymal progenitor cells [67–69]. We found that silencing of SOX11 in breast cancer cells reduces cell survival and cell viability, and SOX11 overexpression leads to increased proliferation rates, suggesting that SOX11 could have a similar function in regulation of proliferation and survival in several types of cells. High levels of SOX11 expression are associated with poor overall survival in breast cancer patients, but its function in breast epithelial cells is not clear and remains to be further investigated.
A study by Spike et al. , found evidence of molecular similarity of subpopulations of E18.5-stage mammary cells to breast cancers. In that study , cells from mammary primordia were separated into subpopulations based on expression of cell-surface markers to enrich for stem cells. The signature used in our study represents the entire embryonic mammary epithelial organogenetic program, because it is derived from gene-expression profiles of intact epithelial tissues. Lineage-tracing studies have shown that embryonic mammary bud epithelial cells labeled at midgestation (E12.5-stage) onward give rise to both basal and luminal lineages [70, 71]. Therefore, our embryonic signature will include progenitor/stem cells as well as other cells within their native microenvironment. Tumors are composed of multiple cell types, and some behavioral features are similar to organotypic growth . Distinctions in both the developmental stages (E12.5 versus E18.5 stage) and biologic features of the cell populations (tissues versus fractionated subpopulations of dissociated cells) that were profiled in the two studies are likely to account for the limited overlap between the signatures. Only one gene (Bcl11a) from the 37-gene signature defined here is shared with one of the tumor-associated subsets defined in the study by Spike et al. .
Our results reveal a small number of genes associated with embryonic mammary development and human basal-like breast cancers. Although this lends support to the notion that reactivation of components of the mammary organogenetic program has detrimental effects in postnatal MECs, our results suggest that only a small fraction of the early (E12.5-stage) embryonic mammary developmental program is highly expressed or reactivated in breast tumors. A substantial component of the tumor-associated embryonic epithelial signature comprises genes regulating cell proliferation. This is somewhat unexpected, because the embryonic mammary epithelium exhibits a low proliferation index [73, 74]. However, several cell-cycle-associated genes are associated with its signature, and may regulate proliferation of particular progenitor cells as the immature mammary cell population expands. One tumor-associated embryonic gene, ASPM, regulates symmetric versus asymmetric cell divisions in progenitor cells and also regulates WNT signaling in the developing brain [75, 76].
We documented activation of embryonic genes in mammary tumors from mice in which Brca1-/- was inactivated in either luminal progenitor cells or basal cells . These observations suggest that it is loss of Brca1 and not the cell of origin that may be dictating the embryonic gene signature expression. These Brca1-/- mice were Tp53+/-; hence, it is possible that loss of p53 function is also contributing to the observed embryonic gene activation in the Brca1-/- tumors. p53 has been shown to regulate polarity of cell division in mammary stem cells, and loss of p53 appears to promote symmetric divisions of cancer stem cells, contributing to tumor growth .
A limited subset of the early mammary developmental program is likely to have a role in promoting tumorigenesis, but its association with some human breast tumors and patient outcome warrants further investigation. We have identified a small network of embryonic genes that are found highly expressed in a subset of basal-like breast cancers and are candidate regulators of cancer cells. These results provide support for the notion that overactivation of small particular aspects of the embryonic mammary genetic program could play a key role in regulating detrimental cellular behaviour, such as tissue remodeling, invasive growth, and/or progenitor cell expansion. Expression of particular embryonic mammary markers within tumor cells may reflect reactivation of genetic programs that influence the behavior of immature cell types present within the breast and may elicit cell behavior associated with embryonic cells, such as a less-differentiated, highly plastic state. Tumor-associated embryonic mammary markers may have value to be exploited as they could represent a novel means to describe and categorize the biologic state of tumor cell populations for use in breast cancer classification as well as potential drug targets.
embryonic stem cell
mammary bud epithelium
mammary epithelial cell
mesenchymal stem cell
David H: Rudolf Virchow and modern aspects of tumor pathology. Pathol Res Pract. 1988, 183: 356-364. 10.1016/S0344-0338(88)80138-9.
Rather J: The Genesis of Cancer: a Study in the History of Ideas. 1978, Baltimore, MD: Johns Hopkins University Press
Stephens PJ, Tarpey PS, Davies H, Van Loo P, Greenman C, Wedge DC, Nik-Zainal S, Martin S, Varela I, Bignell GR, Yates LR, Papaemmanuil E, Beare D, Butler A, Cheverton A, Gamble J, Hinton J, Jia M, Jayakumar A, Jones D, Latimer C, Lau KW, McLaren S, McBride DJ, Menzies A, Mudie L, Raine K, Rad R, Chapman MS, Teague J, et al: The landscape of cancer genes and mutational processes in breast cancer. Nature. 2012, 486: 400-404.
Asselin-Labat ML, Sutherland KD, Barker H, Thomas R, Shackleton M, Forrest NC, Hartley L, Robb L, Grosveld FG, van der Wees J, Lindeman GJ, Visvader JE: Gata-3 is an essential regulator of mammary-gland morphogenesis and luminal-cell differentiation. Nat Cell Biol. 2007, 9: 201-209. 10.1038/ncb1530.
Davenport TG, Jerome-Majewska LA, Papaioannou VE: Mammary gland, limb and yolk sac defects in mice lacking Tbx3, the gene mutated in human ulnar mammary syndrome. Development. 2003, 130: 2263-2273. 10.1242/dev.00431.
Howard B, Ashworth A: Signalling pathways implicated in early mammary gland morphogenesis and breast cancer. PLoS Genet. 2006, 2: e112-10.1371/journal.pgen.0020112.
Driessens G, Beck B, Caauwe A, Simons BD, Blanpain C: Defining the mode of tumour growth by clonal analysis. Nature. 2012, 488: 527-530. 10.1038/nature11344.
Schepers AG, Snippert HJ, Stange DE, van den Born M, van Es JH, van de Wetering M, Clevers H: Lineage tracing reveals Lgr5+ stem cell activity in mouse intestinal adenomas. Science. 2012, 337: 730-735. 10.1126/science.1224676.
Chen J, Li Y, Yu TS, McKay RM, Burns DK, Kernie SG, Parada LF: A restricted cell population propagates glioblastoma growth after chemotherapy. Nature. 2012, 488: 522-526. 10.1038/nature11287.
Wansbury O, Mackay A, Kogata N, Mitsopoulos C, Kendrick H, Davidson K, Ruhrberg C, Reis-Filho JS, Smalley MJ, Zvelebil M, Howard BA: Transcriptome analysis of embryonic mammary cells reveals insights into mammary lineage establishment. Breast Cancer Res. 2011, 13: R79-10.1186/bcr2928.
Van Keymeulen A, Rocha AS, Ousset M, Beck B, Bouvencourt G, Rock J, Sharma N, Dekoninck S, Blanpain C: Distinct stem cells contribute to mammary gland development and maintenance. Nature. 2011, 479: 189-193. 10.1038/nature10573.
Molyneux G, Geyer FC, Magnay FA, McCarthy A, Kendrick H, Natrajan R, Mackay A, Grigoriadis A, Tutt A, Ashworth A, Reis-Filho JS, Smalley MJ: BRCA1 basal-like breast cancers originate from luminal epithelial progenitors and not from basal stem cells. Cell Stem Cell. 2010, 7: 403-417. 10.1016/j.stem.2010.07.010.
Kendrick H, Regan JL, Magnay FA, Grigoriadis A, Mitsopoulos C, Zvelebil M, Smalley MJ: Transcriptome analysis of mammary epithelial subpopulations identifies novel determinants of lineage commitment and cell fate. BMC Genomics. 2008, 9: 591-10.1186/1471-2164-9-591.
Gentleman RC, Carey VJ, Bates DM, Bolstad B, Dettling M, Dudoit S, Ellis B, Gautier L, Ge Y, Gentry J, Hornik K, Hothorn T, Huber W, Iacus S, Irizarry R, Leisch F, Li C, Maechler M, Rossini AJ, Sawitzki G, Smith C, Smyth G, Tierney L, Yang JY, Zhang J: Bioconductor: open software development for computational biology and bioinformatics. Genome Biol. 2004, 5: R80-10.1186/gb-2004-5-10-r80.
van de Vijver MJ, He YD, van't Veer LJ, Dai H, Hart AA, Voskuil DW, Schreiber GJ, Peterse JL, Roberts C, Marton MJ, Parrish M, Atsma D, Witteveen A, Glas A, Delahaye L, van der Velde T, Bartelink H, Rodenhuis S, Rutgers ET, Friend SH, Bernards R: A gene-expression signature as a predictor of survival in breast cancer. N Engl J Med. 2002, 347: 1999-2009. 10.1056/NEJMoa021967.
Natrajan R, Weigelt B, Mackay A, Geyer FC, Grigoriadis A, Tan DS, Jones C, Lord CJ, Vatcheva R, Rodriguez-Pinilla SM, Palacios J, Ashworth A, Reis-Filho JS: An integrative genomic and transcriptomic analysis reveals molecular pathways and networks regulated by copy number aberrations in basal-like, HER2 and luminal cancers. Breast Cancer Res Treat. 2010, 121: 575-589. 10.1007/s10549-009-0501-3.
Parker JS, Mullins M, Cheang MC, Leung S, Voduc D, Vickery T, Davies S, Fauron C, He X, Hu Z, Quackenbush JF, Stijleman IJ, Palazzo J, Marron JS, Nobel AB, Mardis E, Nielsen TO, Ellis MJ, Perou CM, Bernard PS: Supervised risk predictor of breast cancer based on intrinsic subtypes. J Clin Oncol. 2009, 27: 1160-1167. 10.1200/JCO.2008.18.1370.
Weigelt B, Mackay A, A'Hern R, Natrajan R, Tan DS, Dowsett M, Ashworth A, Reis-Filho JS: Breast cancer molecular profiling with single sample predictors: a retrospective analysis. Lancet Oncol. 2010, 11: 339-349. 10.1016/S1470-2045(10)70008-5.
van 't Veer LJ, Dai H, van de Vijver MJ, He YD, Hart AA, Mao M, Peterse HL, van der Kooy K, Marton MJ, Witteveen AT, Schreiber GJ, Kerkhoven RM, Roberts C, Linsley PS, Bernards R, Friend SH: Gene expression profiling predicts clinical outcome of breast cancer. Nature. 2002, 415: 530-536. 10.1038/415530a.
Suzuki R, Shimodaira H: Pvclust: an R package for assessing the uncertainty in hierarchical clustering. Bioinformatics. 2006, 22: 1540-1542. 10.1093/bioinformatics/btl117.
Ben-Porath I, Thomson MW, Carey VJ, Ge R, Bell GW, Regev A, Weinberg RA: An embryonic stem cell-like gene expression signature in poorly differentiated aggressive human tumors. Nat Genet. 2008, 40: 499-507. 10.1038/ng.127.
Desmedt C, Haibe-Kains B, Wirapati P, Buyse M, Larsimont D, Bontempi G, Delorenzi M, Piccart M, Sotiriou C: Biological processes associated with breast cancer clinical outcome depend on the molecular subtypes. Clin Cancer Res. 2008, 14: 5158-5165. 10.1158/1078-0432.CCR-07-4756.
Ghazoui Z, Buffa FM, Dunbier AK, Anderson H, Dexter T, Detre S, Salter J, Smith IE, Harris AL, Dowsett M: Close and stable relationship between proliferation and a hypoxia metagene in aromatase inhibitor-treated ER-positive breast cancer. Clin Cancer Res. 2011, 17: 3005-3012. 10.1158/1078-0432.CCR-10-1704.
Dennis G, Sherman BT, Hosack DA, Yang J, Gao W, Lane HC, Lempicki RA: DAVID: database for annotation, visualization, and integrated discovery. Genome Biol. 2003, 4: P3-10.1186/gb-2003-4-5-p3.
Sims D, Bursteinas B, Gao Q, Jain E, MacKay A, Mitsopoulos C, Zvelebil M: ROCK: a breast cancer functional genomics resource. Breast Cancer Res Treat. 2010, 124: 567-572. 10.1007/s10549-010-0945-5.
Panchal H, Wansbury O, Howard BA: Embryonic mammary anlagen analysis using immunolabelling of whole mounts. Methods Mol Biol. 585: 261-270.
Hoser M, Potzner MR, Koch JM, Bosl MR, Wegner M, Sock E: Sox12 deletion in the mouse reveals nonreciprocal redundancy with the related Sox4 and Sox11 transcription factors. Mol Cell Biol. 2008, 28: 4675-4687. 10.1128/MCB.00338-08.
Potzner MR, Tsarovina K, Binder E, Penzo-Mendez A, Lefebvre V, Rohrer H, Wegner M, Sock E: Sequential requirement of Sox4 and Sox11 during development of the sympathetic nervous system. Development. 2010, 137: 775-784. 10.1242/dev.042101.
Oliemuller E, Pelaez R, Garasa S, Pajares MJ, Agorreta J, Pio R, Montuenga LM, Teijeira A, Llanos S, Rouzaut A: Phosphorylated tubulin adaptor protein CRMP-2 as prognostic marker and candidate therapeutic target for NSCLC. Int J Cancer. 2012, 132: 1986-1995.
Spike BT, Engle DD, Lin JC, Cheung SK, La J, Wahl GM: A mammary stem cell population identified and characterized in late embryogenesis reveals similarities to human breast cancer. Cell Stem Cell. 2012, 10: 183-197. 10.1016/j.stem.2011.12.018.
McCarthy A, Savage K, Gabriel A, Naceur C, Reis-Filho JS, Ashworth A: A mouse model of basal-like breast carcinoma with metaplastic elements. J Pathol. 2007, 211: 389-398. 10.1002/path.2124.
Reis-Filho JS, Pusztai L: Gene expression profiling in breast cancer: classification, prognostication, and prediction. Lancet. 2011, 378: 1812-1823. 10.1016/S0140-6736(11)61539-0.
Prat A, Parker JS, Karginova O, Fan C, Livasy C, Herschkowitz JI, He X, Perou CM: Phenotypic and molecular characterization of the claudin-low intrinsic subtype of breast cancer. Breast Cancer Res. 2010, 12: R68-10.1186/bcr2635.
Haibe-Kains B, Desmedt C, Loi S, Culhane AC, Bontempi G, Quackenbush J, Sotiriou C: A three-gene model to robustly identify breast cancer molecular subtypes. J Natl Cancer Inst. 2012, 104: 311-325. 10.1093/jnci/djr545.
Ansieau S, Bastid J, Doreau A, Morel AP, Bouchet BP, Thomas C, Fauvet F, Puisieux I, Doglioni C, Piccinin S, Maestro R, Voeltzel T, Selmi A, Valsesia-Wittmann S, Caron de Fromentel C, Puisieux A: Induction of EMT by twist proteins as a collateral effect of tumor-promoting inactivation of premature senescence. Cancer Cell. 2008, 14: 79-89. 10.1016/j.ccr.2008.06.005.
Miettinen PJ, Ebner R, Lopez AR, Derynck R: TGF-beta induced transdifferentiation of mammary epithelial cells to mesenchymal cells: involvement of type I receptors. J Cell Biol. 1994, 127: 2021-2036. 10.1083/jcb.127.6.2021.
Vandewalle C, Comijn J, De Craene B, Vermassen P, Bruyneel E, Andersen H, Tulchinsky E, Van Roy F, Berx G: SIP1/ZEB2 induces EMT by repressing genes of different epithelial cell-cell junctions. Nucleic Acids Res. 2005, 33: 6566-6578. 10.1093/nar/gki965.
Desmedt C, Piette F, Loi S, Wang Y, Lallemand F, Haibe-Kains B, Viale G, Delorenzi M, Zhang Y, d'Assignies MS, Bergh J, Lidereau R, Ellis P, Harris AL, Klijn JG, Foekens JA, Cardoso F, Piccart MJ, Buyse M, Sotiriou C, TRANSBIG Consortium: Strong time dependence of the 76-gene prognostic signature for node-negative breast cancer patients in the TRANSBIG multicenter independent validation series. Clin Cancer Res. 2007, 13: 3207-3214. 10.1158/1078-0432.CCR-06-2765.
Farmer P, Bonnefoi H, Becette V, Tubiana-Hulin M, Fumoleau P, Larsimont D, Macgrogan G, Bergh J, Cameron D, Goldstein D, Duss S, Nicoulaz AL, Brisken C, Fiche M, Delorenzi M, Iggo R: Identification of molecular apocrine breast tumours by microarray analysis. Oncogene. 2005, 24: 4660-4671. 10.1038/sj.onc.1208561.
Lu X, Wang ZC, Iglehart JD, Zhang X, Richardson AL: Predicting features of breast cancer with gene expression patterns. Breast Cancer Res Treat. 2008, 108: 191-201. 10.1007/s10549-007-9596-6.
Miller LD, Smeds J, George J, Vega VB, Vergara L, Ploner A, Pawitan Y, Hall P, Klaar S, Liu ET, Bergh J: An expression signature for p53 status in human breast cancer predicts mutation status, transcriptional effects, and patient survival. Proc Natl Acad Sci USA. 2005, 102: 13550-13555. 10.1073/pnas.0506230102.
Popovici V, Chen W, Gallas BG, Hatzis C, Shi W, Samuelson FW, Nikolsky Y, Tsyganova M, Ishkin A, Nikolskaya T, Hess KR, Valero V, Booser D, Delorenzi M, Hortobagyi GN, Shi L, Symmans WF, Pusztai L: Effect of training-sample size and classification difficulty on the accuracy of genomic predictors. Breast Cancer Res. 2010, 12: R5-10.1186/bcr2468.
Chin K, DeVries S, Fridlyand J, Spellman PT, Roydasgupta R, Kuo WL, Lapuk A, Neve RM, Qian Z, Ryder T, Chen F, Feiler H, Tokuyasu T, Kingsley C, Dairkee S, Meng Z, Chew K, Pinkel D, Jain A, Ljung BM, Esserman L, Albertson DG, Waldman FM, Gray JW: Genomic and transcriptional aberrations linked to breast cancer pathophysiologies. Cancer Cell. 2006, 10: 529-541. 10.1016/j.ccr.2006.10.009.
Pawitan Y, Bjöhle J, Amler L, Borg AL, Egyhazi S, Hall P, Han X, Holmberg L, Huang F, Klaar S, Liu ET, Miller L, Nordgren H, Ploner A, Sandelin K, Shaw PM, Smeds J, Skoog L, Wedrén S, Bergh J: Gene expression profiling spares early breast cancer patients from adjuvant therapy: derived and validated in two population-based cohorts. Breast Cancer Res. 2005, 7: R953-964. 10.1186/bcr1325.
Richardson AL, Wang ZC, De Nicolo A, Lu X, Brown M, Miron A, Liao X, Iglehart JD, Livingston DM, Ganesan S: X chromosomal abnormalities in basal-like human breast cancer. Cancer Cell. 2006, 9: 121-132. 10.1016/j.ccr.2006.01.013.
Minn AJ, Gupta GP, Siegel PM, Bos PD, Shu W, Giri DD, Viale A, Olshen AB, Gerald WL, Massague J: Genes that mediate breast cancer metastasis to lung. Nature. 2005, 436: 518-524. 10.1038/nature03799.
Boersma BJ, Reimers M, Yi M, Ludwig JA, Luke BT, Stephens RM, Yfantis HG, Lee DH, Weinstein JN, Ambs S: A stromal gene signature associated with inflammatory breast cancer. Int J Cancer. 2008, 122: 1324-1332.
Finak G, Bertos N, Pepin F, Sadekova S, Souleimanova M, Zhao H, Chen H, Omeroglu G, Meterissian S, Omeroglu A, Hallett M, Park M: Stromal gene expression predicts clinical outcome in breast cancer. Nat Med. 2008, 14: 518-527. 10.1038/nm1764.
Hess KR, Anderson K, Symmans WF, Valero V, Ibrahim N, Mejia JA, Booser D, Theriault RL, Buzdar AU, Dempsey PJ, Rouzier R, Sneige N, Ross JS, Vidaurre T, Gómez HL, Hortobagyi GN, Pusztai L: Pharmacogenomic predictor of sensitivity to preoperative chemotherapy with paclitaxel and fluorouracil, doxorubicin, and cyclophosphamide in breast cancer. J Clin Oncol. 2006, 24: 4236-4244. 10.1200/JCO.2006.05.6861.
Rakha EA, Reis-Filho JS, Baehner F, Dabbs DJ, Decker T, Eusebi V, Fox SB, Ichihara S, Jacquemier J, Lakhani SR, Palacios J, Richardson AL, Schnitt SJ, Schmitt FC, Tan PH, Tse GM, Badve S, Ellis IO: Breast cancer prognostic classification in the molecular era: the role of histological grade. Breast Cancer Res. 2010, 12: 207-
Wirapati P, Sotiriou C, Kunkel S, Farmer P, Pradervand S, Haibe-Kains B, Desmedt C, Ignatiadis M, Sengstag T, Schütz F, Goldstein DR, Piccart M, Delorenzi M: Meta-analysis of gene expression profiles in breast cancer: toward a unified understanding of breast cancer subtyping and prognosis signatures. Breast Cancer Res. 2008, 10: R65-10.1186/bcr2124.
Sotiriou C, Pusztai L: Gene-expression signatures in breast cancer. N Engl J Med. 2009, 360: 790-800. 10.1056/NEJMra0801289.
Hopkin AS, Gordon W, Klein RH, Espitia F, Daily K, Zeller M, Baldi P, Andersen B: GRHL3/GET1 and Trithorax group members collaborate to activate the epidermal progenitor differentiation program. PLoS Genet. 2012, 8: e1002829-10.1371/journal.pgen.1002829.
Hope KJ, Cellot S, Ting SB, MacRae T, Mayotte N, Iscove NN, Sauvageau G: An RNAi screen identifies Msi2 and Prox1 as having opposite roles in the regulation of hematopoietic stem cell activity. Cell Stem Cell. 2010, 7: 101-113. 10.1016/j.stem.2010.06.007.
Wigle JT, Oliver G: Prox1 function is required for the development of the murine lymphatic system. Cell. 1999, 98: 769-778. 10.1016/S0092-8674(00)81511-1.
Lavado A, Lagutin OV, Chow LM, Baker SJ, Oliver G: Prox1 is required for granule cell maturation and intermediate progenitor maintenance during brain neurogenesis. PLoS Biol. 2010, 8: e1000460-10.1371/journal.pbio.1000460.
Nakamura T, Yamazaki Y, Saiki Y, Moriyama M, Largaespada DA, Jenkins NA, Copeland NG: Evi9 encodes a novel zinc finger protein that physically interacts with BCL6, a known human B-cell proto-oncogene product. Mol Cell Biol. 2000, 20: 3178-3186. 10.1128/MCB.20.9.3178-3186.2000.
Liu P, Keller JR, Ortiz M, Tessarollo L, Rachel RA, Nakamura T, Jenkins NA, Copeland NG: Bcl11a is essential for normal lymphoid development. Nat Immunol. 2003, 4: 525-532. 10.1038/ni925.
Wood LD, Parsons DW, Jones S, Lin J, Sjöblom T, Leary RJ, Shen D, Boca SM, Barber T, Ptak J, Silliman N, Szabo S, Dezso Z, Ustyanksky V, Nikolskaya T, Nikolsky Y, Karchin R, Wilson PA, Kaminker JS, Zhang Z, Croshaw R, Willis J, Dawson D, Shipitsin M, Willson JK, Sukumar S, Polyak K, Park BH, Pethiyagoda CL, Pant PV, et al: The genomic landscapes of human breast and colorectal cancers. Science. 2007, 318: 1108-1113. 10.1126/science.1145720.
Sock E, Rettig SD, Enderich J, Bosl MR, Tamm ER, Wegner M: Gene targeting reveals a widespread role for the high-mobility-group transcription factor Sox11 in tissue remodeling. Mol Cell Biol. 2004, 24: 6635-6644. 10.1128/MCB.24.15.6635-6644.2004.
Haslinger A, Schwarz TJ, Covic M, Chichung Lie D: Expression of Sox11 in adult neurogenic niches suggests a stage-specific role in adult neurogenesis. Eur J Neurosci. 2009, 29: 2103-2114.
Bergsland M, Ramskold D, Zaouter C, Klum S, Sandberg R, Muhr J: Sequentially acting Sox transcription factors in neural lineage development. Genes Dev. 2011, 25: 2453-2464. 10.1101/gad.176008.111.
Aiden AP, Rivera MN, Rheinbay E, Ku M, Coffman EJ, Truong TT, Vargas SO, Lander ES, Haber DA, Bernstein BE: Wilms tumor chromatin profiles highlight stem cell properties and a renal developmental network. Cell Stem Cell. 2010, 6: 591-602. 10.1016/j.stem.2010.03.016.
Dictor M, Ek S, Sundberg M, Warenholt J, Gyorgy C, Sernbo S, Gustavsson E, Abu-Alsoud W, Wadstrom T, Borrebaeck C: Strong lymphoid nuclear expression of SOX11 transcription factor defines lymphoblastic neoplasms, mantle cell lymphoma and Burkitt's lymphoma. Haematologica. 2009, 94: 1563-1568. 10.3324/haematol.2009.008474.
Kubo H, Shimizu M, Taya Y, Kawamoto T, Michida M, Kaneko E, Igarashi A, Nishimura M, Segoshi K, Shimazu Y, Tsuji K, Aoba T, Kato Y: Identification of mesenchymal stem cell (MSC)-transcription factors by microarray and knockdown analyses, and signature molecule-marked MSC in bone marrow by immunohistochemistry. Genes Cells. 2009, 14: 407-424. 10.1111/j.1365-2443.2009.01281.x.
Larson BL, Ylostalo J, Lee RH, Gregory C, Prockop DJ: Sox11 is expressed in early progenitor human multipotent stromal cells and decreases with extensive expansion of the cells. Tissue Eng Part A. 2010, 16: 3385-3394. 10.1089/ten.tea.2010.0085.
Jankowski MP, Cornuet PK, McIlwrath S, Koerber HR, Albers KM: SRY-box containing gene 11 (Sox11) transcription factor is required for neuron survival and neurite growth. Neuroscience. 2006, 143: 501-514. 10.1016/j.neuroscience.2006.09.010.
Bhattaram P, Penzo-Mendez A, Sock E, Colmenares C, Kaneko KJ, Vassilev A, Depamphilis ML, Wegner M, Lefebvre V: Organogenesis relies on SoxC transcription factors for the survival of neural and mesenchymal progenitors. Nat Commun. 2010, 1: 9-
Thein DC, Thalhammer JM, Hartwig AC, Crenshaw EB, Lefebvre V, Wegner M, Sock E: The closely related transcription factors Sox4 and Sox11 function as survival factors during spinal cord development. J Neurochem. 2010, 115: 131-141. 10.1111/j.1471-4159.2010.06910.x.
Van Keymeulen A, Rocha AS, Ousset M, Beck B, Bouvencourt G, Rock J, Sharma N, Dekoninck S, Blanpain C: Distinct stem cells contribute to mammary gland development and maintenance. Nature. 2011, 479: 189-193. 10.1038/nature10573.
van Amerongen R, Bowman AN, Nusse R: Developmental stage. Cell Stem Cell. 2012, 11: 387-400. 10.1016/j.stem.2012.05.023.
Egeblad M, Nakasone ES, Werb Z: Tumors as organs: complex tissues that interface with the entire organism. Dev Cell. 2010, 18: 884-901. 10.1016/j.devcel.2010.05.012.
Balinsky B: On the developmental processes in mammary glands and other epidermal structures. Trans R Soc Edinburgh. 1950, 62: 1-31.
Balinsky BI: On the prenatal growth of the mammary gland rudiment in the mouse. J Anat. 1950, 84: 227-235.
Fish JL, Kosodo Y, Enard W, Paabo S, Huttner WB: Aspm specifically maintains symmetric proliferative divisions of neuroepithelial cells. Proc Natl Acad Sci USA. 2006, 103: 10438-10443. 10.1073/pnas.0604066103.
Buchman JJ, Durak O, Tsai LH: ASPM regulates Wnt signaling pathway activity in the developing brain. Genes Dev. 2011, 25: 1909-1914. 10.1101/gad.16830211.
Cicalese A, Bonizzi G, Pasi CE, Faretta M, Ronzoni S, Giulini B, Brisken C, Minucci S, Di Fiore PP, Pelicci PG: The tumor suppressor p53 regulates polarity of self-renewing divisions in mammary stem cells. Cell. 2009, 138: 1083-1095. 10.1016/j.cell.2009.06.048.
Dy P, Penzo-Mendez A, Wang H, Pedraza CE, Macklin WB, Lefebvre V: The three SoxC proteins, Sox4, Sox11, and Sox12, exhibit overlapping expression patterns and molecular properties. Nucleic Acids Res. 2008, 36: 3101-3117. 10.1093/nar/gkn162.
We thank the Breakthrough Histopathology Facility for their assistance and Dr Elisabeth Sock for providing the Sox11 antibody raised in guinea pig and Sox11-/- tissue sections. This work was funded by Breakthrough Breast Cancer. We acknowledge NHS funding to the NIHR Biomedical Research Centre.
The authors declare that they have no competing interests.
BAH conceived of and designed the study and wrote the manuscript. MZ, QG, and AM carried out analyses. MJS and JSR-F provided guidance and samples and participated in the preparation of the manuscript. EO, OW, and HK performed the experimental work. All authors read and approved the manuscript for publication.
Electronic supplementary material
Additional file 1: . (A) Table gives details of antibodies used in this study. (B) Positive control for Sox11 (guinea-pig antiserum) staining of E12.5-stage forebrain. (C) No primary antibody control for Sox11 (guinea-pig antiserum) staining of E12.5-stage mammary primordium. Scale bar, 50 μm. (PDF 3 MB)
Additional file 2: . Table of embryonic mammary epithelial signature based on the expression profiles of genes found highly expressed (10-fold or greater) within mammary bud epithelial cells when compared with postnatal mammary epithelial cells and functional annotation clustering of the embryonic mammary epithelial gene signature. Genes shared by this signature and the "uniquely fetal mammary stem cell" signature defined by Spike et al.  are indicated. (XLS 690 KB)
Additional file 3: Brca1-/- tumors. Table shows embryonic genes found activated in mouse Brca1-/- tumors and functional-annotation clustering. Functional-analysis clustering lists the category of gene set (for example, CC, cellular location; BP, biologic process; MF, molecular function); term (that is, specific gene ontology (GO) with GO number); count (number of genes enriching term); % (percentage of total of genes that belong to category enriched by analyzed gene set); P value (that is, enrichment of gene set); genes (list of genes enriching gene set by Affymetrix ID); Bonferroni; Benjamini, and FDR (false discovery rate) for functional annotation clustering of genes expressed in tumor-associated gene modules defined by cluster analysis. (XLS 50 KB)
Additional file 4: +, or luminal breast cancer subtypes in Natrajan data set. Functional-analysis clustering lists the category of gene set (CC, cellular location; BP, biologic process; MF, molecular function); term (specific gene ontology (GO) with GO number); count (number of genes enriching term); % (percentage of total of genes that belong to category enriched by analyzed gene set); P value (enrichment of gene set); genes (list of genes enriching gene set by Affymetrix ID); Bonferroni; Benjamini, and FDR (false discovery rate) for functional-annotation clustering of genes expressed in tumor-associated gene modules defined by cluster analysis. (XLS 61 KB)
Cluster-stability analysis of the hierarchic clustering of the embryonic mammary signature in breast cancer datasets by using the R-package pvclust
Additional file 5: . Figure shows stability analysis with Approximately Unbiased (AU) P value (shown in green) larger than 95% highlighted by rectangles and strongly supported by data. (A) Cluster-stability analysis of the hierarchic clustering of the embryonic mammary signature in the Natrajan breast cancer samples. Of the 57 basal-like genes, 55 are in the left cluster, and the two major clusters are significantly different. (B) Cluster-stability analysis of the hierarchic clustering of the embryonic mammary signature in the UNC337 breast cancer samples. (C) Cluster-stability analysis of the hierarchic clustering of the embryonic mammary signature in the NKI295 breast cancer samples. (PDF 300 KB)
Similar embryonic epithelial mammary signature subsets are activated across multiple human breast cancer datasets
Additional file 6: . (A, B) Five embryonic gene clusters activated in UNC337 dataset by using unsupervised hierarchic clustering and functional annotation. Tumor subtypes were defined by PAM50, as described . (C, D) Four embryonic gene clusters activated in NKI295 dataset by using unsupervised hierarchic clustering and functional annotation. Subtypes were as defined by the research version of PAM50 classification . The 70-gene prognosis signature was used to classify tumors as to whether tumors are likely to predictive of a short interval to distant metastases (poor) or not (good) [15, 19]. (TIFF 2 MB)
Additional file 7: +, luminal or normal breast cancer subtypes in UNC337 data set. Functional-analysis clustering lists the category of gene set (CC, cellular location; BP, biologic process; MF, molecular function); term (specific gene ontology (GO) with GO number); count (number of genes enriching term); % (percentage of total of genes that belong to category enriched by analyzed gene set); P value (enrichment of gene set); genes (list of genes enriching gene set by Affymetrix ID); Bonferroni; Benjamini, and FDR (false discovery rate) for functional annotation clustering of genes expressed in tumor-associated gene modules defined by cluster analysis. (XLS 56 KB)
Additional file 8: +, luminal, or normal breast cancer subtypes in NKI295 data set. Functional-analysis clustering lists the category of gene set (CC, cellular location; BP, biologic process; MF, molecular function); term (specific gene ontology (GO) with GO number); count (number of genes enriching term); % (percentage of total of genes that belong to category enriched by analyzed gene set); P value (enrichment of gene set); genes (list of genes enriching gene set by Affymetrix ID); Bonferroni; Benjamini, and FDR (false discovery rate) for functional annotation clustering of genes expressed in tumor-associated gene modules defined by cluster analysis. (XLS 96 KB)
Additional file 9: Embryonic genes activated in mouse Brca1-/- tumors and basal-like breast cancers, in at least two of four datasets examined and their functional annotation and hypergeometric statistical analysis. (XLS 98 KB)
Additional file 10: Embryonic genes activated in non-basal-like breast cancer subtypes in at least two of three breast cancer datasets examined and their functional annotation. (XLS 74 KB)
Correlation tests of embryonic genes with proliferation genes used to define nonproliferative embryonic mammary gene signature with proliferation genes,
Additional file 11: t tests of average expression of embryonic genes in tumor subtypes with different SSPs and centroid analysis of nonproliferative gene signature in NKI295 dataset. (A) Pearson correlation of KIF23 with embryonic gene signature in Natrajan dataset. (B) Pearson correlation of TPX2 with embryonic gene signature in Natrajan dataset. (C) Pearson correlation of TOP2A with embryonic gene signature in FAIMos dataset. (D) Pearson correlation of MKI67 with embryonic gene signature in FAIMos dataset. (E) Spearman correlation of Ki67 protein expression with embryonic gene signature in FAIMos dataset. (F) Nonproliferative embryonic mammary gene signature. (XLS 59 KB)
Cluster-stability analysis of the hierarchic clustering of the embryonic mammary signature in breast cancer datasets by using the R-package pvclust
Additional file 12: . (A) Cluster-stability analysis; 55 of the 57 basal-like genes are in the left cluster, and the two major clusters are significantly different. (B) Cluster-stability analysis of the hierarchic clustering of the nonproliferative embryonic mammary signature in the UNC337 breast cancer samples. (C) Cluster-stability analysis of the hierarchic clustering of the nonproliferative embryonic mammary signature in the NKI295 breast cancer samples. (PDF 279 KB)
Additional file 13: t tests of average expression of embryonic genes in tumor subtypes by using different SSPs and centroid analysis of nonproliferative gene signature. (A) Summary of array-expression data for the 37-gene embryonic mammary epithelial signature used to define centroids. (B) Median gene expression of 37 genes comprising embryonic mammary epithelial gene signature. (C) Definition of centroids for embryonic mammary epithelial signature. (D) Centroid correlation with NKI295 dataset. (E) Multivariate Cox Proportional Hazard Regression analysis. (XLS 221 KB)
Additional file 14: Embryonic mammary mesenchymal signature and functional annotation clustering. Table of embryonic mammary epithelial signature based on the expression profiles of genes found highly expressed (10-fold or greater) within mammary mesenchymal cells when compared with postnatal mammary epithelial cells and functional annotation-cluster analysis of the embryonic mammary mesenchymal gene signature. (XLS 818 KB)
Additional file 15: Embryonic mesenchymal genes activated in mouse Brca1-/- tumors and basal-like breast cancers. These are from at least two of four datasets examined and Multivariate Cox Proportional Hazard Regression analysis of 172-gene uniquely mesenchymal signature. (XLS 256 KB)
Additional file 16: . (A) Kaplan-Meier analysis shows a trend toward reduced overall survival in patients with tumors with activation of embryonic mesenchymal signature (172 genes) in the van de Vijver dataset  (χ2 P value = 0.066, log-rank P value = 0.07368). (B) Box plots showing the average expression levels of the mesenchymal 172-gene signature in the breast cancer subtypes classified by using PAM50 SSP on the NKI295 dataset. (TIFF 176 KB)
Fold-change expression levels of core network components activated across independent tumor datasets
Additional file 17: . The average in ER- versus ER+ breast cancers; PR- versus PR+ breast cancers; and HER2- versus HER+ breast cancers, including associated P values. (XLS 26 KB)
Additional file 18: Expression of core network of tumor-associated embryonic genes in HER2+ versus HER2- breast cancers in six datasets. (TIFF 576 KB)
Additional file 19: Significance analysis of microarray analysis of the expression of core network of tumor-associated embryonic genes according to tumor grade and tumor subtype. (PDF 948 KB)
Additional file 20: Brca1-/- tumors. (A) qRT-PCR analysis of four tumor-associated transcription factors in Brca1-/- mouse mammary tumors. (B) IHC showing SOX11 expression (Cell Marque MRQ-58) within embryonic mammary primordium. (C) No primary antibody control for SOX11 (Cell Marque MRQ-58). (D) IHC showing low level of SOX11 expression (Cell Marque MRQ-58) within 10-week-old postnatal mammary gland. (E) Positive control showing SOX11 expression (Cell Marque MRQ-58) in E12.5-stage forebrain. (F) Control showing SOX11 expression (Cell Marque MRQ-58) in E16.5-stage Sox11-/- spinal cord. (G through J) IHC showing SOX11 expression (Cell Marque MRQ-58) in some, but not all, Brca1-/- tumors. Scale bar, 50 μm. (JPEG 5 MB)
Additional file 21: SOX11 knockdown on cell viability of breast cancer cells. ( A) qRT-PCR analysis of SOX11 levels in BT549 cells transfected with either SOX11 SMARTpool or control siRNAs. (B) SOX11 expression in BT474 cells compared with BT549 cells by immunoblotting. (C) Immunoblotting of lysates from cells transiently transfected with either SOX4 or SOX11 expression vectors (Origene) show that SOX11 antibody (Epitomics) does not detect SOX4. SOX4 shares a high degree of identity both in the HMG box domain and in the C-terminal region and is of a similar molecular mass to SOX11 (60 versus 59 kDa), in agreement with previously published data . (D) BT549 cell number represented as measured by PrestoBlue cell viability reagent after transfection with SOX11 or nontargeting siRNAs at daily intervals. Values represent means ± SD for three different experiments. (E) Change in percentage of viable cells was assessed by using PrestoBlue cell-viability assay of BT549 cells 72 hours after transfection with SOX11 siRNAs compared with control siRNA. Values represent mean ± SD for three different experiments. *P < 0.05, and ***P < 0.001 compared with the control. (F). Absorbance of BT549 cells transfected with either SOX11-GFP or control GFP-expressing plasmid was assessed by using PrestoBlue cell-viability assay at daily intervals. Values represent mean ± SEM for three independent experiments; *P < 0.05, compared with the control. The transfection efficiency was about 24% for the SOX11-GFP-expressing plasmid. (JPEG 875 KB)
Authors’ original submitted files for images
Below are the links to the authors’ original submitted files for images.
About this article
Cite this article
Zvelebil, M., Oliemuller, E., Gao, Q. et al. Embryonic mammary signature subsets are activated in Brca1-/- and basal-like breast cancers. Breast Cancer Res 15, R25 (2013). https://doi.org/10.1186/bcr3403
- Breast Cancer
- SOX11 Expression
- Breast Cancer Dataset
- Embryonic Gene
- Embryonic Signature