- Open Access
New concepts in breast cancer genomics and genetics
Breast Cancer Research volume 16, Article number: 460 (2014)
Massively parallel DNA and RNA sequencing approaches have generated data on thousands of breast cancer genomes. In this review, we consider progress largely from the perspective of new concepts and hypotheses raised so far. These include challenges to the multistep model of breast carcinogenesis and the discovery of new defects in DNA repair through sequence analysis. Issues for functional genomics include the development of strategies to differentiate between mutations that are likely to drive carcinogenesis and bystander background mutations, as well as the importance of mechanistic studies that examine the role of mutations in genes with roles in splicing, histone methylation, and long non-coding RNA function. The application of genome-annotated patient-derived breast cancer xenografts as a potentially more reliable preclinical model is also discussed. Finally, we address the challenge of extracting medical value from genomic data. A weakness of many datasets is inadequate clinical annotation, which hampers the establishment of links between the mutation spectra and the efficacy of drugs or disease phenotypes. Tools such as dGene and the DGIdb are being developed to identify possible druggable mutations, but these programs are a work in progress since extensive molecular pharmacology is required to develop successful ‘genome-forward’ clinical trials. Examples are emerging, however, including targeting HER2 in HER2 mutant breast cancer and mutant ESR1 in ESR1 endocrine refractory luminal-type breast cancer. Finally, the integration of DNA- and RNA-based sequencing studies with mass spectrometry-based peptide sequencing and an unbiased determination of post-translational modifications promises a more complete view of the biochemistry of breast cancer cells and points toward a new discovery horizon in our understanding of the pathophysiology of this complex disease.
A decade after the first version of the human genome was published , annotation efforts continue, bringing us to the 19th revision, which is the current research standard. Analysis of protein-coding genes and their regulatory sequences is nearing completion, but these functions are served by only a small fraction of the genome. The rest is more functional than once thought, encoding, for example, many non-protein coding RNA genes with emerging regulatory and catalytic roles in cellular physiology and cancer . Furthermore, mass spectrometry-based peptide sequencing is rapidly maturing, promoting studies that provide an unbiased analysis of information flowing from DNA to mRNA to protein to post-translational modification without the need for probes or antibodies at the individual gene or protein level . Finally, deregulation of histone function and DNA methylation is readily evident in many tumor types and is a further consideration in cancer pathogenesis . There is a growing chasm between our understanding of the breast cancer genome and our ability to translate these insights into improved patient outcomes. In this review, we present some of the most recent findings in the genomics field, from the biological discoveries emanating from genome sequencing studies to the clinical implications of those findings and finally to the future areas of potential research in the field.
Recent biologically relevant findings in the genomics field
Significantly mutated genes versus background mutations in breast cancer
Sequencing of DNA and RNA from tumors by using massively parallel sequencing with a capture or other sequence selection approach (exomes or candidate genes) or unbiased ‘whole genome’ approach has become a standard research tool now that the technology has been extensively commercialized -. One objective of cancer sequencing studies is to identify genes that have undergone somatic mutations, which contribute to malignant transformation. Genes that accumulate somatic mutations at a higher than stochastic rate are referred to as ‘significantly mutated genes’ (SMGs) and are considered likely drivers of malignant progression. In breast cancer, there is a dramatic difference in the SMG list between luminal-type breast cancer and basal-like breast cancer. In The Cancer Genome Atlas (TCGA) breast cancer data, at least 20 SMGs were observed in luminal-type A, eight in luminal-type B, but only three in basal-like breast cancer (Table 1). This is not because luminal breast cancer genomes are more complex than those of basal-like breast cancer; in fact, the opposite is true. Basal-like breast cancer genomes are often so complex that it has proven difficult to identify the causal events by using mutation recurrence statistics. Furthermore, structural rearrangements (large-scale chromosomal deletions, amplifications, inversions, and translocations) are likely to play a particularly critical role in basal-like breast cancer, and the complete delineation of these events requires whole genome sequencing, which is technically demanding and expensive .
Detection of SMGs is complicated by the presence of a large number of likely irrelevant mutations referred to as ‘background mutations’ -. These occur not only in genes irrelevant to transformation but even within the SMGs themselves; that is, a missense mutation in a large tumor suppressor gene cannot be assumed to be always inactivating or cause dysfunction in the encoded protein. Mutant allele expression determined by RNA sequencing (RNA seq) is one starting point for disambiguating biologically relevant mutations on SMGs versus irrelevant ones. Many mutations detected at the DNA level are not expressed at the RNA level and thus, at least from the gain-of-function perspective, are unlikely to be major players in the carcinogenesis process . Although there are challenges left to functionalize many of the SMGs as drivers of carcinogenesis, some progress has been made. RNA seq is widely used for the nomination and validation of expressed fusion genes and was recently used to define an endocrine therapy resistance-associated ESR1 translocation . Ultimately, functional studies are critical for resolving the role of mutations in certain SMGs versus background mutations, since the large number of mutations requiring annotation creates an extreme challenge, if this is done in an unbiased way . An alternative approach is to be selective and initially study those associated with a therapeutic hypothesis. Another priority consists of the SMGs themselves, as the biology served by many of these, particularly those involved in mechanisms such as histone methylation, splicing, transcription, and long non-coding (lnc) RNA function is unclear. For example, whole genome analysis revealed clustered mutations in MALAT1, suggesting a gain-of-function role for this poorly understood and abundant lncRNA in breast cancer . The functions of luminal SMGs have particularly striking similarities to drivers in hematopoietic malignancies , a link also emphasized by a recent study on the role of estradiol in hematopoiesis . A particularly vexing problem is the functional resolution of mutated genes that drive pathogenesis in just a few patients or even in only one patient. A significant number of cases of luminal-type breast cancer in the TCGA analysis did not harbor a single SMG , suggesting that current genomic approaches would potentially benefit from additional refinement.
The genomic structure of breast cancer reveals underlying DNA repair defects
Aside from the focus on the identification of individual genes that are repetitively disrupted in breast cancer, a more broad-based analysis of breast cancer genome structures has led to a paradigm shift in the way we view pathogenesis. The standard multistep model of carcinogenesis postulates that mutations accumulate gradually, one at a time, in a process of Darwinian selection in which individual mutant-bearing clones effectively compete with normal cells and other clones within the tumor through the acquisition of the ability to transform, invade, metastasize, and evade drug treatment . However, it was recently demonstrated that multiple mutations can arise over a very short period wherein multiple chromosomal breaks that occurred during a single catastrophic cell division event are (rarely) viably repaired, reshuffling the genome in a way that rapidly triggers transformation though the simultaneous oncogene amplifications and tumor suppressor gene deletions in the vicinity of the multiple translocations that ensue (chromothripsis)  (Figure 1). The reported frequency of chromothripsis in breast cancer varies from 2% to 11.06% ,. Since chromothripsis and interval breast cancer are both marked by the suddenness of their appearance, we hypothesize that chromothripsis might explain the development of rapidly progressing, so-called ‘interval’, breast cancers that arise suddenly between screening visits. For this class of tumors, screening could never be effective as the time span of tumor development is too short. The genomic structure of interval breast cancers should be pursued aggressively as these tumors carry a high mortality burden. As more patients are included in clinical trials that include longitudinal genome sequencing of tumor samples, this hypothesis will be tested in the near future.
In another conceptual breakthrough, investigators at the Sanger Institute demonstrated that there are more than 20 different patterns of somatic mutation in cancer based on copy number aberrations and nucleotide substitution patterns, with a subset of these recurrently observed in breast cancer (APOBEC, BRCA1/2, Signature B) . Overexpression of cytidine deaminase APOBEC family members, in particular, has come into sharp focus. Clustered mutations characteristic of APOBEC activity have been particularly observed in and around chromosomal breakpoints, suggesting that single-stranded DNA generated during aberrant DNA repair is a substrate for APOBEC enzymatic activity . Differences in DNA repair defects explain the striking finding that some breast cancers display many more mutations than others ,. Thus, even in the absence of a known SMG, it is possible to classify breast cancers on the basis of DNA repair defects and this could be clinically relevant. For example, clinical assays in development aim to identify tumors with defects in homologous recombination, which sensitize tumors to cytotoxic chemotherapy .
Intra-tumor heterogeneity in breast cancer
Chromothripsis, multistep progression, and defects in DNA repair combine to produce astonishing levels of both intra-tumoral and inter-tumoral heterogeneity in breast cancer. This complexity is an obvious explanation for the difficulty in curing breast cancer, particularly when advanced. As the tumor progresses and disseminates, the repertoire of biological possibilities encoded within billions of malignant cells, each subtly genetically different, means that resistance to targeted or more traditional cytotoxic therapy is almost inevitable. There is still not enough genomic data from multiple cancer samples from the same patient to track somatic mutation patterns from the primary through to metastatic disease and subsequent drug resistance. Longitudinal studies of this type, however, have been conducted successfully in individual cases. In 2009, Shah and colleagues  described the mutational evolution of a lobular breast carcinoma by using next-generation sequencing. Out of the 32 somatic, protein-coding mutations present in the metastasis, 19 could not be detected in the primary, five were prevalent in the primary, and six were present in the primary with a lower frequency. The Washington University group investigated the progression of a breast cancer to the brain at the whole genome level and found that the primary tumor and metastasis harbored approximately 48 somatic, protein-coding mutations . In the metastatic sample, there were few de novo mutations, but higher variant allele frequencies and a few much lower, supporting a ‘clonal remodeling’ hypothesis for metastatic spread. At the single cell level of the tumor, various techniques have been used to directly visualize and quantify chromosomal aberrations, including duplications, deletions, and other distinctive chromosomal rearrangements. These studies show that breast cancers routinely exhibit genetic heterogeneity at preferred loci -.
Evidence for marked tumor heterogeneity can be found in studies of other cancer types. For example, in a study of a renal cancer with metastasis to the lung and in the chest wall, sequencing of the metastases and nine different areas within the primary tumor found that only a third of mutations were common to all samples . Based on these data, we can infer that heterogeneity and different subclones develop within the primary tumor, not all of which have the same metastatic potential. Metastases can develop early or late in each cancer’s evolutionary history and are products of ongoing clonal evolution, which can be slow or very rapid. The ability to sequence individual cancer cells  will further illuminate this issue, although the complexity of the data analysis remains a considerable challenge.
Clinical implications of genomic discoveries
Clinical translation of massively parallel sequencing of DNA in breast cancer
The sequencing of cancer with data return to the patient and physician is being piloted through ‘genomic tumor boards’ . However, the complexity of the breast cancer genome has slowed progress, as has the relative paucity of obvious drug mutation matches . Unlike drug therapy matched somatic mutations to melanoma and non-small cell lung cancer, drug therapy matched to the presence of a somatic mutation has yet to be robustly established as a standard approach in breast cancer. A number of strategies to increase the productivity and ‘translatability’ of DNA, RNA, and peptide sequencing studies in breast cancer should be considered. The initial set of sequencing-based studies in breast cancer revealed that this is one of the most heterogeneous forms of cancer, with the four commonly accepted subtypes (luminal-type A, luminal-type B, HER2-enriched, and basal-like) displaying distinct somatic mutation, gene copy, and epigenetic profiles . Within the next few years, tens of thousands of primary breast cancers will likely be sequenced but often through clinical sequencing programs without a current systematic and broad-based plan to integrate the data with clinical endpoints. These studies risk following the course of the TCGA breast cancer study. While a technical tour de force, TCGA was largely a cross-platform genome-cataloging exercise and not a systematic clinical research addressing a particular problem in oncology . Thus, it will not be possible to link the TCGA data to important clinical phenotypes such as drug response. Since polypharmacy is the rule in breast cancer treatment, establishing a link between mutational events and the efficacy of individual drugs is impossible unless a dedicated study is conducted. The neoadjuvant treatment setting allows ethical treatment plans with single agents as well as the acquisition of serial samples to assess the effect of treatment on breast cancer somatic genomes - another subject in its infancy in breast cancer. Thus, a systematic approach linking high-quality sample acquisition, uniform neoadjuvant therapy regimens, and integrated ‘omics’ should be a high priority for clinical investigators. An example is provided by an integrated analysis of whole genome, exome-based somatic mutation detection, gene-expression, and gene copy profiles that identified molecular correlates of aromatase inhibitor-resistant proliferation by using samples from a neoadjuvant study . Mutations in TP53 were associated with endocrine therapy resistance, poor prognosis luminal-type B features, mutations in the stress kinase MAP3K1 with low proliferation and luminal-type A features, and mutations in GATA3 with increased responsiveness to aromatase inhibition. A current research focus is to confirm these findings and to conduct additional studies with large sample sizes to link other breast cancer SMGs to clinical outcomes.
The druggable breast cancer genome
A major obstacle to the translation of newly defined genetic alterations into clinical benefit for patients lies in the identification of biologically relevant druggable aberrations that can be used as therapeutic targets . To address this goal, programs such as dGene  and DGIdb  have been developed. The dGene program is an updated version of the druggable genome concept introduced in 2002 by Hopkins and Groom . The druggable genome refers to a subset of genes that are known or predicted to interact with drugs. The software stratifies mutations from any database containing gene symbols into 10 different gene classes that are both potentially druggable and clinically relevant to cancer biology. An annotation and filtering tool is used to prioritize mutations for consideration. The analysis of a recent breast cancer genomic study  highlights the potential utility of this approach. From a total of 2,622 single-nucleotide variants identified in the neoadjuvant aromatase inhibitor discussed above, dGene identified 368 mutations out of 2,622 single-nucleotide variants as occurring in 255 druggable genes. When filtered for recurrence, that number was narrowed to 37 potentially druggable mutated genes present in at least two patients (Table 2). Despite its utility, dGene does not provide information on the type of mutation or guarantee clinical pertinence of mutations associated with any specific gene. This underscores the critical need to functionally test these and other genomic results.
A similar tool is DGIdb . The concept behind the DGIdb is to classify gene mutations into two classes: genes that are known to have drug interactions and genes that are potentially druggable according to their gene category. DGIdb was developed by integrating data from 13 different sources and contains over 14,000 drug-gene interactions. It also includes 6,761 genes that belong to one or more of 39 potentially druggable gene categories. The utility of DGIdb was demonstrated by analyzing a cohort of 1,273 patients who were included in whole-genome or exome sequencing studies ,-. The software identified 6 of 31 genes (AKT1, CDH1, LRP2, PIK3CA, RYR2, and TP53) that were recurrently mutated in at least 2.5% of patients and also have known drug-gene interactions. With the addition of the top 1% of recurring mutations, the number of genes increased to 315. Six sources - DrugBank, MyCancerGenome, the Pharmacogenetics Knowledge Base (PharmGKB), Trends in the Exploitation of Novel Drug Targets (TEND), Targeted Agents in Lung Cancer (TALC), and Therapeutic Target Database (TTD) - were interrogated by DGldb to identify a total of 354 possible druggable gene interactions among the 315 genes. There was limited overlap between the sources, and only one drug-gene interaction was present in all six sources simultaneously (Figure 2a). The nature and extent of curation as well as the overall methodologies employed by each source are different (Figure 2a), which explains the limited overlap between the different sources. Some of the 315 genes are in potentially druggable categories (dGene), and others represent opportunities for drug discovery (Figure 2b).
This analysis serves to emphasize that these druggable genome approaches remain unvalidated by clinical trials and the pre-existing pharmacopeia is obviously inadequate, although ‘drug repurposing’ - the concept of redirecting US Food and Drug Administration-approved drugs to new secondary indications - is clearly an opportunity. Thus, in their current form, these computational approaches are mostly hypothesis-generating tools that are intended to accelerate medical research, not tools for clinical action (at least not yet). The next logical step after using such tools is to design functional studies to test the related drugs and find a more reliable answer as to whether such mutations are drivers of carcinogenesis or just background mutations.
HER2 and ESR1 mutations as examples of novel druggable targets
The utility of detailed preclinical work on potentially druggable genes is nicely illustrated by the study of HER2 mutations in breast cancer. Data from eight breast cancer genome-sequencing studies identified 25 patients with HER2 somatic mutations without HER2 amplification ,,,-. Thirteen HER2 mutations were functionally characterized by using in vitro kinase assays, protein structure analysis, cell culture, and xenograft experiments . The results showed that the investigational drug neratinib, an irreversible HER2 inhibitor, rather than lapatinib, an approved HER2 kinase inhibitor, was a better approach for clinical studies since some of the recurrent mutations were naturally lapatinib-resistant. This is a result that simple drug somatic mutation matching software would not have revealed. Currently, patients with advanced HER2 mutation-positive tumors are being enrolled into a single-agent study of neratinib (NCT01670877). Point mutations in the estradiol-binding domain of the estrogen receptor gene (ESR1) are emerging as a potent cause of acquired endocrine therapy resistance. Although there are no drugs that specifically target these mutations, alternative endocrine therapies may be effective in this setting , and this possibility will soon be addressed in clinical trials.
Patient-derived xenografts as genomic models for breast cancer
A major criticism of standard cell lines as a model for human breast cancer is that they are essentially disconnected from the individuals from whom they were derived. Without knowledge of the progenitor tumor genome as a reference point and no knowledge of the clinical characteristics of the patient who donated the tissue, it is uncertain what the cell lines actually model from an individual patient perspective and to what degree genetic drift has occurred after prolonged in vitro culture. These limitations likely contribute to the poor predictive utility of cell line panels in drug development -. An alternative preclinical model for drug optimization and target validation is the patient-derived xenograft (PDX) approach. Detailed information covering the continuum from specimen acquisition to development of patient-derived xenografts has been presented and reviewed elsewhere ,-. In brief, a biopsy-sized sample of primary or metastatic tumor is transferred directly into an immunodeficient mouse by orthotopic or subcutaneous implantation. Once tumor engraftment has occurred, RNA and DNA sequencing or chip-based analysis is employed to compare the patient tumor to the PDX. PDXs maintain fidelity to the patient tumor based on molecular subtypes, mutational spectrum, copy number variations, gene expression profiles, and histopathology ,-. PDX models faithfully recapitulate the intra-tumor heterogeneity and response to chemotherapy . This close resemblance between the PDXs and the patient tumor makes it a suitable predictive preclinical model. The deployment of PDXs therefore can be considered a ‘test bed’ for personalized precision medicine in which genome-forward hypotheses can be assessed preclinically. However, despite the great promise and utility of PDXs, there are some drawbacks that need to be resolved to ensure wider adoption and improved utility. The limitations are the higher comparative cost, high level of technical expertise needed, the lack of an immune system, the effect of differences between the mouse and human microenvironment, and the degree of genetic drift and how this affects conclusions regarding biological and pharmacological findings.
Even with the mentioned limitations, the PDX model has great utility in breast cancer research. Through the genome sequencing of different PDX lines, Li and colleagues  identified new ESR1 point mutations and translocations. These gene mutations and the ESR1-YAP1 gene fusion were further investigated through functional studies that directly implicated them in resistance to treatment. Not coincidentally, the patients from whom these PDXs were derived presented with endocrine treatment resistance during their course of treatment.
Future areas of research
Proteomics as the next step in the annotation of the breast cancer genome
A fundamental problem in the study of cancer genomics at the level of DNA and RNA is that conclusions regarding pathway activation are indirect since proteins, not nucleic acids, execute these functions. Thus, when signaling and biology are discussed, it is through inference from signal transduction databases that may or may not have been conducted in the relevant biological context and that may or may not be correct. Informatics approaches generate hypotheses, not conclusions ,. The reverse phase protein array (RPPA) is one answer to the problem of efficiently tracking protein levels and phosphorylation events . Here, tumor protein extracts from many tumors are spotted into slides and probed with highly quality-controlled antibodies. Unfortunately, the generation of RPPA-quality antibodies is technically challenging; in particular, the number of phosphosite-specific antibodies is very limited. Therefore, mass spectrometry is being developed to examine the protein biochemistry of the cancer cells in less biased ways by direct protein sequencing and mass analysis to determine post-translational modifications . Next-generation proteomic technologies are poised to provide deep information on tumor proteomes and on post-translational modifications of all types. When combined with genomic data, proteomics may enable a deeper understanding of complex mechanisms that regulate gene function and dysfunction in cancer. These objectives are being realized by the National Cancer Institute Clinical Proteomic Tumor Analysis Consortium, which is applying standardized proteome analysis platforms to analyze tumor tissues from the TCGA program as well as unique cell and xenograft models and other tissue collections, all of which are accompanied by rich genomic datasets .
The expansion of knowledge in genomics is already having a profound effect on breast cancer research and increasingly on treatment. It is clear, however, that genome-sequencing studies have still not been adequately designed to address specific questions in breast cancer oncology. This is essential to translate the comprehensive catalog of recurrent mutations in breast cancer to a functionally and pharmacologically annotated treatment road map. Through the sequencing of tumors in different time-points, we will be able to identify cellular pathways and targets for drug development and use this information for the development of clinically testable hypotheses. Integrated approaches that not only account for DNA and RNA aberrations but also document protein function and biochemistry are clearly the next technical horizon .
All the authors made substantial contributions to the conception and design of this article, participated in drafting the article or revising it critically for important intellectual content, and gave final approval of the version submitted.
v-akt murine thymoma viral oncogene homolog1
Apolipoprotein B mRNA editing enzyme, catalytic polypeptide-like
Breast cancer 1, early onset
Breast cancer 2, early onset
Cadherin 1, type 1, E-cadherin (epithelial)
Estrogen receptor 1
GATA binding protein 3
Human epidermal growth factor receptor 2
- lnc :
Low density lipoprotein receptor-related protein 2
Metastasis associated lung adenocarcinoma transcript 1
Mitogen-activated protein kinase kinase kinase 1, E3 ubiquitin protein ligase
Phosphatidylinositol-4,5-bisphosphate 3-kinase, catalytic subunit alpha
- RNA seq:
Reverse phase protein array
Ryanodine receptor 2 (cardiac)
Significantly mutated gene
The cancer genome Atlas
Tumor protein p53
Lander ES, Linton LM, Birren B, Nusbaum C, Zody MC, Baldwin J, Devon K, Dewar K, Doyle M, FitzHugh W, Funke R, Gage D, Harris K, Heaford A, Howland J, Kann L, Lehoczky J, LeVine R, McEwan P, McKernan K, Meldrim J, Mesirov JP, Miranda C, Morris W, Naylor J, Raymond C, Rosetti M, Santos R, Sheridan A, Sougnez C, Stange-Thomann N, et al: Initial sequencing and analysis of the human genome. Nature. 2001, 409: 860-921. 10.1038/35057062.
The ENCODE (ENCyclopedia Of DNA Elements) Project. Science. 2004, 306: 636-640. 10.1126/science.1105136.
Ellis MJ, Perou CM: The genomic landscape of breast cancer as a therapeutic roadmap. Cancer Discov. 2013, 3: 27-34. 10.1158/2159-8290.CD-12-0462.
Tsai HC, Baylin SB: Cancer epigenetics: linking basic biology to clinical medicine. Cell Res. 2011, 21: 502-517. 10.1038/cr.2011.24.
Mardis ER: Genome sequencing and cancer. Curr Opin Genet Dev. 2012, 22: 245-250. 10.1016/j.gde.2012.03.005.
Koboldt DC, Steinberg KM, Larson DE, Wilson RK, Mardis ER: The next-generation sequencing revolution and its impact on genomics. Cell. 2013, 155: 27-38. 10.1016/j.cell.2013.09.006.
Mutz KO, Heilkenbrinker A, Lönne M, Walter JG, Stahl F: Transcriptome analysis using next-generation sequencing. Curr Opin Biotechnol. 2013, 24: 22-30. 10.1016/j.copbio.2012.09.004.
Ding L, Ellis MJ, Li S, Larson DE, Chen K, Wallis JW, Harris CC, McLellan MD, Fulton RS, Fulton LL, Abbott RM, Hoog J, Dooling DJ, Koboldt DC, Schmidt H, Kalicki J, Zhang Q, Chen L, Lin L, Wendl MC, McMichael JF, Magrini VJ, Cook L, McGrath SD, Vickery TL, Appelbaum E, Deschryver K, Davies S, Guintoli T, Lin L, et al: Genome remodelling in a basal-like breast cancer metastasis and xenograft. Nature. 2010, 464: 999-1005. 10.1038/nature08989.
Vogelstein B, Papadopoulos N, Velculescu VE, Zhou S, Diaz LA, Kinzler KW: Cancer genome landscapes. Science. 2013, 339: 1546-1558. 10.1126/science.1235122.
Kalari S, Pfeifer GP: Identification of driver and passenger DNA methylation in cancer by epigenomic analysis. Adv Genet. 2010, 70: 277-308. 10.1016/B978-0-12-380866-0.60010-1.
Bignell GR, Greenman CD, Davies H, Butler AP, Edkins S, Andrews JM, Buck G, Chen L, Beare D, Latimer C, Widaa S, Hinton J, Fahey C, Fu B, Swamy S, Dalgliesh GL, Teh BT, Deloukas P, Yang F, Campbell PJ, Futreal PA, Stratton MR: Signatures of mutation and selection in the cancer genome. Nature. 2010, 463: 893-898. 10.1038/nature08768.
Li S, Shen D, Shao J, Crowder R, Liu W, Prat A, He X, Liu S, Hoog J, Lu C, Ding L, Griffith OL, Miller C, Larson D, Fulton RS, Harrison M, Mooney T, McMichael JF, Luo J, Tao Y, Goncalves R, Schlosberg C, Hiken JF, Saied L, Sanchez C, Giuntoli T, Bumb C, Cooper C, Kitchens RT, Lin A, et al: Endocrine-therapy-resistant ESR1 variants revealed by genomic characterization of breast-cancer-derived xenografts. Cell Rep. 2013, 4: 1116-1130. 10.1016/j.celrep.2013.08.022.
Liu ET: Functional genomics of cancer. Curr Opin Genet Dev. 2008, 18: 251-256. 10.1016/j.gde.2008.07.014.
Ellis MJ, Ding L, Shen D, Luo J, Suman VJ, Wallis JW, Van Tine BA, Hoog J, Goiffon RJ, Goldstein TC, Ng S, Lin L, Crowder R, Snider J, Ballman K, Weber J, Chen K, Koboldt DC, Kandoth C, Schierding WS, McMichael JF, Miller CA, Lu C, Harris CC, McLellan MD, Wendl MC, DeSchryver K, Allred DC, Esserman L, Unzeitig G, et al: Whole-genome analysis informs breast cancer response to aromatase inhibition. Nature. 2012, 486: 353-360.
Nakada D, Oguro H, Levi BP, Ryan N, Kitano A, Saitoh Y, Takeichi M, Wendt GR, Morrison SJ: Oestrogen increases haematopoietic stem-cell self-renewal in females and during pregnancy. Nature. 2014, 505: 555-558. 10.1038/nature12932.
Network CGA: Comprehensive molecular portraits of human breast tumours. Nature. 2012, 490: 61-70. 10.1038/nature11453.
Nowell PC: The clonal evolution of tumor cell populations. Science. 1976, 194: 23-28. 10.1126/science.959840.
Stephens PJ, Greenman CD, Fu B, Yang F, Bignell GR, Mudie LJ, Pleasance ED, Lau KW, Beare D, Stebbings LA, McLaren S, Lin ML, McBride DJ, Varela I, Nik-Zainal S, Leroy C, Jia M, Menzies A, Butler AP, Teague JW, Quail MA, Burton J, Swerdlow H, Carter NP, Morsberger LA, Iacobuzio-Donahue C, Follows GA, Green AR, Flanagan AM, Stratton MR, et al: Massive genomic rearrangement acquired in a single catastrophic event during cancer development. Cell. 2011, 144: 27-40. 10.1016/j.cell.2010.11.055.
Cai H, Kumar N, Bagheri HC, von Mering C, Robinson MD, Baudis M: Chromothripsis-like patterns are recurring but heterogeneously distributed features in a survey of 22,347 cancer genome screens. BMC Genomics. 2014, 15: 82-10.1186/1471-2164-15-82.
Nik-Zainal S, Alexandrov LB, Wedge DC, Van Loo P, Greenman CD, Raine K, Jones D, Hinton J, Marshall J, Stebbings LA, Menzies A, Martin S, Leung K, Chen L, Leroy C, Ramakrishna M, Rance R, Lau KW, Mudie LJ, Varela I, McBride DJ, Bignell GR, Cooke SL, Shlien A, Gamble J, Whitmore I, Maddison M, Tarpey PS, Davies HR, Papaemmanuil E, et al: Mutational processes molding the genomes of 21 breast cancers. Cell. 2012, 149: 979-993. 10.1016/j.cell.2012.04.024.
Roberts SA, Lawrence MS, Klimczak LJ, Grimm SA, Fargo D, Stojanov P, Kiezun A, Kryukov GV, Carter SL, Saksena G, Harris S, Shah RR, Resnick MA, Getz G, Gordenin DA: An APOBEC cytidine deaminase mutagenesis pattern is widespread in human cancers. Nat Genet. 2013, 45: 970-976. 10.1038/ng.2702.
Greenman C, Stephens P, Smith R, Dalgliesh GL, Hunter C, Bignell G, Davies H, Teague J, Butler A, Stevens C, Edkins S, O'Meara S, Vastrik I, Schmidt EE, Avis T, Barthorpe S, Bhamra G, Buck G, Choudhury B, Clements J, Cole J, Dicks E, Forbes S, Gray K, Halliday K, Harrison R, Hills K, Hinton J, Jenkinson A, Jones D, et al: Patterns of somatic mutation in human cancer genomes. Nature. 2007, 446: 153-158. 10.1038/nature05610.
Yap TA, Sandhu SK, Carden CP, de Bono JS: Poly(ADP-ribose) polymerase (PARP) inhibitors: exploiting a synthetic lethal strategy in the clinic. CA Cancer J Clin. 2011, 61: 31-49. 10.3322/caac.20095.
Shah SP, Morin RD, Khattra J, Prentice L, Pugh T, Burleigh A, Delaney A, Gelmon K, Guliany R, Senz J, Steidl C, Holt RA, Jones S, Sun M, Leung G, Moore R, Severson T, Taylor GA, Teschendorff AE, Tse K, Turashvili G, Varhol R, Warren RL, Watson P, Zhao Y, Caldas C, Huntsman D, Hirst M, Marra MA, Aparicio S: Mutational evolution in a lobular breast tumour profiled at single nucleotide resolution. Nature. 2009, 461: 809-813. 10.1038/nature08489.
Roka S, Fiegl M, Zojer N, Filipits M, Schuster R, Steiner B, Jakesz R, Huber H, Drach J: Aneuploidy of chromosome 8 as detected by interphase fluorescence in situ hybridization is a recurrent finding in primary and metastatic breast cancer. Breast Cancer Res Treat. 1998, 48: 125-133. 10.1023/A:1005937305102.
Park SY, Lee HE, Li H, Shipitsin M, Gelman R, Polyak K: Heterogeneity for stem cell-related markers according to tumor subtype and histologic stage in breast cancer. Clin Cancer Res. 2010, 16: 876-887. 10.1158/1078-0432.CCR-09-1532.
Farabegoli F, Santini D, Ceccarelli C, Taffurelli M, Marrano D, Baldini N: Clone heterogeneity in diploid and aneuploid breast carcinomas as detected by FISH. Cytometry. 2001, 46: 50-56. 10.1002/1097-0320(20010215)46:1<50::AID-CYTO1037>3.0.CO;2-T.
Teixeira MR, Pandis N, Bardi G, Andersen JA, Heim S: Karyotypic comparisons of multiple tumorous and macroscopically normal surrounding tissue samples from patients with breast cancer. Cancer Res. 1996, 56: 855-859.
Teixeira MR, Pandis N, Bardi G, Andersen JA, Mitelman F, Heim S: Clonal heterogeneity in breast cancer: karyotypic comparisons of multiple intra- and extra-tumorous samples from 3 patients. Int J Cancer. 1995, 63: 63-68. 10.1002/ijc.2910630113.
Gerlinger M, Rowan AJ, Horswell S, Larkin J, Endesfelder D, Gronroos E, Martinez P, Matthews N, Stewart A, Tarpey P, Varela I, Phillimore B, Begum S, McDonald NQ, Butler A, Jones D, Raine K, Latimer C, Santos CR, Nohadani M, Eklund AC, Spencer-Dene B, Clark G, Pickering L, Stamp G, Gore M, Szallasi Z, Downward J, Futreal PA, Swanton C: Intratumor heterogeneity and branched evolution revealed by multiregion sequencing. N Engl J Med. 2012, 366: 883-892. 10.1056/NEJMoa1113205.
Navin N, Kendall J, Troge J, Andrews P, Rodgers L, McIndoo J, Cook K, Stepansky A, Levy D, Esposito D, Muthuswamy L, Krasnitz A, McCombie WR, Hicks J, Wigler M: Tumour evolution inferred by single-cell sequencing. Nature. 2011, 472: 90-94. 10.1038/nature09807.
Roychowdhury S, Iyer MK, Robinson DR, Lonigro RJ, Wu YM, Cao X, Kalyana-Sundaram S, Sam L, Balbin OA, Quist MJ, Barrette T, Everett J, Siddiqui J, Kunju LP, Navone N, Araujo JC, Troncoso P, Logothetis CJ, Innis JW, Smith DC, Lao CD, Kim SY, Roberts JS, Gruber SB, Pienta KJ, Talpaz M, Chinnaiyan AM: Personalized oncology through integrative high-throughput sequencing: a pilot study. Sci Transl Med. 2011, 3: 111ra121-10.1126/scitranslmed.3003161.
Simon R, Roychowdhury S: Implementing personalized cancer genomics in clinical trials. Nat Rev Drug Discov. 2013, 12: 358-369. 10.1038/nrd3979.
Natrajan R, Wilkerson P: From integrative genomics to therapeutic targets. Cancer Res. 2013, 73: 3483-3488. 10.1158/0008-5472.CAN-12-4717.
Kumar RD, Chang LW, Ellis MJ, Bose R: Prioritizing potentially druggable mutations with dGene: an annotation tool for cancer genome sequencing data. PLoS One. 2013, 8: e67980-10.1371/journal.pone.0067980.
Griffith M, Griffith OL, Coffman AC, Weible JV, McMichael JF, Spies NC, Koval J, Das I, Callaway MB, Eldred JM, Miller CA, Subramanian J, Govindan R, Kumar RD, Bose R, Ding L, Walker JR, Larson DE, Dooling DJ, Smith SM, Ley TJ, Mardis ER, Wilson RK: DGIdb: mining the druggable genome. Nat Methods. 2013, 10: 1209-1210. 10.1038/nmeth.2689.
Hopkins AL, Groom CR: The druggable genome. Nat Rev Drug Discov. 2002, 1: 727-730. 10.1038/nrd892.
Banerji S, Cibulskis K, Rangel-Escareno C, Brown KK, Carter SL, Frederick AM, Lawrence MS, Sivachenko AY, Sougnez C, Zou L, Cortes ML, Fernandez-Lopez JC, Peng S, Ardlie KG, Auclair D, Bautista-Piña V, Duke F, Francis J, Jung J, Maffuz-Aziz A, Onofrio RC, Parkin M, Pho NH, Quintanar-Jurado V, Ramos AH, Rebollar-Vega R, Rodriguez-Cuevas S, Romero-Cordoba SL, Schumacher SE, Stransky N, et al: Sequence analysis of mutations and translocations across breast cancer subtypes. Nature. 2012, 486: 405-409. 10.1038/nature11154.
Kan Z, Jaiswal BS, Stinson J, Janakiraman V, Bhatt D, Stern HM, Yue P, Haverty PM, Bourgon R, Zheng J, Moorhead M, Chaudhuri S, Tomsho LP, Peters BA, Pujara K, Cordes S, Davis DP, Carlton VE, Yuan W, Li L, Wang W, Eigenbrot C, Kaminker JS, Eberhard DA, Waring P, Schuster SC, Modrusan Z, Zhang Z, Stokoe D, de Sauvage FJ, et al: Diverse somatic mutation patterns and pathway alterations in human cancers. Nature. 2010, 466: 869-873. 10.1038/nature09208.
Shah SP, Roth A, Goya R, Oloumi A, Ha G, Zhao Y, Turashvili G, Ding J, Tse K, Haffari G, Bashashati A, Prentice LM, Khattra J, Burleigh A, Yap D, Bernard V, McPherson A, Shumansky K, Crisan A, Giuliany R, Heravi-Moussavi A, Rosner J, Lai D, Birol I, Varhol R, Tam A, Dhalla N, Zeng T, Ma K, Chan SK, et al: The clonal and mutational evolution spectrum of primary triple-negative breast cancers. Nature. 2012, 486: 395-399.
Stephens PJ, Tarpey PS, Davies H, Van Loo P, Greenman C, Wedge DC, Nik-Zainal S, Martin S, Varela I, Bignell GR, Yates LR, Papaemmanuil E, Beare D, Butler A, Cheverton A, Gamble J, Hinton J, Jia M, Jayakumar A, Jones D, Latimer C, Lau KW, McLaren S, McBride DJ, Menzies A, Mudie L, Raine K, Rad R, Chapman MS, Teague J, et al: The landscape of cancer genes and mutational processes in breast cancer. Nature. 2012, 486: 400-404.
Lee JW, Soung YH, Seo SH, Kim SY, Park CH, Wang YP, Park K, Nam SW, Park WS, Kim SH, Lee JY, Yoo NJ, Lee SH: Somatic mutations of ERBB2 kinase domain in gastric, colorectal, and breast carcinomas. Clin Cancer Res. 2006, 12: 57-61. 10.1158/1078-0432.CCR-05-0976.
Bose R, Kavuri SM, Searleman AC, Shen W, Shen D, Koboldt DC, Monsey J, Goel N, Aronson AB, Li S, Ma CX, Ding L, Mardis ER, Ellis MJ: Activating HER2 mutations in HER2 gene amplification negative breast cancer. Cancer Discov. 2013, 3: 224-237. 10.1158/2159-8290.CD-12-0349.
Toy W, Shen Y, Won H, Green B, Sakr RA, Will M, Li Z, Gala K, Fanning S, King TA, Hudis C, Chen D, Taran T, Hortobagyi G, Greene G, Berger M, Baselga J, Chandarlapaty S: ESR1 ligand-binding domain mutations in hormone-resistant breast cancer. Nat Genet. 2013, 45: 1439-1445. 10.1038/ng.2822.
Robinson DR, Wu YM, Vats P, Su F, Lonigro RJ, Cao X, Kalyana-Sundaram S, Wang R, Ning Y, Hodges L, Gursky A, Siddiqui J, Tomlins SA, Roychowdhury S, Pienta KJ, Kim SY, Roberts JS, Rae JM, Van Poznak CH, Hayes DF, Chugh R, Kunju LP, Talpaz M, Schott AF, Chinnaiyan AM: Activating ESR1 mutations in hormone-resistant metastatic breast cancer. Nat Genet. 2013, 45: 1446-1451. 10.1038/ng.2823.
Ellis LM, Fidler IJ: Finding the tumor copycat. Therapy fails, patients don't. Nat Med. 2010, 16: 974-975. 10.1038/nm0910-974.
Johnson JI, Decker S, Zaharevitz D, Rubinstein LV, Venditti JM, Schepartz S, Kalyandrug S, Christian M, Arbuck S, Hollingshead M, Sausville EA: Relationships between drug activity in NCI preclinical in vitro and in vivo models and early clinical trials. Br J Cancer. 2001, 84: 1424-1431. 10.1054/bjoc.2001.1796.
Voskoglou-Nomikos T, Pater JL, Seymour L: Clinical predictive value of the in vitro cell line, human xenograft, and mouse allograft preclinical cancer models. Clin Cancer Res. 2003, 9: 4227-4239.
Jin K, Teng L, Shen Y, He K, Xu Z, Li G: Patient-derived human tumour tissue xenografts in immunodeficient mice: a systematic review. Clin Transl Oncol. 2010, 12: 473-480. 10.1007/s12094-010-0540-6.
Morton CL, Houghton PJ: Establishment of human tumor xenografts in immunodeficient mice. Nat Protoc. 2007, 2: 247-250. 10.1038/nprot.2007.25.
Rubio-Viqueira B, Hidalgo M: Direct in vivo xenograft tumor model for predicting chemotherapeutic drug response in cancer patients. Clin Pharmacol Ther. 2009, 85: 217-221. 10.1038/clpt.2008.200.
Sausville EA, Burger AM: Contributions of human tumor xenografts to anticancer drug development. Cancer Res. 2006, 66: 3351-3354. 10.1158/0008-5472.CAN-05-3627. discussion 3354
DeRose YS, Wang G, Lin YC, Bernard PS, Buys SS, Ebbert MT, Factor R, Matsen C, Milash BA, Nelson E, Neumayer L, Randall RL, Stijleman IJ, Welm BE, Welm AL: Tumor grafts derived from women with breast cancer authentically reflect tumor pathology, growth, metastasis and disease outcomes. Nat Med. 2011, 17: 1514-1520. 10.1038/nm.2454.
McEvoy J, Ulyanov A, Brennan R, Wu G, Pounds S, Zhang J, Dyer MA: Analysis of MDM2 and MDM4 single nucleotide polymorphisms, mRNA splicing and protein expression in retinoblastoma. PLoS One. 2012, 7: e42739-10.1371/journal.pone.0042739.
Reyal F, Guyader C, Decraene C, Lucchesi C, Auger N, Assayag F, De Plater L, Gentien D, Poupon MF, Cottu P, De Cremoux P, Gestraud P, Vincent-Salomon A, Fontaine JJ, Roman-Roman S, Delattre O, Decaudin D, Marangoni E: Molecular profiling of patient-derived breast cancer xenografts. Breast Cancer Res. 2012, 14: R11-10.1186/bcr3095.
Zhao X, Liu Z, Yu L, Zhang Y, Baxter P, Voicu H, Gurusiddappa S, Luan J, Su JM, Leung HC, Li XN: Global gene expression profiling confirms the molecular fidelity of primary tumor-based orthotopic xenograft mouse models of medulloblastoma. Neuro Oncol. 2012, 14: 574-583. 10.1093/neuonc/nos061.
Ng S, Collisson EA, Sokolov A, Goldstein T, Gonzalez-Perez A, Lopez-Bigas N, Benz C, Haussler D, Stuart JM: PARADIGM-SHIFT predicts the function of mutations in multiple cancers using pathway impact analysis. Bioinformatics. 2012, 28: i640-i646. 10.1093/bioinformatics/bts402.
Vaske CJ, Benz SC, Sanborn JZ, Earl D, Szeto C, Zhu J, Haussler D, Stuart JM: Inference of patient-specific pathway activities from multi-dimensional cancer genomics data using PARADIGM. Bioinformatics. 2010, 26: i237-245. 10.1093/bioinformatics/btq182.
Tabchy A, Hennessy BT, Gonzalez-Angulo AM, Bernstam FM, Lu Y, Mills GB: Quantitative proteomic analysis in breast cancer. Drugs Today (Barc). 2011, 47: 169-182. 10.1358/dot.2011.47.2.1576695.
Ellis MJ, Gillette M, Carr SA, Paulovich AG, Smith RD, Rodland KK, Townsend RR, Kinsinger C, Mesri M, Rodriguez H, Liebler DC: Connecting genomic alterations to cancer biology with proteomics: the NCI Clinical Proteomic Tumor Analysis Consortium. Cancer Discov. 2013, 3: 1108-1112. 10.1158/2159-8290.CD-13-0219.
RG was supported by a grant from the AVON Foundation for Women. WAW was supported by Washington University School of Medicine, Graduate School of Arts & Sciences/CGFP Fund 94028C. JL was supported by a Siteman Cancer Center Support Grant (P30CA091842). MJE was supported by Susan G. Komen for the Cure (PG12220321), the Clinical Proteomic Tumor Analysis Consortium (U24 CA160035 and R01 CA095614), the AVON Foundation for Women, a Siteman Cancer Center Support Grant (P30CA091842), and the Breast Cancer Research Foundation.
MJE declares patent and royalty income from BioClassifier LLC (St Louis, MO, USA) through a license on the PAM50 patents to Nanostring (Seattle, WA, USA) for the intrinsic subtype test ‘Prosigna’. The other authors declare that they have no competing interests.
Authors’ original submitted files for images
About this article
Cite this article
Goncalves, R., Warner, W.A., Luo, J. et al. New concepts in breast cancer genomics and genetics. Breast Cancer Res 16, 460 (2014). https://doi.org/10.1186/s13058-014-0460-4
- Breast Cancer
- Somatic Mutation
- Reverse Phase Protein Array
- Endocrine Therapy Resistance
- ESR1 Mutation