Skip to main content

The effect of the stromal component of breast tumours on prediction of clinical outcome using gene expression microarray analysis



The aim of this study was to examine the effect of the cellular composition of biopsies on the error rates of multigene predictors of response of breast tumours to neoadjuvant adriamycin and cyclophosphamide (AC) chemotherapy.

Materials and methods

Core biopsies were taken from primary breast tumours of 43 patients prior to AC, and subsequent clinical response was recorded. Post-chemotherapy (day 21) samples were available for 16 of these samples. Frozen sections of each core were used to estimate the proportion of invasive cancer and other tissue components at three levels. Transcriptional profiling was performed using a cDNA array containing 4,600 elements.


Twenty-three (53%) patients demonstrated a 'good' and 20 (47%) a 'poor' clinical response. The percentage invasive tumour in core biopsies collected from these patients varied markedly. Despite this, agglomerative clustering of sample expression profiles showed that almost all biopsies from the same tumour aggregated as nearest neighbours. SAM (significance analysis of microarrays) regression analysis identified 144 genes which distinguished high- and low-percentage invasive tumour biopsies at a false discovery rate of not more than 5%. The misclassification error of prediction of clinical response using microarray data from pre-treatment biopsies (on leave-one-out cross-validation) was 28%. When prediction was performed on subsets of samples which were more homogeneous in their proportions of malignant and stromal cells, the misclassification error was considerably lower (8%–13%, p < 0.05 on permutation).


The non-tumour content of breast cancer samples has a significant effect on gene expression profiles. Consideration of this factor improves accuracy of response prediction by expression array profiling. Future gene expression array prediction studies should be planned taking this into account.


Breast tumours are routinely subclassified according to microscopic morphology, immunohistochemical staining, and stage. On the basis of this clinical information and patient age, an estimate of prognosis may be derived [1, 2]. Most clinicians make recommendations regarding the need for adjuvant chemotherapy on the basis of this estimate.

However, breast cancer is a heterogeneous disease, and differences in prognosis and response in distinct molecular subgroups need to be taken into account. Improvement in the accuracy of prediction of prognosis without systemic treatment or with endocrine treatment alone would allow avoidance of non-beneficial chemotherapy in a significant proportion of women [3]. Additionally, experience with neoadjuvant chemotherapy has demonstrated resistance in a significant proportion of primary breast tumours [4]. These patients derive no downstaging benefits from neoadjuvant chemotherapy. Furthermore, chemosensitivity in the neoadjuvant setting is associated with superior long-term survival (relative to chemoresistance) [5] and therefore may represent a marker of survival benefit from chemotherapy. Identification of a chemoresistant profile would allow further tailoring of treatment by enabling selection of tumours unlikely to respond and therefore unlikely to derive a survival benefit.

Several studies have demonstrated that gene expression microarray profiling may be useful in improving prediction of prognosis [69] and treatment response [1016]. These studies employed non-dissected surgical [79], core-cut biopsy [12], or FNA (fine needle aspiration) samples [10]. However, breast tumours are non-homogenous in nature. They include inflammatory and vascular elements but most significantly (by proportion) connective tissue components [17]. The proportions of these components vary according to tumour type and sample type and also across a single tumour [17]. In studies involving surgical samples, those used for profiling can be selected as those with the highest proportional malignant cell content. In studies involving biopsies, this is not possible and the researcher is required to set an arbitrary minimum percentage tumour limit.

The impact on expression profile of variation in the proportion of tumour cells and the nature of the non-tumour components have been largely unexplored. In this paper, we examine the effect of percentage tumour content on expression profile within a study designed to derive an expression profile predictive of response to adriamycin and cyclophosphamide (AC) neoadjuvant chemotherapy. We also consider methods for improvement of molecular profile-based prediction of response to primary chemotherapy by classification of samples according to cellular makeup or by the incorporation of sample tumour content information into the predictor.

Materials and methods

Patients and samples

Patients were recruited and treated at the Royal Marsden Hospital (RMH), London, UK. Eligible patients were those undergoing neoadjuvant AC chemotherapy treatment at doses of 60 and 600 mg/m2, respectively, three times a week, for a clinically measurable breast tumour. The study was approved by the RMH Clinical Research and Ethics Committees (study number. 1,947), and written consent was obtained in all cases. Patients had been allocated neoadjuvant treatment for one of several standard indications, including locally advanced or inflammatory breast cancer, high tumour-to-breast size ratio, and tumours located close to the nipple.

Diagnosis was confirmed histologically by core-cut biopsy. All patients on hormone replacement therapy at diagnosis were advised to discontinue this treatment. Patients who demonstrated at least a partial clinical response received six cycles of treatment prior to local treatment. Patients in whom there was no or only marginal response after three or four cycles proceeded directly to local treatment or were commenced on alternative systemic treatment (docetaxel).

Clinical size of tumour (largest diameter and a diameter perpendicular to this) was recorded prior to commencement and at completion of treatment. Clinical response was categorised as follows: no palpable abnormality after treatment, complete clinical response (cCR); more than 50% reduction in the product of the bidimensional measurements, partial response (PR); less than 50% reduction in the product of bidimensional measurements was recorded as no change (NC); and residual ill-defined thickening after a good response, minimal residual disease (MRD). Those cases in which there was no residual invasive carcinoma at surgery were classified as a complete pathological response (pCR). Good responders were defined as pCR, cCR, or MRD; poor responders were defined as PR or NC. These categories were chosen on the basis of our previous study, which showed that patients with 'good' response had superior overall survival relative to those with 'poor' response [18]. A proportion of patients undergoing a complete clinical and radiological (on ultrasound) response received radiation only as local treatment. Therefore, some of the cCRs may represent undocumented pCRs.

Research 14-gauge core biopsies were collected prior to commencing treatment and snap-frozen in liquid nitrogen. When consented to, a repeat sample was taken at 21 days after the first cycle of chemotherapy. All samples were thereafter coded using a study number as an identifier. Frozen cores were embedded in OCT (optimum cutting temperature embedding compound) and sectioned at -20°C in a cryostat. Sections (5 μm in thickness) were taken for haematoxylin and eosin staining to assess histological character superficially from the core as soon as 'full-face' was reached. The percentage of cells comprising invasive malignant disease and non-malignant components (that is, in situ disease, inflammatory infiltrate, non-malignant ductal/lobular structures, and fibroblastic involvement) were recorded by consensus between two breast pathologists. For patients in whom multiple biopsies were available, the biopsy with the highest invasive content was used for microarray analysis. Biopsies with less than 20% invasive cancer content were excluded from the study.

RNA extraction and amplification

Cores were extracted from OCT as described by Ellis and colleages [19] and pulverised using a pestle and mortar on a bed of dry ice and subsequently in 1 ml of Trizol reagent (Invitrogen, Carlsbad, CA, USA) with a 'Polytron' homogenizer. Standard 'Trizol' RNA extraction was carried out without use of a carrier according to the manufacturers' instructions. Samples not giving distinct 18S and 28S peaks on an Agilent Bioanalyzer (Agilent Technologies, Palo Alto, CA, USA) trace were excluded from the study. Multiple cores from the same patient were handled separately for RNA extraction. A single round of T7 linear RNA amplification was carried out using the RiboAmp kit (Arcturus, Sunnyvale, CA, USA) with a starting amount of 1 μg when available or 50% of the available RNA.

Reference RNA was generated from a pool of RNAs extracted from 20 independent breast cancer surgical samples.

cDNA array hybridisation

Microarray analyses used in-house (Breakthrough Breast Cancer Research Centre, London, UK) human arrays spotted with DNA derived from 4,600 IMAGE cDNA clones in duplicate. This set of genes represents a subset of the 5,808 Cancer Research UK (London, UK) gene set that was designed to include a high proportion of genes documented as being involved in carcinogenesis or tumour biology. To improve 'coverage' of genes involved in breast cancer, a list of discriminatory genes cited in important microarray studies on clinical breast cancer samples at time of production was compiled [7, 2022], and the array was supplemented with these. One half to two micrograms of sample RNA and a matched amount of reference RNA was labeled using the Powerscript Labeling kit (Clontech, Mountain View, CA, USA) in combination with Amersham Cy dyes (GE Healthcare, Little Chalfont, Buckinghamshire, UK). A single dye swap experiment was performed for each clinical sample. Slides were scanned using a GenePix 4000B (Axon Instruments Inc., Union City, CA, USA) scanner and GenePix version 4.0 software.

Data analysis

Most of the data analyses were carried out with the S-plus statistical software package (Insightful, Seattle, USA) and purpose-written scripts (T Dexter, Breakthrough, UK). Raw expression values were transformed to Log2 ratios (sample/reference). The loess function [23] was used to remove biases due to the spot position and spot intensity. Flagged spots were treated as missing values. Log ratio values from duplicate spots and hybridisations were averaged. Genes with consistently low intensity and those that exhibited little variation across samples were removed from the analysis. After the above pre-processing, 1,286 genes remained for the prediction analysis. Samples were clustered both by complete linkage and flexible beta (beta = -0.5) agglomerative algorithms with (1 - correlation) as a distance measure [24, 25]. The correlations were estimated using Spearman's rank method.

The nearest neighbour class prediction algorithm (euclidean distance) was used for all classifications because of its simplicity and good performance on microarray data as reported by Dudoit and Fridlyand [26]. We elected to use seven nearest neighbours throughout to give more stable error estimates and greater robustness than would result with smaller neighbourhoods. The weighted Kolmogorov-Smirnov statistic was used to rank genes for discriminatory information [27]. Each predictor was built starting with the highest ranked two genes from the training set, and then genes were added to the predictor in decreasing rank order until the error rates no longer decreased.

Owing to the limited number of samples, the misclassification error was estimated by leave-one-out cross-validation (LOOCV). In this approach, the class (that is, response) of each sample was predicted in turn, using the other samples as the training set. To avoid selection bias, the genes that were used as predictors were re-selected for each of these leave-one-out classifications [28]. To estimate the probability of the misclassification error arising by chance, a permutation p value was determined as suggested by Radmacher and colleagues [28]. In this procedure, the LOOCV estimate was determined 1,000 times for permuted class labels; the fraction of these that gave an equal or lower error estimate than that with true labels is taken as the p value. We refer to the latter as a 'label permutation'.

To assess the significance of lower misclassification error estimates for selected subsets of the samples, we needed to control for the smaller sample sizes. To this end, we defined a 'subset permutation' p value as the fraction of random subsets (1,000 matched for size and class proportions) of the full set of samples that gave rise to LOOCV error estimates equal to or lower than the selected subset. The correct class labels were used for subset permutation. The minimum error for each permutation across the top 2 to 10 ranked genes was used to calculate the p values for both of the above types of permutation to avoid bias. Although this is an arbitrary range, the minimum error rates in all the permutations had increased by a cutoff of 10. Significance analysis of microarrays (SAM) analyses were performed using standard software [29]. Pre-treatment samples only were used for prediction analyses.


Patient and tumour characteristics

RNA of adequate amount and quality was available from 43 tumours before treatment. Of these, 23 (53%) demonstrated a 'good' and 20 (47%) a 'poor' clinical response. For 16 of these tumours, a paired 21-day sample was also available. These 'on treatment' biopsies are included in the analysis of reproducibility shown below to increase the number of paired samples but are excluded from the analysis of response prediction. The good responses comprised 16 (37%) that underwent a cCR, three (7%) that exhibited ill-defined thickening (MRD) at the end of treatment, and four (9%) that underwent a pCR (all cCR or MRD). The patient and tumour characteristics are shown in Table 1. The only feature differing between good and poor responders was tumour size, with pre-treatment size greater in poor compared with good responders (Mann-Whitney, p = 0.03). Pre-treatment size did not relate to expression profile (data not shown).

Table 1 Patient, tumour, and pre-treatment biopsy characteristics (43 samples used in prediction analysis)

Biopsy characteristics

A total of 147 cores were sectioned in the course of this study, including 104 that were sectioned at three levels (levels approximately 50 μm apart). Of these 104 cores, only 16 cores showed more than 10% absolute variation in invasive tumour content across all three sections and only one core showed more than 20% variation. Only four cores were found to have tumour at some levels and none at others; all four contained not more than 15% invasive tumour at the lowest level and were therefore excluded from the study. This suggests that the histological composition did not vary widely over the width of the core. The histological result from the lowest section for a given biopsy was taken as that most representative of the remaining biopsy used for profiling and hence the level on which percentage invasive tumour was assessed. The distribution of percentage invasive content of core (in cases of multiple cores, that core used for prediction analysis) did not relate to response as assessed by Wilcoxon rank sum test (p = 0.3) or t test (p = 0.27).

Percentage invasive content in core biopsies included in the prediction study varied from 20% to 95% (median 50%). For most core biopsies, the majority of the non-malignant tissue consisted of connective tissue. Inflammatory infiltrate was also scored on the lowest section as nil, mild, moderate, or severe. Of all 147 core biopsies, only six (4%) were scored as having a 'severe' inflammatory infiltrate at one or more levels.

Basic validation of array data

To assess consistency of the expression profile between repeated biopsies, the 43 pre-treatment and 16 paired post-treatment samples were clustered (Additional file 1). The post-treatment samples were included in this validation study only to enhance numbers and were not used in subsequent predictive analyses. All but one pair (b223A and B) of duplicate biopsies taken at the same time point relative to treatment (in total, five pre-treatment and five post-treatment pairs) clustered as nearest neighbours despite some variation in percentage tumour between the pairs (Additional file 3). Of 16 pre/post-treatment pairs, 14 clustered with samples from the same tumour (Additional file 1). To validate our class prediction methodology, a supervised analysis was undertaken to obtain a gene list for prediction of oestrogen receptor (ER) status in pre-treatment samples (ER-α was excluded from the analysis). ER status was correctly assigned in 40 of 42 cases on LOOCV. Discriminatory genes are listed in the supplementary information (Additional file 4).

Effect of percentage tumour on expression profile

Before performing prediction analysis, an exploratory analysis was undertaken to assess the impact of variation in the histological content of biopsies on the expression profile. A SAM regression analysis was performed to establish whether it was possible to identify genes that correlated with the proportion of tumour cells in the 43 pre-treatment core biopsies. One hundred forty-four genes were significant at a false discovery rate of not more than 5%. Table 2 shows the most correlated genes from this analysis. Positive values for 'score' (bold) indicate positive correlation with high percentage tumours, and negative values (underlined) indicate negative correlation with proportion of tumour cells in the samples. More genes correlated positively with the stromal content (n = 128) than with the tumour content (n = 16), possibly reflecting the greater molecular heterogeneity of tumour types across the samples than that of their associated stromas. This is also reflected in the higher false discovery rates for 'tumour-associated genes'.

Table 2 SAM (significance analysis of microarrays) analysis of genes correlated to percentage tumour content.

Response prediction using pre-treatment biopsies

Prediction analysis was initially undertaken on the full set of 43 biopsies. The optimum misclassification error estimate (LOOCV) for the whole sample set was 28% using a three-gene predictor (Figure 1). To explore the effect of biopsy tumour content (and its associated influence on the expression profile) on error of response prediction, we selected three overlapping subsets of tumour samples which were more homogeneous in terms of percentage tumour content: (a) ≥ 50% (50%–95%) invasive tumour (25 samples), (b) ≤ 50% (20%–50%) invasive tumour (24 samples), and (c) 35%–60% invasive tumour (24 samples).

Figure 1
figure 1

Leave-one-out cross-validation (LOOCV) error rates for response prediction. Variation in the LOOCV error estimate with the number of genes used in the predictor. Misclassification rates are plotted for the whole data set (n = 43) and subsets according to percentage tumour content.

The subset consisting of 35%–60% represented a group centered around the median sample in terms of percentage tumour content and of a size similar to or the same as the other two groups.

The minimum error of classification for each of the subsets (8%–13%) was lower than that for the superset of all samples (28%) (Figure 1), suggesting that homogeneity of tumour content rather than tumour content per se might be an important factor for response prediction. However, the comparison of the subset error estimates with that for all samples is not controlled for the different sample sizes involved. To address this, we determined a 'subset permutation' p value for each of the subsets (see Materials and methods). We also estimated the probability that the subset errors arose by chance (a 'label permutation' as discussed in Materials and methods). Error estimates and corresponding p values for the 'subset permutation' and the 'label permutation' support the hypothesis that homogeneity of biopsy tumour content improves response prediction with the nearest neighbour algorithm (Table 3). The identity of the genes used in the prediction for each of these subgroups is presented in Table 4. The mean differential expression between good and poor responders for these genes is given in the supplementary information (Additional file 5).

Table 3 Misclassification error estimates (leave-one-out cross-validation) for response prediction.
Table 4 Response prediction gene lists.

We found, in common with other studies [20, 22, 3032], that ER-positive and ER-negative tumours were very distinct in molecular terms (Additional file 4), which may confound response prediction. Therefore, response prediction was performed on ER-positive samples alone (16 good, 13 poor responders). Error rates, however, were high (31% with 8 genes, 31% with 25 genes). Within this subset, the effect of heterogeneity of tumour content on prediction appeared to be more pronounced (error rates: 31% (all samples, n = 29) versus 4.5% (≤ 60% tumour, n = 22)).

Incorporation of histological information into the predictor

To explore further the effect of biopsy composition on prediction error, we attempted to create a single predictor for all samples by adding information about tumour content to the predictor as an extra dimension or 'histology gene'. The impact of variation in biopsy tumour content on prediction error was thought to be due partly to the fact that genes that discriminate between good and poor responders in high-tumour-content biopsies are often poor discriminators in low-tumour-content biopsies (and vice versa) as is evident from Table 4. To overcome this complication, 10 genes were selected that showed 100% support in the subset permutations (Table 4). By adding the proportion of tumour content to these 10 genes as an 11th dimension, we aimed to move the samples apart in 'prediction space' such that differences in tumour content also contributed to the distances between samples. Thus, samples with similar expression profiles become nearest neighbours of similar proportion of tumour content as well. In the nearest neighbour predictor, like is matched with like in terms of proportion of tumour content as well as expression profile.

It was necessary to rescale the histology gene's contribution to the distance between samples because the standard deviation of most genes lay between 0.5 and 1.5 (log2 ratio scale). Therefore, the percentage tumour figures were standardised by subtracting the average and dividing by the standard deviation. The 'histology gene' or 11th dimension was then rescaled from 0 to 4 standard deviations, and the effect on cross-validation error determined (Figure 2). After an initial drop in error, as the scale approached 1.0, the error rose beyond the initial error as the influence of the histology gene became too dominant. To test whether the effect of the histology gene was specific to the particular set of 10 genes used above, we tested the effect of adding the histology gene to 1,000 predictors, each containing a random permutation of between 8 and 16 genes drawn from the top 20 genes ranked by a combined score (Additional file 6 a-c). In 91% of these, the addition of the histology gene resulted in a lower error; in 49%, the error drop was greater than 10%.

Figure 2
figure 2

Variation in error rate according to scale of 'histology gene'. An 11th ('histology') gene was added to a 10-gene predictor as an extra dimension. This gene was rescaled from 0 to 4 standard deviations, and the effect on cross-validation error calculated.


In this study, we set out to explore the possibility that the transcriptional profile(s) of breast tumours relates to sensitivity to neoadjuvant chemotherapy. Such a profile might be useful in understanding the molecular mechanisms determining response or resistance and would provide the basis for a predictor of chemotherapy response.

However, the biopsy material used in this study had a complex cellular composition. In planning this study, consideration had been given to two approaches for ensuring that the percentage invasive tumour within biopsies was 'sufficient' and homogeneous. Cell selection, either by gross dissection or laser capture microdissection [33], allows enrichment of the invasive tumour component. We chose, in common with others [6, 7, 11, 13, 14, 31], to set a minimum threshold for 'percentage malignant cells' within a given biopsy. The median percentage invasive tumour for the core biopsy samples was 50%, and the range of figures that were included in the study was 20% to 95%. In many biopsies, the dominant non-tumour component, connective tissue, was admixed with epithelial components, making enrichment of the malignant compartment difficult using gross dissection alone.

We hypothesised that this 'contamination' of biopsies by significant and variable amounts of non-tumour components might confound tumour classification. Indeed, at least 10% of the genes (144 genes) remaining after preprocessing of the data were found to correlate with cellular composition using a SAM regression analysis. Genes relatively overexpressed in low-percentage tumours included established stromal-related genes (Table 2) (for example, Collagen (type XV, alpha-1) and Cadherin 5 (type 2, VE-cadherin)). The exact source (histological compartment) of production of a given RNA could be further confirmed by FISH (fluorescent in situ hybridisation) or by comparing gene expression in microdissected stromal and tumour compartments.

However, paired biopsies, despite differences in proportional non-tumour content (Additional file 3), co-aggregated on cluster analysis (Additional file 1), suggesting a dominant 'tumour profile' despite variation in the proportion of stroma. Furthermore, prediction of ER status was not confounded by marked variation in percentage tumour content. This is in keeping with a number of studies that have shown that strong differential expression of a relatively high proportion of genes correlates with ER status [6, 7, 20, 3032] which would be expected to result in domination of the expression profile despite variation in the contribution by the non-tumour component. Furthermore, it has not been demonstrated that the ER expression signature is derived entirely from tumour cells. However, variation in the proportion of stroma may be sufficient to mask more subtle aspects of the tumour expression signature.

We found that the error rate for response prediction for the whole sample set was poor (28%) but was improved by increasing the homogeneity of cellular composition by subsetting on the basis of histological composition (error rate, 8%–13%) (Table 3). The misclassification rates for the subsets were determined with LOOCV and therefore represent high variance estimates, and the sample numbers are modest. We have used permutation analysis as support for the error estimates; ultimately, however, validation with an independent data set would address these issues.

We did not find any evidence to suggest that highly stromal biopsies result in higher prediction error. However, stromal-tumour content appears to affect the selection of genes that are used in the predictor for each histological subset, resulting in different but overlapping lists of predictive genes (Table 4). Some genes discriminated response in the 'high percentage' and not the 'low percentage' samples (for example, PBEF1). This may simply be a dose effect whereby discriminatory tumour-associated genes are no longer differential in 'low percentage samples' due to low signal. Alternatively, discriminatory genes that are expressed in both tumour and non-tumour compartments may lose discriminatory potential in tumours with a significant stromal contribution to the molecular signature. Genes that are discriminatory in low but not high 'percentage samples' (for example, SOD1) could be expressed only at the tumour-stroma interface in stromal and/or tumour cells. Certainly, breast tumour-induced changes in stromal expression have been previously documented [34]. Furthermore, it has been shown that tumour gene upregulation can occur specifically at the tumour-stroma interface [35, 36]. Finally, it is also likely that the volume and configuration of stromal tissue within a tumour are a reflection of the tumour molecular subtype.

Thus, this analysis resulted in three distinct response-prediction genes lists that partially overlapped. PBEF, reported in one study to act as an inhibitor of apoptosis in neutrophils [37], appeared in the predictor for response for 'high percentage' tumours and has also been reported as a component of a three-gene predictor of AC sensitivity by another group [13]. In both studies, expression of this gene was higher in the resistant tumours than in sensitive tumours. SOD1 was found to be relatively overexpressed in resistant tumours ('low' and 'mid percentage') in keeping with a proposed role in the neutralisation of free radicals, one means by which anthracyclines are thought to inflict cellular damage [38]. NDRG1, upregulated in poor responders ('high percentage'), is induced by hypoxia and may reduce p53 expression [39]. Hypoxia may be a marker of poor vascularisation of tumours and therefore possible limitation of drug access.

Although here we have defined histological subsets, we have shown that it might be feasible to build a predictor that extracts information about the cellular composition of the biopsy from expression data. We found that adding standardised percentage tumour values as a 'histology gene' (Figure 2) to multigene predictors (Table 4) reduced the error rate significantly, supporting the idea that it may be possible to devise a predictor that operates regardless of biopsy composition. This would avoid the need for microdissection to enrich for malignant cells. Furthermore, if stromal or stromal-interface gene expression does carry discriminatory information, then microdissection would result in loss of predictive information.

Several groups have attempted to define a multigene predictor of chemoresponsiveness [1015], using either clinical or pathological definitions of response. The reported error rates in these studies, as assessed on an independent data set or by LOOCV, range from 5% to 30%. Comparison of gene lists and error rates across these studies and with ours is hampered by the fact that the treatment regimens, response definitions, and microarray platforms differ and that the histological composition of the samples in most studies was not presented.

A number of samples used in our study have also been profiled using an Affymetrix platform (Santa Clara, CA, USA) [40]. In this independent analysis, 12 samples from tumours displaying either cCR or pCR and six samples from tumours with residual tumour greater than 70% were used to define a classifier (samples with less than 40% tumour were excluded from the study). The error rate of prediction on LOOCV was 33% (p = 0.4 on permutation).

Therefore, to date, the reported error rates associated with expression array prediction of response, certainly for anthracycline combination chemotherapy, remain too high for clinical utility. This may be due in part to the fact that breast cancer is an extremely complex and heterogeneous disease that operates multiple mechanisms of chemotherapeutic response and resistance which are not consistent across different subtypes, particularly in the case of combination regimens incorporating agents that act by multiple mechanisms. Such complexities mandate that study designs be optimised in terms of biopsy quality and sample size.


The response prediction using all pre-treatment biopsies was modestly effective. However, the percentage of invasive cancer cells within a sample influenced the expression profile. Response prediction on subsets of samples more homogeneous in terms of cellular composition was associated with lower error rates. We believe that it is essential that consideration be given to biopsy composition in planning future studies of this type using methods such as those discussed above. Larger studies are required to establish whether optimal accuracy of response prediction may be achieved by the development of profiles specific to immunohistochemical breast cancer subtypes.



adriamycin and cyclophosphamide


complete clinical response


oestrogen receptor


leave-one-out cross-validation


minimal residual disease


no change


optimum cutting temperature embedding compound


complete pathological response


partial response


Royal Marsden Hospital


significance analysis of microarrays.


  1. Galea MH, Blamey RW, Elston CE, Ellis IO: The Nottingham Prognostic Index in primary breast cancer. Breast Cancer Res Treat. 1992, 22: 207-219. 10.1007/BF01840834.

    CAS  Article  PubMed  Google Scholar 

  2. Olivotto IA, Bajdik CD, Ravdin PM, Speers CH, Coldman AJ, Norris BD, Davis GJ, Chia SK, Gelmon KA: Population-based validation of the prognostic model ADJUVANT! for early breast cancer. J Clin Oncol. 2005, 23: 2716-2725. 10.1200/JCO.2005.06.178.

    Article  PubMed  Google Scholar 

  3. Cleator S, Ashworth A: Molecular profiling of breast cancer: clinical implications. Br J Cancer. 2004, 90: 1120-1124. 10.1038/sj.bjc.6601667.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  4. Fisher B, Brown A, Mamounas E, Wieand S, Robidoux A, Margolese RG, Cruz AB, Fisher ER, Wickerham DL, Wolmark N, et al: Effect of preoperative chemotherapy on local-regional disease in women with operable breast cancer: findings from National Surgical Adjuvant Breast and Bowel Project B-18. J Clin Oncol. 1997, 15: 2483-2493.

    CAS  Article  PubMed  Google Scholar 

  5. Wolmark N, Wang J, Mamounas E, Bryant J, Fisher B: Preoperative chemotherapy in patients with operable breast cancer: nine-year results from National Surgical Adjuvant Breast and Bowel Project B-18. J Natl Cancer Inst Monogr. 2001, 96-102.

    Google Scholar 

  6. Sotiriou C, Neo SY, McShane LM, Korn EL, Long PM, Jazaeri A, Martiat P, Fox SB, Harris AL, Liu ET: Breast cancer classification and prognosis based on gene expression profiles from a population-based study. Proc Natl Acad Sci USA. 2003, 100: 10393-10398. 10.1073/pnas.1732912100.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  7. van't Veer LJ, Dai H, van de Vijver MJ, He YD, Hart AA, Mao M, Peterse HL, van der Kooy K, Marton MJ, Witteveen AT, et al: Gene expression profiling predicts clinical outcome of breast cancer. Nature. 2002, 415: 530-536. 10.1038/415530a.

    Article  Google Scholar 

  8. van de Vijver MJ, He YD, van't Veer LJ, Dai H, Hart AA, Voskuil DW, Schreiber GJ, Peterse JL, Roberts C, Marton MJ, et al: A gene-expression signature as a predictor of survival in breast cancer. N Engl J Med. 2002, 347: 1999-2009. 10.1056/NEJMoa021967.

    CAS  Article  PubMed  Google Scholar 

  9. Wang Y, Klijn JG, Zhang Y, Sieuwerts AM, Look MP, Yang F, Talantov D, Timmermans M, Meijer-van Gelder ME, Yu J, et al: Gene-expression profiles to predict distant metastasis of lymph-node-negative primary breast cancer. Lancet. 2005, 365: 671-679.

    CAS  Article  PubMed  Google Scholar 

  10. Ayers M, Symmans WF, Stec J, Damokosh AI, Clark E, Hess K, Lecocke M, Metivier J, Booser D, Ibrahim N, et al: Gene expression profiles predict complete pathologic response to neoadjuvant paclitaxel and fluorouracil, doxorubicin, and cyclophosphamide chemotherapy in breast cancer. J Clin Oncol. 2004, 22: 2284-2293. 10.1200/JCO.2004.05.166.

    CAS  Article  PubMed  Google Scholar 

  11. Bertucci F, Finetti P, Rougemont J, Charafe-Jauffret E, Nasser V, Loriod B, Camerlo J, Tagett R, Tarpin C, Houvenaeghel G, et al: Gene expression profiling for molecular characterization of inflammatory breast cancer and prediction of response to chemotherapy. Cancer Res. 2004, 64: 8558-8565. 10.1158/0008-5472.CAN-04-2696.

    CAS  Article  PubMed  Google Scholar 

  12. Chang JC, Wooten EC, Tsimelzon A, Hilsenbeck SG, Gutierrez MC, Elledge R, Mohsin S, Osborne CK, Chamness GC, Allred DC, O' Connell P: Gene expression profiling for the prediction of therapeutic response to docetaxel in patients with breast cancer. Lancet. 2003, 362: 362-369. 10.1016/S0140-6736(03)14023-8.

    CAS  Article  PubMed  Google Scholar 

  13. Folgueira MA, Carraro DM, Brentani H, Patrao DF, Barbosa EM, Netto MM, Caldeira JR, Katayama ML, Soares FA, Oliveira CT, et al: Gene expression profile associated with response to doxorubicin-based therapy in breast cancer. Clin Cancer Res. 2005, 11: 7434-7443. 10.1158/1078-0432.CCR-04-0548.

    CAS  Article  PubMed  Google Scholar 

  14. Hannemann J, Oosterkamp HM, Bosch CA, Velds A, Wessels LF, Loo C, Rutgers EJ, Rodenhuis S, van de Vijver MJ: Changes in gene expression associated with response to neoadjuvant chemotherapy in breast cancer. J Clin Oncol. 2005, 23: 3331-3342. 10.1200/JCO.2005.09.077.

    CAS  Article  PubMed  Google Scholar 

  15. Iwao-Koizumi K, Matoba R, Ueno N, Kim SJ, Ando A, Miyoshi Y, Maeda E, Noguchi S, Kato K: Prediction of docetaxel response in human breast cancer by gene expression profiling. J Clin Oncol. 2005, 23: 422-431. 10.1200/JCO.2005.09.078.

    CAS  Article  PubMed  Google Scholar 

  16. Modlich O, Prisack HB, Munnes M, Audretsch W, Bojar H: Predictors of primary breast cancers responsiveness to preoperative epirubicin/cyclophosphamide-based chemotherapy: translation of microarray data into clinically useful predictive signatures. J Transl Med. 2005, 3: 32-10.1186/1479-5876-3-32.

    Article  PubMed  PubMed Central  Google Scholar 

  17. WHO: World Health Organization Classification of Tumours. Pathology and Genetics of Tumours of the Breast and Genital Organs. 2003, Lyon: IARC Press

    Google Scholar 

  18. Cleator S, Makris A, Ashley SE, Lal R, Powles TJ: Good clinical response of breast cancers to neoadjuvant chemoendocrine treatment is associated with improved overall survival. Ann Oncol. 2005, 16: 267-272. 10.1093/annonc/mdi049.

    CAS  Article  PubMed  Google Scholar 

  19. Ellis M, Davis N, Coop A, Liu M, Schumaker L, Lee RY, Srikanchana R, Russell CG, Singh B, Miller WR, et al: Development and validation of a method for using breast core needle biopsies for gene expression microarray analyses. Clin Cancer Res. 2002, 8: 1155-1166.

    CAS  PubMed  Google Scholar 

  20. Gruvberger S, Ringner M, Chen Y, Panavally S, Saal LH, Borg A, Ferno M, Peterson C, Meltzer PS: Estrogen receptor status in breast cancer is associated with remarkably distinct gene expression patterns. Cancer Res. 2001, 61: 5979-5984.

    CAS  PubMed  Google Scholar 

  21. Sotiriou C, Powles TJ, Dowsett M, Jazaeri AA, Feldman AL, Assersohn L, Gadisetti C, Libutti SK, Liu ET: Gene expression profiles derived from fine needle aspiration correlate with response to systemic chemotherapy in breast cancer. Breast Cancer Res. 2002, 4: R3-10.1186/bcr433.

    Article  PubMed  PubMed Central  Google Scholar 

  22. West M, Blanchette C, Dressman H, Huang E, Ishida S, Spang R, Zuzan H, Olson JA, Marks JR, Nevins JR: Predicting the clinical status of human breast cancer by using gene expression profiles. Proc Natl Acad Sci USA. 2001, 98: 11462-11467. 10.1073/pnas.201162998.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  23. Dudoit S: Statistical methods for identifying differentially expressed genes in replicated cDNA microarray experiments. Statistica Sinica. 2002, 12: 111-139.

    Google Scholar 

  24. Everitt BS: Cluster Analysis. 1993, London: Arnold

    Google Scholar 

  25. Lance G, Williams W: A general theory of classificatory sorting strategies: Hierarchical systems. Computer Journal. 1967, 9: 373-380.

    Article  Google Scholar 

  26. Dudoit S, Fridlyand J: Classification in microarray experiments. Statistical Analysis of Gene Expression Data. Edited by: Speed T. 2003, Boca Raton: Chapman and Hall/CRC

    Google Scholar 

  27. Wilcox R: Introduction to Robust Estimation and Hypothesis Testing. 1997, San Diego: Academic Press

    Google Scholar 

  28. Radmacher MD, McShane LM, Simon R: A paradigm for class prediction using gene expression profiles. J Comput Biol. 2002, 9: 505-511. 10.1089/106652702760138592.

    CAS  Article  PubMed  Google Scholar 

  29. Tusher VG, Tibshirani R, Chu G: Significance analysis of microarrays applied to the ionizing radiation response. Proc Natl Acad Sci USA. 2001, 98: 5116-5121. 10.1073/pnas.091062498.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  30. Sorlie T, Perou CM, Tibshirani R, Aas T, Geisler S, Johnsen H, Hastie T, Eisen MB, van de Rijn M, Jeffrey SS, et al: Gene expression patterns of breast carcinomas distinguish tumor subclasses with clinical implications. Proc Natl Acad Sci USA. 2001, 98: 10869-10874. 10.1073/pnas.191367098.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  31. Sorlie T, Tibshirani R, Parker J, Hastie T, Marron JS, Nobel A, Deng S, Johnsen H, Pesich R, Geisler S, et al: Repeated observation of breast tumor subtypes in independent gene expression data sets. Proc Natl Acad Sci USA. 2003, 100: 8418-8423. 10.1073/pnas.0932692100.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  32. Perou CM, Sorlie T, Eisen MB, van de Rijn M, Jeffrey SS, Rees CA, Pollack JR, Ross DT, Johnsen H, Akslen LA, et al: Molecular portraits of human breast tumours. Nature. 2000, 406: 747-752. 10.1038/35021093.

    CAS  Article  PubMed  Google Scholar 

  33. Sgroi DC, Teng S, Robinson G, LeVangie R, Hudson JR, Elkahloun AG: In vivo gene expression profile analysis of human breast cancer progression. Cancer Res. 1999, 59: 5656-5661.

    CAS  PubMed  Google Scholar 

  34. Iacobuzio-Donahue CA, Argani P, Hempen PM, Jones J, Kern SE: The desmoplastic response to infiltrating breast carcinoma: gene expression at the site of primary invasion and implications for comparisons between tumor types. Cancer Res. 2002, 62: 5351-5357.

    CAS  PubMed  Google Scholar 

  35. Lindsay CK, Thorgeirsson UP, Tsuda H, Hirohashi S: Expression of tissue inhibitor of metalloproteinase-1 and type IV collagenase/gelatinase messenger RNAs in human breast cancer. Hum Pathol. 1997, 28: 359-366. 10.1016/S0046-8177(97)90136-2.

    CAS  Article  PubMed  Google Scholar 

  36. Zhang F, Riley J, Gant TW: Intrinsic multidrug class 1 and 2 gene expression and localization in rat and human mammary tumors. Lab Invest. 1996, 75: 413-426.

    CAS  PubMed  Google Scholar 

  37. Jia SH, Li Y, Parodo J, Kapus A, Fan L, Rotstein OD, Marshall JC: Pre-B cell colony-enhancing factor inhibits neutrophil apoptosis in experimental inflammation and clinical sepsis. J Clin Invest. 2004, 113: 1318-1327. 10.1172/JCI200419930.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  38. Gewirtz DA: A critical evaluation of the mechanisms of action proposed for the antitumor effects of the anthracycline antibiotics adriamycin and daunorubicin. Biochem Pharmacol. 1999, 57: 727-741. 10.1016/S0006-2952(98)00307-4.

    CAS  Article  PubMed  Google Scholar 

  39. Chen B, Nelson DM, Sadovsky Y: N-myc down-regulated gene 1 modulates the response of term human trophoblasts to hypoxic injury. J Biol Chem. 2006, 281: 2764-2772. 10.1074/jbc.M507330200.

    CAS  Article  PubMed  Google Scholar 

  40. Cleator S, Tsimelzon A, Ashworth A, Dowsett M, Dexter T, Powles T, Hilsenbeck S, Wong H, Osborne CK, O'Connell P, Chang JC: Gene expression patterns for doxorubicin (Adriamycin) and cyclophosphamide (Cytoxan) (AC) response and resistance. Breast Cancer Res Treat. 2006, 95: 229-233. 10.1007/s10549-005-9009-7.

    CAS  Article  PubMed  Google Scholar 

Download references


We thank Kerry Fenwick, Peter Kerr, Sunil Lakhani, and Marjan Iravani (all of whom were funded by the Breakthrough Breast Cancer Research Centre) and Margaret Hill and Geraldine Walsh (who were funded by RMH). The study and the manuscript preparation were funded by the Breakthrough Breast Cancer Research Centre. Neither of the above funding bodies contributed to the study design or collection, analysis, and interpretation of results. Likewise, there was no involvement by any funding body in the writing of the manuscript or the decision to submit it.

Author information

Authors and Affiliations


Corresponding author

Correspondence to Susan J Cleator.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors' contributions

SC was involved in design of study, acquisition of data, analysis and interpretation of results, and drafting the manuscript and preparing the final version (Breakthrough Breast Cancer Research Centre). TP participated in data acquisition and the conception and design of the study (RMH). TD was involved in analysis and interpretation of results, and drafting the manuscript and preparing the final version (Breakthrough Breast Cancer Research Centre). LF contributed to data acquisition (Breakthrough Breast Cancer Research Centre). AM performed data acquisition and analysis (Breakthrough Breast Cancer Research Centre). IS participated in data acquisition (RMH). HV performed data analysis (Breakthrough Breast Cancer Research Centre). AA participated in the design of study, interpretation of results, and preparing the final version of the manuscript (Breakthrough Breast Cancer Research Centre). MD was involved in the conception and design of the study, interpretation of results, and preparing the final version of the manuscript (RMH). All authors read and approved the final manuscript.

Electronic supplementary material


Additional file 1: A PDF file containing a dendrogram of flexible beta clustering with Spearman rank correlation on all core biopsy samples taken from 43 patients. (PDF 195 KB)

Additional file 2: A word document containing the legend for additional data file 1. (DOC 19 KB)


Additional file 3: A word document containing a table that shows the characteristics of paired (repeat) core biopsy samples. (DOC 36 KB)


Additional file 4: A word document containing a table that shows the oestrogen receptor (ER) predictive genes. (DOC 68 KB)


Additional file 5: A word document containing a table that shows response prediction gene lists with expression ratios. (DOC 78 KB)


Additional file 6: A PDF file showing Permutation to assess the generalisability of the reduction in error rate observed by addition of the 'histology gene'. (PDF 189 KB)

Additional file 7: A word document containing the legend for Additional data file 6. (DOC 20 KB)

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Rights and permissions

Reprints and Permissions

About this article

Cite this article

Cleator, S.J., Powles, T.J., Dexter, T. et al. The effect of the stromal component of breast tumours on prediction of clinical outcome using gene expression microarray analysis. Breast Cancer Res 8, R32 (2006).

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • DOI:


  • Tumour Content
  • Core Biopsy
  • Minimal Residual Disease
  • Poor Responder
  • Response Prediction