- Research article
- Open Access
Quantitative single-cell analysis of immunofluorescence protein multiplex images illustrates biomarker spatial heterogeneity within breast cancer subtypes
Breast Cancer Research volume 23, Article number: 114 (2021)
The extent of cellular heterogeneity in breast cancer could have potential impact on diagnosis and long-term outcome. However, pathology evaluation is limited to biomarker immunohistochemical staining and morphology of the bulk cancer. Inter-cellular heterogeneity of biomarkers is not usually assessed. As an initial evaluation of the extent of breast cancer cellular heterogeneity, we conducted quantitative and spatial imaging of Estrogen Receptor (ER), Progesterone Receptor (PR), Epidermal Growth Factor Receptor-2 (HER2), Ki67, TP53, CDKN1A (P21/WAF1), CDKN2A (P16INK4A), CD8 and CD20 of a tissue microarray (TMA) representing subtypes defined by St. Gallen surrogate classification.
Quantitative, single cell-based imaging was conducted using an Immunofluorescence protein multiplexing platform (MxIF) to study protein co-expression signatures and their spatial localization patterns. The range of MxIF intensity values of each protein marker was compared to the respective IHC score for the TMA core. Extent of heterogeneity in spatial neighborhoods was analyzed using co-occurrence matrix and Diversity Index measures.
On the 101 cores from 59 cases studied, diverse expression levels and distributions were observed in MxIF measures of ER and PR among the hormonal receptor-positive tumor cores. As expected, Luminal A-like cancers exhibit higher proportions of cell groups that co-express ER and PR, while Luminal B-like (HER2-negative) cancers were composed of ER+, PR- groups. Proliferating cells defined by Ki67 positivity were mainly found in groups with PR-negative cells. Triple-Negative Breast Cancer (TNBC) exhibited the highest proliferative fraction and incidence of abnormal P53 and P16 expression. Among the tumors exhibiting P53 overexpression by immunohistochemistry, a group of TNBC was found with much higher MxIF-measured P53 signal intensity compared to HER2+, Luminal B-like and other TNBC cases. Densities of CD8 and CD20 cells were highest in HER2+ cancers. Spatial analysis demonstrated variability in heterogeneity in cellular neighborhoods in the cancer and the tumor microenvironment.
Protein marker multiplexing and quantitative image analysis demonstrated marked heterogeneity in protein co-expression signatures and cellular arrangement within each breast cancer subtype. These refined descriptors of biomarker expressions and spatial patterns could be valuable in the development of more informative tools to guide diagnosis and treatment.
Breast Cancer is a highly heterogeneous disease. Methods to classify different clinicopathologic types of breast cancers range from evaluation of the immunohistochemical (IHC) staining of hormonal and protein biomarkers (Estrogen receptor ER, Progesterone receptor PR and Epidermal Growth Factor Receptor HER2/neu), etc., to molecular profiling and intrinsic subtyping based on RNA expression [1,2,3] and integrated transcriptomics with copy number aberrations (IntClust) [4,5,6,7] in bulk populations of cancer cells. Molecularly defined intrinsic subtypes classify breast cancers into Luminal A, Luminal B, HER2-enriched and basal-like cancers.
While these classification systems are clinically valuable for prognosis and treatment planning, the definitions of ER+/HER2-cancers as specified at the 2013 meeting of the St. Gallen International Breast Cancer Conference are quite broad  and display a spectrum of “intermediate” luminals (those between Luminal A-like and Luminal B-like) . Discordance as high as 18% was found between clinical classification of hormonal receptor-positive and molecular subtyping of luminal cancers, with HR+ cancers often assigned to HER2-enriched and TNBC molecular subtypes, while clinically diagnosed HER2+ and TNBC included a fraction of Luminal cancers by PAM50 assessment [3, 10,11,12,13,14,15,16,17]. Detailed analysis of ER+ cancers with long-term outcome data has also identified integrative molecular subtypes that are associated with increased risk of long-term recurrence . These findings suggest that additional specificity within this spectrum may be helpful in improving the characterization of this subtype. Heterogeneity was also observed in HER2+ and TNBC with impact on the response to treatment [15, 19,20,21,22,23]. The presence of inherent intra- and inter-tumoral heterogeneity could reduce effectiveness of treatment, possibly resulting in residual viable cancer cells which could contribute to resistance to treatment or recurrence [24,25,26]. Defining the cellular and molecular heterogeneities at the single cell level with spatial context will provide a more comprehensive understanding of breast cancer biology. Importantly, correlations between the molecular definitions of subtype with assays used in the clinical setting, namely morphology and IHC scoring, will help to translate these enhanced measures into functional tools for the clinical laboratory.
The purpose of this study is to assess heterogeneity within each clinical subtype of breast cancer in the context of protein marker expression signatures of single cells. Quantitative evaluation of the physical arrangement of cell subgroups is also illustrated to demonstrate the application of protein multiplexing in the investigation of spatial heterogeneities in breast cancer. In order to obtain a quantitative, single cell-based analysis of heterogeneity within breast cancer subtypes, we studied the co-expression patterns of breast biomarkers in individual cells of the cancer epithelium, the distribution of each subgroup and their spatial arrangement. Sequential staining of the same tissue section with a series of fluorescent-labeled antibodies, combined with digital imaging and bleaching of the fluorophores at the end of each staining round, was conducted using an immunofluorescence protein multiplexing platform MxIF . In this study, we evaluated breast biomarkers ER, PR, HER2 and Ki67, together with TP53, CDKN2A (P16INK4a), cyclin-dependent kinase (CDK) inhibitor CDKN1A (P21/WAF1) in individual cancer cells of a breast tissue microarray (TMA) comprising invasive tumors. Subgroups of cells based on ER, PR, HER2 and Ki67 MxIF positivity were identified, and their abundances and spatial distributions were studied and compared between the pathologic IHC-based subtypes. Co-expression patterns of TP53 and P16 were also studied. In addition, CD8 and CD20 lymphocytes in the tumor microenvironment were quantified and their spatial localization was analyzed.
Methods and materials
Breast cancer tissue microarray (TMA)
The formalin-fixed, paraffin-embedded (FFPE) breast cancer TMA used in this work was obtained from Pantomics (CA, USA). It includes 75 cases of benign tissue and invasive breast cancer with two independent cores (1.1 mm diameter) taken from different regions of the same specimen (referred as paired cores in this study). Only cores with invasive cancers were analyzed.
Serial sections of the TMA were stained with protein marker in immunohistochemistry either using the Ventana autostainer (Roche), or manually, following manufacturers’ protocols. The following antibodies were used: Estrogen Receptor (SP1, Ventana Medical Systems), Progesterone Receptor (1E2, Ventana Medical Systems), HER2 (4B5, Ventana Medical Systems), Ki67 (MIB-1, Ventana Medical Systems), P53 (DO-7, Dako), P16 (CINtec, Ventana Medical Systems), P21 (1:200, Cell Signaling Cat#2947). ER, PR, HER2 and P21 IHC were scored by study pathologist (K.L.) with an evaluation of staining intensity and percentage of positive cells . HER2 scoring was conducted according to the recommendations from 2013 ASCO/CAP . HER2 2+ (equivocal) cases by IHC were further evaluated with fluorescent in situ hybridization (FISH) (S. N-M). Ki67 IHC was presented as percentages of positive cells per core (scored by K.L.). P53 and P16 IHC were reviewed by study pathologist (S. N-M.) and categorized as normal, null or over-expressed for P53 and normal or over-expressed for P16.
Classification of intrinsic subtypes based on IHC surrogate staining
Based on scoring of ER, PR, HER2 and Ki67 IHC of serial sections of the TMA, cores were classified as Luminal A-like [LumA], Luminal B-like (HER2-negative) [LumB], Luminal B-like (HER2-positive) [LumB, HER2 +], HER2+ (non-luminal) [HER2 +] and TNBC [TNBC] based on criteria described in Goldhirsch et. al. . Of the TMA cores studied, 24 (14 cases) were LumA cores, 36 (21 cases) were LumB cores. One case has one core classified as LumA, and the second core classified as LumB. Six cores (4 cases) were LumB, HER2+, 13 cores (9 cases) were from HER2+ cancers, and 22 cores (12 cases) were from TNBC.
Antibody selection, conjugation, calibration and validation of MxIF
Commercially available antibodies for each protein marker of interest were first tested using standard IHC with appropriate positive and negative control tissues. Antibodies that showed specific staining and intensity comparable to IHC were selected and conjugated to Cy3 or Cy5 fluorophores following established protocols . Each fluorophore-conjugated antibody was optimized and validated using control tissues. The staining patterns, specificities and intensities of conjugated antibodies were compared to the optimized IHC staining of the same tissue and evaluated by study pathologists (K.L, S.N-M). For breast biomarkers of ER, PR, HER2 and Ki67, we used clinically annotated breast cancer tissues for antibody validation. P53 staining was validated with a colon cancer case demonstrating P53 overexpression, while P21 and P16 were validated with skin and colon tissues and on high-grade serous ovarian cancer cases, respectively. A range of concentrations were tested for each fluorophore-conjugated antibody in the MxIF staining protocol and the parameters that resulted in staining intensities similar to those with IHC as performed in the clinical immunohistochemistry lab were selected.
Protein multiplexing was conducted on the Immunofluorescence Multiplexing MxIF platform (GE Research (GER), Niskayuna, NY, USA) . Pairs of fluorophore-conjugated (Cy3 and Cy5) antibodies were sequentially applied onto a single tissue section of the TMA, followed by image acquisition and photo-induced chemical bleaching to inactivate optical signals from antibodies (U.S. patent 7,741,045) . The order of antibody staining is listed in Table 1. In the first round of staining, unconjugated antibodies for HER2 and P16 were applied in primary incubation, followed by secondary staining with fluorophore conjugates of Cy3 or Cy5, respectively. All other antibodies were directly conjugated to fluorophores. After completion of MxIF staining rounds, quality of the staining of each protein marker was visually evaluated. TMA cores showing non-specific staining or having high background, over-exposed areas, or exhibiting tissue damage were excluded from analysis. Antibodies for the cyclooxygenase-II enzyme (COX-2) were included in the panel and data will be presented in a future report.
Image analysis and statistical methods
Image registration and single cell segmentation were performed with the software package, Layers, developed by GER for the MxIF platform. The DAPI signal and those from antibodies of ribosomal S6 and Na+-K+-ATPase were used for segmentation of the cell nuclear, cytoplasmic and membranous compartments, respectively, as well as the boundary of individual cells. Layers uses one channel to highlight the nuclear component and a second to represent intracellular membranes, extracellular fibers and other tissue structures, to produce a “virtual H&E” (vH&E) image. Following cell segmentation, measurements of cell size, morphology and protein marker signals were collected together with the coordinates of cell centroids. Pre-processing of data was conducted to remove cells that did not meet quality control requirements (i.e. cells that were lost or damaged at some point in the processing procedure). Pixel size in our study was 0.293 μm.
Inclusion of cancer cells for analysis
For the study of the cancer population, a set of cell size parameters (> 90 pixels in perimeter, > 400 pixels in cell area and > 10 pixels of nuclear area) were applied in gating and included for analysis. To enrich for cancer cells in our analysis, single cell data from Luminal A/B and HER2+ cores were gated with Pan-Cytokeratin (PCK26) level (> = 1500 raw signal intensity) in order to select for PCK+ epithelial cancer cells. Regions with benign ducts were manually annotated by study pathologist (K.L.) on the virtual H&E images using the Image viewer Sedeen (Pathcore Inc, Toronto, ON). A python script (http://www.python.org) was developed to include/exclude single cell data from annotated regions. In this case, single cell data corresponding to the annotated benign ducts were excluded from analysis. For TNBC, PCK staining is generally weak and prevented gating for PCK+ epithelial cells. Subsequently, cancer cells were manually annotated using Sedeen and the python script was applied to select for cells in the annotated regions.
Data processing and thresholding
Protein expression levels for each marker were extracted from their reported subcellular compartment localizations as the fluorescent marker pixel values (ER-nuclear, PR-nuclear, HER2-membranous, Ki67-nuclear, P53-nuclear and cytosol, P21-nuclear, P16-nuclear, PCK-membranous). A min–max normalization to a range of 0–15 was performed on the values for each marker individually. For thresholding analysis of ER, PR and HER2, selection of empirical cut-points for marker positivity (weak vs strong staining for ER) were determined by comparing the MxIF values of cells from cores that were scored as negative, weak, moderate or strong staining intensities on IHC. For HER2, MxIF values were compared to cores scored as 0, 1+, 3+ in IHC as cases with equivocal (2+) scoring were further tested with FISH. Five cores in each scoring class were studied (3 cores only for PR weak). For Ki67, the cut-point was determined by comparing MxIF distributions from representative cores with a range of IHC scores. For P53, P16 and P21, cut-points were determined at > 0, 2 and 1 of normalized MxIF levels, respectively, after comparison with IHC scores.
Immune cells analysis
All segmented cells (including cancer and stromal cells) were included for analysis. CD8-positive and CD20-positive cells were quantified based on a threshold setting of normalized MxIF signals at 0.2.
Spatial arrangement of single cells
A MATLAB script was developed to map and label individual cells with their protein marker classifications using their cell centroid coordinates. Neighborhood analysis was conducted by calculating the frequency of each neighboring cell type within a 30 μm (100 pixel) or 100 μm (341 pixel) radius from the centroid of the central cell. Heterogeneity was evaluated by calculating the Shannon Equitability index (EH) (Shannon Diversity Index divided by maximum diversity defined as [Ln (Number of Species)]) . It assumes a value between 0 and 1, with 1 representing the condition of all species present in the neighborhood and at the same level of abundance.
Significant difference between each pair of cores taken from the same cancer specimen was determined with the compare.2.vectors function (afex package in R) with the EPH and Ki67 classifications (16 classes) for all cells in each TMA core presented as a vector. p < 0.01 for both Wilcoxon test (coin::Wilcoxon) and permutation test were used to determine significant difference. Significant differences between the percentages of cells in each EPH group among IHC subtypes, and between EH of EPH groups were evaluated with ANOVA test and Tukey’s HSD. All statistical analysis were performed in R (R Statistics).
MxIF intensities of biomarkers in single cells showed distributions comparable to IHC scoring of tissue core
A total of 225,086 single cells from 101 TMA cores (59 cases) were included after quality evaluation and gating parameters had been applied (see Additional file 1). From the 101 TMA cores studied, 82 were paired cores from 41 cases. In order to determine if MxIF measurements of breast biomarkers reflect IHC results, we compared the distribution of normalized ER and PR MxIF signals from individual cells between cores that were scored by IHC as negative, weak, moderately or strongly stained for the majority of the cells (Fig. 1A–D). For HER2, the distributions of normalized MxIF signals from individual cells were compared between cores that were scored in HER2 IHC as negative, 1+ or 3+ (2 cases that were scored as 2+/equivocal in IHC were further tested by FISH with one case re-classified to 1+ and one to 3 +)(Fig. 1E, F). We found that ER MxIF signal intensities were relatively low in cores scored as weak or moderate by IHC and that there was significant overlap in the range of MxIF values in those cores (Fig. 1B). Some cells in cores that were scored as strong in IHC also exhibited zero or low MxIF signal intensities, but the majority of cells showed medium to high MxIF staining intensities. For PR, MxIF signal intensities for negative and weakly stained cores again showed overlapping distributions, with cells from moderate and strongly stained IHC cores showing higher intensities and more dispersed range (Fig. 1D). MxIF HER2 staining showed discrete distributions between neg (0), 1+ and 3+ IHC-scored cores (Fig. 1F).
Based on the distributions of normalized MxIF intensities for each protein marker, threshold levels were determined for binary classification of cells. As the maximum MxIF level of cells from weak ER IHC cores (IHC score = Low) was 0.36 (Fig. 1B), a threshold of 0.4 was chosen to segregate cells with negative or weak staining from the moderate and strongly positive cells. The cut-point to classify PR positive and negative cells was set at 1.6 as the range of normalized MxIF cell intensities from IHC PR-negative cores extended from 0 to 1.6 (Fig. 1D). The HER2-positive cut-point was set at 5.0, the maximum signal intensity from cells in HER2 1+ cores (Fig. 1F). Comparison of representative cores with various levels of Ki67 IHC positivity prompted the selection of a cut-point value at 0.2 to include the weakly positive cells (Fig. 1G, H).
In addition to ER, PR, HER2 and Ki67, we also used MxIF to study the expressions of P53, P21 and P16 on the TMA. Pathologists evaluate P53 and P16 IHC staining in cancers by reviewing the overall staining intensity and pattern across the lesion and classify tumors as normal (heterogeneous staining across the lesion), null or overexpression for P53 and normal or overexpression for P16 [33, 34]. We studied the distribution of MxIF signal intensity levels of each marker in cores that showed various IHC staining patterns (Fig. 2). P53 MxIF signals in cores that were scored by IHC as null ranged from 0–0.5 (Fig. 2A, B). P53 normal cores expressed a majority of cells with P53 MxIF at 0, with some cells expressing levels between 0 and 1, whereas some cells in cores with overexpression (O/E) by IHC exhibited much higher levels of MxIF signal intensity (99th percentile at 6.77). In a comparison of P16 MxIF levels between IHC normal or O/E cores, cells from O/E cores tended to exhibit higher measured intensity than normal cores (Fig. 2C, D). P21 expression in IHC was scored in terms of percentage of positive cells and average staining intensity. P21 MxIF values from four cases with varying IHC scores were compared (Fig. 2E, F). MxIF values from the P21 negative core remained low at 0 to 1. As the percentages of IHC positive cells and their staining intensities increased, a corresponding increasing trend in MxIF signal intensity was observed. These data suggest that MxIF signal distributions could be used to reflect the mutational status of P53 and P16 or P21 expression patterns determined by IHC evaluation.
MxIF co-expression pattern of ER, PR, HER2 and Ki67 recapitulates IHC-surrogate subtypes and identifies heterogeneity within each subtype
Cut-points for ER, PR and HER2 were applied to MxIF measured levels in each cell. As a result, 8 classes, which we refer to as EPH groups (for ER, PR and HER2) emerged from the data and were color-labelled accordingly (Fig. 3A). Each group was further stratified as being Ki67+ or Ki67−. The abundance of EPH-classified cells in each TMA core was measured (Fig. 3B-C and Additional file 2: Table S1). Cores that were taken from the same cancer case were arranged together in pairs (Fig. 3B). Paired cores from all but one case were classified into the same IHC-surrogate subtype and generally showed similar distributions of EPH groups. That one case showed discordant classification, where one core was classified as Luminal A-like (LumA) (spot_032) and the second core as Luminal B-like (HER2-negative) (LumB) (spot_059). Luminal cancers were composed of varying proportions of cells from HER2-negative EPH groups (Groups 1, 3, 5 and 7) (Fig. 3B, C). LumB cancers harbored cells from PR-negative Groups 1 and 5, while LumA cancers also comprised PR+ Groups 3 and 7 cells. Only a few LumB, HER2+ (Luminal B-like, HER2+) cores were studied, and these mainly presented with cells from ER-weak, PR-negative EPH Group 1 and HER2+ Group 2. One particular LumB, HER2+ core (spot_117, Fig. 3B) showed composition of not only Groups 1 and 2, but also cells from ER-weak, PR-positive, HER2 ± Groups 3 and 4. Most of the cells in HER2+ (non-luminal) cases belonged to Groups 1 and 2 only (Fig. 3C). TNBC cases were predominately comprised of Group 1 cells (Fig. 3C).
We did not observe any strong association of Ki67 positivity to a single EPH group, but higher proliferative fractions were observed in Luminal B, HER2+ and TNBC cores, coinciding with EPH Groups 1, 2 and 5 (Fig. 3C), all of which are negative for PR.
Intra-tumoral heterogeneity was assessed by comparing the EPH composition and Ki67 positivity of single cells between cores taken from the same specimen. There were 41 cases with two cores studied. 6 out of 10 cases (60%) of LumA breast cancer showed significant statistical difference between cores, and 60% (9 out of 15 cases) of LumB cases showed significant differences. 1 out of 2 cases of LumB, HER2+, and all 4 HER2+ cases showed significant differences. As expected, only 20% (2 out of 10 cases) of TNBC showed significant differences between cores from the same specimen as most TNBC are composed of EPH Group 1 cells (Ki67+ or −) only.
Co-expression patterns of Ki67, P53, P16 and P21 in IHC-surrogate subtypes reveals protein marker heterogeneity in TNBC
Next, we analyzed the co-expression pattern of Ki67, P53, P21 and P16 in each core. Cores were arranged according to their IHC-surrogate subtypes (same order as Fig. 3B) to determine if specific signatures are more prevalent (Fig. 4). The normalized MxIF levels of P53, P16 and P21 per cell, as well as the positive fractions of Ki67, P53, P16 and P21 in each core are shown. Mutational status of P53 and P16 based on IHC evaluation are also shown for each case. We observed that proliferative fractions of MxIF-measured Ki67+ cells were high in some LumB, most HER2+ and most TNBC cases, with the latter exhibiting the highest levels. P53 and P16 aberrations were found most frequently in TNBC by IHC, either as null or as overexpression, as well as in HER2+ and a small fraction of LumB breast cancers. However, levels of P53 MxIF signal intensities were observed to be much higher in some TNBC tumors, compared to HER2+ or LumB cases which also exhibited P53 overexpression patterns in IHC. A large fraction of TNBC cases was also found to exhibit P16 overexpression by IHC or by high MxIF levels which coincided with IHC-scored P53 aberrations. On the other hand, two LumB cases which showed overexpression of P16 demonstrated normal P53 by IHC. P21 expression levels did not appear to associate with P53 or P16 expression patterns, yet a slightly higher proportion of LumB breast cancers, as compared to the other subtypes, demonstrated higher levels of P21 and P21-positive fractions measured by MxIF. The association between MxIF-measured P21 level and EPH Group classification was assessed and no significant dependence was observed (F-value = 0.003, p-value = 0.955).
Neighborhood analysis to assess heterogeneity in cellular arrangement
The spatial arrangement of cells with different protein co-expression signatures was studied. In Fig. 5 and Additional file 2: Fig. S1, the centroid of each cell, depicted by a color representing its EPH grouping was mapped back to its cellular location on the tissue section. Images from TMA cores of three LumB cases, one of which was also HER2+, are shown in Fig. 5. Spot135, a LumB (HER2-) tumor, illustrated a relatively homogeneous composition and consisted of predominately ERm/s EPH5 cells (Fig. 5A, D). Spot056, also a LumB (HER2-), appeared to be more heterogeneous in the composition of EPH groups and is composed of a majority of cells in EPH Groups 1 and 5, and a small number of cells in PR+ EPH Groups 3 and 7 (Fig. 5B, E). The LumB, HER2+ core (spot117) shown here was highly heterogeneous and composed of cells from all groups except for Group 5 (Fig. 5C, F). Here, cells from the two predominant groups (EPH3 and 4) appeared to form spatial clusters within their own group.
Quantitative analysis of heterogeneity in cellular arrangement with Shannon Equitability Index (EH) is illustrated in Fig. 5D–F. EH was calculated for each (central) cell for the indicated size of neighborhood (30 and 100 μm in radii) and presented as the mean for each EPH cell type. In spot135, EH was found to be significantly higher for central EPH1 cells in both 30 and 100 μm neighborhoods (p < 0.01) (Fig. 5D), suggesting that EPH Group 1 cells tend to have more heterogeneous mix of cells in their proximity compared to other EPH groups in this cancer. In the other LumB case (spot056), EH measured for EPH5 central cells was significantly higher than EPH Groups 1 and 3 in the 30 μm neighborhood (p < 0.01) (Fig. 5E). In the LumB, HER2+ case examined (spot117), EH evaluated for central cells of both EPH Groups 2 and 3 was significantly higher than for EPH1 and 4 (p < 0.01), suggesting that EPH2 and 3 cells are more likely to be present in a heterogeneous cellular environment compared to other cellular groups in this tumor (Fig. 5F).
To determine if the neighborhood environment for certain EPH Group cells demonstrate particular patterns, we studied the cumulative neighboring cell types of central EPH cell type in 5 LumA and 5 LumB cores (Fig. 5G, H). For LumA cores (Fig. 5G), each EPH group that was present appeared to have a higher likelihood to being in close proximity to cells of the same EPH group. For LumB cores (Fig. 5H), only small numbers of EPH3 and 7 cells were present, while EPH1 and 5 cells tended to be most likely to cluster with their own group.
Quantification of immune cells and their spatial organization
In addition to analysis of breast cancer cells, densities of CD8 T-lymphocytes and CD20 B-lymphocytes in the tumor microenvironment were quantified. We found higher levels of both CD8 and CD20 lymphocytes in the HER2+ cancers compared to the other IHC subtypes (Fig. 6A). Next we attempted to evaluate the spatial localization patterns of these lymphocytes using co-occurrence matrices. Four representative cases with similar densities of CD8 (3.5–8.6%) and CD20 (3.3–8.2%) are shown in Fig. 6B. The localization maps indicate that the lymphocytes in spots 035 and 097 have more spatially scattered, tumor-infiltrating patterns, whereas in spots 012 and 060 lymphocytes tended to be arranged in close proximity, particularly for the CD20 B-lymphocytes. These observations were also apparent from the quantification of percentage of neighboring cells for each central CD8 or CD20 cell. While the majority of neighboring cells for CD8 and CD20 were PCK+ cancer cells in spots 035 and 097, about half of neighboring cells for CD20 were also CD20 cells in spots 012 and 060. These findings suggest that cancers that showed similar levels of immune densities could exhibit heterogeneous spatial patterns.
Breast cancers are traditionally classified based on morphology including histologic grade and the expressions of ER, PR and HER2 following established guidelines. The St. Gallen International Expert Consensus on the primary therapy of early breast cancers have established the use of IHC staining to define surrogates for molecularly defined intrinsic subtypes . In this proof of principle study, we have reported the use of protein marker multiplexing with MxIF to examine protein biomarker co-expression patterns of single cells within each IHC-surrogate subtype. Comparison of the distribution of MxIF intensity levels of each marker and the corresponding IHC scoring illustrated that MxIF-measured expression correlates well overall with IHC classifications. When individual cells were characterized by their MxIF-measured levels of ER, PR and HER2, they segregated into groups to which we refer here as EPH, and demonstrated marked heterogeneity between cores within the Luminal A/B-like St. Gallen IHC subtypes. Heterogeneity was also observed in TNBC cancers with varying P53 and P16 levels. P53 protein levels measured with MxIF were much higher in some TNBC cancers, compared to HER2+ or luminal B-like cases which also harbored P53 overexpression identified with IHC. Our study illustrates that heterogeneity within each St. Gallen IHC-surrogate subtype could be further explained by quantitative analysis of co-expression signatures of key biomarkers. Addition of these refined descriptors could assist in the characterization of individual breast cancer to improve management and long-term prognostication of the disease.
When single cells were grouped according to the co-expression signatures of ER, PR and HER2, we found that significant levels of variation exist within ER+ cancers, especially in the proportion and staining intensities of ER and PR. Currently, depending on the patient’s menopausal status, endocrine therapy is recommended for hormone receptor-positive, HER2-negative cancers, defined as ER/PR > 1%. Recent recommendations from the American Society of Clinical Oncology/College of American Pathologists (ASCO/CAP) for ER and PR test reporting include the introduction of a new category of “ER Low Positive” in light of limited data on the benefit of endocrine therapy for cancers with 1% to 10% of ER+ cells . Improved characterization of cancers with low to intermediate range of hormonal receptor expression will help to identify subgroups of cases with poor prognosis which may require more aggressive treatment or monitoring. Hormonal receptor-positive, HER2-positive breast cancers, which constitute 5% of all breast cancers, also exhibit high degrees of heterogeneity in ER and HER2 expression which has been shown to be associated with their response to neoadjuvant chemotherapy and HER2-targeted therapy . Our analysis included only a few Luminal B-like, HER2+ cancers. Interestingly, we found as reported by others [37, 38] that cells co-expressing high levels of both ER and HER2 are rare, while co-expression of PR and HER2 was detected in a small fraction of cells. Ki67 positivity appeared to co-localize with EPH groups that are PR-negative. This observation aligns with previous findings which suggest that the loss of, or low values of PR expression is an adverse prognostic factor [39, 40].
Spatial arrangement of cells was evaluated quantitatively by measuring the frequency of each cell type in a user-defined distance from the central cell type. Heterogeneity in cellular neighborhood was also quantified by calculating the Shannon Equitability index for each central cell. Although only a few cases were presented here, certain EPH groups appear to be localized in neighborhoods that were less heterogeneous. Since our limited dataset lacks outcome information of the cases studied, the clinical significance of these findings is not clear at this point. Nevertheless we believe that this tool for quantifying cellular arrangements could be useful when combined with other molecular profiling assays, particularly in the study of mutational evolution. Cells harboring potential driver mutations will exhibit higher proliferative capacity, resulting in these cells being closely arranged in cellular homogeneous neighborhoods. Such cellular neighborhood analysis could also be used to predict functional interaction between cell groups, such as the relationships between infiltrating immune subsets or between immune and cancer cells. We demonstrated that between cancers that have similar levels of CD8 and CD20 lymphocytes, some cases showed a more scattered infiltration pattern, while in others the lymphocytes were arranged in close proximity. However, the limited size of the TMA cores studied here prevented us from properly assessing immune densities and localization patterns over the entire lesion as would be done in the pathology lab . Nevertheless, we believe these methods, when applied to whole tissue sections, could be useful for evaluating response to immunotherapy, where the type, density and localization of immune subsets have shown to predict long-term outcomes and response to therapy [41,42,43,44,45,46,47].
Our current analysis was based on gating for cancer cells on the basis of larger sizes compared to normal epithelial cells in perimeter and area in pixels. Selection of cancer cells was further enriched after thresholding of PCK26 level in luminal and HER2+ cancers. TNBC cells express weak PCK26, preventing the use of a threshold for gating. Therefore, cancer cells were manually annotated in the TNBC cancers. Nevertheless, “contamination” with non-cancer cells in our study population was inevitable and these could be tumor-infiltrating lymphocytes, fibroblasts or vascular cells. Another technical challenge is the low sensitivity in calibrating the MxIF measurement in distinguishing ER weakly positive cells from ER negative cells. A more granular calibration study, with the goal of improving the correlation of MxIF measurements to IHC histoscores in breast cancer is being conducted to develop methods that could translate multiplex-defined signatures to clinical subtypes.
Recent advances in high-dimensional proteomics technologies have allowed the study of breast cancer with hundreds or even thousands of protein markers and revealed the presence of a myriad of cellular phenotypes in the cancer and the tumor microenvironment that may further define each PAM50 intrinsic molecular subtype [48,49,50,51,52]. Although our analysis here is limited to seven markers, our comparison with IHC-surrogate subtypes and IHC staining patterns illustrate how the MxIF co-expression signatures could be translated to phenotypes identified in the clinical setting. We believe that a detailed quantitative assessment of protein markers co-expression on single cells, in the context of intensity grading and spatial arrangement, could improve the precision in stratifying the disease for treatment assignment and monitoring. To this goal, we are applying the methods reported here to study whole tissue sections from a cohort of whole-mount processed breast lumpectomies and correlating with clinical outcome information. A combined evaluation with molecular analyses such as targeted mutational or RNA profiling, will augment our protein signature and clustering analysis. The ability to study spatial arrangement of cell groups, particularly between cancer cells and cells in the stroma and the heterogeneity of such arrangement could provide valuable insight into improved management of breast cancer.
We report on a proof-of-concept study where single-cell image analysis of protein multiplexing is used to assess heterogeneity within IHC-defined breast cancer subtypes. Biomarker expression signatures evaluated on single cells, revealed heterogeneous composition and spatial arrangement of subgroups. Furthermore, cancers with the same IHC score of overexpression of P53 and/or P16 exhibited a range of expression levels when measured in individual cells. These findings motivate a more refined stratification of breast cancers based on the expression levels of protein biomarkers and evaluation of correlation with outcomes. This information may improve the characterization of breast cancers and predicting their responsiveness to therapy.
Availability of data and materials
The dataset consisting of raw output of the MxIF study is included as an additional file to this submission—“Cheung_BrCaMxIF_RawDataAllCells.csv”. It will be placed in a public repository before this article is published.
Analysis of variance
American Society of Clinical Oncology/College of American Pathologists
- EH :
Shannon Equitability Index
Fluorescent in situ hybridisation
Epidermal Growth Factor Receptor 2
Luminal A-like subtype
Luminal B-like (HER2-negative) subtype
- LumB, HER2+:
Luminal B-like (HER2-positive) subtype
HER2-positive (non-luminal) subtype
Moderately or strongly stained
Multiplex in Immunofluorescence
Triple-negative breast cancer
- Tukey’s HSD:
Tukey’s honestly significant difference
Virtual Hematoxylin & Eosin
Koboldt DC, Fulton RS, McLellan MD, Schmidt H, Kalicki-Veizer J, McMichael JF, et al. Comprehensive molecular portraits of human breast tumours. Nature. 2012;490(7418):61–70. https://doi.org/10.1038/nature11412.
Perou CM, Sørlie T, Eisen MB, van de Rijn M, Jeffrey SS, Rees CA, et al. Molecular portraits of human breast tumours. Nature. 2000;406(6797):747–52.
Cheang MCU, Martin M, Nielsen TO, Prat A, Voduc D, Rodriguez-Lescure A, et al. Defining breast cancer intrinsic subtypes by quantitative receptor expression. Oncologist. 2015;20(5):474–82.
Ali HR, Rueda OM, Chin S-F, Curtis C, Dunning MJ, Aparicio SA, et al. Genome-driven integrated classification of breast cancer validated in over 7,500 samples. Genome Biol. 2014;15(8):431.
Curtis C, Shah SP, Chin S-F, Turashvili G, Rueda OM, Dunning MJ, et al. The genomic and transcriptomic architecture of 2,000 breast tumours reveals novel subgroups. Nature. 2012;486(7403):346–52.
Russnes HG, Lingjærde OC, Børresen-Dale A-L, Caldas C, Nord S, Caldas C, et al. Breast cancer molecular stratification- from intrinsic subtypes to integrative clusters. Am J Pathol. 2017;111:326–38.
Dawson S-J, Rueda OM, Aparicio S, Caldas C. A new genome-driven integrated classification of breast cancer and its implications. EMBO J. 2013;32(5):617–28. https://doi.org/10.1038/emboj.2013.19.
Goldhirsch A, Winer EP, Coates AS, Gelber RD, Piccart-Gebhart M, Thürlimann B, et al. Personalizing the treatment of women with early breast cancer: highlights of the St Gallen International Expert Consensus on the Primary Therapy of Early Breast Cancer 2013. 2013
Curigliano G, Burstein HJ, Winer EP, Gnant M, Dubsky P, Loibl S, et al. De-escalating and escalating treatments for early-stage breast cancer: The St. Gallen International Expert Consensus Conference on the Primary Therapy of Early Breast Cancer 2017. Ann Oncol. 2017;28(8):1700–12.
Bastien RR, Rodríguez-Lescure Á, Ebbert MT, Prat A, Munárriz B, Rowe L, et al. PAM50 Breast cancer subtyping by RT-qPCR and concordance with standard clinical molecular markers. BMC Med Genomics. 2012;5(1):44. https://doi.org/10.1186/1755-8794-5-44.
Fernandez-Martinez A, Pascual T, Perrone G, Morales S, de la Haba J, González-Rivera M, et al. Limitations in predicting PAM50 intrinsic subtype and risk of relapse score with Ki67 in estrogen receptor-positive HER2- negative breast cancer. Oncotarget. 2017;8(13):21930–7.
Liu MC, Pitcher BN, Mardis ER, Davies SR, Friedman PN, Snider JE, et al. PAM50 gene signatures and breast cancer prognosis with adjuvant anthracycline-and taxane-based chemotherapy: Correlative analysis of C9741 (alliance). Npj Breast Cancer. 2016;2(1):10.
Prat A, Cheang MCU, Galván P, Nuciforo P, Paré L, Adamo B, et al. Prognostic value of intrinsic subtypes in hormone receptor-positive metastatic breast cancer treated with letrozole with or without lapatinib. JAMA Oncol. 2016;2(10):1287–94.
Prat A, Fan C, Fernández A, Hoadley KA, Martinello R, Vidal M, et al. Response and survival of breast cancer intrinsic subtypes following multi-agent neoadjuvant chemotherapy. BMC Med. 2015;13(1):10.
Pogue-Geile KL, Song N, Jeong JH, Gavin PG, Kim SR, Blackmon NL, et al. Intrinsic subtypes, PIK3CA mutation, and the degree of benefit from adjuvant trastuzumab in the NSABP B-31 trial. J Clin Oncol. 2015;33(12):1340–7.
Nielsen TO, Parker JS, Leung S, Voduc D, Ebbert M, Vickery T, et al. A comparison of PAM50 intrinsic subtyping with immunohistochemistry and clinical prognostic factors in tamoxifen-treated estrogen receptor-positive breast cancer. Clin Cancer Res. 2010;16(21):5222–32. https://doi.org/10.1158/1078-0432.CCR-10-1282.
Szymiczek A, Lone A, Akbari MR. Molecular intrinsic versus clinical subtyping in breast cancer: a comprehensive review. Clinical genetics. New York: Blackwell Publishing Ltd; 2020.
Rueda OM, Sammut SJ, Seoane JA, Chin SF, Caswell-Jin JL, Callari M, et al. Dynamics of breast-cancer relapse reveal late-recurring ER-positive genomic subgroups. Nature. 2019;567(7748):399–404.
Burstein MD, Tsimelzon A, Poage GM, Covington KR, Contreras A, Fuqua SAW, et al. Comprehensive genomic analysis identifies novel subtypes and targets of triple-negative breast cancer. Clin Cancer Res. 2015;21(7):1688–98.
Echavarria I, Lopez-Tarruella S, Picornell A, García-Saenz JA, Jerez Y, Hoadley K, et al. Pathological response in a triple-negative breast cancer cohort treated with neoadjuvant carboplatin and docetaxel according to Lehmann’s refined classification. Clin Cancer Res. 2018;24(8):1845–52.
Gingras I, Gebhart G, De Azambuja E, Piccart-Gebhart M. HER2-positive breast cancer is lost in translation: time for patient-centered research. Vol. 14, Nature Reviews Clinical Oncology. Nature Publishing Group; 2017. p. 669–81.
Karaayvaz M, Cristea S, Gillespie SM, Patel AP, Mylvaganam R, Luo CC, et al. Unravelling subclonal heterogeneity and aggressive disease states in TNBC through single-cell RNA-seq. Nat Commun. 2018;9(1):10.
Lehmann BD, Jovanović B, Chen X, Estrada MV, Johnson KN, Shyr Y, et al. Refinement of triple-negative breast cancer molecular subtypes: Implications for neoadjuvant chemotherapy selection. PLoS ONE. 2016;11(6):10.
Carey LA, Berry DA, Cirrincione CT, Barry WT, Pitcher BN, Harris LN, et al. Molecular heterogeneity and response to neoadjuvant human epidermal growth factor receptor 2 targeting in CALGB 40601, a randomized phase III trial of paclitaxel plus trastuzumab with or without lapatinib. J Clin Oncol. 2016;34(6):542–9.
Whitworth P, Beitsch P, Mislowsky A, Pellicane JV, Nash C, Murray M, et al. Chemosensitivity and endocrine sensitivity in clinical luminal breast cancer patients in the prospective neoadjuvant breast registry symphony trial (NBRST) predicted by molecular subtyping. Ann Surg Oncol. 2017;24(3):669–75.
Lambertini M, Campbell C, Gelber RD, Viale G, McCullough A, Hilbers F, Korde LA, Werner O, Chumsri S, Jackisch C, Wolff AC, Vaz-Luis I, Ferreira AR, Prat A, Moreno-Aspitia A, Piccart M, Loi S, de Azambuja E. Dissecting the effect of hormone receptor status in patients with HER2-positive early breast cancer: exploratory analysis from the ALTTO (BIG 2–06) randomized clinical trial. Breast Cancer Res Treat. 2019;177:103–14. https://doi.org/10.1007/s10549-019-05284-y.
Gerdes MJ, Sevinsky CJ, Sood A, Adak S, Bello MO, Bordwell A, et al. Highly multiplexed single-cell analysis of formalin-fixed, paraffin-embedded cancer tissue. Proc Natl Acad Sci. 2013;110(29):11982–7. https://doi.org/10.1073/pnas.1300136110.
Fitzgibbons PL, Murphy DA, Hammond MEH, Allred DC, Valenstein PN. Recommendations for validating estrogen and progesterone receptor immunohistochemistry assays. Arch Pathol Lab Med. 2010;134(6):930–5.
Wolff AC, Hammond MEH, Hicks DG, Dowsett M, McShane LM, Allison KH, et al. Recommendations for human epidermal growth factor receptor 2 testing in breast cancer: American Society of Clinical Oncology/College of American Pathologists clinical practice guideline update. J Clin Oncol. 2013;31(31):3997–4013.
Goldhirsch A, Winer EP, Coates AS, Gelber RD, Piccart-Gebhart M, Thürlimann B, et al. Personalizing the treatment of women with early breast cancer: highlights of the St Gallen International Expert Consensus on the Primary Therapy of Early Breast Cancer 2013. Ann Oncol. 2013;24(9):2206–23.
Gerdes MJ, Sood A, Montalto MC, Can A, Ginty F, Bresnahan MA, Filkins RJ PZ. Sequential analysis of biological samplese. 2010. p. US7741045.
Magurran AE. Ecological diversity and its measurement. Ecological diversity and its measurement. Dordrecht: Springer Netherlands; 1988.
Ali Hashmi A, Naz S, Hashmi SK, Hussain ZF, Irfan M, Khan EY, et al. Prognostic significance of p16 & p53 immunohistochemical expression in triple negative breast cancer 11 Medical and Health Sciences 1112 Oncology and Carcinogenesis. BMC Clin Pathol. 2018;18(1):10.
Yemelyanova A, Vang R, Kshirsagar M, Lu D, Marks MA, Shih IM, et al. Immunohistochemical staining patterns of p53 can serve as a surrogate marker for TP53 mutations in ovarian carcinoma: an immunohistochemical and nucleotide sequencing analysis. Mod Pathol. 2011;24(9):1248–53.
Allison KH, Hammond MEH, Dowsett M, McKernin SE, Carey LA, Fitzgibbons PL, et al. Estrogen and progesterone receptor testing in breast cancer: American society of clinical oncology/college of American pathologists guideline update. Arch Pathol Lab Med. 2020;144(5):545–63.
Rye IH, Trinh A, Sætersdal AB, Nebdal D, Lingjærde OC, Almendro V, et al. Intratumor heterogeneity defines treatment-resistant HER2+ breast tumors. Mol Oncol. 2018;12(11):1838–55.
Chen C, Peng J, Xia H, Wu Q, Zeng L, Xu H, et al. Quantum-dot-based immunofluorescent imaging of HER2 and ER provides new insights into breast cancer heterogeneity. Nanotechnology. 2010;21(9):10.
Giltnane JM, Moeder CB, Camp RL, Rimm DL. Quantitative multiplexed analysis of ErbB family coexpression for primary breast cancer prognosis in a large retrospective cohort. Cancer. 2009;115(11):2400–9.
Chen S, Huang L, Chen CM, Shao ZM. Progesterone receptor loss identifies luminal-type local advanced breast cancer with poor survival in patients who fail to achieve a pathological complete response to neoadjuvant chemotherapy. Oncotarget. 2015;6(20):18174–82.
Prat A, Cheang MCU, Martín M, Parker JS, Carrasco E, Caballero R, et al. Prognostic significance of progesterone receptor-positive tumor cells within immunohistochemically defined luminal a breast cancer. J Clin Oncol. 2013;31(2):203–9.
Salgado R, Denkert C, Demaria S, Sirtaine N, Klauschen F, Pruneri G, et al. The evaluation of tumor-infiltrating lymphocytes (TILs) in breast cancer: recommendations by an International TILs Working Group 2014. Ann Oncol. 2015;26(2):259–71.
Denkert C, Von Minckwitz G, Brase JC, Sinn BV, Gade S, Kronenwett R, et al. Tumor-infiltrating lymphocytes and response to neoadjuvant chemotherapy with or without carboplatin in human epidermal growth factor receptor 2-positive and triple-negative primary breast cancers. J Clin Oncol. 2015;33(9):983–91.
Heppner BI, Untch M, Denkert C, Pfitzner BM, Lederer B, Schmitt W, et al. Tumor-infiltrating lymphocytes: a predictive and prognostic biomarker in neoadjuvant-treated HER2-positive breast cancer. Clin Cancer Res. 2016;22(23):5747–54.
Heindl A, Sestak I, Naidoo K, Cuzick J, Dowsett M, Yuan Y. Relevance of spatial heterogeneity of immune infiltration for predicting risk of recurrence after endocrine therapy of ER+ breast cancer. J Natl Cancer Inst. 2017;110(2):10.
Karn T, Denkert C, Weber KE, Holtrich U, Hanusch C, Sinn BV, et al. Tumor mutational burden and immune infiltration as independent predictors of response to neoadjuvant immune checkpoint inhibition in early TNBC in GeparNuevo. Ann Oncol. 2020;31(9):1216–22.
Maley CC, Koelble K, Natrajan R, Aktipis A, Yuan Y. An ecological measure of immune-cancer colocalization as a prognostic factor for breast cancer. Breast Cancer Res. 2015;17(1):131. https://doi.org/10.1186/s13058-015-0638-4.
Zhang AW, McPherson A, Milne K, Kroeger DR, Hamilton PT, Miranda A, et al. Interfaces of malignant and immunologic clonal dynamics in ovarian cancer. Cell. 2018;173(7):1755-1769.e22.
Angelo M, Bendall SC, Finck R, Hale MB, Hitzman C, Borowsky AD, et al. Multiplexed ion beam imaging of human breast tumors. Nat Med. 2014;20(4):436–42.
Johansson HJ, Socciarelli F, Vacanti NM, Haugen MH, Zhu Y, Siavelis I, Fernandez-Woodbridge A, Aure MR, Sennblad B, Vesterlund M, Branca RM, Orre LM, Huss M, Fredlund E, Beraki E, Garred Ø, Boekel J, Sauer T, Zhao W, Nord S, Hoglander EK, Jans DC, Brismar H, Haukaas TH, Bathen TF, Schlichting E, Naume B, Luders T, Borgen E, Kristensen VN, Russnes HG, Lingjærde OC, Mills GB, Sahlberg KK, Børresen-Dale A-L, Lehtiö J. Breast cancer quantitative proteome and proteogenomic landscape. Nat Commun. 2019;10(1):1–14. https://doi.org/10.1038/s41467-019-09018-y.
McCart Reed AE, Bennett J, Kutasovic JR, Kalaw E, Ferguson K, Yeong J, et al. Digital spatial profiling application in breast cancer: a user’s perspective. Virchows Arch. 2020;477(6):885–90.
Ali HR, Jackson HW, Zanotelli VRT, Danenberg E, Fischer JR, Bardwell H, et al. Imaging mass cytometry and multiplatform genomics define the phenogenomic landscape of breast cancer. Nat Cancer. 2020;1(2):163–75. https://doi.org/10.1038/s43018-020-0026-6.
Jackson HW, Fischer JR, Zanotelli VRT, Ali HR, Mechera R, Soysal SD, et al. The single-cell pathology landscape of breast cancer. Nature. 2020;578(7796):615–20.
We thank Dr. Lincoln Stein at the Ontario Institute for Cancer Research (OICR) and Dr. Robert Kerbel (Sunnybrook Research Institute) for their critical review and input, Yulia Yerofeyeva for her managerial assistance of this project, Gord Mawdsley for his assistance on equipment maintenance, Theo Cleland for developing script for extracting cell data from region selection, and other members at the Biomarker Imaging Research Lab for their input and support.
This work is supported by funding from the Canadian Cancer Society Research Institute, and by the Ontario Institute for Cancer Research.
Ethics approval and consent to participate
Not applicable. The microarray studied in this report was commercially purchased. There is no identifiable patient information and the source of the specimens was not provided.
Consent for publication
The authors declare that they have no competing interests—A.M.C., D.W., K.L., M.M., T. H., S. N-M., A.M., M. J. Y. This author is an employee of General Electric Research and has financial competing interest to declare—F. G.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Cheung, A.MY., Wang, D., Liu, K. et al. Quantitative single-cell analysis of immunofluorescence protein multiplex images illustrates biomarker spatial heterogeneity within breast cancer subtypes. Breast Cancer Res 23, 114 (2021). https://doi.org/10.1186/s13058-021-01475-y
- Breast cancer subtypes
- Spatial arrangement
- Protein multiplexing
- Immunofluorescence imaging
- Single cell imaging