Skip to main content
  • Research article
  • Open access
  • Published:

Radiomic signatures with contrast-enhanced magnetic resonance imaging for the assessment of breast cancer receptor status and molecular subtypes: initial results

Abstract

Background

To evaluate the diagnostic performance of radiomic signatures extracted from contrast-enhanced magnetic resonance imaging (CE-MRI) for the assessment of breast cancer receptor status and molecular subtypes.

Methods

One hundred and forty-three patients with biopsy-proven breast cancer who underwent CE-MRI at 3 T were included in this IRB-approved HIPAA-compliant retrospective study. The training dataset comprised 91 patients (luminal A, n = 49; luminal B, n = 8; HER2-enriched, n = 11; triple negative, n = 23), while the validation dataset comprised 52 patients from a second institution (luminal A, n = 17; luminal B, n = 17; triple negative, n = 18). Radiomic analysis of manually segmented tumors included calculation of features derived from the first-order histogram (HIS), co-occurrence matrix (COM), run-length matrix (RLM), absolute gradient (GRA), autoregressive model (ARM), discrete Haar wavelet transform (WAV), and lesion geometry (GEO). Fisher, probability of error and average correlation (POE + ACC), and mutual information coefficients were used for feature selection. Linear discriminant analysis followed by k-nearest neighbor classification (with leave-one-out cross-validation) was used for pairwise radiomic-based separation of receptor status and molecular subtypes. Histopathology served as the standard of reference.

Results

In the training dataset, radiomic signatures yielded the following accuracies > 80%: luminal B vs. luminal A, 84.2% (mainly based on COM features); luminal B vs. triple negative, 83.9% (mainly based on GEO features); luminal B vs. all others, 89% (mainly based on COM features); and HER2-enriched vs. all others, 81.3% (mainly based on COM features). Radiomic signatures were successfully validated in the separate validation dataset for luminal A vs. luminal B (79.4%) and luminal B vs. triple negative (77.1%).

Conclusions

In this preliminary study, radiomic signatures with CE-MRI enable the assessment of breast cancer receptor status and molecular subtypes with high diagnostic accuracy. These results need to be confirmed in future larger studies.

Background

Breast cancer is a heterogeneous disease with varying clinical presentations, subtypes, and treatment responses [1,2,3,4,5,6,7]. Although traditional prognostic and predictive factors such as tumor size, grade, histopathologic type, and immunohistochemical receptor status are well established, it has become evident that these traditional classifications cannot fully capture the heterogeneity of breast cancer so that the classical approach of stratifying patients into treatment groups based on phenotypic biomarkers may be insufficient in some patients. In this era of precision medicine, treatments are selected based on genetic tumor characteristics. In breast cancer, gene expression profiling has revealed four main intrinsic molecular subtypes that show pervasive differences in their gene expression patterns: luminal A, luminal B, human epidermal growth factor receptor 2 (HER2)-enriched, and triple negative [8,9,10]. These intrinsic molecular subtypes have different phenotypic presentations, prognosis, treatment responses, and recurrence-free and disease-specific survival leading to molecular subtype-based recommendations for systemic therapy [1,2,3, 11, 12].

Assessment of molecular subtypes is currently based on either gene expression profiling or immunohistochemical (IHC) surrogates from invasive tissue sampling [10, 13, 14]. It must be noted that this approach has limitations. A single biopsy of a potentially heterogenous tumor can capture only a snapshot of the tumor tissue that is subject to selection bias and may not be completely representative of the genetic, epigenetic, and/or phenotypic alterations of the entire tumor. In addition, cancer biology is subject to change over time as well with treatment from a stem-like, therapy-resistant and a differentiated drug-sensitive phenotype in a process linked to epithelial-mesenchymal transition [15]. Therefore, there is a strong argument for the development of alternative means to derive tumor characteristics that are prognostic indicators, i.e., receptor status and molecular subtypes, from the tumor in its entirety and to spatio-longitudinal monitor tumor biology changes during treatment. This is a unique opportunity for medical imaging, and in this context, contrast-enhanced magnetic resonance imaging (CE-MRI) coupled with radiomic analyses has yielded initial encouraging results. Prior studies have investigated radiomic signatures in the breast, but the generalization of these results is limited due to utilization of different MRI protocols and scanners [11, 16,17,18], results may be suboptimal due to inclusion of only a few radiomic features [11, 17,18,19,20], and only certain subgroups of breast cancers have been investigated [21].

We hypothesized that through the extraction of radiomic signatures from standardized CE-MRI data, an accurate assessment of molecular subtypes and receptor status of breast cancers will be provided. Thus, the aim of our study was to evaluate the diagnostic performance of CE-MRI coupled with radiomic analysis for the differentiation of breast cancers of different receptor status and molecular subtype.

Methods

This retrospective study was compliant with Health Insurance Portability and Accountability Act guidelines and approved by the Institutional Review Board with a waiver of written informed consent.

Patients

A prospectively populated research database was searched for patients who underwent state-of-the-art multiparametric MRI of the breast at our institution between January 2011 and January 2013 and who fulfilled the following inclusion criteria: histopathologically verified breast cancer, lesions ≥ 1 cm on CE-MRI to reduce the influence of partial volume effect on texture analysis [22]; 18 years or older; not pregnant; and not breastfeeding. One hundred and seventeen consecutive patients matched our search criteria. These patients underwent MRI for pre-treatment staging of their biopsy-proven breast cancer. Twenty-six were excluded for pathology results demonstrating types of cancer other than invasive ductal carcinoma or invasive lobular carcinoma, prior treatment, and poor image quality. Thus, 91 patients were included in the study.

MR imaging

All patients were examined with a 3 T system (Discovery MR750; GE Healthcare, Milwaukee, WI) with the body coil as the transmitter and a dedicated phased-array breast coil as the receiver. All patients underwent a state-of-the-art MRI protocol [23] including T2-weighted imaging, diffusion-weighted imaging, and CE-MRI. CE-MRI data was used for radiomic analysis.

CE-MRI (3D T1-weighted gradient echo VIBRANT sequence; TR/TE 4.3/2.1 ms; flip angle 10o; matrix size 320 × 192; FOV 30 cm; voxel size 1.8–2 mm3) with and without fat suppression. CE-MRI images were acquired before and at three points at 60-s intervals after the injection of a standard dose (0.1 mmol/kg body weight) of gadopentetate dimeglumine (Magnevist; Bayer HealthCare, Hanover, NJ, USA). Additional axial CE-MRI images (voxel size 0.7 mm3, acquisition time 1.5 min) were also collected. Contrast agent was injected in an antecubital vein as a bolus using a power injector at 4 m/s, followed by a saline flush. Parallel imaging using the array spatial sensitivity encoding technique (ASSET) was applied during the acquisition of DW and CE-MR images. The details of the full MRI sequence protocol are provided in Additional file 1.

Radiomic analysis

For radiomic analysis, the first post-contrast T1-weighted sequence was used. T1 was chosen over T2 and T3 for radiomic analysis, as at this time point, malignant lesions show peak enhancement, and tumor heterogeneity/subtle lesion morphologic features are best discernable on visual assessment. A semi-automatic approach for feature extraction was chosen, using the publicly available software package MaZda 4.6 (http://www.eletel.p.lodz.pl/programy/mazda/), which was developed within the COST (European Cooperation in The Field of Scientific and Technical Research) projects B11 and B21. In all patients, the slice with the largest transaxial lesion diameter on CE-MRI was chosen and two radiologists with 13 and 4 years of experience in consensus drew a single two-dimensional region of interest around the whole tumor (Fig. 1). Adequate distance was kept from the surrounding anatomic structures and biopsy markers. Gray-level normalization of each region of interest was performed, limiting the dynamics to μ ± 3σ (μ, gray-level mean; σ, gray-level standard deviation) in order to reduce the influence of contrast and brightness variations which might otherwise affect radiomic feature quantification [24]. Radiomic analysis of the manually segmented lesions included calculation of features derived from first-order histogram (HIS), co-occurrence matrix (COM), run-length matrix (RLM), absolute gradient (GRA), autoregressive model (ARM), discrete Haar wavelet transform (WAV), and lesion geometry (GEO) (Fig. 2) [25,26,27]. All features that MaZda was capable of calculating were included. The numbers of calculated features per feature class are as follows: HIS, n = 9; COM, n = 220; RLM, n = 20; GRA, n = 5; ARM, n = 5; WAV, n = 20; GEO, n = 73 (total number of features per lesion, n = 352). The full list of radiomic features used in this study (including their abbreviations) can be accessed at http://www.eletel.p.lodz.pl/programy/mazda/download/FeaturerList.pdf. For instance, COM features are based on pairs of pixels/voxels and, hence, provide information on lesion heterogeneity. RLM parameters are calculated for four directions and represent the number of times there is a run of pixels having a certain gray level. Meanwhile, ARM is based on a local interaction between pixels, where the intensity of one pixel is assumed to be a weighted sum of neighboring intensities. In contrast to all other feature groups, HIS features represent statistical descriptors of lesion signal intensities. The total time for image post-processing including lesion segmentation and radiomic analysis was approximately 2–3 min per lesion.

Fig. 1
figure 1

Manual region of interest placement for radiomic analysis in a 56-year-old patient with a HER2-enriched invasive ductal carcinoma in the right breast

Fig. 2
figure 2

Original CE-MRI images and corresponding color-coded sum entropy feature map as overlay of the tumor area of triple-negative (TN) and HER2-enriched (HER2) breast cancer. TN shows a clearly lower sum entropy than HER2

Statistical analysis

Feature selection is a method to reduce the large number of features obtained to the most relevant parameter set and is based on the choice of certain features according to given mathematical criteria. In this study, Fisher (ratio of between-class to within-class variance), minimization of probability of classification error and average correlation (POE + ACC), and mutual information coefficients were used for feature selection. Fisher did not take correlation between features into account, while the other two selection methods—POE + ACC and mutual information coefficients—however, did consider the interrelationships between features and aimed at reducing data redundancy. Feature selection was performed once across the entire training set, prior to linear discriminant analysis (LDA) and cross-validation. LDA was performed separately for each comparison to reduce the number of features used to distinguish between two subtypes/receptor status. LDA followed by k-nearest neighbor classification (with leave-one-out cross-validation) was used for pairwise radiomic-based separation of subtypes/receptor status, i.e., the model was trained with the use of all patients (n) − 1, excluding information from the held-out patient, and tested on the remaining patient. The training-testing process was then repeated n times, with n depending on the number of samples included in each comparison (e.g., luminal A vs. luminal B). LDA produces new feature vectors, which are called “most discriminating features” (MDF) and are optimized for maximum between-class scatter and minimum within-class scatter [28]. For a given dataset, the number of retained MDFs is equal to the number of eigenvalues exceeding 97% of the sum of all of them. The relevant equations can be accessed in the MaZda documentation on page 58, which can be accessed at http://www.eletel.p.lodz.pl/programy/mazda/download/mazda_manual.pdf. Since LDA is prone to overfitting due to “data piling,” where the feature space is large relative to the number of samples (URL: https://www.optimization-online.org/DB_FILE/2002/07/513.pdf), the number of features for each pairwise classification was dependent on the number of samples: we used one feature for every ten samples. This strategy—feature selection followed by feature reduction—was used to reduce the dimensionality of texture feature information as much as possible, as previously recommended [27]. The k-nearest neighbor classifier assigns a point in data space to a class to which its k-nearest neighbors belong [29]. Based on the MDF, the k-nearest neighbor classifier is used to measure the percentage of misclassified data vectors by comparing them with the true class affiliations.

For visualization of radiomic differences between breast lesions with different receptor status, feature maps were constructed which represent the distributions of single radiomic features within each image [27]. These feature maps were used solely for illustrative purposes.

External validation cohort

To validate part of our results, some of the experiments were repeated using data from a second institution (approved by the local Institutional Review Board with a waiver of written informed consent). A research database was searched for patients who underwent state-of-the-art multiparametric MRI of the breast between January and September 2014 and who met the same inclusion criteria as described above. Fifty-two patients were included in the validation cohort. MR imaging parameters used for the validation cohort have been described previously [30]. Lesions were segmented using MaZda, and the same feature sets as selected before were applied to differentiate between molecular subtypes/receptor status. For validation, only those pairwise comparisons that yielded an accuracy of at least 80% were chosen.

Histopathology

The tumor sampling results were reviewed for tumor histology, tumor and nuclear grade, and IHC status. A central review by reference pathologists for breast cancer is the standard in our institution. Evaluation of the IHC status included estrogen receptor, progesterone receptor, and HER2. Estrogen receptor or progesterone receptor-positive tumors (> 1% staining) were classified as hormone receptor positive. In all patients, the final histolopathological analysis from surgical tumor specimen was used as the reference standard. Tumor subtypes were classified as luminal A for hormone receptor positive and HER2 negative, luminal B for hormone receptor positive and HER2 positive, HER2-enriched for hormone receptor negative and HER2 positive, and triple negative for hormone receptor and HER2 negative [1, 11, 31]. Patients with equivocal HER2 status were evaluated using fluorescence in situ hybridization to detect gene amplification.

Results

Mean lesion size of the 91 treatment-naïve, biopsy-proven breast cancers in 91 patients (mean age, 48 ± 9.7 years, range, 27–68 years) was 3.5 ± 2.3 cm (range, 1–16.6 cm). There were 70 mass lesions and 21 non-mass enhancing lesions. Hormone receptor positivity was observed in 57 patients (62.6%). Forty-nine breast cancers were luminal A (53.8%), eight were luminal B (8.8%), 11 were HER2-enriched (12.1%), and 23 were triple negative (25.3%).

Results of group-wise radiomic-based classifications including numbers of features are listed in Tables 1 and 2. Detailed results of all radiomic feature-based cancer classifications including all selection algorithms and respective MDFs are given in Additional file 1: Table S1. With the exception of HER2-enriched vs. luminal A (MDF = 2), the previously selected texture features were reduced to a single MDF. Selected feature sets for classifications with accuracies > 80% are given in Table 3, while feature sets for all pairwise classifications and selection algorithms are provided in Additional file 2: Table S2. Classification accuracies ≥ 80% were considered as clinically relevant and were achieved for molecular subtypes and receptor status as listed below.

Table 1 Results of group-wise radiomic feature-based cancer classifications for molecular breast cancer subtypes (training dataset)
Table 2 Results of group-wise radiomic feature-based cancer classifications for hormone receptor status (training dataset)
Table 3 Selected feature sets for pairwise classifications with accuracies ≥ 80%

Molecular breast cancer subtypes

Best results in terms of accuracies were achieved for the differentiation of luminal B cancers from other groups: luminal B vs. luminal A, 84.2% (mainly based on COM features selected by mutual information coefficients); luminal B vs. triple negative, 83.9% (mainly based on GEO features selected by Fisher coefficients); and luminal B vs. all others, 89% (mainly based on COM features selected by POE + ACC). In summary, luminal B cancers seemed to carry distinct radiomic signatures that enable their separation from breast cancers with other features (Tables 1 and 3).

The separation of HER2-enriched cancers was successful, with the following accuracy: HER2-enriched vs. all others, 81.3% (mainly based on COM features selected by mutual information coefficients) (Tables 1 and 3). In summary, HER2-enriched cancers showed radiomic signatures that enable their separation from other breast cancers.

Receptor status

With regard to cancer hormone receptor status, pairwise discrimination of hormone receptor-positive tumors from other groups yielded accuracies that were below 80% (see Table 2).

External validation cohort

Of the 52 treatment-naïve, biopsy-proven breast cancers included from a second institution to validate our results, 17 were luminal A, 17 luminal B, and 18 triple negative. This allowed us to repeat some of the experiments that had yielded satisfactory results with accuracies above 80% when using our own data. Validation of these feature sets provided satisfactory results, with accuracies of 79.4% for luminal A vs. luminal B (original training dataset, 82.5%) and 77.1% for luminal B vs. triple negative (training dataset, 83.9%). Detailed results for group-wise classifications using the validation dataset can be found in Table 4, while rates of misclassified cases for pairwise comparisons (training and validation dataset) are displayed in Table 5.

Table 4 Results of group-wise radiomic feature-based cancer classifications (validation dataset)
Table 5 Rates of misclassified cases for pairwise comparisons (training and validation dataset)

Discussion

In this study, we evaluated the diagnostic performance of CE-MRI coupled with radiomic analysis for the non-invasive differentiation of breast cancers with different receptor status and molecular subtypes. We hypothesized that microstructural differences between breast cancers of different receptor status, and hence molecular subtypes, would lead to different, larger-scale gray-level patterns in CE-MR images which can be assessed quantitatively by radiomic analysis. Our results demonstrate that radiomic features derived from CE-MRI data enable the determination of receptor status and molecular subtypes with a high accuracy. CE-MRI radiomic signatures may therefore provide valuable prognostic indicators based on the entire tumor and thus may be used to monitor spatio-longitudinal tumor biology changes during treatment.

In our study, the best results were achieved for the differentiation of luminal A vs. luminal B (accuracy 84.2%), luminal B vs. triple negative (83.9%), luminal B vs. all others (89%), and HER2-enriched vs. all other cancers (81.3%). In these four pairwise comparisons, especially COM and GEO features seemed to be of importance, which highlight the relevance of structural heterogeneity on the one hand and lesion shape on the other hand. External validation of part of our results yielded satisfactory results for the differentiation of luminal A vs. luminal B (79.4%) and luminal B vs. triple negative (77.1%). Our results indicate that, since tumor biology may change in course of treatment, radiomic features may possibly be able to capture these changes noninvasively, provided that our results are verified in future studies that use larger datasets and a prospective design.

Li et al. achieved excellent results for radiomic-based separation between estrogen receptor-positive and estrogen receptor-negative cancers with an area under the curve of 0.89 [17]. Given the fact that in clinical decision-making, a distinction of positivity of estrogen receptor and progesterone receptor is not relevant, we considered either estrogen receptor or progesterone receptor positivity as hormone receptor positive and did not analyze these receptors separately. In our study, in contrast to the differentiation of luminal A/B cancers, the differentiation of hormone receptor-positive and hormone receptor-negative cancers had limited success with an accuracy of 68.1%. However, we showed excellent results for the differentiation of luminal B vs. all other subtypes (89%) and luminal A vs. B cancers (84.2%). In addition to being hormone receptor positive, luminal B cancers are also characterized by other biologic features such as higher proliferation rates and/or HER2 positivity, and it seems that this can be captured through distinctive radiomic features. All hormone receptor-positive cancers are treated with endocrine therapy alone or with chemotherapy, while luminal B cancers in contrast to luminal A cancers derive benefit from additional cytotoxic and targeted treatment. In this context, the ability to identify and non-invasively spatio-longitudinally monitor these breast cancers during treatment is particularly relevant.

In this study, we used a combination of multiple radiomic features derived from different categories (e.g., COM, RLM, and ARM), all of which capture different aspects of image texture and thus might have contributed to the excellent results. This approach differs from the majority of prior MRI radiomic studies in the field of breast cancer, in which typically very few and often COM-based radiomic features are calculated [18,19,20, 32]. Sutton et al. achieved accuracies of up to 89.2% for the differentiation of molecular subtypes but used a combination of pathology-derived features and COM radiomic features [32]. In this study, we intentionally chose to rely solely on radiomic features from CE-MRI and to dispense with features derived from invasive tissue sampling. Holli-Helenius aimed to differentiate between luminal A and B subtypes in a small collective of 27 patients, using COM features for radiomic analysis [33]. With areas under the curve of 0.83–0.88, these results were very similar to those in our own study (84.2% accuracy); in accordance to Holli-Helenius, COM features were of importance for the differentiation between those two groups. Notably, the results of Holli-Helenius mainly relied on features extracted from pre-contrast MR images, which were not included in our study. Nevertheless, our results generally confirm that luminal A and B cancers can be differentiated by MRI-based radiomic features with a high level of confidence.

Wang et al. developed a model to exclusively distinguish triple-negative cancer from other subtypes and achieved an area under the curve of 0.88 using COM features extracted from both the tumor and healthy breast parenchyma [21]. These results agree with our findings where we demonstrated a differentiation of triple negative from luminal B with an accuracy of 83.9% when employing radiomic features from the tumor itself but not from the healthy breast tissue. The good differentiation between triple-negative and luminal B cancers may be explained by the fact that the geometric features (Table 3) used for this task capture the typically round/circumscribed shape of triple negative and the irregular/spiculated shape of luminal B cancers. In addition, Fan et al. aimed to develop a model to distinguish between molecular breast cancer subtypes based on CE-MRI data of the tumor and healthy breast parenchyma [19]. They used COM and GEO radiomic features, dynamic enhancement features, and clinical parameters and achieved good to excellent discrimination of luminal A, luminal B, HER2-enriched, and triple-negative cancers from all other tumors. Subtype discrimination in this study was partially less successful than our study (e.g., for luminal B cancers) even though we did not incorporate clinical parameters, and in part more successful (e.g., for triple-negative cancers). While it must be noted that Fan et al. used a training and a validation dataset—a strategy which, especially in combination with artificial neural networks, is preferable when datasets are sufficiently large—they did not use the same set of radiomic features in the two datasets; thus, no true validation of their model was performed.

External validation of part of our results yielded satisfactory results for the differentiation of luminal A vs. luminal B (79.4%) and luminal B vs. triple negative (77.1%). These results might be attributed to the feature sets produced by Fisher coefficients in these two comparisons, because they contained only GEO features for luminal A vs. luminal B and two GEO features and one HIS feature for luminal B vs. triple negative (see Additional file 2: Table S2). Unlike texture features, GEO and HIS feature should not be sensitive to MR image acquisition parameters. As expected, due to technical differences (TR/TE, scanner model and vendor, etc.) between training and validation datasets, some of the other results were poor, indicating a partial lack of generalizability to a wider clinical setting. Notably, some were instances where class imbalance led to seemingly good results despite a lack of separability, for instance the differentiation of luminal B vs. all others (75%). In contrast to prior CE-MRI radiomic studies that used heterogeneous image datasets in terms of acquisition parameters [11, 16,17,18], the MRI protocol in this study was homogeneous, which might have contributed to the excellent results. Grimm et al. evaluated associations of molecular breast cancer subtypes with imaging characteristics that included morphologic, radiomic, and dynamic enhancement features using a semi-automatic approach for lesion segmentation (i.e., a fuzzy C-Means clustering algorithm [16]. They found that enhancement characteristics and morphological features were superior to radiomic features. However, in their study, there was a high level of heterogeneity in terms of scanning equipment and pulse sequence protocols. In particular, matrix size, which, through its association with spatial resolution (and thus, pixel/voxel size), has been shown to have a major impact on feature calculations [34] and thus could be responsible for the divergent results. Similar MRI acquisition protocol heterogeneities may also have contributed to the disappointing performance of radiomic features in the study by Mazurowski et al. that investigated associations of molecular subtypes and semi-automatically extracted CE-MRI data, including COM and GEO features [11]. In contrast to prior publications that investigated molecular subtyping with radiomic analysis, a novelty and strength of the present study is our assessment of a wealth of radiomic features derived from a homogeneous state-of-the-art MRI protocol. These prior divergent results highlight the necessity for rigorous standardization across institutions, vendors, and platforms to fully explore the potential of MRI radiomics in breast cancer, especially when sample sizes are limited. On the other hand, in truly large datasets (“big data”) in which a sufficiently large number of cases for each protocol variation is available, protocol heterogeneity could present an advantage with regard to generalizability.

We acknowledge the limitations of the present study. In this study, molecular subtypes were derived not from formal genetic testing but from IHC surrogates. It has to be acknowledged that there is variable agreement between classifications via these surrogates and formal genetic testing (41–100%). However, in clinical practice, molecular subtypes and subsequent clinical decision-making are most often based on IHC surrogates. Like the vast majority of previous radiomic studies, the conclusions that can be drawn from this study are somewhat limited due to its retrospective nature and the unequal distribution of molecular subtypes. Another drawback is (in comparison to luminal breast cancers) the smaller number of HER2-enriched and triple-negative cancers in our patient collective (Fig. 3); however, this reflects the normal distribution of molecular subtypes of breast cancer in a patient population. Due to this limitation, we decided not to divide our original study population into a training dataset and a validation dataset. Instead, we used a k-nearest neighbor classifier with leave-one-out cross-validation, a technique that does not require two separate datasets and has been used in numerous studies in the field [22, 35, 36], for all pairwise comparisons, even though n-fold cross-validation may have produced better results in the larger samples. It has to be acknowledged that as each training set used in leave-one-out cross-validation only differs by one sample, the models tend to be more highly correlated and less variability is encountered when training the model. Thus, there is a possibility of underestimating the error via leave-one-out cross-validation. As the dataset is relatively small (especially for some subgroups such as HER2-enriched and luminal B cancers), there is the possibility that some of our findings could be the product of overfitting. Nevertheless, we aimed to decrease the risk of overfitting by reducing the dimensionality of feature space to 10% of the samples out of the original ten features provided by MaZda (e.g., five features for 50 samples). Validation of the promising results of this initial study with larger patient numbers is currently ongoing. We acknowledge that the mean size of breast cancers included in this study was relatively large, which could explain the high number of triple-negative cancers. However, this would be the target population for neoadjuvant treatment where radiomic analysis may be used to monitor spatio-longitudinal tumor biology changes during treatment and thereby identify a treatment-resistant phenotype. Unlike techniques such as 18F-fluoroestradiol positron emission tomography, which directly visualizes functional estrogen receptors, radiomics cannot directly capture receptor density. However, estrogen/progesterone receptor responsiveness “fuels” certain growth patterns, which may be captured by radiomic signatures. GEO features are influenced by tumor size to some degree; however, the texture feature classes that reflect homogeneity/heterogeneity of the lesions should not be significantly influenced by tumor size. For instance, COM features are calculated with a maximum interpixel distance of n = 5; since in-plane resolution was approximately 1.26 mm, feature calculation would not have been affected in tumors of 1 cm or larger. We acknowledge that a comparative analysis of images and corresponding cross-sectional histological stains would be of special interest. However, a confident co-registration is challenging, and for tumors treated with neoadjuvant chemotherapy, treatment-naïve tumor specimens are not available. Breast cancers were delineated manually and on the slice with the largest lesion diameter only. The combination of CE-MRI with diffusion-weighted imaging data might further improve the results of radiomic analysis, as different aspects of tumor biology could be captured, i.e., heterogeneity in terms of perfusion on the one hand and diffusivity/cell density on the other hand. In addition, the integration of radiomic features from healthy breast parenchyma might also be of great interest [21].

Fig. 3
figure 3

Top: contrast-enhanced fat-saturated T1-weighted image of a 50-year-old patient with a HER2-enriched cancer in the left breast. Bottom: contrast-enhanced T1-weighted image of a 59-year-old patient with a triple-negative cancer in the right breast. Both lesions are irregularly shaped and margined, with heterogeneous contrast enhancement and central necrosis. Radiomic signatures derived from contrast-enhanced MRI (CE-MRI) accurately differentiated HER2-enriched from triple-negative breast cancer with an overall accuracy of 73.5% in our patient collective

Conclusion

In conclusion, radiomic signatures with CE-MRI may enable the assessment of breast cancer receptor status and molecular subtypes. Our initial results indicate that CE-MRI radiomic signatures may have the potential to provide prognostic indicators derived from the tumor in its entirety; theoretically, this information could be used to monitor spatio-longitudinal tumor biology changes during treatment. However, larger prospective studies and efforts in data and protocol standardization are warranted to validate these findings before definitive conclusions with regard to the value of CE-MRI radiomic features for distinguishing between breast cancer receptor status and molecular subtypes can be drawn.

Availability of data and materials

The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.

Abbreviations

ARM:

Autoregressive model

COM:

Co-occurrence matrix

CE-MRI:

Dynamic contrast-enhanced magnetic resonance imaging

GEO:

Lesion geometry

GRA:

Absolute gradient

HER2:

Human epidermal growth factor receptor 2

HIS:

First-order histogram

IHC:

Immunohistochemical

LDA:

Linear discriminant analysis

MDF:

Most discriminating feature

POE + ACC:

Probability of error and average correlation

RLM:

Run-length matrix

WAV:

Discrete Haar wavelet transform

References

  1. Carey LA, Perou CM, Livasy CA, Dressler LG, Cowan D, Conway K, et al. Race, breast cancer subtypes, and survival in the Carolina Breast Cancer Study. JAMA. 2006;295:2492–502.

    Article  CAS  Google Scholar 

  2. Huber KE, Carey LA, Wazer DE. Breast cancer molecular subtypes in patients with locally advanced disease: impact on prognosis, patterns of recurrence, and response to therapy. Semin Radiat Oncol. 2009;19:204–10.

    Article  Google Scholar 

  3. Lam SW, Jimenez CR, Boven E. Breast cancer classification by proteomic technologies: current state of knowledge. Cancer Treat Rev. 2014;40:129–38.

    Article  CAS  Google Scholar 

  4. Wirapati P, Sotiriou C, Kunkel S, Farmer P, Pradervand S, Haibe-Kains B, et al. Meta-analysis of gene expression profiles in breast cancer: toward a unified understanding of breast cancer subtyping and prognosis signatures. Breast Cancer Res. 2008;10:R65.

    Article  Google Scholar 

  5. Haynes B, Sarma A, Nangia-Makker P, Shekhar MP. Breast cancer complexity: implications of intratumoral heterogeneity in clinical management. Cancer Metastasis Rev. 2017;36:547–55.

    Article  Google Scholar 

  6. Zardavas D, Irrthum A, Swanton C, Piccart M. Clinical management of breast cancer heterogeneity. Nat Rev Clin Oncol. 2015;12:381–94.

    Article  CAS  Google Scholar 

  7. Martelotto LG, Ng CK, Piscuoglio S, Weigelt B, Reis-Filho JS. Breast cancer intra-tumor heterogeneity. Breast Cancer Res. 2014;16:210.

    Article  Google Scholar 

  8. Cancer Genome Atlas N. Comprehensive molecular portraits of human breast tumours. Nature. 2012;490:61–70.

    Article  Google Scholar 

  9. Prat A, Perou CM. Deconstructing the molecular portraits of breast cancer. Mol Oncol. 2011;5:5–23.

    Article  CAS  Google Scholar 

  10. Prat A, Pineda E, Adamo B, Galvan P, Fernandez A, Gaba L, et al. Clinical implications of the intrinsic molecular subtypes of breast cancer. Breast. 2015;24(Suppl 2):S26–35.

    Article  Google Scholar 

  11. Mazurowski MA, Zhang J, Grimm LJ, Yoon SC, Silber JI. Radiogenomic analysis of breast cancer: luminal B molecular subtype is associated with enhancement dynamics at MR imaging. Radiology. 2014;273:365–72.

    Article  Google Scholar 

  12. Tsoutsou PG, Vozenin MC, Durham AD, Bourhis J. How could breast cancer molecular features contribute to locoregional treatment decision making? Crit Rev Oncol Hematol. 2017;110:43–8.

    Article  Google Scholar 

  13. Ahn HJ, Jung SJ, Kim TH, Oh MK, Yoon HK. Differences in clinical outcomes between luminal A and B type breast cancers according to the St. Gallen Consensus 2013. J Breast Cancer. 2015;18:149–59.

    Article  Google Scholar 

  14. Senn HJ. St. Gallen consensus 2013: optimizing and personalizing primary curative therapy of breast cancer worldwide. Breast Care (Basel). 2013;8:101.

    Article  Google Scholar 

  15. Pisco AO, Huang S. Non-genetic cancer cell plasticity and therapy-induced stemness in tumour relapse: ‘What does not kill me strengthens me’. Br J Cancer. 2015;112:1725–32.

    Article  CAS  Google Scholar 

  16. Grimm LJ, Zhang J, Baker JA, Soo MS, Johnson KS, Mazurowski MA. Relationships between MRI breast imaging-reporting and data system (BI-RADS) lexicon descriptors and breast cancer molecular subtypes: internal enhancement is associated with luminal B subtype. Breast J. 2017;23:579–82.

    Article  Google Scholar 

  17. Li H, Zhu Y, Burnside ES, Huang E, Drukker K, Hoadley KA, et al. Quantitative MRI radiomics in the prediction of molecular classifications of breast cancer subtypes in the TCGA/TCIA data set. NPJ Breast Cancer. 2016;2:16012.

  18. Waugh SA, Purdie CA, Jordan LB, Vinnicombe S, Lerski RA, Martin P, et al. Magnetic resonance imaging texture analysis classification of primary breast cancer. Eur Radiol. 2016;26:322–30.

    Article  CAS  Google Scholar 

  19. Fan M, Li H, Wang S, Zheng B, Zhang J, Li L. Radiomic analysis reveals DCE-MRI features for prediction of molecular subtypes of breast cancer. PLoS One. 2017;12:e0171683.

    Article  Google Scholar 

  20. Wu J, Sun X, Wang J, Cui Y, Kato F, Shirato H, et al. Identifying relations between imaging phenotypes and molecular subtypes of breast cancer: model discovery and external validation. J Magn Reson Imaging. 2017;46:1017–27.

    Article  Google Scholar 

  21. Wang J, Kato F, Oyama-Manabe N, Li R, Cui Y, Tha KK, et al. Identifying triple-negative breast cancer using background parenchymal enhancement heterogeneity on dynamic contrast-enhanced MRI: a pilot radiomics study. PLoS One. 2015;10:e0143308.

    Article  Google Scholar 

  22. Mayerhoefer ME, Schima W, Trattnig S, Pinker K, Berger-Kulemann V, Ba-Ssalamah A. Texture-based classification of focal liver lesions on MRI at 3.0 Tesla: a feasibility study in cysts and hemangiomas. J Magn Reson Imaging. 2010;32:352–9.

    Article  Google Scholar 

  23. Thakur SB, Durando M, Milans S, Cho GY, Gennaro L, Sutton EJ, et al. Apparent diffusion coefficient in estrogen receptor-positive and lymph node-negative invasive breast cancers at 3.0T DW-MRI: a potential predictor for an oncotype Dx test recurrence score. J Magn Reson Imaging. 2018;47:401–9.

    Article  Google Scholar 

  24. Collewet G, Strzelecki M, Mariette F. Influence of MRI acquisition protocols and image intensity normalization methods on texture classification. Magn Reson Imaging. 2004;22:81–91.

    Article  CAS  Google Scholar 

  25. Hu Y, Dennis TJ. Textured image segmentation by context enhanced clustering. IEE Proc Vis Image Signal Process. 1994;141:413–21.

    Article  Google Scholar 

  26. Kashyap R, Chellappa R. Estimation and choice of neighbors in spatial-interaction models of images. IEEE Trans Inf Theory. 1983;29:60–72.

    Article  Google Scholar 

  27. Hajek M, Dezortova M, Materka A, Lerski RA. Texture analysis for magnetic resonance imaging. Prague: Med4Publishing; 2006. p. 234.

  28. Fukunaga K. Introduction to statistical pattern recognition. 2nd ed. San Diego: Academic Press Professional, Inc; 1990.

    Google Scholar 

  29. Duda R, Hart PE, Stork DG. Pattern classification. 2nd ed. United States: Wiley; 2012.

    Google Scholar 

  30. Pinker K, Moy L, Sutton EJ, Mann RM, Weber M, Thakur SB, et al. Diffusion-weighted imaging with apparent diffusion coefficient mapping for breast cancer detection as a stand-alone parameter: comparison with dynamic contrast-enhanced and multiparametric magnetic resonance imaging. Investig Radiol. 2018;53:587–95.

    Google Scholar 

  31. Hon JD, Singh B, Sahin A, Du G, Wang J, Wang VY, et al. Breast cancer molecular subtypes: from TNBC to QNBC. Am J Cancer Res. 2016;6:1864–72.

    CAS  PubMed  PubMed Central  Google Scholar 

  32. Sutton EJ, Dashevsky BZ, Oh JH, Veeraraghavan H, Apte AP, Thakur SB, et al. Breast cancer molecular subtype classifier that incorporates MRI features. J Magn Reson Imaging. 2016;44:122–9.

    Article  Google Scholar 

  33. Holli-Helenius K, Salminen A, Rinta-Kiikka I, Koskivuo I, Bruck N, Bostrom P, et al. MRI texture analysis in differentiating luminal A and luminal B breast cancer molecular subtypes - a feasibility study. BMC Med Imaging. 2017;17:69.

    Article  Google Scholar 

  34. Mayerhoefer ME, Szomolanyi P, Jirak D, Materka A, Trattnig S. Effects of MRI acquisition parameter variations and protocol heterogeneity on the results of texture analysis and pattern discrimination: an application-oriented study. Med Phys. 2009;36:1236–43.

    Article  Google Scholar 

  35. Li Z, Mao Y, Li H, Yu G, Wan H, Li B. Differentiating brain metastases from different pathological types of lung cancers using texture analysis of T1 postcontrast MR. Magn Reson Med. 2016;76:1410–9.

    Article  Google Scholar 

  36. Fruehwald-Pallamar J, Czerny C, Holzer-Fruehwald L, Nemec SF, Mueller-Mang C, Weber M, et al. Texture-based and diffusion-weighted discrimination of parotid gland lesions on MR images at 3.0 Tesla. NMR Biomed. 2013;26:1372–9.

    Article  Google Scholar 

Download references

Acknowledgements

The authors acknowledge the support in manuscript editing from Joanne Chin.

Funding

Funding was provided by the Austrian Nationalbank ‘Jubiläumsfond’ Project Nr. 16219, the 2020 - Research and Innovation Framework Programme PHC-11-2015 Nr. 667211-2 and seed grants from Siemens Austria, Novomed, and Guerbet, France. This research was also funded in part through the NIH/NCI Cancer Center Support Grant P30 CA008748, the Breast Cancer Research Foundation, and the Susan G. Komen Foundation.

Author information

Authors and Affiliations

Authors

Contributions

KP and DL contributed to the conception and design. KP, BB-D, and DL contributed to the development of the methodology. ST, DL, and MAM contributed to the acquisition of the data. KP, JVH, and BB-D contributed to the analysis and interpretation of data. KP, DL, MSJ, MAM, REO-A, and EAM contributed to the writing, review, and/or revision of the manuscript. EAM, DFM, and JVH contributed to the administrative, technical, or material support. KP, ST, and EA M contributed to the study supervision. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Katja Pinker.

Ethics declarations

Ethics approval and consent to participate

This single-institution retrospective study was compliant with the Health Insurance Portability and Accountability Act guidelines and approved by the Memorial Sloan Kettering Cancer Center Institutional Review Board with a waiver of written informed consent.

Consent for publication

Not applicable

Competing interests

Elizabeth A. Morris and Katja Pinker were in part funded by the project: Use Of ctNA To Distinguish Between Benign and Malignant BIRADS 4 Radiographic Lesions. Maxine S Jochelson has received speaker honoraria from GE Healthcare. The other authors declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Additional files

Additional file 1:

Table S1. Detailed results of group-wise radiomic feature-based cancer classifications for molecular breast cancer subtypes / receptor status (training dataset). The number of features used for classification (one feature per every ten sample) is written above the accuracies. The feature selection algorithm and number of most discriminating features are given in parentheses. (DOCX 24 kb)

Additional file 2:

Table S2. Selected features sets for all pairwise classifications (training dataset). Values isn parentheses represent coordinates: information about direction and interpixel distance for pixel pairs. The full list of features and their abbreviations can be accessed at http://www.eletel.p.lodz.pl/programy/mazda/download/FeaturerList.pdf (DOCX 26 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Leithner, D., Horvat, J.V., Marino, M.A. et al. Radiomic signatures with contrast-enhanced magnetic resonance imaging for the assessment of breast cancer receptor status and molecular subtypes: initial results. Breast Cancer Res 21, 106 (2019). https://doi.org/10.1186/s13058-019-1187-z

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s13058-019-1187-z

Keywords