- Research article
- Open Access
Magnetic resonance imaging and molecular features associated with tumor-infiltrating lymphocytes in breast cancer
Breast Cancer Researchvolume 20, Article number: 101 (2018)
We sought to investigate associations between dynamic contrast-enhanced (DCE) magnetic resonance imaging (MRI) features and tumor-infiltrating lymphocytes (TILs) in breast cancer, as well as to study if MRI features are complementary to molecular markers of TILs.
In this retrospective study, we extracted 17 computational DCE-MRI features to characterize tumor and parenchyma in The Cancer Genome Atlas cohort (n = 126). The percentage of stromal TILs was evaluated on H&E-stained histological whole-tumor sections. We first evaluated associations between individual imaging features and TILs. Multiple-hypothesis testing was corrected by the Benjamini-Hochberg method using false discovery rate (FDR). Second, we implemented LASSO (least absolute shrinkage and selection operator) and linear regression nested with tenfold cross-validation to develop an imaging signature for TILs. Next, we built a composite prediction model for TILs by combining imaging signature with molecular features. Finally, we tested the prognostic significance of the TIL model in an independent cohort (I-SPY 1; n = 106).
Four imaging features were significantly associated with TILs (P < 0.05 and FDR < 0.2), including tumor volume, cluster shade of signal enhancement ratio (SER), mean SER of tumor-surrounding background parenchymal enhancement (BPE), and proportion of BPE. Among molecular and clinicopathological factors, only cytolytic score was correlated with TILs (ρ = 0.51; 95% CI, 0.36–0.63; P = 1.6E-9). An imaging signature that linearly combines five features showed correlation with TILs (ρ = 0.40; 95% CI, 0.24–0.54; P = 4.2E-6). A composite model combining the imaging signature and cytolytic score improved correlation with TILs (ρ = 0.62; 95% CI, 0.50–0.72; P = 9.7E-15). The composite model successfully distinguished low vs high, intermediate vs high, and low vs intermediate TIL groups, with AUCs of 0.94, 0.76, and 0.79, respectively. During validation (I-SPY 1), the predicted TILs from the imaging signature separated patients into two groups with distinct recurrence-free survival (RFS), with log-rank P = 0.042 among triple-negative breast cancer (TNBC). The composite model further improved stratification of patients with distinct RFS (log-rank P = 0.0008), where TNBC with no/minimal TILs had a worse prognosis.
Specific MRI features of tumor and parenchyma are associated with TILs in breast cancer, and imaging may play an important role in the evaluation of TILs by providing key complementary information in equivocal cases or situations that are prone to sampling bias.
Immunotherapy for treating patients with cancer has generated much excitement in recent years . Compared with conventional therapies, immune checkpoint blockade (ICB) such as anti-PD1 therapy has achieved durable clinical response and long-term survival benefit in a variety of cancer types [2, 3]. However, only a small proportion of patients respond to current immunotherapy, underscoring the need for predictive biomarkers to identify appropriate patients . One promising biomarker is tumor-infiltrating lymphocytes (TILs), because it is now recognized that a preexisting antitumor immunity is required for the success of ICB-based immunotherapy . In breast cancer, there is strong evidence for the prognostic and predictive value of TILs . Several large clinical trials have demonstrated that TILs are associated with pathological complete response and prognosis after chemotherapy or targeted therapies, particularly in triple-negative breast cancer (TNBC) and human epidermal growth factor receptor 2 (HER2)-positive breast cancer [7,8,9,10,11,12,13,14].
The evaluation of TILs involves visualization and measurement of lymphocytes on H&E-stained histological slides of tumor samples . Current guidelines issued by the International Immuno-Oncology Biomarker Working Group on Breast Cancer recommend that evaluation of TILs be performed in the stromal rather than intraepithelial compartments, and preferably on whole tumor sections over core biopsies . Despite numerous efforts to standardize the evaluation of TILs, this process remains laborious and subjective with inter- and intrarater variability . Moreover, evaluation of TILs in the preoperative neoadjuvant setting is problematic because of heterogeneous tumor shrinkage patterns and sampling bias in a biopsy. A more objective, consistent method to evaluate TILs in breast cancer would be extremely valuable.
Imaging allows noninvasive visualization of the entire tumor and its surrounding tissue. Recent studies have demonstrated associations between specific magnetic resonance imaging (MRI) features and pathological or molecular patterns, such as molecular subtypes [17,18,19,20,21,22] and gene expression signatures or pathways [23,24,25,26,27,28]. These data support the underlying pathophysiology of the disease being reflected on imaging at a macroscopic level, and this link may be revealed by a more detailed comprehensive image analysis.
The purpose of this study was to investigate the association between MRI features and TILs in breast cancer. We explored whether computational imaging features could be used to predict TILs. Further, we constructed a composite prediction model by integrating imaging and immune-related molecular features and validated its clinical relevance in an independent cohort.
We carried out this institutional review board-approved, Health Insurance Portability and Accountability Act (HIPAA)-compliant retrospective study in three steps (Fig. 1). First, we characterized both tumor and parenchymal enhancement patterns at dynamic contrast-enhanced (DCE) MRI and evaluated their association with TILs. Second, we built a composite model to predict TILs by integrating imaging with molecular and clinicopathological data. Third, we tested the prognostic significance of the TIL model in an independent cohort.
We analyzed two breast cancer cohorts from The Cancer Genome Atlas (TCGA) project  and the I-SPY 1 (Investigation of Serial Studies to Predict Your Therapeutic Response with Imaging And moLecular Analysis) trial . For this study, the inclusion criteria for TCGA cohort were (1) pathologically proven invasive carcinomas, (2) pretreatment DCE-MRI data available, (3) H&E-stained whole-tumor tissue sections available, and (4) tumor gene expression data from RNA-sequencing (RNA-seq) and mutational data from whole-exome sequencing available. We applied similar inclusion criteria to select patients from the I-SPY 1 cohort, except that outcomes were available, but H&E-stained slides and mutational data were not required. After selection, 126 patients from TCGA and 105 patients from I-SPY 1 were eligible for the proposed study. The detailed selection procedures are shown in Additional file 1: Figure S1. Clinical and imaging data are publicly available for both cohorts from The Cancer Imaging Archive (TCIA) (www.cancerimagingarchive.net).
Evaluation of tumor-infiltrating lymphocytes
TILs were evaluated for TCGA cohort, for which detailed biospecimen collection and processing protocols have been described elsewhere . In brief, the tumor sections were collected from surgical specimens and reviewed by a board-certified pathologist to confirm the presence of invasive carcinoma. The H&E-stained whole-slide tumor sections were digitally scanned and are available from the Cancer Digital Slide Archive (http://cancer.digitalslidearchive.net/).
Two pathologists (XT and XL, with 30 and 5 years of experience, respectively, in reading breast cancer tissue slides) evaluated TILs in consensus based on the recommendations from the International Immuno-Oncology Biomarker Working Group on Breast Cancer . Two pathologists simultaneously reviewed the digital slides of each patient from the Cancer Digital Slide Archive, and the TILs were measured as the percentage of lymphocytes and macrophages in the area of total intratumoral stromal compartments. In addition, three discrete categories are defined, with ≤ 10%, > 10% to ≤40%, and > 40% to ≤ 90% TILs indicating tumors with no/minimal, intermediate, and high lymphocyte infiltration, respectively . To assess interrater variability, we calculated the intraclass correlation coefficient (ICC) between our TIL percentage and those reported in a previous study focused on TNBC  for 15 overlapped cases in TCGA cohort.
The detailed imaging protocol for TCGA cohort has been reported elsewhere . In brief, the scans were performed between September 1999 and June 2006 at six participating centers with a 1.5-T or 3-T GE Healthcare (Milwaukee, MI, USA), Siemens (Erlangen, Germany), or Philips (Amsterdam, The Netherlands) whole-body MRI system with a standard double-breast coil. The dynamic protocol of DCE-MRI was in accordance with the American College of Radiology guidelines, which included one precontrast and two to seven postcontrast scans (with a gadolinium-based contrast agent), in either the sagittal or axial view.
The detailed imaging protocol for the I-SPY cohort was reported elsewhere [32, 33]. To match the MRI from TCGA cohort, we focused on the scans acquired before neoadjuvant chemotherapy (i.e., baseline scans). MRI was performed through a 1.5-T GE Healthcare, Siemens, or Philips system, with a dedicated breast radiofrequency coil. The DCE-MRI protocols include one precontrast scan and two postcontrast phases with one ~ 2.5 minutes and another one ~ 7.5 minutes.
Image processing and harmonization
Given the diverse imaging protocols within the multicenter TCGA data and I-SPY 1 cohorts, we developed a pipeline to normalize the imaging data before extracting quantitative feature. First, we applied the N4 bias correction to correct for shading artifacts. Next, we standardized the temporal resolution of DCE-MRI scans in TCGA and I-SPY cohorts. In particular, for each patient, we included DCE-MRI before contrast agent administration and two postcontrast scans, with one having a 2–3-minute delay and the other having an ~ 7.5-minute delay. Third, to explicitly account for heterogeneous imaging protocols, for each individual, the voxel values of DCE-MRI were normalized by the parenchyma without contrast (i.e., the average value of interquartile voxel from parenchyma before administrating contrast). Finally, the MRI scans were resized to have an isotropic voxel resolution of 1 mm to assure consistent and meaningful computation of 3D textural features.
Tumor and background parenchyma segmentation
The detailed process used for segmentation was reported elsewhere [28, 34]. Briefly, two radiologists with 14 and 11 years of experience, respectively, in breast imaging manually delineated the 3D tumor slice-by-slice and reached consensus regarding 3D tumor contours. The ipsilateral parenchyma was segmented automatically through Fuzzy C-means clustering. The 3D parenchymal segmentation was inspected by two radiologists, and they manually revised it when necessary.
MRI feature exaction
The rationale of feature extraction is to provide a comprehensive characterization of breast cancer at DCE-MRI. We initially extracted 110 computational imaging features as defined in a previous study  and removed those with linear ICCs above 0.85. For correlated features, the one that showed highest robustness with respect to tumor contour variations (manual segmentation vs automatic segmentation via Fuzzy C-means clustering) was kept, similar to previous studies [34, 35]. As a result, 17 nonredundant imaging features remained. The selected features characterize the tumoral and parenchymal phenotypes at DCE-MRI, which include five tumor morphological features, four tumor texture features, two functional tumor volume features, four background parenchymal enhancement (BPE) features, and two tumor-surrounding PE features. The mathematical formulation and the interpretation and clinical relevance [27, 28, 32, 36,37,38] of these features are elaborated in Table 1. The computation of all imaging features was implemented automatically with MATLAB software (MathWorks, Natick, MA, USA).
Molecular features related to tumor-infiltrating lymphocytes
Tumor mutation burden is an important genetic factor in mediating antitumor immunity. Tumors with a higher mutation load are associated with a higher neoantigen level and thus are more immunogenic and likely to have higher immune infiltration and more TILs . The cytolytic activity reflects local immune effector function and can indicate the presence of TILs. Indeed, cytolytic activity computed from the gene transcript levels of two critical immune cytolytic effectors , perforin (PRF1) and granzyme A (GZMA), has been shown to be closely related to immune infiltration and CD8+ T-cell activation [41, 42]. For TCGA breast cancer cohort, gene expression data from RNA-seq and mutational data from whole-exome sequencing are available in the Genomic Data Commons (https://gdc.cancer.gov/). On the basis of these data, we computed the nonsynonymous somatic mutational burden and cytolytic activity score, defined as the geometric mean of the expression of two genes: GZMA and PRF1 . Similarly for the I-SPY 1 cohort, we computed the cytolytic activity score on the basis of microarray gene expression data available from the Gene Expression Omnibus (https://www.ncbi.nlm.nih.gov/geo/; [GEO:GSE22226]) . The ComBat algorithm  was implemented to harmonize the gene expression data from TCGA and I-SPY.
Association with tumor-infiltrating lymphocytes and predictive modeling
We first evaluated the Pearson linear correlation between individual imaging features and percentage of TILs in TCGA cohort. Next, we built a predictive model for TILs by combining multiple imaging features into an imaging signature. For this purpose, we used linear regression with feature selection via LASSO (least absolute shrinkage and selection operator)  to avoid overfitting. In addition, tenfold cross-validation was applied and repeated 100 times to minimize the selection bias. The most frequently selected imaging features (> 90%) were used to fit the final model. Further, we investigated whether combining the imaging signature with immune-related molecular features (cytolytic score and somatic mutation burden) would improve prediction accuracy for TILs by fitting a composite model via multivariate linear regression.
To evaluate the prediction models, we calculated the Pearson linear correlation between pathologist-rated and estimated percentage of TILs. In addition, patients were divided into three recognized TIL categories (low, intermediate, and high immune infiltration) , and pairwise classification among the three categories was evaluated. We compared the performance of the composite model with molecular features based on cytolytic score and imaging signature. In particular, the ROC analysis and AUC were used to assess the binary prediction accuracy of the models. The threshold used to separate different prediction models was defined on the basis of Youden’s J statistics , and the corresponding sensitivity, specificity, and accuracy were reported. Finally, we tested prognostic significance of the imaging signature as well as the composite TIL model by assessing their association with recurrence-free survival (RFS) in the entire I-SPY 1 cohort as well as in clinically relevant subgroups according to the receptor status. Because the prognostic value of TILs seems to be strongest in TNBC [11, 13], we expect that the composite model would also be prognostic within the TNBC subgroup in the I-SPY 1 cohort.
In univariate analysis, to adjust for multiple statistical testing, the Benjamini-Hochberg method was used to control the false discovery rate (FDR). The Mann-Whitney U statistic was used to assess the statistical significance of binary classification of TIL categories by comparing the prediction models with a random guess with an AUC of 0.5. The DeLong test was used to determine the 95% CIs and compute P values for the comparison of ROC curves. The Cox proportional hazards model was used to build survival models. Kaplan-Meier analysis was used to estimate survival probability. The log-rank test and concordance index were used to assess prognostic performance. All statistical tests were two-sided. P value < 0.05 and FDR < 0.2 were considered to be statistically significant. Statistical analysis was performed in R (R Foundation for Statistical Computing, Vienna, Austria).
Patient characteristics and tumor-infiltrating lymphocyte evaluation
Among 1098 cases in TCGA breast cancer cohort, 126 patients were eligible for our study. A majority (n = 92, 73%) of patients had low immune infiltration (0–10% TILs) in their tumor stroma, whereas 20% (n = 25) and 7% (n = 9) of patients had intermediate and high immune infiltration, respectively. Clinicopathological characteristics of patients in each of the three TIL categories are shown in Table 2. There was high reproducibility between TILs measured by our pathologists and previously reported values with ICC of 0.80 (P = 0.002). For the I-SPY 1 cohort, 105 patients were eligible and included in this study (patient characteristics summarized in Additional file 2: Table S1).
Imaging features associated with tumor-infiltrating lymphocytes
Each of the 17 imaging features independently characterizes the cancer phenotypes, and their pairwise correlation map is shown in Additional file 1: Figure S2. Figure 2 shows the heat map of 17 imaging features for 126 patients in TCGA cohort ranked on the basis of their TILs, monotonically increasing from left to right. In the univariate analysis, 4 of 17 imaging features were significantly associated with the percentage of TILs (P < 0.05 and FDR < 0.2), as shown in Fig. 3. Among these four features, the tumor volume was positively correlated with TILs, whereas cluster shade of signal enhancement ratio (SER) map, mean SER of tumor surrounding BPE, and proportion of BPE were negatively correlated with TILs (Additional file 2: Table S2). Next, we built an imaging signature for TILs by fitting a linear model, which consisted of five imaging features: 4.4 × M1 − 3.14 × TEX2 − 2.0 × TS − BPE2 − 2.62 × BPE1 − 0.72 × BPE3 + 13.02, where M1 = tumor volume, TEX2 = cluster shade of SER map, TS-BPE2 = mean SER of tumor surrounding BPE (2 cm), BPE1 = BPE volume (percentage enhancement or PE > 20%), and BPE3 = BPE proportion (PE, > 20%). The mean and SD values of the five selected imaging features are shown in Additional file 2: Table S3. This imaging signature had a moderate linear correlation with TILs (ρ = 0.40; 95% CI, 0.24–0.54; P = 4.2E-6). Moreover, the imaging signature is able to separate three TILs categories in pairwise fashion (Fig. 4a), with prediction accuracy of 0.73, 0.71, and 0.71, respectively (Table 3). Figure 5 showed the details of three representative patients where there is good agreement between the predicted TILs from proposed imaging signatures and TIL readings by two pathologists.
Relationships between imaging, molecular signatures, and tumor-infiltrating lymphocytes
We evaluated the associations between imaging and immune-related molecular features, as well as the percentage of TILs, in TCGA cohort. In addition to the imaging signature, cytolytic score was significantly associated with TILs (ρ = 0.51; 95% CI, 0.36–0.63; P = 1.6E-9) (Fig. 4b), whereas none of the clinicopathological factors or the somatic mutation burden were correlated with TILs (Table 4). We found that five imaging features were significantly associated with mutation burden, but none was associated with cytolytic score (Fig. 3, Additional file 2: Table S4). This suggests that imaging and cytolytic score are independent and could be complementary to each other for predicting TILs. For three cases in Fig. 5, the imaging signature can provide more accurate prediction of TILs than the model of cytolytic score.
Composite model for tumor-infiltrating lymphocytes
On multivariate analysis, the imaging signature and cytolytic score remained as independent predictors of TILs (P = 0.004 and P < 0.0001, respectively) after adjusting for stage, estrogen receptor/progesterone receptor/HER2 status, and mutation burden (Table 4). We retained both significant variables and refitted a composite model for predicting TILs: 5.86 × Imaging Signature + 7.78 × Cytolytic Score + 13.0. The linear correlation between the composite model and TILs was improved (ρ = 0.62; 95% CI, 0.50–0.72; P = 9.7E-15). Detailed box plots of inferred TILs from the composite model vs the original pathologists’ readings are presented in Fig. 4c.
We tested the composite model for predicting three predefined TILs categories. As shown in Fig. 6a, cytolytic score alone could not differentiate between intermediate and high TILs (AUC, 0.63; P = 0.14). By integrating imaging signature and cytolytic score, the composite model successfully separated these two TILs groups (AUC, 0.76; P = 0.01), and the improvement was statistically significant (DeLong test P = 0.039). Similar results were observed for differentiating low vs high TILs groups (AUC, 0.88 vs 0.94) (Fig. 6b). For distinguishing low and intermediate groups, there was no significant improvement using the composite model over cytolytic score (AUC, 0.77 vs 0.79) (Additional file 1: Figure S3). In addition, we performed a detailed evaluation of the proposed composite model, as in Table 3, where the composite model’s accuracy is 0.85, 0.91, and 0.75, respectively.
Clinical validation of the composite model
The previously developed composite model was used to infer TILs based on imaging and molecular data in an independent cohort from the I-SPY 1 trial. We found that hormone receptor-negative (HR−)/HER2− or TNBC had significantly higher predicted TILs than HR+/HER2− breast cancer (P = 0.049) (Additional file 1: Figure S4). With the threshold values obtained from the training cohort, we divided the patients into three groups based on the predicted TIL values. Then, we investigated the relationship between predicted TIL groups and outcomes. In TNBC, patients without recurrence had significantly higher predicted TILs than those who developed recurrence (P = 0.024) (Fig. 7a). Within the TNBC group, distinct RFS exists between the predicted no/minimal TIL group and the predicted high/intermediate TILs group (log-rank P = 0.0008) (Fig. 8a), where the group with lower TILs had significantly worse prognosis. However, predicted TIL groups were not associated with RFS in HR+/HER2− or HER2+ breast cancer (Fig. 7b and c and Fig. 8b and c, respectively).
Additionally, to validate the imaging signature for TILs, we applied it to 44 patients with TNBC who had images publicly available in the I-SPY 1 cohort. Similarly, with the threshold value obtained from the training cohort, we classified the patients into three TIL categories. A trend similar to that for the composite mode was observed, where no/minimal TILs had a significantly worse prognosis than high/intermediate TILs regarding their RFS (log-rank P = 0.042) (Fig. 8d).
In this study, we aimed to dissect the complex tumor-immune interactions in breast cancer [5, 6] by integrating imaging, genomic, and histological data. In particular, our pilot study showed that the percentage of stromal TILs evaluated on histological tissue sections were significantly associated with specific enhancement patterns of tumor and surrounding parenchyma at DCE-MRI. Our findings are consistent with a recent study that demonstrated a link between heterogeneous enhancement of tumor-adjacent parenchyma on DCE-MRI and dysregulated tumor necrosis factor signaling pathway in breast cancer . Both studies support the role of inflammatory or immune response in breast cancer progression and its relationship to specific parenchymal enhancement patterns at DCE-MRI. Consistent with previous work, we found that TILs were also associated with cytolytic activity but not with tumor mutational burdens in TCGA breast cancer cohort . In addition to identifying associations, we further developed and evaluated prediction models for TILs that integrated imaging and genomic data. Our results show that a composite model combining an imaging signature and cytolytic score achieved augmented linear correlation with TILs compared with using either alone. The composite model showed good or excellent discriminative ability among low, intermediate, and high TIL groups, with AUCs ranging from 0.76 to 0.94.
The reliable evaluation of TILs has significance and clinical implications in breast cancer. Abundant evidence has demonstrated that TILs have strong prognostic and predictive value in specific breast cancer subtypes . For localized breast cancer, TILs have been shown to be associated with pathological complete response and prognosis after chemotherapy or targeted therapies for localized breast cancer. This suggests that our imaging and molecular signature for TILs could help select patients who will be most likely respond to and benefit from neoadjuvant chemotherapy or targeted therapies in locally advanced breast cancer. On the other hand, they may also be used in combination with established clinical and pathological criteria to identify low-risk patients with a favorable prognosis who might be spared adjuvant chemotherapy in early-stage breast cancer. However, this will require further validation in prospective clinical studies. Although there are yet no U.S. Food and Drug Administration-approved immunotherapies for breast cancer, numerous trials are ongoing with the goal of assessing the clinical activity and potential benefit of ICB . Because the success of ICB hinges on a preexisting antitumor immunity, which is manifested as TILs, TILs could serve as useful predictive biomarkers to select patients who are likely to benefit from immunotherapy.
The current gold standard for evaluating TILs is based on pathologists’ visual assessment of H&E-stained whole-tumor tissue sections. This approach is limited mainly by intra- and interrater variability . Gene expression profiles can also reflect antitumor immune response. Recently, a two-gene cytolytic score was proposed to characterize local immune infiltration and cytolytic activity in a large study across 18 tumor types in TCGA . Nonetheless, pathological or molecular evaluation of TILs may be confounded by the spatial intratumoral heterogeneity, especially in the neoadjuvant setting with core biopsies . On the other hand, imaging provides a global, unbiased picture of the entire tumor and its surrounding tissue, potentially allowing more reproducible evaluation for TILs. Our results demonstrate that, compared with cytolytic score, the imaging signature may be particularly useful in distinguishing tumors with high vs intermediate TILs. Although imaging analysis alone cannot replace pathological evaluation for TILs, our study supports imaging playing an important role in this process by providing key complementary information in equivocal cases or situations that are prone to sampling bias (e.g., in core biopsy). In our study, given the tumor contours manually delineated by radiologists, the subsequent imaging analysis was fully automatic. With the rapid advancement of machine learning in radiology , we anticipate that much of the process will be automated and that radiological interpretation bias can be minimized. One unique advantage of MRI is that it provides a global view of the whole tumor as well as its surrounding parenchyma, which overcomes the issue of sampling bias in core biopsy.
We validated the clinical relevance of our composite prediction model for TILs in an independent cohort. Consistent with previous findings , we showed that TNBC had significantly higher predicted TILs than HR+/HER2− breast cancer. So far, the strongest evidence for the prognostic value of TILs has been in TNBC, whereas its significance is more mixed in HER2+ and seems uncertain in HR+/HER2− subtypes . We confirmed that higher TILs predicted by the composite model were indeed associated with better prognosis and RFS in TNBC, but not among other subtypes. Given the relatively small number of TNBC cases in the I-SPY 1 cohort, it would be important to further validate the model in future studies with more patients.
Our study adds to the growing body of literature where a more detailed comprehensive analysis of imaging phenotypes could reveal the underlying tumor pathophysiology at the molecular or pathological level [17,18,19,20,21,22,23,24,25,26,27,28, 53,54,55]. Different from previous radiogenomic studies that focused on analyses of imaging and genomic properties of the tumor, our study focused on immune infiltration in the stroma and included imaging features of the tumor as well as its surrounding parenchymal tissue. Another distinction is that previous work aimed to find correlation (i.e., similarity) between imaging and molecular data, whereas our study demonstrates that imaging can provide independent value and complement molecular profiles for predicting TILs.
There are several limitations of this study. The images and samples in TCGA cohort were retrospectively collected, which may not be a representative patient population for breast cancer. The association findings in this study should be interpreted as hypothesis-generating, and the composite prediction model for TILs requires validation in large, ideally prospective cohorts. Owing to the limited sample size, our analysis may have been insufficiently powered to detect differences in TILs by receptor status. Future work is needed to confirm the findings in a subtype-specific manner. In addition, there are diverse imaging acquisition protocols in the multi-institutional TCGA cohort, which may have confounded our analysis. Despite our efforts to harmonize imaging data, uncertainty could remain. Finally, we focused on DCE-MRI for association with TILs. Additional imaging modalities such as T2-weighted and diffusion-weighted MRI may be incorporated in future studies.
We showed that specific tumoral and parenchymal imaging features are associated with TILs and that integration of imaging and molecular features allows for better prediction of TILs in breast cancer. These preliminary findings should be validated in additional larger studies.
Sharma P, Allison JP. The future of immune checkpoint therapy. Science. 2015;348(6230):56–61.
Borghaei H, Paz-Ares L, Horn L, Spigel DR, Steins M, Ready NE, Chow LQ, Vokes EE, Felip E, Holgado E. Nivolumab versus docetaxel in advanced nonsquamous non–small-cell lung cancer. N Engl J Med. 2015;373(17):1627–39.
Robert C, Long GV, Brady B, Dutriaux C, Maio M, Mortier L, Hassel JC, Rutkowski P, McNeil C, Kalinka-Warzocha E. Nivolumab in previously untreated melanoma without BRAF mutation. N Engl J Med. 2015;372(4):320–30.
Topalian SL, Taube JM, Anders RA, Pardoll DM. Mechanism-driven biomarkers to guide immune checkpoint blockade in cancer therapy. Nat Rev Cancer. 2016;16(5):275–87.
Fridman WH, Zitvogel L, Sautès-Fridman C, Kroemer G. The immune contexture in cancer prognosis and treatment. Nat Rev Clin Oncol. 2017;14(12):717–34.
Savas P, Salgado R, Denkert C, Sotiriou C, Darcy PK, Smyth MJ, Loi S. Clinical relevance of host immunity in breast cancer: from TILs to the clinic. Nat Rev Clin Oncol. 2016;13(4):228–41.
Denkert C, Von Minckwitz G, Brase JC, Sinn BV, Gade S, Kronenwett R, Pfitzner BM, Salat C, Loi S, Schmitt WD. Tumor-infiltrating lymphocytes and response to neoadjuvant chemotherapy with or without carboplatin in human epidermal growth factor receptor 2–positive and triple-negative primary breast cancers. J Clin Oncol. 2014;33(9):983–91.
Loi S, Sirtaine N, Piette F, Salgado R, Viale G, Van Eenoo F, Rouas G, Francis P, Crown JP, Hitre E. Prognostic and predictive value of tumor-infiltrating lymphocytes in a phase III randomized adjuvant breast cancer trial in node-positive breast cancer comparing the addition of docetaxel to doxorubicin with doxorubicin-based chemotherapy: BIG 02-98. J Clin Oncol. 2013;31(7):860–7.
Salgado R, Denkert C, Campbell C, Savas P, Nuciforo P, Aura C, de Azambuja E, Eidtmann H, Ellis CE, Baselga J. Tumor-infiltrating lymphocytes and associations with pathological complete response and event-free survival in HER2-positive early-stage breast cancer treated with lapatinib and trastuzumab: a secondary analysis of the NeoALTTO trial. JAMA Oncol. 2015;1(4):448–55.
Perez EA, Ballman KV, Tenner KS, Thompson EA, Badve SS, Bailey H, Baehner FL. Association of stromal tumor-infiltrating lymphocytes with recurrence-free survival in the N9831 adjuvant trial in patients with early-stage HER2-positive breast cancer. JAMA Oncol. 2016;2(1):56–64.
Loi S, Michiels S, Salgado R, Sirtaine N, Jose V, Fumagalli D, Kellokumpu-Lehtinen P-L, Bono P, Kataja V, Desmedt C. Tumor infiltrating lymphocytes are prognostic in triple negative breast cancer and predictive for trastuzumab benefit in early breast cancer: results from the FinHER trial. Ann Oncol. 2014;25(8):1544–50.
Dieci M, Mathieu M, Guarneri V, Conte P, Delaloge S, Andre F, Goubar A. Prognostic and predictive value of tumor-infiltrating lymphocytes in two phase III randomized adjuvant breast cancer trials. Ann Oncol. 2015;26(8):1698–704.
Adams S, Gray RJ, Demaria S, Goldstein L, Perez EA, Shulman LN, Martino S, Wang M, Jones VE, Saphner TJ. Prognostic value of tumor-infiltrating lymphocytes in triple-negative breast cancers from two phase III randomized adjuvant breast cancer trials: ECOG 2197 and ECOG 1199. J Clin Oncol. 2014;32(27):2959–66.
Wang K, Xu J, Zhang T, Xue D. Tumor-infiltrating lymphocytes in breast cancer predict the response to chemotherapy and survival outcome: a meta-analysis. Oncotarget. 2016;7(28):44288.
Luen SJ, Savas P, Fox SB, Salgado R, Loi S. Tumour-infiltrating lymphocytes and the emerging role of immunotherapy in breast cancer. Pathology. 2017;49(2):141–55.
Salgado R, Denkert C, Demaria S, Sirtaine N, Klauschen F, Pruneri G, Wienert S, Van den Eynden G, Baehner FL, Penault-Llorca F. The evaluation of tumor-infiltrating lymphocytes (TILs) in breast cancer: recommendations by an international TILs working group 2014. Ann Oncol. 2014;26(2):259–71.
Agner SC, Rosen MA, Englander S, Tomaszewski JE, Feldman MD, Zhang P, Mies C, Schnall MD, Madabhushi A. Computerized image analysis for identifying triple-negative breast cancers and differentiating them from other molecular subtypes of breast cancer on dynamic contrast-enhanced MR images: a feasibility study. Radiology. 2014;272(1):91–9.
Sutton EJ, Dashevsky BZ, Oh JH, Veeraraghavan H, Apte AP, Thakur SB, Morris EA, Deasy JO. Breast cancer molecular subtype classifier that incorporates MRI features. J Magn Reson Imaging. 2016;44(1):122–9.
Martincich L, Deantoni V, Bertotto I, Redana S, Kubatzki F, Sarotto I, Rossi V, Liotti M, Ponzone R, Aglietta M. Correlations between diffusion-weighted imaging and breast cancer biomarkers. Eur Radiol. 2012;22(7):1519–28.
Waugh S, Purdie C, Jordan L, Vinnicombe S, Lerski R, Martin P, Thompson A. Magnetic resonance imaging texture analysis classification of primary breast cancer. Eur Radiol. 2016;26(2):322–30.
Blaschke E, Abe H. MRI phenotype of breast cancer: kinetic assessment for molecular subtypes. J Magn Reson Imaging. 2015;42(4):920–4.
Wu J, Sun X, Wang J, Cui Y, Kato F, Shirato H, Ikeda DM, Li R. Identifying relations between imaging phenotypes and molecular subtypes of breast cancer: model discovery and external validation. J Magn Reson Imaging. 2017;46(4):1017–27.
Yamamoto S, Maki DD, Korn RL, Kuo MD. Radiogenomic analysis of breast cancer using MRI: a preliminary study to define the landscape. Am J Roentgenol. 2012;199(3):654–63.
Ashraf AB, Daye D, Gavenonis S, Mies C, Feldman M, Rosen M, Kontos D. Identification of intrinsic imaging phenotypes for breast cancer tumors: preliminary associations with gene expression profiles. Radiology. 2014;272(2):374–84.
Yamamoto S, Han W, Kim Y, Du LT, Jamshidi N, Huang DS, Kim JH, Kuo MD. Breast cancer: radiogenomic biomarker reveals associations among dynamic contrast-enhanced MR imaging, long noncoding RNA, and metastasis. Radiology. 2015;275(2):384–92.
Sutton EJ, Oh JH, Dashevsky BZ, Veeraraghavan H, Apte AP, Thakur SB, Deasy JO, Morris EA. Breast cancer subtype intertumor heterogeneity: MRI-based features predict results of a genomic assay. J Magn Reson Imaging. 2015;42(5):1398–406.
Li H, Zhu Y, Burnside ES, Drukker K, Hoadley KA, Fan C, Conzen SD, Whitman GJ, Sutton EJ, Net JM. MR imaging radiomics signatures for predicting the risk of breast Cancer recurrence as given by research versions of MammaPrint, Oncotype DX, and PAM50 gene assays. Radiology. 2016;281(2):382–91.
Wu J, Cui Y, Sun X, Cao G, Li B, Ikeda DM, Kurian AW, Li R. Unsupervised clustering of quantitative image phenotypes reveals breast cancer subtypes with distinct prognoses and molecular pathways. Clin Cancer Res. 2017;23(13):3334–42.
Cancer Genome Atlas Network. Comprehensive molecular portraits of human breast tumours. Nature. 2012;490(7418):61–70.
Esserman LJ, Berry DA, DeMichele A, Carey L, Davis SE, Buxton M, Hudis C, Gray JW, Perou C, Yau C. Pathologic complete response predicts recurrence-free survival more effectively by cancer subset: results from the I-SPY 1 TRIAL—CALGB 150007/150012, ACRIN 6657. J Clin Oncol. 2012;30(26):3242–9.
Lehmann BD, Jovanović B, Chen X, Estrada MV, Johnson KN, Shyr Y, Moses HL, Sanders ME, Pietenpol JA. Refinement of triple-negative breast cancer molecular subtypes: implications for neoadjuvant chemotherapy selection. PLoS One. 2016;11(6):e0157368.
Hylton NM, Gatsonis CA, Rosen MA, Lehman CD, Newitt DC, Partridge SC, Bernreuter WK, Pisano ED, Morris EA, Weatherall PT. Neoadjuvant chemotherapy for breast Cancer: functional tumor volume by MR imaging predicts recurrence-free survival—results from the ACRIN 6657/CALGB 150007 I-SPY 1 TRIAL. Radiology. 2016;279(1):44–55.
Hylton NM, Blume JD, Bernreuter WK, Pisano ED, Rosen MA, Morris EA, Weatherall PT, Lehman CD, Newstead GM, Polin S. Locally advanced breast cancer: MR imaging for prediction of response to neoadjuvant chemotherapy—results from ACRIN 6657/I-SPY TRIAL. Radiology. 2012;263(3):663–72.
Wu J, Cao G, Sun X, Lee J, Rubin DL, Napel S, Kurian AW, Daniel BL, Li R. Intratumoral spatial heterogeneity at perfusion MR imaging predicts recurrence-free survival in locally advanced breast cancer treated with neoadjuvant chemotherapy. Radiology. 2018;288(1):26–35.
Wu J, Aguilera T, Shultz D, Gudur M, Rubin DL, Loo BW Jr, Diehn M, Li R. Early-stage non–small cell lung cancer: quantitative imaging characteristics of 18F fluorodeoxyglucose PET/CT allow prediction of distant metastasis. Radiology. 2016;281(1):270–8.
Edwards SD, Lipson JA, Ikeda DM, Lee JM. Updates and revisions to the BI-RADS magnetic resonance imaging lexicon. Magn Reson Imaging Clin N Am. 2013;21(3):483–93.
Haralick RM, Shanmugam K, Dinstein IH. Textural features for image classification. IEEE Trans Syst Man Cybern. 1973;SMC-3(6):610–21.
Hattangadi J, Park C, Rembert J, Klifa C, Hwang J, Gibbs J, Hylton N. Breast stromal enhancement on MRI is associated with response to neoadjuvant chemotherapy. Am J Roentgenol. 2008;190(6):1630–6.
Chen DS, Mellman I. Oncology meets immunology: the cancer-immunity cycle. Immunity. 2013;39(1):1–10.
Rooney MS, Shukla SA, Wu CJ, Getz G, Hacohen N. Molecular and genetic properties of tumors associated with local immune cytolytic activity. Cell. 2015;160(1):48–61.
Johnson BJ, Costelloe EO, Fitzpatrick DR, Haanen JB, Schumacher TN, Brown LE, Kelso A. Single-cell perforin and granzyme expression reveals the anatomical localization of effector CD8+ T cells in influenza virus-infected mice. Proc Natl Acad Sci U S A. 2003;100(5):2657–62.
Herbst RS, Soria JC, Kowanetz M, Fine GD, Hamid O, Gordon MS, Sosman JA, McDermott DF, Powderly JD, Gettinger SN. Predictive correlates of response to the anti-PD-L1 antibody MPDL3280A in cancer patients. Nature. 2014;515(7528):563.
Esserman LJ, Berry DA, Cheang MC, Yau C, Perou CM, Carey L, DeMichele A, Gray JW, Conway-Dorsey K, Lenburg ME. Chemotherapy response and recurrence-free survival in neoadjuvant breast cancer depends on biomarker profiles: results from the I-SPY 1 TRIAL (CALGB 150007/150012; ACRIN 6657). Breast Cancer Res Treat. 2012;132(3):1049–62.
Leek JT, Johnson WE, Parker HS, Jaffe AE, Storey JD. The sva package for removing batch effects and other unwanted variation in high-throughput experiments. Bioinformatics. 2012;28(6):882–3.
Tibshirani R. Regression shrinkage and selection via the lasso. J R Stat Soc Ser B Stat Methodol. 1996;58(1):267–88.
Youden WJ. Index for rating diagnostic tests. Cancer. 1950;3(1):32–5.
Wu J, Li B, Sun X, Cao G, Rubin DL, Napel S, Ikeda DM, Kurian AW, Li R. Heterogeneous enhancement patterns of tumor-adjacent parenchyma at MR imaging are associated with dysregulated signaling pathways and poor survival in breast cancer. Radiology. 2017;285(2):401–13.
Luen S, Virassamy B, Savas P, Salgado R, Loi S. The genomic landscape of breast cancer and its interaction with host immunity. Breast. 2016;29:241–50.
Emens LA. Breast cancer immunotherapy: facts and hopes. Clin Cancer Res. 2018;24(3):511–20.
Khan AM, Yuan Y. Biopsy variability of lymphocytic infiltration in breast cancer subtypes and the ImmunoSkew score. Sci Rep. 2016;6:36231.
Choy G, Khalilzadeh O, Michalski M, Do S, Samir AE, Pianykh OS, Geis JR, Pandharipande PV, Brink JA, Dreyer KJ. Current applications and future impact of machine learning in radiology. Radiology. 2018;288(2):318–28.
Stanton SE, Adams S, Disis ML. Variation in the incidence and magnitude of tumor-infiltrating lymphocytes in breast cancer subtypes: a systematic review. JAMA Oncol. 2016;2(10):1354–60.
Braman NM, Etesami M, Prasanna P, Dubchuk C, Gilmore H, Tiwari P, Plecha D, Madabhushi A. Intratumoral and peritumoral radiomics for the pretreatment prediction of pathological complete response to neoadjuvant chemotherapy based on breast DCE-MRI. Breast Cancer Res. 2017;19(1):57.
Bahl M, Barzilay R, Yedidia AB, Locascio NJ, Yu L, Lehman CD. High-risk breast lesions: a machine learning model to predict pathologic upgrade and reduce unnecessary surgical excision. Radiology. 2018;286(3):810–8.
Pinker K, Chin J, Melsaether AN, Morris EA, Moy L. Precision medicine and radiogenomics in breast cancer: new approaches toward diagnosis and treatment. Radiology. 2018;287(3):732–47.
The authors thank The Cancer Imaging Archive (TCIA), the Cancer Digital Slide Archive, the Genomic Data Commons, and Gene Expression Omnibus for providing the breast cancer cases enrolled in The Cancer Genome Atlas (TCGA) study and the I-SPY 1 study.
This research was partially supported by the National Institutes of Health grants R01 CA222512, R01 CA193730, and K99 CA218667.
Availability of data and materials
Clinical and imaging data are publicly available from TCIA (www.cancerimagingarchive.net/). The H&E-stained whole-slide tumor sections are available from the Cancer Digital Slide Archive (http://cancer.digitalslidearchive.net/). For the TCGA breast cancer cohort, gene expression data derived from RNA-seq and mutational data derived from whole-exome sequencing are available in the Genomic Data Commons (https://gdc.cancer.gov/). For the I-SPY 1 cohort, the microarray gene expression data are available from the Gene Expression Omnibus (https://www.ncbi.nlm.nih.gov/geo/ [GEO:GSE22226]).
Ethics approval and consent to participate
This study was approved by the institutional review board at Stanford University (eProtocol 44870) and complies with Health Insurance Portability and Accountability Act protocols.
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Figure S1. Flowcharts of detailed patient selection for both TCGA and I-SPY 1 trials in the proposed study. Figure S2. Pairwise Pearson’s correlation of 17 quantitative DCE-MRI features (see definition in Table 1). Figure S3. ROC curves corresponding to mutation burden, cytolytic activity, and proposed composite model for classification of low vs intermediate TIL groups. Figure S4. Predicted TIL values for I-SPY patients based on the composite model, stratified by (a) three subtypes and (b) recurrence status. (DOCX 600 kb)
Table S1. Clinical and pathological characteristics for eligible patients in the I-SPY cohort. Table S2. Imaging features associated with tumor-infiltrating lymphocytes (TILs) with FDR < 0.2. Table S3. Mean and SD values for five quantitative imaging features. Table S4. Imaging features associated with nonsynonymous mutation burdens with FDR < 0.2. (DOCX 17 kb)