- Research
- Open access
- Published:
Prediction of pathological complete response after neoadjuvant chemotherapy for HER2-negative breast cancer patients with routine immunohistochemical markers
Breast Cancer Research volume 27, Article number: 13 (2025)
Abstract
Background
Pathological complete response (pCR) is an established surrogate marker for prognosis in patients with breast cancer (BC) after neoadjuvant chemotherapy. Individualized pCR prediction based on clinical information available at biopsy, particularly immunohistochemical (IHC) markers, may help identify patients who could benefit from preoperative chemotherapy.
Methods
Data from patients with HER2-negative BC who underwent neoadjuvant chemotherapy from 2002 to 2020 (n = 1166) were used to develop multivariable prediction models to estimate the probability of pCR (pCR-prob). The most precise model identified using cross-validation was implemented in an online calculator and a nomogram. Associations among pCR-prob, prognostic IHC3 distant recurrence and disease-free survival were studied using Cox regression and Kaplan–Meier analyses. The model’s utility was further evaluated in independent external validation cohorts.
Results
273 patients (23.4%) achieved a pCR. The most precise model had across-validated area under the curve (AUC) of 0.84, sensitivity of 0.82, and specificity of 0.71. External validation yielded AUCs between 0.75 (95% CI, 0.70–0.81) and 0.83 (95% CI, 0.78–0.87). The higher the pCR-prob, the greater the prognostic impact of pCR status (presence/absence): hazard ratios decreased from 0.55 (95% central range, 0.07–1.77) at 0% to 0.20 (0.11–0.31) at 50% pCR-prob. Combining pCR-prob and IHC3 score further improved the precision of disease-free survival prognosis.
Conclusions
A pCR prediction model for neoadjuvant therapy decision-making was established. Combining pCR and recurrence prediction allows identification of not only patients who benefit most from neoadjuvant chemotherapy, but also patients with a very unfavorable prognosis for whom alternative treatment strategies should be considered.
Background
Pathological complete response (pCR) after neoadjuvant chemotherapy is a surrogate marker for prognosis in patients with early breast cancer (BC) [1, 2]. Particularly in patients with triple-negative or HER2-positive breast cancer, a pCR is strongly associated with a favorable prognosis [1, 2]. Some hormone receptor–positive, HER2-negative patients have an excellent prognosis despite low pCR rates [1, 2]. Ideally, biomarkers can help identify patients who have a good response to chemotherapy and serve to justify such treatment, especially when the presence or absence of a pCR has a substantial impact on survival. A survey among physicians confirmed the general interest in a prediction tool for pCR[3]. The present study focuses on HER2-negative disease because, for HER2-positive patients, the indication for chemotherapy is usually established independently of biomarker values: most of these patients have been shown to benefit from combined chemotherapy with trastuzumab[4].
Although many molecular biomarkers have been shown to be associated with pCR, and some mRNA-based multigene assays have been used to predict the response to chemotherapy[5], the full potential of immunohistochemistry (IHC) may not yet have been fully explored.
IHC markers, including estrogen receptor (ER), progesterone receptor (PgR), HER2, and in some institutions Ki-67 as well, are often used for clinical decision making. Although ER, PgR, and Ki-67 assessments are usually reported as the percentage of positively stained cells, dichotomized information (positive vs negative) is mainly used in clinical trials and scientific analyses. For routine clinical use, some cutoff values have been discussed, but these cut-off points have changed in some cases. The cutoff points for ER/PgR, for example, have decreased from 10% to 1% over time [6,7,8,9,10]. These cutoff values were chosen in relation to responsiveness to antihormonal therapy but have nevertheless also been used to predict the efficacy of chemotherapy [1, 2, 11,12,13,14].
Combined information allows individualized prediction of pCR that goes beyond the single dichotomized biomarker approach currently used. The primary objective of this study was to develop various prediction tools to estimate a patient’s likelihood of pCR on the basis of clinical predictors and the IHC biomarkers ER, PgR, and Ki-67—either as assessed during routine clinical work or categorically using established or newly identified thresholds. The most precise tool is described in detail and has been validated in several independent external samples.
Methods
Primary study population
This retrospective, single-center, hospital-based observational study included 1166 patients with HER2-negative BC from the Erlangen Neoadjuvant Study Breast (ERNEST-B) study [12] who underwent neoadjuvant chemotherapy from 2002 to 2020, were ≥ 18 years of age, and had available information about their pCR status. Patients with metastases or contralateral breast cancer at primary diagnosis and patients with incomplete biomarker information were excluded. Approval for the analyses was obtained from the ethics committee of the University of Erlangen–Nuremberg. Further information is provided in Fig. 1 and the Supplement.
Data were collected prospectively in accordance with the breast center certification requirements of the German Cancer Society. Follow-up data were collected for ≤ 10 years after the primary diagnosis.
All histopathological parameters were documented from the original pathology reports. Tumor grade, tumor type, HER2 status, ER/PgR expression, and Ki-67 staining were assessed as part of clinical routine testing on formalin-fixed, paraffin-embedded tumor tissue at initial diagnosis. The staining procedures are described in the Supplement.
Study outcome
pCR was defined as a complete absence of tumor cells from the breast (ypT0) and lymph nodes (ypN0) after chemotherapy at the time of surgery.
Disease-free survival (DFS) was defined as the period from the date of diagnosis to either the earliest date of disease progression (ie, distant metastasis, local recurrence, or death from any cause) or to the last date the patient was known to be disease free within the 10-year maximum observation time.
Univariable statistical analyses
The possibly nonlinear relationship between the biomarkers ER, PgR, and Ki-67 (each continuous, 0% to 100% of positively stained cells) and pCR (yes vs no) was described using natural cubic spline functions. Optimal cutoff values for these biomarkers were calculated using the minimum p value approach. Full details are provided in the Supplement.
Developing pCR prediction models
Several logistic regression models were fitted to assess different usages of the biomarkers ER, PgR, and Ki-67 for predicting a patient’s likelihood of pCR. A logistic regression model (hereafter referred to as the basic model) was set up with established predictors for pCR: age at diagnosis (continuous), inverse body mass index (1/BMI, continuous), tumor stage (ordinal, cT1–cT4), grade (ordinal, grade 1–grade 3), lymph node status (categorical, cN0 vs cN +) and tumor type (categorical; ductal, lubular, other). Continuous predictors (age, 1/BMI) were used as cubic spline functions in which the degrees of freedom (df; 1 or 2) were determined using the Akaike information criterion (AIC). Missing predictor values were substituted by predicted expected values based on nonmissing data. All other prediction models were extensions of the basic model, including the biomarkers of interest.
Two logistic regression models were fitted with the biomarkers of interest as binary categorical predictors, one using established cutoff points (1% for ER and PgR [9], 14% for Ki-67 [15]) and the other using optimal cutoff points from the univariable analyses mentioned in the Univariable Statistical Analyses subsection. The biomarkers ER, PgR and Ki-67 were used continuously (0% to 100% of positively stained cells) as natural cubic spline functions, with 1 to 3 df in 27 (ie, 33) further regression models considering all combinations of df. To improve prediction, shrinkage of regression coefficients after estimation was applied [16].
Internal assessment of prediction models
The performance of the models in relation to calibration and discrimination was assessed using the mean squared error (MSE), receiver operating characteristic (ROC) curve, and area under the ROC curve (AUC). These measures were obtained by threefold cross-validation with 100 repetitions in order to obtain stable and realistic results [17,18,19]. In particular, all model-building steps were performed using training data, and the performance of the model was assessed using validation data that had not been used for model building. Apparent measures on the complete dataset were calculated to assess overfitting. In addition, the AIC was used as an alternative model performance measure. It was applied to the complete dataset and took overfitting into account by penalizing complex models.
The model with the smallest cross-validated MSE was considered the final model, which had the best usage of the biomarkers ER, PgR, and Ki-67 in comparison with the other usages [17, 18]. It was fitted on the complete dataset and analyzed in greater detail. An explicit formula for a patient’s predicted pCR probability (pCR-prob) was derived, implemented in an online calculator, and graphically presented as a nomogram. Model performance was also assessed using the Hosmer–Lemeshow calibration plot and χ2 test, as previously done [20]. Cross-validated sensitivities and specificities are presented. Spearman correlation coefficient ρ was calculated for pCR-prob and the IHC3 + C score, which incorporates ER, PgR, and Ki-67 (IHC3) and clinical predictors (C) and provides prognostic information on the risk of distant recurrence [21]. The IHC3 score is virtually identical to IHC4 when HER2 is negative [21].
The association between pCR-prob and the impact of pCR on the prognosis was analyzed using a Cox regression model with the following predictors: observed pCR status (yes vs no), pCR-prob obtained from the final logistic regression model as a cubic spline function with 2 df, and the interaction between the two predictors. To obtain hazard ratios for pCR (yes vs no) as a continuous function of pCR-prob, 20,000 random sample splittings were carried out in which the final model was fitted on half the data first and the Cox regression analysis was then performed on the remaining half.
Kaplan–Meier curves for DFS are shown in accordance with recently suggested pCR probability groups [3] and IHC3 + C risk classes.
External validation of the final prediction model
The observational Hannover Breast Cancer Study (HaBCS [22], n = 338, see Supplement) and two randomized clinical trials (GeparSepto [23], n = 781; GeparOcto [24], n = 269; see Supplement) were each used to validate the final prediction model that had been fitted on the primary study dataset. The discrimination ability was assessed using the AUC. Its 95% CI was estimated using 10,000 bootstrap samples. Calibration was checked using a calibration plot in addition to a simple logistic regression model with observed pCR as the outcome and the logit of pCR-prob as the only predictor (hereafter termed the calibration model). If the intercept significantly (p < 0.05) differed from 0, or if the slope significantly differed from 1, then the calibration intercept and slope were used to update the original prediction model for future application in the validation cohort [19, 25].
The final prediction model was further assessed (AUC) in subgroups according to hormone receptor status to control for heterogeneity within and across study populations.
Calculations were carried out using the R system for statistical computing (version 4.1.1; R Foundation for Statistical Computing, Vienna, Austria).
RESULTS
Patients
In the primary study cohort, 273 (23.4%) of the 1166 patients achieved a pCR. In the validation cohorts, pCR rates ranged from 17.1% to 32.7% (Supplementary Table S1). The patient characteristics grouped by pCR status are shown in Table 1, and biomarker distributions in the primary cohort are shown in Supplementary Fig. S1.
Optimal cutoff points for biomarkers
The unadjusted relationship between ER, PgR, and Ki-67 and the pCR was best described as cubic spline functions with 1, 2, and 2 df, respectively (Supplementary Fig. S2a). The df obtained from cross-validation were confirmed using the AIC. The functions were monotonic and therefore enabled the determination of cut-off points. The optimal cut-off points for ER, PgR, and Ki-67 were 40%, 9%, and 30%, respectively (Supplementary Fig. S3).
Comparison of pCR prediction models
The prediction model with linear usage of biomarkers (ER, PgR, Ki-67; 0% to 100% of positively stained cells) was most accurate (Table 2; cross-validated MSE, 0.1336). Additionally, all other models with continuous usage of the biomarkers were more accurate (0.1338–0.1347)—with decreasing accuracy correlating to increasing model complexity—than the model with established biomarker categories (0.1392) and the model with categories related to newly determined cutoff values (0.1356). These MSE results were confirmed by the AIC and AUC statistics (Table 2).
The final prediction model
The regression coefficients of the final model and a formula for calculating a patient’s pCR-prob are presented in Table 3. Using an alternative definition of pCR gave similar results (Supplementary Table S2). The online calculator and the nomogram generate pCR-prob in a user-friendly format (see Supplement). Examples of ways in which biomarkers influence pCR-prob are shown in Supplementary Fig. S2b. Most patients in the primary study population had a low pCR-prob, particularly hormone receptor–positive patients (Supplementary Fig. S4). The final model was well calibrated (p = 0.90; Hosmer–Lemeshow χ2 test; Supplementary Fig. S5a). The apparent AUC was 0.009 units larger than the cross-validated AUC (0.845 versus 0.836, Table 2), indicating a small amount of overfitting. Cross-validated subgroup-specific AUC values were similar to the overall AUC for hormone-receptor positive (0.836) and ER-positive (0.834), but not for triple-negative breast cancer (TNBC) patients (0.662, Supplementary Table S4).
The cross-validated ROC curve is shown in Supplementary Fig. S6. For cutoff points between 17 and 27%, sensitivities were between 0.80 and 0.89 at specificities of ≥ 0.65 (Supplementary Table S3). pCR-prob correlated moderately with IHC3 + C (ρ = 0.61, Supplementary Fig. S7).
A high pCR-prob not only indicated the efficacy of chemotherapy but also corresponded with the impact of pCR status on the prognosis. The greater the pCR-prob, the larger the benefit of a pCR in relation to DFS (Fig. 2). The hazard ratios for pCR presence vs absence improved from 0.55 (95% central range, 0.07–1.77) at a pCR-prob of 0% to 0.20 (0.11–0.31) at 50%.
Disease-free survival HR for patients with pCR and patients without pCR (reference) as a continuous function of predicted pCR probabilities using data from the primary study population. The lower the value on the y axis, the higher the impact of pCR status on the prognosis. The solid curve shows mean HR values obtained from 20,000 random sample splittings. Dashed curves show the corresponding pointwise 95% central range (2.5th and 97.5th percentiles) of the HR distribution. The gray vertical lines indicate the first, second, and third quartiles of the predicted pCR probability in the primary study population. HR, hazard ratio; pCR, pathological complete response
Patients with a low pCR-prob (5-year DFS rate, 0.77; 95% CI, 0.74–0.81) and patients with a high pCR-prob (0.77; 0.72–0.81) had a better prognosis than patients with an intermediate pCR-prob (0.66; 0.59–0.74, Fig. 3a). This result was examined in more detail using the IHC3 + C score (Supplementary Fig. S8): The IHC3 + C score separated patients within patient groups defined by pCR-prob (Fig. 3b–d). Patients with a low pCR-prob and a high IHC3 + C value had an extraordinarily unfavorable DFS prognosis (10-year DFS rate, 0.21; 95% CI, 0.10–0.42) in comparison with patients with a high IHC3 + C but intermediate (0.45; 0.33–0.63) or a high pCR-prob (0.59; 0.51–0.70, Fig. 3b).
Kaplan–Meier estimates for disease-free survival relative to pCR probability classes (low, 0%–10%; intermediate, 10%–30%; high, 30%–100%) and IHC3 + C classes (low, < 210, first quartile; intermediate, 210–350, interquartile range; high, ≥ 350, third quartile): a for the complete primary study cohort, b for patients with a low pCR probability, c for patients with an intermediate pCR probability, and d for patients with a high pCR probability. The curves for low IHC3 + C and intermediate pCR are not shown because the sample size was small (n = 18). No patients had a low IHC3 + C with a high pCR. IHC3 + C = estrogen receptor, progesterone receptor, Ki-67, and clinical predictors; pCR, pathological complete response
External validation of the final prediction model
The AUC of the final prediction model was 0.827 (95% CI, 0.779–0.871) in the HaBCS. Sensitivities and specificities were similar to those in the primary cohort (Supplementary Table S3). The model was well calibrated (Supplementary Fig. S5b). The intercept and the slope of the calibration model were 0.26 (95% CI, –0.14–0.67) and 1.09 (95% CI, 0.77–1.42) and did not significantly differ from 0 and 1, respectively. Hence, updating of the model was not necessary.
The AUC was between 0.754 and 0.795 in the three analyzed treatment arms of GeparSepto and GeparOcto (Table 4). pCR-prob was satisfactorily precise in the GeparOcto-ETC arm, in patients with low pCR-prob in the GeparSepto-paclitaxel arm, and in those with high pCR-prob in the GeparSepto-NAB-paclitaxel arm. It was systematically too high and too low, respectively, for the other patients in the GeparSepto arms (Supplementary Fig. S5c–e), suggesting that the model should be recalibrated before application to data from GeparSepto-like treated patients (Table 4). The online calculator and the nomogram provide updated pCR-prob for such patients.
Subgroup-specific validations are shown in Supplementary Table S4. AUC values for hormone receptor-positive patients were between 0.789 and 0.810 in the validation cohorts, and the AUC values for TNBC patients (0.614 to 0.683) were similar to the value in the primary cohort.
Discussion
This study developed and compared several prediction models for pCR after neoadjuvant chemotherapy in patients with HER2-negative BC, using clinical predictors and the molecular biomarkers ER, PgR, and Ki-67 assessed by IHC during routine clinical work. Integrating the molecular predictors as linear variables (0%–100%) yielded the best prediction of pCR. The validation of the prediction model in other populations yielded good results in patient groups with similar standard-of-care treatments. An online calculator and a nomogram are provided for user-friendly application of the prediction model.
To predict pCR, immunohistochemical markers were usually used categorically (positive vs negative), with predefined cutoff points that did not originate from pCR prediction [26, 27]. In the present study, the model with newly identified cutoff points performed better than the model with established cutoffs. However, the prediction model that did not use any cutoff points was favored overall.
With regard to prognosis, ER, PgR, and Ki-67 were also continuously incorporated into the IHC4 score in an adjuvant study on postmenopausal hormone receptor–positive patients [21]. The IHC4 score was also analyzed relative to predicting pCR after neoadjuvant chemotherapy [28, 29]. Good associations were found between IHC4 and pCR rates, but the AUC was 0.665 when IHC4 was combined with the Nottingham prognostic index [28]. Some multigene assays have also been analyzed in relation to predicting chemotherapy responsiveness, but none of these tests has been directly compared with classic IHC markers in a joint study, and the reported results do not indicate any superiority for the multigene tests [5].
pCR probability has not previously been considered in decision making for or against neoadjuvant chemotherapy. A survey among physicians with long-term experience showed that 84% would welcome the opportunity to use probability of pCR in decision making [3]. Using pCR-prob may be reasonable because the present study shows that the higher the pCR-prob, the greater the impact of pCR on the prognosis. This finding ensures not only consideration of the number of patients achieving a pCR but also the selection of a patient group with a greater prognostic benefit from achieving a pCR. We have shown that patients with high pCR-prob have a more favorable prognosis than patients with intermediate pCR-prob, which supports the view that patients with high pCR-prob can be treated with neoadjuvant chemotherapy.
A meta-analysis of randomized studies comparing neoadjuvant with adjuvant chemotherapy showed that the mortality rate in patients who do not respond to neoadjuvant chemotherapy was much higher (33.5% at 10 years) than in all patients who received adjuvant chemotherapy (22.7% at 10 years) [30]. This finding may largely reflect patient selection rather than a treatment effect [30]. Since the analyses were adjusted for clinical tumor size and nodal status, it might also indicate that some patients not only derive no benefit from neoadjuvant chemotherapy but also have prognoses that are poorer than expected. In the present study, this group of patients may correspond to patients with a low pCR-prob and a high risk of recurrence.
The most precise external prediction was obtained in a hospital-based cohort (HaBCS) that underwent preoperative chemotherapy very similar to that of the primary cohort (ie, an anthracycline-based agent followed by a taxane). The predictor showed good results in patients whose treatments were close to standard care (GeparSepto; NCT01583426; paclitaxel or nab-paclitaxel followed by anthracycline-based chemotherapy). The prediction model did not take treatments into account, and the significant treatment effect in GeparSepto might therefore have led to overestimated predictions in one arm and underestimated predictions in the other, necessitating correction by recalibration.
Differences in performance between the primary and validation cohorts are unavoidable and were expected [19]. More external validation studies that cover various geographic regions, time periods, and healthcare institutions may be able to further clarify such differences and possibly improve the prediction model. Adding other novel biomarkers, such as tumor-infiltrating lymphocytes, to the prediction model might also improve accuracy. The pCR prediction in patients with rare tumor types such as lobular breast cancer could be improved by using additional, sufficiently large specific cohorts.
The AUC values for hormone-receptor positive patients—i.e., patients for whom the use of the prediction tool is primarily intended—were around 0.80 and only slightly smaller than in the primary study population. The AUC values in TNBC patients were similar across all cohorts, however poorer (around 0.65) than in other patient groups, but still within the range of published studies. This may not be of great importance clinically as therapy decisions for chemotherapy in TNBC patients (as in HER2-positve patients) are usually made without considering further tumor-related characteristics because therapy efficacy is high and there is a lack of alternative treatment options.
The final prediction model was assessed both internally by cross-validation and externally in independent validation cohorts in accordance with the TRIPOD statement [19]. In both cases, prediction performance was measured with data not used for model building in order to obtain realistic results. Data splitting was also applied in survival analyses with actual and predicted pCR values in order to avoid biased predictions in which the upcoming event was already known. Both the AIC and cross-validation provided very similar model selection and model complexity results. It follows that using the AIC instead of cross-validation may be appropriate in complex settings. In the present study, the AIC rather than cross-validation was applied in the model development process to some extent in order to keep the number of potential models manageable.
Conclusions
This study provides a prediction model for pCR after neoadjuvant chemotherapy using clinical parameters and continuous IHC markers. The good performance of the model suggests that therapy decisions could be based on predicted pCR probabilities. Considering both the likelihood of pCR and the prognosis not only identifies patients who may benefit from neoadjuvant chemotherapy but also patients in whom the prognosis was unexpectedly unfavorable. It may be necessary to develop novel treatment strategies for these patients.
Availability of data and materials
For eligible studies, qualified researchers who provide a methodologically sound proposal and whose proposed use of data has been approved by a review committee may request access to individual patient data that underlie the results reported in this article, after de-identification (text, tables, figures, and appendices) and study protocols, beginning 9 months before and ending 36 months after article publication. Analyses to achieve the aims in the proposed proposal are eligible. Proposals may be submitted to the corresponding author up to 36 months following article publication. After 36 months, the data will be available in our university’s data warehouse but without investigator support. The online calculator developed from the final prediction model for pCR is available at https://www.pcrpredictor.org. The web page also provides an open-source application of the calculator as a downloadable file. The R code to determine the pCR probabilities is available on request from the corresponding author. No datasets were generated or analysed during the current study.
Abbreviations
- AIC:
-
Akaike information criterion
- AUC:
-
Area under the receiver operating characteristic curve
- BC:
-
Breast cancer
- BMI:
-
Body mass index
- BSA:
-
Body surface area
- CI:
-
Confidence interval
- DFS:
-
Disease-free survival
- ER:
-
Estrogen receptor
- ERNEST-B:
-
Erlangen neoadjuvant study breast
- ETC:
-
High-dose epirubicin, taxane, and cyclophosphamide
- HaBCS:
-
Hannover breast cancer study
- HER2:
-
Human epidermal growth factor receptor 2
- iddEPC:
-
Intense dose-dense epirubicin, paclitaxel, and cyclophosphamide
- IHC:
-
Immunohistochemical
- IHC3 + C:
-
IHC 3 score plus clinical predictors
- IQR:
-
Interquartile range
- MSE:
-
Mean squared error
- pCR:
-
Pathological complete response
- pCR-prob:
-
Probability of pathological complete response
- PgR:
-
Progesterone receptor
- PM[Cb]:
-
Paclitaxel plus nonpegylated liposomal doxorubicin with additional carboplatin
- ROC:
-
Receiver operating characteristic
- SD:
-
Standard deviation
- SE:
-
Standard error
References
von Minckwitz G, Untch M, Blohmer JU, Costa SD, Eidtmann H, Fasching PA, et al. Definition and impact of pathologic complete response on prognosis after neoadjuvant chemotherapy in various intrinsic breast cancer subtypes. J Clin Oncol. 2012;30(15):1796–804.
Cortazar P, Zhang L, Untch M, Mehta K, Costantino JP, Wolmark N, et al. Pathological complete response and long-term clinical benefit in breast cancer: the CTNeoBC pooled analysis. Lancet. 2014;384(9938):164–72.
Gass P, Haberle L, Erber R, Hein A, Jud SM, Lux MP, et al. Using probability for a pathological complete response (pCR) as a decision support marker for a neoadjuvant chemotherapy in HER2 negative breast cancer patients a survey among physicians. Geburtshilfe Frauenheilkd. 2018;79:707–14.
Tolaney SM, Barry WT, Dang CT, Yardley DA, Moy B, Marcom PK, et al. Adjuvant paclitaxel and trastuzumab for node-negative, HER2-positive breast cancer. N Engl J Med. 2015;372(2):134–41.
Robidoux A, McCullough D, Lau A, Stoppler M, Chao C. Use of the 21-gene Oncotype DX (R) breast recurrence score (TM) (RS) assay in the neoadjuvant treatment setting. Breast. 2017;32:S80.
Goldhirsch A, Wood WC, Gelber RD, Coates AS, Thurlimann B, Senn HJ. Meeting highlights: updated international expert consensus on the primary therapy of early breast cancer. J Clin Oncol. 2003;21(17):3357–65.
Goldhirsch A, Glick JH, Gelber RD, Coates AS, Thurlimann B, Senn HJ. Meeting highlights: international expert consensus on the primary therapy of early breast cancer 2005. Ann Oncol. 2005;16(10):1569–83.
Hammond ME, Hayes DF, Wolff AC. Clinical notice for American society of clinical oncology-college of American Pathologists guideline recommendations on ER/PgR and HER2 testing in breast cancer. J Clin Oncol Off J Am Soc Clin Oncol. 2011;29(15): e458.
Hammond ME, Hayes DF, Dowsett M, Allred DC, Hagerty KL, Badve S, et al. American society of clinical oncology/college of American pathologists guideline recommendations for immunohistochemical testing of estrogen and progesterone receptors in breast cancer. J Clin Oncol Off J Am Soc Clin Oncol. 2010;28(16):2784–95.
Goldhirsch A, Glick JH, Gelber RD, Coates AS, Senn HJ. Meeting highlights: international consensus panel on the treatment of primary breast cancer. In: 7th International conference on adjuvant therapy of primary breast cancer. J Clin Oncol. 2001;19(18):3817–27.
Goldhirsch A, Ingle JN, Gelber RD, Coates AS, Thurlimann B, Senn HJ. Thresholds for therapies: highlights of the St Gallen International Expert Consensus on the primary therapy of early breast cancer 2009. Ann Oncol Off J Eur Soc Med Oncol. 2009;20(8):1319–29.
Fasching PA, Heusinger K, Haeberle L, Niklos M, Hein A, Bayer CM, et al. Ki67, chemotherapy response, and prognosis in breast cancer patients receiving neoadjuvant treatment. BMC Cancer. 2011;11:486.
Kim KI, Lee KH, Kim TR, Chun YS, Lee TH, Park HK. Ki-67 as a predictor of response to neoadjuvant chemotherapy in breast cancer patients. J Breast Cancer. 2014;17(1):40–6.
Lips EH, Mulder L, de Ronde JJ, Mandjes IA, Koolen BB, Wessels LF, et al. Breast cancer subtyping by immunohistochemistry and histological grade outperforms breast cancer intrinsic subtypes in predicting neoadjuvant chemotherapy response. Breast Cancer Res Treat. 2013;140(1):63–71.
Cheang MC, Chia SK, Voduc D, Gao D, Leung S, Snider J, et al. Ki67 index, HER2 status, and prognosis of patients with luminal B breast cancer. J Natl Cancer Inst. 2009;101(10):736–50.
Steyerberg EW, Borsboom GJ, van Houwelingen HC, Eijkemans MJ, Habbema JD. Validation and updating of predictive logistic regression models: a study on sample size and shrinkage. Stat Med. 2004;23(16):2567–86.
Haberle L, Hack CC, Heusinger K, Wagner F, Jud SM, Uder M, et al. Using automated texture features to determine the probability for masking of a tumor on mammography, but not ultrasound. Eur J Med Res. 2017;22(1):30.
Wessels LF, Reinders MJ, Hart AA, Veenman CJ, Dai H, He YD, et al. A protocol for building and evaluating predictors of disease state based on microarray data. Bioinformatics (Oxford, England). 2005;21(19):3755–62.
Moons KG, Altman DG, Reitsma JB, Ioannidis JP, Macaskill P, Steyerberg EW, et al. Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD): explanation and elaboration. Ann Intern Med. 2015;162(1):W1-73.
Haberle L, Fasching PA, Brehm B, Heusinger K, Jud SM, Loehberg CR, et al. Mammographic density is the main correlate of tumors detected on ultrasound but not on mammography. Int J Cancer. 2016;139(9):1967–74.
Cuzick J, Dowsett M, Pineda S, Wale C, Salter J, Quinn E, et al. Prognostic value of a combined estrogen receptor, progesterone receptor, Ki-67, and human epidermal growth factor receptor 2 immunohistochemical score and comparison with the Genomic Health recurrence score in early breast cancer. J Clin Oncol. 2011;29(32):4273–8.
Michailidou K, Lindstrom S, Dennis J, Beesley J, Hui S, Kar S, et al. Association analysis identifies 65 new breast cancer risk loci. Nature. 2017;551(7678):92–4.
Untch M, Jackisch C, Schneeweiss A, Conrad B, Aktas B, Denkert C, et al. Nab-paclitaxel versus solvent-based paclitaxel in neoadjuvant chemotherapy for early breast cancer (GeparSepto-GBG 69): a randomised, phase 3 trial. Lancet Oncol. 2016;17(3):345–56.
Schneeweiss A, Mobus V, Tesch H, Hanusch C, Denkert C, Lubbe K, et al. Intense dose-dense epirubicin, paclitaxel, cyclophosphamide versus weekly paclitaxel, liposomal doxorubicin (plus carboplatin in triple-negative breast cancer) for neoadjuvant treatment of high-risk early breast cancer (GeparOcto-GBG 84): a randomised phase III trial. Eur J Cancer. 2019;106:181–92.
Janssen KJ, Moons KG, Kalkman CJ, Grobbee DE, Vergouwe Y. Updating methods improved the performance of a clinical prediction model in new patients. J Clin Epidemiol. 2008;61(1):76–86.
Rouzier R, Coutant C, Lesieur B, Mazouni C, Incitti R, Natowicz R, et al. Direct comparison of logistic regression and recursive partitioning to predict chemotherapy response of breast cancer based on clinical pathological variables. Breast Cancer Res Treat. 2009;117(2):325–31.
Kim JY, Jeon E, Kwon S, Jung H, Joo S, Park Y, et al. Prediction of pathologic complete response to neoadjuvant chemotherapy using machine learning models in patients with breast cancer. Breast Cancer Res Treat. 2021;189(3):747–57.
Tan W, Luo W, Jia W, Liang G, Xie X, Zheng W, et al. A combination of Nottingham prognostic index and IHC4 score predicts pathological complete response of neoadjuvant chemotherapy in estrogen receptor positive breast cancer. Oncotarget. 2016;7(52):87312–22.
Sheri A, Smith IE, Hills M, Jones RL, Johnston SR, Dowsett M. Relationship between IHC4 score and response to neo-adjuvant chemotherapy in estrogen receptor-positive breast cancer. Breast Cancer Res Treat. 2017;164(2):395–400.
Early Breast Cancer Trialists Collaborative G. Long-term outcomes for neoadjuvant versus adjuvant chemotherapy in early breast cancer: meta-analysis of individual patient data from ten randomised trials. Lancet Oncol. 2018;19(1):27–39.
Acknowledgements
The authors are grateful to Dr Michael Robertson and Nucleus Global for professional medical editing services. The authors thank all clinicians and documentation assistants at participating hospitals of the regional Hannover Breast Cancer Network, specifically at the Diakovere Hannover, the Agaplesion Clinics Schaumburg, and the Sana Clinics Hameln-Pyrmont, as well as all clinicians and documentation assistants at the University Breast Center for Franconia. The contribution of Melitta Niklos to this publication was made in partial fulfillment of the requirements for obtaining the degree of Doctor of Medicine. Parts of the research published here have been used for her doctoral thesis at the Medical Faculty of Friedrich Alexander University of Erlangen–Nuremberg.
Funding
Open Access funding enabled and organized by Projekt DEAL. This work was supported by the Ellen and Artur Matull Foundation [MWB/0f – 08/07/2018] and and by the Federal Ministry of Education and Research (BMBF; project ERACoSysMed3: collaborative project: RESCUER; project grant number: 031L0240). The funders of the study had no role in study design, data collection, data analysis, data interpretation, writing of the report or the decision to submit the paper for publication.
Author information
Authors and Affiliations
Contributions
LH conducted the statistical analysis and interpretation of data. RE, RSW, VN, AH, MC, TD, and SL provided data for the analysis. BV and MN performed the data management. PG, AH, CCH, AS, MU, TWPS, and MWB recruited patients for the study. PAF and LH, with the help of HH and CG, wrote the first draft of the manuscript. PAF supervised the project. All authors revised the manuscript. All authors discussed the results and contributed to the final manuscript. LH, BV, and PAF directly accessed and verified the underlying data reported in the manuscript. All authors had full access to all the data in the study and had final responsibility for the decision to submit for publication.
Corresponding author
Ethics declarations
Ethics approval and consent to participate
The ERNEST-B study was approved by the Ethics Committee of the Medical Faculty of Friedrich-Alexander Universität Erlangen-Nürnberg, Erlangen (application number 297_17 Bc). The HaBCS was approved by the ethics committee at Hannover Medical School (No. 6079). GeparSepto (NCT01583426) and GeparOcto (NCT02125344) were multicenter trials, approved by the ethics committees/institutional review boards and the competent authority. Patients enrolled into the GeparSepto and GeparOcto studies provided informed consent. In the ERNEST-B and HaBCS studies, patient data were collected in accordance with the breast center certification requirements of the German Cancer Society. All experiments were performed in accordance with relevant guidelines and regulations.
Consent for publication
Not applicable.
Competing interests
RE has received honoria from Roche, Eisai, and Novartis; has participated on advisory boards for Roche; and has received compensation for travel expenses from Novartis and Roche. PG has received honoraria from Novartis; financial support for symposia from Novartis, Roche, and PharmaMar; and compensation for travel expenses from discovering hands®. CCH has received honoraria from Roche. AH has received honoraria and compensation for travel expenses from BMS, MSD, Roche, AstraZeneca, AbbVie, Böhringer Ingelheim, Sysmex, Janssen, and Cepheid; has participated on advisory boards for BMS, MSD, Roche, AstraZeneca, AbbVie, Böhringer Ingelheim, Sysmex, Janssen, Cepheid, and NanoString; has received research funding from Janssen, Roche, Cepheid, and NanoString; and has provided expert testimony for NanoString. MPL has participated on advisory boards for AstraZeneca, MSD, Novartis, Pfizer, Eisai, Genomic Health, Tesaro, and Roche, and has received honoraria for lectures from MSD, Lilly, Roche, Novartis, Pfizer, Genomic Health, AstraZeneca, medac, and Eisai. PAF reports grants from Novartis, Cepheid, and BioNTech, and personal fees from Novartis, Roche, Pfizer, Celgene, Daiichi-Sankyo, TEVA, AstraZeneca, MSDohme, Myelo Therapeutics, Macrogenics, Eisai, and Puma. All remaining authors have no conflicts of interest to disclose.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Häberle, L., Erber, R., Gass, P. et al. Prediction of pathological complete response after neoadjuvant chemotherapy for HER2-negative breast cancer patients with routine immunohistochemical markers. Breast Cancer Res 27, 13 (2025). https://doi.org/10.1186/s13058-025-01960-8
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/s13058-025-01960-8