Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2009 Mar 10;27(8):1160-7.
doi: 10.1200/JCO.2008.18.1370. Epub 2009 Feb 9.

Supervised risk predictor of breast cancer based on intrinsic subtypes

Affiliations

Supervised risk predictor of breast cancer based on intrinsic subtypes

Joel S Parker et al. J Clin Oncol. .

Abstract

PURPOSE To improve on current standards for breast cancer prognosis and prediction of chemotherapy benefit by developing a risk model that incorporates the gene expression-based "intrinsic" subtypes luminal A, luminal B, HER2-enriched, and basal-like. METHODS A 50-gene subtype predictor was developed using microarray and quantitative reverse transcriptase polymerase chain reaction data from 189 prototype samples. Test sets from 761 patients (no systemic therapy) were evaluated for prognosis, and 133 patients were evaluated for prediction of pathologic complete response (pCR) to a taxane and anthracycline regimen.

Results: The intrinsic subtypes as discrete entities showed prognostic significance (P = 2.26E-12) and remained significant in multivariable analyses that incorporated standard parameters (estrogen receptor status, histologic grade, tumor size, and node status). A prognostic model for node-negative breast cancer was built using intrinsic subtype and clinical information. The C-index estimate for the combined model (subtype and tumor size) was a significant improvement on either the clinicopathologic model or subtype model alone. The intrinsic subtype model predicted neoadjuvant chemotherapy efficacy with a negative predictive value for pCR of 97%. CONCLUSION Diagnosis by intrinsic subtype adds significant prognostic and predictive information to standard parameters for patients with breast cancer. The prognostic properties of the continuous risk score will be of value for the management of node-negative breast cancers. The subtypes and risk score can also be used to assess the likelihood of efficacy from neoadjuvant chemotherapy.

PubMed Disclaimer

Conflict of interest statement

Authors' disclosures of potential conflicts of interest and author contributions are found at the end of this article.

Figures

Fig 1.
Fig 1.
PAM50 intrinsic subtype prognosis for relapse-free survival (RFS). (A) Outcome predictions according to the four tumor subtypes in a test set of 710 node-negative, no systemic adjuvant therapy patients. (B) Outcome by subtype in the subset of patients with estrogen receptor (ER) –positive disease from Figure 1A. (C) Outcome by subtype in patients with ER-negative disease. (D) Outcome by subtype in HER2clin-positive patients.
Fig 2.
Fig 2.
Risk of relapse (ROR) predictions using a test set of node-negative, no systemic therapy patients. (A) C-index analyses of four different Cox models using a test set of node-negative, untreated patients. (B) ROR-C (tumor size and subtype model) scores stratified by subtype. (C) Kaplan-Meier plots of the test set using cut points determined in training. (D) Analysis of the ROR-C model versus probability of survival shows a linear relationship (with the dashed lines showing the 95% CIs). ER, estrogen receptor; RFS, relapse-free survival.
Fig 3.
Fig 3.
Relationship between risk of relapse (ROR) score and paclitaxel, fluorouracil, doxorubicin, and cyclophosphamide neoadjuvant response. (A) Receiver operating characteristic curve analysis of ROR-S (subtype only model) versus pathologic complete response (pCR) in the Hess et al test set. (B) ROR-S score versus probability of pCR. PPV, positive predictive value; NPV, negative predictive value; AUC, area under the curve.
Fig A1.
Fig A1.
Hierarchical clustering and SigClust analysis of microarray data using 1,906 “intrinsic” genes and 189 samples. The SigClust algorithm statistically identifies significant/unique groups by testing the null hypothesis that a group of samples is from a single cluster, where a cluster is characterized as a multivariable normal distribution. SigClust was run at each node of the dendrogram beginning at the root and stopping when the test was no longer significant (P > .001). Statistical selection using SigClust identified nine significant groups, including the previously identified subtypes designated as luminal A (dark blue), luminal B (light blue), HER2-enriched (pink), normal-like (green), and basal-like (red).
Fig A2.
Fig A2.
Focused heatmap of Classification by Nearest Centroids (ClaNC) selected genes for each subtype. The ClaNC algorithm was optimized to select 10 genes per class for a total of 50 genes. The 10 genes for each class are shown as red/green according to their expression in a class. Black indicates that gene was not selected for the given class.
Fig A3.
Fig A3.
Heatmap of the centroid models of subtype. The centroids were constructed using the Classification by Nearest Centroids selected genes and calculated as described for the Prediction Analysis of Microarray algorithm. The expression values are shown as red/green according to their relative expression level.
Fig A4.
Fig A4.
Developing a continuous risk score based on subtypes and clinical variables. (A) A family of smoothing functions illustrates the general linear relationship between the risk of relapse (ROR) score and relapse-free survival at 5 years. (B) Significance of the slopes in the smoothed lines is plotted for each bandwidth across the range of scores. Blue indicates a significant positive slope, purple indicates nonsignificant slope, and green indicates too few data for inference.
Fig A5.
Fig A5.
Risk classification for training set of untreated patients using a combined model of intrinsic subtypes and clinical variables. (A) Risk of relapse based on the combined model with low-risk scores less than 29 (black), moderate-risk scores between 29 and 53 (green), and high-risk scores ≥ 53 (red). (B) Kaplan-Meier plot and significance of the risk score shown for 141 training set cases. LumA, luminal A; LumB, luminal B.
Fig A6.
Fig A6.
Analysis of an old-aged formalin-fixed, paraffin-embedded patient cohort. (A) The combined risk of relapse (ROR)-C model predicting high, medium, and low risk of relapse in 279 patients from the University of British Columbia. (B) Receiver operating characteristic curve analysis of quantitative reverse transcriptase polymerase chain reaction (qRT-PCR) data for ESR1 versus immunohistochemistry (IHC) data. (C) Receiver operating characteristic curve analysis for qRT-PCR data for HER2/ERBB2 versus HER2clin-positive (IHC and/or fluorescent in situ hybridization). PPV, positive predictive value; NPV, negative predictive value; AUC, area under the curve.

Republished in

  • Supervised Risk Predictor of Breast Cancer Based on Intrinsic Subtypes.
    Parker JS, Mullins M, Cheang MCU, Leung S, Voduc D, Vickery T, Davies S, Fauron C, He X, Hu Z, Quackenbush JF, Stijleman IJ, Palazzo J, Marron JS, Nobel AB, Mardis E, Nielsen TO, Ellis MJ, Perou CM, Bernard PS. Parker JS, et al. J Clin Oncol. 2023 Sep 10;41(26):4192-4199. doi: 10.1200/JCO.22.02511. J Clin Oncol. 2023. PMID: 37672882

Comment in

Similar articles

Cited by

  • Differential chromatin accessibility and transcriptional dynamics define breast cancer subtypes and their lineages.
    Iglesia MD, Jayasinghe RG, Chen S, Terekhanova NV, Herndon JM, Storrs E, Karpova A, Zhou DC, Naser Al Deen N, Shinkle AT, Lu RJ, Caravan W, Houston A, Zhao Y, Sato K, Lal P, Street C, Martins Rodrigues F, Southard-Smith AN, Targino da Costa ALN, Zhu H, Mo CK, Crowson L, Fulton RS, Wyczalkowski MA, Fronick CC, Fulton LA, Sun H, Davies SR, Appelbaum EL, Chasnoff SE, Carmody M, Brooks C, Liu R, Wendl MC, Oh C, Bender D, Cruchaga C, Harari O, Bredemeyer A, Lavine K, Bose R, Margenthaler J, Held JM, Achilefu S, Ademuyiwa F, Aft R, Ma C, Colditz GA, Ju T, Oh ST, Fitzpatrick J, Hwang ES, Shoghi KI, Chheda MG, Veis DJ, Chen F, Fields RC, Gillanders WE, Ding L. Iglesia MD, et al. Nat Cancer. 2024 Oct 30. doi: 10.1038/s43018-024-00773-6. Online ahead of print. Nat Cancer. 2024. PMID: 39478117
  • A multi-modal single-cell and spatial expression map of metastatic breast cancer biopsies across clinicopathological features.
    Klughammer J, Abravanel DL, Segerstolpe Å, Blosser TR, Goltsev Y, Cui Y, Goodwin DR, Sinha A, Ashenberg O, Slyper M, Vigneau S, Jané-Valbuena J, Alon S, Caraccio C, Chen J, Cohen O, Cullen N, DelloStritto LK, Dionne D, Files J, Frangieh A, Helvie K, Hughes ME, Inga S, Kanodia A, Lako A, MacKichan C, Mages S, Moriel N, Murray E, Napolitano S, Nguyen K, Nitzan M, Ortiz R, Patel M, Pfaff KL, Porter CBM, Rotem A, Strauss S, Strasser R, Thorner AR, Turner M, Wakiro I, Waldman J, Wu J, Gómez Tejeda Zañudo J, Zhang D, Lin NU, Tolaney SM, Winer EP, Boyden ES, Chen F, Nolan GP, Rodig SJ, Zhuang X, Rozenblatt-Rosen O, Johnson BE, Regev A, Wagle N. Klughammer J, et al. Nat Med. 2024 Oct 30. doi: 10.1038/s41591-024-03215-z. Online ahead of print. Nat Med. 2024. PMID: 39478111
  • The Influence of Microbiota on Breast Cancer: A Review.
    Neagoe CX, Ionică M, Neagoe OC, Trifa AP. Neagoe CX, et al. Cancers (Basel). 2024 Oct 13;16(20):3468. doi: 10.3390/cancers16203468. Cancers (Basel). 2024. PMID: 39456562 Free PMC article. Review.
  • Development and validation of a gene expression-based Breast Cancer Purity Score.
    Barreca M, Dugo M, Galbardi B, Győrffy B; NA-PHER2 consortium; NeoTRIP consortium; Valagussa P, Besozzi D, Viale G, Bianchini G, Gianni L, Callari M. Barreca M, et al. NPJ Precis Oncol. 2024 Oct 24;8(1):242. doi: 10.1038/s41698-024-00730-7. NPJ Precis Oncol. 2024. PMID: 39448787 Free PMC article.
  • Single-cell transcriptional atlas of human breast cancers and model systems.
    Altman JE, Olex AL, Zboril EK, Walker CJ, Boyd DC, Myrick RK, Hairr NS, Koblinski JE, Puchalapalli M, Hu B, Dozmorov MG, Chen XS, Chen Y, Perou CM, Lehmann BD, Visvader JE, Harrell JC. Altman JE, et al. Clin Transl Med. 2024 Oct;14(10):e70044. doi: 10.1002/ctm2.70044. Clin Transl Med. 2024. PMID: 39417215 Free PMC article.

References

    1. Sørlie T, Perou CM, Tibshirani R, et al. Gene expression patterns of breast carcinomas distinguish tumor subclasses with clinical implications. Proc Natl Acad Sci U S A. 2001;98:10869–10874. - PMC - PubMed
    1. van 't Veer LJ, Dai H, van de Vijver MJ, et al. Gene expression profiling predicts clinical outcome of breast cancer. Nature. 2002;415:530–536. - PubMed
    1. van't Veer LJ, Paik S, Hayes DF. Gene expression profiling of breast cancer: A new tumor marker. J Clin Oncol. 2005;23:1631–1635. - PubMed
    1. Paik S, Shak S, Tang G, et al. A multigene assay to predict recurrence of tamoxifen-treated, node-negative breast cancer. N Engl J Med. 2004;351:2817–2826. - PubMed
    1. Paik S, Tang G, Shak S, et al. Gene expression and benefit of chemotherapy in women with node-negative, estrogen receptor-positive breast cancer. J Clin Oncol. 2006;24:3726–3734. - PubMed

Publication types