Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2001 Jun;7(6):673-9.
doi: 10.1038/89044.

Classification and diagnostic prediction of cancers using gene expression profiling and artificial neural networks

Affiliations

Classification and diagnostic prediction of cancers using gene expression profiling and artificial neural networks

J Khan et al. Nat Med. 2001 Jun.

Abstract

The purpose of this study was to develop a method of classifying cancers to specific diagnostic categories based on their gene expression signatures using artificial neural networks (ANNs). We trained the ANNs using the small, round blue-cell tumors (SRBCTs) as a model. These cancers belong to four distinct diagnostic categories and often present diagnostic dilemmas in clinical practice. The ANNs correctly classified all samples and identified the genes most relevant to the classification. Expression of several of these genes has been reported in SRBCTs, but most have not been associated with these cancers. To test the ability of the trained ANN models to recognize SRBCTs, we analyzed additional blinded samples that were not previously used for the training procedure, and correctly classified them in all cases. This study demonstrates the potential applications of these methods for tumor diagnosis and the identification of candidate targets for therapy.

PubMed Disclaimer

Figures

Fig. 1
Fig. 1
The artificial neural network. a, Schematic illustration of the analysis process. The entire data-set of all 88 experiments was first quality filtered (1) and then the dimensionality was further reduced by principal component analysis (PCA) to 10 PCA projections (2), from the original 6567 expression values. Next, the 25 test experiments were set aside and the 63 training experiments were randomly partitioned into 3 groups (3). One of these groups was reserved for validation and the remaining 2 groups for calibration (4). ANN models were then calibrated using for each sample the 10 PCA values as input and the cancer category as output (5). For each model the calibration was optimized with 100 iterative cycles (epochs). This was repeated using each of the 3 groups for validation (6). The samples were again randomly partitioned and the entire training process repeated (7). For each selection of a validation group one model was calibrated, resulting in a total of 3750 trained models. Once the models were calibrated they were used to rank the genes according to their importance for the classification (8). The entire process (2–7) was repeated using only top ranked genes (9). The 25 test experiments were subsequently classified using all the calibrated models. b, Monitoring the calibration of the models. The average classification error per sample (using a summed square error function) is plotted during the training iterations (epochs) for both the training and the validation samples. A pair of lines, purple (training) and gray (validation), represents one model. The decrease in the classification errors with increasing epochs demonstrates the learning of the models to distinguish these cancers. The results shown are for 200 different models, each corresponding to a random partitioning of the data. All the models performed well for both training and validation as demonstrated by the parallel decrease (with increasing epochs) of the average summed square classification error per sample. In addition, there was no sign of over-training: if the models begin to learn features in the training set, which are not present in the validation set, this would result in an increase in the error for the validation at that point and the curves would no longer remain parallel. c, Minimizing the number of genes. The average number of misclassified samples for all 3750 models is plotted against increasing number of used genes. The misclassifications minimized to zero using the 96 highest ranked genes.
Fig. 2
Fig. 2
Classification and diagnosis of the samples. A sample is classified to a cancer category according to its highest committee vote (average of all ANN outputs) and placed in the corresponding plot. Plotted, for each sample, is the distance from its committee vote to the ideal vote for that diagnostic category (for example, for EWS, it is EWS = 1, RMS = NB = BL = 0). Thus a perfectly classified sample would be plotted with a distance of zero. Training samples are displayed as squares and test samples as triangles. Non-SRBCT samples are colored black. All SRBCT samples, including the 20 tests, were correctly classified. The distance corresponding to the 95th percentile for the training samples is represented by a dashed line, outside which the diagnosis of a sample is rejected. The diagnosis of all 5 non-SRBCT test samples was rejected since they lie outside their respective dashed lines. Three of the SRBCT samples (EWS-T13, TEST-10 and TEST-20) though correctly classified could not be confidently diagnosed.
Fig. 3
Fig. 3
Hierarchical clustering and multidimensional scaling analysis. The top 96 genes as ranked by the ANN models were used for the analysis. a, Multidimensional scaling analysis. Shown here are two projections of the MDS plot of the training samples. EWS are depicted as yellow circles, RMS as red, BL as blue and NB as green. The samples clustered closely according to the 4 different cancer categories. b, Hierarchical clustering of the samples and genes. Each row represents one of the 96 cDNA clones and each column a separate sample. A pseudo-colored representation of the relative red intensity is shown such that a red color indicates high expression and green color low expression, with scale shown below. On the right are the IMAGE id., gene symbol, class in which the gene is highly expressed (see Supplementary Methods), and the ANN rank. *, genes that have not been reported to be associated with these cancers. c, Enlargement of the hierarchical clustering dendrogram of the samples in b. All 63 training and the 20 test SRBCTs correctly clustered within their diagnostic categories. In both cases where two samples were derived from the same cell line, BL-C2 & C4, and NB-C2 and C7, each mapped adjacent to one another in the same cluster. The scale shows the Pearson correlation coefficient used to construct the dendrogram. The Pearson correlation cutoff was 0.54, when the samples clustered into the four diagnostic categories.

Comment in

Similar articles

Cited by

References

    1. Pizzo, P.A. Principles and practice of pediatric oncology (Lippincott-Raven, Philadelphia, 1997).
    1. Triche TJ, Askin FB. Neuroblastoma and the differential diagnosis of small-, round-, blue- cell tumors. Hum Pathol. 1983;14:569–595. - PubMed
    1. Taylor C, et al. Diagnosis of Ewing’s sarcoma and peripheral neuroectodermal tumour based on the detection of t(11;22) using fluorescence in situ hybridisation. Br J Cancer. 1993;67:128–133. - PMC - PubMed
    1. McManus AP, Gusterson BA, Pinkerton CR, Shipley JM. The molecular pathology of small round-cell tumours—relevance to diagnosis, prognosis, and classification. J Pathol. 1996;178:116–121. - PubMed
    1. Khan J, et al. Gene expression profiling of alveolar rhabdomyosarcoma with cDNA microarrays. Cancer Res. 1998;58:5009–5013. - PubMed

Publication types

MeSH terms