Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Sep 6:17:164-174.
doi: 10.1016/j.omtn.2019.05.017. Epub 2019 Jun 4.

Anticancer Drug Response Prediction in Cell Lines Using Weighted Graph Regularized Matrix Factorization

Affiliations

Anticancer Drug Response Prediction in Cell Lines Using Weighted Graph Regularized Matrix Factorization

Na-Na Guan et al. Mol Ther Nucleic Acids. .

Abstract

Precision medicine has become a novel and rising concept, which depends much on the identification of individual genomic signatures for different patients. The cancer cell lines could reflect the "omic" diversity of primary tumors, based on which many works have been carried out to study the cancer biology and drug discovery both in experimental and computational aspects. In this work, we presented a novel method to utilize weighted graph regularized matrix factorization (WGRMF) for inferring anticancer drug response in cell lines. We constructed a p-nearest neighbor graph to sparsify drug similarity matrix and cell line similarity matrix, respectively. Using the sparsified matrices in the graph regularization terms, we performed matrix factorization to generate the latent matrices for drug and cell line. The graph regularization terms including neighbor information could help to exclude the noisy ingredient and improve the prediction accuracy. The 10-fold cross-validation was implemented, and the Pearson correlation coefficient (PCC), root-mean-square error (RMSE), PCCsr, and RMSEsr averaged over all drugs were calculated to evaluate the performance of WGRMF. The results on the Genomics of Drug Sensitivity in Cancer (GDSC) dataset are 0.64 ± 0.16, 1.37 ± 0.35, 0.73 ± 0.14, and 1.71 ± 0.44 for PCC, RMSE, PCCsr, and RMSEsr in turn. And for the Cancer Cell Line Encyclopedia (CCLE) dataset, WGRMF got results of 0.72 ± 0.09, 0.56 ± 0.19, 0.79 ± 0.07, and 0.69 ± 0.19, respectively. The results showed the superiority of WGRMF compared with previous methods. Besides, based on the prediction results using the GDSC dataset, three types of case studies were carried out. The results from both cross-validation and case studies have shown the effectiveness of WGRMF on the prediction of drug response in cell lines.

Keywords: cell line; drug response; graph regularization; matrix factorization; response prediction.

PubMed Disclaimer

Figures

Figure 1
Figure 1
The Comparison Histogram of PCCsr for the Drugs That Target Genes in PI3K Pathway The comparison histogram of PCCsr between WGRMF and SRMF under the 10-fold cross-validation for the drugs that target genes in the PI3K pathway.
Figure 2
Figure 2
The Comparison Histogram of RMSEsr for the Drugs That Target Genes in PI3K Pathway The comparison histogram of RMSEsr between WGRMF and SRMF under the 10-fold cross-validation for the drugs that target genes in the PI3K pathway.
Figure 3
Figure 3
The Results for Consistency Identification between Predicted and Existing Data for Four Drug-Gene Pairs Based on the GDSC Dataset (A) The responses of EGFR mutated and wild-type cell lines to lapatinib are shown. (B–D) EGFR mutation and erlotinib (B), CDKN2A mutation and PD-0332991 (C), and KRAS mutation and pazopanib (D). The p values obtained from the rank-sum test have been shown in each panel.
Figure 4
Figure 4
The Responses of MET-Amplified and Wild-Type Cell Lines to PHA-665752 for Predicted, Existing, and Combined Data in GDSC The sensitivity of MET amplification to PHA-665752 could be obtained through combining the newly predicted responses and the existing data. The p values obtained from rank-sum test were given for predicted, existing, and combined data, respectively.
Figure 5
Figure 5
The Reposition of PHA-665752 on NSCLC Based on the Combination of the Newly Predicted Responses and the Existing Data The figure shows that NSCLC cell lines are more sensitive to PHA-665752 based on the integrated result (p value of rank-sum test is 3.7044 × 10−2), which could not be observed based only on existing data.
Figure 6
Figure 6
The Flowchart of the WGRMF for Prediction of Drug Response in Cancer Cell Lines The flowchart of the WGRMF for prediction of drug response in cancer cell lines, based on the drug chemical structure similarity, cell line gene expression similarity, and known response data.

Similar articles

Cited by

References

    1. Collins F.S., Varmus H. A new initiative on precision medicine. N. Engl. J. Med. 2015;372:793–795. - PMC - PubMed
    1. Mirnezami R., Nicholson J., Darzi A. Preparing for precision medicine. N. Engl. J. Med. 2012;366:489–491. - PubMed
    1. Ashley E.A. The precision medicine initiative: a new national effort. JAMA. 2015;313:2119–2120. - PubMed
    1. Barretina J., Caponigro G., Stransky N., Venkatesan K., Margolin A.A., Kim S., Wilson C.J., Lehár J., Kryukov G.V., Sonkin D. The Cancer Cell Line Encyclopedia enables predictive modelling of anticancer drug sensitivity. Nature. 2012;483:603–607. - PMC - PubMed
    1. Heiser L.M., Sadanandam A., Kuo W.L., Benz S.C., Goldstein T.C., Ng S., Gibb W.J., Wang N.J., Ziyad S., Tong F. Subtype and pathway specific responses to anticancer compounds in breast cancer. Proc. Natl. Acad. Sci. USA. 2012;109:2724–2729. - PMC - PubMed

LinkOut - more resources