Skip to main page content
U.S. flag

An official website of the United States government

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Nov 25;15(11):e1007520.
doi: 10.1371/journal.pcbi.1007520. eCollection 2019 Nov.

A novel network control model for identifying personalized driver genes in cancer

Affiliations

A novel network control model for identifying personalized driver genes in cancer

Wei-Feng Guo et al. PLoS Comput Biol. .

Abstract

Although existing computational models have identified many common driver genes, it remains challenging to identify the personalized driver genes by using samples of an individual patient. Recently, the methods of exploiting the structure-based control principles of complex networks provide new clues for identifying minimum number of driver nodes to drive the state transition of large-scale complex networks from an initial state to the desired state. However, the structure-based network control methods cannot be directly applied to identify the personalized driver genes due to the unknown network dynamics of the personalized system. Here we proposed the personalized network control model (PNC) to identify the personalized driver genes by employing the structure-based network control principle on genetic data of individual patients. In PNC model, we firstly presented a paired single sample network construction method to construct the personalized state transition network for capturing the phenotype transitions between healthy and disease states. Then, we designed a novel structure-based network control method from the Feedback Vertex Sets-based control perspective to identify the personalized driver genes. The wide experimental results on 13 cancer datasets from The Cancer Genome Atlas firstly showed that PNC model outperforms current state-of-the-art methods, in terms of F-measures for identifying cancer driver genes enriched in the gold-standard cancer driver gene lists. Furthermore, these results showed that personalized driver genes can be explored by their network characteristics even when they are hidden factors in transcription and mutation profiles. Our PNC gives novel insights and useful tools into understanding the tumor heterogeneity in cancer. The PNC package and data resources used in this work can be freely downloaded from https://github.com/NWPU-903PR/PNC.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Fig 1
Fig 1. Overview of personalized network control model (PNC) for identifying personalized driver genes.
(a) The motivation of PNC. By integrating sample specific network theory and structure based network control theory, a small number of personalized driver genes that are altered in response to oncogene activations is detected for triggering the state transition of an individual patient from the healthy state to the disease state at expression level. The main consideration of PNC is to design methods for i) constructing personalized state transition networks to capture the phenotypic transitions between healthy and disease states / attractors, and ii) developing network control method on personalized state transition networks where we identify personalized driver genes for driving individual system in cancer from normal attractor to disease attractor through oncogene activation. (b) The main workflow of PNC. Our PNC consists of two main parts. One is a paired single sample network construction method (Paired-SSN), which we used to construct personalized state transition networks. In these networks, edges denote the significant difference of gene interactions between normal sample and tumor sample of an individual patient. For the second part of PNC, we designed a novel nonlinear structural control method (namely, NCUA) for identifying personalized driver genes. This was done to ensure that the state in the personalized state transition network would asymptotically be changed from normal state to disease state through oncogene activations.
Fig 2
Fig 2. Overview of Paired-SSN for constructing personalized state transition networks.
For a given cancer patient, TCGA.EJ.7781 in Prostate Adenocarcinoma (PRAD) cancer data, we chose the expression data of all normal samples in PRAD as the reference data and respectively constructed the co-expression network of tumor sample (white color) and normal sample (green color) with the reference data by using SSN method. Then the personalized state transition network was constructed where the nodes represent the genes and edges denote the significant difference of gene interactions between normal sample and tumor sample of an individual patient in the disease development. Here we showed the individual specific sub-networks related with driver gene TP53 which contain its first-order neighboring genes as an example in this Fig.
Fig 3
Fig 3. The significant enrichment F-scores of PNC and other methods for identifying cancer driver genes.
Fig 4
Fig 4. The performance evaluation of different single sample network construction methods and network control methods.
(a) The significant enrichment F-scores of NCUA on personalized state transition networks constructed by Paired-SSN (the first step of our PNC), SSN and LIONESS. (b)The enrichment F-scores of NCUA (the second step of our PNC) and other structure based network control methods (MMS and MDS and DFVS) on the paired SSN networks (the first step of our PNC) in the list of CCG and NCG genes. (c) The enrichment F-scores of PNC with gene interaction network used in this work (Network 1) and other references (Network 2, Network 3) and the gene interaction network from STRING data set (scores>900) (Network 4) and the gene interaction network with top 10000 high scores from STRING data set (Network 5) and our filleted reliable gene interaction network (Network 6) and our filleted unreliable gene interactions network (Network 7).
Fig 5
Fig 5. The F-measures on different cancer data sets by using PNC on gene interaction network used in this work (Network 1) and our filleted reliable gene interaction network (Network 6) and our filleted unreliable gene interactions network (Network 7).
Fig 6
Fig 6. Statistic analysis of the personalized driver genes.
(a) The fractions of high-frequency personalized driver genes (yellow, fh), medium-frequency personalized driver genes (red, fm), and low-frequency personalized driver genes (blue, fl) on 13 cancer datasets. (b) The fraction of personalized driver genes in CCG and NCG with low mutation frequency and high mutation frequency respectively on 13 kinds of cancer data.
Fig 7
Fig 7. The p-value of personalized driver genes enriched in CCG and NCG genes on 13 kinds of cancer data.
The red line denotes the significant threshold value 0.05.
Fig 8
Fig 8. Heat map of the patient frequency (different colors) of these 20 intersected pathways in 13 cancer data sets which was previous reported in other literatures.
Fig 9
Fig 9. Controllability evaluation of individual patient system by using PNC.
(a) Box plot with the distribution of the personalized controllability in 13 cancer datasets. (b) Heat map of the personalized controllability (different colors) which is as a function of the average degree and power law degree exponent.

Similar articles

Cited by

References

    1. Cheng F, Zhao J, Zhao Z. Advances in computational approaches for prioritizing driver mutations and significantly mutated genes in cancer genomes. Briefings in bioinformatics. 2015;17(4):642–656. 10.1093/bib/bbv068 - DOI - PMC - PubMed
    1. Porta-Pardo E, Kamburov A, Tamborero D, Pons T, Grases D, Valencia A, et al. Comparison of algorithms for the detection of cancer drivers at subgene resolution. Nature methods. 2017;14(8):782 10.1038/nmeth.4364 - DOI - PMC - PubMed
    1. Jang HS, Shah NM, Du AY, Dailey ZZ, Pehrsson EC, Godoy PM, et al. Transposable elements drive widespread expression of oncogenes in human cancers. Nature genetics. 2019;51(4):611–617. 10.1038/s41588-019-0373-3 - DOI - PMC - PubMed
    1. Wei X, Walia V, Lin JC, Teer JK, Prickett TD, Gartner J, et al. Exome sequencing identifies GRIN2A as frequently mutated in melanoma. Nature genetics. 2011;43(5):442–446. 10.1038/ng.810 - DOI - PMC - PubMed
    1. Tamborero D, Gonzalez-Perez A, Lopez-Bigas N. OncodriveCLUST: exploiting the positional clustering of somatic mutations to identify cancer genes. Bioinformatics. 2013;29(18):2238–2244. 10.1093/bioinformatics/btt395 - DOI - PubMed

Publication types

Grants and funding

This paper was supported by the National Natural Science Foundation of China (61873202, 61473232, 91430111, 31930022, 31771476, 81471047 and 11871456) and National Key R&D Program (2017YFA0505500), Shanghai Municipal Science and Technology Major Project (No. 2017SHZDZX01), National Key R&D Program (Special Project on Precision Medicine) (2016YFC0903400), and Natural Science Foundation of Shanghai (17ZR1446100). All the above funders played roles in the study design, data collection and analysis and preparation of the manuscript