Positive-Unlabeled Learning for inferring drug interactions based on heterogeneous attributes

doi:10.1186/s12859-017-1546-7

. 2017 Mar 1;18(1):140.

doi: 10.1186/s12859-017-1546-7.

Positive-Unlabeled Learning for inferring drug interactions based on heterogeneous attributes

Pathima Nusrath Hameed^{1

2

3}, Karin Verspoor⁴, Snezana Kusljic^{5

6}, Saman Halgamuge⁷

Affiliations

¹ Department of Mechanical Engineering, University of Melbourne, Parkville, Melbourne, 3010, Australia. nusrath@dcs.ruh.ac.lk.
² Data61, Victoria Research Lab, West Melbourne, 3003, Australia. nusrath@dcs.ruh.ac.lk.
³ Department of Computer Science, University of Ruhuna, Matara, 81000, Sri Lanka. nusrath@dcs.ruh.ac.lk.
⁴ Department of Computing and Information Systems, University of Melbourne, Parkville, Melbourne, 3010, Australia.
⁵ Department of Nursing, University of Melbourne, Parkville, Melbourne, 3010, Australia.
⁶ The Florey Institute of Neuroscience and Mental Health, University of Melbourne, Parkville, Melbourne, 3010, Australia.
⁷ Research School of Engineering, College of Engineering and Computer Science, The Australian National University, Canberra, 2601, ACT, Australia.

PMID: 28249566
PMCID: PMC5333429
DOI: 10.1186/s12859-017-1546-7

Positive-Unlabeled Learning for inferring drug interactions based on heterogeneous attributes

Pathima Nusrath Hameed et al. BMC Bioinformatics. 2017.

. 2017 Mar 1;18(1):140.

doi: 10.1186/s12859-017-1546-7.

Authors

Pathima Nusrath Hameed^{1

2

3}, Karin Verspoor⁴, Snezana Kusljic^{5

6}, Saman Halgamuge⁷

Affiliations

¹ Department of Mechanical Engineering, University of Melbourne, Parkville, Melbourne, 3010, Australia. nusrath@dcs.ruh.ac.lk.
² Data61, Victoria Research Lab, West Melbourne, 3003, Australia. nusrath@dcs.ruh.ac.lk.
³ Department of Computer Science, University of Ruhuna, Matara, 81000, Sri Lanka. nusrath@dcs.ruh.ac.lk.
⁴ Department of Computing and Information Systems, University of Melbourne, Parkville, Melbourne, 3010, Australia.
⁵ Department of Nursing, University of Melbourne, Parkville, Melbourne, 3010, Australia.
⁶ The Florey Institute of Neuroscience and Mental Health, University of Melbourne, Parkville, Melbourne, 3010, Australia.
⁷ Research School of Engineering, College of Engineering and Computer Science, The Australian National University, Canberra, 2601, ACT, Australia.

PMID: 28249566
PMCID: PMC5333429
DOI: 10.1186/s12859-017-1546-7

Abstract

Background: Investigating and understanding drug-drug interactions (DDIs) is important in improving the effectiveness of clinical care. DDIs can occur when two or more drugs are administered together. Experimentally based DDI detection methods require a large cost and time. Hence, there is a great interest in developing efficient and useful computational methods for inferring potential DDIs. Standard binary classifiers require both positives and negatives for training. In a DDI context, drug pairs that are known to interact can serve as positives for predictive methods. But, the negatives or drug pairs that have been confirmed to have no interaction are scarce. To address this lack of negatives, we introduce a Positive-Unlabeled Learning method for inferring potential DDIs.

Results: The proposed method consists of three steps: i) application of Growing Self Organizing Maps to infer negatives from the unlabeled dataset; ii) using a pairwise similarity function to quantify the overlap between individual features of drugs and iii) using support vector machine classifier for inferring DDIs. We obtained 6036 DDIs from DrugBank database. Using the proposed approach, we inferred 589 drug pairs that are likely to not interact with each other; these drug pairs are used as representative data for the negative class in binary classification for DDI prediction. Moreover, we classify the predicted DDIs as Cytochrome P450 (CYP) enzyme-Dependent and CYP-Independent interactions invoking their locations on the Growing Self Organizing Map, due to the particular importance of these enzymes in clinically significant interaction effects. Further, we provide a case study on three predicted CYP-Dependent DDIs to evaluate the clinical relevance of this study.

Conclusion: Our proposed approach showed an absolute improvement in F1-score of 14 and 38% in comparison to the method that randomly selects unlabeled data points as likely negatives, depending on the choice of similarity function. We inferred 5300 possible CYP-Dependent DDIs and 592 CYP-Independent DDIs with the highest posterior probabilities. Our discoveries can be used to improve clinical care as well as the research outcomes of drug development.

Keywords: CYP isoforms; Drug-drug interaction; Growing self organizing map (GSOM); PU learning; Pairwise drug similarity.

PubMed Disclaimer

Figures

**Fig. 1**
This diagram illustrates the main idea behind Positive-Unlabeled Learning. a Available data. b Goal

**Fig. 2**
This diagram illustrates the proposed methodology and our three main contributions for inferring DDIs, integrating Similarity Feature Representation1 (SFR1) and Similarity Feature Representation2 (SFR2)

**Fig. 3**
Pseudo-code for profiling GSOM nodes as ‘positive/negative/ambiguous’ node

**Fig. 4**
Example of deriving similarity metrics for drug association. Jaccard Index is the frequently used approach while Individual Similarity function is the proposed function

**Fig. 5**
a The average within cluster distance (AWCD) using Similarity Feature Representation 1 and (b) Number of GSOM nodes variation for Similarity Feature Representation 1

**Fig. 6**
GSOM maps for DDI data: (a) shows the GSOM map for Similarity Feature Representation 1 (SFR1) when Spread Factor=0.1 and it contains 919 nodes; (b) shows the GSOM map for Similarity Feature Representation 2 (SFR2) when Spread Factor= 10⁻¹⁵ and it contains 922 nodes. The nodes shown in *blue* are the proposed negative nodes having only unlabeled instances, the nodes shown in *grey* contains both initial positives and unlabeled instances, and the nodes shown in *red* contains only initial positives

See this image and copyright information in PMC

Cited by

Evaluation of knowledge graph embedding approaches for drug-drug interaction prediction in realistic settings.
Celebi R, Uyar H, Yasar E, Gumus O, Dikenelli O, Dumontier M. Celebi R, et al. BMC Bioinformatics. 2019 Dec 18;20(1):726. doi: 10.1186/s12859-019-3284-5. BMC Bioinformatics. 2019. PMID: 31852427 Free PMC article.
Positive-unlabelled learning of glycosylation sites in the human proteome.
Li F, Zhang Y, Purcell AW, Webb GI, Chou KC, Lithgow T, Li C, Song J. Li F, et al. BMC Bioinformatics. 2019 Mar 6;20(1):112. doi: 10.1186/s12859-019-2700-1. BMC Bioinformatics. 2019. PMID: 30841845 Free PMC article.
Drug-drug interaction prediction: databases, web servers and computational models.
Zhao Y, Yin J, Zhang L, Zhang Y, Chen X. Zhao Y, et al. Brief Bioinform. 2023 Nov 22;25(1):bbad445. doi: 10.1093/bib/bbad445. Brief Bioinform. 2023. PMID: 38113076 Free PMC article. Review.
Twenty years of bioinformatics research for protease-specific substrate and cleavage site prediction: a comprehensive revisit and benchmarking of existing methods.
Li F, Wang Y, Li C, Marquez-Lago TT, Leier A, Rawlings ND, Haffari G, Revote J, Akutsu T, Chou KC, Purcell AW, Pike RN, Webb GI, Ian Smith A, Lithgow T, Daly RJ, Whisstock JC, Song J. Li F, et al. Brief Bioinform. 2019 Nov 27;20(6):2150-2166. doi: 10.1093/bib/bby077. Brief Bioinform. 2019. PMID: 30184176 Free PMC article. Review.
Learning peptide properties with positive examples only.
Ansari M, White AD. Ansari M, et al. Digit Discov. 2024 Apr 19;3(5):977-986. doi: 10.1039/d3dd00218g. eCollection 2024 May 15. Digit Discov. 2024. PMID: 38756224 Free PMC article.

See all "Cited by" articles

References

1. Cheng F, Zhao Z. Machine learning-based prediction of drug-drug interactions by integrating drug phenotypic, therapeutic, chemical, and genomic properties. J Am Med Inform Assoc. 2014;21(e2):278–86. doi: 10.1136/amiajnl-2013-002512. - DOI - PMC - PubMed
1. Ai N, Fan X, Ekins S. In silico methods for predicting drug-drug interactions with cytochrome p-450s, transporters and beyond. Adv Drug Deliv Rev. 2015;86:46–60. doi: 10.1016/j.addr.2015.03.006. - DOI - PubMed
1. Snyder BD, Polasek TM, Doogue MP. Drug interactions: principles and practice. Aust Prescr. 2012;35(3):85–8. doi: 10.18773/austprescr.2012.037. - DOI
1. Law V, Knox C, Djoumbou Y, Jewison T, Guo AC, Liu Y, Maciejewski A, Arndt D, Wilson M, Neveu V, et al. Drugbank 4.0: shedding new light on drug metabolism. Nucleic Acids Res. 2014;42(D1):1091–097. doi: 10.1093/nar/gkt1068. - DOI - PMC - PubMed
1. DrugBank. DrugBank Stat. http://www.drugbank.ca/stats. Accessed 31 Mar 2016.

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Substances

Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
Other Literature Sources
- scite Smart Citations
Medical
- MedlinePlus Health Information

[1] Cheng F, Zhao Z. Machine learning-based prediction of drug-drug interactions by integrating drug phenotypic, therapeutic, chemical, and genomic properties. J Am Med Inform Assoc. 2014;21(e2):278–86. doi: 10.1136/amiajnl-2013-002512. - DOI - PMC - PubMed

[2] Cheng F, Zhao Z. Machine learning-based prediction of drug-drug interactions by integrating drug phenotypic, therapeutic, chemical, and genomic properties. J Am Med Inform Assoc. 2014;21(e2):278–86. doi: 10.1136/amiajnl-2013-002512. - DOI - PMC - PubMed

[3] Ai N, Fan X, Ekins S. In silico methods for predicting drug-drug interactions with cytochrome p-450s, transporters and beyond. Adv Drug Deliv Rev. 2015;86:46–60. doi: 10.1016/j.addr.2015.03.006. - DOI - PubMed

[4] Ai N, Fan X, Ekins S. In silico methods for predicting drug-drug interactions with cytochrome p-450s, transporters and beyond. Adv Drug Deliv Rev. 2015;86:46–60. doi: 10.1016/j.addr.2015.03.006. - DOI - PubMed

[5] Snyder BD, Polasek TM, Doogue MP. Drug interactions: principles and practice. Aust Prescr. 2012;35(3):85–8. doi: 10.18773/austprescr.2012.037. - DOI

[6] Snyder BD, Polasek TM, Doogue MP. Drug interactions: principles and practice. Aust Prescr. 2012;35(3):85–8. doi: 10.18773/austprescr.2012.037. - DOI

[7] Law V, Knox C, Djoumbou Y, Jewison T, Guo AC, Liu Y, Maciejewski A, Arndt D, Wilson M, Neveu V, et al. Drugbank 4.0: shedding new light on drug metabolism. Nucleic Acids Res. 2014;42(D1):1091–097. doi: 10.1093/nar/gkt1068. - DOI - PMC - PubMed

[8] Law V, Knox C, Djoumbou Y, Jewison T, Guo AC, Liu Y, Maciejewski A, Arndt D, Wilson M, Neveu V, et al. Drugbank 4.0: shedding new light on drug metabolism. Nucleic Acids Res. 2014;42(D1):1091–097. doi: 10.1093/nar/gkt1068. - DOI - PMC - PubMed

[9] DrugBank. DrugBank Stat. http://www.drugbank.ca/stats. Accessed 31 Mar 2016.

[10] DrugBank. DrugBank Stat. http://www.drugbank.ca/stats. Accessed 31 Mar 2016.

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Positive-Unlabeled Learning for inferring drug interactions based on heterogeneous attributes

Affiliations

Positive-Unlabeled Learning for inferring drug interactions based on heterogeneous attributes

Authors

Affiliations

Abstract

Figures

Similar articles

Cited by

References

MeSH terms

Substances

LinkOut - more resources

Full Text Sources

Other Literature Sources

Medical

Abstract

Figures

Similar articles

Cited by

References

MeSH terms

Substances

Related information

LinkOut - more resources

Full Text Sources

Other Literature Sources

Medical