Using cloud computing infrastructure with CloudBioLinux, CloudMan, and Galaxy
- PMID: 22700313
- PMCID: PMC3412548
- DOI: 10.1002/0471250953.bi1109s38
Using cloud computing infrastructure with CloudBioLinux, CloudMan, and Galaxy
Abstract
Cloud computing has revolutionized availability and access to computing and storage resources, making it possible to provision a large computational infrastructure with only a few clicks in a Web browser. However, those resources are typically provided in the form of low-level infrastructure components that need to be procured and configured before use. In this unit, we demonstrate how to utilize cloud computing resources to perform open-ended bioinformatic analyses, with fully automated management of the underlying cloud infrastructure. By combining three projects, CloudBioLinux, CloudMan, and Galaxy, into a cohesive unit, we have enabled researchers to gain access to more than 100 preconfigured bioinformatics tools and gigabytes of reference genomes on top of the flexible cloud computing infrastructure. The protocol demonstrates how to set up the available infrastructure and how to use the tools via a graphical desktop interface, a parallel command-line interface, and the Web-based Galaxy interface.
Figures
Similar articles
-
Galaxy CloudMan: delivering cloud compute clusters.BMC Bioinformatics. 2010 Dec 21;11 Suppl 12(Suppl 12):S4. doi: 10.1186/1471-2105-11-S12-S4. BMC Bioinformatics. 2010. PMID: 21210983 Free PMC article.
-
Tavaxy: integrating Taverna and Galaxy workflows with cloud computing support.BMC Bioinformatics. 2012 May 4;13:77. doi: 10.1186/1471-2105-13-77. BMC Bioinformatics. 2012. PMID: 22559942 Free PMC article.
-
Laniakea@ReCaS: exploring the potential of customisable Galaxy on-demand instances as a cloud-based service.BMC Bioinformatics. 2021 Nov 8;22(Suppl 15):544. doi: 10.1186/s12859-021-04401-3. BMC Bioinformatics. 2021. PMID: 34749633 Free PMC article.
-
Bioinformatics clouds for big data manipulation.Biol Direct. 2012 Nov 28;7:43; discussion 43. doi: 10.1186/1745-6150-7-43. Biol Direct. 2012. PMID: 23190475 Free PMC article. Review.
-
Improving data workflow systems with cloud services and use of open data for bioinformatics research.Brief Bioinform. 2018 Sep 28;19(5):1035-1050. doi: 10.1093/bib/bbx039. Brief Bioinform. 2018. PMID: 28419324 Free PMC article. Review.
Cited by
-
Collaborative cloud-enabled tools allow rapid, reproducible biological insights.ISME J. 2013 Mar;7(3):461-4. doi: 10.1038/ismej.2012.123. Epub 2012 Oct 25. ISME J. 2013. PMID: 23096404 Free PMC article. No abstract available.
-
Experiences with workflows for automating data-intensive bioinformatics.Biol Direct. 2015 Aug 19;10:43. doi: 10.1186/s13062-015-0071-8. Biol Direct. 2015. PMID: 26282399 Free PMC article.
-
CloudLaunch: Discover and Deploy Cloud Applications.Future Gener Comput Syst. 2019 May;94:802-810. doi: 10.1016/j.future.2018.04.037. Epub 2018 Jun 15. Future Gener Comput Syst. 2019. PMID: 34366521 Free PMC article.
-
Bioinformatics on the cloud computing platform Azure.PLoS One. 2014 Jul 22;9(7):e102642. doi: 10.1371/journal.pone.0102642. eCollection 2014. PLoS One. 2014. PMID: 25050811 Free PMC article.
-
The Widening Gulf between Genomics Data Generation and Consumption: A Practical Guide to Big Data Transfer Technology.Bioinform Biol Insights. 2015 Sep 23;9(Suppl 1):9-19. doi: 10.4137/BBI.S28988. eCollection 2015. Bioinform Biol Insights. 2015. PMID: 26568680 Free PMC article. Review.
References
-
- Afgan E, Goecks J, Baker D, Coraor N, Nekrutenko A, Taylor J. Yang K. Galaxy - a Gateway to Tools in e-Science. Guide to e-Science: Next Generation Scientific Research and Discovery. 2011c Springer;:145–177.
-
- Posada D. jModelTest: Phylogenetic Model Averaging. Molecular Biology and Evolution. 2008;25:1253–1256. - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources