The Biological Observation Matrix (BIOM) format or: how I learned to stop worrying and love the ome-ome
- PMID: 23587224
- PMCID: PMC3626512
- DOI: 10.1186/2047-217X-1-7
The Biological Observation Matrix (BIOM) format or: how I learned to stop worrying and love the ome-ome
Abstract
Background: We present the Biological Observation Matrix (BIOM, pronounced "biome") format: a JSON-based file format for representing arbitrary observation by sample contingency tables with associated sample and observation metadata. As the number of categories of comparative omics data types (collectively, the "ome-ome") grows rapidly, a general format to represent and archive this data will facilitate the interoperability of existing bioinformatics tools and future meta-analyses.
Findings: The BIOM file format is supported by an independent open-source software project (the biom-format project), which initially contains Python objects that support the use and manipulation of BIOM data in Python programs, and is intended to be an open development effort where developers can submit implementations of these objects in other programming languages.
Conclusions: The BIOM file format and the biom-format project are steps toward reducing the "bioinformatics bottleneck" that is currently being experienced in diverse areas of biological sciences, and will help us move toward the next phase of comparative omics where basic science is translated into clinical and environmental applications. The BIOM file format is currently recognized as an Earth Microbiome Project Standard, and as a Candidate Standard by the Genomic Standards Consortium.
Figures
Similar articles
-
biojs-io-biom, a BioJS component for handling data in Biological Observation Matrix (BIOM) format.F1000Res. 2016 Sep 20;5:2348. doi: 10.12688/f1000research.9618.2. eCollection 2016. F1000Res. 2016. PMID: 28105307 Free PMC article.
-
A fast and efficient python library for interfacing with the Biological Magnetic Resonance Data Bank.BMC Bioinformatics. 2017 Mar 17;18(1):175. doi: 10.1186/s12859-017-1580-5. BMC Bioinformatics. 2017. PMID: 28302053 Free PMC article.
-
OME-NGFF: a next-generation file format for expanding bioimaging data-access strategies.Nat Methods. 2021 Dec;18(12):1496-1498. doi: 10.1038/s41592-021-01326-w. Epub 2021 Nov 29. Nat Methods. 2021. PMID: 34845388 Free PMC article.
-
A Python library for FAIRer access and deposition to the Metabolomics Workbench Data Repository.Metabolomics. 2018;14(5):64. doi: 10.1007/s11306-018-1356-6. Epub 2018 Apr 20. Metabolomics. 2018. PMID: 29706851 Free PMC article.
-
Proposal for a Standard Format for Neurophysiology Data Recording and Exchange.J Clin Neurophysiol. 2016 Oct;33(5):403-413. doi: 10.1097/WNP.0000000000000257. J Clin Neurophysiol. 2016. PMID: 26808620 Free PMC article. Review.
Cited by
-
Implementing high-throughput insect barcoding in microbiome studies: impact of non-destructive DNA extraction on microbiome reconstruction.PeerJ. 2024 Sep 23;12:e18025. doi: 10.7717/peerj.18025. eCollection 2024. PeerJ. 2024. PMID: 39329134 Free PMC article.
-
Bacterial cell surface characterization by phage display coupled to high-throughput sequencing.Nat Commun. 2024 Aug 29;15(1):7502. doi: 10.1038/s41467-024-51912-7. Nat Commun. 2024. PMID: 39209859 Free PMC article.
-
Assessment of Vibrio populations in a transect of Rhizophora mangle in Punta Galeta, Panamá: culture-dependent analyses reveal biotechnological applications.Rev Biol Trop. 2023;71(1):e50983. doi: 10.15517/rev.biol.trop..v71i1.50983. Epub 2023 Aug 4. Rev Biol Trop. 2023. PMID: 39175646 Free PMC article.
-
Quantitating primer-template interactions using deconstructed PCR.PeerJ. 2024 Aug 8;12:e17787. doi: 10.7717/peerj.17787. eCollection 2024. PeerJ. 2024. PMID: 39131619 Free PMC article.
-
Microbial communities reveal niche partitioning across the slope and bottom zones of the challenger deep.Environ Microbiol Rep. 2024 Aug;16(4):e13314. doi: 10.1111/1758-2229.13314. Environ Microbiol Rep. 2024. PMID: 39086173 Free PMC article.
References
-
- Qin J, Li R, Raes J, Arumugam M, Burgdorf KS, Manichanh C, Nielsen T, Pons N, Levenez F, Yamada T, Mende DR, Li J, Xu J, Li S, Li D, Cao J, Wang B, Liang H, Zheng H, Xie Y, Tap J, Lepage P, Bertalan M, Batto JM, Hansen T, Le Paslier D, Linneberg A, Nielsen HB, Pelletier E, Renault P, Sicheritz-Ponten T, Turner K, Zhu H, Yu C, Li S, Jian M, Zhou Y, Li Y, Zhang X, Li S, Qin N, Yang H, Wang J, Brunak S, Doré J, Guarner F, Kristiansen K, Pedersen O, Parkhill J, Weissenbach J, Weissenbach J, Bork P, Ehrlich SD, Wang J. MetaHIT Consortium. A human gut microbial gene catalogue established by metagenomic sequencing. Nature. 2010;464(7285):59–65. doi: 10.1038/nature08821. - DOI - PMC - PubMed
-
- Markowitz VM, Chen IM, Palaniappan K, Chu K, Szeto E, Grechkin Y, Ratner A, Jacob B, Huang J, Williams P, Huntemann M, Anderson I, Mavromatis K, Ivanova NN, Kyrpides NC. IMG: the Integrated Microbial Genomes database and comparative analysis system. Nucleic Acids Res. 2012;40:D115–122. doi: 10.1093/nar/gkr1044. - DOI - PMC - PubMed
-
- Zakrzewski M, Goesmann A, Jaenicke S, Jünemann S, Eikmeyer F, Szczepanowski R, Al-Soud WA, Sørensen S, Pühler A, Schlüter A. Profiling of the metabolically active community from a production-scale biogas plant by means of high-throughput metatranscriptome sequencing. J Biotechnol. 2012;158(4):248–258. doi: 10.1016/j.jbiotec.2012.01.020. - DOI - PubMed
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials