Abstract
Networks are powerful tools to uncover functional roles of genes in phenotypic variation at a system-wide scale. Here, we constructed a maize network map that contains the genomic, transcriptomic, translatomic and proteomic networks across maize development. This map comprises over 2.8 million edges in more than 1,400 functional subnetworks, demonstrating an extensive network divergence of duplicated genes. We applied this map to identify factors regulating flowering time and identified 2,651 genes enriched in eight subnetworks. We validated the functions of 20 genes, including 18 with previously unknown connections to flowering time in maize. Furthermore, we uncovered a flowering pathway involving histone modification. The multi-omics integrative network map illustrates the principles of how molecular networks connect different types of genes and potential pathways to map a genome-wide functional landscape in maize, which should be applicable in a wide range of species.
This is a preview of subscription content, access via your institution
Access options
Access Nature and 54 other Nature Portfolio journals
Get Nature+, our best-value online-access subscription
$29.99 / 30 days
cancel any time
Subscribe to this journal
Receive 12 print issues and online access
$209.00 per year
only $17.42 per issue
Buy this article
- Purchase on SpringerLink
- Instant access to full article PDF
Prices may be subject to local taxes which are calculated during checkout
Similar content being viewed by others
Data availability
The ChIA–PET data were deposited in the Sequence Read Archive (SRA) of the NCBI (https://www.ncbi.nlm.nih.gov/sra) under BioProject accession no. PRJNA541043. The raw transcriptome data for 26 tissues or stages were deposited in the Gene Expression Omnibus (GEO) of the NCBI (https://www.ncbi.nlm.nih.gov/geo/) under GEO accession no. GSE199932. The remaining raw transcriptome data for the five other tissues or stages, which were generated in our previous study54, are available at the NCBI under BioProject accession no. PRJNA637713 (SRR13808023–SRR13808026, SRR13808028–SRR13808033). The raw translatome data for 16 tissues or stages were deposited in the GEO under accession no. GSE199932; the raw translatomic data for the five other tissues or stages are available at the SRA under accession no. PRJNA637713 (SRR13808051–SRR13808052, SRR13808055–SRR13808059, SRR13808061–SRR13808063). The raw interactome data were deposited in GEO under accession no. GSE199932; protein interactions were submitted to the IMEx (http://www.imexconsortium.org) consortium through IntAct82 and assigned the identifier IM-29553.
Code availability
We deposited the raw data matrix, codes and bioinformatics analyses in GitHub (https://github.com/hanlinqian/IntegrativeNetworkMap). Codes are also available at Zenodo (https://doi.org/10.5281/zenodo.7263543)83. A user-friendly website (http://minteractome.ncpgr.cn/) was developed to host all the expression and network information. This is publicly available for the replication of the whole study.
References
Eisenstein, M. Big data: the power of petabytes. Nature 527, S2–S4 (2015).
Trewavas, A. A brief history of systems biology: ‘Every object that biology studies is a system of systems’. Francois Jacob (1974). Plant Cell 18, 2420–2430 (2006).
Dixon, S. J., Costanzo, M., Baryshnikova, A., Andrews, B. & Boone, C. Systematic mapping of genetic interaction networks. Annu. Rev. Genet. 43, 601–625 (2009).
Braun, P. Evidence for network evolution in an Arabidopsis interactome map. Science 333, 601–607 (2011).
Mergner, J. et al. Mass-spectrometry-based draft of the Arabidopsis proteome. Nature 579, 409–414 (2020).
Altmann, M. et al. Extensive signal integration by the phytohormone protein network. Nature 583, 271–276 (2020).
McWhite, C. D. et al. A pan-plant protein complex map reveals deep conservation and novel assemblies. Cell 181, 460–474 (2020).
Zander, M. et al. Integrated multi-omics framework of the plant response to jasmonic acid. Nat. Plants 6, 290–302 (2020).
Luck, K. et al. A reference map of the human binary protein interactome. Nature 580, 402–408 (2020).
Wang, H., Cimen, E., Singh, N. & Buckler, E. Deep learning for plant genomics and crop improvement. Curr. Opin. Plant Biol. 54, 34–41 (2020).
Yao, V. et al. An integrative tissue-network approach to identify and test human disease genes. Nat. Biotechnol. 36, 1091–1099 (2018).
Sartor, R. C., Noshay, J., Springer, N. M. & Briggs, S. P. Identification of the expressome by machine learning on omics data. Proc. Natl Acad. Sci. USA 116, 18119–18125 (2019).
Wu, L. et al. Using interactome big data to crack genetic mysteries and enhance future crop breeding. Mol. Plant 14, 77–94 (2021).
Wallace, J. G., Larsson, S. J. & Buckler, E. S. Entering the second century of maize quantitative genetics. Heredity 112, 30–38 (2014).
Walley, J. W. et al. Integration of omic networks in a developmental atlas of maize. Science 353, 814–818 (2016).
Peng, Y. et al. Chromatin interaction maps reveal genetic regulation for quantitative traits in maize. Nat. Commun. 10, 2632 (2019).
Yang, F. et al. Development and application of a recombination-based library versus library high- throughput yeast two-hybrid (RLL-Y2H) screening system. Nucleic Acids Res. 46, e17 (2018).
Zhu, G. et al. PPIM: a protein–protein interaction database for maize. Plant Physiol. 170, 618–626 (2016).
Wang, H., Studer, A. J., Zhao, Q., Meeley, R. & Doebley, J. F. Evidence that the origin of naked kernels during maize domestication was caused by a single amino acid substitution in tga1. Genetics 200, 965–974 (2015).
Freeling, M. Bias in plant gene content following different sorts of duplication: tandem, whole-genome, segmental, or by transposition. Annu. Rev. Plant Biol. 60, 433–453 (2009).
Qiao, X. et al. Gene duplication and evolution in recurring polyploidization–diploidization cycles in plants. Genome Biol. 20, 38 (2019).
Prince, V. E. & Pickett, F. B. Splitting pairs: the diverging fates of duplicated genes. Nat. Rev. Genet. 3, 827–837 (2002).
Schnable, J. C., Springer, N. M. & Freeling, M. Differentiation of the maize subgenomes by genome dominance and both ancient and ongoing gene loss. Proc. Natl Acad. Sci. USA 108, 4069–4074 (2011).
Li, L. et al. Co-expression network analysis of duplicate genes in maize (Zea mays L.) reveals no subgenome bias. BMC Genomics 17, 875 (2016).
Jeong, H., Mason, S. P., Barabási, A. L. & Oltvai, Z. N. Lethality and centrality in protein networks. Nature 411, 41–42 (2001).
Doebley, J., Stec, A. & Hubbard, L. The evolution of apical dominance in maize. Nature 386, 485–488 (1997).
Whipple, C. J. et al. grassy tillers1 promotes apical dominance in maize and responds to shade signals in the grasses. Proc. Natl Acad. Sci. USA 108, E506–E512 (2011).
Dong, Z. et al. Ideal crop plant architecture is mediated by tassels replace upper ears1, a BTB/POZ ankyrin repeat gene directly targeted by TEOSINTE BRANCHED1. Proc. Natl Acad. Sci. USA 114, E8656–E8664 (2017).
Gallavotti, A., Yang, Y., Schmidt, R. J. & Jackson, D. The relationship between auxin transport and maize branching. Plant Physiol. 147, 1913–1923 (2008).
Gälweiler, L. et al. Regulation of polar auxin transport by AtPIN1 in Arabidopsis vascular tissue. Science 282, 2226–2230 (1998).
Skirpan, A., Wu, X. & McSteen, P. Genetic and physical interaction suggest that BARREN STALK1 is a target of BARREN INFLORESCENCE2 in maize inflorescence development. Plant J. 55, 787–797 (2008).
Wang, Q., Kohlen, W., Rossmann, S., Vernoux, T. & Theres, K. Auxin depletion from the leaf axil conditions competence for axillary meristem formation in Arabidopsis and tomato. Plant Cell 26, 2068–2079 (2014).
Whipple, C. J. et al. A conserved mechanism of bract suppression in the grass family. Plant Cell 22, 565–578 (2010).
Yao, H. et al. The barren stalk2 gene is required for axillary meristem development in maize. Mol. Plant 12, 374–389 (2019).
Hake, S. Identification of cup-shaped cotyledon: new ways to think about organ initiation. Plant Cell 31, 1202–1203 (2019).
Feng, F. et al. OPAQUE11 is a central hub of the regulatory network for maize endosperm development and nutrient metabolism. Plant Cell 30, 375–396 (2018).
Mach, J. Clarifying the opaque: identification of direct targets of maize OPAQUE2. Plant Cell 27, 484 (2015).
Wang, W. et al. The Zea mays mutants opaque2 and opaque 16 disclose lysine change in waxy maize as revealed by RNA-Seq. Sci. Rep. 9, 12265 (2019).
Zhang, Z., Zheng, X., Yang, J., Messing, J. & Wu, Y. Maize endosperm-specific transcription factors O2 and PBF network the regulation of protein and starch synthesis. Proc. Natl Acad. Sci. USA 113, 10842–10847 (2016).
Buckler, E. S. et al. The genetic architecture of maize flowering time. Science 325, 714–718 (2009).
van Heerwaarden, J., Hufford, M. B. & Ross-Ibarra, J. Historical genomics of North American maize. Proc. Natl Acad. Sci. USA 109, 12420–12425 (2012).
Kuleshov, N. N. World’s diversity of phenotypes of maize. Agron. J. 25, 688–700 (1933).
Liu, H.-J. et al. High-throughput CRISPR/Cas9 mutagenesis streamlines trait gene identification in maize. Plant Cell 32, 1397–1413 (2020).
Dong, Z. et al. A gene regulatory network model for floral transition of the shoot apex in maize and its dynamic modeling. PLoS ONE 7, e43450 (2012).
Matsushika, A., Imamura, A., Yamashino, T. & Mizuno, T. Aberrant expression of the light-inducible and circadian-regulated APRR9 gene belonging to the circadian-associated APRR1/TOC1 quintet results in the phenotype of early flowering in Arabidopsis thaliana. Plant Cell Physiol. 43, 833–843 (2002).
Lusser, A., Kölle, D. & Loidl, P. Histone acetylation: lessons from the plant kingdom. Trends Plant Sci. 6, 59–65 (2001).
Xiao, J. et al. Requirement of histone acetyltransferases HAM1 and HAM2 for epigenetic modification of FLC in regulating flowering in Arabidopsis. J. Plant Physiol. 170, 444–451 (2013).
Shin, J.-H. & Chekanova, J. A. Arabidopsis RRP6L1 and RRP6L2 function in FLOWERING LOCUS C silencing via regulation of antisense RNA synthesis. PLoS Genet. 10, e1004612 (2014).
Gu, X. et al. Arabidopsis FLC clade members form flowering-repressor complexes coordinating responses to endogenous and environmental cues. Nat. Commun. 4, 1947 (2013).
Ecker, J. R. et al. Genomics: ENCODE explained. Nature 489, 52–55 (2012).
Rodriguez-Leal, D. et al. Evolution of buffering in a genetic circuit controlling plant stem cell proliferation. Nat. Genet. 51, 786–792 (2019).
Ma, C., Xin, M., Feldmann, K. A. & Wang, X. Machine learning-based differential network analysis: a study of stress-responsive transcriptomes in Arabidopsis. Plant Cell 26, 520–537 (2014).
Schaefer, R. J. et al. Integrating coexpression networks with GWAS to prioritize causal genes in maize. Plant Cell 30, 2922–2942 (2018).
Zhu, W. et al. Large-scale translatome profiling annotates the functional genome and reveals the key role of genic 3′ untranslated regions in translatomic variation in plants. Plant Commun. 2, 100181 (2021).
Cock, P. J. A., Fields, C. J., Goto, N., Heuer, M. L. & Rice, P. M. The Sanger FASTQ file format for sequences with quality scores, and the Solexa/Illumina FASTQ variants. Nucleic Acids Res. 38, 1767–1771 (2010).
Kim, D., Langmead, B. & Salzberg, S. L. HISAT: a fast spliced aligner with low memory requirements. Nat. Methods 12, 357–360 (2015).
Kovaka, S. et al. Transcriptome assembly from long-read RNA-seq alignments with StringTie2. Genome Biol. 20, 278 (2019).
Li, L. et al. Genome-wide discovery and characterization of maize long non-coding RNAs. Genome Biol. 15, R40 (2014).
Frith, M. C. et al. The abundance of short proteins in the mammalian proteome. PLoS Genet. 2, e52 (2006).
Kang, Y.-J. et al. CPC2: a fast and accurate coding potential calculator based on sequence intrinsic features. Nucleic Acids Res. 45, W12–W16 (2017).
Wang, K. et al. MapSplice: accurate mapping of RNA-seq reads for splice junction discovery. Nucleic Acids Res. 38, e178 (2010).
Johnson, M. et al. NCBI BLAST: a better web interface. Nucleic Acids Res. 36, W5–W9 (2008).
Trapnell, C., Pachter, L. & Salzberg, S. L. TopHat: discovering splice junctions with RNA-Seq. Bioinformatics 25, 1105–1111 (2009).
Xu, H. et al. FastUniq: a fast de novo duplicates removal tool for paired short reads. PLoS ONE 7, e52249 (2012).
Zhang, X.-O. et al. Diverse alternative back-splicing and alternative splicing landscape of circular RNAs. Genome Res. 26, 1277–1287 (2016).
Wang, J., Yao, W., Zhu, D., Xie, W. & Zhang, Q. Genetic basis of sRNA quantitative variation analyzed using an experimental population derived from an elite rice hybrid. eLife 4, e03913 (2015).
Wang, B. et al. Unveiling the complexity of the maize transcriptome by single-molecule long-read sequencing. Nat. Commun. 7, 11708 (2016).
Jiao, Y. et al. Improved maize reference genome with single-molecule technologies. Nature 546, 524–527 (2017).
Crisp, P. A. et al. Variation and inheritance of small RNAs in maize inbreds and F1 hybrids. Plant Physiol. 182, 318–331 (2020).
Langmead, B., Trapnell, C., Pop, M. & Salzberg, S. L. Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol. 10, R25 (2009).
Li, H. et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).
Langmead, B. Aligning short sequencing reads with Bowtie. Curr. Protoc. Bioinformatics 32, 11.7.1–11.7.14 (2010).
Zhu, Y. Y., Machleder, E. M., Chenchik, A., Li, R. & Siebert, P. D. Reverse transcriptase template switching: a SMART approach for full-length cDNA library construction. Biotechniques 30, 892–897 (2001).
Bogdanova, E. A., Shagin, D. A. & Lukyanov, S. A. Normalization of full-length enriched cDNA. Mol. Biosyst. 4, 205–212 (2008).
Martin, M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet J. 17, 10–12 (2011).
Langfelder, P. & Horvath, S. WGCNA: an R package for weighted correlation network analysis. BMC Bioinformatics 9, 559 (2008).
Fuxman Bass, J. I. et al. Using networks to measure similarity between genes: association index selection. Nat. Methods 10, 1169–1176 (2013).
Lu, X. et al. Gene-indexed mutations in maize. Mol. Plant 11, 496–504 (2018).
Swigonová, Z. et al. Close split of sorghum and maize genome progenitors. Genome Res. 14, 1916–1923 (2004).
Wei, F. et al. Physical and genetic structure of the maize genome reflects its complex evolutionary history. PLoS Genet. 3, e123 (2007).
Salse, J. et al. Identification and characterization of shared duplications between rice and wheat provide new insight into grass genome evolution. Plant Cell 20, 11–24 (2008).
Orchard, S. et al. The MIntAct project—IntAct as a common curation platform for 11 molecular interaction databases. Nucleic Acids Res. 42, D358–D363 (2014).
Han, L. et al. IntegrativeNetworkMap: first release of IntegrativeNetworkMap code. Zenodo https://doi.org/10.5281/zenodo.7263543 (2022).
Acknowledgements
This research was supported by the National Natural Science Foundation of China (nos. 92035302, 91735305 and 31922068), the Hainan Yazhou Bay Seed Lab (no. B21HJ8102), the HZAU-AGIS Cooperation Fund (no. SZYJY2021006), the major program of Hubei Hongshan Laboratory (no. 2021hszd008), the National Key Research and Development Program of China (no. 2016YFD0100800), the Huazhong Agricultural University Scientific & Technological Self-innovation Foundation (no. 2021ZKPY001) (all to L.L.) and the Chinese Academy of Agricultural Sciences Innovation Project (no. CAAS-ZDRW202004) (to H.Z.). We also thank the China National GeneBank and the high-performance computing platform at the National Key Laboratory of Crop Genetic Improvement at Huazhong Agricultural University.
Author information
Authors and Affiliations
Contributions
L.L., F.Y. and J.Y. designed and supervised the study. W.S.Z., J.Q., M.J., P.T., W.C.Z., H.Z. and Y.S. collected the raw data. L.H., W.S.Z., J.Q., P.T., H.Z., Y.S., J.-W.F., G.C., B.F., R.L., X.G.L., Z.X., J.L., Z. Luo, S.C., D.X.D., Q.J., J.X.L., Z. Li, Y.L., X.J., Y.P., X.Y., C.Z., Y.Y., Z.T., H.C., W.F.L., L.-L.C., Q.L., F.Y. and L.L. performed the data analysis. L.L., F.Y., J.Y. and L.H. wrote the manuscript.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Peer review
Peer review information
Nature Genetics thanks Pascal Falter-Braun, Silin Zhong and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Extended data
Extended Data Fig. 1 Landscape of functional elements in maize.
a, Data collection. The numbers in the cells of the matrix indicate the number of tissues collected for each of the four omics of high-throughput data. b, Distribution of different elements in transcriptome across the maize genome. Chr, chromosome. c, Proportion of different elements in transcriptome across the maize genome. d, Numbers of different elements identified across 31 tissues and stages. From top to bottom each bar represents fusion transcripts, long non-coding RNA, protein-coding transcripts, circRNAs and sRNA clusters in each tissue or stage, respectively.
Extended Data Fig. 2 Expression-level variation across 31 different stages/tissues for all different types of transcripts.
a, Sampling of 31 stages/tissues spanning the maize whole life period. b, Transcriptome landscape of all 31 different stages/tissues for all different types of transcript isoforms. c, Stage/tissue-specific expression of protein-coding (mRNA), ribosome-imprinted transcripts (Ribo-Seq RNA), long noncoding (lncRNA), fusion, circular and small RNA.
Extended Data Fig. 3 High-throughput yeast two-hybrid analysis for the identification of protein–protein interactions.
a, The high-throughput yeast two-hybrid pipeline composed three major steps: yeast cDNA library construction, yeast mating and sequencing, and data analysis (see Methods for details). b, Number of PPIs of different confidence levels between ZmPPIs and PPIM high-confidence PPIs. HC, high confidence; MC, middle confidence; LC, low confidence (N = 1000 repeats for Random; Numbers of Random of HC = 12.505 ± 3.592; Numbers of Random of LC = 44.537 ± 6.655; Numbers of Observed of HC = 176; Numbers of Observed of LC = 372). c, Validation rates of ZmPPIs by BiFC. d, Number of saturated PPIs for each library across different numbers of matings. Saturation value is the predicted value of the specific library.
Extended Data Fig. 4 High quality transcriptomic and translatomic data across different replicates.
a, Correlation coefficients of two replicates at the transcriptome level across different tissues/stages. b, Correlation coefficients of two replicates at the translational level across different tissues/stages. c, Good 3-nt periodicity of Ribo-seq data in our study. d, Overlap for co-expression and co-translation networks constructed by two replicates.
Extended Data Fig. 5 Five regulating divergence patterns of duplicate types in different omic layers.
a–d, Enrichment and depletion of five divergence patterns in Proximal (a), Tandem (b), Transposed (c) and Dispersed (d) in transcriptome, translatome and proteome three layers. P values were calculated by Chi-square test.
Extended Data Fig. 6
Mutagenesis resulting in the zmalog1, zmalog2, zmnam1 and zmnam2 mutants.
Extended Data Fig. 7 Reconstructions of the regulating networks for well-known pathways in maize.
a, Starch synthesis pathway in maize. b, Meristem developmental pathway. P value was calculated from the frequency in 1000 simulations.
Extended Data Fig. 8 AUC and F1 values from evaluations using five classical algorithms and the expression matrix from the transcriptome (co-expression) and translatome (co-translation), network attributes from the interactome, and integrative omics integrating the attributes of each omics dataset.
Box middle line represented the median, box edges were the 25th and 75th percentiles, and further outliers were marked individually.
Extended Data Fig. 9 Predicted pathway associated with Flowering time in maize.
a, Percentage of predicted flowering time genes supported by other, related evidence for effects on flowering time. Homolog represents genes that are homologs to Arabidopsis FT genes identified in the FLOR-ID flowering time database (http://www.phytosystems.ulg.ac.be/florid/). SNPs-GWAS refers to genes with FT association signals identified in maize by Liu et al.43. Others denotes genes for which there is no known functional evidence for an effect on flowering time. b, Eight molecular pathways were predicted to be associated with flowering time in maize.
Extended Data Fig. 10 Detailed flowchart of the whole study.
The accompanied intermediate data and bioinformatics scripts were deposited in GitHub (https://github.com/hanlinqian/IntegrativeNetworkMap).
Supplementary information
Supplementary Information
Supplementary Figs. 1–34, Results and Methods.
Supplementary Tables 1–16
Supplementary Table 1. Summary of raw data collected in our study. Supplementary Table 2. Tissue information of PPI mating libraries. Supplementary Table 3. Detailed information of PPIs detected across different libraries. Supplementary Table 4. PPIs validated by bimolecular fluorescence complementation (BiFC). Supplementary Table 5. Domains detected in PPIs. Supplementary Table 6. Summary of edges and modules detected in the multi-omics functional map. Supplementary Table 7. Validation rate by BiFC of different PPI datasets. Supplementary Table 8. Detailed information of functional genes involved in kernel development. Supplementary Table 9. Genes involved in the starch synthesis-related pathway. Supplementary Table 10. Genes involved in the meristem development-related pathway. Supplementary Table 11. Detailed information of well-known FT genes. Supplementary Table 12. Information of predicted FT genes. Supplementary Table 13. Detailed summary of field phenotypic tests for the mutants and WT counterparts of predicted FT genes. Supplementary Table 14. Detailed information of 20 validated FT genes. Supplementary Table 15. Detailed information of the primers/sequences used in PCR amplification, sequencing and mapping in the RLL-Y2H assay. Supplementary Table 16. Primer sequences in the CRISPR knockout experiments.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Han, L., Zhong, W., Qian, J. et al. A multi-omics integrative network map of maize. Nat Genet 55, 144–153 (2023). https://doi.org/10.1038/s41588-022-01262-1
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1038/s41588-022-01262-1
This article is cited by
-
A dynamic regulome of shoot-apical-meristem-related homeobox transcription factors modulates plant architecture in maize
Genome Biology (2024)
-
Prediction of plant complex traits via integration of multi-omics data
Nature Communications (2024)
-
Factors specifying sex determination in maize
Plant Reproduction (2024)
-
A translatome-transcriptome multi-omics gene regulatory network reveals the complicated functional landscape of maize
Genome Biology (2023)
-
QTG-Miner aids rapid dissection of the genetic base of tassel branch number in maize
Nature Communications (2023)