Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Sep 4;14(9):jkae139.
doi: 10.1093/g3journal/jkae139.

Chromosome-scale genome assembly and annotation of the tetraploid potato cultivar Diacol Capiro adapted to the Andean region

Affiliations

Chromosome-scale genome assembly and annotation of the tetraploid potato cultivar Diacol Capiro adapted to the Andean region

Paula H Reyes-Herrera et al. G3 (Bethesda). .

Abstract

Potato (Solanum tuberosum) is an essential crop for food security and is ranked as the third most important crop worldwide for human consumption. The Diacol Capiro cultivar holds the dominant position in Colombian cultivation, primarily catering to the food processing industry. This highly heterozygous, autotetraploid cultivar belongs to the Andigenum group and it stands out for its adaptation to a wide variety of environments spanning altitudes from 1,800 to 3,200 meters above sea level. Here, a chromosome-scale assembly, referred to as DC, is presented for this cultivar. The assembly was generated by combining circular consensus sequencing with proximity ligation Hi-C for the scaffolding and represents 2.369 Gb with 48 pseudochromosomes covering 2,091 Gb and an anchor rate of 88.26%. The reference genome metrics, including an N50 of 50.5 Mb, a BUSCO (Benchmarking Universal Single-Copy Orthologue) score of 99.38%, and an Long Terminal Repeat Assembly Index score of 13.53, collectively signal the achieved high assembly quality. A comprehensive annotation yielded a total of 154,114 genes, and the associated BUSCO score of 95.78% for the annotated sequences attests to their completeness. The number of predicted NLR (Nucleotide-Binding and Leucine-Rich-Repeat genes) was 2107 with a large representation of NBARC (for nucleotide binding domain shared by Apaf-1, certain R gene products, and CED-4) containing domains (99.85%). Further comparative analysis of the proposed annotation-based assembly with high-quality known potato genomes, showed a similar genome metrics with differences in total gene numbers related to the ploidy status. The genome assembly and annotation of DC presented in this study represent a valuable asset for comprehending potato genetics. This resource aids in targeted breeding initiatives and contributes to the creation of enhanced, resilient, and more productive potato varieties, particularly beneficial for countries in Latin America.

Keywords: Solanum tuberosum; Andigenum group; genome assembly; tetraploid assembly.

PubMed Disclaimer

Conflict of interest statement

Conflicts of interest The author(s) declare no conflicts of interest.

Figures

Fig. 1.
Fig. 1.
Tubers from cultivar Diacol Capiro.
Fig. 2.
Fig. 2.
Boxplots displaying the range of sizes of the four largest scaffolds for each of the 12 chromosomes (Chr) using different assembly strategies (st0–st7). The DC assembly is also included. In the legend, the strategies are separated based on the reference genome used at the pruning step. The horizontal lines above indicate the size of each chromosome in the reference genomes DMv6.1 (green) and RH89-039-16 with two haplotypes (blue and red).
Fig. 3.
Fig. 3.
Pipeline for the selection of pseudo-chromosomes using the results from three strategies. (i) Selection of the four largest scaffolds (A, B, C, and D) for each chromosome (Chri) for the strategies st2, st3, and st6. (ii) Alignment with Minimap2 among the scaffolds of each strategy and comparison to identify similar groups and different scaffolds. (iii) Identification of the representative scaffolds, between similar scaffolds, based on BUSCO score. For this example Ast2, Cst2, and Dst2 had the highest BUSCO score. (iv) Building hybrid strategies combining representative scaffolds and the different group, each row is an alternative combination of scaffolds to represent the Chri. (v) Comparisons of hybrid strategies based on the BUSCO score, median alignment rate to DMv6.1 and contiguity based on dot plot with DMv6.1. (vi) Final selection of best candidates as pseudo-chromosomes for Chri, in this case Ast2Ast3Cst2Dst2 is a combination of scaffolds from st2 and st3.
Fig. 4.
Fig. 4.
Dotplot with the alignment of DC (Diacol Capiro) assembly against the DMv6.1 chromosome-scale assembly. Sets of 4 homologous chromosomes aligned to each DMv6.1 chromosome are observed.
Fig. 5.
Fig. 5.
Circos plot for the DC assembly. a) 48 pseudo-chromosomes. b) LAI over 100 kb sliding windows. c) Reads from CENH3-ChIP-seq (Gong et al. 2012) aligned to the DC assembly, with chromosomes divided into 100 kb windows. The resultant peaks detected in this analysis are indicative of potential centromeric regions within the genome. d) Histogram for genes using a window size of 5 Mb. e) Histogram for LTRs with a window size of 5 Mb.
Fig. 6.
Fig. 6.
Proportions of predicted NLR (Nucleotide-binding Leucine-rich Repeat) genes across six distinct potato genome assemblies. The representation of various combinations of domains within NLR genes is visualized using different colors (see legend). Percentages below 1% were not included in the figure to prevent visual overlaps and maintain clarity.

Similar articles

References

    1. Andrade-Piedra J, Torres L. 2011. Inventario de tecnologías e información para el Cultivo de papa en ecuador. Information at https://cipotato.org/papaenecuador/2017/10/12/24-diacol-capiro/.
    1. Arendsee Z, Li J, Singh U, Seetharam A, Dorman K, Wurtele ES. 2019. phylostratr: a framework for phylostratigraphy. Bioinformatics. 35:3617–3627. doi:10.1093/bioinformatics/btz171. - DOI - PubMed
    1. Aversano R, Contaldi F, Ercolano MR, Grosso V, Iorizzo M, Tatino F, Xumerle L, Dal Molin A, Avanzato C, Ferrarini A, et al. 2015. The solanum commersonii genome sequence provides insights into adaptation to stress conditions and genome evolution of wild potato relatives. Plant Cell. 27:954–968. doi:10.1105/tpc.114.135954. - DOI - PMC - PubMed
    1. Ballvora A, Ercolano MR, Weiss J, Meksem K, Bormann CA, Oberhagemann P, Salamini F, Gebhardt C. 2002. The R1 gene for potato resistance to late blight (Phytophthora infestans) belongs to the leucine zipper/NBS/LRR class of plant resistance genes. Plant J. 30:361–371. doi:10.1046/j.1365-313X.2001.01292.x. - DOI - PubMed
    1. Bao Z, Li C, Li G, Wang P, Peng Z, Cheng L, Li H, Zhang Z, Li Y, Huang W, et al. 2022. Genome architecture and tetrasomic inheritance of autotetraploid potato. Mol Plant. 15:1211–1226. doi:10.1016/j.molp.2022.06.009. - DOI - PubMed

LinkOut - more resources