Abstract
Key message
We provided a study on homeologous gene evolution of homeologous genes by comparing Brassica genomes.
Abstract
Polyploidy has played fundamental roles during the evolution of plants. Following polyploidization, many duplicated genes are diversified or lost in a process termed diploidization. Understanding the retention and diversification of homeologs after polyploidization will help elucidate the process of diploidization. Here, we investigated the evolution of homeologous genes in Brassica genomes and observed similarly asymmetrical gene retention among different functional groups and consistent retention after recurrent polyploidizations. In the comparative analysis of Brassica diploid genomes, we found that preferentially retained genes show different patterns on sequence and expression divergence: genes with the function of ‘biosynthetic process’ and ‘transport’ were under much stronger purifying selection, while transcriptional regulatory genes diverged much faster than other genes. Duplicate pairs of the former two functional groups show conserved high expression patterns, while most of transcriptional regulatory genes are simultaneously lowly expressed. Furthermore, homeologs in diploids and allotetraploids showed similar loss and retention patterns: duplicates in progenitor genomes were more likely to be retained and accumulated fewer substitutions. However, transcriptional regulation is also enriched in the genes that do not have any non-synonymous mutations in the Brassica allotetraploids, indicating that some of these genes were under strong purifying selection. Overall, our study provided insight into the evolution of homeologs genes during diploidization process.
Similar content being viewed by others
References
Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 25:3389–3402. https://doi.org/10.1093/nar/25.17.3389
Amborella Genome P (2013) The Amborella genome and the evolution of. flowering plants Science 342:1241089. https://doi.org/10.1126/science.1241089
Arabidopsis Genome I (2000) Analysis of the genome sequence of the flowering plant Arabidopsis thaliana. Nature 408:796–815. https://doi.org/10.1038/35048692
Babu MM, Luscombe NM, Aravind L, Gerstein M, Teichmann SA (2004) Structure and evolution of transcriptional regulatory networks. Curr Opin Struct Biol 14:283–291. https://doi.org/10.1016/j.sbi.2004.05.004
Birchler JA, Veitia RA (2007) The gene balance hypothesis: from classical genetics to modern genomics. Plant Cell 19:395–402. https://doi.org/10.1105/tpc.106.049338
Birchler JA, Veitia RA (2012) Gene balance hypothesis: connecting issues of dosage sensitivity across biological disciplines. Proc Natl Acad Sci USA 109:14746–14753. https://doi.org/10.1073/pnas.1207726109
Blanc G, Wolfe KH (2004) Functional divergence of duplicated genes formed by polyploidy during Arabidopsis evolution. Plant Cell 16:1679–1691. https://doi.org/10.1105/tpc.021410
Cai C et al (2017) Brassica rapa genome 2.0: a reference upgrade through sequence re-assembly and gene re-annotation. Mol Plant 10:649–651. https://doi.org/10.1016/j.molp.2016.11.008
Chalhoub B et al (2014) Early allopolyploid evolution in the post-Neolithic Brassica napus oilseed genome. Science 345:950–953. https://doi.org/10.1126/science.1253435
Cheng F, Wu J, Cai X, Liang J, Freeling M, Wang X (2018) Gene retention, fractionation and subgenome differences in polyploid plants. Nat Plants 4:258–268. https://doi.org/10.1038/s41477-018-0136-7
Conant GC, Birchler JA, Pires JC (2014) Dosage, duplication, and diploidization: clarifying the interplay of multiple models for duplicate gene evolution over time. Curr Opin Plant Biol 19:91–98. https://doi.org/10.1016/j.pbi.2014.05.008
De Smet R, Adams KL, Vandepoele K, Van Montagu MC, Maere S, Van de Peer Y (2013) Convergent gene loss following gene and genome duplications creates single-copy families in flowering plants. Proc Natl Acad Sci USA 110:2898–2903. https://doi.org/10.1073/pnas.1300127110
Edgar RC (2004) MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res 32:1792–1797. https://doi.org/10.1093/nar/gkh340
Ermolaeva MD, Wu M, Eisen JA, Salzberg SL (2003) The age of the Arabidopsis thaliana genome duplication. Plant Mol Biol 51:859–866. https://doi.org/10.1023/a:1023001130337
Freeling M (2009) Bias in plant gene content following different sorts of duplication: tandem, whole-genome, segmental, or by transposition. Annu Rev Plant Biol 60:433–453. https://doi.org/10.1146/annurev.arplant.043008.092122
Freeling M, Thomas BC (2006) Gene-balanced duplications, like tetraploidy, provide predictable drive to increase morphological complexity. Genom Res 16:805–814. https://doi.org/10.1101/gr.3681406
Hu TT et al (2011) The Arabidopsis lyrata genome sequence and the basis of rapid genome size change. Nat Genet 43:476–481. https://doi.org/10.1038/ng.807
Jiang WK, Liu YL, Xia EH, Gao LZ (2013) Prevalent role of gene features in determining evolutionary fates of whole-genome duplication duplicated genes in flowering plants. Plant Physiol 161:1844–1861. https://doi.org/10.1104/pp.112.200147
Jiao Y et al (2011) Ancestral polyploidy in seed plants and angiosperms. Nature 473:97–100. https://doi.org/10.1038/nature09916
Kaltenegger E, Leng S, Heyl A (2018) The effects of repeated whole genome duplication events on the evolution of cytokinin signaling pathway. BMC Evol Biol 18:76. https://doi.org/10.1186/s12862-018-1153-x
Lamesch P et al (2012) The Arabidopsis Information Resource (TAIR): improved gene annotation and new tools. Nucleic Acids Res 40:D1202–D1210. https://doi.org/10.1093/nar/gkr1090
Li H et al (2009) The sequence alignment/map format and SAMtools. Bioinformatics 25:2078–2079. https://doi.org/10.1093/bioinformatics/btp352
Li H, Durbin R (2009) Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25:1754–1760. https://doi.org/10.1093/bioinformatics/btp324
Li Z, Defoort J, Tasdighian S, Maere S, Van de Peer Y, De Smet R (2016) Gene duplicability of core genes is highly consistent across all Angiosperms. Plant Cell 28:326–344. https://doi.org/10.1105/tpc.15.00877
Li M, Wang R, Liang Z, Wu X, Wang J (2019) Genome-wide identification and analysis of the EIN3/EIL gene family in allotetraploid Brassica napus reveal its potential advantages during polyploidization. BMC Plant Biol 19:110. https://doi.org/10.1186/s12870-019-1716-z
Liu S et al (2014) The Brassica oleracea genome reveals the asymmetrical evolution of polyploid genomes. Nat Commun 5:3930. https://doi.org/10.1038/ncomms4930
Lysak MA, Koch MA, Pecinka A, Schubert I (2005) Chromosome triplication found across the tribe Brassiceae. Genom Res 15:516–525. https://doi.org/10.1101/gr.3531105
Lysak MA, Mandakova T, Schranz ME (2016) Comparative paleogenomics of crucifers: ancestral genomic blocks revisited. Curr Opin Plant Biol 30:108–115. https://doi.org/10.1016/j.pbi.2016.02.001
Ma XF, Gustafson JP (2005) Genome evolution of allopolyploids: a process of cytological and genetic diploidization. Cytogenet Genom Res 109:236–249. https://doi.org/10.1159/000082406
Maere S, De Bodt S, Raes J, Casneuf T, Van Montagu M, Kuiper M, Van de Peer Y (2005) Modeling gene and genome duplications in eukaryotes. Proc Natl Acad Sci USA 102:5454–5459. https://doi.org/10.1073/pnas.0501102102
Maere S, Heymans K, Kuiper M (2005) BiNGO: a Cytoscape plugin to assess overrepresentation of gene ontology categories in biological networks. Bioinformatics 21:3448–3449. https://doi.org/10.1093/bioinformatics/bti551
Nagaharu U (1935) Genome analysis in Brassica with special reference to the experimental formation of B. napus and peculiar mode of fertilization Japan. J Bot 7:389–452
Paterson AH, Chapman BA, Kissinger JC, Bowers JE, Feltus FA, Estill JC (2006) Many gene and domain families have convergent fates following independent whole-genome duplication events in Arabidopsis, Oryza, Saccharomyces and Tetraodon . Trends Genet 22:597–602. https://doi.org/10.1016/j.tig.2006.09.003
Proost S, Fostier J, De Witte D, Dhoedt B, Demeester P, Van de Peer Y, Vandepoele K (2012) i-ADHoRe 3.0–fast and sensitive detection of genomic homology in extremely large data sets. Nucleic Acids Res 40:e11. https://doi.org/10.1093/nar/gkr955
Ramirez-Gonzalez RH et al (2018) The transcriptional landscape of polyploid wheat. Science. https://doi.org/10.1126/science.aar6089
Ramsey J, Schemske DW (2002) Neopolyploidy in flowering plants. Annu Rev Ecol Syst 33:51
Renny-Byfield S, Wendel JF (2014) Doubling down on genomes: polyploidy and crop plants. Am J Bot 101:1711–1725. https://doi.org/10.3732/ajb.1400119
Schranz ME, Lysak MA, Mitchell-Olds T (2006) The ABC’s of comparative genomics in the Brassicaceae: building blocks of crucifer genomes. Trends Plant Sci 11:535–542. https://doi.org/10.1016/j.tplants.2006.09.002
Shi T et al (2020) Distinct expression and methylation patterns for genes with different fates following a single whole-genome duplication in flowering plants. Mol Biol Evol 37:2394–2413. https://doi.org/10.1093/molbev/msaa105
Smith RD, Kinser TJ, Smith GDC, Puzey JR (2019) A likelihood ratio test for changes in homeolog expression bias. BMC Bioinform 20:149. https://doi.org/10.1186/s12859-019-2709-5
Soltis DE, Soltis PS (1995) The dynamic nature of polyploid genomes. Proc Natl Acad Sci USA 92:8089–8091. https://doi.org/10.1073/pnas.92.18.8089
Soltis PS, Marchant DB, Van de Peer Y, Soltis DE (2015) Polyploidy and genome evolution in plants. Curr Opin Genet Dev 35:119–125. https://doi.org/10.1016/j.gde.2015.11.003
Song H, Gao H, Liu J, Tian P, Nan Z (2017) Comprehensive analysis of correlations among codon usage bias, gene expression, and substitution rate in Arachis duranensis and Arachis ipaensis orthologs. Sci Rep 7:14853. https://doi.org/10.1038/s41598-017-13981-1
Song MJ, Potter BI, Doyle JJ, Coate JE (2020) Gene balance predicts transcriptional responses immediately following ploidy change in Arabidopsis thaliana. Plant Cell 32:1434–1448. https://doi.org/10.1105/tpc.19.00832
Stamatakis A (2014) RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics 30:1312–1313. https://doi.org/10.1093/bioinformatics/btu033
Su G, Morris JH, Demchak B, Bader GD (2014) Biological network exploration with Cytoscape 3. Curr Protoc Bioinform 47:8. https://doi.org/10.1002/0471250953.bi0813s47
Tasdighian S, Van Bel M, Li Z, Van de Peer Y, Carretero-Paulet L, Maere S (2017) Reciprocally retained genes in the angiosperm lineage show the hallmarks of dosage balance sensitivity. Plant Cell 29:2766–2785. https://doi.org/10.1105/tpc.17.00313
Van de Peer Y, Maere S, Meyer A (2009) The evolutionary significance of ancient genome duplications. Nat Rev Genet 10:725–732. https://doi.org/10.1038/nrg2600
Van de Peer Y, Mizrachi E, Marchal K (2017) The evolutionary significance of polyploidy. Nat Rev Genet 18:411–424. https://doi.org/10.1038/nrg.2017.26
Wang X et al (2011) The genome of the mesopolyploid crop species Brassica rapa. Nat Genet 43:1035–1039. https://doi.org/10.1038/ng.919
Wang J et al (2018) An overlooked paleotetraploidization in Cucurbitaceae. Mol Biol Evol 35:16–26. https://doi.org/10.1093/molbev/msx242
Wang W et al (2019) Chromosome level comparative analysis of Brassica genomes. Plant Mol Biol 99:237–249. https://doi.org/10.1007/s11103-018-0814-x
Wang D, Zhang Y, Zhang Z, Zhu J, Yu J (2010) KaKs_Calculator 2.0: a toolkit incorporating gamma-series methods and sliding window strategies. Genom Proteom Bioinform 8:77–80. https://doi.org/10.1016/S1672-0229(10)60008-3
Wang P, Moore BM, Panchy NL, Meng F, Lehti-Shiu MD, Shiu SH (2018) Factors influencing gene family size variation among related species in a plant family, Solanaceae. Genom Biol Evol 10:2596–2613. https://doi.org/10.1093/gbe/evy193
Wendel JF (2000) Genome evolution in polyploids. Plant Mol Biol 42:225–249
Wolfe KH (2001) Yesterday’s polyploids and the mystery of diploidization. Nat Rev Genet 2:333–341. https://doi.org/10.1038/35072009
Yang J et al (2016) The genome sequence of allopolyploid Brassica juncea and analysis of differential homoeolog gene expression influencing selection. Nat Genet 48:1225–1232. https://doi.org/10.1038/ng.3657
Yang Z et al (2017) Genome-wide analysis of WOX genes in upland cotton and their expression pattern under different stresses. BMC Plant Biol 17:113. https://doi.org/10.1186/s12870-017-1065-8
Acknowledgements
This work was supported by the National Natural Science Foundation of China (Grant Nos. 31970241, 31370258).
Author information
Authors and Affiliations
Contributions
JW and WW designed the study. HZ and WW conducted functional annotation of the genes and family clustering together with genes from related species. JX performed transcriptome analysis. W.W. performed syntenic alignment analysis and constructed the subgenomes. HZ and WW performed the analysis on homeologous genes in Brassica diploids and allotetraploids. HZ and WW wrote this manuscript. JW and WW revised the manuscript. All authors have read and commented on the manuscript.
Corresponding authors
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Research involving human and/or animal participants
This article does not contain any studies with human participants or animals performed by any of the authors.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
About this article
Cite this article
Zhang, H., Xie, J., Wang, W. et al. Comparison of Brassica Genomes reveals asymmetrical gene retention between functional groups of genes in recurrent polyploidizations. Plant Mol Biol 106, 193–206 (2021). https://doi.org/10.1007/s11103-021-01137-9
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11103-021-01137-9