Improved chromosome-level genome assembly and annotation of the seagrass, <i>Zostera marina</i> (eelgrass)

Xiao Ma; Jeanine L. Olsen; Thorsten B.H. Reusch; Gabriele Procaccini; Dave Kudrna; Melissa Williams; Jane Grimwood; Shanmugam Rajasekar; Jerry Jenkins; Jeremy Schmutz; Yves Van de Peer

doi:10.12688/f1000research.38156.1

Home Browse Improved chromosome-level genome assembly and annotation of the seagrass,...

ALL Metrics

-

Views

-

Downloads

Get PDF

Get XML

Export

▬

✚

Research Article

Improved chromosome-level genome assembly and annotation of the seagrass, Zostera marina (eelgrass)

[version 1; peer review: 2 approved]

Xiao Ma¹, Jeanine L. Olsen², Thorsten B.H. Reusch³, [...] Gabriele Procaccini⁴, Dave Kudrna⁵, Melissa Williams⁶, Jane Grimwood⁶, Shanmugam Rajasekar⁷, Jerry Jenkins⁶, Jeremy Schmutz^5,6, Yves Van de Peer ^1,8,9

Xiao Ma¹, Jeanine L. Olsen², [...] Thorsten B.H. Reusch³, Gabriele Procaccini⁴, Dave Kudrna⁵, Melissa Williams⁶, Jane Grimwood⁶, Shanmugam Rajasekar⁷, Jerry Jenkins⁶, Jeremy Schmutz^5,6, Yves Van de Peer ^1,8,9

PUBLISHED 15 Apr 2021

Author details Author details

¹ Department of Plant Biotechnology and Bioinformatics, Ghent University - VIB, Ghent, 9052, Belgium
² Groningen Institute of Evolutionary Life Sciences, Groningen, 9747 AG, The Netherlands
³ GEOMAR Helmholtz Centre for Ocean Research Kiel, Marine Evolutionary Ecology, Kiel, 24105, Germany
⁴ Department of Integrative Marine Ecology, Stazione Zoologica Anton Dohrn, Napoli, 80123, Italy
⁵ Department of Energy Joint Genome Institute, Lawrence Berkeley National Lab, Berkeley, CA, USA
⁶ HudsonAlpha Institute for Biotechnology, Huntsville, AL, USA
⁷ Arizona Genomics Institute, School of Plant Sciences, University of Arizona Tucson, Tucson, AZ, 85721, USA
⁸ Department of Biochemistry, Genetics and Microbiology, University of Pretoria, Pretoria, South Africa
⁹ College of Horticulture, Nanjing Agricultural University, Nanjing, 210014, China

Xiao Ma
Roles: Data Curation, Formal Analysis, Methodology, Software, Visualization

Jeanine L. Olsen
Roles: Conceptualization, Funding Acquisition, Project Administration, Resources, Writing – Original Draft Preparation, Writing – Review & Editing

Thorsten B.H. Reusch
Roles: Conceptualization, Funding Acquisition, Resources

Gabriele Procaccini
Roles: Conceptualization, Resources

Dave Kudrna
Roles: Resources

Melissa Williams
Roles: Resources

Jane Grimwood
Roles: Project Administration, Resources, Software, Validation

Shanmugam Rajasekar
Roles: Resources

Jerry Jenkins
Roles: Data Curation, Methodology, Resources, Software, Writing – Original Draft Preparation

Jeremy Schmutz
Roles: Data Curation, Methodology, Resources, Software

Yves Van de Peer
Roles: Conceptualization, Funding Acquisition, Project Administration, Resources, Supervision, Visualization, Writing – Review & Editing

OPEN PEER REVIEW

REVIEWER STATUS

This article is included in the Plant Science gateway.

This article is included in the Genomics and Genetics gateway.

Abstract

Background: Seagrasses (Alismatales) are the only fully marine angiosperms. Zostera marina (eelgrass) plays a crucial role in the functioning of coastal marine ecosystems and global carbon sequestration. It is the most widely studied seagrass and has become a marine model system for exploring adaptation under rapid climate change. The original draft genome (v.1.0) of the seagrass Z. marina (L.) was based on a combination of Illumina mate-pair libraries and fosmid-ends. A total of 25.55 Gb of Illumina and 0.14 Gb of Sanger sequence was obtained representing 47.7× genomic coverage. The assembly resulted in ~2000 unordered scaffolds (L50 of 486 Kb), a final genome assembly size of 203MB, 20,450 protein coding genes and 63% TE content. Here, we present an upgraded chromosome-scale genome assembly and compare v.1.0 and the new v.3.1, reconfirming previous results from Olsen et al. (2016), as well as pointing out new findings.
Methods: The same high molecular weight DNA used in the original sequencing of the Finnish clone was used. A high-quality reference genome was assembled with the MECAT assembly pipeline combining PacBio long-read sequencing and Hi-C scaffolding.
Results: In total, 75.97 Gb PacBio data was produced. The final assembly comprises six pseudo-chromosomes and 304 unanchored scaffolds with a total length of 260.5Mb and an N50 of 34.6 MB, showing high contiguity and few gaps (~0.5%). 21,483 protein-encoding genes are annotated in this assembly, of which 20,665 (96.2%) obtained at least one functional assignment based on similarity to known proteins.
Conclusions: As an important marine angiosperm, the improved Z. marina genome assembly will further assist evolutionary, ecological, and comparative genomics at the chromosome level. The new genome assembly will further our understanding into the structural and physiological adaptations from land to marine life.

Keywords

Seagrass, Zostera marina, eelgrass, chromosome-scale genome assembly, annotation

Corresponding author: Yves Van de Peer

Competing interests: No competing interests were disclosed.

Grant information: The work conducted by the U.S. Department of Energy Joint Genome Institute was supported by the Office of Science of the U.S. Department of Energy under Contract No DE-AC02-05CH11231 to Lawrence Berkeley National Laboratory.

This work was supported by the DOE-Joint Genome Institute, Berkeley, CA, USA, Community Sequencing Program 2019, Grant Nr. 504341-Marine Angiosperm Genomes Initiative (MAGI) to YVdP, JLO, TBHR and GP.

Copyright: © 2021 Ma X et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

How to cite: Ma X, Olsen JL, Reusch TBH et al. Improved chromosome-level genome assembly and annotation of the seagrass, Zostera marina (eelgrass) [version 1; peer review: 2 approved]. F1000Research 2021, 10:289 (https://doi.org/10.12688/f1000research.38156.1) First published: 15 Apr 2021, 10:289 (https://doi.org/10.12688/f1000research.38156.1) Latest published: 15 Apr 2021, 10:289 (https://doi.org/10.12688/f1000research.38156.1)

Introduction

Seagrasses are a polyphyletic assemblage of early-diverging monocotyledoneous plants belonging to the Alismatales (Les, Cleland, and Waycott 1997; Du and Wang 2016); they are not true grasses (Poaceae). Several clades of seagrasses arose independently from freshwater sister taxa 3-4 times between the Paleocene and late Eocene (~65-34 mya) and are the only fully marine, flowering plants (~14 genera and ~65 species) (Chase et al. 2016). They occur in predominantly soft-sediment, marine coastal environments worldwide (Green, Short, and Frederick 2003) and as engineering species provide the foundation of three-dimensional habitats that are among the most productive and biodiverse (Costanza et al. 1997; Duffy et al. 2015). Seagrass meadows provide numerous ecosystem services, e.g., provisioning of fish and invertebrates, retention of nutrients (Larkum, Orth, and Duarte 2006) and carbon sequestration (Fourqurean et al. 2012). Unfortunately, they are also under threat related to human impacts (Waycott et al. 2009) that fundamentally change coastal system dynamics (Duffy et al. 2015) and make restoration extremely difficult (van Katwijk et al. 2016). Effective marine conservation strategies require integrative research perspectives between ecology and genomics (Hillebrand, Jacob, and Leslie 2020) because ecological and evolutionary change can and do occur on the same time scales (Carroll et al. 2007), e.g., genetic polymorphisms underlying critical traits or the role of genetic diversity at selectively relevant sites for population resilience.

Zostera marina (eelgrass) is a marine model species with >3000 papers covering a wide variety of ecological, evolutionary, conservation and biotech topics. Its unique, circumglobal, warm temperate to Arctic distribution has allowed it to withstand numerous cycles of rapid climate change during the Pliocene glacial and interglacial periods (Olsen et al. 2004), empirically demonstrating its capacity to adapt, acclimate and recover (Duarte et al. 2018), e.g., to temperature (Franssen et al. 2011; Jueterbock et al. 2016; Jueterbock et al. 2020), salinity gradients/osmoregulation (Shivaraj et al. 2017), ocean acidification (Zimmerman 2020) and potential pathogens (Brakel et al. 2014; Guan, Saha, and Weinberger 2020; Zang et al. 2020). Further, clonal populations of Z. marina can persist for hundreds to thousands of years ((Thorsten et al. 1999) for Z. marina, (Arnaud-Haond et al. 2012) for Posidonia oceanica)), yet have found ways to adapt through periods of rapid climate selection via genotypic plasticity, fostered by somatic mutation (Yu et al. 2020) and epigenetic modification of the methylome (DuBois, Williams, and Stachowicz 2020; Jueterbock et al. 2020). Microbiome interaction studies are being conducted in parallel with eelgrass resequencing, e.g., bacterial (Cucio et al. 2016; Fahimipour et al. 2017; Wang, Tomas, and Mueller 2020; Eisen et al. 2017) and fungal (Ettinger, Voerman, et al. 2017; Ettinger, Williams, et al. 2017; Ettinger and Eisen 2019, 2020; Ettinger, Vann, and Eisen 2020) to inform restoration strategies as well as meta-organismal function. Bioengineered salinity tolerance is also of interest (Wani et al. 2020).

The new assembly of the Z. marina reference genome will further advance studies in the aforementioned areas, as well as comparative analyses of genome structure and evolution, as new reference genomes for representatives of the other three seagrass lineages (i.e., Posidonia oceanica - Posidoniaceace, Cymodocea nodosa - Cymodoceaceae and Thalassia testudinum - Hydrocharitaceae) come online in the near future along with Zostera mueller (Lee et al. 2016) and Zostera japonica (unpublished).

Methods

Sequencing strategy

We used an aliquot of the same DNA that served as the basis for Z. marina v.1.0 genome. We sequenced the Z. marina genome using a whole genome shotgun sequencing strategy and standard sequencing protocols. Sequencing reads were collected using Illumina and PacBio platforms at the HudsonAlpha Institute for Biotechnology in Huntsville, Alabama, USA. Illumina reads were sequenced using the Illumina HiSeq-2500 platform and the PacBio reads were sequenced using the SEQUEL II platform. One 400 bp insert 2×150 Illumina fragment library (162.7× coverage), and one 2×150 Hi-C Illumina library were constructed using Dovetail Hi-C kit and sequenced to 581.1× coverage. Prior to assembly, Illumina fragment reads were screened for PhiX contamination. Reads composed of >95% simple sequence were removed. Illumina reads <50 bp after trimming for adapter and quality (q<20) were removed. The final read set consists of 280,181,449 reads for a total of 156.4× of high-quality Illumina bases. For the PacBio sequencing, a standard PacBio long read library was constructed and a total of 8 PB chemistry 3.0 chips (10 hours movie time) were sequenced on a Sequel 1, a sequence yield of 75.97 Gb, with a total coverage of 189.93x.

Genome assembly and construction of pseudomolecule chromosomes

The current v.3.1 assembly was generated by assembling the 5,615,408 PacBio reads (189.93x sequence coverage) using the MECAT assembler (Xiao et al. 2017) and subsequently polished using ARROW (Chin et al. 2013).

Misjoins in the assembly were identified using Hi-C data as part of the JUICER pipeline (Durand et al. 2016). No misjoins were identified in the polished assembly. The contigs were then oriented, ordered, and joined together using HI-C data using the JUICER pipeline. A total of 89 joins were applied to the assembled contigs to form the final set of six chromosomes. Each chromosome join is padded with 10,000 Ns. Significant telomeric sequence was identified using the (TTAGGG)_n repeat, and care was taken to make sure that it was properly oriented in the production assembly. The remaining scaffolds were screened against bacterial proteins, organelle sequences, GenBank nr and removed if found to be a contaminant.

Finally, homozygous SNPs and INDELs were corrected in the released consensus sequence using 40x of Illumina reads (2x150, 400bp insert) by aligning the reads using bwa mem (Li 2013) and identifying homozygous SNPs and INDELs with the GATK’s UnifiedGenotyper tool (McKenna et al. 2010). A total of 1,876 homozygous SNPs and 64,447 homozygous INDELs were corrected in the release.

Annotation of repetitive elements and noncoding RNAs

RepeatModeler v2.0 was used to build a custom repeat library for the genome assembly of Z. marina v.3.1. Subsequently, RepeatMasker v4.1 was used to discover and classify repeats based on the custom repeat libraries from RepeatModeler v2.0. Transfer RNAs (tRNA) were predicted by tRNAscan-SE v1.31 (Chan and Lowe 2019) with default parameters. We also identified noncoding RNAs, such as microRNAs (miRNAs), small nuclear RNAs (snRNAs) and ribosomal RNAs (rRNAs) by comparing with known noncoding RNA libraries (Rfam v14.2), using the cmscan program of Infernal v1.1.2 (Nawrocki and Eddy 2013). In addition, novel miRNA entries from the Z. marina v.1.0 assembly were aligned to hard-masking Z. marina v.3.1 using SeqMap (Jiang and Wong 2008) with no mismatches. We extracted ~ 110 bp upstream and downstream sequences surrounding every aligned locus and discarded the miRNAs candidates located within protein coding sequences or repetitive elements (“NNNNNNNNNNN”). The stem-loop structure and the minimum free energy (MFE) were predicted for each region using the RNAfold program of the ViennaRNA v 2.1.1 (Lorenz et al. 2011) with default settings. Finally, the results based on Rfam and Z.marina v.1.0 were combined into a non-redundant prediction of miRNAs.

Gene prediction and functional annotation

Genome annotation was performed using a combination of ab initio prediction, homology searches and RNA-aided alignment. Augustus (Stanke, Tzvetkova, and Morgenstern 2006) was used for ab initio gene prediction using model training based on protein structures and RNA-seq data from Z. marina v.1.0 (Olsen et al. 2016). For homology-based predictions, the protein sequences of Z. marina v.1.0 and Oryza sativa were downloaded from PLAZA (https://bioinformatics.psb.ugent.be/plaza/) and aligned to Z. marina v.3.1 using TBLASTN with different e-values (Z. marina v.1.0 with e-value ≤ 1e-10 and O. sativa with e-value ≤ 1e-5). Next, regions were mapped by these query sequences to define gene models using Exonerate (Slater and Birney 2005). For RNA-aided annotation, we downloaded 23 libraries of Z. marina v.1.0 from NCBI (BioProject PRJNA280336). Firstly, we joined the paired-end reads using clc_assembly_cell to generate almost 2/3 of joined reads and 1/3 of un-joined reads. Then, we aligned the joined and un-joined RNA-seq data to Z. marina v.3.1 using HISAT2 (Kim, Langmead, and Salzberg 2015) with the parameters “--max-intronlen 50000” and assembled into potential transcripts using StringTie v2.1.1 (Kovaka et al. 2019). Further, TransDecoder v5.0.2 was used to predict open reading frames (ORFs) within the assembled transcripts. Finally, gene models obtained from all three approaches were integrated as the final non-redundant gene set using EVidenceModeler (v.1.1.1) (Haas et al. 2008). Specifically, a set of 1,460 bad gene models identified by the wgd software (Zwaenepoel and Van de Peer 2019) was manually curated using the genome browser GenomeView on the ORCAE platform (https://bioinformatics.psb.ugent.be/orcae/) and the gene annotation results were evaluated by BUSCO hits. Putative gene function was determined using InterProScan (Jones et al. 2014) with the different databases, including PFAM, Gene3D, PANTHER, CDD, SUPERFAMILY, ProSite, and GO. Meanwhile, functional annotation of predicted genes was also obtained by aligning the protein sequences against the sequences in public protein databases (Uniprot/SwissProt database) using BLASTP with an e-value cut-off of 1×10-5.

Results and discussion

Genome size and assembly

A single genotype (or clone) of Z. marina from the northern Baltic Sea, Finnish Archipelago Sea had been subjected to whole-genome assembly using Sanger and Illumina sequencing (referred to as Z. marina v.1.0) (Olsen et al. 2016). Since PacBio technology can deliver longer reads, necessary to improve assembly contiguity and obtain a nearly complete, reference genome, we re-sequenced the inbred, Finnish clone, leading to the final v.3.1 release, which contains 260.5 Mb of sequence, consists of 432 contigs with a contig N50 of 7.0 Mb and a total of 87.6% of assembled bases into six pseudo-chromosomes (2n = 12). Interestingly, during the assembly of the genome using Hi-C, it was noted that there was a seventh “chromosome” (scaffold_7 in this release) with a length of 8.68Mb, consisting of mainly repetitive DNA and a possible Nucleolus Organizing Region (NOR). Completeness of the euchromatic portion of v.3.1 assembly was assessed using 20,450 annotated genes from Z. marina v.1.0. The screened alignments indicate that 20,342 (99.47%) of the previously annotated version 1.0 genes aligned to the v.3.1 release. Of the remaining genes, 50 (0.24%) aligned at <50% coverage, while 58 (0.29%) were not found in the v.1.0 release. This shows a high degree of consistency between the two versions. However, version 3.1 presents much higher contiguity and fewer gaps compared to the previously published Z. marina v.1.0 (Table 1).

Table 1. Comparison of genome assemblies for Zostera marina v1.0 (Olsen et al., 2016) and v3.1 (current study).

Parameters	Zostera marina v1.0	Zostera marina v3.1
Scaffolds length (Mb)	203	260.5
Scaffolds number	2,228	310
Scaffolds N50 (Mb)	0.486	34.6
Contigs length (Mb)	191	259.3
Contigs number	12,583	432
Contigs N50 (Mb)	0.08	7
Largest length (bp)	2,654,544	42,612,672

Repetitive elements and noncoding RNAs

We used ab initio approaches to identify and annotate repetitive sequences, which accounted for 67.12 % of Z. marina v.3.1. 41.72 % of these TEs were long terminal repeat (LTR) elements (Table 2). Screening the Z. marina v.3.1 genome against the Rfam v14.2 database using Infernal identified 546 tRNAs, 376 rRNAs, 93 miRNAs, and 134 snRNAs (Table 3). In addition to the 93 known conserved miRNAs identified from Rfam v14.2, we also identified 23 novel miRNAs candidates compared to 19 novel miRNAs candidates in Z. marina v.1.0, resulting in a total of 116 miRNAs (Table 4).

Table 2. Annotation of repeat elements.

		Zostera marina v.3.1
Repeat types		Length (bp)	P (%)
DNA		14,881,010	5.71
LINE		10,538,203	4.05
SINE		228,565	0.09
LTR	Gypsy	83,650,636	32.11
	Copia	24,717,775	9.49
	Pao	309,173	0.12
Other		4,041,971	1.55
Unknown		36,466,934	14.00
Total		174,834,267	67.12

Table 3. Known miRNAs identified in the genome of Zostera marina (v1.0 and v3.1).

	Number of loci
miRNA family	Zostera marina v1.0	Zostera marina v3.1
miR1388	0	11
miR156	5	7
miR159	1	4
miR160	3	3
miR164	3	4
miR166	4	5
miR167	2	2
miR168	1	1
miR169	2	6
miR171	3	13
miR172	1	3
miR390	2	2
miR393	2	2
miR396	2	4
miR398	0	1
miR399	2	5
miR528	1	5
miR4414	1	8
miR5658	1	7
Total	36	93

Table 4. Novel miRNA candidates in Zostera marina (v1.0 and v3.1).

	Number of loci
Novel miRNA family	Zostera marina v1.0	Zostera marina v3.1
zmr-miR001	2	2
zmr-miR002	1	0
zmr-miR003	1	1
zmr-miR004	1	1
zmr-miR005	1	1
zmr-miR006	1	3
zmr-miR007	1	1
zmr-miR008	1	0
zmr-miR009	1	3
zmr-miR010	1	1
zmr-miR011	1	1
zmr-miR012	1	1
zmr-miR013	3	7
zmr-miR014	1	1
zmr-miR015	2	0
Total	19	23

Protein-coding genes

Through a combination of ab initio prediction, homology searches and RNA-aided evidence, we annotated 21,533 gene models after masking repeat elements. After manually checking most gene models and improving 1395 incorrect gene models on the ORCAE platform and adding 30 new genes based on RNA-seq evidence, the final annotation produced 21,483 gene models, 91.8% of which (19,739 genes) are supported by transcriptome data from leaves, roots and flowers. On average, protein-coding genes in Z. marina v.3.1 are 3,300 bp long and contain 4.99 exons, values that are very similar to those of Z. marina v.1.0. Notably, intron lengths greatly improved after manual curation (Table 5). BUSCO assessment of the current gene set shows that the current annotation includes 95.7% complete genes in the embryophyte database10. 93.2% of the BUSCO genes were single copy while 2.5% of these BUSCO genes were found in duplicate. 0.5% of the BUSCO genes were fragmented and 3.8% was missing, which could be due to some specific pathways missing in Z. marina, compared to land plants (Olsen et al., 2016) (Table 6). BUSCO assessment in Z. marina v.3.1 shows more complete genes and fewer fragmented genes than for Z. marina v.1.0 (Figure 1). 20,665 genes (96.2%) obtained at least one functional assignment based on similarity to known proteins in the databases. Pfam domain information could be added to 15,716 (73.2%) predicted genes, and 12,406 (57.7%) predicted genes could be assigned a GO term (Table 7).

Figure 1. Busco assessment results for Z. marina genomes v1.0 and v3.1.

Table 5. Genome annotation statistics.

Statistics	Zosma v1.0	Zosma v3.1	Zosma v3.1_before_curation
Protein coding genes	20,450	21,483	21,533
Mean gene length, bp	3,301	3,300	3,202
Mean CDS length, bp	1,177	1,225	1,207
Mean exon per gene	5.20	5.00	4.9
Mean exon length, bp	226	245	245
Mean intron length, bp	443	710	510

Table 6. BUSCO completeness assessment of protein coding sequences in Z. marina version 3.1.

Total number of BUSCO core genes queried	1614
Number of core genes detected
Complete	1544 (95.7%)
Complete + Partial	1552 (96.2%)
Number of missing core genes	62 (3.8%)
Scores in BUSCO format*	C:95.7% [S:93.2%, D:2.5%], F:0.5%, M:3.8%, n:1614

Table 7. Functional annotation.

Database	Count	Percentage
InterPro	17,073	79.5
PFAM	15,716	73.2
Gene3D	12,597	58.6
PANTHER	18,220	84.8
CDD	6,818	31.7
SUPERFAMILY	12,078	56.2
ProSite	8,969	41.7
GO	12,406	57.7
BLASTP	18,243	84.9
Swiss-Prot	14,374	66.9
Total annotation	20,665	96.2
Total unigene	21,483	100

Conclusions

Here, we report a high-quality, chromosome-scale genome assembly of Z. marina using a combination of single-molecule real-time sequencing and Hi-C scaffolding. Although a draft genome sequence for Z. marina has been available for more than five years (Olsen et al. 2016), a chromosome-scale assembly and well-annotated reference genome is an important step to further advance our understanding with respect to its metabolism, evolution and adaptation. As we expected, there is a discrepancy in genome size between an Illumina-derived assembly and a PACBIO long read-derived assembly, which is mainly due to the more accurate coverage of repetitive content. Nevertheless, the genome size of 259 Mb still characterizes Z. marina as relatively compact monocot genome, and it is also the smallest genome among the seagrasses where genome size estimates exist (JGI pilot analyses). Also, telomere and centromere regions are generally captured more fully. As a first high quality, reference genome of an important marine angiosperm, v.3.1 of the genome of Z. marina will provide a great resource for further comparative and evolutionary studies.

Data availability

Underlying data

NCBI BioProject: Zostera marina Genome sequencing and assembly. Accession number: PRJNA701932; https://identifiers.org/ncbi/bioproject:PRJNA701932.

The genome assembly and annotation for Z. marina v.3.1 is also available from https://data.jgi.doe.gov/refine-download/phytozome?organism=Zmarina and through the ORCAE platform (https://bioinformatics.psb.ugent.be/gdb/zostera/).

Acknowledgements

We thank Shengqiang Shu from the Joint Genome Institute for validating the annotation.

References

Arnaud-Haond S, Duarte CM, Diaz-Almela E, et al.: Implications of extreme life span in clonal organisms: millenary clones in meadows of the threatened seagrass Posidonia oceanica. PLoS One. 2012; 7: e30454. PubMed Abstract | Publisher Full Text | Free Full Text
Brakel J, Werner FJ, Tams V, et al.: Current European Labyrinthula zosterae are not virulent and modulate seagrass (Zostera marina) defense gene expression. PLoS One. 2014; 9: e92448. PubMed Abstract | Publisher Full Text | Free Full Text
Carroll SP, Hendry AP, Reznick DN, et al.: Evolution on ecological time-scales. Functional Ecology. 2007; 21: 387–393. Publisher Full Text
Chan PP, Lowe TM: tRNAscan-SE: Searching for tRNA Genes in Genomic Sequences. Methods Mol Biol. 2019; 1962: 1–14. PubMed Abstract | Publisher Full Text | Free Full Text
Chase MW, Christenhusz MJM, Fay MF, et al.: An update of the Angiosperm Phylogeny Group classification for the orders and families of flowering plants: APG IV Botanical Journal of the Linnean Society. 2016; 181: 1–20. Publisher Full Text
Chin CS, Alexander DH, Marks P, et al.: Nonhybrid, finished microbial genome assemblies from long-read SMRT sequencing data. Nat Methods. 2013; 10: 563–9. PubMed Abstract | Publisher Full Text
Costanza R, d'Arge A, de Groot R, et al.: The value of the world’s ecosystem services and natural capital. Nature. 1997; 387: 253–60.
Cucio C, Engelen AH, Costa R, et al.: Rhizosphere Microbiomes of European + Seagrasses Are Selected by the Plant, But Are Not Species Specific. Front Microbiol. 2016; 7: 440. PubMed Abstract | Publisher Full Text | Free Full Text
Du Z-Y, Wang Q-F: Phylogenetic tree of vascular plants reveals the origins of aquatic angiosperms. Journal of Systematics and Evolution. 2016; 54: 342–48. Publisher Full Text
Duarte B, Martins I, Rosa R, et al.: Climate Change Impacts on Seagrass Meadows and Macroalgal Forests: An Integrative Perspective on Acclimation and Adaptation Potential. Front Mar Sci. 2018; 5. Publisher Full Text
DuBois K, Williams SL, Stachowicz JJ: Previous exposure mediates the response of eelgrass to future warming via clonal transgenerational plasticity. Ecology. 2020; 101: e03169. PubMed Abstract | Publisher Full Text
Duffy JE, Reynolds PL, Bostrom C, et al.: Biodiversity mediates top-down control in eelgrass ecosystems: a global comparative-experimental approach. Ecol Lett. 2015; 18: 696–705.
Durand NC, Shamim MS, Machol I, et al.: Juicer Provides a One-Click System for Analyzing Loop-Resolution Hi-C Experiments. Cell Syst. 2016; 3: 95–8. PubMed Abstract | Publisher Full Text | Free Full Text
Eisen JA, Stachowicz JJ, Olsen JL, et al.: Population and evolutionary genomics of host-microbiome interactions in Zostera marina and other seagrasses. In: Berkeley, CA.: Community Sequencing Program, DOE-Joint Genome Institut; 2017.
Ettinger CL, Eisen JA: Characterization of the Mycobiome of the Seagrass, Zostera marina, Reveals Putative Associations With Marine Chytrids. Front Microbiol. 2019; 10: 2476. PubMed Abstract | Publisher Full Text | Free Full Text
Ettinger CL, Eisen JA: Fungi, bacteria and oomycota opportunistically isolated from the seagrass, Zostera marina. PLoS One. 2020; 15: e0236135. PubMed Abstract | Publisher Full Text | Free Full Text
Ettinger CL, Voerman SE, Lang JM, et al.: Microbial communities in sediment from Zostera marina patches, but not the Z. marina leaf or root microbiomes, vary in relation to distance from patch edge. PeerJ. 2017a; 5: e3246. PubMed Abstract | Publisher Full Text | Free Full Text
Ettinger CL, Williams SL, Abbott JM, et al.: Microbiome succession during ammonification in eelgrass bed sediments. PeerJ. 2017b; 5: e3674. PubMed Abstract | Publisher Full Text | Free Full Text
Ettinger CL, Vann LE, Eisen JA: Global diversity and biogeography of the Zostera marina mycobiome.2020. Publisher Full Text
Fahimipour AK, Kardish MR, Lang JM, et al.: Global-Scale Structure of the Eelgrass Microbiome. Appl Environ Microbiol. 2017; 83. PubMed Abstract | Publisher Full Text | Free Full Text
Fourqurean JW, Duarte CM, Kennedy H, et al.: Seagrass ecosystems as a globally significant carbon stock. Nature Geoscience. 2012; 5: 505–09. Publisher Full Text
Franssen SU, Gu J, Bergmann N, et al.: Transcriptomic resilience to global warming in the seagrass Zostera marina, a marine foundation species. Proc Natl Acad Sci U S A. 2011; 108: 19276–81. PubMed Abstract | Publisher Full Text | Free Full Text
Green PE, Short FT, Frederick T: World atlas of seagrasses (Univ of California Press).2003.
Guan C, Saha M, Weinberger F: Simulated Heatwaves Lead to Upregulated Chemical Defense of a Marine Foundation Macrophyte Against Microbial Colonizers. Front Mar Sci. 2020: 7. Publisher Full Text
Haas BJ, Salzberg SL, Zhu W, et al.: Automated eukaryotic gene structure annotation using EVidenceModeler and the Program to Assemble Spliced Alignments. Genome Biol. 2008; 9: R7. PubMed Abstract | Publisher Full Text | Free Full Text
Hillebrand H, Jacob U, Leslie HM: Integrative research perspectives on marine conservation. Philos Trans R Soc Lond B Biol Sci. 2020; 375: 20190444. PubMed Abstract | Publisher Full Text | Free Full Text
Jiang H, Wong WH: SeqMap: mapping massive amount of oligonucleotides to the genome. Bioinformatics. 2008; 24: 2395–6. PubMed Abstract | Publisher Full Text | Free Full Text
Jones P, Binns D, Chang HY, et al.: InterProScan 5: genome-scale protein function classification. Bioinformatics. 2014; 30: 1236–40. PubMed Abstract | Publisher Full Text | Free Full Text
Jueterbock A, Bostrom C, Coyer JA, et al.: The Seagrass Methylome Is Associated With Variation in Photosynthetic Performance Among Clonal Shoots. Front Plant Sci. 2020; 11: 571646. PubMed Abstract | Publisher Full Text | Free Full Text
Jueterbock A, Franssen SU, Bergmann N, et al.: Phylogeographic differentiation versus transcriptomic adaptation to warm temperatures in Zostera marina, a globally important seagrass. Mol Ecol. 2016; 25: 5396–411. PubMed Abstract | Publisher Full Text
Kim D, Langmead B, Salzberg SL: HISAT: a fast spliced aligner with low memory requirements. Nat Methods. 2015; 12: 357–60. PubMed Abstract | Publisher Full Text | Free Full Text
Kovaka S, Zimin AV, Pertea GM, et al.: Transcriptome assembly from long-read RNA-seq alignments with StringTie2. Genome Biol. 2019; 20: 278. PubMed Abstract | Publisher Full Text | Free Full Text
Larkum AW, Orth RJ, Duarte CM: Seagrasses (Springer). 2006.
Lee H, Golicz AA, Bayer PE, et al.: The Genome of a Southern Hemisphere Seagrass Species (Zostera muelleri). Plant Physiol. 2016; 172: 272–83. PubMed Abstract | Publisher Full Text | Free Full Text
Les DH, Cleland MA, Waycott M: Phylogenetic Studies in Alismatidae, II: Evolution of Marine Angiosperms (Seagrasses) and Hydrophily. Systematic Botany. 1997; 22. Publisher Full Text
Li H: Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM.2013.
Lorenz R, Bernhart SH, Honer Zu Siederdissen C, et al.: ViennaRNA Package 2.0. Algorithms Mol Biol. 2011; 6: 26. PubMed Abstract | Publisher Full Text | Free Full Text
McKenna A, Hanna M, Banks E, et al.: The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 2010; 20: 1297–303. PubMed Abstract | Publisher Full Text | Free Full Text
Nawrocki EP, Eddy SR: Infernal 1.1: 100-fold faster RNA homology searches. Bioinformatics. 2013; 29: 2933–5. PubMed Abstract | Publisher Full Text | Free Full Text
Olsen JL, Rouze P, Verhelst B, et al.: The genome of the seagrass Zostera marina reveals angiosperm adaptation to the sea. Nature. 2016; 530: 331–5. PubMed Abstract | Publisher Full Text
Olsen JL, Stam WT, Coyer JA, et al.: North Atlantic phylogeography and large-scale population differentiation of the seagrass Zostera marina L. Mol Ecol. 2004; 13: 1923–41. PubMed Abstract | Publisher Full Text
Shivaraj SM, Deshmukh R, Bhat JA, et al.: Understanding Aquaporin Transport System in Eelgrass (Zostera marina L.), an Aquatic Plant Species. Front Plant Sci. 2017; 8: 1334. PubMed Abstract | Publisher Full Text | Free Full Text
Slater GS, Birney E: Automated generation of heuristics for biological sequence comparison. BMC Bioinformatics. 2005; 6: 31. PubMed Abstract | Publisher Full Text | Free Full Text
Stanke M, Tzvetkova A, Morgenstern B: AUGUSTUS at EGASP: using EST, protein and genomic alignments for improved gene prediction in the human genome. Genome Biol. 2006; 7Suppl 1: S11 1–8. PubMed Abstract | Publisher Full Text | Free Full Text
Thorsten BHR, Reusch H, Christoffer B, et al.: An ancient eelgrass clone in the Baltic. Mar Ecol Progress Series. 1999; 183: 301–04. Publisher Full Text
van Katwijk MM, Thorhaug A, Marbà N, et al.: Global analysis of seagrass restoration: the importance of large-scale planting. J Appl. Ecol. 2016; 53: 567–78. Publisher Full Text
Wang L, Tomas F, Mueller RS: Nutrient enrichment increases size of Zostera marina shoots and enriches for sulfur and nitrogen cycling bacteria in root-associated microbiomes. FEMS Microbiol Ecol. 2020; 96. PubMed Abstract | Publisher Full Text
Wani SH, Kumar V, Khare T, et al.: Engineering salinity tolerance in plants: progress and prospects. Planta. 2020; 251: 76. PubMed Abstract | Publisher Full Text
Waycott M, Duarte CM, Carruthers TJ, et al.: Accelerating loss of seagrasses across the globe threatens coastal ecosystems. Proc Natl Acad Sci U S A. 2009; 106: 12377–81. PubMed Abstract | Publisher Full Text | Free Full Text
Xiao CL, Chen Y, Xie SQ, et al.: MECAT: fast mapping, error correction, and de novo assembly for single-molecule sequencing reads. Nat Methods. 2017; 14: 1072–74. PubMed Abstract | Publisher Full Text
Yu L, Boström C, Franzenburg S, et al.: Somatic genetic drift and multilevel selection in a clonal seagrass. Nat Ecol Evol. 2020; 4: 952–962. PubMed Abstract | Publisher Full Text
Zang Y, Chen J, Li R, et al.: Genome-wide analysis of the superoxide dismutase (SOD) gene family in Zostera marina and expression profile analysis under temperature stress. PeerJ. 2020; 8: e9063. PubMed Abstract | Publisher Full Text | Free Full Text
Zimmerman RC: Scaling up: Predicting the Impacts of Climate Change on Seagrass Ecosystems. Estuaries and Coasts. 2020. Publisher Full Text
Zwaenepoel A, Van de Peer Y: wgd-simple command line tools for the analysis of ancient whole-genome duplications. Bioinformatics. 2019; 35: 2153–55. PubMed Abstract | Publisher Full Text | Free Full Text

Comments on this article Comments (0)

Version 1

VERSION 1 PUBLISHED 15 Apr 2021

Author details Author details

¹ Department of Plant Biotechnology and Bioinformatics, Ghent University - VIB, Ghent, 9052, Belgium
² Groningen Institute of Evolutionary Life Sciences, Groningen, 9747 AG, The Netherlands
³ GEOMAR Helmholtz Centre for Ocean Research Kiel, Marine Evolutionary Ecology, Kiel, 24105, Germany
⁴ Department of Integrative Marine Ecology, Stazione Zoologica Anton Dohrn, Napoli, 80123, Italy
⁵ Department of Energy Joint Genome Institute, Lawrence Berkeley National Lab, Berkeley, CA, USA
⁶ HudsonAlpha Institute for Biotechnology, Huntsville, AL, USA
⁷ Arizona Genomics Institute, School of Plant Sciences, University of Arizona Tucson, Tucson, AZ, 85721, USA
⁸ Department of Biochemistry, Genetics and Microbiology, University of Pretoria, Pretoria, South Africa
⁹ College of Horticulture, Nanjing Agricultural University, Nanjing, 210014, China

Xiao Ma
Roles: Data Curation, Formal Analysis, Methodology, Software, Visualization

Jeanine L. Olsen
Roles: Conceptualization, Funding Acquisition, Project Administration, Resources, Writing – Original Draft Preparation, Writing – Review & Editing

Thorsten B.H. Reusch
Roles: Conceptualization, Funding Acquisition, Resources

Gabriele Procaccini
Roles: Conceptualization, Resources

Dave Kudrna
Roles: Resources

Melissa Williams
Roles: Resources

Jane Grimwood
Roles: Project Administration, Resources, Software, Validation

Shanmugam Rajasekar
Roles: Resources

Jerry Jenkins
Roles: Data Curation, Methodology, Resources, Software, Writing – Original Draft Preparation

Jeremy Schmutz
Roles: Data Curation, Methodology, Resources, Software

Yves Van de Peer
Roles: Conceptualization, Funding Acquisition, Project Administration, Resources, Supervision, Visualization, Writing – Review & Editing

Competing interests

No competing interests were disclosed.

Grant information

The work conducted by the U.S. Department of Energy Joint Genome Institute was supported by the Office of Science of the U.S. Department of Energy under Contract No DE-AC02-05CH11231 to Lawrence Berkeley National Laboratory.

This work was supported by the DOE-Joint Genome Institute, Berkeley, CA, USA, Community Sequencing Program 2019, Grant Nr. 504341-Marine Angiosperm Genomes Initiative (MAGI) to YVdP, JLO, TBHR and GP.

Article Versions (1)

version 1

Published: 15 Apr 2021, 10:289

https://doi.org/10.12688/f1000research.38156.1

Copyright

© 2021 Ma X et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Download

Export To

metrics

	Views	Downloads
F1000Research	-	-
PubMed Central Data from PMC are received and updated monthly.	-	-

Citations

0

SEE MORE DETAILS

CITE

how to cite this article

Ma X, Olsen JL, Reusch TBH et al. Improved chromosome-level genome assembly and annotation of the seagrass, Zostera marina (eelgrass) [version 1; peer review: 2 approved] F1000Research 2021, 10:289 (https://doi.org/10.12688/f1000research.38156.1)

NOTE: it is important to ensure the information in square brackets after the title is included in all citations of this article.

track

receive updates on this article

Track an article to receive email alerts on any updates to this article.

Open Peer Review

Current Reviewer Status: ?

Key to Reviewer Statuses VIEW HIDE

ApprovedThe paper is scientifically sound in its current form and only minor, if any, improvements are suggested

Approved with reservations A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.

Not approvedFundamental flaws in the paper seriously undermine the findings and conclusions

Version 1

VERSION 1

PUBLISHED 15 Apr 2021

Views

10

Reviewer Report 29 Sep 2021

Vratislav Peska, Department of Cell Biology and Radiobiology, The Czech Academy of Sciences, Institute of Biophysics, Brno, Czech Republic

Approved

https://doi.org/10.5256/f1000research.41219.r89904

The manuscript "Improved chromosome-level genome assembly and annotation of the seagrass, Zostera marina (eelgrass)" by Ma, Olsen, Reusch et al. is a well-written article about a unique genome of marine angiosperm plant Z. marina. The results convincingly present improving the ... Continue reading

The manuscript "Improved chromosome-level genome assembly and annotation of the seagrass, Zostera marina (eelgrass)" by Ma, Olsen, Reusch et al. is a well-written article about a unique genome of marine angiosperm plant Z. marina. The results convincingly present improving the genome assembly using PacBio and Hi-C strategies. The improvement is obvious, among others, from an increased number of coding genes by 1000 in comparison with the first assembly and almost removing fragmented hits in the BUSCO analysis results.

I must say that I do not have any serious concerns about the methodology, results, and conclusions. The release of an improved genome is a great opportunity of the research community to have a powerful tool in genome-dependent studies of this extraordinary plant.

My minor notes:

In the text, "come online in the near future along with Zostera mueller" probably should be "muelleri".
The sentence, "The remaining scaffolds were screened against bacterial proteins, organelle sequences, GenBank nr and removed if found to be a contaminant." would need rewording to be clear what "GenBank nr" means. Does it mean anything that is not obviously of Zostera species origin?
I suggest using the "BWA-MEM algorithm" instead of "bwa mem".
I do not understand what the (“NNNNNNNNNNN”) stands for.
I think a short explanation of the assembly numbering would be helpful for readers. Was the Zosma2.0 published and is available somewhere? The JGI link shows v2.2 and v3.1. Actually v3.1 contains data named Zmarina_668_v2.0.fa.gz. The ugent link shows only V2. In summary, the numbering is confusing and a bit more intuitive naming would be nice.
In Peska et al., (2020)¹, we showed there are two telomerase RNA genes in Z. marina. I wonder if assembly v3.1 confirms them as a set of separate genes or two alleles of the same gene? We also showed decompacted mitotic chromosome at the rDNA locus which might be a fragile site. Does it correspond to the observed "extra" pseudochromosome in the v3.1?

Is the work clearly and accurately presented and does it cite the current literature?

Yes
Is the study design appropriate and is the work technically sound?

Yes
Are sufficient details of methods and analysis provided to allow replication by others?

Yes
If applicable, is the statistical analysis and its interpretation appropriate?

Yes
Are all the source data underlying the results available to ensure full reproducibility?

Yes
Are the conclusions drawn adequately supported by the results?

Yes

References

1. Peska V, Mátl M, Mandáková T, Vitales D, et al.: Human-like telomeres in Zostera marina reveal a mode of transition from the plant to the human telomeric sequences. Journal of Experimental Botany. 2020; 71 (19): 5786-5793 Publisher Full Text

Competing Interests: No competing interests were disclosed.

Reviewer Expertise: Telomere biology, genomics, sequencing

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

CITE

Report a concern

Respond or Comment

Views

17

Reviewer Report 16 Jun 2021

Jarkko Salojärvi, School of Biological Sciences, Nanyang Technological University, Singapore, Singapore

Approved

https://doi.org/10.5256/f1000research.41219.r83351

The authors present a new assembly of eelgrass based on long read sequencing and Hi-C chromosome conformation capture technology. Altogether this is a well-prepared improvement on the existing genome assembly. In addition to improving the contiguity, also the manual curation ... Continue reading

The authors present a new assembly of eelgrass based on long read sequencing and Hi-C chromosome conformation capture technology. Altogether this is a well-prepared improvement on the existing genome assembly. In addition to improving the contiguity, also the manual curation of 1,460 gene models is very much welcome.

For comparison purposes the authors could still produce a dot plot of the whole genome alignments of the old vs new version.

Is the work clearly and accurately presented and does it cite the current literature?

Yes
Is the study design appropriate and is the work technically sound?

Yes
Are sufficient details of methods and analysis provided to allow replication by others?

Yes
If applicable, is the statistical analysis and its interpretation appropriate?

Not applicable
Are all the source data underlying the results available to ensure full reproducibility?

Yes
Are the conclusions drawn adequately supported by the results?

Yes

Competing Interests: No competing interests were disclosed.

Reviewer Expertise: Genomics, population genetics, bioinformatics

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

CITE

Report a concern

Respond or Comment

Comments on this article Comments (0)

Version 1

VERSION 1 PUBLISHED 15 Apr 2021

Open Peer Review

Reviewer Status

Reviewer Reports

	Invited Reviewers
	1	2
Version 1 15 Apr 21	read	read

Jarkko Salojärvi, Nanyang Technological University, Singapore, Singapore
Vratislav Peska, The Czech Academy of Sciences, Institute of Biophysics, Brno, Czech Republic

Comments on this article

All Comments(0)

Add a comment

Sign up for content alerts

Browse by related subjects

Back to all reports

Reviewer Report

10 Views

29 Sep 2021 | for Version 1

Vratislav Peska, Department of Cell Biology and Radiobiology, The Czech Academy of Sciences, Institute of Biophysics, Brno, Czech Republic

10 Views Cite this report Responses(0)

Approved

The manuscript "Improved chromosome-level genome assembly and annotation of the seagrass, Zostera marina (eelgrass)" by Ma, Olsen, Reusch et al. is a well-written article about a unique genome of marine angiosperm plant Z. marina. The results convincingly present improving the genome assembly using PacBio and Hi-C strategies. The improvement is obvious, among others, from an increased number of coding genes by 1000 in comparison with the first assembly and almost removing fragmented hits in the BUSCO analysis results.

I must say that I do not have any serious concerns about the methodology, results, and conclusions. The release of an improved genome is a great opportunity of the research community to have a powerful tool in genome-dependent studies of this extraordinary plant.

My minor notes:

In the text, "come online in the near future along with Zostera mueller" probably should be "muelleri".
The sentence, "The remaining scaffolds were screened against bacterial proteins, organelle sequences, GenBank nr and removed if found to be a contaminant." would need rewording to be clear what "GenBank nr" means. Does it mean anything that is not obviously of Zostera species origin?
I suggest using the "BWA-MEM algorithm" instead of "bwa mem".
I do not understand what the (“NNNNNNNNNNN”) stands for.
I think a short explanation of the assembly numbering would be helpful for readers. Was the Zosma2.0 published and is available somewhere? The JGI link shows v2.2 and v3.1. Actually v3.1 contains data named Zmarina_668_v2.0.fa.gz. The ugent link shows only V2. In summary, the numbering is confusing and a bit more intuitive naming would be nice.
In Peska et al., (2020)¹, we showed there are two telomerase RNA genes in Z. marina. I wonder if assembly v3.1 confirms them as a set of separate genes or two alleles of the same gene? We also showed decompacted mitotic chromosome at the rDNA locus which might be a fragile site. Does it correspond to the observed "extra" pseudochromosome in the v3.1?

Is the work clearly and accurately presented and does it cite the current literature?

Yes
Is the study design appropriate and is the work technically sound?

Yes
Are sufficient details of methods and analysis provided to allow replication by others?

Yes
If applicable, is the statistical analysis and its interpretation appropriate?

Yes
Are all the source data underlying the results available to ensure full reproducibility?

Yes
Are the conclusions drawn adequately supported by the results?

Yes

References

1. Peska V, Mátl M, Mandáková T, Vitales D, et al.: Human-like telomeres in Zostera marina reveal a mode of transition from the plant to the human telomeric sequences. Journal of Experimental Botany. 2020; 71 (19): 5786-5793 Publisher Full Text

Competing Interests

No competing interests were disclosed.

Reviewer Expertise

Telomere biology, genomics, sequencing

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

Respond to this report

Responses (0)

Back to all reports

Reviewer Report

17 Views

16 Jun 2021 | for Version 1

Jarkko Salojärvi, School of Biological Sciences, Nanyang Technological University, Singapore, Singapore

17 Views Cite this report Responses(0)

Approved

The authors present a new assembly of eelgrass based on long read sequencing and Hi-C chromosome conformation capture technology. Altogether this is a well-prepared improvement on the existing genome assembly. In addition to improving the contiguity, also the manual curation of 1,460 gene models is very much welcome.

For comparison purposes the authors could still produce a dot plot of the whole genome alignments of the old vs new version.

Is the work clearly and accurately presented and does it cite the current literature?

Yes
Is the study design appropriate and is the work technically sound?

Yes
Are sufficient details of methods and analysis provided to allow replication by others?

Yes
If applicable, is the statistical analysis and its interpretation appropriate?

Not applicable
Are all the source data underlying the results available to ensure full reproducibility?

Yes
Are the conclusions drawn adequately supported by the results?

Yes

Competing Interests

No competing interests were disclosed.

Reviewer Expertise

Genomics, population genetics, bioinformatics

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

Respond to this report

Responses (0)

[1] Arnaud-Haond S, Duarte CM, Diaz-Almela E, et al.: Implications of extreme life span in clonal organisms: millenary clones in meadows of the threatened seagrass Posidonia oceanica. PLoS One. 2012; 7: e30454. PubMed Abstract | Publisher Full Text | Free Full Text

[2] Brakel J, Werner FJ, Tams V, et al.: Current European Labyrinthula zosterae are not virulent and modulate seagrass (Zostera marina) defense gene expression. PLoS One. 2014; 9: e92448. PubMed Abstract | Publisher Full Text | Free Full Text

[3] Carroll SP, Hendry AP, Reznick DN, et al.: Evolution on ecological time-scales. Functional Ecology. 2007; 21: 387–393. Publisher Full Text

[4] Chan PP, Lowe TM: tRNAscan-SE: Searching for tRNA Genes in Genomic Sequences. Methods Mol Biol. 2019; 1962: 1–14. PubMed Abstract | Publisher Full Text | Free Full Text

[5] Chase MW, Christenhusz MJM, Fay MF, et al.: An update of the Angiosperm Phylogeny Group classification for the orders and families of flowering plants: APG IV Botanical Journal of the Linnean Society. 2016; 181: 1–20. Publisher Full Text

[6] Chin CS, Alexander DH, Marks P, et al.: Nonhybrid, finished microbial genome assemblies from long-read SMRT sequencing data. Nat Methods. 2013; 10: 563–9. PubMed Abstract | Publisher Full Text

[7] Costanza R, d'Arge A, de Groot R, et al.: The value of the world’s ecosystem services and natural capital. Nature. 1997; 387: 253–60.

[8] Cucio C, Engelen AH, Costa R, et al.: Rhizosphere Microbiomes of European + Seagrasses Are Selected by the Plant, But Are Not Species Specific. Front Microbiol. 2016; 7: 440. PubMed Abstract | Publisher Full Text | Free Full Text

[9] Du Z-Y, Wang Q-F: Phylogenetic tree of vascular plants reveals the origins of aquatic angiosperms. Journal of Systematics and Evolution. 2016; 54: 342–48. Publisher Full Text

[10] Duarte B, Martins I, Rosa R, et al.: Climate Change Impacts on Seagrass Meadows and Macroalgal Forests: An Integrative Perspective on Acclimation and Adaptation Potential. Front Mar Sci. 2018; 5. Publisher Full Text

[11] DuBois K, Williams SL, Stachowicz JJ: Previous exposure mediates the response of eelgrass to future warming via clonal transgenerational plasticity. Ecology. 2020; 101: e03169. PubMed Abstract | Publisher Full Text

[12] Duffy JE, Reynolds PL, Bostrom C, et al.: Biodiversity mediates top-down control in eelgrass ecosystems: a global comparative-experimental approach. Ecol Lett. 2015; 18: 696–705.

[13] Durand NC, Shamim MS, Machol I, et al.: Juicer Provides a One-Click System for Analyzing Loop-Resolution Hi-C Experiments. Cell Syst. 2016; 3: 95–8. PubMed Abstract | Publisher Full Text | Free Full Text

[14] Eisen JA, Stachowicz JJ, Olsen JL, et al.: Population and evolutionary genomics of host-microbiome interactions in Zostera marina and other seagrasses. In: Berkeley, CA.: Community Sequencing Program, DOE-Joint Genome Institut; 2017.

[15] Ettinger CL, Eisen JA: Characterization of the Mycobiome of the Seagrass, Zostera marina, Reveals Putative Associations With Marine Chytrids. Front Microbiol. 2019; 10: 2476. PubMed Abstract | Publisher Full Text | Free Full Text

[16] Ettinger CL, Eisen JA: Fungi, bacteria and oomycota opportunistically isolated from the seagrass, Zostera marina. PLoS One. 2020; 15: e0236135. PubMed Abstract | Publisher Full Text | Free Full Text

[17] Ettinger CL, Voerman SE, Lang JM, et al.: Microbial communities in sediment from Zostera marina patches, but not the Z. marina leaf or root microbiomes, vary in relation to distance from patch edge. PeerJ. 2017a; 5: e3246. PubMed Abstract | Publisher Full Text | Free Full Text

[18] Ettinger CL, Williams SL, Abbott JM, et al.: Microbiome succession during ammonification in eelgrass bed sediments. PeerJ. 2017b; 5: e3674. PubMed Abstract | Publisher Full Text | Free Full Text

[19] Ettinger CL, Vann LE, Eisen JA: Global diversity and biogeography of the Zostera marina mycobiome.2020. Publisher Full Text

[20] Fahimipour AK, Kardish MR, Lang JM, et al.: Global-Scale Structure of the Eelgrass Microbiome. Appl Environ Microbiol. 2017; 83. PubMed Abstract | Publisher Full Text | Free Full Text

[21] Fourqurean JW, Duarte CM, Kennedy H, et al.: Seagrass ecosystems as a globally significant carbon stock. Nature Geoscience. 2012; 5: 505–09. Publisher Full Text

[22] Franssen SU, Gu J, Bergmann N, et al.: Transcriptomic resilience to global warming in the seagrass Zostera marina, a marine foundation species. Proc Natl Acad Sci U S A. 2011; 108: 19276–81. PubMed Abstract | Publisher Full Text | Free Full Text

[23] Green PE, Short FT, Frederick T: World atlas of seagrasses (Univ of California Press).2003.

[24] Guan C, Saha M, Weinberger F: Simulated Heatwaves Lead to Upregulated Chemical Defense of a Marine Foundation Macrophyte Against Microbial Colonizers. Front Mar Sci. 2020: 7. Publisher Full Text

[25] Haas BJ, Salzberg SL, Zhu W, et al.: Automated eukaryotic gene structure annotation using EVidenceModeler and the Program to Assemble Spliced Alignments. Genome Biol. 2008; 9: R7. PubMed Abstract | Publisher Full Text | Free Full Text

[26] Hillebrand H, Jacob U, Leslie HM: Integrative research perspectives on marine conservation. Philos Trans R Soc Lond B Biol Sci. 2020; 375: 20190444. PubMed Abstract | Publisher Full Text | Free Full Text

[27] Jiang H, Wong WH: SeqMap: mapping massive amount of oligonucleotides to the genome. Bioinformatics. 2008; 24: 2395–6. PubMed Abstract | Publisher Full Text | Free Full Text

[28] Jones P, Binns D, Chang HY, et al.: InterProScan 5: genome-scale protein function classification. Bioinformatics. 2014; 30: 1236–40. PubMed Abstract | Publisher Full Text | Free Full Text

[29] Jueterbock A, Bostrom C, Coyer JA, et al.: The Seagrass Methylome Is Associated With Variation in Photosynthetic Performance Among Clonal Shoots. Front Plant Sci. 2020; 11: 571646. PubMed Abstract | Publisher Full Text | Free Full Text

[30] Jueterbock A, Franssen SU, Bergmann N, et al.: Phylogeographic differentiation versus transcriptomic adaptation to warm temperatures in Zostera marina, a globally important seagrass. Mol Ecol. 2016; 25: 5396–411. PubMed Abstract | Publisher Full Text

[31] Kim D, Langmead B, Salzberg SL: HISAT: a fast spliced aligner with low memory requirements. Nat Methods. 2015; 12: 357–60. PubMed Abstract | Publisher Full Text | Free Full Text

[32] Kovaka S, Zimin AV, Pertea GM, et al.: Transcriptome assembly from long-read RNA-seq alignments with StringTie2. Genome Biol. 2019; 20: 278. PubMed Abstract | Publisher Full Text | Free Full Text

[33] Larkum AW, Orth RJ, Duarte CM: Seagrasses (Springer). 2006.

[34] Lee H, Golicz AA, Bayer PE, et al.: The Genome of a Southern Hemisphere Seagrass Species (Zostera muelleri). Plant Physiol. 2016; 172: 272–83. PubMed Abstract | Publisher Full Text | Free Full Text

[35] Les DH, Cleland MA, Waycott M: Phylogenetic Studies in Alismatidae, II: Evolution of Marine Angiosperms (Seagrasses) and Hydrophily. Systematic Botany. 1997; 22. Publisher Full Text

[36] Li H: Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM.2013.

[37] Lorenz R, Bernhart SH, Honer Zu Siederdissen C, et al.: ViennaRNA Package 2.0. Algorithms Mol Biol. 2011; 6: 26. PubMed Abstract | Publisher Full Text | Free Full Text

[38] McKenna A, Hanna M, Banks E, et al.: The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 2010; 20: 1297–303. PubMed Abstract | Publisher Full Text | Free Full Text

[39] Nawrocki EP, Eddy SR: Infernal 1.1: 100-fold faster RNA homology searches. Bioinformatics. 2013; 29: 2933–5. PubMed Abstract | Publisher Full Text | Free Full Text

[40] Olsen JL, Rouze P, Verhelst B, et al.: The genome of the seagrass Zostera marina reveals angiosperm adaptation to the sea. Nature. 2016; 530: 331–5. PubMed Abstract | Publisher Full Text

[41] Olsen JL, Stam WT, Coyer JA, et al.: North Atlantic phylogeography and large-scale population differentiation of the seagrass Zostera marina L. Mol Ecol. 2004; 13: 1923–41. PubMed Abstract | Publisher Full Text

[42] Shivaraj SM, Deshmukh R, Bhat JA, et al.: Understanding Aquaporin Transport System in Eelgrass (Zostera marina L.), an Aquatic Plant Species. Front Plant Sci. 2017; 8: 1334. PubMed Abstract | Publisher Full Text | Free Full Text

[43] Slater GS, Birney E: Automated generation of heuristics for biological sequence comparison. BMC Bioinformatics. 2005; 6: 31. PubMed Abstract | Publisher Full Text | Free Full Text

[44] Stanke M, Tzvetkova A, Morgenstern B: AUGUSTUS at EGASP: using EST, protein and genomic alignments for improved gene prediction in the human genome. Genome Biol. 2006; 7Suppl 1: S11 1–8. PubMed Abstract | Publisher Full Text | Free Full Text

[45] Thorsten BHR, Reusch H, Christoffer B, et al.: An ancient eelgrass clone in the Baltic. Mar Ecol Progress Series. 1999; 183: 301–04. Publisher Full Text

[46] van Katwijk MM, Thorhaug A, Marbà N, et al.: Global analysis of seagrass restoration: the importance of large-scale planting. J Appl. Ecol. 2016; 53: 567–78. Publisher Full Text

[47] Wang L, Tomas F, Mueller RS: Nutrient enrichment increases size of Zostera marina shoots and enriches for sulfur and nitrogen cycling bacteria in root-associated microbiomes. FEMS Microbiol Ecol. 2020; 96. PubMed Abstract | Publisher Full Text

[48] Wani SH, Kumar V, Khare T, et al.: Engineering salinity tolerance in plants: progress and prospects. Planta. 2020; 251: 76. PubMed Abstract | Publisher Full Text

[49] Waycott M, Duarte CM, Carruthers TJ, et al.: Accelerating loss of seagrasses across the globe threatens coastal ecosystems. Proc Natl Acad Sci U S A. 2009; 106: 12377–81. PubMed Abstract | Publisher Full Text | Free Full Text

[50] Xiao CL, Chen Y, Xie SQ, et al.: MECAT: fast mapping, error correction, and de novo assembly for single-molecule sequencing reads. Nat Methods. 2017; 14: 1072–74. PubMed Abstract | Publisher Full Text

[51] Yu L, Boström C, Franzenburg S, et al.: Somatic genetic drift and multilevel selection in a clonal seagrass. Nat Ecol Evol. 2020; 4: 952–962. PubMed Abstract | Publisher Full Text

[52] Zang Y, Chen J, Li R, et al.: Genome-wide analysis of the superoxide dismutase (SOD) gene family in Zostera marina and expression profile analysis under temperature stress. PeerJ. 2020; 8: e9063. PubMed Abstract | Publisher Full Text | Free Full Text

[53] Zimmerman RC: Scaling up: Predicting the Impacts of Climate Change on Seagrass Ecosystems. Estuaries and Coasts. 2020. Publisher Full Text

[54] Zwaenepoel A, Van de Peer Y: wgd-simple command line tools for the analysis of ancient whole-genome duplications. Bioinformatics. 2019; 35: 2153–55. PubMed Abstract | Publisher Full Text | Free Full Text

Improved chromosome-level genome assembly and annotation of the seagrass, Zostera marina (eelgrass)

Abstract

Keywords

Introduction

Methods

Sequencing strategy

Genome assembly and construction of pseudomolecule chromosomes

Annotation of repetitive elements and noncoding RNAs

Gene prediction and functional annotation

Results and discussion

Genome size and assembly

Table 1. Comparison of genome assemblies for Zostera marina v1.0 (Olsen et al., 2016) and v3.1 (current study).

Repetitive elements and noncoding RNAs

Table 2. Annotation of repeat elements.

Table 3. Known miRNAs identified in the genome of Zostera marina (v1.0 and v3.1).

Table 4. Novel miRNA candidates in Zostera marina (v1.0 and v3.1).

Protein-coding genes

Figure 1. Busco assessment results for Z. marina genomes v1.0 and v3.1.

Table 5. Genome annotation statistics.

Table 6. BUSCO completeness assessment of protein coding sequences in Z. marina version 3.1.

Table 7. Functional annotation.

Conclusions

Data availability

Underlying data

Acknowledgements

References

Comments on this article Comments (0)

Open Peer Review

Comments on this article Comments (0)

Open Peer Review

Reviewer Status

Reviewer Reports

Comments on this article

Browse by related subjects

Competing Interests Policy

Stay Updated