Neuropeptide evolution: Chelicerate neurohormone and neuropeptide genes may reflect one or more whole genome duplications

https://doi.org/10.1016/j.ygcen.2015.11.019Get rights and content

Highlights

  • Many neuropeptide and neuropeptide GPCR genes have paralogs in Chelicerates.

  • These paralogs could be due to whole genome duplications.

  • CCRFamide is a novel Arthropod neuropeptide gene.

Abstract

Four genomes and two transcriptomes from six Chelicerate species were analyzed for the presence of neuropeptide and neurohormone precursors and their GPCRs. The genome from the spider Stegodyphus mimosarum yielded 87 neuropeptide precursors and 120 neuropeptide GPCRs. Many neuropeptide transcripts were also found in the transcriptomes of three other spiders, Latrodectus hesperus, Parasteatoda tepidariorum and Acanthoscurria geniculata. For the scorpion Mesobuthus martensii the numbers are 79 and 93 respectively. The very small genome of the house dust mite, Dermatophagoides farinae, on the other hand contains a much smaller number of such genes. A few new putative Arthropod neuropeptide genes were discovered. Thus, both spiders and the scorpion have an achatin gene and in spiders there are two different genes encoding myosuppressin-like peptides while spiders also have two genes encoding novel LGamides. Another finding is the presence of trissin in spiders and scorpions, while neuropeptide genes that seem to be orthologs of Lottia LFRYamide and Platynereis CCRFamide were also found. Such genes were also found in various insect species, but seem to be lacking from the Holometabola. The Chelicerate neuropeptide and neuropeptide GPCR genes often have paralogs. As the large majority of these are probably not due to local gene duplications, is plausible that they reflect the effects of one or more ancient whole genome duplications.

Introduction

Neuropeptides and neurohormones regulate and/or modulate many biological processes from basal physiological functions such as carbohydrate metabolism and water balance to cognitive functions. It is possible that they were among the first chemical messengers used by the nervous and endocrine systems to establish communication between different cells of a single organism. In the last two decades a large number of genomes have been sequenced which allow in theory to identify all the neuropeptide genes they contain. In practice, this is often more difficult as neuropeptide sequences are small and relatively variable and are usually only recognized when they either code for homologs of previously identified neuropeptides or a number of very similar peptide sequences separated by putative convertase cleavage sites. Nevertheless, it seems likely that most neuropeptide genes can be identified in a given sequenced Arthropod genome.

The large majority of neuropeptides act through G-protein coupled receptors (GPCRs) which are readily identified from genome sequences for the presence of their seven transmembrane regions that are well conserved. Thus, by analyzing simultaneously both neuropeptide and their putative receptors in its genome it is possible to get a fairly complete view of the neuropeptidome of a species. Using such methods it was previously shown that the neuropeptidomes of insects, annelids and mollusks are remarkably similar and share a large number of neuropeptide genes (Veenstra, 2010, Veenstra, 2011, Stewart et al., 2014).

As expected, neuropeptides and their GPCRs identified from the spider mite Tetranychus urticae, the first Chelicerate for which a complete genome was sequenced (Grbić et al., 2011), are most similar to those of insects (Veenstra et al., 2012). However, some of its neuropeptide genes were previously only known from mollusks. One of these, elevenin, has subsequently also been found in insects (Tanaka et al., 2014, Veenstra, 2014). Since then draft genomes have been published for four other Chelicerates, those of the African social velvet spider, Stegodyphus mimosarum, the Brazilian white-knee tarantula, Acanthoscurria geniculata (Sanggaard et al., 2014), the scorpion Mesobuthus martensii (Cao et al., 2013), and the house dust mite Dermatophagoides farinae (Chan et al., 2015). At the same time spider transcriptomes have become available for Latrodectus hesperus and Parasteatoda tepidariorum (Clarke et al., 2014, Posnien et al., 2014). It thus seemed interesting to take another look at Chelicerate neuropeptides to complete the picture of Arthropod neuropeptides and their evolution. Christie (2015) has already published a number of spider neuropeptide precursors based on the Latrodectus transcriptome, but, as shown here, the actual number of neuropeptide precursors in this transcriptome is considerably larger and analyzing whole genomes reveals additional neuropeptide genes that are not represented in the assembled transcriptomes. While this work was in progress the genome of the Myriapod Strigamia maritima was also published (Chipman et al., 2014). I included this genome in the analysis as the phylogenetic position of the Myriapods falls between insects and Chelicerates (Fig. 1). I felt it would be interesting to have Protostomian outgroups in the phylogenetic analysis of GPCRs, as it was expected that the more recently shared ancestry might lead to better resolved phylogenetic trees. I, therefore, also predicted the neuropeptide GPCRs from the genome of the mollusk Lottia gigantea (Simakov et al., 2013), a species from which a large number of neuropeptide genes has previously been described (Veenstra, 2010, Roch et al., 2011, Mirabeau and Joly, 2013). More recently a large number of GPCRs from the annelid Platynereis dumerilii were published, some of which were deorphanized (Bauknecht and Jékely, 2015). Given the interest of this data set together with the previously published list of putative neuropeptide precursors from this species (Conzelmann et al., 2013) those GPCR sequences were also included in the analysis. Apart from discovering a few novel putative Arthropod neuropeptide genes, the more interesting findings are that many spider and scorpion genes encoding neuropeptides and their receptors have been duplicated, perhaps the result of one or more whole genome duplications.

Section snippets

Materials and methods

Local BLAST (Altschul et al., 1997, Camacho et al., 2009) was used to analyze the published transcriptome of Latrodectus as well as the Stegodyphus, Mesobuthus, Strigamia and Dermatophagoides genomes (all obtained from NCBI) as well as the Parasteatoda transcriptome (downloaded from http://asgard.rc.fas.harvard.edu/download.html). The Acanthoscurria genome was analyzed directly at NCBI using the web interface and contigs that might contain neuropeptide genes or parts thereof were downloaded for

Results and discussion

The predicted neuropeptide and neurohormone precursors and their putative processing into active peptides are listed in Supplementary Tables 1–7. A comparison of neuropeptide genes found in various Arthropod genomes is presented in Table 1. Of the two spider genomes that were analyzed, the one from the tarantula A. geniculata is considerably larger, has a 30 times higher heterozygosity and a lower coverage than the velvet spider, S. mimosarum. It is, therefore, not surprising that the quality

Acknowledgments

I thank all those who generated all the data that I used here as well as those that wrote the programs that I needed to do so, nam nihil proprium est. Constructive comments from two reviewers are also gratefully acknowledged, as is institutional funding from the CNRS.

References (105)

  • F. Hauser et al.

    Evolution of the AKH/corazonin/ACP/GnRH receptor superfamily and their ligands in the Protostomia

    Gen. Comp. Endocrinol.

    (2014)
  • F. Hauser et al.

    A review of neurohormone GPCRs present in the fruitfly Drosophila melanogaster and the honey bee Apis mellifera

    Prog. Neurobiol.

    (2006)
  • F. Hauser et al.

    A genome-wide inventory of neurohormone GPCRs in the red flour beetle Tribolium castaneum

    Front. Neuroendocrinol.

    (2008)
  • T. Homma et al.

    G protein-coupled receptor for diapause hormone, an inducer of Bombyx embryonic diapause

    Biochem. Biophys. Res. Commun.

    (2006)
  • T. Ida et al.

    Identification of the endogenous cysteine-rich peptide trissin, a ligand for an orphan G protein-coupled receptor in Drosophila

    Biochem. Biophys. Res. Commun.

    (2011)
  • A. Iversen et al.

    Molecular cloning and functional expression of a Drosophila receptor for the neuropeptides capa-1 and -2

    Biochem. Biophys. Res. Commun.

    (2002)
  • H. Jiang et al.

    Alternatively spliced orcokinin isoforms and their functions in Tribolium castaneum

    Insect Biochem. Mol. Biol.

    (2015)
  • S.H. Jung et al.

    Identification of a novel insect neuropeptide, CNMa and its receptor

    FEBS Lett.

    (2014)
  • Y. Kamatani et al.

    Achatin-I, an endogenous neuroexcitatory tetrapeptide from Achatina fulica Férussac containing a D-amino acid residue

    Biochem. Biophys. Res. Commun.

    (1989)
  • A. Krogh et al.

    Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes

    J. Mol. Biol.

    (2001)
  • G.A. Linneweber et al.

    Neuronal control of metabolism through nutrient-dependent modulation of tracheal branching

    Cell

    (2014)
  • C. Nagai et al.

    Identification and characterization of receptors for ion transport peptide (ITP) and ITP-like (ITPL) in the silkworm Bombyx mori

    J. Biol. Chem.

    (2014)
  • D.R. Nässel et al.

    Insulin/IGF signaling and its regulation in Drosophila

    Gen. Comp. Endocrinol.

    (2015)
  • J.P. Paluzzi et al.

    Identification, spatial expression analysis and functional characterization of a pyrokinin-1 receptor in the Chagas’ disease vector, Rhodnius prolixus

    Mol. Cell. Endocrinol.

    (2012)
  • Y. Park et al.

    Molecular cloning and biological activity of ecdysis-triggering hormones in Drosophila melanogaster

    FEBS Lett.

    (1999)
  • N. Pascual et al.

    Orcokinins in insects and other invertebrates

    Insect Biochem. Mol. Biol.

    (2004)
  • C. Rémy et al.

    Anatomical organization of two vasopressin–neurophysin-like neurosecretory cells throughout the central nervous system of the migratory locust

    Gen. Comp. Endocrinol.

    (1980)
  • S. Richards et al.

    Best practices in insect genome sequencing: what works and what doesn’t

    Curr. Opinion Insect Sci.

    (2015)
  • G.J. Roch et al.

    Evolution of GnRH: diving deeper

    Gen. Comp. Endocrinol.

    (2011)
  • L. Roller et al.

    The unique evolution of neuropeptide genes in the silkworm Bombyx mori

    Insect Biochem. Mol. Biol.

    (2008)
  • C. Rosenkilde et al.

    Molecular cloning, functional expression, and gene silencing of two Drosophila receptors for the Drosophila neuropeptide pyrokinin-2

    Biochem. Biophys. Res. Commun.

    (2003)
  • L.1. Šimo et al.

    Neuropeptidergic control of the hindgut in the black-legged tick Ixodes scapularis

    Int. J. Parasitol.

    (2014)
  • J. Stangier et al.

    Orcokinin: a novelmyotropic peptide from the nervous system of the crayfish, Orconectes limosus

    Peptides

    (1992)
  • M. Sterkel et al.

    OKB, a novel family of brain-gut neuropeptides from insects

    Insect Biochem. Mol. Biol.

    (2012)
  • Y. Tanaka et al.

    Transcriptome analysis of neuropeptides and G-protein coupled receptors (GPCRs) for neuropeptides in the brown planthopper Nilaparvata lugens

    Peptides

    (2014)
  • J.A. Veenstra

    Isolation and structure of corazonin, a cardioactive peptide from the American cockroach

    FEBS Lett.

    (1989)
  • J.A. Veenstra

    Allatostatin C and its paralog allatostatin double C: the Arthropod somatostatins

    Insect Biochem. Mol. Biol.

    (2009)
  • J.A. Veenstra

    Does corazonin signal nutritional stress in insects?

    Insect Biochem. Mol. Biol.

    (2009)
  • J.A. Veenstra

    Neurohormones and neuropeptides encoded by the genome of Lottia gigantea, with reference to other mollusks and insects

    Gen. Comp. Endocrinol.

    (2010)
  • J.A. Veenstra

    Neuropeptide evolution: neurohormones and neuropeptides predicted from the genomes of Capitella teleta and Helobdella robusta

    Gen. Comp. Endocrinol.

    (2011)
  • S.F. Altschul et al.

    Gapped BLAST and PSI-BLAST: a new generation of protein database search programs

    Nucl. Acids Res.

    (1997)
  • D. Araujo et al.

    Sex chromosomes and meiosis in spiders: a review

  • A.R. Beaumont et al.

    Ploidy manipulation in molluscan shellfish: a review

    J. Shellfish Res.

    (1991)
  • J.D. Bendtsen et al.

    Improved prediction of signal peptides: signalP 3.0

    J. Mol. Biol.

    (2004)
  • B.W. Bissinger et al.

    Synganglion transcriptome and developmental global gene expression in adult females of the American dog tick, Dermacentor variabilis (Acari: Ixodidae)

    Insect Mol. Biol.

    (2011)
  • I. Braasch et al.

    The endothelin system: evolution of vertebrate-specific ligand–receptor interactions by three rounds of genome duplication

    Mol. Biol. Evol.

    (2009)
  • T.H. Bullock

    Revisiting the concept of identifiable neurons

    Brain Behav. Evol.

    (2000)
  • C. Camacho et al.

    BLAST+: architecture and applications

    BMC Bioinformatics

    (2009)
  • Z. Cao et al.

    The genome of Mesobuthus martensii reveals a unique adaptation model of Arthropods

    Nat. Commun.

    (2013)
  • G. Cazzamali et al.

    Molecular cloning and functional expression of the first insect FMRFamide receptor

    Proc. Natl. Acad. Sci. USA

    (2002)
  • Cited by (36)

    • Allatostatin-CC

      2021, Handbook of Hormones: Comparative Endocrinology for Basic and Clinical Research
    • Neuropeptide and microRNA regulators of juvenile hormone production

      2020, General and Comparative Endocrinology
      Citation Excerpt :

      These paralogs named AST double-C (AST-CC) and AST triple-C (AST-CCC) have conserved primary structures (Table 2). All three forms were found in chelicerates, decapod crustaceans and selected insects such as L. migratoria and the sawfly Athalia rosae (Veenstra, 2016a, 2016b). In other insects, two forms of AST-C have been found either AST-C and AST-CC or AST-CC and AST-CCC.

    • The TRH-ortholog EFLamide in the migratory locust

      2020, Insect Biochemistry and Molecular Biology
    View all citing articles on Scopus
    View full text