Neuropeptide evolution: Chelicerate neurohormone and neuropeptide genes may reflect one or more whole genome duplications
Introduction
Neuropeptides and neurohormones regulate and/or modulate many biological processes from basal physiological functions such as carbohydrate metabolism and water balance to cognitive functions. It is possible that they were among the first chemical messengers used by the nervous and endocrine systems to establish communication between different cells of a single organism. In the last two decades a large number of genomes have been sequenced which allow in theory to identify all the neuropeptide genes they contain. In practice, this is often more difficult as neuropeptide sequences are small and relatively variable and are usually only recognized when they either code for homologs of previously identified neuropeptides or a number of very similar peptide sequences separated by putative convertase cleavage sites. Nevertheless, it seems likely that most neuropeptide genes can be identified in a given sequenced Arthropod genome.
The large majority of neuropeptides act through G-protein coupled receptors (GPCRs) which are readily identified from genome sequences for the presence of their seven transmembrane regions that are well conserved. Thus, by analyzing simultaneously both neuropeptide and their putative receptors in its genome it is possible to get a fairly complete view of the neuropeptidome of a species. Using such methods it was previously shown that the neuropeptidomes of insects, annelids and mollusks are remarkably similar and share a large number of neuropeptide genes (Veenstra, 2010, Veenstra, 2011, Stewart et al., 2014).
As expected, neuropeptides and their GPCRs identified from the spider mite Tetranychus urticae, the first Chelicerate for which a complete genome was sequenced (Grbić et al., 2011), are most similar to those of insects (Veenstra et al., 2012). However, some of its neuropeptide genes were previously only known from mollusks. One of these, elevenin, has subsequently also been found in insects (Tanaka et al., 2014, Veenstra, 2014). Since then draft genomes have been published for four other Chelicerates, those of the African social velvet spider, Stegodyphus mimosarum, the Brazilian white-knee tarantula, Acanthoscurria geniculata (Sanggaard et al., 2014), the scorpion Mesobuthus martensii (Cao et al., 2013), and the house dust mite Dermatophagoides farinae (Chan et al., 2015). At the same time spider transcriptomes have become available for Latrodectus hesperus and Parasteatoda tepidariorum (Clarke et al., 2014, Posnien et al., 2014). It thus seemed interesting to take another look at Chelicerate neuropeptides to complete the picture of Arthropod neuropeptides and their evolution. Christie (2015) has already published a number of spider neuropeptide precursors based on the Latrodectus transcriptome, but, as shown here, the actual number of neuropeptide precursors in this transcriptome is considerably larger and analyzing whole genomes reveals additional neuropeptide genes that are not represented in the assembled transcriptomes. While this work was in progress the genome of the Myriapod Strigamia maritima was also published (Chipman et al., 2014). I included this genome in the analysis as the phylogenetic position of the Myriapods falls between insects and Chelicerates (Fig. 1). I felt it would be interesting to have Protostomian outgroups in the phylogenetic analysis of GPCRs, as it was expected that the more recently shared ancestry might lead to better resolved phylogenetic trees. I, therefore, also predicted the neuropeptide GPCRs from the genome of the mollusk Lottia gigantea (Simakov et al., 2013), a species from which a large number of neuropeptide genes has previously been described (Veenstra, 2010, Roch et al., 2011, Mirabeau and Joly, 2013). More recently a large number of GPCRs from the annelid Platynereis dumerilii were published, some of which were deorphanized (Bauknecht and Jékely, 2015). Given the interest of this data set together with the previously published list of putative neuropeptide precursors from this species (Conzelmann et al., 2013) those GPCR sequences were also included in the analysis. Apart from discovering a few novel putative Arthropod neuropeptide genes, the more interesting findings are that many spider and scorpion genes encoding neuropeptides and their receptors have been duplicated, perhaps the result of one or more whole genome duplications.
Section snippets
Materials and methods
Local BLAST (Altschul et al., 1997, Camacho et al., 2009) was used to analyze the published transcriptome of Latrodectus as well as the Stegodyphus, Mesobuthus, Strigamia and Dermatophagoides genomes (all obtained from NCBI) as well as the Parasteatoda transcriptome (downloaded from http://asgard.rc.fas.harvard.edu/download.html). The Acanthoscurria genome was analyzed directly at NCBI using the web interface and contigs that might contain neuropeptide genes or parts thereof were downloaded for
Results and discussion
The predicted neuropeptide and neurohormone precursors and their putative processing into active peptides are listed in Supplementary Tables 1–7. A comparison of neuropeptide genes found in various Arthropod genomes is presented in Table 1. Of the two spider genomes that were analyzed, the one from the tarantula A. geniculata is considerably larger, has a 30 times higher heterozygosity and a lower coverage than the velvet spider, S. mimosarum. It is, therefore, not surprising that the quality
Acknowledgments
I thank all those who generated all the data that I used here as well as those that wrote the programs that I needed to do so, nam nihil proprium est. Constructive comments from two reviewers are also gratefully acknowledged, as is institutional funding from the CNRS.
References (105)
- et al.
Characterisation and tissue distribution of the PISCF allatostatin receptor in the red flour beetle Tribolium castaneum
Insect Biochem. Mol. Biol.
(2013) - et al.
Large-scale combinatorial deorphanization of platynereis neuropeptide GPCRs
Cell Rep.
(2015) - et al.
New insights into the evolution of vertebrate CRH (corticotropin-releasing hormone) and invertebrate DH44 (diuretic hormone 44) receptors in metazoans
Gen. Comp. Endocrinol.
(2014) - et al.
The Drosophila gene CG9918 codes for a pyrokinin-1 receptor
Biochem. Biophys. Res. Commun.
(2005) - et al.
The draft genome, transcriptome, and microbiome of Dermatophagoides farinae reveal a broad spectrum of dust mite allergens
J. Allergy Clin. Immunol.
(2015) In silico characterization of the neuropeptidome of the Western black widow spider Latrodectus hesperus
Gen. Comp. Endocrinol.
(2015)- et al.
Genome-wide analysis of homeobox genes from Mesobuthus martensii reveals Hox gene duplication in scorpions
Insect Biochem. Mol. Biol.
(2015) - et al.
Neuropeptide signaling sequences identified by pyrosequencing of the American dog tick synganglion transcriptome during blood feeding and reproduction
Insect Biochem. Mol. Biol.
(2010) - et al.
The shrimp hyperglycemic hormone-like neuropeptide is encoded by multiple copies of genes arranged in a cluster
FEBS Lett.
(1998) - et al.
Discovery of a novel insect neuropeptide signaling system closely related to the insect adipokinetic hormone and corazonin hormonal systems
J. Biol. Chem.
(2010)
Evolution of the AKH/corazonin/ACP/GnRH receptor superfamily and their ligands in the Protostomia
Gen. Comp. Endocrinol.
A review of neurohormone GPCRs present in the fruitfly Drosophila melanogaster and the honey bee Apis mellifera
Prog. Neurobiol.
A genome-wide inventory of neurohormone GPCRs in the red flour beetle Tribolium castaneum
Front. Neuroendocrinol.
G protein-coupled receptor for diapause hormone, an inducer of Bombyx embryonic diapause
Biochem. Biophys. Res. Commun.
Identification of the endogenous cysteine-rich peptide trissin, a ligand for an orphan G protein-coupled receptor in Drosophila
Biochem. Biophys. Res. Commun.
Molecular cloning and functional expression of a Drosophila receptor for the neuropeptides capa-1 and -2
Biochem. Biophys. Res. Commun.
Alternatively spliced orcokinin isoforms and their functions in Tribolium castaneum
Insect Biochem. Mol. Biol.
Identification of a novel insect neuropeptide, CNMa and its receptor
FEBS Lett.
Achatin-I, an endogenous neuroexcitatory tetrapeptide from Achatina fulica Férussac containing a D-amino acid residue
Biochem. Biophys. Res. Commun.
Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes
J. Mol. Biol.
Neuronal control of metabolism through nutrient-dependent modulation of tracheal branching
Cell
Identification and characterization of receptors for ion transport peptide (ITP) and ITP-like (ITPL) in the silkworm Bombyx mori
J. Biol. Chem.
Insulin/IGF signaling and its regulation in Drosophila
Gen. Comp. Endocrinol.
Identification, spatial expression analysis and functional characterization of a pyrokinin-1 receptor in the Chagas’ disease vector, Rhodnius prolixus
Mol. Cell. Endocrinol.
Molecular cloning and biological activity of ecdysis-triggering hormones in Drosophila melanogaster
FEBS Lett.
Orcokinins in insects and other invertebrates
Insect Biochem. Mol. Biol.
Anatomical organization of two vasopressin–neurophysin-like neurosecretory cells throughout the central nervous system of the migratory locust
Gen. Comp. Endocrinol.
Best practices in insect genome sequencing: what works and what doesn’t
Curr. Opinion Insect Sci.
Evolution of GnRH: diving deeper
Gen. Comp. Endocrinol.
The unique evolution of neuropeptide genes in the silkworm Bombyx mori
Insect Biochem. Mol. Biol.
Molecular cloning, functional expression, and gene silencing of two Drosophila receptors for the Drosophila neuropeptide pyrokinin-2
Biochem. Biophys. Res. Commun.
Neuropeptidergic control of the hindgut in the black-legged tick Ixodes scapularis
Int. J. Parasitol.
Orcokinin: a novelmyotropic peptide from the nervous system of the crayfish, Orconectes limosus
Peptides
OKB, a novel family of brain-gut neuropeptides from insects
Insect Biochem. Mol. Biol.
Transcriptome analysis of neuropeptides and G-protein coupled receptors (GPCRs) for neuropeptides in the brown planthopper Nilaparvata lugens
Peptides
Isolation and structure of corazonin, a cardioactive peptide from the American cockroach
FEBS Lett.
Allatostatin C and its paralog allatostatin double C: the Arthropod somatostatins
Insect Biochem. Mol. Biol.
Does corazonin signal nutritional stress in insects?
Insect Biochem. Mol. Biol.
Neurohormones and neuropeptides encoded by the genome of Lottia gigantea, with reference to other mollusks and insects
Gen. Comp. Endocrinol.
Neuropeptide evolution: neurohormones and neuropeptides predicted from the genomes of Capitella teleta and Helobdella robusta
Gen. Comp. Endocrinol.
Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
Nucl. Acids Res.
Sex chromosomes and meiosis in spiders: a review
Ploidy manipulation in molluscan shellfish: a review
J. Shellfish Res.
Improved prediction of signal peptides: signalP 3.0
J. Mol. Biol.
Synganglion transcriptome and developmental global gene expression in adult females of the American dog tick, Dermacentor variabilis (Acari: Ixodidae)
Insect Mol. Biol.
The endothelin system: evolution of vertebrate-specific ligand–receptor interactions by three rounds of genome duplication
Mol. Biol. Evol.
Revisiting the concept of identifiable neurons
Brain Behav. Evol.
BLAST+: architecture and applications
BMC Bioinformatics
The genome of Mesobuthus martensii reveals a unique adaptation model of Arthropods
Nat. Commun.
Molecular cloning and functional expression of the first insect FMRFamide receptor
Proc. Natl. Acad. Sci. USA
Cited by (36)
Neuropeptides in Rhipicephalus microplus and other hard ticks
2022, Ticks and Tick-borne DiseasesBombyx neuropeptide G protein-coupled receptor A14 and A15 are two functional G protein-coupled receptors for CCHamide neuropeptides
2021, Insect Biochemistry and Molecular BiologyAllatostatin-CC
2021, Handbook of Hormones: Comparative Endocrinology for Basic and Clinical ResearchNeuropeptide and microRNA regulators of juvenile hormone production
2020, General and Comparative EndocrinologyCitation Excerpt :These paralogs named AST double-C (AST-CC) and AST triple-C (AST-CCC) have conserved primary structures (Table 2). All three forms were found in chelicerates, decapod crustaceans and selected insects such as L. migratoria and the sawfly Athalia rosae (Veenstra, 2016a, 2016b). In other insects, two forms of AST-C have been found either AST-C and AST-CC or AST-CC and AST-CCC.
The TRH-ortholog EFLamide in the migratory locust
2020, Insect Biochemistry and Molecular Biology