Abstract
Campylobacter hyointestinalis is an emerging pathogen currently divided in two subspecies: C. hyointestinalis subsp. lawsonii which is predominantly recovered from pigs, and C. hyointestinalis subsp. hyointestinalis which can be found in a much wider range of mammalian hosts. Despite C. hyointestinalis being reported as an emerging pathogen, its evolutionary and host-associated diversification patterns are still vastly unexplored. For this reason, we generated whole-genome sequences of 13 C. hyointestinalis subsp. hyointestinalis strains and performed a comprehensive comparative analysis including publicly available C. hyointestinalis subsp. hyointestinalis and C. hyointestinalis subsp. lawsonii genomes, to gain insight into the genomic variation of these differentially-adapted subspecies. Both subspecies are distinct phylogenetic lineages which present an apparent barrier to homologous recombination, suggesting genetic isolation. This is further supported by accessory gene patterns that recapitulate the core genome phylogeny. Additionally, C. hyointestinalis subsp. hyointestinalis presents a bigger and more diverse accessory genome, which probably reflects its capacity to colonize different mammalian hosts unlike C. hyointestinalis subsp. lawsonii that is presumably host-restricted. This greater plasticity in the accessory genome of C. hyointestinalis subsp. hyointestinalis correlates to a higher incidence of genome-wide recombination events, that may be the underlying mechanism driving its diversification. Concordantly, both subspecies present distinct patterns of gene families involved in genome plasticity and DNA repair like CRISPR-associated proteins and restriction-modification systems. Together, our results provide an overview of the genetic mechanisms shaping the genomes of C. hyointestinalis subspecies, contributing to understand the biology of Campylobacter species that are increasingly recognized as emerging pathogens.
Similar content being viewed by others
Introduction
The genus Campylobacter consists in a diverse group of bacteria currently classified into 32 species and 13 subspecies. Among them, C. jejuni and C. coli have drawn most of the attention because they are leading causes of human gastroenteritis worldwide1. However, the recent application of whole-genome sequencing to study bacterial populations has increased the clinical awareness of campylobacteriosis and highlighted the importance of other neglected Campylobacter species, like C. fetus2,3,4,5, as causative agents of human and animal infections. Among them, C. hyointestinalis is an emerging pathogen that was first isolated from swine with proliferative enteritis6 and has since been sporadically recovered from human infections, but also found as a commensal in a wide variety of wild, farm and domestic mammals (including cattle, pigs, dogs, hamsters, deer and sheep)7.
C. hyointestinalis is currently divided in two subspecies based on genetic and phenotypic traits8,9. While C. hyointestinalis subsp. hyointestinalis has a broad host range, C. hyointestinalis subsp. lawsonii has been predominantly recovered from pigs. Some pioneering studies at both genetic and protein levels have suggested that C. hyointestinalis harbors even further intraspecific diversity10,11,12 which could facilitate its adaptation to diverse hosts and environments. However, these observations remain to be assessed at higher resolution due to the lack of available genomic data for both subspecies. So, the evolutionary forces driving their genetic and ecological distinctions have not been explored at the whole-genome level.
Here, we performed whole-genome sequencing of 13 C. hyointestinalis subsp. hyointestinalis strains isolated from healthy cattle and from a natural watercourse, that were collected from farms located around Sherbrooke, Québec, Canada. By incorporating this information to the available genomes of both subspecies, we performed a pangenome analysis to elucidate the main sources of molecular diversity in both subspecies and the probable genetic mechanisms and functional characteristics that distinguish the presumably host-restricted C. hyointestinalis subsp. lawsonii from the generalist C. hyointestinalis subsp. hyointestinalis. Our work provides the first comparative analysis of both C. hyointestinalis subspecies at the pangenome level and will guide future efforts to understand the patterns of host-associated evolution in emerging Campylobacter pathogens.
Results
By whole-genome sequencing 13 C. hyointestinalis subsp. hyointestinalis strains, we enlarged by 45% the current collection of genomes available for C. hyointestinalis. Then, by recovering 29 additional genomes of C. hyointestinalis subsp. hyointestinalis (n = 19) and C. hyointestinalis subsp. lawsonii (n = 10) from public databases, we built a genomic dataset consisting of 42 genomes (Table 1). These genomes represent strains isolated between 1985 and 2016 from five different hosts in six different countries. This dataset was used to apply comparative pangenomic, phylogenetic and ecological approaches to uncover the main sources of genetic variability between C. hyointestinalis subspecies.
Genetic diversity of C. hyointestinalis subsp. hyointestinalis strains sequenced in this study
To determine the degree of genetic variability among the new C. hyointestinalis subsp. hyointestinalis genomes generated from strains isolated in Canada, we used the currently available multilocus sequence typing (MLST) scheme for C. hyointestinalis. This analysis revealed that 7 out of 13 (54%) genomes presented new sequence types (STs). Out of them, three new STs (strains 006A-0063, 006A-0178 and 006A-0196) were product of new combinations of previously described alleles. The remaining novel STs were product of previously unknown alleles for genes tkt, aspA, glnA and pgm. Remarkably, not a single C. hyointestinalis subsp. hyointestinalis genome sequenced in this study harbored the same MLST genotype (Table S1).
C. hyointestinalis subspecies are genetically isolated lineages
To gain insight into the population structure of C. hyointestinalis we reconstructed the species clonal phylogeny starting from a core genome alignment that consisted in 1,320,272 positions (representing 66% of the longest genome). After removing recombinations only 81,000 positions (representing 6% of the original core genome alignment) remained in the clonal frame. The resulting clonal phylogeny showed a highly structured topology with both subspecies completely separated in two distinct lineages with clear differences in host distribution (Fig. 1A,B). This was in line with a mean Average Nucleotide Identity (ANI)13 of ~ 95% separating C. hyointestinalis subsp. hyointestinalis from C. hyointestinalis subsp. lawsonii (Fig. 1C). Evidence supporting the genetic isolation of both subspecies also came from exploring genome-wide recombination patterns, which revealed a barrier to homologous recombination between C. hyointestinalis subsp. hyointestinalis from C. hyointestinalis subsp. lawsonii (with the exception of C. hyointestinalis subsp. hyointestinalis strains S1499c and 006A-0180 that have recombined with C. hyointestinalis subsp. lawsonii strains) (Fig. 1D). Furthermore, C. hyointestinalis subsp. hyointestinalis seems to be much more recombinogenic than C. hyointestinalis subsp. lawsonii, as evidenced by a significantly higher proportion of their genomes contained within recombinant regions (Fig. 1E).
Accessory genes discriminate both C. hyointestinalis subspecies
To gain further insight into the genomic evolution of C. hyointestinalis subspecies we reconstructed its pangenome. A total of 4317 gene clusters were identified out of which 3040 (70%) were accessory genes (Table S2). The accessory genome median size was 580 (IQR = 174) and 538 (IQR = 74) for C. hyointestinalis subsp. hyointestinalis and C. hyointestinalis subsp. lawsonii, respectively. Figure 2A shows a slightly significant difference in the accessory genome size in favor of C. hyointestinalis subsp. hyointestinalis (p = 0.023, Mann–Whitney U test). To discard possible confounding effects due to the unbalanced number of genomes available for each subspecies, we repeated this analysis by sub-sampling C. hyointestinalis subsp. hyointestinalis genomes to the number of available C. hyointestinalis subsp. lawsonii genomes. This analysis revealed a still observable difference in the accessory genome size in favor of C. hyointestinalis subsp. hyointestinalis (Fig. S1). This tendency was also observable when calculating the diversity of accessory genes using the inverted Simpson’s index for both subspecies (p = 0.00021, Mann–Whitney U test) (Fig. 2B). Accessory gene presence/absence patterns also allowed to completely discriminate between C. hyointestinalis subsp. hyointestinalis and C. hyointestinalis subsp. lawsonii using a Principal Components Analysis (PCA), indicating that they have subspecies-specific accessory gene repertories (Fig. 2C). Indeed, 1562 accessory gene clusters were exclusively found in C. hyointestinalis subsp. hyointestinalis genomes while only 618 were specific to C. hyointestinalis subsp. lawsonii genomes.
Functional distinctions in the accessory genome of C. hyointestinalis subspecies
To evaluate possible functional aspects associated to different accessory gene patterns distinguishing C. hyointestinalis subspecies, we performed a functional classification based on the eggNOG database14. First, we found a complete separation of subspecies when using functional annotations to perform a PCA (p = 0.001, Permanova test), supporting that accessory genomes are functionally different between them (Fig. 3A). Then, we looked for functional categories that could discriminate between subspecies and we found that genes belonging to the functional category referred as “DNA replication, recombination and repair” (L) presented the most informative discriminatory patterns (Fig. 3B). Given this evidence, we studied CRISPR-associated proteins (Cas) and Restriction-Modification (R-M) systems, which are known to be involved in DNA recombination and repair. Figure 4 shows that Cas systems are more diverse and widespread in C. hyointestinalis subsp. hyointestinalis. Importantly, our analysis did not find any complete Cas system in C. hyointestinalis subsp. lawsonii genomes. In particular, CAS system type I was the most prevalent in C. hyointestinalis subsp. hyointestinalis genomes (59%) A higher number of complete R-M systems were found in C. hyointestinalis subsp. lawsonii (mean = 5), than in C. hyointestinalis subsp. hyointestinalis (mean = 2). In particular, type II and type III R-M systems had > 2 copies in 90% of C. hyointestinalis subsp. lawsonii genomes and only in 25% of C. hyointestinalis subsp. hyointestinalis genomes.
Discussion
Recently, the first analysis of multiple C. hyointestinalis strains confirmed the previously observed highly diverse nature of this bacterial species at the whole-genome level15 by comparing strains recovered from ruminant livestock in New Zealand. Despite these strains being collected in geographically close locations, they showed all different and novel MLST genotypes. Similarly, our collection of isolates from Canada also presented different MLST genotypes (most of which were also novel), suggesting that patterns of high genomic variability in C. hyointestinalis are a distinctive feature of this species in geographically distant populations.
Hitherto, main patterns of variation in C. hyointestinalis have been identified by studying C. hyointestinalis subsp. hyointestinalis genomes. This limitation prevented to compare if the observed trends were conserved between both subspecies or if evolutionary forces are differentially impacting their genomes. Accordingly, not only our work increased the availability of C. hyointestinalis subsp. hyointestinalis genomes from a previously unsampled geographic region, but also took advantage of the recent release of novel C. hyointestinalis subsp. lawsonii genomes to perform a comparative pangenome analysis that revealed the main forces underpinning the genomic diversity now considering both subspecies.
Despite sampling bias may exist, the available data indicate that C. hyointestinalis subspecies are ecologically distinct, given that C. hyointestinalis subsp. lawsonii has been mainly recovered from pigs while C. hyointestinalis subsp. hyointestinalis is a generalist that colonizes several mammalian species. Host specialization has been observed in other Campylobacter species, such as in C. fetus lineages that preferably infect cows, humans or reptiles3,16, in phylogenetically distinct C. coli isolates from diseased humans or riparian environments17, and in global clonal complexes of C. jejuni with differential host preferences18. In most of these cases, strong lineage-specific recombination and accessory gene gain/loss patterns have been identified, concordantly to what is expected for bacterial lineages that undergo ecological isolation. For example, a barrier to homologous recombination like that observed between C. hyointestinalis subspecies has been also detected between mammal- and reptile-associated C. fetus subspecies16, and lineage-specific recombination patterns have been found in the C. jejuni clonal complex ST-403 that is unable to colonize chicken19. Interestingly, this is correlated with the presence of lineage-specific repertories of R-M systems, as well as we observed for type II and type III R-M systems in C. hyointestinalis subspecies. Moreover, other molecular mechanisms involved in genome plasticity like CRISPR/Cas systems are unevenly distributed in agricultural or non-agricultural C. jejuni/coli genomes20, indicating that these systems are differentially present in ecologically distinct niches resembling again the patterns we observed for type I Cas systems in C. hyointestinalis subspecies. However, the presence of two C. hyointestinalis subsp. hyointestinalis genomes (006A-0180 and S1499c) of cattle origin recombining with C. hyointestinalis subsp. lawsonii genomes of porcine origin, suggests that both subspecies have occupied the same niche at some point either in cattle or pigs. This had been previously observed for strain S1499c by Wilkinson et al. (2018)15, which is defined by the authors as an atypical isolate with a particularly highly diverse genome. Interestingly, strains 006A-0180 and S1499c form a divergent branch within the C. hyointestinalis subsp. hyointestinalis clade indicating they are genetically distinct. This suggests the probable existence of yet unsampled intermediate lineages between genetically isolated C. hyointestinalis subsp. hyointestinalis and C. hyointestinalis subsp. lawsonii that have kept the capacity of exchanging genetic material. Also, the available genomic dataset for C. hyointestinalis subsp. lawsonii is mostly composed by strains isolated in the same geographic region (California, United States). This may be a confounding factor given that conclusions about the genetic isolation of this subspecies could change if a more diverse dataset is included. Additionally, a more diverse set of C. hyointestinalis subsp. hyointestinalis genomes are needed to confirm this observation, in particular isolated for strains isolated from pigs. This would allow to investigate genetic relatedness between both subspecies coexisting in the same host. Alternatively, the observed recombination barrier could not be entirely related to genomic differences preventing horizontal gene transfer and recombination between subspecies, and might be explained by the physical isolation of both subspecies in different hosts.
The maintenance of lineage-specific repertories of molecular machineries that modulate genome plasticity is probably an extended mechanism in Campylobacter, considering that recombination is an important evolutionary force for the adaptation and acquisition of a host signature in well-known Campylobacter pathogens21. In general, adaptation occurs in favor of gradual host specialization, but generalism is also widely observed in nature, for example in extremely successful C. jejuni lineages that can be found in high prevalence from both agricultural sources or human infections22. A generalist phenotype can be thought as an advantage for bacteria that colonize farm animals, since it allows the subsistence in multiple mammalian species that thieve in close proximity. However, this also represents an increased risk for zoonotic transmission since these animals are usually in contact with humans. Indeed, this scenario is reflected in C. hyointestinalis subspecies, given that the generalist C. hyointestinalis subsp. hyointestinalis has been frequently isolated from human infections in contrast to the lack of reported cases of human infections with C. hyointestinalis subsp. lawsonii.
Despite our analysis uncovered the main forces shaping the intra-specific diversity of C. hyointestinalis and our results support the observed epidemiological pattern in both subspecies, the availability of comprehensive genomic datasets for most campylobacters is quite restricted23, so the integration of strain collections from different hosts, geographic regions and clinical conditions is necessary to deepening our understanding of the genomic evolution in this emerging pathogen and other neglected Campylobacter species.
Methods
Sampling and bacterial isolation
Samples were collected as described previously24. Briefly, cattle feces samples were transported in Enteric Plus medium (Meridian Bioscience Inc, Ohio, USA) and processed on the same day. About 1–2 g of each fecal sample were transferred to 25 ml of Preston selective enrichment broth (Oxoid, Nepean, Ontario, Canada) and incubated 3–4 h at 37 ºC and then transferred to 42 ºC and incubated for 48 h. After incubation, 20 μl were streaked on a Karmali plate (Oxoid) and incubated at 42 ºC for 48 h. For environmental water, 3000 ml of water were collected and transported on ice to the laboratory, held at 4 ºC and tested within 24 h. Water was filtered through a 0.45 μm pore-size membrane filter and Preston broth and Karmali plate were used as above to isolate Campylobacter.
Whole genome sequencing, available data and taxonogenomic analyses
Cells were pelleted from culture plates and phosphate-buffered saline (PBS). Genomic DNA preparation was performed using a BioRobot M48 (Qiagen). DNA was prepared and sequenced using the Illumina Hi-Seq platform with library fragment sizes of 200–300 bp. and a read length of 100 bp at the Wellcome Sanger Institute. Each sequenced genome was de novo assembled with Velvet25, SSPACE v2.026 and GapFiller v1.127 with default parameters. Resulting contigs were annotated using Prokka28. Species membership was checked by calculating the Average Nucleotide Identity (ANI) index as previously described29. Multilocus sequence typing (MLST) was performed from genomic assemblies with the available C. hyointestinalis scheme at PubMLST (https://www.pubmlst.org) using MLSTar30. Available genomic data at the time of designing this work consisted in 19 C. hyointestinalis subsp. hyointestinalis strains and 10 C. hyointestinalis subsp. lawsonii strains, that were added to the 13 C. hyointestinalis subsp. hyointestinalis sequenced in this work (Table S3) to build a final dataset of 42 genomes (Table 1).
Pangenome and recombination analyses
A multiple genome alignment was performed with the progressiveMauve algorithm31 and the final core genome alignment was defined by concatenating locally collinear blocks (LCBs) longer than 500 bp present in at least 41 out of 42 genomes (~ 98%). Recombinant regions were identified running Gubbins32 with default parameters. The pan-genome was reconstructed using Pewit (https://github.com/iferres/pewit)33. Briefly, for every genome, each annotated gene is scanned against the Pfam database34 using HMMER3 v3.1b2 hmmsearch35 and its domain architecture is recorded (presence and order). A primary set of orthologous clusters is generated by grouping genes sharing exactly the same domain architecture. Then, remaining genes without hits against the Pfam database are compared to each other at protein level using HMMER3 v3.1b2 phmmer and clustered using the MCL algorithm36. These coarse clusters are then splitted using a tree-pruning algorithm which allows to discriminate between orthologous and paralogous genes. This algorithm automatically aligns each gene cluster and builds a Neighbor-Joining tree and iteratively refines coarse clusters and detects paralogous using each gene-tree to i) detect nodes whose descendants all belong to the same genome and ii) split the tree in many subtrees as necessary to achieve the minimum set of subtrees with just one tip per genome. Finally, singletons (genes occurring in a single genome) generated in the previous steps are refined by trying to reallocate them to previously generated clusters by comparing each singleton against cluster-specific HMMs using HMMER3 v3.1b2 hmmsearch35. Accessory genes were defined as those gene clusters occurring in less than 98% of the genomes (41/42). Ecological distances over accessory gene patterns like Jaccard index (to measure diversity between strains) or Shannon index (to measure intra-genomic diversity) and Principal Component Analysis (PCA) were calculated with the base or vegan37 packages in R v3.6.0.
Analysis of specific gene families and functional categories
Several specific gene families of interest were recovered and analyzed form C. hyointestinalis genomes. CRISPR-associated protein (CAS) gene clusters were identified and classified using CRISPRCasFinder38. To recover R-M systems we followed the methodology described in Oliveira et al. (2016)39. This approach identifies systems by searching genes encoding MTase and REase components from the REBASE database40 using Blast + blastp41 with an identity > 80% and query coverage > 80% as inclusion thresholds. Complete R-M systems of each type were considered if MTase and REase components were less than four genes apart each other. Functional categories were assigned to annotated genes using the egg NOG database14 and the eggNOG-mapper tool42.
References
Kaakoush, N. O., Castaño-Rodríguez, N., Mitchell, H. M. & Man, S. M. Global epidemiology of Campylobacter infection. Clin. Microbiol. Rev. 28, 687–720 (2015).
Fitzgerald, C. et al. Campylobacter fetus subsp. testudinum subsp. nov., isolated from humans and reptiles. Int. J. Syst. Evolut. Microbiol. 64, 2944–2948 (2014).
Iraola, G. et al. Distinct Campylobacter fetus lineages adapted as livestock pathogens and human pathobionts in the intestinal microbiota. Nat. Commun. 8, 1367 (2017).
Iraola, G. et al. A rural worker infected with a bovine-prevalent genotype of Campylobacter fetus subsp. fetus supports zoonotic transmission and inconsistency of MLST and whole-genome typing. Eur. J. Clin. Microbiol. Infect. Dis. 34, 1593–1596 (2015).
Costa, D. et al. Polyclonal Campylobacter fetus infections among unrelated patients, Montevideo, Uruguay, 2013–2018. Clin. Infect. Dis. 70, 1236–1239 (2020).
Gebhart, C. et al. Campylobacter hyointestinalis (new species) isolated from swine with lesions of proliferative ileitis. Am. J. Vet. Res. 44, 361–367 (1983).
Man, S. M. The clinical importance of emerging Campylobacter species. Nat. Rev. Gastroenterol. Hepatol. 8, 669–685 (2011).
Miller, W. G. et al. Multilocus sequence typing methods for the emerging Campylobacter Species C. hyointestinalis, C. lanienae, C. sputorum, C. concisus, and C. curvus. Front. Cell. Infect. Microbiol. 2, 45 (2012).
On, S. L. et al. Campylobacter hyointestinalis subsp. lawsonii subsp. nov., isolated from the porcine stomach, and an emended description of Campylobacter hyointestinalis. Int. J. Syst. Evolut. Microbiol. 45, 767–774 (1995).
Harrington, C. S. & On, S. L. W. Extensive 16S rRNA gene sequence diversity in Campylobacter hyointestinalis strains: Taxonomic and applied implications. Int. J. Syst. Evol. Microbiol. 49, 1171–1175 (1999).
Salama, S. M. et al. Pulsed-field gel electrophoresis for epidemiologic studies of Campylobacter hyointestinalis isolates. J. Clin. Microbiol. 30, 1982–1984 (1992).
On, S. L. W. et al. Identification and intra-specific heterogeneity of Campylobacter hyointestinalis based on numerical analysis of electrophoretic protein profiles. Syst. Appl. Microbiol. 16, 37–46 (1993).
Konstantinidis, K. T. & Tiedje, J. M. Genomic insights that advance the species definition for prokaryotes. PNAS 102, 2567–2572 (2005).
Huerta-Cepas, J. et al. eggNOG 5.0: A hierarchical, functionally and phylogenetically annotated orthology resource based on 5090 organisms and 2502 viruses. Nucleic Acids Res. 47, D309–D314 (2019).
Wilkinson, D. A. et al. Updating the genomic taxonomy and epidemiology of Campylobacter hyointestinalis. Sci. Rep. 8, 1–12 (2018).
Gilbert, M. J. et al. Comparative genomics of Campylobacter fetus from reptiles and mammals reveals divergent evolution in host-associated lineages. Genome Biol. Evolut. 8, 2006–2019 (2016).
Sheppard, S. K. et al. Progressive genome-wide introgression in agricultural Campylobacter coli. Mol. Ecol. 22, 1051–1064 (2013).
Sheppard, S. K. et al. Cryptic ecology among host generalist Campylobacter jejuni in domestic animals. Mol. Ecol. 23, 2442–2451 (2014).
Morley, L. et al. Gene loss and lineage-specific restriction-modification systems associated with niche differentiation in the Campylobacter jejuni sequence type 403 clonal complex. Appl. Environ. Microbiol. 81, 3641–3647 (2015).
Pearson, B. M. et al. Differential distribution of type II CRISPR-Cas systems in agricultural and nonagricultural Campylobacter coli and Campylobacter jejuni isolates correlates with lack of shared environments. Genome Biol. Evolut. 7, 2663–2679 (2015).
Sheppard, S. K. & Maiden, M. C. J. The evolution of Campylobacter jejuni and Campylobacter coli. Cold Spring Harbor Perspect. Biol. 7, a018119 (2015).
Dearlove, B. L. et al. Rapid host switching in generalist Campylobacter strains erodes the signal for tracing human infections. ISME J. 10, 721–729 (2016).
Costa, D. & Iraola, G. Pathogenomics of emerging Campylobacter species. Clin. Microbiol 32, e00072-e118 (2019).
Lévesque, S. et al. Campylobacteriosis in urban versus rural areas: A case-case study integrated with molecular typing to validate risk factors and to attribute sources of infection. PLoS ONE 8, e83731 (2013).
Zerbino, D. R. & Birney, E. Velvet: Algorithms for de novo short read assembly using de Bruijn graphs. Genome Res. 18, 821–829 (2008).
Boetzer, M. et al. Scaffolding pre-assembled contigs using SSPACE. Bioinformatics 27, 578–579 (2011).
Boetzer, M. & Pirovano, W. Toward almost closed genomes with GapFiller. Genome Biol. 13, R56 (2012).
Seemann, T. Prokka: Rapid prokaryotic genome annotation. Bioinformatics 30, 2068–2069 (2014).
Piccirillo, A. et al. Campylobacter geochelonis sp. nov. isolated from the western Hermann’s tortoise (Testudo hermanni hermanni). Int. J. Syst. Evolut. Microbiol. 66, 3468–3476 (2016).
Ferrés, I. & Iraola, G. MLSTar: Automatic multilocus sequence typing of bacterial genomes in R. PeerJ 6, e5098 (2018).
Darling, A. E. et al. progressiveMauve: Multiple genome alignment with gene gain, loss and rearrangement. PLoS ONE 5, e11147 (2010).
Croucher, N. J. et al. Rapid phylogenetic analysis of large samples of recombinant bacterial whole genome sequences using Gubbins. Nucleic Acids Res. 43, e15–e15 (2015).
Thibeaux, R. et al. Deciphering the unexplored Leptospira diversity from soils uncovers genomic evolution to virulence. Microb. Genomics 4, e000144 (2018).
Finn, R. D. et al. The Pfam protein families database: towards a more sustainable future. Nucleic Acids Res. 44, D279–D285 (2016).
Eddy, S. R. Accelerated profile HMM searches. PLoS Comput. Biol. 7, e1002195 (2011).
Enright, A. J. et al. An efficient algorithm for large-scale detection of protein families. Nucleic Acids Res. 30, 1575–1584 (2002).
Dixon, P. VEGAN, a package of R functions for community ecology. J. Veg. Sci. 14, 927–930 (2003).
Covin, D. et al. CRISPRCasFinder, an update of CRISPRFinder, includes a portable version, enhanced performance and integrates search of Cas proteins. Nucleic Acids Res. 46, W246–W251 (2018).
Oliveria, P. H. et al. Regulation of genetic flux between bacteria by restriction-modification systems. PNAS 113, 5658–5663 (2016).
Roberts, R. J. et al. REBASE—A database for DNA restriction and modification: enzymes, genes and genomes. Nucleic Acids Res. 43, D298–D299 (2015).
Camacho, C. et al. BLAST+: Architecture and applications. BMC Bioinform. 10, 421 (2009).
Huerta-Cepas, J. et al. Fast genome-wide functional annotation through orthology assignment by eggNOG-mapper. Mol. Biol. Evol. 34, 2115–2122 (2017).
Miller, W. G., et al. Complete genome sequences of Campylobacter hyointestinalis subsp. hyointestinalis strain LMG 9260 and C. hyointestinalis subsp. lawsonii strain LMG 15993. Genome Announc. 4, e00665–16 (2016).
Bian, X. et al. Draft genome sequences of nine Campylobacter hyointestinalis subsp. lawsonii strains. Microbiol. Resour. Announc. 7, e0101618 (2018).
Acknowledgements
We acknowledge the Pathogen Informatics and Sequencing groups at the Wellcome Trust Sanger Institute for technical support. We also thank to Mark Stares and Hilary Browne at the Host-Microbiota Interactions Laboratory, Wellcome Trust Sanger Institute, for their technical support. G.I. and D.C. are supported by the Agencia Nacional de Investigación e Innovación (ANII, Uruguay) grant FSSA_X_2014_1_105252. This work received partial financial support from Fondo de Convergencia Estructural del Mercosur (FOCEM) grant COF 03/11, the Wellcome Trust grant number 098051 and the Medical Research Council UK grant number PF451.
Author information
Authors and Affiliations
Contributions
G.I. conceived the idea and designed the experiments. G.I., D.C., I.F., P.F., S.L. and N.K. performed the experiments and analyzed the data. S.L. collected and provided samples and T.D.L. contributed to data analysis and interpretation. G.I. and D.C. wrote the paper with suggestions from all authors. All authors approved the manuscript prior to submission.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Costa, D., Lévesque, S., Kumar, N. et al. Pangenome analysis reveals genetic isolation in Campylobacter hyointestinalis subspecies adapted to different mammalian hosts. Sci Rep 11, 3431 (2021). https://doi.org/10.1038/s41598-021-82993-9
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-021-82993-9
This article is cited by
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.