Abstract
The adaptive immune system uses V genes for antigen recognition. However, the evolutionary diversification and selection processes within and across species and orders remain poorly understood. Here, we studied the amino acid (AA) sequences obtained from the translated in-frame V exons of immunoglobulins (IG) and T cell receptors (TR) from 16 primate species whose genomes have been sequenced. Multi-species comparative analysis supports the hypothesis that V genes in the IG loci undergo birth/death processes, thereby permitting rapid adaptability over evolutionary time. We also show that multiple cladistic groupings exist in the TRA (35 clades) and TRB (25 clades) V gene loci and that each primate species typically contributes at least one V gene to each of these clades. The results demonstrate that IG V genes and TR V genes have quite different evolutionary pathways; multiple duplications can explain the IG loci results, while coevolutionary pressures can explain the phylogenetic results of the TR V gene loci. Our results suggest that there exist evolutionary relationships between V gene clades in the TRA and TRB loci. Due to the long-standing preservation of these clades, such genes may have specific and necessary roles for the viability of a species.
Similar content being viewed by others
References
Berman JE, Mellis SJ, Pollock R, Smith CL, Suh H, Heinke B, Kowal C, Surti U, Chess L, Cantor CR et al (1988) Content and organization of the human Ig VH locus: definition of three new VH families and linkage to the Ig CH locus. EMBO J 7(3):727–738
Brack C, Hirama M, Lenhard-Schuller R, Tonegawa S (1978) A complete immunoglobulin gene is created by somatic recombination. Cell 15(1):1–14
Breiman L (2001) Random forests. Mach Learn 45(1):4–32
Bylander T (2002) Estimating generalization error on two-class datasets using out-of-bag estimates. J Mach Learn 48(1–3):287–297
Davis MM, Bjorkman PJ (1988) T-cell antigen receptor genes and T-cell recognition. Nature 334(6181):395–402. doi:10.1038/334395a0
Deng L, Langley RJ, Wang Q, Topalian SL, Mariuzza RA (2012) Structural insights into the editing of germ-line-encoded interactions between T-cell receptor and MHC class II by Valpha CDR3. Proc Natl Acad Sci U S A 109(37):14960–14965. doi:10.1073/pnas.1207186109
Fitch WM (2000) Homology a personal view on some of the problems. Trends Genet 16(5):227–231
Flajnik MF, Kasahara M (2010) Origin and evolution of the adaptive immune system: genetic events and selective pressures. Nat Rev Genet 11(1):47–59. doi:10.1038/nrg2703
Gambon Deza F, Sanchez Espinel C, Magadan Mompo S (2009) The immunoglobulin heavy chain locus in the reptile Anolis carolinensis. Mol Immunol 46(8–9):1679–1687. doi:10.1016/j.molimm.2009.02.019
Gambon-Deza F, Sanchez-Espinel C, Magadan-Mompo S (2010) Presence of an unique IgT on the IGH locus in three-spined stickleback fish (Gasterosteus aculeatus) and the very recent generation of a repertoire of VH genes. Dev Comp Immunol 34(2):114–122. doi:10.1016/j.dci.2009.08.011
Ghaffari SH, Lobb CJ (1991) Heavy chain variable region gene families evolved early in phylogeny. Ig complexity in fish. J Immunol 146(3):1037–1046
Giudicelli V, Lefranc MP (1999) Ontology for immunogenetics: the IMGT-ONTOLOGY. Bioinformatics 15(12):1047–1054
Giudicelli V, Lefranc MP (2012) IMGT-ONTOLOGY 2012. Front Genet 3:79. doi:10.3389/fgene.2012.00079
Giudicelli V, Chaume D, Lefranc MP (2005) IMGT/GENE-DB: a comprehensive database for human and mouse immunoglobulin and T cell receptor genes. Nucleic Acids Res 33(Database issue):D256–261. doi:10.1093/nar/gki010
Gouy M, Guindon S, Gascuel O (2010) SeaView version 4: a multiplatform graphical user interface for sequence alignment and phylogenetic tree building. Mol Biol Evol 27(2):221–224. doi:10.1093/molbev/msp259
Guo Y, Bao Y, Wang H, Hu X, Zhao Z, Li N, Zhao Y (2011) A preliminary analysis of the immunoglobulin genes in the African elephant (Loxodonta africana). PLoS One 6(2):e16889. doi:10.1371/journal.pone.0016889
Hassanin A, Golub R, Lewis SM, Wu GE (2000) Evolution of the recombination signal sequences in the Ig heavy-chain variable region locus of mammals. Proc Natl Acad Sci U S A 97(21):11415–11420. doi:10.1073/pnas.97.21.11415
Hughes AL (1994) The evolution of functionally novel proteins after gene duplication. Proc Biol Sci R Soc 256(1346):119–124. doi:10.1098/rspb.1994.0058
Janeway CA Jr (1992) The immune system evolved to discriminate infectious nonself from noninfectious self. Immunol Today 13(1):11–16. doi:10.1016/0167-5699(92)90198-g
Janeway CA, Jr., Travers P, Walport M, Shlomchik MJ (2005) Immunobiology Garland Science
Kirkham PM, Mortari F, Newton JA, Schroeder HW Jr (1992) Immunoglobulin VH clan and family identity predicts variable domain structure and may influence antigen binding. EMBO J 11(2):603–609
Lefranc MP (2001) Nomenclature of the human immunoglobulin heavy (IGH) genes. Exp Clin Immunogenet 18(2):100–116
Lefranc MP (2011a) From IMGT-ONTOLOGY DESCRIPTION axiom to IMGT standardized labels: for immunoglobulin (IG) and T cell receptor (TR) sequences and structures. Cold Spring Harb Protoc 2011(6):614–626. doi:10.1101/pdb.ip83
Lefranc MP (2011b) IMGT, the international ImMunoGeneTics information system. Cold Spring Harb Protoc 2011(6):595–603. doi:10.1101/pdb.top115
Lefranc MP (2014) Immunoglobulin and T cell receptor genes: IMGT ((R)) and the birth and rise of immunoinformatics. Front Immunol 5:22. doi:10.3389/fimmu.2014.00022
Lefranc MP, Lefranc G (2001a) The Immunoglobulin Factsbook. Academic, San Diego
Lefranc MP, Lefranc G (2001b) The T cell receptor FactsBook. Academic, San Diego
Lefranc MP, Pommie C, Ruiz M, Giudicelli V, Foulquier E, Truong L, Thouvenin-Contet V, Lefranc G (2003) IMGT unique numbering for immunoglobulin and T cell receptor variable domains and Ig superfamily V-like domains. Dev Comp Immunol 27(1):55–77
Lefranc MP, Giudicelli V, Ginestoux C, Jabado-Michaloud J, Folch G, Bellahcene F, Wu Y, Gemrot E, Brochet X, Lane J, Regnier L, Ehrenmann F, Lefranc G, Duroux P (2009) IMGT, the international ImMunoGeneTics information system. Nucleic Acids Res 37(Database issue):D1006–1012. doi:10.1093/nar/gkn838
Liu B, Wang X, Chen Q, Dong Q, Lan X (2012) Using amino acid physicochemical distance transformation for fast protein remote homology detection. PLoS One 7(9):e46633. doi:10.1371/journal.pone.0046633
Marrack P, Scott-Browne JP, Dai S, Gapin L, Kappler JW (2008) Evolutionarily conserved amino acids that control TCR-MHC interaction. Annu Rev Immunol 26:171–203. doi:10.1146/annurev.immunol.26.021607.090421
Miller RD, Grabe H, Rosenberg GH (1998) V (H) repertoire of a marsupial (Monodelphis domestica). J Immunol 160(1):259–265
Niku M, Liljavirta J, Durkin K, Schroderus E, Iivanainen A (2012) The bovine genomic DNA sequence data reveal three IGHV subgroups, only one of which is functionally expressed. Dev Comp Immunol 37(3–4):457–461. doi:10.1016/j.dci.2012.02.006
Olivieri D, Faro J, von Haeften B, Sanchez-Espinel C, Gambon-Deza F (2013) An automated algorithm for extracting functional immunologic V-genes from genomes in jawed vertebrates. Immunogenetics 65(9):691–702. doi:10.1007/s00251-013-0715-8
Olivieri DN, von Haeften B, Sanchez-Espinel C, Faro J, Gambon-Deza F (2014) Genomic V exons from whole genome shotgun data in reptiles. Immunogenetics 66(7–8):479–492. doi:10.1007/s00251-014-0784-3
Patterson C (1988) Homology in classical and molecular biology. Mol Biol Evol 5(6):603–625
Perelman P, Johnson WE, Roos C, Seuanez HN, Horvath JE, Moreira MA, Kessing B, Pontius J, Roelke M, Rumpler Y, Schneider MP, Silva A, O’Brien SJ, Pecon-Slattery J (2011) A molecular phylogeny of living primates. PLoS Genet 7(3):e1001342. doi:10.1371/journal.pgen.1001342
Price MN, Dehal PS, Arkin AP (2010) FastTree 2—approximately maximum-likelihood trees for large alignments. PLoS One 5(3):e9490. doi:10.1371/journal.pone.0009490
Sievers F, Higgins DG (2014) Clustal Omega, accurate alignment of very large numbers of sequences. Methods Mol Biol 1079:105–116. doi:10.1007/978-1-62703-646-7_6
Suarez E, Magadan S, Sanjuan I, Valladares M, Molina A, Gambon F, Diaz-Espada F, Gonzalez-Fernandez A (2006) Rearrangement of only one human IGHV gene is sufficient to generate a wide repertoire of antigen specific antibody responses in transgenic mice. Mol Immunol 43(11):1827–1835. doi:10.1016/j.molimm.2005.10.015
Tamura K, Peterson D, Peterson N, Stecher G, Nei M, Kumar S (2011) MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. Mol Biol Evol 28(10):2731–2739. doi:10.1093/molbev/msr121
Tonegawa S (1983) Somatic generation of antibody diversity. Nature 302(5909):575–581
Waterhouse AM, Procter JB, Martin DM, Clamp M, Barton GJ (2009) Jalview Version 2—a multiple sequence alignment editor and analysis workbench. Bioinformatics 25(9):1189–1191. doi:10.1093/bioinformatics/btp033
Zhang C (2012) Ensemble machine learning methods and applications. Springer, New York
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Olivieri, D.N., Gambón-Deza, F. V genes in primates from whole genome sequencing data. Immunogenetics 67, 211–228 (2015). https://doi.org/10.1007/s00251-015-0830-9
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00251-015-0830-9