Integrated Mapping, Chromosomal Sequencing and Sequence Analysis of Cryptosporidium parvum

  1. Alan T. Bankier1,
  2. Helen F. Spriggs1,
  3. Berthold Fartmann2,
  4. Bernard A. Konfortov1,
  5. Martin Madera1,
  6. Christine Vogel1,
  7. Sarah A. Teichmann1,
  8. Al Ivens3, and
  9. Paul H. Dear1,4
  1. 1 Medical Research Council (MRC) Laboratory of Molecular Biology, Cambridge CB 2 2QH, UK
  2. 2 MWG Biotech, D-85560 Ebersberg, Germany
  3. 3 The Wellcome Trust Sanger Institute, The Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK

Abstract

The apicomplexan Cryptosporidium parvum is one of the most prevalent protozoan parasites of humans. We report the physical mapping of the genome of the Iowa isolate, sequencing and analysis of chromosome 6, and ∼0.9 Mbp of sequence sampled from the remainder of the genome. To construct a robust physical map, we devised a novel and general strategy, enabling accurate placement of clones regardless of clone artefacts. Analysis reveals a compact genome, unusually rich in membrane proteins. As in Plasmodium falciparum, the mean size of the predicted proteins is larger than that in other sequenced eukaryotes. We find several predicted proteins of interest as potential therapeutic targets, including one exhibiting similarity to the chloroquine resistance protein of Plasmodium. Coding sequence analysis argues against the conventional phylogenetic position of Cryptosporidium and supports an earlier suggestion that this genus arose from an early branching within the Apicomplexa. In agreement with this, we find no significant synteny and surprisingly little protein similarity with Plasmodium. Finally, we find two unusual and abundant repeats throughout the genome. Among sequenced genomes, one motif is abundant only in C. parvum, whereas the other is shared with (but has previously gone unnoticed in) all known genomes of the Coccidia and Haemosporida. These motifs appear to be unique in their structure, distribution and sequences.

Footnotes

  • [Supplemental material is available online at www.genome.org. The sequence data from this study have been submitted to EMBL. The sequence of Chromosome 6 appears under accession number BX526834; the end-sequences of the PAC clones appear under accession numbers AJ561222–AJ563278.]

    Article published online before print in July 2003.

  • Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.1555203.

  • 4 Corresponding author. E-MAIL phd{at}mrc-lmb.cam.ac.uk; FAX 44 1223 412178.

    • Accepted May 19, 2003.
    • Received February 28, 2003.
| Table of Contents

Preprint Server