The ANISEED database: Digital representation, formalization, and elucidation of a chordate developmental program

  1. Patrick Lemaire1,23
  1. 1 Institut de Biologie du Développement de Marseille-Luminy, UMR6216 CNRS, F-13288 Marseille, Cedex 9, France;
  2. 2 Department of Biology, Faculty of Science and Engineering, Konan University, Higashinada, Kobe 658-8501, Japan;
  3. 3 Equipe INRA U1126 “Morphogenèse du système nerveux des Chordés” UPR 3294, CNRS, Institut de Neurosciences A. Fessard, F-91198 Gif-sur-Yvette, France;
  4. 4 University of California, Berkeley, Department of Molecular & Cell Biology, Berkeley, California 94720-3200, USA;
  5. 5 CRFB, Génopôle Marseille-Nice, Université de la Méditerranée, F-13288 Marseille, Cedex 9, France;
  6. 6 UMR7009, CNRS/UPMC, Station Zoologique, Observatoire Océanologique, F-06230 Villefranche-sur-mer, France;
  7. 7 Wellcome Trust/Cancer Research UK Gurdon Institute, The Henry Wellcome Building of Cancer and Developmental Biology, University of Cambridge, Cambridge CB2 1QN, United Kingdom;
  8. 8 Institute of Socio-Arts and Sciences, University of Tokushima, Tokushima 770-8502, Japan;
  9. 9 Department of Bioscience and Informatics, Faculty of Science and Technology, Keio University, Yokohama, 223-8522 Japan;
  10. 10 Department of Applied Science, Kochi University, Kochi 780-8520, Japan;
  11. 11 Department of Zoology, Graduate School of Science, Kyoto University, Kyoto 606-8502, Japan;
  12. 12 Institute for Bioinformatics Research and Development, Japan Science and Technology Agency, Tokyo 102-0081, Japan
    • 14 Present addresses: Institut de Génétique et de Biologie Moléculaire et Cellulaire, 1 rue Laurent Fries BP 10142 F-67404 Illkirch France;

    • 15 European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK;

    • 16 MGCB, University of Chicago, 920 E. 58th St. Chicago, IL 60637, USA;

    • 17 EMBL–Heidelberg, Meyerhofstraße 1, D-69117 Heidelberg, Germany;

    • 18 Australian Regenerative Medicine Institute, Monash University Clayton Campus, 1 Wellington Road, Clayton VIC 3168, Australia;

    • 19 Programme Cartes d'Identité des Tumeurs (CIT), Ligue Nationale Contre le Cancer, 14 rue Corvisart, 75013, Paris, France;

    • 20 NYU, Center for Genomics and Systems Biology, Department of Biology, 1009 Silver Center, 100 Washington Square East, New York, NY 10003-6688;

    • 21 Division of Systems Biology, MRC National Institute for Medical Research, The Ridgeway, London, NW7 1AA, UK;

    • 22 Marine Genomics Unit, Okinawa Institute of Science and Technology Promotion Corporation, Uruma, Okinawa 904-2234, Japan.

    1. 13 These authors contributed equally to this work.

    Abstract

    Developmental biology aims to understand how the dynamics of embryonic shapes and organ functions are encoded in linear DNA molecules. Thanks to recent progress in genomics and imaging technologies, systemic approaches are now used in parallel with small-scale studies to establish links between genomic information and phenotypes, often described at the subcellular level. Current model organism databases, however, do not integrate heterogeneous data sets at different scales into a global view of the developmental program. Here, we present a novel, generic digital system, NISEED, and its implementation, ANISEED, to ascidians, which are invertebrate chordates suitable for developmental systems biology approaches. ANISEED hosts an unprecedented combination of anatomical and molecular data on ascidian development. This includes the first detailed anatomical ontologies for these embryos, and quantitative geometrical descriptions of developing cells obtained from reconstructed three-dimensional (3D) embryos up to the gastrula stages. Fully annotated gene model sets are linked to 30,000 high-resolution spatial gene expression patterns in wild-type and experimentally manipulated conditions and to 528 experimentally validated cis-regulatory regions imported from specialized databases or extracted from 160 literature articles. This highly structured data set can be explored via a Developmental Browser, a Genome Browser, and a 3D Virtual Embryo module. We show how integration of heterogeneous data in ANISEED can provide a system-level understanding of the developmental program through the automatic inference of gene regulatory interactions, the identification of inducing signals, and the discovery and explanation of novel asymmetric divisions.

    Footnotes

    • Received March 24, 2010.
    • Accepted June 30, 2010.

    Freely available online through the Genome Research Open Access option.

    | Table of Contents
    OPEN ACCESS ARTICLE

    Preprint Server