MusMorph, a database of standardized mouse morphology data for morphometric meta-analyses

Devine, Jay; Vidal-García, Marta; Liu, Wei; Neves, Amanda; Lo Vercio, Lucas D.; Green, Rebecca M.; Richbourg, Heather A.; Marchini, Marta; Unger, Colton M.; Nickle, Audrey C.; Radford, Bethany; Young, Nathan M.; Gonzalez, Paula N.; Schuler, Robert E.; Bugacov, Alejandro; Rolian, Campbell; Percival, Christopher J.; Williams, Trevor; Niswander, Lee; Calof, Anne L.; Lander, Arthur D.; Visel, Axel; Jirik, Frank R.; Cheverud, James M.; Klein, Ophir D.; Birnbaum, Ramon Y.; Merrill, Amy E.; Ackermann, Rebecca R.; Graf, Daniel; Hemberger, Myriam; Dean, Wendy; Forkert, Nils D.; Murray, Stephen A.; Westerberg, Henrik; Marcucio, Ralph S.; Hallgrímsson, Benedikt

doi:10.1038/s41597-022-01338-x

Download PDF

Data Descriptor
Open access
Published: 25 May 2022

MusMorph, a database of standardized mouse morphology data for morphometric meta-analyses

Scientific Data volume 9, Article number: 230 (2022) Cite this article

2980 Accesses
1 Citations
6 Altmetric
Metrics details

Subjects

An Author Correction to this article was published on 28 June 2023

This article has been updated

Abstract

Complex morphological traits are the product of many genes with transient or lasting developmental effects that interact in anatomical context. Mouse models are a key resource for disentangling such effects, because they offer myriad tools for manipulating the genome in a controlled environment. Unfortunately, phenotypic data are often obtained using laboratory-specific protocols, resulting in self-contained datasets that are difficult to relate to one another for larger scale analyses. To enable meta-analyses of morphological variation, particularly in the craniofacial complex and brain, we created MusMorph, a database of standardized mouse morphology data spanning numerous genotypes and developmental stages, including E10.5, E11.5, E14.5, E15.5, E18.5, and adulthood. To standardize data collection, we implemented an atlas-based phenotyping pipeline that combines techniques from image registration, deep learning, and morphometrics. Alongside stage-specific atlases, we provide aligned micro-computed tomography images, dense anatomical landmarks, and segmentations (if available) for each specimen (N = 10,056). Our workflow is open-source to encourage transparency and reproducible data collection. The MusMorph data and scripts are available on FaceBase (www.facebase.org, https://doi.org/10.25550/3-HXMC) and GitHub (https://github.com/jaydevine/MusMorph).

Measurement(s)	mouse anatomy • brain morphology • transgenic mouse models • developmental stage • craniofacial region
Technology Type(s)	micro-computed tomography • image registration • anatomical landmarks • micro-computed tomography • gene knockout
Factor Type(s)	strain • genotype • treatment • sex • stage
Sample Characteristic - Organism	Mus musculus

Sex differences orchestrated by androgens at single-cell resolution

Article 10 April 2024

Three million images and morphological profiles of cells treated with matched chemical and genetic perturbations

Article Open access 09 April 2024

Single-cell long-read sequencing-based mapping reveals specialized splicing patterns in developing and adult mouse and human brain

Article Open access 09 April 2024

Background & Summary

Understanding how genes, development, and the environment produce variation in complex morphological traits is a core challenge in biology with evolutionary and clinical implications. Explanations for the generation of variation tend to cohere around the genotype-phenotype map concept. Genetic variation and genetic effects, like epistasis and pleiotropy, drive variation in developmental processes that act at different times and scales in anatomical context^1,2,3. Specific developmental and genetic mechanisms then operate alongside embedded mechanisms, such as nonlinearities^4,5 and gene redundancy⁶, to modulate these effects to express a phenotype^7,8,9. Despite recent insights into these phenomena, the developmental-genetic basis for morphological variation remains largely unknown, as there are likely many overlapping and coordinated mechanisms involved, each with relative contributions¹⁰. To help disentangle these mechanisms, it is important to build and integrate large phenotypic databases for model organisms^11,12,13,14. In this work, we present MusMorph, a database of standardized mouse morphology data for meta-analyses of morphological variability and variation, particularly in the craniofacial complex and brain.

The laboratory mouse is a useful model organism for studying the mechanisms of morphological variation because of the high genetic homology with humans, short gestation, and rich set of tools for manipulating the genome in a controlled environment. Unfortunately, phenotypic data are often biased by laboratory-specific data collection protocols. The International Mouse Phenotyping Consortium (IMPC, www.mousephenotype.org) was born out of a need to determine the relationship between genotype and phenotype with standardized phenotypic data. Using micro-computed tomography (µCT) and optical projection tomography, the consortium has studied the anatomy of mouse lines heterozygous or homozygous for a single gene mutation, particularly at embryonic day E9.5, E14.5-15.5, and E18.5^{15,16,17,18,19,20}. Less emphasis has been placed on µCT imaging and analysis of adults and mid-gestation (E10 to E11) mutants, where critical developmental events, like fusion of the craniofacial prominences, occur. Mouse lines with normal (non-pathological) levels of variation, such as recombinant inbred strains and outbred strains with high heterozygosity^21,22,23, have also been poorly characterized. Quantifying such variation is important, because it drives disease susceptibility and course of disease in humans.

Recently, model organism phenotyping has transitioned from manual linear measurements to fully automated computational pipelines. One common approach is voxel-based morphometry^24,25. Voxel-based morphometry is based on the analysis of deformation fields obtained via image registration. After spatially aligning images to an average atlas, the deformation fields can be quantitatively compared between groups on a voxel-wise basis to identify differences in morphology. Voxel-based morphometry remains a pillar of shape analysis, because it can localize small regions of shape change without any a priori knowledge of the anatomy, but it is prone to the multiple testing problem^26,27. Another approach is atlas-based geometric morphometrics, which instead uses registration fields to automatically derive landmarks, or Cartesian coordinate points that are homologous across samples. Geometric morphometrics is central to evolutionary biology and developmental biology, among other fields, because landmarks allow for statistically tractable quantifications of morphological variation, as well as intuitive visualizations²⁸. These advantages continue to fuel development of novel geometric morphometric pipelines and extensions^{29,30,31,32,33}. Yet large-scale morphometric analyses remain rare due to the sparsity of standardized landmark data.

Here, we introduce MusMorph, a database of standardized mouse morphology data generated with an open-source, atlas-based phenotyping pipeline that integrates techniques from image registration, deep learning, and morphometrics. We compiled the database (N = 10,056) using µCT scans of mice from a variety of strain/genotype combinations and developmental stages, including E10.5, E11.5, E14.5, E15.5, E18.5, and adulthood. Most of MusMorph is composed of head morphology data, but there are also whole-body embryo data for different integrative analyses. We provide (1) a developmental atlas for each timepoint; (2) a rigidly aligned and preprocessed µCT scan, dense anatomical landmarks, and segmentations (if available) for each specimen; (3) a set of scripts for transforming and comparing an input scan to an atlas; (4) an approach to validate the transformed landmark data and optimize it, if needed. To ensure reproducibility and data sharing, we make the data freely accessible from FaceBase³⁴ (www.facebase.org, https://doi.org/10.25550/3-HXMC)³⁵ and our code from GitHub (https://github.com/jaydevine/MusMorph). By incorporating substantial developmental and genetic variation alongside a rich set of metadata, MusMorph will enable standardized morphometric analyses of genotype-phenotypes to better understand the mechanistic basis for morphological variation.

Methods

Mice

We compiled mouse embryos and adults from numerous sources. The mouse lines for the E15.5 and E18.5 datasets were generated by the IMPC. These mice were produced and maintained on a C57BL/6N genetic background, with support from C57BL/6NJ, C57BL/6NTac or C57BL/6NCrl. More details about husbandry practices can be found at https://www.mousephenotype.org/impress. The mouse lines for the E10.5, E11.5, E14.5, and adult datasets were produced on a variety of genetic backgrounds at different institutions for studies of craniofacial variation. We hereafter refer to these lines as the Calgary mice, because they were ultimately imaged at the University of Calgary. Specific information about study protocols, such as husbandry practices and genotyping, should be gleaned from the MusMorph dataset summaries on FaceBase or the original studies themselves. Each dataset within the MusMorph project³⁵ on FaceBase represents a study or set of studies defined by a common study design that yielded similar mouse lines. Details about the experimental design were obtained from the original studies listed in the “Publication(s)” section of each dataset. In addition, we provide a supplementary comma-separated values (CSV) file (Study_Metadata.csv) in the project-wide metadata dataset³⁶ on FaceBase that lists the associated studies.

Micro-computed tomography

Sample preparation

Each IMPC embryo underwent a hydrogel stabilization protocol³⁷ to prepare for diffusible iodine-based contrast-enhanced µCT (diceCT)³⁸. This involved incubating the embryo in a hydrogel solution composed of 4% (wt) paraformaldehyde, 4% (wt/vol) acrylamide (Bio-Rad, USA), 0.05% (wt/vol) bis-acrylamide, 0.25% VA044 Initiator (Wako Chemicals, USA), 0.05% (wt/vol) saponin (Sigma-Aldrich, Germany), and phosphate-buffered saline at 4 °C for 3 days. Following incubation, the air in the specimen tube was replaced with nitrogen gas and the tube was immersed in a 37 °C water bath for 3 h. The whole embryo was then stained with a 0.025 N to 0.1 N Lugol’s iodine (I₂KI) solution (Sigma-Aldrich, Germany) for 24 h and mounted in agarose for diceCT. This approach has become a popular alternative to magnetic resonance imaging because it is faster, cheaper, and still offers remarkable contrast, allowing for high-throughput phenotyping of soft and hard tissue³⁸.

The Calgary embryos were subjected to different fixation and staining protocols. Each embryo acquired prior to 2017 was fixed in a solution of 4% (wt) paraformaldehyde and 5% (wt) glutaraldehyde. The specimen was next submerged in the CystoCon Ray II (iothalamate meglumine) contrast agent for one hour to stain external morphology. Embryos obtained after 2017 were put through a nucleic acid stabilization protocol that allows for examination of RNA in embryos scanned via µCT³⁹. Each embryo was fixed with the PAXgene Tissue FIX solution (Qiagen, PreAnalytics, cat #765312), incubated overnight (17 h + /- 1 h) at room temperature, then transferred to a solution of PAXgene Tissue STABILIZER prepared to manufacturer specification (Qiagen, PreAnalytics, cat #765512). For diceCT, each specimen was placed in a solution of PAXgene Tissue STABILIZER and 1% to 3.75% (wt/vol) Lugol’s iodine for 24 h. The head of every embryo was dissected before being mounted in either agarose or soft wax, which was covered by a microcentrifuge tube and infused with 50-100 µl of tissue stabilizer.

Each Calgary adult was set up with a standardized storage and mounting protocol. The mouse carcass was stored at −20 °C after euthanasia. Prior to the day of scanning, the mouse was retrieved and thawed overnight at 4 °C. The carcasses were then wrapped in foam and placed into a 37 mm diameter sample holder for µCT.

Imaging

The IMPC embryos were imaged at six centers, including the Baylor College of Medicine, Czech Center for Phenogenomics, MRC Harwell, Toronto Centre for Phenogenomics, The Jackson Laboratory, and University of California, Davis. A 3-D image of each iodine-stained whole embryo was acquired with a Skyscan 1172 µCT scanner (Bruker, Kontich, Belgium) at 100 kVp and 100 µA. The raw images were initially obtained with isotropic voxels but variable spatial dimensions and resolutions, ranging between 0.002 mm to 0.04 mm. Image projections were reconstructed into a digital stack using the Feldkamp algorithm⁴⁰.

The Calgary mice were imaged in the 3-D Morphometrics Center at the University of Calgary. A 3-D image of each stained embryo head was obtained with either (a) a Scanco µCT 35 scanner (Scanco Medical, Brütisellen, Switzerland) at 45 kV and 177 µA or (b) a ZEISS Xradia Versa 520 X-ray microscope (Carl Zeiss AG, Oberkochen, Germany) at 40–50 kV, 4-5 W, and 2 s exposure time. A 3-D image of each adult skull was acquired with either (a) a Scanco vivaCT 40 µCT scanner (Scanco Medical, Brütisellen, Switzerland), (b) a Scanco vivaCT 80 µCT scanner (Scanco Medical, Brütisellen, Switzerland), or (c) a Skyscan 1173 v1.6 µCT scanner (Bruker, Kontich, Belgium) at 55–80 kV and 60–145 µA. Like the IMPC data, these original images were obtained with isotropic voxels but variable spatial dimensions and resolutions. Embryo image resolutions ranged between 0.007 mm and 0.027 mm, whereas adult resolutions ranged between 0.035 mm and 0.044 mm. Image projections were reconstructed with the integrated Scanco software, the ZEISS XMReconstructor software, or the Skyscan NRecon v1.7.4.2 software.

Image preprocessing

We preprocessed each image to account for differences in image acquisition that would interfere with the atlas-based registration workflow described below (Fig. 1). The preprocessing scripts are provided in the MusMorph GitHub repository (https://github.com/jaydevine/MusMorph/tree/main/Preprocessing). In this preprocessing step, we first converted the reconstructed imaging data (.nrrd, .aim, .tiff) to the Montreal Neurological Institute (MNI) .mnc format using file conversion scripts written in Bash and Python (see AIM_to_MNC.sh, NII_to_MNC.sh, TIFF_to_MNC.sh, DCM_to_MNC.sh, and NRRD_to_MNC.py). As part of the open-source MINC library (http://bic-mni.github.io/man-pages/), the .mnc format is implemented using HDF5 (Hierarchical Data Format, version 5), which supports hierarchical data structure, internal compression, 64-bit file sizes, and other modern features⁴¹.

Staining artifacts, such as extreme intensity gradients and variable penetrance, can bias the image registration process. To minimize intensity inhomogeneities, we applied the N3 method⁴². Since many of the E15.5 images had background noise, where the stained scanning medium was indistinguishable from the anatomy, we employed a thresholding script in Bash (see Threshold.sh). This script computes a lower anatomical density threshold, masks the voxels above this bound and those in proximity via dilation, and equates all voxels outside the mask to 0. To ensure the image resolutions and dimensions were consistent with the atlas, we implemented an image resampling script in Bash (see Downsample_and_Correct.sh). We also used this script to control for differences in bit depth among scanners by including a min-max normalization, which scaled the embryo intensities between 0 and 1. Table 1 outlines the source of the image data, developmental stage, voxel dimensions, image resolutions, stage-specific sample sizes, and the presence or absence of atlas anatomical labels. Note that the E14.5 images were solely used to create another stage-specific atlas, as they are from a smaller, unpublished dataset.

Table 1 Summary of imaging data.

Full size table

Another essential step to all image registration workflows is the initialization, or a rigid alignment between an image pair. Using initialization scripts written in Bash (see Preprocessing.md) and R (Tag_Combine.R), we rigidly transformed each image to a stage-specific atlas or, if an atlas did not exist, an arbitrary but stage-specific reference image. To determine the rigid transformation matrices, we utilized a semi-automated or strictly automated approach, depending on anatomical orientation. If the mouse was scanned in a random orientation, we rendered a minimum threshold surface in MINC, then manually placed five homologous three-dimensional (3-D) landmarks at anatomical extrema (e.g., ears, nose, top of the head, and back of the head), resulting in an MNI tag point file (.tag) with landmark coordinates. Next, we concatenated the reference and arbitrary landmark matrices, and minimized their 3-D Euclidean distances via least squares. If the specimen was already roughly aligned to the reference image, we performed an automated, intensity-based rigid alignment using the full registration process outlined below (see the “Image Registration and Label Propagation” section). This intensity-based rigid alignment was also repeated for the manually aligned volumes to ensure consistency. With the rigid transformation matrices, we resampled each image into their stage-specific reference coordinate space using tri-linear interpolation.

Reference atlases

We generated a population average atlas for each stage, excluding E15.5 and adulthood, by spatially normalizing 25 µCT images of wildtype mice with a group-wise registration workflow^43,44 (Figs. 2 and 3). A nearly identical workflow was used to create the existing E15.5 and adult atlases. The atlas construction script is available in the MusMorph GitHub (https://github.com/jaydevine/MusMorph/tree/main/Processing) and is written in Python (see HiRes_Atlas.py or LoRes_Atlas.py). This script produces Bash scripts that can be executed automatically and in parallel on a compute cluster to maximize computational efficiency. Without massively parallel computing, the volumetric registrations would need to be performed sequentially, each requiring hours of computation and a large amount of memory. Before executing the workflow, the user must upload the initialized images and registration scripts to a compute cluster. In addition, the user needs to install a MINC Toolkit module onto the cluster via Docker (https://bic-mni.github.io/) or GitHub (https://github.com/BIC-MNI/minc-toolkit-v2), or define a pre-existing module, because the scripts utilize the open-source MINC software. An atlas can also be generated locally, but it will be significantly slower without massively parallel computing.

Spatial normalization involves an initial affine transformation for global alignment, followed by a deformable transformation for non-linear alignment. To account for global variation in location, orientation, and scale, we computed a series of multi-resolution (coarse to fine) affine transformations among the images by optimizing a cross-correlation objective function⁴⁵. Given that sample-wide pairwise registrations yield an improved affine template⁴⁶, or intensity average, we completed all possible (N = 25*24) pairwise affine registrations, then averaged the resulting transformation for each specimen. Using the averaged transformations, we resampled each initialized image into the affine coordinate space with tri-linear interpolation and averaged the resulting images to produce an affine template. To correct for local variation in shape, we computed a series of multi-resolution non-linear transformations with the ANIMAL (Automatic Nonlinear Image Matching and Anatomical Labelling) algorithm⁴⁷, again optimizing for cross-correlation. This iterative, four-step process involves non-linearly deforming each mouse to an evolving template at increasingly higher resolutions, with the first template being the affine average and the next three being improved versions of the non-linear average⁴⁸. The final product is a stage-specific average with excellent contrast and a high signal-to-noise ratio.

Since the goal of MusMorph was to aggregate landmark data for morphometrics, and our primary imaging data are head scans, we focused on labelling each atlas head surface with a standardized landmark configuration (Figs. 2 and 3). Specific information about the number of landmarks and their anatomical definitions can be found below in the “Data Records: Landmarks” section. To generate the landmarks, we first rendered a minimum density isosurface in MINC, which uses ITK’s marching cubes algorithm, and saved the 3-D rendering as a Stanford PLY (.ply) file. We then used 3D Slicer⁴⁹ or the MINC Toolkit to acquire a landmark configuration on each surface that provided a comprehensive representation of shape⁵⁰. For the embryos, we used 3D Slicer and the SlicerMorph extension³² to identify sparse landmarks and interpolate landmark patches of variable density in between, depending on the size of the area, resulting in dense coverage of the head. This also ensured that the semilandmark patches were equivalent, allowing for a morphospace into which all specimens may be superimposed. Note that we privilege the term equivalent over homologous here, because the developmental origin of dense tissue regions between fixed landmarks is more ambiguous. Because semilandmark patch equivalence was not a consideration for the adults, we landmarked the adult atlas in MINC using built-in display tools, again ensuring sparse and dense landmark coverage.

Shared developmental pathways lead to correlated morphological variation, or morphological integration^{51,52,53,54,55,56,57}. To enable analyses of integration, we added landmark configurations and segmentations to different regions of the adult skull atlas. Specifically, we manually segmented the adult cranium, mandible, and cranial endocast (i.e., a proxy for the brain) in MINC, then rendered these segmentations as isosurfaces before landmarking them with a dense configuration. Once again, the landmark details are described below in the “Data Records: Landmarks” section. The segmentations may be used for surface-based analyses⁵⁸, measures of size (e.g., volume or surface), or as masks to reduce the shape dimensionality of a voxel-based morphometry analysis. Unlike the adult atlas, the embryo atlases do not come with segmentations due to the scope of this work, apart from the pre-existing E15.5 atlas, which has 48 manually segmented structures (http://www.mouseimaging.ca/technologies/mouse_atlas/mouse_embryo_atlas.html).

Image registration and label propagation

We pairwise registered each image to their stage-specific atlas to obtain a composite (affine and non-linear) transformation for label propagation (Fig. 1). Like the atlas workflow described above, the registration scripts are available in the MusMorph GitHub (https://github.com/jaydevine/MusMorph/tree/main/Processing) and are written in Python (see HiRes_Pairwise.py or LoRes_Pairwise.py). The purpose once more is to produce Bash scripts en masse for massively parallel computing on a compute cluster due to the computational requirements of volumetric deformable registration and anatomical labelling. Only the initialized images and registration scripts need to be uploaded to the cluster to execute the workflow. While the pairwise registrations involved the same multi-resolution affine alignment described above, the non-linear alignment differed. Here, we implemented the geodesic SyN (Symmetric Normalization) algorithm⁵⁹, because it was previously validated for atlas-based landmarking and morphometrics of mouse models⁴⁴. The SyN registrations were optimized using cross-correlation. After registration, we used labelling scripts written in Bash and produced via Python (see Label_Propagation.py) to recover the non-linear transformations, concatenate them with the affine transformations, invert them, and propagate the atlas labels to the rigid space of each image.

Neural network shape optimization

Although top-performing registration algorithms provide an effective and generalizable way to automatically label anatomy, there are instances where outliers and problematic landmarks can alter shape representations. This is particularly true for model organisms, where mutant phenotypes may show little to no resemblance with an atlas. To demonstrate how biological signal can be restored, we implemented a supervised deep learning workflow available in the MusMorph GitHub (https://github.com/jaydevine/MusMorph/tree/main/Postprocessing), which employs scripts written in R and Julia (see GPA_and_Projection.R and Landmark_Optimization.jl)⁶⁰. Using a subset of 68 sparse adult craniofacial landmarks (N = 2,000) described in previous work^{61,62,63,64,65}, we trained a deep feedforward neural network to learn a domain-specific loss function that minimizes automated and manual shape differences. The sparse landmark numbers amenable to optimization (see Optimization_Order.csv)³⁶ are available on FaceBase. We focused on the adults because that was the only stage with a large existing set of homologous manual landmarks for training.

We tested the network predictions on a random subset (N = 500) of adult skulls described further in the “Technical Validation” section. To help others initialize the network without having to retrain it, we provide the adult network model (Calgary_Adult_Cranium_Model.bson) and weights (Calgary_Adult_Cranium_Weights.bson) in the Binary JSON (.bson) file format on GitHub. We also make available the optimized sparse shape predictions for the entire adult crania dataset (Adult_Cranium_Sparse_Landmarks.csv)³⁶. Although we focused on adults, this optimization strategy is generalizable, so other research groups with manual landmark data on any structure of the atlases may use the network architecture to improve outlier predictions.

Data Records

Specimen metadata

Each specimen in the MusMorph database³⁵ is associated with a rich set of identifiers to accommodate morphometric analyses using multiple factors and/or covariates. Alongside detailed metadata descriptions in FaceBase, we provide the specimen metadata as a supplementary CSV file (MusMorph_Metadata.csv)³⁶ for convenience and to include auxiliary fields. Table 2 enumerates the metadata and Table S1 summarizes the metadata distributions for each dataset on FaceBase.

Table 2 Summary of metadata identifiers.

Full size table

Figure 4a,b illustrates the distributions of sex, strain type, and genotype across the embryo and adult datasets. Sex is well-annotated for the E15.5, E18.5, and adult datasets, but is missing (“NA”) for many of the E10.5 and E11.5 specimens. While most of the embryo mouse models were produced on an isogenic inbred background, particularly C57BL/6N, strain diversity is a focal point of the adult datasets. Among the nine adult strain types provided, there are 98 unique background strains. The majority are recombinant inbred lines (e.g., the Collaborative Cross dataset⁶⁶), wild-derived crosses (e.g., the Hybrid dataset⁶⁷), and outbred lines (e.g., the Diversity Outbred dataset⁶⁸). We have included 459 unique genotypes for the embryo datasets, most of which derive from the IMPC dataset⁶⁹, as well as 179 genotypes for the adult datasets. A minority of specimens, including several embryos in the Ap2⁷⁰, B9d⁷¹, and Bulgy⁷² datasets as well as a few adults in the Brain-Face⁷³ dataset, have unknown genotypes (e.g., “-/-;NA” and “ + /-;NA” in double knockout designs or “NA” and “ + / + or + /-” in single knockouts) due to genotyping complications in the past. Specimens homozygous for a single gene mutation predominate the embryo datasets, whereas normal wildtype variants comprise the bulk of the adult datasets. Figure 4c shows the developmental stages represented in MusMorph. Of the 10,056 specimens processed, 40% are embryos and 60% are adults, many of which have just finished maturing around postnatal day 90. All specimens without a recorded stage (“NA”) are mature adults.

It is often desirable to compare mutants to their wildtype counterparts from the same sample because background strains vary. To preserve sample provenance where possible, specimens that are wildtype for a given mutation will have the same gene symbol as their heterozygote and homozygote littermates. For wildtype specimens without litter information, like the IMPC dataset, their genotypes are equated to background strain. Mouse strain nomenclature follows the MGI guidelines, except when the strain design is unknown and has no MGI ID (e.g., novel hybrid backcrosses). We also abbreviate genotypes for complex strain designs using MGI synonyms if available. Furthermore, while most wildtype specimens fall within the control experimental group, there are cases where they can exhibit mutant-like phenotypes and be categorized as such. One example in MusMorph is the artificial selection Longshanks dataset⁷⁴, which through many generations of artificial selection produced wildtype specimens with extreme tibia and craniofacial phenotypes^75,76.

We selected the above identifiers, because they tend to explain a significant amount of morphological variation in morphometric analyses. For instance, many structures in the mouse are sexually dimorphic, including the shape of the brain⁷⁷ and craniofacial complex⁷⁸, cortical bone size and strength⁷⁹, adipose tissue distribution⁸⁰, and feto-placental growth^81,82, to name a few. It is also known that classical laboratory strains, such as those in the Strain Comparison dataset⁸³, exhibit naturally occurring craniofacial phenotypes⁸⁴. Moreover, gene mutations can interact with a background strain via epistasis to produce different phenotypes^85,86,87, like those in the Spry dataset⁸⁸. Another key driver of variation is developmental stage, as differences in age often define a principal axis of allometric variation via correlations with size and/or shape^{89,90,91,92,93}. Given the ubiquity of allometry, these correlations can be found across most MusMorph datasets (Fig. 4d). Finally, numerous studies have reported the phenotypic outcomes of single gene mutations, environmental perturbations, and how zygosity modulates these effects^94,95,96. These identifiers have corresponding images, landmarks, segmentations, and deformation fields for morphological analyses (Fig. 4d,e).

Images

We provide the atlases and initialized images for each specimen in the MNI .mnc format. The naming convention for the atlas volumes is < Source > _ < Stage > _ < Anatomy > _Atlas .mnc. They are categorized as “Imaging Data” in the project-wide dataset³⁶ on FaceBase. The naming convention for the initialized volumes is < Biosample > .mnc, where Biosample is the name of the specimen in the metadata (see the “Specimen metadata” section). One exception is the naming convention for the subset of thresholded E15.5 images, which is < Biosample > _Thresh.mnc. These volumes are also categorized as “Imaging Data” across the MusMorph datasets on FaceBase. Each .mnc file has four key attributes: 1) a named dimension (xspace, yspace, zspace), 2) length (number of voxels on each dimension), step (resolution), and start (origin). MINC defines a voxel and world coordinate system, so one can move between them with the simple “voxeltoworld” and “worldtovoxel” MINC commands. If users want to convert between .mnc and different file formats (e.g., raw data, DICOM, NIfTI, Analyze, ECAT, TIFF, Concorde, VFF), there are a variety of other Bash commands available (http://bic-mni.github.io/man-pages/). While the raw IMPC images are freely accessible in the NRRD (.nrrd) format at https://www.mousephenotype.org/data/embryo, the raw Calgary images are available upon request in the AIM (.aim) or TIFF (.tiff) formats.

Transformations

For each pairwise registration, we recovered an inverted non-linear and composite (affine and non-linear) transformation. Given the file sizes of the non-linear deformation fields (~3 GB on average × 10,000 = 30 TB), we make the transformations available upon request. The deformation fields and composite transformations are in the MNI .mnc and .xfm formats. Each .mnc file shares the same image attributes described above with an additional named dimension called vector_dimension which describes the non-linear displacement vectors. Each .xfm file contains a header and affine transformation matrix. The naming convention for the deformation fields is < Biosample > _ANTS_nl_inverted_grid_0.mnc and < Biosample > _ANTS_nl_inverted.xfm, whereas the composite transformations are called < Biosample > _origtoANTS_nl_inverted_grid_0.mnc and < Biosample > _origtoANTS_nl_inverted.xfm. “ANTS” denotes the algorithm and “nl” stands for “non-linear”. Much like the images, the transformations for the subset of thresholded E15.5 volumes have “Thresh” appended to the < Biosample > name.

Non-linear deformation fields describe the displacements of each target image voxel to each reference image voxel⁹⁷. By calculating the Jacobian determinant J for every point $p(x,y,z)$ in the deformation field,

$$\det (J(x,y,z))=\left|\begin{array}{ccc}\frac{\partial x}{\partial x} & \frac{\partial y}{\partial x} & \frac{\partial z}{\partial x}\\ \frac{\partial x}{\partial y} & \frac{\partial y}{\partial y} & \frac{\partial z}{\partial y}\\ \frac{\partial x}{\partial z} & \frac{\partial y}{\partial z} & \frac{\partial z}{\partial z}\end{array}\right|$$

one can quantify the magnitude of morphological change at each voxel (Fig. 4e). A Jacobian determinant of 1 indicates no volume change, whereas determinants greater than 1 indicate volume expansion and determinants between 0 and 1 indicate volume shrinkage. These determinants can also be scaled and sheared with a composite transformation to examine voxel-wise differences in form. Jacobian determinants can be analyzed with voxel-wise tests, such as an ANOVA with a false-discovery rate correction, to map statistics onto the anatomy, a technique otherwise known as statistical parametric mapping (see VBM_Example.R). For example, in Fig. 4e, we use the RMINC R package (https://github.com/Mouse-Imaging-Centre/RMINC) to show significant voxel-wise changes (shrinkages) in form between Ghrhr mutants⁹⁸ and wildtype specimens, as well as voxel-wise variances in form associated with this mutation.

Landmarks

We labelled each atlas, and thus every registered mouse embryo and adult, with a standardized landmark configuration (Figs. 2 and 3). The atlas landmark files are named < Source > _ < Stage > _ < Anatomy > _Atlas_Landmarks.tag. They are stored as “Imaging Data” alongside the atlas volumes on FaceBase³⁶. The individual specimen landmark files are named < Biosample > _ < Anatomy > _Landmarks.tag and are similarly categorized as “Imaging Data” across FaceBase. The MNI.tag file format is an ASCII file which stores the coordinates of each landmark in the millimetric world space of the volume. Each .tag file has a header above an array of p landmarks (rows) in k dimensions (columns). These files can be imported into R individually or collectively as a 3-D array using the tag2array function in the custom morpho.tools.GM package⁹⁹. Alternatively, the user can employ the read.csv function in R to import a vectorized .csv file. For every developmental stage and anatomical region, we provide a landmark .csv file in the “Supplementary Files” section of the project-wide dataset on FaceBase³⁶, each of which contains a matrix of n specimens (rows) and $p\times k$ landmark coordinate dimensions (columns). Importantly, there are dense semi-landmarks and sparse fixed landmarks for local and global geometric morphometric analyses of craniofacial, endocast (brain), and mandible morphology. In Fig. 4d, for instance, we show craniofacial shape morphs along the first principal component (PC) in an adult subsample, as well as allometry regressions which relate craniofacial shape to size.

The embryo landmarks are equivalent across stages. Table S2 describes the sparse embryo landmarks and their biological definitions. Table S3 lists the embryo semi-landmark patches and their density, both of which are based on the sparse landmarks. The stage-specific semi-landmark patch files can also be found as tab-separated value (TSV) files on GitHub (https://github.com/jaydevine/MusMorph/tree/main/Postprocessing/Data/Atlases). Each embryo has 22 sparse homologous landmarks within their larger dense configuration. To perform a sparse landmark shape analysis, users may subset the first 22 rows of each 3-D array. Since there are three additional sparse landmarks for the E15.5 and E18.5 specimens, rows 23 to 25 may be included for stage-specific analyses or excluded for ontogenetic analyses.

The adult landmarks are simply equivalent within stage (i.e., all postnatal ages). Tables S4, S5, and S6 describe the sparse adult craniofacial, endocast, and mandible landmarks, respectively, as well as their biological definitions. While the adult curve semi-landmarks and surface semi-landmarks are not patch based, they can be slid and resampled using the R scripts on GitHub (see Calgary_Adult_Cranium_Sliding_Semis.R, Calgary_Adult_Mandible_Sliding_Semis.R, and Calgary_Adult_Endocast_Sliding_Semis.R) to mimic patches or any other structure. Much like the embryos, the sparse landmarks are the first 93, 12, and 19 rows of the cranium, endocast, and mandible 3-D arrays, respectively, and can be partitioned for a sparse shape analysis. If users want to generate new landmarks, such as internal landmarks or whole-body landmarks, they can use a script (see Label_Propagation.py), the inverted composite transformations (see the “Transformations” section), and a local or remote compute cluster to propagate the landmarks to an initialized image. To promote standardization, we encourage users to add new landmark subsets to the pre-existing configurations.

Segmentations

We provide segmentation labels for the E15.5 and adult atlases and specimens to support alternative morphological analyses, such as 3-D visualizations, voxel-based morphometry, volumetric size comparisons, and surface-based image processing pipelines. Other stages do not have segmentation labels due to the scope of this work. The segmentations follow the same naming conventions described above: < Source > _ < Stage > _Atlas_Segs .mnc and < Biosample > _Segs.mnc. The atlas segmentations are available as “Imaging Data” on FaceBase³⁶, as are the individual segmentation files across various MusMorph datasets. The published E15.5 atlas contains 48 whole body segmentations (http://www.mouseimaging.ca/technologies/mouse_atlas/mouse_embryo_atlas.html)⁴⁸, while the adult atlas comes with cranium, endocast, and mandible segmentations. Each label file is a .mnc volume of integers that matches the dimensionality of the image. To visualize the adult segmentations, for example, the user may load the atlas and label files together and input an integer of 1 to render the endocast, 2 for the cranium, and 3 for the mandible. As with new landmarks, there is the potential to resample new atlas segmentation labels into the initialized space of any image using the composite transformations (see the “Transformations” section) and a local or remote compute cluster (see Label_Propagation.py).

Technical Validation

Cross-correlation and root mean squared error

We computed intensity-based, pairwise registrations between each target image (I) and a reference atlas (J) by optimizing a normalized cross-correlation (NCC) similarity metric:

$$NCC\left(I,J\right)=\frac{{\sum }_{p\in \Omega }{I}_{(p)}{J}_{(p)}}{\sqrt{{\sum }_{p\in \Omega }{I}_{(p)}^{2}{\sum }_{p\in \Omega }{J}_{(p)}^{2}}}.$$

Normalized cross-correlation is calculated for all voxel positions p over a discrete domain ($p\in \Omega $). If the domain is the entire 3-D volume and $NCC\left(I,J\right)=1$, the deformed target image and reference image are perfectly aligned. To assess the quality of each registration, we recorded the normalized cross-correlation between each deformed target image and the atlas using code in the labelling scripts (see Label_Propagation.py). Unfortunately, it is difficult to know whether the final registration correlations are “good” or “bad” without relating them to the quality of the labels collected. We investigated the relationship between landmark root mean squared error and cross-correlation in the adult crania training set above to build a quality assessment model. Letting ${{\boldsymbol{x}}}_{{\boldsymbol{\ell }}}^{(I)}$ and ${\widehat{{{\boldsymbol{x}}}_{{\boldsymbol{\ell }}}}}^{(I)}$ denote the observed (manual) and predicted (automated) Euclidean vectors at landmark $\ell $ for a target image I, the root mean squared error for p landmarks is defined as

$$RMSE=\sqrt{\mathop{\sum }\limits_{\ell =1}^{p}\frac{1}{n}\mathop{\sum }\limits_{I=1}^{n}{\left({{\boldsymbol{x}}}_{{\boldsymbol{\ell }}}^{\left(I\right)}-{\widehat{{{\boldsymbol{x}}}_{{\boldsymbol{\ell }}}}}^{\left(I\right)}\right)}^{2}}.$$

After computing the root mean squared error for each specimen, we regressed these values on their corresponding cross-correlation values with linear, squared, and cubic cross-correlation terms (Fig. 5a). We found a statistically significant non-linear relationship (R² = 0.3, p < 0.001), such that cross-correlation values below 0.90 resulted in exponentially higher landmark errors. The average root mean squared error was 0.23 mm (95% CI ± 0.002 mm). This mean error is comparable to manual landmark intra-observer detection errors across the skull, which tend to be 0.25 mm or less^44,50. To verify registration quality across the rest of the database, we calculated cross-correlations for all specimens and stages. The mean cross-correlation values and their standard deviations for E10.5, E11.5, E15.5, E18.5, and adulthood were 0.94 ± 0.07, 0.96 ± 0.04, 0.93 ± 0.02, 0.93 ± 0.12, and 0.95 ± 0.02, respectively (Fig. 5a). These values are on par or higher than those reported in previous mouse registration studies¹⁰⁰ and speak to the reproducibility of this approach for analyzing variable morphology.

Covariance patterns and the mean shape

We quantified differences in covariance structure and the sample mean shape between our baseline automated landmarks, the optimized neural network landmarks, and the manual landmarks. To analyze covariance similarity, we projected the automated configurations into the manual PC space and correlated the uncentered PC scores. Figure 5b shows automated and manual correlations for the first 10 PCs (65.1% of the total variance). The average correlation within PCs for the baseline automated configurations was r = 0.6. This measure is biased downwards by lower order automated PCs, which tend to capture residual covariance of the first manual PC. The average correlation within PCs for the optimized automated configurations was r = 0.8, suggesting a restoration of signal among the major PCs.

To analyze mean shape deviations, we computed the grand mean shape for the manual landmarks and deformed it to the automated mean shapes via thin-plate spline. We then used the Morpho package¹⁰¹ in R to generate a deformation heatmap of Procrustes distances at every vertex of the deformed mesh (Fig. 5c). Procrustes distance is equivalent to the root mean squared error between two configurations in shape space. The total distance between the baseline automated mean and manual mean was 0.05, whereas the distance between the optimized automated mean and manual mean was 0.01. Visually, the baseline automated mean shape is largely indistinguishable from the manual mean shape, apart from several known problematic areas⁴⁴. First, the anterior extent of the frontonasal prominence is underestimated. Second, the shape of the foramen magnum is altered. Third, the lateral extent of the frontal bone is underestimated, likely because there are no sparse landmarks to interpolate there; however, this area is well-covered by the dense landmark configurations. Optimization successfully corrected errors at these problematic locations.

Outliers and stage-specific shape distributions

For each stage, we calculated the Procrustes distance between the mean shape and every configuration to obtain shape distributions and identify outliers (Fig. S1). We defined outlier shapes as those with a Procrustes distance above ${Q}_{3}+1.5\times IQR$, where ${Q}_{3}$ is the third quartile and $IQR$ is the interquartile range. Next, we displayed a minimum threshold isosurface of each outlier image alongside its landmarks to assess the errors. Landmark (.tag) files with clear head registration errors were removed. We observed most errant outlier landmark configurations in the E15.5 and E18.5 embryos, which underwent whole-body registrations. Since the orientation of the head relative to the body cannot be standardized in embryos, the whole-body registrations and inherent constraints of spatial normalization resulted in local registrations errors if their orientation was markedly different from the atlas.

Eliminating problematic outliers with distance distributions is a global solution but not always a local one. For example, if a landmark configuration hardly deviates from the mean on average, yet still has several landmarks with high detection errors, its distance to the mean could be small but its shape distinct. We performed a Principal Component Analysis on each stage-specific landmark dataset (Figs. S2 and S3) to identify such localized errors, assuming the first PC would capture distinctly problematic shapes. Figure. 6 shows the resulting shape distributions along PC1 for each stage. Here, we morphed a surface of the mean shape to each extreme via thin-plate spline and visualized the outputs. If the deformed surface was unusual, we displayed the image and landmarks as above, removed the errant landmark (.tag) file if necessary, and repeated this process until the prediction was correct.

Usage Notes

Why MusMorph?

The goal of MusMorph was to create a database of standardized mouse morphology data using an automated, high-throughput, and open-source phenotyping pipeline. By combining developmental atlases with a registration and deep learning framework, we constructed common coordinate systems into which various phenotypic data can be integrated. We primarily focused on acquiring morphological data, including anatomical landmarks, segmentations, and deformation fields, for the craniofacial complex and brain. However, we also generated whole body data for other integrative analyses of late-gestation embryos. To enable novel morphometric analyses of genotype-phenotype maps, we utilized mouse models with substantial developmental and genetic variation. Paired alongside other key metadata, such as strain and sex, MusMorph provides the community with a unique opportunity to disentangle the mechanistic basis for morphological variation.

While sparse landmarks are invaluable for geometric morphometrics, there are scenarios where local shape change can be poorly represented. More ambiguous anatomy, such as curves and surfaces, cannot be sufficiently captured with fixed anatomical landmarks, and semi-landmarking each specimen can be tedious and error-prone. Our standardized sparse and dense landmark datasets can enable global and local shape analyses^102,103, an area in geometric morphometrics historically overlooked. Equivalent dense landmark patches across the embryo datasets will also permit joint superimposition of multiple stages into a common shape space for increased statistical power as well as analyses of ontogeny (Fig. S4). In addition to landmarks, we make the corresponding deformation fields available on an ad hoc basis to support voxel-based meta-analyses of morphology. Despite its ubiquitous application in neuroimaging, voxel-based morphometry is rarely seen in fields that study hard tissue, such as evolutionary developmental biology, anthropology, and paleontology. These deformation fields will let one examine internal and external tissue interactions within anatomical context. Finally, we include anatomical segmentations for several stages, which can be used to restrict the dimensionality of a voxel-wise analysis, calculate the size (e.g., volume or surface area) of a structure, or perform a surface-based morphometry analysis. If users are dissatisfied with the coverage of existing landmarks and segmentations, they can modify the atlases and use the image transformations to generate new labels.

We have made the data and scripts freely available at FaceBase (www.facebase.org, https://doi.org/10.25550/3-HXMC)³⁵ and GitHub (https://github.com/jaydevine/MusMorph) to promote transparency, reproducibility, and future data aggregation. Completely open-source efforts like MusMorph are critical for standardizing phenotypic datasets. Unlike the field of genomics, which has been revolutionized through standardized sequencing and data crowdsourcing, phenomics continues to be limited by one-off, self-contained studies that cannot be related to one another. Standardized morphological datasets will allow research groups to, for instance, investigate the effects of a gene mutation alongside other mutants or wildtype strains in a common morphospace. The same can be said for other significant morphological factors and covariates, such as sex and age. Common morphospaces will further encourage multimodal data integration across the phenomic hierarchy, ranging from cellular and developmental phenotyping with light sheet microscopy¹⁰⁴ to tissue phenotyping with magnetic resonance imaging and contrast-enhanced computed tomography³⁸. Large phenotypic datasets will ultimately give us the statistical power needed to interrogate mechanisms that bias and generate morphological variation.

Sources of error and potential limitations

Staining artifacts are a drawback of contrast-enhanced computed tomography. Among the largest sources of registration error were poor contrast and background noise, particularly in the E15.5 dataset. Variable stain penetrance and inadequate contrast can underrepresent anatomy, whereas background noise can masquerade as anatomy and deceive the registration, even if the alignment is constrained with a mask. We mitigated labelling errors by registering thresholded images and by employing other preprocessing techniques, such as intensity bias correction and normalization. However, in some cases, the intensities of the scanning tube could not be distinguished from the specimen, leading to surface landmark errors. Another spatial alignment problem that was difficult to reconcile was variation in articulated anatomical positions. For example, head orientation relative to the body varied in the E15.5 and E18.5 datasets, and mandible orientation relative to the skull sometimes differed in the adult dataset. We chose to register the entire scan instead of separate segmentations, masks or cropped volumes, because a) we observed no significant differences in average registration quality, b) a single registration field is computationally more feasible to generate, store, and use downstream and c) a single atlas with a detailed set of labels is better for data standardization.

Non-linear alignment and labelling errors may occur around extreme anatomical points with high variability. To demonstrate how automated landmark error can be reduced, we implemented a neural network that minimized automated and manual craniofacial shape differences. Since the endocast, mandible, and embryo datasets do not have manual landmark training data, they cannot be optimized. However, if other investigators have training data, a network could be built to correct sparse phenotyping errors in areas of high morphological variability. Lastly, it is important to consider the computational time and memory needed for volumetric registration. To integrate new data, we strongly encourage users to parallelize their work on compute clusters.

Future development

The majority of MusMorph is composed of head data, because we had reservations about registering whole body data. Now that we have observed no significant differences in registration quality among datasets, on average, we plan to experiment with more whole-body data for embryos and adults. Another area we intend to improve is our developmental coverage. Despite sampling across most of development, we recognize that additional embryo timepoints (e.g., E9.5 and E12.5-14.5) are needed, as are higher sample sizes throughout mid-gestation and early adulthood. The developing mouse craniofacial complex, for example, undergoes immense growth during the first 30 days after birth¹⁰⁵. Early postnatal datasets will be critical for asking questions about size and ontogenetic allometry. Finally, to complement our large sample of homozygous embryo mutants, we hope to introduce more wildtype and heterozygous embryos for analyses of normal variation. Heterozygotes have not been a focus of the IMPC, so there is ample opportunity to reveal previously unrecognized embryo phenotypes with standardized MusMorph comparisons. The adult dataset, by contrast, needs to be balanced with more homozygous mutants to better understand how mutations of large effect influence morphological variance and other related phenomena, such as integration and modularity.

Data access

MusMorph is categorized as a “Project” on FaceBase. Projects can be found in the “Data Browser: Projects” tab at the top of the home page. Project data are organized hierarchically. The levels of the hierarchy in ascending order of data specificity are “Project”, “Dataset”, “Experiment”, and “Biosample”. A project contains datasets, which are sets of similar studies. Each dataset is annotated with study abstracts, experimental designs, and metadata identifiers. Datasets are composed of experiments. An experiment represents a set of similar specimens, so mice with the same genetic background, age, treatment, and mutation would constitute one experiment. Experiments contain biosamples. A biosample is an individual specimen.

After creating a free account and logging in the MusMorph data and metadata can be downloaded at any level in the project hierarchy using the “Export: BDBag” tool at the top-right of the browser. This export function uses DERIVA¹⁰⁶, the software platform that powers FaceBase, to generate a BDBag (Big Data Bag)¹⁰⁷ ZIP file. Users then need to download the file and process it via BDBag client tools, either via the command line or GUI application. Specific details about the DERIVA Client installation and the step-by-step export instructions are available here: www.facebase.org/help/exporting.

Code availability

Our code is freely available at https://github.com/jaydevine/MusMorph. The scripts describe every stage of the MusMorph data acquisition and analysis, including image preprocessing (e.g., file conversion, image resampling and intensity correction), processing (e.g., atlas generation, non-linear registration, label propagation), and postprocessing (e.g., shape optimization, morphometric analysis). We developed and implemented the code with Bash 4.4.20, R 3.6.1, Python 3.6, and Julia 1.2.0 on Ubuntu. To facilitate MusMorph software installations, reproducibility, and data aggregation, we have created a comprehensive Docker image that can be downloaded as follows: $ docker pull jaydevine/musmorph:latest. Further information about running the Docker container is available on GitHub. All code is distributed under the GNU General Public License v3.0.

Change history

28 June 2023
A Correction to this paper has been published: https://doi.org/10.1038/s41597-023-02320-x

References

Hallgrímsson, B., Mio, W., Marcucio, R. S. & Spritz, R. Let’s face it—complex traits are just not that simple. PLoS Genet 10, e1004724, https://doi.org/10.1371/journal.pgen.1004724 (2014).
Article CAS PubMed PubMed Central Google Scholar
Mitteroecker, P., Cheverud, J. M. & Pavlicev, M. Multivariate analysis of genotype-phenotype association. Genetics 202, 1345–1363, https://doi.org/10.1534/genetics.115.181339 (2016).
Article CAS PubMed PubMed Central Google Scholar
Pavlicev, M., Norgard, E. A., Fawcett, G. L. & Cheverud, J. M. Evolution of pleiotropy: epistatic interaction pattern supports a mechanistic model underlying variation in genotype-phenotype map. J. Exp. Zool. (Mol. Dev. Evol.) 316, 371–385, https://doi.org/10.1002/jez.b.21410 (2011).
Article Google Scholar
Green, R. M. et al. Developmental nonlinearity drives phenotypic robustness. Nat. Commun. 8, 1–12, https://doi.org/10.1038/s41467-017-02037-7 (2017).
Article ADS CAS Google Scholar
Young, N. M., Chong, H. J., Du, H., Hallgrímsson, B. & Marcucio, R. S. Quantitative analyses link modulation of sonic hedgehog signaling to continuous variation in facial growth and shape. Development 137, 3405–3409, https://doi.org/10.1242/dev.052340 (2010).
Article CAS PubMed PubMed Central Google Scholar
Wagner, G. P. Evolution of gene networks by gene duplications: a mathematical model and its implications on genome organization. Proc. Natl. Acad. Sci. USA 91, 4387–4391, https://doi.org/10.1073/pnas.91.10.4387 (1994).
Article ADS CAS PubMed PubMed Central MATH Google Scholar
Wagner, G. P. & Zhang, J. The pleiotropic structure of the genotype–phenotype map: the evolvability of complex organisms. Nat. Rev. Genet. 12, 204–213, https://doi.org/10.1038/nrg2949 (2011).
Article CAS PubMed Google Scholar
Rice, S. H. The evolution of canalization and the breaking of von Baer’s laws: modeling the evolution of development with epistasis. Evolution 52, 647–656, https://doi.org/10.1111/j.1558-5646.1998.tb03690.x (1998).
Article PubMed Google Scholar
Rice, S. H. Theoretical approaches to the evolution of development and genetic architecture. Ann. N.Y. Acad. Sci. 1133, 67–86, https://doi.org/10.1196/annals.1438.002 (2008).
Article ADS PubMed Google Scholar
Hallgrímsson, B. et al. The developmental-genetics of canalization. In Seminars in Cell & Developmental Biology, Vol. 88, https://doi.org/10.1016/j.semcdb.2018.05.019 (Academic Press, 2019).
Karim, K. et al. Xenbase: a genomic, epigenomic and transcriptomic model organism database. Nucleic Acids Res. 46, D861–D868, https://doi.org/10.1093/nar/gkx936 (2018).
Article CAS Google Scholar
Blake, J. A. et al. The Mouse Genome Database (MGD): the model organism database for the laboratory mouse. Nucleic Acids Res. 30, 113–115, https://doi.org/10.1093/nar/30.1.113 (2002).
Article ADS CAS PubMed PubMed Central Google Scholar
Howe et al. ZFIN, the Zebrafish Model Organism Database: increased support for mutants and transgenics. Nucleic Acids Res. 41, D854–D860, https://doi.org/10.1093/nar/gks938 (2012).
Article CAS PubMed PubMed Central Google Scholar
Wang, L., Wang, S., Li, Y., Paradesi, M. S. R. & Brown, S. J. BeetleBase: the model organism database for Tribolium castaneum. Nucleic Acids Res. 35, D476–D479, https://doi.org/10.1093/nar/gkl776 (2006).
Article PubMed PubMed Central Google Scholar
Brown, S. D. & Moore, M. W. The International Mouse Phenotyping Consortium: past and future perspectives on mouse phenotyping. Mamm. Genome 23, 632–640, https://doi.org/10.1007/s00335-012-9427-x (2012).
Article CAS PubMed PubMed Central Google Scholar
Brown, S. D. & Moore, M. W. Towards an encyclopaedia of mammalian gene function: The International Mouse Phenotyping Consortium. Dis. Model. Mech. 5, 289–292, https://doi.org/10.1242/dmm.009878 (2012).
Article CAS PubMed PubMed Central Google Scholar
Horner, N. R. et al. LAMA: automated image analysis for the developmental phenotyping of mouse embryos. Development 148, dev192955, https://doi.org/10.1242/dev.192955 (2021).
Article CAS PubMed PubMed Central Google Scholar
Koscielny, G. et al. The International Mouse Phenotyping Consortium Web Portal, a unified point of access for knockout mice and related phenotyping data. Nucleic Acids Res. 42, D802–D809, https://doi.org/10.1093/nar/gkt977 (2014).
Article CAS PubMed Google Scholar
Meehan, T. F. et al. Disease model discovery from 3,328 gene knockouts by The International Mouse Phenotyping Consortium. Nat. Genet. 49, 1231–1238, https://doi.org/10.1038/ng.3901 (2017).
Article CAS PubMed PubMed Central Google Scholar
Dickinson, M. E. et al. High-throughput discovery of novel developmental phenotypes. Nature 537, 508–514, https://doi.org/10.1038/nature19356 (2016).
Article CAS PubMed PubMed Central Google Scholar
Churchill, G. A., Gatti, D. M., Munger, S. C. & Svenson, K. L. The diversity outbred mouse population. Mamm. Genome 23, 713–718, https://doi.org/10.1007/s00335-012-9414-2 (2012).
Article PubMed PubMed Central Google Scholar
Collaborative Cross Consortium. The genome architecture of the Collaborative Cross mouse genetic reference population. Genetics 190, 389–401, https://doi.org/10.1534/genetics.111.132639 (2012).
Article CAS PubMed Central Google Scholar
Katz, D. C. et al. Facial shape and allometry quantitative trait locus intervals in the Diversity Outbred mouse are enriched for known skeletal and facial development genes. PLoS ONE 15, e023337, https://doi.org/10.1371/journal.pone.0233377 (2020).
Article CAS Google Scholar
Ashburner, J. & Friston, K. J. Voxel-based morphometry—the methods. NeuroImage 11, 805–821, https://doi.org/10.1006/nimg.2000.0582 (2000).
Article CAS PubMed Google Scholar
Fonov, V. et al. Unbiased average age-appropriate atlases for pediatric studies. NeuroImage 54, 313–327, https://doi.org/10.1016/j.neuroimage.2010.07.033 (2011).
Article PubMed Google Scholar
Ridgway, G. R. et al. Ten simple rules for reporting voxel-based morphometry studies. NeuroImage 40, 1429–1435, https://doi.org/10.1016/j.neuroimage.2008.01.003 (2008).
Article PubMed Google Scholar
Silver, M., Montana, G., Nichols, T. E. & Alzheimer’s Disease Neuroimaging Initiative. False positives in neuroimaging genetics using voxel-based morphometry data. NeuroImage 54, 992–1000, https://doi.org/10.1016/j.neuroimage.2010.08.049 (2011).
Article PubMed Google Scholar
Adams, D. C., Rohlf, F. J. & Slice, D. E. A field comes of age: geometric morphometrics in the 21st century. Hystrix 24, 7, https://doi.org/10.4404/hystrix-24.1-6283 (2013).
Article Google Scholar
Boyer, D. M. et al. A new fully automated approach for aligning and comparing shapes. Anat. Rec. 298, 249–276, https://doi.org/10.1002/ar.23084 (2015).
Article Google Scholar
Maga, A. M., Tustison, N. J. & Avants, B. B. A population level atlas of Mus musculus craniofacial skeleton and automated image‐based shape analysis. J. Anat. 231, 433–443, https://doi.org/10.1111/joa.12645 (2017).
Article ADS PubMed PubMed Central Google Scholar
Porto, A. & Voje, K. L. ML‐morph: A fast, accurate and general approach for automated detection and landmarking of biological structures in images. Methods Ecol. Evol. 11, 500–512, https://doi.org/10.1111/2041-210X.13373 (2020).
Article Google Scholar
Rolfe, S. et al. SlicerMorph: retrieve, visualize and analyze 3D morphology with open-source. Integr. Comp. Biol. 60, e269–454 (2020).
Google Scholar
Vidal‐García, M., Bandara, L. & Keogh, J. S. ShapeRotator: an R tool for standardized rigid rotations of articulated three‐dimensional structures with application for geometric morphometrics. Ecol. Evol. 8, 4669–4675, https://doi.org/10.1002/ece3.4018 (2018).
Article PubMed PubMed Central Google Scholar
Samuels, B. D. et al. FaceBase 3: analytical tools and FAIR resources for craniofacial and dental research. Development 147, dev191213, https://doi.org/10.1242/dev.191213 (2020).
Article CAS PubMed PubMed Central Google Scholar
Devine, J. et al. MusMorph, a database of standardized mouse morphology data for morphometric meta-analyses. FaceBase Consortium, https://doi.org/10.25550/3-HXMC (2021).
Devine, J. et al. Project-wide metadata, atlases, and landmarks for MusMorph. FaceBase Consortium https://doi.org/10.25550/6-2EPY (2021).
Wong, M. D., Spring, S. & Henkelman, R. M. Structural stabilization of tissue for embryo phenotyping using micro-CT with iodine staining. PLoS ONE 8, e84321, https://doi.org/10.1371/journal.pone.0084321 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Gignac, P. M. et al. Diffusible iodine-based contrast-enhanced computed tomography (diceCT): an emerging tool for rapid, high-resolution, 3-D imaging of metazoan soft tissues. J. Anat. 228, 889–909, https://doi.org/10.1111/joa.12449 (2016).
Article CAS PubMed PubMed Central Google Scholar
Green, R. M., Leach, C. L., Hoehn, N., Marcucio, R. S. & Hallgrímsson, B. Quantifying three‐dimensional morphology and RNA from individual embryos. Dev. Dynam. 246, 431–436, https://doi.org/10.1002/DVDY.24490 (2017).
Article CAS Google Scholar
Feldkamp, L. A., Davis, L. C. & Kress, J. W. Practical cone-beam algorithm. J. Opt. Soc. Am. A 1, 612–619, https://doi.org/10.1364/JOSAA.1.000612 (1984).
Article ADS Google Scholar
Vincent, R. D. et al. MINC 2.0: a flexible format for multi-modal images. Front. Neuroinform. 10, 35, https://doi.org/10.3389/fninf.2016.00035 (2016).
Article PubMed PubMed Central Google Scholar
Sled, J. G., Zijdenbos, A. P. & Evans, A. C. A nonparametric method for automatic correction of intensity nonuniformity in MRI data. IEEE T. Med. Imaging 17, 87–97, https://doi.org/10.1109/42.668698 (1998).
Article CAS Google Scholar
Friedel, M., van Eede, M. C., Pipitone, J., Chakravarty, M. M. & Lerch, J. P. Pydpiper: a flexible toolkit for constructing novel registration pipelines. Front. Neuroinform. 8, 67, https://doi.org/10.3389/fninf.2014.00067 (2014).
Article PubMed PubMed Central Google Scholar
Percival, C. J. et al. The effect of automated landmark identification on morphometric analyses. J. Anat. 234, 917–935, https://doi.org/10.1111/joa.12973 (2019).
Article PubMed PubMed Central Google Scholar
Collins, D. L., Neelin, P., Peters, T. M. & Evans, A. C. Automatic 3D intersubject registration of MR volumetric data in standardized Talairach space. J. Comput. Assist. Tomo. 18, 192–205, https://doi.org/10.1097/00004728-199403000-00005 (1994).
Article CAS Google Scholar
Lerch, J. P., Sled, J. G. & Henkelman, R. M. MRI Phenotyping of Genetically Altered Mice. In: Magnetic Resonance Neuroimaging. Methods in Molecular Biology (Methods and Protocols), Vol. 711 (eds. Modo M., Bulte, J.) https://doi.org/10.1007/978-1-61737-992-5_17 (Humana Press, 2011).
Collins, D. L. & Evans, A. C. Animal: validation and applications of nonlinear registration-based segmentation. Int. J. Pattern Recogn. 11, 1271–1294, https://doi.org/10.1142/S0218001497000597 (1997).
Article Google Scholar
Wong, M. D., Dorr, A. E., Walls, J. R., Lerch, J. P. & Henkelman, R. M. A novel 3D mouse embryo atlas based on micro-CT. Development 139, 3248–3256, https://doi.org/10.1242/dev.082016 (2012).
Article CAS PubMed PubMed Central Google Scholar
Kikinis R., Pieper S. D. & Vosburgh K. G. 3D Slicer: A Platform for Subject-Specific Image Analysis, Visualization, and Clinical Support. In: Intraoperative Imaging and Image-Guided Therapy (eds. Jolesz, F.) https://doi.org/10.1007/978-1-4614-7657-3_19. (Springer, 2014).
Percival, C. J., Green, R., Marcucio, R. S. & Hallgrímsson, B. Surface landmark quantification of embryonic mouse craniofacial morphogenesis. BMC Dev. Biol. 14, 1–12, https://doi.org/10.1186/1471-213X-14-31 (2014).
Article Google Scholar
Bastir, M. A systems-model for the morphological analysis of integration and modularity in human craniofacial evolution. J. Anthropol. Sci. 86, 19934468 (2008).
Google Scholar
Porto, A., de Oliveira, F. B., Shirai, L. T., De Conto, V. & Marroig, G. The evolution of modularity in the mammalian skull I: Morphological integration patterns and magnitudes. Evol. Biol. 36, 118–135, https://doi.org/10.1007/s11692-008-9038-3 (2009).
Article Google Scholar
Hallgrímsson, B. et al. Integration and the developmental genetics of allometry. Integr. Comp. Biol. 59, 1369–1381, https://doi.org/10.1093/icb/icz105 (2019).
Article PubMed PubMed Central Google Scholar
Richtsmeier, J. T. et al. Phenotypic integration of neurocranium and brain. J. Exp. Zool. (Mol. Dev. Evol.) 306, 360–378, https://doi.org/10.1002/jez.b.21092 (2006).
Article Google Scholar
Marchini, M. et al. Wnt signaling drives correlated changes in facial morphology and brain shape. Front. Cell Dev. Biol. 9, 694, https://doi.org/10.3389/fcell.2021.644099 (2021).
Article Google Scholar
Smith, K. K. Integration of craniofacial structures during development in mammals. Am. Zool. 36, 70–79 (1996).
Article Google Scholar
Young, N. M., Linde-Medina, M., Fondon, J. W., Hallgrímsson, B. & Marcucio, R. S. Craniofacial diversification in the domestic pigeon and the evolution of the avian skull. Nat. Ecol. Evol. 1, 1–8, https://doi.org/10.1038/s41559-017-0095 (2017).
Article Google Scholar
Toussaint, N. et al. A landmark-free morphometrics pipeline for high-resolution phenotyping: application to a mouse model of Down syndrome. Development 148, dev188631, https://doi.org/10.1242/dev.188631 (2021).
Article CAS PubMed PubMed Central Google Scholar
Avants, B. B. et al. A reproducible evaluation of ANTs similarity metric performance in brain image registration. NeuroImage 54, 2033–2044, https://doi.org/10.1016/j.neuroimage.2010.09.025 (2011).
Article PubMed Google Scholar
Devine, J. et al. A registration and deep learning approach to automated landmark detection for geometric morphometrics. Evol. Biol. 47, 246–259, https://doi.org/10.1007/s11692-020-09508-8 (2020).
Article PubMed PubMed Central Google Scholar
Attanasio, C. et al. Fine tuning of craniofacial morphology by distant-acting enhancers. Science 342, 1–20, https://doi.org/10.1126/science.1241006 (2014).
Article CAS Google Scholar
Hallgrímsson, B., Willmore, K., Dorval, C. & Cooper, D. M. L. Craniofacial variability and modularity in macaques and mice. J. Exp. Zool. Part B 302, 207–225, https://doi.org/10.1002/jez.b.21002 (2004).
Article Google Scholar
Hallgrímsson, B. et al. The Brachymorph mouse and the developmental-genetic basis for canalization and morphological integration. Evol. Dev. 8, 61–73, https://doi.org/10.1111/j.1525-142X.2006.05075.x (2006).
Article PubMed Google Scholar
Hallgrímsson, B. et al. Deciphering the Palimpsest: Studying the relationship between morphological integration and phenotypic covariation. Evol. Biol. 36, 355–376, https://doi.org/10.1007/s11692-009-9076-5 (2009).
Article PubMed PubMed Central Google Scholar
Lieberman, D. E., Hallgrímsson, B., Liu, W., Parsons, T. E. & Jamniczky, H. A. Spatial packing, cranial base angulation, and craniofacial shape variation in the mammalian skull: Testing a new model using mice. J. Anat. 212, 720–735, https://doi.org/10.1111/j.1469-7580.2008.00900.x (2008).
Article PubMed PubMed Central Google Scholar
Devine, J. et al. Collaborative Cross: A standardized mouse morphology dataset for MusMorph. FaceBase Consortium https://doi.org/10.25550/3-KB0W (2021).
Devine, J. et al. Hybrid: A standardized mouse morphology dataset for MusMorph. FaceBase Consortium https://doi.org/10.25550/3-KB32 (2021).
Devine, J. et al. Diversity Outbred: A standardized mouse morphology dataset for MusMorph. FaceBase Consortium https://doi.org/10.25550/3-KB0W (2021).
Devine, J. et al. IMPC: A standardized mouse morphology dataset for MusMorph. FaceBase Consortium https://doi.org/10.25550/3-JZA6 (2021).
Devine, J. et al. Ap2: A standardized mouse morphology dataset for MusMorph. FaceBase Consortium https://doi.org/10.25550/3-JQMG (2021).
Devine, J. et al. B9d: A standardized mouse morphology dataset for MusMorph. FaceBase Consortium https://doi.org/10.25550/3-JQMM (2021).
Devine, J. et al. Bulgy: A standardized mouse morphology dataset for MusMorph. FaceBase Consortium https://doi.org/10.25550/3-JZ9G (2021).
Devine, J. et al. Brain-Face: A standardized mouse morphology dataset for MusMorph. FaceBase Consortium https://doi.org/10.25550/3-KB3J (2021).
Devine, J. et al. Longshanks: A standardized mouse morphology dataset for MusMorph. FaceBase Consortium https://doi.org/10.25550/3-KFBE (2021).
Unger, C. M., Devine, J., Hallgrímsson, B. & Rolian, C. Selection for increased tibia length in mice alters skull shape through parallel changes in developmental mechanisms. Elife 10, e67612, https://doi.org/10.7554/eLife.67612 (2021).
Article CAS PubMed PubMed Central Google Scholar
Marchini, M. & Rolian, C. Artificial selection sheds light on developmental mechanisms of limb elongation. Evolution 72, 825–837, https://doi.org/10.1111/evo.13447 (2018).
Article PubMed Google Scholar
Spring, S., Lerch, J. P. & Henkelman, R. M. Sexual dimorphism revealed in the structure of the mouse brain using three-dimensional magnetic resonance imaging. NeuroImage 35, 1424–1433, https://doi.org/10.1016/j.neuroimage.2007.02.023 (2007).
Article PubMed Google Scholar
Gonzalez, P. N., Bernal, V. & Perez, S. I. Analysis of sexual dimorphism of craniofacial traits using geometric morphometric techniques. Int. J. Osteoarchaeol. 21, 82–91, https://doi.org/10.1002/oa.1109 (2011).
Article Google Scholar
Callewaert, F. et al. Sexual dimorphism in cortical bone size and strength but not density is determined by independent and time-specific actions of sex steroids and IGF-1: Evidence from pubertal mouse models. J. Bone Miner. Res. 25, 617–626, https://doi.org/10.1359/jbmr.090828 (2010).
Article CAS PubMed Google Scholar
Grove, K. L., Fried, S. K., Greenberg, A. S., Xiao, X. Q. & Clegg, D. J. A microarray analysis of sexual dimorphism of adipose tissues in high-fat-diet-induced obese mice. Int. J. Obesity 34, 989–1000, https://doi.org/10.1038/ijo.2010.12 (2010).
Article CAS Google Scholar
Eaton, M. et al. Complex patterns of cell growth in the placenta in normal pregnancy and as adaptations to maternal diet restriction. PLoS ONE 15, e0226735, https://doi.org/10.1371/journal.pone.0226735 (2020).
Article CAS PubMed PubMed Central Google Scholar
Gonzalez et al. Chronic protein restriction in mice impacts placental function and maternal body weight before fetal growth. PLoS ONE 11, e0152227, https://doi.org/10.1371/journal.pone.0152227 (2016).
Article CAS PubMed PubMed Central Google Scholar
Devine, J. et al. Strain Comparison: A standardized mouse morphology dataset for MusMorph. FaceBase Consortium https://doi.org/10.25550/3-JZ9J (2021).
Jamniczky, H. A. & Hallgrímsson, B. A comparison of covariance structure in wild and laboratory muroid crania. Evolution 63, 1540–1556, https://doi.org/10.1111/j.1558-5646.2009.00651.x (2009).
Article PubMed Google Scholar
Davies, A. G., Bettinger, J. C., Thiele, T. R., Judy, M. E. & McIntire, S. L. Natural variation in the npr-1 gene modifies ethanol responses of wild strains of C. elegans. Neuron 42, 731–743, https://doi.org/10.1016/j.neuron.2004.05.004 (2004).
Article CAS PubMed Google Scholar
Pavlicev, M., Norgard, E. A., Fawcett, G. L. & Cheverud, J. M. Evolution of pleiotropy: epistatic interaction pattern supports a mechanistic model underlying variation in genotype–phenotype map. J. Exp. Zool. (Mol. Dev. Evol.) 316, 371–385 (2011).
Article Google Scholar
Percival, C. J., Marangoni, P., Tapaltsyan, V., Klein, O. & Hallgrímsson, B. The interaction of genetic background and mutational effects in regulation of mouse craniofacial shape. G3—Genes Genom. Genet. 7, 1439–1450, https://doi.org/10.1534/g3.117.040659 (2017).
Article CAS Google Scholar
Devine, J. et al. Spry: A standardized mouse morphology dataset for MusMorph. FaceBase Consortium https://doi.org/10.25550/3-JZAM (2021).
Cheverud, J. M. Relationships among ontogenetic, static, and evolutionary allometry. Am. J. Phys. Anthropol. 59, 139–149 (1982).
Article CAS PubMed Google Scholar
Gonzalez, P. N., Kristensen, E., Morck, D. W., Boyd, S. & Hallgrímsson, B. Effects of growth hormone on the ontogenetic allometry of craniofacial bones. Evol. Dev. 15, 133–145, https://doi.org/10.1111/ede.12025 (2013).
Article CAS PubMed PubMed Central Google Scholar
Klingenberg, C. P. Multivariate allometry. In Advances in Morphometrics https://doi.org/10.1007/978-1-4757-9083-2_3 (Springer, 1996).
Mosimann, J. E. Size allometry: size and shape variables with characterizations of the lognormal and generalized gamma distributions. J. Am. Stat. Assoc. 65, 930–945, https://doi.org/10.1080/01621459.1970.10481136 (1970).
Article MATH Google Scholar
Jolicoeur, P. Note: the multivariate generalization of the allometry equation. Biometrics 19, 497–499, https://doi.org/10.2307/2527939 (1963).
Article Google Scholar
Richtsmeier, J. T. & Flaherty, K. Hand in glove: brain and skull in development and dysmorphogenesis. Acta. Neuropathol. 125, 469–489 (2013).
Article CAS PubMed PubMed Central Google Scholar
Klingenberg, C. P. Morphometrics and the role of the phenotype in studies of the evolution of developmental mechanisms. Gene 287, 3–10, https://doi.org/10.1016/S0378-1119(01)00867-8 (2002).
Article CAS PubMed Google Scholar
Soulé, M. E. Heterozygosity and developmental stability: another look. Evolution 33, 396–401, https://doi.org/10.2307/2407629 (1979).
Article PubMed Google Scholar
Sotiras, A., Davatzikos, C. & Paragios, N. Deformable medical image registration: A survey. IEEE T. Med. Imaging 32, 1153–1190, https://doi.org/10.1109/TMI.2013.2265603 (2013).
Article Google Scholar
Devine, J. et al. Ghrhr: A standardized mouse morphology dataset for MusMorph. FaceBase Consortium https://doi.org/10.25550/3-KB08 (2021).
Vidal-García, M. morpho.tools.GM v1.0.0: A set of R tools to help with geometric morphometric analyses of 3D data. zenodo https://doi.org/10.5281/zenodo.4673771 (2021).
Wong, M. D. et al. 4D atlas of the mouse embryo for precise morphological staging. Development 142, 3583–3591, https://doi.org/10.1242/dev.125872 (2015).
Article CAS PubMed PubMed Central Google Scholar
Schlager, S. Morpho and Rvcg–Shape analysis in R: R packages for geometric morphometrics, shape analysis and surface manipulations. In Statistical Shape and Deformation Analysis. Methods, Implementation and Applications https://doi.org/10.1016/B978-0-12-810493-4.00011-0 (Academic Press, 2017).
Claes, P. et al. Genome-wide mapping of global-to-local genetic effects on human facial shape. Nat. Genet. 50, 414–423, https://doi.org/10.1038/s41588-018-0057-4 (2018).
Article CAS PubMed PubMed Central Google Scholar
Mitteroecker, P. et al. Morphometric variation at different spatial scales: coordination and compensation in the emergence of organismal form. Syst. Biol. 69, 913–926, https://doi.org/10.1093/sysbio/syaa007 (2020).
Article PubMed PubMed Central Google Scholar
Epp, J. R. et al. Optimization of CLARITY for clearing whole-brain and other intact organs. eNeuro 2, https://doi.org/10.1523/ENEURO.0022-15.2015 (2015).
Vora, S. R., Camci, E. D. & Cox, T. C. Postnatal ontogeny of the cranial base and craniofacial skeleton in male C57BL/6J mice: A reference standard for quantitative analysis. Front. Physiol. 6, 417, https://doi.org/10.3389/fphys.2015.00417 (2016).
Article PubMed PubMed Central Google Scholar
Bugacov, A. et al. Experiences with DERIVA: An asset management platform for accelerating eScience. In: IEEE 13^th International Conference on e-Science, 79—88, https://doi.org/10.1109/eScience.2017.20 (2017).
Chard, K. et al. I’ll take that to go: Big data bags and minimal identifiers for exchange of large, complex datasets. In: IEEE International Conference on Big Data, 319—328, https://doi.org/10.1109/BigData.2016.7840618 (2016).

Download references

Acknowledgements

We thank the Advanced Research Computing team at the University of Calgary for facilitating image processing and storage on the ARC and Helix compute clusters. We also thank FaceBase and the International Mouse Phenotyping Consortium for assisting with image data storage and acquisition. Finally, we would like to acknowledge funding from a CIHR Foundation Grant 159920 to BH and RM, an NSERC Discovery Grant 238992-17 to BH, an NIH R01DE019638 to RM, BH, JC, AL, and WD, an NIH U01DE028729 to BH and OK, NIH R35-DE026602 to OK, NIH R01HL138659 to AL and AC, NIH R01 DE019843 and NIH R01 DE012728 to TW, NIH R01DE025222 to AM, NIH U54 HG006370-01 to HW, Israel Science Foundation grant 2334/21 to RB as well as funding from Alberta Innovates, and the Alberta Children’s Hospital Foundation to BH. MVG is supported by an Alberta Innovates Postdoctoral Fellowship in Health Innovation. JD.

Author information

Authors and Affiliations

Alberta Children’s Hospital Research Institute, University of Calgary, 28 Oki Dr NW, Calgary, AB, T3B 6A8, Canada
Jay Devine, Marta Vidal-García, Wei Liu, Lucas D. Lo Vercio, Marta Marchini, Bethany Radford, Frank R. Jirik, Myriam Hemberger, Wendy Dean, Nils D. Forkert & Benedikt Hallgrímsson
The McCaig Institute for Bone and Joint Health, University of Calgary, 3280 Hospital Dr NW, Calgary, AB, T2N 4Z6, Canada
Jay Devine, Marta Vidal-García, Wei Liu, Lucas D. Lo Vercio, Marta Marchini, Colton M. Unger, Campbell Rolian, Frank R. Jirik & Benedikt Hallgrímsson
Department of Cell Biology and Anatomy, Cumming School of Medicine, University of Calgary, 3330 Hospital Dr NW, Calgary, AB, T2N 4N1, Canada
Jay Devine, Marta Vidal-García, Wei Liu, Lucas D. Lo Vercio, Marta Marchini, Wendy Dean & Benedikt Hallgrímsson
Department of Biology, McMaster University, 1280 Main St W, Hamilton, ON, L8S 4L8, Canada
Amanda Neves
School of Dental Medicine, University of Pittsburgh, 3501 Terrace St, Pittsburgh, PA, 15213, USA
Rebecca M. Green
Orthopaedic Trauma Institute, ZSFG, UCSF, 2550 23rd St, San Francisco, CA, 94110, USA
Heather A. Richbourg, Nathan M. Young & Ralph S. Marcucio
Department of Biological Sciences, University of Calgary, 2500 University Dr NW, Calgary, AB, T2N 1N4, Canada
Colton M. Unger
Center for Craniofacial Molecular Biology, Department of Biomedical Sciences, Herman Ostrow School of Dentistry, University of Southern California, Los Angeles, 2250 Alcazar St, Los Angeles, CA, 90033, USA
Audrey C. Nickle & Amy E. Merrill
Department of Biochemistry and Molecular Medicine, Keck School of Medicine, University of Southern California, Los Angeles, 1975 Zonal Ave, Los Angeles, CA, 90033, USA
Audrey C. Nickle & Amy E. Merrill
Department of Biochemistry and Molecular Biology, Cumming School of Medicine, University of Calgary, 3330 Hospital Dr NW, Calgary, AB, T2N 4N1, Canada
Bethany Radford, Frank R. Jirik & Myriam Hemberger
Institute for Studies in Neuroscience and Complex Systems (ENyS) CONICET, Av. Calchaquí, 5402, Florencio Varela, Buenos Aires, Argentina
Paula N. Gonzalez
Information Sciences Institute, Viterbi School of Engineering, University of Southern California, 4676 Admiralty Way, Marina del Rey, CA, 90292, USA
Robert E. Schuler & Alejandro Bugacov
Department of Comparative Biology and Experimental Medicine, Faculty of Veterinary Medicine, University of Calgary, 3330 Hospital Dr NW, Calgary, AB, T2N 4N1, Canada
Campbell Rolian
Department of Anthropology, Stony Brook University, 100 Nicolls Rd, Stony Brook, NY, 11794, USA
Christopher J. Percival
Department of Craniofacial Biology, University of Colorado Anschutz Medical Campus, 12801 East 17th Ave, Aurora, CO, 80045, USA
Trevor Williams
Department of Molecular, Cellular and Developmental Biology, University of Colorado Boulder, Boulder, CO, 80309, USA
Lee Niswander
Department of Anatomy and Neurobiology, University of California, Irvine, Irvine, CA, 92697, USA
Anne L. Calof
Center for Complex Biological Systems, University of California, Irvine, Irvine, CA, 92697, USA
Anne L. Calof & Arthur D. Lander
Department of Developmental and Cell Biology, University of California, Irvine, Irvine, CA, 92697, USA
Arthur D. Lander
Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Rd, Berkeley, CA, 94720, USA
Axel Visel
U.S. Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, 1 Cyclotron Rd, Berkeley, CA, 94720, USA
Axel Visel
School of Natural Sciences, University of California, Merced, 5200 Lake Rd, Merced, CA, 95343, USA
Axel Visel
Department of Biology, Loyola University Chicago, 1032 W Sheridan Rd, Chicago, IL, 60660, USA
James M. Cheverud
Department of Orofacial Sciences and Program in Craniofacial Biology, University of California, San Francisco, 513 Parnassus Ave, San Francisco, CA, 94143, USA
Ophir D. Klein
Department of Pediatrics and Institute for Human Genetics, University of California, San Francisco, 513 Parnassus Ave, San Francisco, CA, 94143, USA
Ophir D. Klein
Department of Pediatrics, Cedars-Sinai Medical Center, 8700 Beverly Blvd, Los Angeles, CA, 90048, USA
Ophir D. Klein
Department of Life Sciences, Faculty of Natural Sciences, The Ben-Gurion University of the Negev, David Ben Gurion Blvd 1, Be’er Sheva, Israel
Ramon Y. Birnbaum
Department of Archaeology, University of Cape Town, Rondebosch, Cape Town, 7700, South Africa
Rebecca R. Ackermann
Human Evolution Research Institute, University of Cape Town, Rondebosch, Cape Town, 7700, South Africa
Rebecca R. Ackermann
School of Dentistry, Faculty of Medicine and Dentistry, University of Alberta, 116 St. and 85 Ave, Edmonton, AB, T6G 2R3, Canada
Daniel Graf
Department of Medical Genetics, Faculty of Medicine and Dentistry, University of Alberta, 116 St. and 85 Ave, Edmonton, AB, T6G 2R3, Canada
Daniel Graf
Department of Radiology, Cumming School of Medicine, University of Calgary, 3330 Hospital Dr NW, Calgary, AB, T2N 4N1, Canada
Nils D. Forkert
The Jackson Laboratory, 600 Main St, Bar Harbor, ME, 04609, USA
Stephen A. Murray
Department of Bioimaging Informatics, MRC Harwell Institute, Oxfordshire, OX11 0RD, UK
Henrik Westerberg

Authors

Jay Devine
View author publications
You can also search for this author in PubMed Google Scholar
Marta Vidal-García
View author publications
You can also search for this author in PubMed Google Scholar
Wei Liu
View author publications
You can also search for this author in PubMed Google Scholar
Amanda Neves
View author publications
You can also search for this author in PubMed Google Scholar
Lucas D. Lo Vercio
View author publications
You can also search for this author in PubMed Google Scholar
Rebecca M. Green
View author publications
You can also search for this author in PubMed Google Scholar
Heather A. Richbourg
View author publications
You can also search for this author in PubMed Google Scholar
Marta Marchini
View author publications
You can also search for this author in PubMed Google Scholar
Colton M. Unger
View author publications
You can also search for this author in PubMed Google Scholar
Audrey C. Nickle
View author publications
You can also search for this author in PubMed Google Scholar
Bethany Radford
View author publications
You can also search for this author in PubMed Google Scholar
Nathan M. Young
View author publications
You can also search for this author in PubMed Google Scholar
Paula N. Gonzalez
View author publications
You can also search for this author in PubMed Google Scholar
Robert E. Schuler
View author publications
You can also search for this author in PubMed Google Scholar
Alejandro Bugacov
View author publications
You can also search for this author in PubMed Google Scholar
Campbell Rolian
View author publications
You can also search for this author in PubMed Google Scholar
Christopher J. Percival
View author publications
You can also search for this author in PubMed Google Scholar
Trevor Williams
View author publications
You can also search for this author in PubMed Google Scholar
Lee Niswander
View author publications
You can also search for this author in PubMed Google Scholar
Anne L. Calof
View author publications
You can also search for this author in PubMed Google Scholar
Arthur D. Lander
View author publications
You can also search for this author in PubMed Google Scholar
Axel Visel
View author publications
You can also search for this author in PubMed Google Scholar
Frank R. Jirik
View author publications
You can also search for this author in PubMed Google Scholar
James M. Cheverud
View author publications
You can also search for this author in PubMed Google Scholar
Ophir D. Klein
View author publications
You can also search for this author in PubMed Google Scholar
Ramon Y. Birnbaum
View author publications
You can also search for this author in PubMed Google Scholar
Amy E. Merrill
View author publications
You can also search for this author in PubMed Google Scholar
Rebecca R. Ackermann
View author publications
You can also search for this author in PubMed Google Scholar
Daniel Graf
View author publications
You can also search for this author in PubMed Google Scholar
Myriam Hemberger
View author publications
You can also search for this author in PubMed Google Scholar
Wendy Dean
View author publications
You can also search for this author in PubMed Google Scholar
Nils D. Forkert
View author publications
You can also search for this author in PubMed Google Scholar
Stephen A. Murray
View author publications
You can also search for this author in PubMed Google Scholar
Henrik Westerberg
View author publications
You can also search for this author in PubMed Google Scholar
Ralph S. Marcucio
View author publications
You can also search for this author in PubMed Google Scholar
Benedikt Hallgrímsson
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

J.D., M.V.G., and B.H.: Study design, image processing, data collection, data analysis, drafting the manuscript, and revising it critically. H.W. and R.S.M.: Study design, data collection, drafting the manuscript, and revising it critically. W.L., A.N., and L.D.L.: Image processing, data collection, and revising the manuscript critically. R.E.S and A.B.: Data upload and organization. R.M.G., H.A.R., M.M., C.M.U., A.C.N., N.M.Y., P.N.G., C.R., C.J.P., T.W., L.N., A.L.C., A.D.L, A.V., F.R.J., J.M.C., O.D.K., R.Y.B., A.E.M., R.R.A., D.G., W.D., B.R., M.H., and S.A.M.: Data collection and revising the manuscript critically. All authors gave final approval for publication.

Corresponding author

Correspondence to Benedikt Hallgrímsson.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Devine, J., Vidal-García, M., Liu, W. et al. MusMorph, a database of standardized mouse morphology data for morphometric meta-analyses. Sci Data 9, 230 (2022). https://doi.org/10.1038/s41597-022-01338-x

Download citation

Received: 15 November 2021
Accepted: 13 April 2022
Published: 25 May 2022
DOI: https://doi.org/10.1038/s41597-022-01338-x