Finding cell-specific expression patterns in the early Ciona embryo with single-cell RNA-seq

Ilsley, Garth R.; Suyama, Ritsuko; Noda, Takeshi; Satoh, Nori; Luscombe, Nicholas M.

doi:10.1038/s41598-020-61591-1

Download PDF

Article
Open access
Published: 18 March 2020

Finding cell-specific expression patterns in the early Ciona embryo with single-cell RNA-seq

Scientific Reports volume 10, Article number: 4961 (2020) Cite this article

2804 Accesses
5 Citations
2 Altmetric
Metrics details

Subjects

Abstract

Single-cell RNA-seq has been established as a reliable and accessible technique enabling new types of analyses, such as identifying cell types and studying spatial and temporal gene expression variation and change at single-cell resolution. Recently, single-cell RNA-seq has been applied to developing embryos, which offers great potential for finding and characterising genes controlling the course of development along with their expression patterns. In this study, we applied single-cell RNA-seq to the 16-cell stage of the Ciona embryo, a marine chordate and performed a computational search for cell-specific gene expression patterns. We recovered many known expression patterns from our single-cell RNA-seq data and despite extensive previous screens, we succeeded in finding new cell-specific patterns, which we validated by in situ and single-cell qPCR.

Single-cell long-read sequencing-based mapping reveals specialized splicing patterns in developing and adult mouse and human brain

Article Open access 09 April 2024

Anoushka Joglekar, Wen Hu, … Hagen U. Tilgner

Three million images and morphological profiles of cells treated with matched chemical and genetic perturbations

Article Open access 09 April 2024

Srinivas Niranj Chandrasekaran, Beth A. Cimini, … Anne E. Carpenter

Inferring gene regulatory networks from single-cell multiome data using atlas-scale external data

Article Open access 12 April 2024

Qiuyue Yuan & Zhana Duren

Introduction

Cell types can be characterised at high resolution with single-cell RNA-seq. Apparently homogenous groups of cells can be clustered to identify novel and rare subtypes^1,2,3,4,5 and cells undergoing differentiation at different rates can be ordered and analysed^6,7,8,9. This offers great potential for studying developing embryos with their high diversity of cell types^{10,11,12,13,14,15,16,17,18,19,20}. Making sense of this diversity is a goal of developmental biology and an important first step is identifying and characterising the relatively few genes controlling the course of development and the subsets of cells they are expressed in, or in other words, their gene expression patterns.

We applied single-cell RNA-seq (scRNA-seq) to the 16-cell stage of Ciona, a simple chordate, and looked for gene expression patterns using a novel computational approach. Zygotic expression begins around the 8-cell stage^21,22 and many gene expression patterns, both maternal and zygotic, are known from comprehensive in situ screens in Ciona²³ (Fig. 1b, Supplementary Table 1). We identified many of these from our scRNA-seq data. In addition, we found unknown gene expression patterns, which we validated by in situ and single-cell qPCR.

Results

Ciona development follows an invariant lineage allowing cells to be unambiguously identified at the 16-cell stage^24,25,26, whether before sequencing as in our work (Fig. 1a) or afterwards based on expression, as in recent work of the same stage²⁷. We collected cells from four 16-cell stage embryos, each from different individuals fertilised on different days. The left and right side of the embryo at this stage are thought to be symmetrical, but to avoid any potential bilateral variation in our small sample, we collected all eight cell types from the right half of each embryo. We multiplexed the cells of each embryo (or batch) and sequenced them on Illumina MiSeq, as well as on three lanes across two Illumina HiSeq 2500 runs, leading to scRNA-seq measurements for 32 cells (Supplementary Table 2).

Our cells were sequenced with relatively high coverage (around 20 million reads per cell on Illumina HiSeq and 4.5 million per cell on MiSeq). Although we had only four replicates per cell type, we produced a reliable dataset.

Assessment of technical variability and reproducibility

Our results show limited technical variation within each batch: the expression levels in different cell types from the same embryo are well correlated for embryos 2, 3 and 4. They are, in fact, more similar to each other than the same cell types are across different individuals (Fig. 2a–b;→, Supplementary Fig. 1). Although we cannot separate out the sources of cross-embryo variation, this result is consistent with a previous report showing that maternal mRNA levels vary significantly between unfertilized eggs from different individuals²⁸. It is also worth noting that very little of the variation between embryos is from the sequencing run. This is consistent with previous results showing high correlation between expression measurements from tens of millions of reads per cell and those from lower coverage of a million or fewer reads^29,30.

This embryo batch effect is further demonstrated by a Principal Components Analysis (Fig. 2b), which shows a similar result with the cell types of embryos 2, 3 and 4 being close to each on the first two components (which explain 56% of the variance) and the cell types of embryo 1 being more spread out. Embryo 1 belonged to a healthy developmental batch (Supplementary Table 2) and we suggest that its outlier status is due to technical variability introduced during library preparation, specifically cDNA synthesis and amplification.

The close clustering of cells from the same embryo, as well as their high correlation, suggests that our technical variability is low, leading to reproducibility within each batch (or embryo). A confirmation of the reproducibility of our results is the tight distribution of genes detected across samples within embryos (Supplementary Fig. 1). Genes were considered detected when the measured count was greater than zero. These results show that slightly more genes were detected on HiSeq than MiSeq, but that the median difference for each embryo is less than 10%. This can be compared with a previous result showing a reduction of genes detected of around 39% when lowering sequence coverage to less than a million reads per cell²⁹. As before, embryo 1 showed more variability across samples than the other embryos.

Further, our data can be compared to previously published data for the 16-cell stage that was generated using gene expression microarrays from many pooled single cells²⁸. We found good agreement with our scRNA-seq expression patterns for the key genes shown in their paper (Fig. 2c).

Normalisation of counts

After producing a dataset of counts, we normalised our data. Since we were interested in gene expression differences between different cell types, we did not normalize across genes (such as by GC content or transcript length), but only for sequencing depth by dividing by the total number of reads per sample. More complex normalisation steps could be considered³¹, but this simple approach suffices for our data. It gives a natural measure of expression for each gene, namely the proportion it contributes to the total, which we assume is independent of the total number of reads.

We then made use of a suitable transformation of proportions, the arcsine of the square root (referred to here as the φ transformation), namely:

$${\varphi }_{ij}=2\,\arcsin \left(\sqrt{\frac{{k}_{ij}}{{N}_{j}}}\right)$$

where k_ij is the count for the i^th gene and j^th sample and N_j is the total number of counts for the j^th sample. The difference between φ_i of two samples can be interpreted as an effect size index for proportions, namely Cohen’s h³².

In practice, the arcsine function in the φ transformation is largely redundant because most genes are expressed at low proportions. At these low values, the arcsine transformation is close to identity, meaning that a square root transformation of proportions performs equivalently in many cases.

Finding putative gene expression patterns

We then took a simple and effective approach to find gene expression patterns. Instead of grouping cells into a single set of clusters, we clustered the gene expression pattern of each gene separately (Fig. 3). For each gene, we grouped the different cell types into two classes of ON and OFF expression. We took advantage of our known cell types and calculated the Euclidean distance between vectors of replicate measurements for each cell and performed single-linkage hierarchical clustering of these (bottom-up, agglomerative clustering of the cells). The resulting two top-level clusters determined the ON/OFF pattern of each cell type for each gene (Fig. 3).

We then ranked these results, by taking an approach that does not require parametric estimation of variation or dispersion. We calculated our cluster reliability score as the difference between the first quartile of the ON cluster and the third quartile of the OFF cluster, which we call the Transquartile Range (TQR). The TQR is larger when the difference in cluster means is larger, but it penalizes higher variation for a given difference in means.

Ranking cell-specific gene expression patterns

We applied this approach and produced a list of ranked genes and their expression patterns. We examined our results for 77 genes (Supplementary Table 1) with known in situ patterns^28,33,34,35. Our top 35 results matched the known in situ patterns (Supplementary Fig. 2a), except for KH.L152.12, which is validated below as a new pattern by in situ and qPCR (see below). However, the lower ranked results did not correspond well (Supplementary Fig. 2b).

There were a few reasons for this. For example, clustering did not produce a correct pattern for Lefty (Supplementary Fig. 2b). The cell types, A5.2 and B5.1, are normally considered part of the Lefty pattern, but they have intermediate expression in our scRNA-seq results and, in this instance, the clustering algorithm places them in the OFF cluster. For a few other genes, e.g. DPOZ (KH.C12.589) and Dlx.b, the clustering is correct, but the TQR is low. For a few other genes, no reads were mapped, e.g. Sox7/17/18, Fringe 2 and KH.C13.22, but in most cases where our scRNA-seq data does not agree with published in situ patterns, our expression measurements were low or relatively uniform across the eight cells and thus the algorithm functions correctly in attributing lower score to these results. Many of these genes are expressed maternally in the Ciona embryo and are de-adenylated during the maternal-to-zygotic transition^36,37; thus they might not be easily detected by our RNA-seq protocol, which amplifies from the poly(A) tail of mRNA. In contrast, in situ hybridisation can detect localised maternal signal in the cytoplasm, which could explain the discrepancy. It is also possible that some in situ results are false positives.

Validating cell-specific gene expression patterns

Using these known in situ results, we assessed that the top 40 ranked results from all genes were likely to be reliable and focused on these for further validation (Fig. 4). We found 12 distinct patterns in the top 40, which included all known patterns as well as three potentially new patterns (highlighted in orange in Fig. 4). Ten patterns are currently known at this stage in Ciona^{28,33,38,39,40,41,42}, and although we matched only nine of these directly (Fig. 1b), the tenth pattern has a single known case, Tfap2-r.b (AP-2-like2), which we did pick up, but without expression in A5.2. This agrees with previous observations that expression is not consistent in this cell across embryos^28,33. Further, it is in agreement with the average over many embryos as measured by microarray²⁸.

We validated one of the potentially new patterns, namely the pattern for KH.L152.12, by in situ in biological duplicates and single-cell qPCR in triplicate (Fig. 5e). This observation does not necessarily negate previously published ones for KH.L152.12, but rather highlights the value of our algorithm in identifying new patterns from scRNA-seq data comprising just four embryos. We could not validate the second pattern for KH.C1.933 that has relatively ubiquitous expression, although with apparently reduced expression in b5.4.

In addition to recovering all known patterns and validating one novel pattern, we found at least 28 genes with known in situ patterns. Of the 12 patterns we found, the pattern with expression in the B5.2 cell only was the most represented. This is also the most frequent pattern in known in situ patterns, i.e. postplasmic/PEM RNAs³⁵. The majority of our results for B5.2 are confirmed by previous in situ datasets, but we identified new B5.2-specific genes, such as KH.C13.98 and KH.C12.212, confirming their expression by in situ and single-cell qPCR (Fig. 5a).

We also validated other classes of uncharacterized genes, namely KH.S1497.1, which expresses specifically in the animal hemisphere, and KH.C11.529 on the anterior side (Fig. 5c–d). These results are particularly striking, since it was expected that no further zygotic gene expression patterns would be found at this stage in Ciona²³.

We also looked more widely in the top 60 (Fig. 4, Supplementary Fig. 3) and validated additional genes, KH.C8.450, KH.C9.289 and KH.C4.260, by single-cell qPCR and in situ hybridisation (Fig. 5a-b). The first is another example of B5.2 expression, whereas the last two genes are expressed in all cells except B5.2, a pattern known previously from Hes.a⁴³. While developing our approach, we also applied the TQR to the microarray data of this stage²⁸ and looked for reliable B5.2 cell-specific expression using both datasets. As a result, we also found and validated KH.L60.2 and KH.C14.501 (ranked 88 and 2403 in our data; see Supplementary Table 7).

In conclusion, we have recovered many known patterns, as well as patterns and genes that had not been detected previously despite extensive in situ screens. These results open up opportunities for further research into developmental patterning in Ciona. In addition, we have demonstrated that single-cell RNA-seq is a viable alternative to extensive in situ screens, offering a promising approach for finding genes with cell-specific expression in less well-studied organisms.

Methods

Study design

We isolated cells from five 16-cell stage Ciona embryos, each on a different day (Supplementary Table 2). Early ascidian embryos are thought to be bilaterally symmetrical, but to avoid any potential bilateral variation in our small sample, we collected eight cells from the right side of each embryo. The cells were collected individually in batches of eight cells from the same embryo on the same day, with sequencing libraries prepared in parallel, barcoded and then sequenced together. This means that biological variation between embryos and technical variation between batches cannot be distinguished. The advantage of this design is that it minimizes technical variation between cell types of the same embryo and controls for confounding technical and biological variation between embryos. Averaging across the cell types of different batches reduces this unwanted variation, maintaining cell-specific variation. Our results show that cells from the same embryo are more similar to each other than the same cell types are across individuals, with a similar number of genes detected per cell type (Fig. 2a-b, Supplementary Fig. 1).

Preparation of Ciona embryos

Ciona intestinalis type A, recently designated Ciona robusta^44,45, adults were obtained from Maizuru Fisheries Research Station (Kyoto University) and Misaki Marine Biological station (The University of Tokyo) under the National Bio-Resource Project for Ciona. They were maintained in an aquarium in our laboratory at Okinawa Institute of Science and Technology Graduate University under constant light (Calcitrans, Nisshin Marinetech Co., Ltd.) for three days apart from a few hours of darkness a day with feeding to induce spawning of the old eggs. After this, the Ciona were maintained under constant light to induce oocyte maturation. Eggs and sperm were obtained surgically from the gonoducts. Embryos were dechorionated after insemination using a solution of 0.07% actinase and 1.3% sodium thioglycolate. Eggs were reared to reach the 16-cell stage in Millipore-filtered seawater (MFSW) at about 18 °C. Embryos from each insemination batch were kept to check the ratio that developed into morphologically normal tailbud. We only used embryos from batches where more than 70% developed normally to tailbud (10 hours post fertilization at 18 °C) (see Supplementary Table 2 for embryo batch information).

Naming of cells

In Ciona, cells are named using the nomenclature of Conklin²⁴: the animal side is prefixed with a lowercase letter (a or b) and the vegetal with an uppercase letter (A or B); the anterior with A or a and the posterior with B or b. The initial letter is followed by a number that indicates the embryo stage since fertilization, with individual cells numbered according to their lineage. At the 16-cell stage, the animal domain corresponds to a5.3, a5.4, b5.3 and b5.4, the vegetal domain to A5.1, A5.2, B5.1 and B5.2, and postplasmic RNAs are localized to B5.2.

Isolation of single cells at the 16-cell stage

At a defined point in development of the 16-cell embryo i.e., at the stage immediately after compaction of the embryo (2.5~2.6 hours post fertilization), the embryo was transferred to 4 °C to slow its development. Each blastomere was isolated with a fine glass needle in a mannitol solution (0.77 M mannitol: MFSW, 9:1) under a stereo microscope at 4 °C regulated by a thermo plate (Tokai Hit Co., Ltd.) and its identity noted. Isolated blastomeres were picked up and transferred immediately with a mouth pipet into a lysis buffer⁴⁶ for reverse transcription.

Library preparation

We followed the single-cell library preparation method of Tang et al.^46,47 with some modification. We added ERCC spike-in RNA (Thermo Fisher scientific, 4456740, 1:80000) to each lysis buffer and applied 14 and then 9 cycles of PCR amplification after second strand synthesis. Amplified cDNA was purified with MinElute PCR Purification kit (28006, QIAGEN) and QIAquick PCR Purification Kit (28106, QIAGEN) after each PCR reaction respectively and its concentration measured with Qubit® 2.0 Fluorometer (Q32866, Life Technologies) to have more than 150 ng total yield of cDNA. The quality of the amplified cDNA and distribution of DNA fragment size were confirmed by Agilent 2100 Bioanalyzer (Agilent Technologies) with High Sensitivity DNA Kit (5067–4626, Agilent) to consist mainly of 1.0–1.5 kb fragments.

Amplified cDNAs were sheared using sonication Covaris S2 System to produce DNA of 300 bp on average. The settings were as follows: Duty cycle: 20%, Intensity: 5, Cycles per burst: 200, Power mode - Frequency sweeping, Treatment time: 90 seconds, Temperature: 12 °C.

NEB Next® ChIP-Seq Library Prep Master Mix Set for Illumina® (E6240, NEB) was applied to sheared cDNA for preparation of the library for the Illumina platform. NEBNext® Multiplex Oligos for Illumina (E7335, E7500, NEB Next Multiplex Oligos for Illumina, NEB) were combined to introduce an index and adaptor to the double-stranded DNA. After extraction of the 300 bp fraction of adaptor-ligated DNA by E-Gel Size Select 2% Agarose (G661002, Invitrogen), DNA was amplified with individual index primers using PCR with 19 cycles.

The amplified DNA fragment composition was purified with Agencourt AMPure XP twice (A63881, Beckman) and again checked by Qubit (> 60 ng of cDNA in total yield) and by Bioanalyzer to ensure that the fragment size was sharply distributed around 300 bp (on average, about 320 bp with a standard deviation of 40). The concentration of fragments with appropriate index adapters was quantified by KAPA Library Quantification Kits (KAPA Library Quantification Kits, Illumina GA/Universal, KK4825, Genetics) to ensure that the final libraries had adapters for both ends and their concentration was at least 20 pM.

Data generation and quality checking

Libraries were sequenced on Illumina’s (San Diego, CA) MiSeq benchtop sequencer and Illumina HiSeq2500. Libraries were prepared with different index primers and sequenced on MiSeq using paired 150 nt reads (No. MS-102–2002, MiSeq Reagent Kit v2) with eight multiplexed samples per run with the standard Illumina protocols. The same libraries were sequenced on an Illumina HiSeq 2500 with 150 bp paired end reads (No. PE-402-4001 and FC-402-4001, TruSeq Rapid Cluster - Paired-End and SBS Kits) with 16 multiplexed samples per lane following standard Illumina protocols. Our results from using HiSeq and MiSeq were similar (Supplementary Figs. 1, 3 and 4).

The resulting reads were aligned using Bowtie⁴⁸ version 2.2.6 to the Ciona KH genome assembly^49,50, downloaded from Ghost (http://ghost.zool.kyoto-u.ac.jp/download_kh.html). Reads were mapped using local alignment (–local), with other settings at their default. We did not trim or filter reads, but instead made use of local alignment to find the optimal match. This had the additional benefit that we did not need to split up reads to handle transcripts spanning more than one intron, as is done, for example, in TopHat⁵¹. Gene counts were calculated from the resulting alignment files using htseq-count⁵² with the non-stranded option and mode “intersection-nonempty” against the KH gene models (version 2013) downloaded from Ghost.

We assessed our samples for mapping quality. We excluded one embryo from subsequent analysis since it had oligo-dT primer sequence in more than 50% of its read pairs; the remaining four embryos had less than 1% of read pairs affected. All remaining samples mapped well to the genome (Supplementary Table 3) and a uniform number of genes were detected (about 60%), although embryo 1 had noticeably fewer detected genes for some of its cells.

Pattern discovery

Hierarchical clustering to determine candidate patterns was performed with ClusteringComponents in Mathematica 10.4 with the Agglomerate method and Euclidean distance function. This is equivalent to hclust in R with the single linkage method. The Transquartile Range (TQR) was calculated as the difference between the first quartile of the ON cluster and the third quartile of the OFF cluster. The quantile method for the TQR used linear interpolation equivalent to type 5 in the R quantile function (the hydrologist method). The resulting patterns and TQR score are listed in Supplementary Table 7.

Single-cell qPCR analysis

cDNA was reverse transcribed from all cells of one embryo per gene replicate using the same protocol we used for single-cell RNA-seq^46,47. Quantitative PCR was performed using a StepOnePlus PCR machine (Applied Biosystems) with the SYBR green method (No. RR820B, Takara). Each gene was measured with either two or three replicates, except KH.L152.12, which had four. We did not get reliable measurements for KH.S1497.1. The qPCR measures for the cell types of each embryo were scaled between 0 and 1 and then averaged for each cell type across replicates. If there was insufficient target mRNA, it was first amplified using primers covering a wider region of the target gene than those used for single-cell qPCR. Amplification of a specific product in each reaction was confirmed by determining a dissociation curve and comparing with the relevant standard plasmid to estimate the copy number in each cell type. The primers for single-cell qPCR analysis, the IDs for the cDNA clones and the resulting data are listed in Supplementary Tables 4–6.

In situ hybridisation

Whole-mount in situ hybridisation was carried out as previously described with minor modification⁵³. Dig-labeled antisense RNA probes were synthesized in vitro from cDNAs from the Ciona cDNA project⁵⁴. mRNA expression was visualized using the NBT/BCIP system (Roche, No. 11681451001) and detected on a Zeiss Axio Imager Z1 microscope using Differential Interference Contrast (DIC). The images were acquired with Axiovision SE64 release 4.9.1. Contrast and brightness were adjusted for some images using Adobe Photoshop. The IDs for the cDNA clones are shown in Supplementary Table 5.

Microarray processing

Previously published microarray data²⁸ was processed with the limma R package⁵⁵. Background was corrected using normexp and arrays were normalised with the quantile method.

Gene models and names

Gene names for the KH 2013 gene models were downloaded from Ghost (http://ghost.zool.kyoto-u.ac.jp/TF_KH.html and http://ghost.zool.kyoto-u.ac.jp/ST_KH.html) and supplemented with names from Prodon et al.³⁵.

Data availability

RNA-seq data have been deposited in the ArrayExpress database at EMBL-EBI (www.ebi.ac.uk/arrayexpress) under accession number E-MTAB-6117.

Code availability

Software is available at https://github.com/ilsley/Ciona16.

References

Björklund, Å. K. et al. The heterogeneity of human CD127+ innate lymphoid cells revealed by single-cell RNA sequencing. Nat. Immunol. 17, 451–460 (2016).
Article CAS PubMed Google Scholar
Grün, D. et al. Single-cell messenger RNA sequencing reveals rare intestinal cell types. Nat. 525, 251–255 (2015).
Article ADS CAS Google Scholar
Jaitin, D. A. et al. Massively Parallel Single-Cell RNA-Seq for Marker-Free Decomposition of Tissues into Cell Types. Sci. 343, 776–779 (2014).
Article ADS CAS Google Scholar
Kiselev, V. Y. et al. SC3: consensus clustering of single-cell RNA-seq data. Nat. Meth 14, 483–486 (2017).
Article CAS Google Scholar
Usoskin, D. et al. Unbiased classification of sensory neuron types by large-scale single-cell RNA sequencing. Nat. Neurosci. 18, 145–153 (2015).
Article CAS PubMed Google Scholar
Mojtahedi, M. et al. Cell Fate Decision as High-Dimensional Critical State Transition. PLOS Biol. 14, e2000640 (2016).
Article CAS PubMed PubMed Central Google Scholar
Olsson, A. et al. Single-cell analysis of mixed-lineage states leading to a binary cell fate choice. Nat. 537, 698–702 (2016).
Article ADS CAS Google Scholar
Richard, A. et al. Single-Cell-Based Analysis Highlights a Surge in Cell-to-Cell Molecular Variability Preceding Irreversible Commitment in a Differentiation Process. PLOS Biol. 14, e1002585 (2016).
Article CAS PubMed PubMed Central Google Scholar
Trapnell, C. et al. The dynamics and regulators of cell fate decisions are revealed by pseudotemporal ordering of single cells. Nat. Biotech. 32, 381–386 (2014).
Article CAS Google Scholar
Biase, F. H., Cao, X. & Zhong, S. Cell fate inclination within 2-cell and 4-cell mouse embryos revealed by single-cell RNA sequencing. Genome Res. 24, 1787–1796 (2014).
Article CAS PubMed PubMed Central Google Scholar
Cao, J. et al. Comprehensive single-cell transcriptional profiling of a multicellular organism. Sci. 357, 661–667 (2017).
Article ADS CAS Google Scholar
Deng, Q., Ramsköld, D., Reinius, B. & Sandberg, R. Single-Cell RNA-Seq Reveals Dynamic, Random Monoallelic Gene Expression in Mammalian Cells. Sci. 343, 193–196 (2014).
Article ADS CAS Google Scholar
Goolam, M. et al. Heterogeneity in Oct4 and Sox2 Targets Biases Cell Fate in 4-Cell Mouse Embryos. Cell 165, 61–74 (2016).
Article CAS PubMed PubMed Central Google Scholar
Hashimshony, T., Wagner, F., Sher, N. & Yanai, I. CEL-Seq: single-cell RNA-Seq by multiplexed linear amplification. Cell Rep. 2, 666–673 (2012).
Article CAS PubMed Google Scholar
Ibarra-Soria, X. et al. Defining murine organogenesis at single-cell resolution reveals a role for the leukotriene pathway in regulating blood progenitor formation. Nat. Cell Biol. 20, 127–134 (2018).
Article CAS PubMed PubMed Central Google Scholar
Karaiskos, N. et al. The Drosophila embryo at single-cell transcriptome resolution. Sci. 358, 194–199 (2017).
Article ADS CAS Google Scholar
Scialdone, A. et al. Resolving early mesoderm diversification through single-cell expression profiling. Nat. 535, 289–293 (2016).
Article ADS CAS Google Scholar
Tintori, S. C., Osborne Nishimura, E., Golden, P., Lieb, J. D. & Goldstein, B. A Transcriptional Lineage of the Early C. elegans Embryo. Developmental Cell 38, 430–444 (2016).
Article CAS PubMed PubMed Central Google Scholar
Xue, Z. et al. Genetic programs in human and mouse early embryos revealed by single-cell RNA sequencing. Nat. 500, 593–597 (2013).
Article ADS CAS Google Scholar
Yan, L. et al. Single-cell RNA-Seq profiling of human preimplantation embryos and embryonic stem cells. Nat. Struct. Mol. Biol. 20, 1131–1139 (2013).
Article CAS PubMed Google Scholar
Shirae-Kurabayashi, M., Matsuda, K. & Nakamura, A. Ci-Pem-1 localizes to the nucleus and represses somatic gene transcription in the germline of Ciona intestinalis embryos. Dev. 138, 2871–2881 (2011).
Article CAS Google Scholar
Lamy, C., Rothbächer, U., Caillol, D. & Lemaire, P. Ci-FoxA-a is the earliest zygotic determinant of the ascidian anterior ectoderm and directly activates Ci-sFRP1/5. Dev. 133, 2835–2844 (2006).
Article CAS Google Scholar
Satou, Y. & Imai, K. S. Gene regulatory systems that control gene expression in the Ciona embryo. Proc. Jpn. Academy, Ser. B 91, 33–51 (2015).
Article ADS CAS Google Scholar
Conklin, E. G. The organization and cell-lineage of the ascidian egg. J. Acad. Nat. Sci. Phila. 13, 1–119 (1905).
Google Scholar
Lemaire, P. Unfolding a chordate developmental program, one cell at a time: Invariant cell lineages, short-range inductions and evolutionary plasticity in ascidians. Developmental Biol. 332, 48–60 (2009).
Article CAS Google Scholar
Nishida, H. Specification of embryonic axis and mosaic development in ascidians. Dev. Dyn. 233, 1177–1193 (2005).
Article CAS PubMed Google Scholar
Treen, N., Heist, T., Wang, W. & Levine, M. Depletion of Maternal Cyclin B3 Contributes to Zygotic Genome Activation in the Ciona Embryo. Curr. Biol. 28, 1150–1156.e4 (2018).
Article CAS PubMed PubMed Central Google Scholar
Matsuoka, T., Ikeda, T., Fujimaki, K. & Satou, Y. Transcriptome dynamics in early embryos of the ascidian, Ciona intestinalis. Developmental Biol. 384, 375–385 (2013).
Article CAS Google Scholar
Pollen, A. A. et al. Low-coverage single-cell mRNA sequencing reveals cellular heterogeneity and activated signaling pathways in developing cerebral cortex. Nat. Biotech. 32, 1053–1058 (2014).
Article CAS Google Scholar
Shalek, A. K. et al. Single-cell RNA-seq reveals dynamic paracrine control of cellular variation. Nat. 510, 363–369 (2014).
Article ADS CAS Google Scholar
Vallejos, C. A., Risso, D., Scialdone, A., Dudoit, S. & Marioni, J. C. Normalizing single-cell RNA sequencing data: challenges and opportunities. Nat. Methods 14, 565–571 (2017).
Article CAS PubMed PubMed Central Google Scholar
Cohen, J. Statistical Power Analysis for the Behavioral Sciences. (Routledge, 1988).
Imai, K. S., Hino, K., Yagi, K., Satoh, N. & Satou, Y. Gene expression profiles of transcription factors and signaling molecules in the ascidian embryo: towards a comprehensive understanding of gene networks. Dev. 131, 4047–4058 (2004).
Article CAS Google Scholar
Miwata, K. et al. Systematic analysis of embryonic expression profiles of zinc finger genes in Ciona intestinalis. Developmental Biol. 292, 546–554 (2006).
Article CAS Google Scholar
Prodon, F., Yamada, L., Shirae-Kurabayashi, M., Nakamura, Y. & Sasakura, Y. Postplasmic/PEM RNAs: A class of localized maternal mRNAs with multiple roles in cell polarity and development in ascidian embryos. Dev. Dyn. 236, 1698–1715 (2007).
Article CAS PubMed Google Scholar
Tadros, W. & Lipshitz, H. D. The maternal-to-zygotic transition: a play in two acts. Dev. 136, 3033–3042 (2009).
Article CAS Google Scholar
Li, L., Zheng, P. & Dean, J. Maternal control of early mouse development. Dev. 137, 859–870 (2010).
Article CAS Google Scholar
Bertrand, V., Hudson, C., Caillol, D., Popovici, C. & Lemaire, P. Neural Tissue in Ascidian Embryos Is Induced by FGF9/16/20, Acting via a Combination of Maternal GATA and Ets Transcription Factors. Cell 115, 615–627 (2003).
Article CAS PubMed Google Scholar
Hamaguchi, M., Fujie, M., Noda, T. & Satoh, N. Microarray analysis of zygotic expression of transcription factor genes and cell signaling molecule genes in early Ciona intestinalis embryos. Development, Growth Differ. 49, 27–37 (2007).
Article CAS Google Scholar
Hudson, C. & Yasuo, H. Patterning across the ascidian neural plate by lateral Nodal signalling sources. Dev. 132, 1199–1210 (2005).
Article CAS Google Scholar
Imai, K. S., Levine, M., Satoh, N. & Satou, Y. Regulatory Blueprint for a Chordate Embryo. Sci. 312, 1183–1187 (2006).
Article ADS CAS MATH Google Scholar
Shi, W. & Levine, M. Ephrin signaling establishes asymmetric cell fates in an endomesoderm lineage of the Ciona embryo. Dev. 135, 931–940 (2008).
Article CAS Google Scholar
Satou, Y., Kawashima, T., Shoguchi, E., Nakayama, A. & Satoh, N. An Integrated Database of the Ascidian, Ciona intestinalis: Towards Functional Genomics. Zool. Sci. 22, 837–843 (2005).
Article CAS Google Scholar
Hoshino, Z. & Tokioka, T. An unusually robust Ciona from the northeastern coast of Honsyu Island, Japan. Publ. Seto Mar. Biol. Lab. 15, 275–290 (1967).
Article Google Scholar
Pennati, R. et al. Morphological Differences between Larvae of the Ciona intestinalis Species Complex: Hints for a Valid Taxonomic Definition of Distinct Species. PLOS ONE 10, e0122879 (2015).
Article CAS PubMed PubMed Central Google Scholar
Tang, F. et al. RNA-Seq analysis to capture the transcriptome landscape of a single cell. Nat. Protoc. 5, 516–535 (2010).
Article CAS PubMed Google Scholar
Tang, F. et al. mRNA-Seq whole-transcriptome analysis of a single cell. Nat. Meth 6, 377–382 (2009).
Article CAS Google Scholar
Langmead, B. & Salzberg, S. L. Fast gapped-read alignment with Bowtie 2. Nat. Meth 9, 357–359 (2012).
Article CAS Google Scholar
Dehal, P. et al. The Draft Genome of Ciona intestinalis: Insights into Chordate and Vertebrate Origins. Sci. 298, 2157–2167 (2002).
Article ADS CAS Google Scholar
Satou, Y. et al. Improved genome assembly and evidence-based global gene model set for the chordate Ciona intestinalis: new insight into intron and operon populations. Genome Biol. 9, R152 (2008).
Article CAS PubMed PubMed Central Google Scholar
Kim, D. et al. TopHat2: accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions. Genome Biol. 14, R36 (2013).
Article CAS PubMed PubMed Central Google Scholar
Anders, S., Pyl, P. T. & Huber, W. HTSeq—a Python framework to work with high-throughput sequencing data. Bioinforma. 31, 166–169 (2015).
Article CAS Google Scholar
Wada, S., Katsuyama, Y., Yasugi, S. & Saiga, H. Spatially and temporally regulated expression of the LIM class homeobox gene Hrlim suggests multiple distinct functions in development of the ascidian, Halocynthia roretzi. Mechanisms Dev. 51, 115–126 (1995).
Article CAS PubMed Google Scholar
Satou, Y. et al. A cDNA resource from the basal chordate Ciona intestinalis. Genes. 33, 153–154 (2002).
Article CAS Google Scholar
Ritchie, M. E. et al. limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. 43, e47–e47 (2015).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We thank the staff of the Maizuru Fisheries Research Station of Kyoto University and Misaki Marine Biological Station of the University of Tokyo for collecting and cultivating Ciona under the National BioResource Project (NBRP) of MEXT, Japan, and RIKEN BRC for providing Ciona EST clones through the NBRP. We thank Vladimir Benes and Dinko Pavlinic in the Genomics Core Facility at the European Molecular Biology Laboratory (EMBL) for initial advice on the library preparation protocol and the members of the OIST DNA Sequencing Section for their support in running our samples on their Illumina MiSeq and HiSeq machines. We also thank Sylvain Guillot for his technical support, Filipe Tavares-Cadete for early feedback on the method, Kenji Kobayashi for a helpful comment on the validation experiment and Yutaka Satou advising the microarray data analysis. This work was supported by core funding from OIST to the Genomics & Regulatory Systems and Marine Genomics Units.

Author information

Ritsuko Suyama
Present address: Graduate School of Frontier Biosciences, Osaka University, 1-3 Yamadaoka, Suita, Osaka, 565-0871, Japan
Takeshi Noda
Present address: Shinshu University, Matsumoto, Nagano, 390-8621, Japan
These author contributed equally: Garth R. Ilsley and Ritsuko Suyama.

Authors and Affiliations

Okinawa Institute of Science and Technology Graduate University, Onna, Okinawa, 904-0495, Japan
Garth R. Ilsley, Ritsuko Suyama, Takeshi Noda, Nori Satoh & Nicholas M. Luscombe
European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, United Kingdom
Garth R. Ilsley
The Francis Crick Institute, 1 Midland Road, London, NW1 1AT, UK
Nicholas M. Luscombe
UCL Genetics Institute, University College London, Gower Street, London, WC1E 6BT, UK
Nicholas M. Luscombe

Authors

Garth R. Ilsley
View author publications
You can also search for this author in PubMed Google Scholar
Ritsuko Suyama
View author publications
You can also search for this author in PubMed Google Scholar
Takeshi Noda
View author publications
You can also search for this author in PubMed Google Scholar
Nori Satoh
View author publications
You can also search for this author in PubMed Google Scholar
Nicholas M. Luscombe
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

N.M.L. and N.S. established and supervised the project. All authors contributed to the study design. R.S. and T.N. optimized the experimental protocols, collected and prepared the samples and sequencing libraries. R.S. designed and performed the in situ and qPCR analysis. G.R.I. conceived and designed the normalization, gene testing and pattern discovery method and performed the bioinformatics analysis. G.R.I., R.S. and N.M.L. wrote the paper and all authors edited and approved the final manuscript.

Corresponding author

Correspondence to Nicholas M. Luscombe.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Figures.

Supplementary Table S1.

Supplementary Table S2.

Supplementary Table S3.

Supplementary Table S4.

Supplementary Table S5.

Supplementary Table S6.

Supplementary Table S7.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Ilsley, G.R., Suyama, R., Noda, T. et al. Finding cell-specific expression patterns in the early Ciona embryo with single-cell RNA-seq. Sci Rep 10, 4961 (2020). https://doi.org/10.1038/s41598-020-61591-1

Download citation

Received: 19 October 2018
Accepted: 24 February 2020
Published: 18 March 2020
DOI: https://doi.org/10.1038/s41598-020-61591-1

This article is cited by

Embryos assist morphogenesis of others through calcium and ATP signaling mechanisms in collective teratogen resistance
- Angela Tung
- Megan M. Sperry
- Michael Levin
Nature Communications (2024)
Single-cell analysis of cell fate bifurcation in the chordate Ciona
- Konner M. Winkley
- Wendy M. Reeves
- Michael T. Veeman
BMC Biology (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.