Abstract
We developed a massive-scale RNA sequencing protocol, short quantitative random RNA libraries or SQRL, to survey the complexity, dynamics and sequence content of transcriptomes in a near-complete fashion. This method generates directional, random-primed, linear cDNA libraries that are optimized for next-generation short-tag sequencing. We surveyed the poly(A)+ transcriptomes of undifferentiated mouse embryonic stem cells (ESCs) and embryoid bodies (EBs) at an unprecedented depth (10 Gb), using the Applied Biosystems SOLiD technology. These libraries capture the genomic landscape of expression, state-specific expression, single-nucleotide polymorphisms (SNPs), the transcriptional activity of repeat elements, and both known and new alternative splicing events. We investigated the impact of transcriptional complexity on current models of key signaling pathways controlling ESC pluripotency and differentiation, highlighting how SQRL can be used to characterize transcriptome content and dynamics in a quantitative and reproducible manner, and suggesting that our understanding of transcriptional complexity is far from complete.
This is a preview of subscription content, access via your institution
Access options
Subscribe to this journal
Receive 12 print issues and online access
£169.00 per year
only £14.08 per issue
Buy this article
- Purchase on SpringerLink
- Instant access to full article PDF
Prices may be subject to local taxes which are calculated during checkout
Similar content being viewed by others
Accession codes
References
Boguski, M.S. & Schuler, G.D. Establishing a human transcript map. Nat. Genet. 10, 369–371 (1995).
Lee, Y. et al. The TIGR gene indices: clustering and assembling EST and known genes and integration with eukaryotic genomes. Nucleic Acids Res. 33, D71–D74 (2005).
Carninci, P. et al. The transcriptional landscape of the mammalian genome. Science 309, 1559–1563 (2005).
Kawai, J. et al. Functional annotation of a full-length mouse cDNA collection. Nature 409, 685–690 (2001).
Okazaki, Y. et al. Analysis of the mouse transcriptome based on functional annotation of 60,770 full-length cDNAs. Nature 420, 563–573 (2002).
Birney, E. et al. Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project. Nature 447, 799–816 (2007).
Cawley, S. et al. Unbiased mapping of transcription factor binding sites along human chromosomes 21 and 22 points to widespread regulation of noncoding RNAs. Cell 116, 499–509 (2004).
Engstrom, P.G. et al. Complex loci in human and mouse genomes. PLoS Genet. 2, 564–577 (2006).
Kapranov, P. et al. RNA maps reveal new RNA classes and a possible function for pervasive transcription. Science 316, 1484–1488 (2007).
Gardina, P.J. et al. Alternative splicing and differential gene expression in colon cancer detected by a whole genome exon array. BMC Genomics 7, 325 (2006).
Clark, T.A., Sugnet, C.W. & Ares, M. Genome-wide analysis of mRNA processing in yeast using splicing-specific microarrays. Science 296, 907–910 (2002).
Reinartz, J. et al. Massively parallel signature sequencing (MPSS) as a tool for in-depth quantitative gene expression profiling in all organisms. Brief. Funct. Genomic. Proteomic. 1, 95–104 (2002).
Velculescu, V.E. et al. Serial analysis of gene expression. Science 270, 484–487 (1995).
Shiraki, T. et al. Cap analysis gene expression for high-throughput analysis of transcriptional starting point and identification of promoter usage. Proc. Natl. Acad. Sci. USA 100, 15776–15781 (2003).
Kim, J.B. et al. Polony multiplex analysis of gene expression (PMAGE) in mouse hypertrophic cardiomyopathy. Science 316, 1481–1484 (2007).
Bruce, S.J. et al. Dynamic transcription programs during ES cell differentiation towards mesoderm in serum versus serum-free (BMP4) culture. BMC Genomics 8, 365 (2007).
Hirst, C.E. et al. Transcriptional profiling of mouse and human ES cells identifies SLAIN1, a novel stem cell gene. Dev. Biol. 293, 90–103 (2006).
Faulkner, G.J. et al. A rescue strategy for multimapping short sequence tags refines surveys of transcriptional activity by CAGE. Genomics 91, 281–288 (2008).
Karolchik, D. et al. The UCSC genome browser database: 2008 update. Nucleic Acids Res. 36 (Suppl. 1), D773–779 (2008).
Thierry-Mieg, D. & Thierry-Mieg, J. AceView: a comprehensive cDNA-supported gene and transcripts annotation. Genome Biol. 7 (Suppl. 1), S12 (2006).
Kiyosawa, H. et al. Disclosing hidden transcripts: mouse natural sense-antisense transcripts tend to be poly(A) negative and nuclear localized. Genome Res. 15, 463–474 (2005).
Jeck, W.R. et al. Extending assembly of short DNA sequences to handle error. Bioinformatics 23, 2942–2944 (2007).
Kent, W.J. BLAT—the BLAST-like alignment tool. Genome Res. 12, 656–664 (2002).
Zhou, Q. et al. A gene regulatory network in mouse embryonic stem cells. Proc. Natl. Acad. Sci. USA 104, 16438–16443 (2007).
Lister, R. et al. Highly integrated single-base resolution maps of the epigenome in Arabidopsis. Cell 133, 523–536 (2008).
Nagalakshmi, U. et al. The transcriptional landscape of the yeast genome defined by RNA sequencing. Science published online, doi:10.1126/science.1158441 (1 May 2008).
Bruce, S.J. et al. In vitro differentiation of murine embryonic stem cells toward a renal lineage. Differentiation 75, 337–349 (2007).
Matz, M. et al. Amplification of cDNA ends based on template-switching effect and step-out PCR. Nucleic Acids Res. 27, 1558–1560 (1999).
Ohtake, H. et al. Determination of the capped site sequence of mRNA based on the detection of Cap-dependent nucleotide addition using an anchor ligation method. DNA Res. 11, 305–309 (2004).
Schmidt, W.M. & Mueller, M.W. CapSelect: a highly sensitive method for 5′ CAP–dependent enrichment of full-length cDNA in PCR-mediated analysis of mRNAs. Nucleic Acids Res. 27, e31 (1999).
Shendure, J. et al. Accurate multiplex polony sequencing of an evolved bacterial genome. Science 309, 1728–1732 (2005).
Smyth, G.K. Linear models and empirical Bayes methods for assessing differential expression in microarray experiments. Stat. Appl. Genet. Mol. Biol. 3, 3 (2004).
Acknowledgements
A.R.R.F. and S.M.G. are funded by the Australian National Health and Medical Research Council. A.R.R.F. is a C.J. Martin fellow (428261), and S.M.G. is a senior research fellow. N.C. is a University of Queensland postdoctoral fellow. G.J.F. and M.K.B. are supported by Australian Postgraduate awards. We acknowledge the Australian Research Council Centre for Functional and Applied Genomics Array Facility for expression profiling, and R. Nutter, G. Weightman and L. Stubberfield for support of this initiative.
Author information
Authors and Affiliations
Contributions
N.C. created and integrated the sequence mapping and visualization pipeline, performed SOLiD sequencing bioinformatics, SNP analysis and splicing studies. A.R.R.F. conceived and pioneered the SQRL library strategy, performed preliminary genomic analysis, and developed the initial visualization methods. G.K. led the array-SQRL analyses, RT-PCR, pathway analysis and contributed to SNP analysis. B.B.A.G., M.K.B., G.K. and N.C. contributed to method design. A.L.S., G.K., S.J.B. and A.C.P. contributed to sample generation. G.K., A.R.R.F. and B.B.A.G. constructed libraries. C.C.L., S.S.R., B.B.A.G., G.B. and K.J.M. contributed to library sequencing. N.C., G.K., G.J.F., A.R.R.F., S.M.G., D.F.T., H.E.P. and J.M.M. contributed to data analysis. G.K., A.J.R., S.W., N.C. and A.L.S. contributed to experimental validation. S.M.G. supervised the work and prepared the manuscript with N.C., G.K. and A.R.R.F.
Corresponding author
Ethics declarations
Competing interests
C.C.L., S.S.R., H.E.P., J.M.M. and K.J.M. are employed by Applied Biosystems, a manufacturer of DNA sequencing instrumentation and supplies.
Supplementary information
Supplementary Text and Figures
Supplementary Figures 1–13, Supplementary Tables 1–16, Supplementary Methods (PDF 3878 kb)
Rights and permissions
About this article
Cite this article
Cloonan, N., Forrest, A., Kolle, G. et al. Stem cell transcriptome profiling via massive-scale mRNA sequencing. Nat Methods 5, 613–619 (2008). https://doi.org/10.1038/nmeth.1223
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1038/nmeth.1223