Abstract
Comparing gene expression profiles measured in a wide range of different tissue types, at different developmental stages, or under different environmental conditions can yield valuable insights into the mechanisms of cell/tissue specification and differentiation, or identify cell/tissue-type specific responses to environmental stimuli. Critical for such comparisons is the identical processing of data from different sources. This may also include the integration of a novel data set into an existing collection of data sets (e.g., in-house and publicly available data). Here, I describe a complete workflow for RNA-Seq data, from data processing steps to the comparison of gene expression profiles measured with RNA-Seq. I use publicly available data for demonstration purposes, but I also describe how to integrate your own data sets. The workflow runs on all three major operating systems (Linux, MacOS, and Windows). The scripts and the tutorial can be accessed on github.com/MWSchmid/RNAseq_protocol.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Conesa A, Madrigal P, Tarazona S et al (2016) A survey of best practices for RNA-Seq data analysis. Genome Biol 17:13
R Core Team (2015) R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna. https://www.R-project.org
Liao Y, Smyth GK, Shi W (2013) The Subread aligner: fast, accurate and scalable read mapping by seed-and-vote. Nucleic Acids Res 41:e108
Durinck S, Spellman P, Birney E et al (2009) Mapping identifiers for the integration of genomic datasets with the R/Bioconductor package biomaRt. Nat Protoc 4:1184–1191
Love MI, Huber W, Anders S (2014) Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol 15:550
Robinson MD, Oshlack A (2010) A scaling normalization method for differential expression analysis of RNA-Seq data. Genome Biol 11:R25
Ritchie ME, Phipson B, Wu D et al (2015) limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res 43:e47
Liao Y, Smyth GK, Shi W (2014) featureCounts: an efficient general purpose program for assigning sequence reads to genomic features. Bioinformatics 30:923–930
Schmid MW, Grossniklaus U (2015) Rcount: simple and flexible RNA-Seq read counting. Bioinformatics 31:436–437
Li X, Nair A, Wang S et al (2015) Quality control of RNA-Seq experiments. Methods Mol Biol 1269:137–146
Qi W, Schlapbach R, Rehrauer H (2017) RNA-seq data analysis: from raw data quality control to differential expression analysis. In: Schmidt A (ed) Plant germline development. Methods in molecular biology. Springer, Dordrecht
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer Science+Business Media LLC
About this protocol
Cite this protocol
Schmid, M.W. (2017). RNA-Seq Data Analysis Protocol: Combining In-House and Publicly Available Data. In: Schmidt, A. (eds) Plant Germline Development. Methods in Molecular Biology, vol 1669. Humana Press, New York, NY. https://doi.org/10.1007/978-1-4939-7286-9_24
Download citation
DOI: https://doi.org/10.1007/978-1-4939-7286-9_24
Published:
Publisher Name: Humana Press, New York, NY
Print ISBN: 978-1-4939-7285-2
Online ISBN: 978-1-4939-7286-9
eBook Packages: Springer Protocols