Transcriptome profiling of a beach-adapted wild legume for dissecting novel mechanisms of salinity tolerance

Zhang, Hengyou; Zuelsdorf, Christine; Penneys, Darin; Fan, Shoujin; Kofsky, Janice; Song, Bao-Hua

doi:10.1038/sdata.2018.290

Transcriptome profiling of a beach-adapted wild legume for dissecting novel mechanisms of salinity tolerance

Data Descriptor
Open access
Published: 11 December 2018

Volume 5, article number 180290, (2018)
Cite this article

Download PDF

You have full access to this open access article

Scientific Data

Transcriptome profiling of a beach-adapted wild legume for dissecting novel mechanisms of salinity tolerance

Download PDF

Hengyou Zhang¹,
Christine Zuelsdorf¹,
Darin Penneys²,
Shoujin Fan³,
Janice Kofsky¹ &
…
Bao-Hua Song¹

2346 Accesses
6 Citations
4 Altmetric
Explore all metrics

Abstract

Strophostyles helvola is a close relative to common bean (Phaseolus vulgaris) and inhabits both coastal and non-coastal regions in North America. However, the mechanism of saline adaptation in S. helvola remains unclear. A transcriptome profiling would facilitate dissecting the underlying molecular mechanisms in salinity-adapted S. helvola. In this study, we reported the RNA-seq analyses of two genotypes (a salt-tolerant beach genotype and a salt-sensitive inland genotype) of S. helvola stressed with salt. S. helvola plants were grown in pots and treated with half lethal-guided dose of NaCl solution for 3 h, 24 h, and 7d. The plants supplied with the same amount of water were used as controls. The whole roots sampled from the three time points were equally pooled as one biological replicate, and three replicates were used for library construction and transcriptome sequencing on Illumina Hiseq 2500. The comparative analyses of root transcriptomes presented here provides a valuable resource for discovery of genes and networks involved in salt tolerance in S. helvola.

Design Type(s)	stimulus or stress design • strain comparison design • transcription profiling design
Measurement Type(s)	transcription profiling assay
Technology Type(s)	RNA sequencing
Factor Type(s)	genotype • experimental condition
Sample Characteristic(s)	Strophostyles helvola • root

Machine-accessible metadata file describing the reported data (ISA-Tab format)

Transcriptome profiling of the salt-stress response in Triticum aestivum cv. Kharchia Local

Article Open access 13 June 2016

RNA-seq Transcriptome Profiling of the Halophyte Salicornia persica in Response to Salinity

Article 11 May 2020

Shedding light on response of Triticum aestivum cv. Kharchia Local roots to long-term salinity stress through transcriptome profiling

Article 10 December 2019

Background & Summary

Soil salinity is becoming a critical environmental factor constraining plant growth and crop productivity worldwide because a majority of crops cultivated in irrigated arable land are salt sensitive. In addition, an increase of crop production is in high demand to sustain the growing human population, thus imposing a need of crop cultivation in marginal or coastal land. A sustainable and environmental-friendly alternative strategy is to develop salt-tolerant crops that can thrive in saline soils.

Next-generation sequencing of transcriptomes has been widely used to characterize the global expression patterns in various organisms under diverse conditions to facilitate gene discovery and address major questions associated with plant environmental stress responses¹. In plants, global transcriptome profiling has been performed to elucidate molecular mechanisms of salt tolerance or response, especially in salinity-adapted genotypes mainly in model systems. Recently, more and more reports on non-model species with limited genomic sequence knowledge have been published. For example, the transcriptome analyses on Ipomoea imperati, a wild relative of sweet potato tolerant to high salinity, revealed the ABA signalling pathway and two membrane transporter genes². Sonneratia alba represents one of the most salt tolerant mangrove species, and RNA-seq on it has identified salt responsive genes with signatures of natural selection³. These transcriptomic studies have allowed us to uncover gene expression mechanisms and novel genes beyond what we currently know from model species.

Strophostyles helvola (L.) Elliott, belonging to Fabaceae family, is herbaceous annual vine native to North America. S. helvola can colonize in wild places, either moist or dry conditions with preference in sandy soils, thus it is also called wild bean or sand bean. In addition, sand bean is a close relative of common bean (Phaseolus vulgaris L.), one of the most important grain legumes worldwide providing protein for human consumption and having roles in fixing atmospheric nitrogen⁴. Like other leguminous crops, cultivated common bean is a glycophyte, sensitive to salt, leading to reduced production if grown under salt conditions⁵. Thus far, molecular mechanisms of common bean’s tolerance to salinity was rarely studied^6,7, thus the underlying mechanisms remain largely unclear. As a close halophyte relative of common bean, a global investigation of molecular response of salinity-adaptive sand bean can be very helpful with revealing the underlying mechanisms toward the goal to develop salt tolerant common bean.

In this study, we presented the transcriptome analyses of two genotypes of S. helvola (halophyte vs glycophyte) stressed with over half-lethal doses of NaCl which was not previously reported. We described the detailed procedure of transcriptome profiling for NaCl-treated and non-treated root tissues of a salt tolerant Beach genotype and a salt sensitive Inland genotype, respectively, during the time-course treatment of 3 h, 24 h, and 7d. In total, 30.5 Gb of transcriptome data (fastq.gz) from twelve root tissues were generated. We also presented in detail the analytic methods of how to obtain the raw sequence, quality control, sequence alignment, and differential expression analyses. We conducted data pre-processing to indicate the high quality of our data as visualized with FastQC and robustness of our results using analyses of multidimensional scales (MDS), dendrogram clustering and expression pattern analyses. The comparative expression profiling of the two S. helvola genotypes will provide valuable resources of genomic data for studying salinity tolerance in S. helvola and others leguminous relatives.

Methods

Overview of experimental design

The whole root tissues that were treated with NaCl and mock (water) were dissected from the plants during the time course investigation at 3 h, 24 h, and 7d. The frozen tissues collected from the three time points were pooled to generate a biological replicate. Total RNA was extracted from the pooled tissues using Qiagen RNeasy mini kit (Qiagen, Valencia, CA, USA) according to the manufacture’s instruction followed by library construction and transcriptome sequencing. The quality-controlled reads were aligned to P. vulgaris reference genome, P. vulgaris_218_v1.0 (https://phytozome.jgi.doe.gov). The resulting .bam files were used for differential expression analyses using edgeR⁸. The experiment design and analysis pipeline were shown in Fig. 1.

**Figure 1: Flowchart of experimental design of this study.**

Materials and treatment

Two S. helvola genotypes were used here for comparative study. One genotype was originally from the sandy soil along the beach in east coast of North Carolina, we named it “Beach genotype” in our study; The other genotype was originally from central Missouri, and we named it “Inland genotype”.

To determine the dose of NaCl that may trigger the tolerance response to salinity, we first tested the median lethal dose (LD50, the salt concentration required to kill 50% of the plants). The beach and inland genotypes were watered every 2 days with increasing increments of 50 mM NaCl solution⁹. We found that two genotypes required different doses of NaCl to cause 50% plants die. For Inland genotype, a final dose 350-mM NaCl may cause lethal toxicity in half, while a higher dose, 600 mM of NaCl, was needed for Beach genotype. These lethal limit data were used to guide the salt treatment for the following experiment.

The seeds were germinated in a growth chamber (Percival Scientific Inc., Perry, IA, USA) and the healthy seedlings were transplanted into flats filled with soil. Briefly, seed coat was sliced to facilitate germination and placed on moist filter paper on a petri dish. 3-day post germination, the seedlings were transplanted in 3 × 6 flat (3.10′ × 3.10′ × 2.33′) (Greenhouse MegaStore, US) filled with potting mix soil. For salt treatment for RNA-seq assay, we use the dose that was slightly lower than the dose identified for the lethal limit mentioned above, thus the plants could be stressed enough but would not die. Thus, we used 200 Mm and 400 Mm of NaCl solution as a final concentration to stress Inland and Beach genotypes, respectively. For treatment, the plants were supplied with 50 mL NaCl solution per day. In parallel, plants supplied with the same amount of water were used as controls. Whole roots were sliced off from the treatment and control plants at 3 h, 24 h and 7d, respectively, after the concentration of NaCl was reached as designed and flash frozen in liquid nitrogen. All frozen tissues were stored at −80 °C for RNA extraction. Three biological replicates per collection were collected.

RNA extraction, Library construction, and RNA sequencing

For each biological replicate, we used pooled root tissue from three plants for the library construction and sequencing, with each collected at 3 h, 24 h, and 7d, respectively. Briefly, the individual roots were ground and equal amounts of ground roots from the three time points were pooled to generate one biological replicate. Thus, three biological replicates per condition were generated. In total, twelve samples for both genotypes were used for total RNA extraction. Total RNA was isolated with RNeasy Plant Mini Kit (Qiagen, Valencia, CA, USA). Purified RNA was quantified using a Quant-iT™ RiboGreen™ RNA Assay Kit (Invitrogen Carlsbad, CA, USA) and its integrity was evaluated using an Agilent 2100 Bioanalyzer (Agilent Technologies, Palo Alto, CA, USA). One microgram of RNA samples with RNA integrity number (RIN) ≥ 7.0 (Table 1) from three independent biological replicates of each condition was used to generate cDNA libraries with insert sizes ranging from 300 to 350 bp using a TruSeq RNA Library Prep Kit from Illumina. Libraries were combined into a single pool and a 125 bp single-read sequencing run was conducted using a HiSeq 2500 instrument (Illumina, San Diego, CA, USA). Primary processing was performed on the raw reads to generate FASTQ files. RNA extraction, library construction and sequencing were performed in the Genomics Laboratory in the David H. Murdock Research Institute (Kannapolis, NC, USA).

Table 1 Statistics analyses of transcriptomes of two S. helvola genotype.

Full size table

Pre-processing of sequencing data

The quality of the raw sequence generated from transcriptome sequencing was assessed with FastQC (https://www.bioinformatics.babraham.ac.uk/projects/fastqc/). Low quality (< 20) bases and adapter sequences were trimmed with Trimmomatic v 0.36¹⁰ with following parameters: ILLUMINACLIP: path/to/adaptor.fa:2:30:10 LEADING:3 TRAILING:3 SLIDINGWINDOW:4:15 MINLEN:36. After filtering, the remaining reads were called “clean reads” and were re-assessed with FastQC. All the results of FastQC were merged and visualized using MultiQC (http://multiqc.info). Clean reads were aligned to common bean (Phaseolus vulgaris) reference genome Pvulgaris_218_v1.0 (https://phytozome.jgi.doe.gov) using RNA-seq aligner STAR software¹¹. The general feature format (gff3) file (Pvulgaris_218_v1.0.gene_exons.gff3) corresponding to Pvulgaris_218_v1.0 downloaded at Phytozome was used as an input for STAR. The options used for running STAR are: --runThreadN 16 --genomeDir/path/to/directory --sjdbGTFtagExonParentTranscript Pvulgaris_218_v1.0.gene_exons.gff3 --readFilesIn read.fastq.gz --readFilesCommand zcat --outFileNamePrefix Name --outSAMtype BAM Unsorted SortedByCoordinate. The STAR-resultant .bam files were used to estimate the abundance of uniquely-mapped reads using FeatureCounts¹². Difference expression analyses was conducted using EdgeR¹³. Heat maps were made using heatmap.2 function of the gplots package¹⁴.

Code availability

Codes that were used for data processing are included in the Methods and available as supplementary material (Supplementary File 1).

Data Records

The project was deposited into the National Center for Biotechnology Information (NCBI) Sequence Read Archive (SRA) accession (Table 1 and Data Citation 1). The abundance count for all the samples was deposited at Gene Expression Omnibus (GEO) database (Data Citation 2).

Technical Validation

Quality control

A total of 12 RNA libraries were prepared and sequenced with the sequencing depth ranging from 41.0–55.7 million single-end reads (Table 1). We applied FastQC to determine the data quality and measured several important parameters. The assessment for the filtered data was shown in Fig. 2, and the distribution of mean quality score and per sequence quality scores indicated the high quality of filtered sequences, with scores of most sequences over 35. Over 99% of the raw reads were kept after quality control and a 75.89–79.36% of the clean reads were mapped to unique location in the common bean reference genome Pvulgaris_218_v1.0 (Table 1).

**Figure 2: Visualization of the qualities of *S. helvola* sequencing data.**

Analysis of RNA-seq data

The clean reads with single alignment on P. vulgaris reference exosome were counted (Fig. 3a) and normalized by counts per million (Fig. 3b) for differential expression analysis. Gene clustering analyses were used to examine the difference between the biological replicates. Figure 3c shows that three replicates from the same group cluster together while samples from different groups are well separated. This result was further supported by multi-dimensional scale (Fig. 3d) showing that gene expression profiles of all biological replicates can clearly separate the four groups and cluster biological replicates together with small variability per group. We further explored the expression profiles in two comparisons and visualized with mean-difference (MD) plots. As shown in Fig. 3e and f, a majority of the genes are cantered around the line of zero log(fold change), and the differentially expressed genes (DEGs) were highlighted in red with a threshold of fold change ≥ 2 and fdr ≤ 0.05. We identified 2910 DEGs in Beach genotype and have showed their expression pattern across all the samples in Fig. 3g. Consistent with Fig. 3c, replicates from each group are clustered together. DEGs showing distinct expression pattern in treated Beach genotype compared with the other three groups merit further exploration.

**Figure 3: Global assessment of transcriptome data.**

Usage Notes

The RNA-seq fastq.gz files were deposited at NCBI SRA public repository and could be downloaded using fastq-dump tool of SRA Toolkit (https://www.ncbi.nlm.nih.gov). Other than Trimmomatic, FASTX (http://hannonlab.cshl.edu/fastx_toolkit/) and cutadapt¹⁵ are also commonly used for trimming and adapter removal. The alternative aligners for RNA-seq sequence could also be used, such as TopHat2¹⁶ and HISAT2¹⁷. The reference genome of P. vulgaris, the annotation file, and gff3 file could be retrieved at Phytozome database (https://phytozome.jgi.doe.gov). For downstream differential expression analyses, Cufflinks package coupled with CummeRbund may generate transcriptome assembly, expression abundance, differential expression analyses, and visualization of analyses results. HTSeq¹⁸ could also be used as alternative of featureCounts for quantification and performed differential expression analyses with DESeq2¹⁹.

Additional information

How to cite this article: Zhang, H. et al. Transcriptome profiling of a beach-adapted wild legume for dissecting novel mechanisms of salinity tolerance. Sci. Data. 5:180290 doi:10.1038/sdata.2018.290 (2018).

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

Han, Y. X., Gao, S. G., Muegge, K., Zhang, W. & Zhou, B. Advanced applications of RNA sequencing and challenges. Bioinform Biol Insig 9, 29–46 (2015).
CAS Google Scholar
Luo, Y. et al. Salt tolerance response revealed by RNA-Seq in a diploid halophytic wild relative of sweet potato. Sci Rep 7, 9624 (2017).
Article ADS Google Scholar
Chen, S. F. et al. Transcriptome sequencing of a highly salt tolerant mangrove species Sonneratia alba using Illumina platform. Mar Genom 4, 129–136 (2011).
Article Google Scholar
Schmutz, J. et al. A reference genome for common bean and genome-wide analysis of dual domestications. Nat Genet 46, 707–713 (2014).
Article CAS Google Scholar
Maas, E. V. & Hoffman, G. J. Crop salt tolerance-current assessment. J Irrig Drainage Div 103, 115–134 (1977).
Google Scholar
Hiz, M. C., Canher, B., Niron, H. & Turet, M. Transcriptome analysis of salt tolerant common bean (Phaseolus vulgaris L.) under saline conditions. PLoS One 9, e92598 (2014).
Article ADS Google Scholar
Buyuk, I. et al. Genome-wide identification of salinity responsive HSP70s in common bean. Mol Biol Rep 43, 1251–1266 (2016).
Article CAS Google Scholar
McCarthy, D. J., Chen, Y. S. & Smyth, G. K. Differential expression analysis of multifactor RNA-Seq experiments with respect to biological variation. Nucleic Acids Res 40, 4288–4297 (2012).
Article CAS Google Scholar
Orsini, F. et al. A comparative study of salt tolerance parameters in 11 wild relatives of Arabidopsis thaliana. J Exp Bot 61, 3787–3798 (2010).
Article CAS Google Scholar
Bolger, A. M., Lohse, M. & Usadel, B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30, 2114–2120 (2014).
Article CAS Google Scholar
Dobin, A . et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics 29, 15–21 (2013).
Article CAS Google Scholar
Liao, Y., Smyth, G. K. & Shi, W. featureCounts: an efficient general purpose program for assigning sequence reads to genomic features. Bioinformatics 30, 923–930 (2014).
Article CAS Google Scholar
Robinson, M. D., McCarthy, D. J. & Smyth, G. K. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics 26, 139–140 (2010).
Article CAS Google Scholar
Warnes, G. R. et al. gplots: various R programming tools for plotting data. https://cran.r-project.org/web/packages/gplots/index.html (2016).
Martin, M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnetjournal 17, 10–12 (2011).
Google Scholar
Kim, D. et al. TopHat2: accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions. Genome Biol 14, R36 (2013).
Article Google Scholar
Kim, D., Landmead, B. & Salzberg, S. L. HISAT: a fast spliced aligner with low memory requirements. Nat Methods 12, 357–360 (2015).
Article CAS Google Scholar
Anders, S., Pyl, P. T. & Huber, W. HTSeq-a Python framework to work with high-throughput sequencing data. Bioinformatics 31, 166–169 (2015).
Article CAS Google Scholar
Love, M. I., Huber, W. & Anders, S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol 15, 550 (2014).
Article Google Scholar

Data Citations

NCBI Sequence Read Archive SRP155428 (2018)
Gene Expression Omnibus GSE121143 (2018)

Download references

Acknowledgements

B.S. was supported by The National Institute of General Medical Sciences of the National Institutes of Health (Award Number: R15GM122029) and University of North Carolina (2018_FRG_SONG).

Author information

Authors and Affiliations

Department of Biological Sciences, University of North Carolina at Charlotte, Charlotte, 28223, NC, USA
Hengyou Zhang, Christine Zuelsdorf, Janice Kofsky & Bao-Hua Song
Department of Biology & Marine Biology, University of North Carolina at Willington, Willington, 28403, NC, USA
Darin Penneys
Key Laboratory of Plant Stress Research, College of Life Science, Shandong Normal University, Jinan, 250014, Shandong, China
Shoujin Fan

Authors

Hengyou Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Christine Zuelsdorf
View author publications
You can also search for this author in PubMed Google Scholar
Darin Penneys
View author publications
You can also search for this author in PubMed Google Scholar
Shoujin Fan
View author publications
You can also search for this author in PubMed Google Scholar
Janice Kofsky
View author publications
You can also search for this author in PubMed Google Scholar
Bao-Hua Song
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

B.S. conceived the project and designed the experiment. C.Z. conducted the experiment. H.Z., D.P., J.K., S.F. performed the data analyses. H.Z. and B.S. wrote the manuscript. All authors read the final manuscript.

Corresponding author

Correspondence to Bao-Hua Song.

Ethics declarations

Competing interests

The authors declare no competing interests.

ISA-Tab metadata

Supplementary information

Supplementary File 1 (TXT 2 kb)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ The Creative Commons Public Domain Dedication waiver http://creativecommons.org/publicdomain/zero/1.0/ applies to the metadata files made available in this article.

Reprints and permissions

About this article

Cite this article

Zhang, H., Zuelsdorf, C., Penneys, D. et al. Transcriptome profiling of a beach-adapted wild legume for dissecting novel mechanisms of salinity tolerance. Sci Data 5, 180290 (2018). https://doi.org/10.1038/sdata.2018.290

Download citation

Received: 12 September 2018
Accepted: 31 October 2018
Published: 11 December 2018
DOI: https://doi.org/10.1038/sdata.2018.290
Springer Nature Limited

This article is cited by

Integrated de novo transcriptome of Culex pipiens mosquito larvae as a resource for genetic control strategies
- Valentina Mastrantonio
- Pietro Libro
- Daniele Porretta
Scientific Data (2024)
Water requirements and crop coefficients of edible, spicy and medicinal herbs and vegetables; a review aimed at supporting plant and water management
- Luis S. Pereira
- Mariana Mota
- Paula Paredes
Irrigation Science (2024)
Transcriptomic analysis reveals candidate genes associated with salinity stress tolerance during the early vegetative stage in fababean genotype, Hassawi-2
- Muhammad Afzal
- Salem S. Alghamdi
- Muhammad Habib ur Rahman
Scientific Reports (2023)

Transcriptome profiling of a beach-adapted wild legume for dissecting novel mechanisms of salinity tolerance

Abstract

Similar content being viewed by others

Background & Summary

Methods

Overview of experimental design

Materials and treatment

RNA extraction, Library construction, and RNA sequencing

Pre-processing of sequencing data

Code availability

Data Records

Technical Validation

Quality control

Analysis of RNA-seq data

Usage Notes

Additional information

References

References

Data Citations

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

ISA-Tab metadata

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Navigation