Grey Level Texture Features for Segmentation of Chromogenic Dye RNAscope from Breast Cancer Tissue

Davidson, Andrew; Morley-Bunker, Arthur; Wiggins, George; Walker, Logan; Harris, Gavin; Mukundan, Ramakrishnan

doi:10.1007/978-981-97-1335-6_7

Andrew Davidson³⁸,
Arthur Morley-Bunker³⁹,
George Wiggins³⁹,
Logan Walker³⁹,
Gavin Harris⁴⁰,
Ramakrishnan Mukundan³⁸ &
kConFab Investigators

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 1166))

Included in the following conference series:

International Conference on Medical Imaging and Computer-Aided Diagnosis

250 Accesses

Abstract

Chromogenic RNAscope dye and haematoxylin staining of cancer tissue facilitates diagnosis of the cancer type and subsequent treatment, and fits well into existing pathology workflows. However, manual quantification of the RNAscope transcripts (dots), which signify gene expression, is prohibitively time consuming. In addition, there is a lack of verified supporting methods for quantification and analysis. This paper investigates the usefulness of gray level texture features for automatically segmenting and classifying the positions of RNAscope transcripts from breast cancer tissue. Feature analysis showed that a small set of gray level features, including Gray Level Dependence Matrix and Neighbouring Gray Tone Difference Matrix features, were well suited for the task. The automated method performed similarly to expert annotators at identifying the positions of RNAscope transcripts, with an $F_1$-score of 0.571 compared to the expert inter-rater $F_1$-score of 0.596. These results demonstrate the potential of gray level texture features for automated quantification of RNAscope in the pathology workflow.

Access provided by Autonomous University of Puebla. Download conference paper PDF

Rapid staining and imaging of subnuclear features to differentiate between malignant and benign breast tissues at a point-of-care setting

Article 22 April 2016

AI powered quantification of nuclear morphology in cancers enables prediction of genome instability and prognosis

Article Open access 19 June 2024

A novel computational method for automatic segmentation, quantification and comparative analysis of immunohistochemically labeled tissue sections

Article Open access 15 October 2018

Keywords

1 Introduction

In 2020, there were over 2.3 million new cases of breast cancer, and 685,000 deaths caused by breast cancer [2]. The effectiveness of breast cancer treatments vary based on some qualities of the cancer, therefore, it is crucial that these qualities are classified accurately and consistently. Over the last several years the number of cancer cases has increased, and the number of cases is projected to continue increasing [2]. Together with a shortage in the number of pathologists available [10], these trends have led to an increased pathology workload.

Within the last decade whole slide imaging has become common, which is the practice of using advanced digital scanners to produce high resolution images of patient tissue [5, 6]. There is an emerging opportunity to apply image processing methods to whole slide images, to produce consistent and exhaustive quantification of features relevant to breast cancer diagnosis.

1.1 RNAscope Staining

RNAscope is an in situ hybridization assay that is used to detect the presence of certain RNA sequences in tissue [15]. It allows the expression of specific gene sequences in a tissue sample to be visually quantified as stained dots (transcripts). The density of RNAscope transcripts is an indicator of the level of gene expression. This technique is applicable to the problem of breast cancer diagnosis because the ideal treatment differs based on some genetic characteristics of the tumour tissue, such as the level of expression of the Erb-B2 Receptor Tyrosine Kinase 2 (ERBB2) gene, also referred to as HER2 status.

To fit most easily into existing pathology workflows, a chromogenic RNAscope dye can be applied to tissue alongside haematoxylin, which is a stain that gives good definition to the nuclei. A stained sample can then be scanned on a slide in normal lab conditions to produce a single image. Fluorescent RNAscope dyes are also available and can be used to produce spectral image stacks (multiple images) that are much simpler to quantify RNAscope from. However, the fluorescent dyed samples deteriorate quickly, require a dark room to scan, and are more difficult to prepare. Therefore, this paper will solely investigate the segmentation of chromogenic RNAscope.

1.2 RNAscope Segmentation

The cases that are important to accurately segment are those with low or non-existent RNAscope staining. If there is heavy RNAscope staining, the staining is readily evident and gene expression can be deduced without having to count the individual transcripts. However, there are some challenges involved with designing a robust chromogenic RNAscope segmentation method. Since the RNAscope transcripts present at varying hue, stain intensity, and shape depending on stain preparation and tissue characteristics, segmentation is not straightforward; simple colour filtering is inadequate unless the preparation is consistently impeccable. This variability can be seen in Fig. 1. There is also a lack of available data with this stain type that also has annotations for the position of each RNAscope transcript.

This paper details the development and testing of a gray level texture feature based RNAscope segmentation method, which operates on data from whole slide images. The images contain FFPE (formalin fixed paraffin embedded) breast cancer tissue that has been stained with haematoxylin and chromogenic RNAscope.

2 Related Work

Gray level texture features are useful descriptors that are commonly used to extract information from medical images [3]. Gray level texture feature extraction techniques include the gray level co-occurrence matrix [8] (GLCM), which assesses the co-occurence of gray values with a given offset; the gray level run length matrix [4] (GLRLM), which assesses runs of same valued consecutive pixels, and the gray level size zone matrix [14] (GLSZM), which assesses connected zones of same valued pixels. Further methods include the gray level dependence matrix [12] (GLDM), which assesses the frequency of near, similar valued pixels at each gray level, and the neighbouring gray tone difference matrix [1] (NGTDM), which assesses the variability of gray values from the average value of their nearby pixels.

A recent study [9] compared existing chromogenic RNAscope transcript counting methods on FFPE haematoxylin stained colorectal cancer tissues. Although it does not directly compare segmentation performance, this study indicates that currently available open source methods for RNAscope segmentation are slow due to requiring user input to function; by manually selecting the RNAscope positions or validating each RNAscope candidate. The only listed open-source method that does not require this level of user input (Trainable WEKA Segmentation) is not able to differentiate between single RNAscope transcripts and clusters of them, and is not considered fully automated due to the number of user steps required. Even the commercial methods considered (SpotStudio and Aperio RNA ISH) required some configuration to run, and did not perform better than the open source methods by most metrics assessed.

Although there are existing solutions, the methods published in academic literature focus on the clinical side of the problem and have not published accuracy metrics for their solutions. There are also few existing solutions that use haematoxylin and chromogenic RNAscope stained tissue, which is much more convenient to fit into the pathology workflow. Some commercial methods exist, but they do not have a verified accuracy, and the mechanisms by which they operate are not in the public domain.

3 Method

A texture feature-based method for RNAscope transcript segmentation was explored to assess the viability of this approach. It was implemented in Python, making use of the pyradiomics [7] library.

3.1 Data Acquisition

Fourty whole slide images were obtained from the University of Otago, Christchurch. Each of these whole slide images contain a scan of a tissue microarray of haematoxylin and brown chromogenic RNAscope stained FFPE breast cancer tissues. The tissue was scanned at 40x magnification (0.25$\upmu $ per pixel).

3.2 Expert Annotation of RNAscope Transcripts

Tissue areas with light RNAscope staining (indicated by small brown/yellow regions in the tissue) were identified for potential use as training data. The RNAscope transcripts on a total of 144 480$\,\times \,$480 pixel non-overlapping patches were annotated by a trained pathology scientist and an anatomical pathologist, with an overlap of 19 patches annotated by both. The experts annotated 113 and 50 patches respectively.

An agreement test was conducted on the 19 patches of tissue that were annotated with RNAscope transcript positions by both experts. This was done to provide a baseline evaluation metric. Since each set of expert annotations were represented by a set of coordinates, a method was needed to provide a reasonable agreement metric. Simply assessing exact matches of coordinates would lead to very low agreement. Therefore, pairs of annotations from each expert that were less than 5 pixels from each other were counted as true positives (as shown in Fig. 2). Each annotation could only be paired to one annotation from the other expert. 5 pixels was chosen as the threshold value because the visibly stained portion of an RNAscope transcript generally has a diameter of up to 5 pixels. The $F_1$-score could then be calculated to evaluate the inter-rater agreement.

The $F_1$-score, representing inter-rater agreement, was found to be 0.596, with the recall of one annotator being 0.530 and the other being 0.682. It is important to note that a relatively low score is expected for this problem because each RNAscope transcript needs to not only be correctly recognized, but also detected at nearly the same position. The type of tissue staining used also causes additional difficulties, as discussed in Sect. 1.2.

3.3 Candidate Selection

To reduce computational load, a pre-processing step is taken to reduce the number of candidate pixels. It makes use of the colour information in the image, but cannot be made very sensitive due to the inconsistent nature of chromogenic RNAscope staining alongside haematoxylin.

Firstly, colour deconvolution (as described in [11]) is used to separate the RNAscope (brown) stain colour into its own image. Then, the RNAscope image is Gaussian blurred using a 5$\,\times \,$5 pixel circular kernel. Next, it is thresholded using a histogram-based method to keep only dark areas that are likely to represent RNAscope staining. A histogram for the intensities up to 250 (out of 255) is constructed. Values above 250 are excluded to remove data from the white background (non-tissue) areas. Then, the peak value is found and used as the threshold value, as this corresponds to the base tissue intensity. The remaining regions after thresholding are of lower value (indicating darker staining) than the threshold value. If more than 50% of the tissue still remains, the threshold value is decreased to make it more selective. A threshold decrease of 8 was found to work well. The detected RNAscope regions are separated using the Suzuki contour detection algorithm [13] to find each region contour. Using the grayscale representation of the original image with the same 5$\,\times \,$5 pixel Gaussian blur applied, very dark (heavily stained) areas are found by applying a binary threshold to keep only areas with lower than 100 value. The Suzuki contour detection algorithm is again used to convert these regions into contour lists.

Both sets of contours are drawn onto a new, blank mask with width and height dimensions matching the original image. The background colour is the previously selected RNAscope threshold value. The RNAscope contours are filled with their original intensity values from the RNAscope image (which are darker than the threshold value). The dark region contours are filled with the RNAscope threshold value minus a small value. A value of 11 was chosen, but any value from 1–32 would function identically, resulting in the same candidate density of 50%. In cases where contours overlap, the lowest value is taken. This mask is used to generate a list of candidate coordinates that may be RNAscope transcripts (dots). The list of candidates is created by repeatedly adding the coordinates of the lowest remaining value on the mask to the list of candidates. A circle centred on these coordinates (with radius according to Eq. 1) is then drawn with its value set to the RNAscope threshold value, which prevents the area from being selected again. intensity in the equation refers to the mask value at the candidate location, and thresh refers to the previously selected RNAscope threshold value. This means that the radius will be 0 (producing a dot and allowing for more adjacent, clustered detections), unless the difference between the candidate location value and the RNAscope threshold value is less or equal to 32 (indicating a weaker detection). This process ends when there are no remaining values darker than the RNAscope threshold value. An example is shown in Fig. 3.

$$\begin{aligned} radius=floor(\frac{max(intensity - thresh + 64, 0)}{32}) \end{aligned}$$

(1)

3.4 Feature Extraction

A vector of texture features is extracted for each of the candidate coordinates. The image is firstly separated into three single-channel images: grayscale, haematoxylin, and RNAscope. The haematoxylin and RNAscope channels are extracted using the previously mentioned colour deconvolution method [11]. A large set of first order, GLRLM, GLSZM, GLCM, GLDM, and NGTDM features are extracted. For the features that can be assessed at different distances, distances of 1, 2, and 3 pixels are used. GLDM cut-offs of 0, 1, and 2 pixels are used. Each feature is assessed in both a 7$\,\times \,$7 pixel and an 11$\,\times \,$11 pixel window around the central pixel. Feature vectors for coordinates that are within 1 pixel of a ground-truth annotation are classified as positive, and all other coordinates as negative. The texture features are normalized. For classification, the linear support vector classifier (LSVC) was found to work well, using 1e−7 stopping tolerance, balanced class weighting, l2 penalty, C = 1.0, and 1e7 max iterations. Viewing the coefficient weights for the trained LSVC showed that very few features were significant, so a reduced feature set was designed; 1548/1572 features were removed. The remaining features are first order energy and variance, NGTDM coarseness, and GLDM Large Dependence High Gray Level Emphasis (LDHGLE). NGTDM coarseness was kept over GLSZM Large Area High Gray Level Emphasis (LAHGLE, which was weighted slightly higher) as it is simpler to compute. These are only assessed at a distance of 3 pixels, and GLDM cut-off of 2 pixels. The reduced feature set extraction process runs about 10 times faster. Each classifier was trained on 115 of the annotated patches, with the remaining 29 being used for validation.

The prediction outputs for each feature vector are used to produce segmentation maps, which are binarized with a configurable gray threshold value. The watershed transform is used to aggregate neighboring values into a set of coordinates that can be directly compared against the ground truth coordinates. The minimum pixels in each detection (area threshold) is configurable. Figure 4 shows the segmentation map and final coordinates for an example patch.

4 Results

The classifiers were evaluated against the same set of 19 patches used to assess expert inter-rater agreement. No comparison was made with other existing methods, due to their lack of output granularity and automation (as discussed in Sect. 2). No patches from the same tissue cores were present across both the training and validation datasets. Both the classifier using the full feature set and the classifier using the reduced feature set were evaluated. $F_1$-score (based on matching annotation pairs) was used as the evaluation metric for each classifier. The best scores for each classifier were very similar for both classifiers, with the reduced feature set classifier only performing slightly worse (0.571 $F_1$-score) than the classifier using the full feature set (0.572 $F_1$-score). Both classifiers performed similarly to the expert inter-rater agreement (0.596 $F_1$-score). Additional metrics (precision and recall) are also shown in Table 1.

Table 1. Multiple accuracy measurements for both classifiers

Full size table

To illustrate the segmentation characteristics and hyperparameter (gray threshold and area threshold) selection process, a surface plot showing the $F_1$-score at each gray threshold and area threshold is shown in Fig. 5. Since both classifiers produced very similar surfaces, only the plot for the reduced feature set classifier is shown. The plot displays good $F_1$-scores at most thresholds, only dropping significantly at gray thresholds higher than 150. This means that both classifiers are resilient to small changes in the RNAscope detection size on the segmentation map, and to the intensity of these detections. This also implies that the feature extraction is stable, producing coherent results for neighboring pixels.

4.1 Feature Analysis

Of all of the features extracted, few are needed to produce an accurate segmentation map. This is evidenced by the fact that reducing the feature set from 1572 to 24 features only decreased the classification $F_1$-score from 0.572 to 0.571. Figure 6 shows the relative coefficient weights of each category of features, and the weighting of each colour channel. Weighting for a feature is aggregated across all parameters (such as distance, cutoff) measured for that feature.

The feature with highest weighting is the first order energy, which is simply the magnitude of pixel values within the area. It accounts for 35.5% of the total coefficient weight magnitude. For the smaller 7$\,\times \,$7 pixel window, lower gray and RNAscope channel energy values (indicating darker staining) were positively correlated with RNAscope detection; whereas, lower haematoxylin values (indicating denser, darker tissue) were negatively correlated. For the larger 11$\,\times \,$11 pixel window, the opposite correlation (albeit weaker) occurred for each of the three channels. This makes sense as the 11$\,\times \,$11 pixel window would mostly consist of the tissue surrounding the transcript, not just the transcript itself.

The second highest weighted feature is the GLDM LDHGLE, which accounts for 28.2% of the total weight. The LDHGLE measures the occurrence of high (light) gray valued regions with homogeneous texture. This is positively correlated with RNAscope detection on the RNAscope channel at window size 11$\,\times \,$11 pixels. This positive correlation at the larger window size is likely because the immediate area surrounding an RNAscope transcript generally contains little to no RNAscope staining. The third highest weighted feature, GLSZM LAHGLE, indicates large connected areas of high gray value, and carries the same positive correlation on the RNAscope channel at window size 11$\,\times \,$11 pixels. Given that these two features are so similar, including both would be largely redundant.

The fourth highest weighted feature is the NGTDM coarseness, which accounts for 7.2% of the total weight. It assesses the average difference of gray values from their neighbors within a small (1–3 pixel) radius, or the spatial rate of change. Lower rates of change on the RNAscope and gray channels correlate with transcript detection, whereas the opposite association exists on the haematoxylin channel. This could be because a dot with smooth edges would produce low coarseness values on the RNAscope channel, and the haematoxylin channel will naturally be coarse in the areas near nuclei, where RNA is primarily found.

No GLRLM or GLCM features were highly weighted. Since GLRLM assesses horizontal runs of same-valued pixels, the noisy and two dimensional nature of slide image data would reduce its suitability. Given that GLCM assesses co-occurrence of gray values at small, fixed offsets, it does not seem well suited for detecting simple dot structures atop heterogeneous tissue.

5 Conclusion

In this paper, the viability of gray level texture features for segmentation of chromogenic dye RNAscope was investigated. A linear support vector classifier using a variety of these features was developed, and subsequently pruned to only include the most significant features. It performed similarly to two experts who annotated data for this study, achieving an $F_1$-score of 0.571 for identifying closely matching RNAscope transcript coordinates. The baseline expert $F_1$-score for the same task was 0.596. Feature analysis revealed a small set of gray level features well suited to segmentation of chromogenic dots (RNAscope) from histological tissue specimen slide images. Automated methods for chromogenic RNAscope segmentation and subsequent quantification are crucial for the usability of RNAscope in routine pathology workflows, and could greatly aid in cancer diagnosis with further development. For future work, more nuanced deep learning methods will be investigated because they could potentially better handle the tissue heterogeneity that makes this segmentation task difficult.

References

Amadasun, M., King, R.: Textural features corresponding to textural properties. IEEE Trans. Syst. Man Cybern. 19(5), 1264–1274 (1989). https://doi.org/10.1109/21.44046
Article Google Scholar
Arnold, M., et al.: Current and future burden of breast cancer: global statistics for 2020 and 2040. The Breast 66, 15–23 (2022). https://doi.org/10.1016/j.breast.2022.08.010
Article Google Scholar
Chowdhary, C.L., Acharjya, D.: Segmentation and feature extraction in medical imaging: a systematic review. Procedia Comp. Sci. 167, 26–36 (2020). International Conference on Computational Intelligence and Data Science. https://doi.org/10.1016/j.procs.2020.03.179
Galloway, M.M.: Texture analysis using gray level run lengths. Comput. Graph. Image Process. 4(2), 172–179 (1975). https://doi.org/10.1016/S0146-664X(75)80008-6
Article Google Scholar
Ghaznavi, F., Evans, A., Madabhushi, A., Feldman, M.: Digital imaging in pathology: whole-slide imaging and beyond. Annu. Rev. Pathol. 8(1), 331–359 (2013). https://doi.org/10.1146/annurev-pathol-011811-120902
Article Google Scholar
Graschew, G., Roelofs, T.A., Rakowsky, S., Schlag, P.M.: E-health and telemedicine. Int. J. Comput. Assist. Radiol. Surg. 1(1), 119–135 (2006). https://doi.org/10.1007/s11548-006-0012-1
Article Google Scholar
van Griethuysen, J.J., et al.: Computational radiomics system to decode the radiographic phenotype. Can. Res. 77(21), e104–e107 (2017). https://doi.org/10.1158/0008-5472.CAN-17-0339
Article Google Scholar
Haralick, R.M., Shanmugam, K., Dinstein, I.: Textural features for image classification. IEEE Trans. Syst. Man Cybern. SMC- 3(6), 610–621 (1973). https://doi.org/10.1109/tsmc.1973.4309314
Article Google Scholar
Morley-Bunker, A.E., et al.: RNAscope compatibility with image analysis platforms for the quantification of tissue-based colorectal cancer biomarkers in archival formalin-fixed paraffin-embedded tissue. Acta Histochemica 123(6), 151, 765 (2021). https://doi.org/10.1016/j.acthis.2021.151765
Rozario, S.Y., Sarkar, M., Farlie, M.K., Lazarus, M.D.: Responding to the healthcare workforce shortage: a scoping review exploring anatomical pathologists’ professional identities over time. Anat. Sci. Educ. 17, 351–365 (2023). https://doi.org/10.1002/ase.2260
Article Google Scholar
Ruifrok, A.C., Johnston, D.A.: Quantification of histochemical staining by color deconvolution. Anal. Quant. Cytol. Histol. 23(4), 291–299 (2001)
Google Scholar
Sun, C., Wee, W.G.: Neighboring gray level dependence matrix for texture classification. Comput. Vis. Graph. Image Process. 23(3), 341–352 (1983). https://doi.org/10.1016/0734-189X(83)90032-4
Article Google Scholar
Suzuki, S., Be, K.: Topological structural analysis of digitized binary images by border following. Comput. Vis. Graph Image Process. 30(1), 32–46 (1985). https://doi.org/10.1016/0734-189X(85)90016-7
Thibault, G., et al.: Texture indexes and gray level size zone matrix application to cell nuclei classification. In: International Conference on Pattern Recognition and Information Processing, PRIP 2009, pp. 140–145 (2009)
Google Scholar
Wang, F., et al.: RNAScope: a novel in situ RNA analysis platform for formalin-fixed, paraffin-embedded tissues. J. Mol. Diagn. JMD 14(1), 22–29 (2012). https://doi.org/10.1016/j.jmoldx.2011.08.002
Article MathSciNet Google Scholar

Download references

Acknowledgment

This study used data supplied by Associate Professor Logan Walker, Dr. Arthur Morley-Bunker, and Dr. George Wiggins from the University of Otago, Christchurch.

The RNAscope transcripts on a total of 144 image patches were annotated by Dr. Arthur Morley-Bunker (a trained pathology scientist) and Dr. Gavin Harris (an anatomical pathologist) for use in this study.

We wish to thank Heather Thorne, Eveline Niedermayr, Sharon Guo, all the kConFab research nurses and staff, the heads and staff of the Family Cancer Clinics, and the Clinical Follow Up Study (which has received funding from the NHMRC, the National Breast Cancer Foundation, Cancer Australia, and the National Institute of Health (USA)) for their contributions to this resource, and the many families who contribute to kConFab. kConFab is supported by a grant from the National Breast Cancer Foundation, and previously by the National Health and Medical Research Council (NHMRC), the Queensland Cancer Fund, the Cancer Councils of New South Wales, Victoria, Tasmania and South Australia, and the Cancer Foundation of Western Australia.

Author information

Authors and Affiliations

Department of Computer Science and Software Engineering, University of Canterbury, Christchurch, New Zealand
Andrew Davidson & Ramakrishnan Mukundan
Department of Pathology and Biomedical Science, University of Otago, Christchurch, New Zealand
Arthur Morley-Bunker, George Wiggins & Logan Walker
Canterbury Health Laboratories, Christchurch, New Zealand
Gavin Harris

Authors

Andrew Davidson
View author publications
You can also search for this author in PubMed Google Scholar
Arthur Morley-Bunker
View author publications
You can also search for this author in PubMed Google Scholar
George Wiggins
View author publications
You can also search for this author in PubMed Google Scholar
Logan Walker
View author publications
You can also search for this author in PubMed Google Scholar
Gavin Harris
View author publications
You can also search for this author in PubMed Google Scholar
Ramakrishnan Mukundan
View author publications
You can also search for this author in PubMed Google Scholar

Consortia

kConFab Investigators

Corresponding author

Correspondence to Andrew Davidson .

Editor information

Editors and Affiliations

Department of Computer Science, Shanghai Jiao Tong University, Shanghai, China
Ruidan Su
Department of Informatics, University of Leicester, Leicester, UK
Yu-Dong Zhang
University of Manchester, Manchester, UK
Alejandro F. Frangi

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Davidson, A. et al. (2024). Grey Level Texture Features for Segmentation of Chromogenic Dye RNAscope from Breast Cancer Tissue. In: Su, R., Zhang, YD., Frangi, A.F. (eds) Proceedings of 2023 International Conference on Medical Imaging and Computer-Aided Diagnosis (MICAD 2023). MICAD 2023. Lecture Notes in Electrical Engineering, vol 1166. Springer, Singapore. https://doi.org/10.1007/978-981-97-1335-6_7

Download citation

DOI: https://doi.org/10.1007/978-981-97-1335-6_7
Published: 06 March 2024
Publisher Name: Springer, Singapore
Print ISBN: 978-981-97-1334-9
Online ISBN: 978-981-97-1335-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Grey Level Texture Features for Segmentation of Chromogenic Dye RNAscope from Breast Cancer Tissue

Abstract

Similar content being viewed by others

Rapid staining and imaging of subnuclear features to differentiate between malignant and benign breast tissues at a point-of-care setting

AI powered quantification of nuclear morphology in cancers enables prediction of genome instability and prognosis

A novel computational method for automatic segmentation, quantification and comparative analysis of immunohistochemically labeled tissue sections

Keywords