Abstract
This chapter describes a method for meta-research, based on image mining from neuroscientific publications. It extends earlier investigation to the study of a large scale data set. Using a framework for extraction and characterisation of reported fMRI images, based on their coordinates and colour profiles, we propose that significant information can be harvested automatically. The coordinates of the brain activity regions, in relation to a standard reference templates are estimated. We focus on the analysis of scientific reports of the default mode network. Both the commonalities and the differences of brain activity between control, Alzheimer and schizophrenic patients are identified.
Access provided by Autonomous University of Puebla. Download chapter PDF
Similar content being viewed by others
Keywords
1 Introduction
1.1 Meta-Analysis in Neuroscience
There is an ever increasing number of scientific publications in many research fields in general, and in neuroscience in particular. Hundreds of articles are published every month, with a considerable amount devoted to functional magnetic resonance imaging (fMRI) ([7,12]). When comparing results obtained with a particular experimental setup with those reported in the existing literature, one may validate, integrate or confront different theories. This analysis is usually performed in a rather human-intensive manner, through the use of dedicated curators, e.g. [14,15]. The development of tools able to synthesise and aggregate such large-data can then be seen as crucial.
Meta-analysis of neuroscience research would clearly benefit from direct access to their original data sets. This is often not possible, due to the unavailability of such data. Yet, albeit of poorer quality, there is a plethora of summarising information, readily available in many published reports. Its analysis is the main topic of the current manuscript. That information is encoded both in text structures, as well as in image content, providing ample scope for mining information at various levels. The extraction of relevant information is not a simple task, and constitutes a major subject of information retrieval and data mining [11].
1.2 Previous Work
As stated above, previous approaches often used a considerable amount of curator work, with researchers reading from several sources, and extracting by hand the relevant information (cf., [14]). This severely limits the range of possible analyses. It is, therefore, of significant importance that robust automated information retrieving approaches be added to the current attempts to build functional neuro-atlases. A recent, fully automated approach was proposed by [21]. Their framework combines text-mining, meta-analysis and machine-learning techniques, to generate probabilistic mappings between cognitive and neural states. One drawback of this method is that it addresses only text mining, and requires the presence of activation coordinates in the articles analysed. Those peak-coordinates and some text tags are the only representation of the activations, which results in the discarding of valuable information from the neural activity.
We see our approach as a complementary way to tackle the problem, when image information, rather than text, is automatically harvested from published data.
1.3 Default Mode Network
An open field of research with increasing interest in neuroscience is the resting state and default mode networks (RSN & DMN, respectively). These networks comprise areas such as the occipital, temporal and frontal areas of the brain. They are active when the individual is not performing any goal-oriented task, and suppressed during activity [6,17]. In spite of the great attention to those networks, scientific research of brain’s “resting state” still poses various conceptual and methodological difficulties [19]. A commonly topic of study consists in investigating the differences and commonalities in the activity of healthy brains when compared to, e.g., Alzheimer or schizophrenic brains. Specifically, how different is the composition of RSN and DMN, in healthy and pathological brains, and how do these differences influence cognitive and functional performances.
1.4 Proposed Approach
In this chapter, we propose a complementary framework to text analysis, focusing instead on image information. It relies on the automatic extraction and characterisation of image information in fMRI literature. Such information often takes the form of activation/suppression of activity in the brain, in a variety of image settings, orientations and resolutions. This framework aims to open different means to building and improving functional atlases of the human brain, based solely on the large number of images published in neuroscientific articles.
We demonstrate the feasibility and results of our method in studies of the resting state and default mode networks, and highlight three outcomes of such research. The first is the identification of common neuronal activity across all subjects. Several regions are expected to participate in the DMN structures, in spite of possible existence of any of the aforementioned diseases. The second outcome focuses on differences between the activation patterns of healthy subjects and unhealthy ones, which can be explained with information already reported in articles within the data set used. Finally, we aim at identifying also variations in activity, not reported in the literature, and which could constitute evidence for proposing new research questions.
In the following sections, we will describe the procedure used for the extraction of reported fMRI images and subsequent mapping of functional activity patterns to a common brain template. Then we demonstrate the results obtained when mining information from a collection of articles related to the DMN. Using those results, we subsequently compare brain activity in healthy, Alzheimer and schizophrenic brains. We conclude the article with some remarks about the proposed approach, its limitations and possible future work.
2 Methodology
2.1 Data
The first step of our research consisted in the construction of a database of relevant publications. With this in mind, we searched for neuroscientific publications published online, in which the topic of discussion was related to the default mode network. This search was carried out using a keyword based search, with words such as DMN, Alzheimer, fMRI, cognitive impairment, Schizophrenia and resting state.
We gathered 183 articles in pdf format, from journals such as NeuroImage, Human Brain Mapping, Brain, Magnetic Resonance Imaging, PNAS and PLOS ONE. The time-frame for these articles ranged from early 2000 to June 2013. All papers were then separated according to the specificity of the analysis carried therein (see Table 1), distinguishing between studies on healthy brains (132), Alzheimer (29) and Schizophrenia (18) research.
2.2 fMRI Activity
Consider typical images of fMRI activity, as shown in Fig. 1. In a brief glance, it is easy to identify several features of relevance, such as the kind of section of the image (axial in this case, as opposed to sagittal or coronal), various anatomical features of the section, as well as the functional activity regions or ‘blobs’ within the section. We do that by relating the image to an internal representation of our anatomical and physiological knowledge of the brain. This relation takes into account physical and geometrical properties of the underlying structural image, as well as of the superimposed blob. In addition to the activity location, other features, such as intensity, area, perimeter or shape can be used to fully characterise the activity, c.f. ([1,18]). Other non-pictorial features, such as the text in the caption, could also be used to characterise said images.
In Fig. 1, one can also see the various reporting styles illustrated, including various underlying, gray-scale, structural images, colours and formats. The leftmost image shows a typical example where a slight increase of activity when compared to the reference corresponds to dark red while a big increase is depicted in bright yellow, which is typically called the hot colour scale. On the rightmost image, the decrease of activity when compared to the reference is shown in a gradation of blue, from dark to bright, corresponding to a small to big decrease. The image on the middle shows an example where the authors only chose to report the areas of difference in activation, without giving intensity information.
2.3 Image Extraction Procedure
In Fig. 2, we show a flowchart of our framework. We start by extracting figures from the PDF files of publications, using an open source command-line utility pdfimages, running in Linux.
For each journal publication there is a pre-defined, common reporting style, but, as shown before, different authors produce their figures with different styles. They have non-homogeneous content, such as multiple image frames per figure, other plots, annotations or captions. Since we desire a clear image in order to accurately isolate the fMRI activity of interest, it is necessary to morphologically process those figures.
The first stage is the object identification. Many figures have a simple background colour, like black or white, but others have different colours, e.g., gray. Hence, the background colour needs to be detected, which is done through histogram and border analysis. The possible background colours are detected from the borders of the image, and the one with highest number of pixels is selected.
To detect different objects in a figure, and after background detection, figures are converted to black (background) and white (objects) colour. In those binary images, the white areas correspond to the smallest rectangle enclosing a object. Objects in the border of the respective figure, as well as those composed of only a few pixels are discarded. The next step is to analyse the images that are left inside the remaining objects. After extracting said images, we need to identify and extract the ones that correspond to fMRI reports. This is done using various properties, such as:
-
a minimum perimeter of the image, which we have set to 80 pixels, to allow a sufficient processing resolution;
-
a minimum and maximum number of image/background pixel ratio, between 0.1 and 97.5, to avoid non-brain images;
-
percentage of colour pixels in the image between 0 and 40 % of coloured pixels, filtering out non-fMRI images or images with activity all over the brain;
-
image aspect ratio between 0.66 and 1.6, typical of a brain image;
-
one image should occupy more than 50 % of the frame, to eliminate multiple images in the same object frame.
Regarding the last property, we repeated the object identification procedure when objects included several images, until no more images could be found.
In the example shown in Fig. 2, the object frame containing the figure colour map is discarded, due to the aspect ratio. Two of the brain images are also discarded since they don’t have colour present, therefore not being considered as originating from an fMRI study.
The following step removes undesired annotations. In Fig. 2, these correspond to coordinate axis as well as letters ‘L’ and ‘R’. This stage is done by removing all images inside the frame, except for the biggest one. Also any lines in 0 or 90 degree angles are removed, using the Hough transform [8,20] on each frame. Pixels belonging to vertical/horizontal lines that are present in more than two thirds of the height/width of the object are replaced with an average intensity of the surrounding pixels.
2.4 Volume and Section Identification
Once the activity images have been retrieved and cleaned, the type of template used in the images, i.e. volume type, and sections are identified, to estimate the three-dimensional coordinates of the activated regions. To represent the three dimensional changes in brain activity, views from three different planes are used to represent them in two dimensions. Thus we have axial sections, along the transversal plane that travels from the top of the brain to bottom; the sagittal section that travels along the median plane, from left to right; and the coronal section, along the frontal plane, that travels from front to back. To do a proper characterisation of the images, instead of focusing on the internal features of each section, the symmetry characteristics of the section shapes are used, as show in Fig. 3.
The images are again converted to binary images, thereby outlining the respective shape of the section. Simple symmetry allows for a suitable distinction between sections. One axial section is mostly symmetric about both the horizontal and vertical axis (Fig. 3a, d). The coronal section displays some symmetry only with respect to its vertical axis (Fig. 3b, e) while the sagittal section is asymmetric (Fig. 3c, f).
Most researchers map the activity changes found onto either SPM [10] or Colin [3] volume templates. Colin volumes contain higher resolution sections, when compared to SPM. Regarding the spatial separation between adjacent sections, SPM volume templates uses 2 mm, whereas that distance is 1mm for Colin templates.
To detect the volume type, one can use a complexity measure of the images. We used a Canny filter, [4], to detect the voxels corresponding to contrast edges. This is done for both template volumes, i.e. Colin and SPM, and for all the image slices from the section identified before. The volume template we select corresponds to the one with the minimal difference between the analysed image and the volume template images. This difference is calculated for the whole image and for a centred square with half the image size. We then average both values and use this as the difference measure.
To identify which slice of the template’s volume corresponds to the extracted image, we compare that image with all of the template’s slices. This comparison is performed using a combination of correlation and scale invariant feature transform (SIFT, [16,20]). If there is a slice with a correlation of more than.9 with the extracted image, then that slice is selected. Otherwise, we select the slice which obtains the smallest distance of SIFT features as the correct mapping. Once this information is found, the complete coordinate set is identified for the reported image.
2.5 Blob Information Mapping
Once the geometrical considerations of the image have been dealt with, we can now characterise in the more detail the regions reported therein.
Activity regions are generated in response to stimulation. The properties of these regions largely define the fMRI activity and hence it is crucial that an analysis of the coloured blobs is carried out. Since we assume that only activations are color coded, these regions are easily segmented based on hue information (cf. ‘blob identification’ box in Fig. 2).
As mentioned before, the reporting style of different researchers can vary. This variety of reporting methods restricts the analysis that can be performed, since the same article can contain images with different colour scales. We tried to obtain intensity information from each fMRI image by using a colour map detection procedure, through histogram analysis. Since some images showed both increased and decreased activity, this step comprised from mild human intervention, aimed at fixing some wrongly detected colour maps. This was only applied to the rare cases where the automatic histogram analysis couldn’t detect the correct colour scale, and was performed rather easily.
Using the Colin brain template as a reference to our own reporting, we mapped all blob intensity information to their respective coordinates. We sum all intensities found in the data, for each voxel. Then those intensities are normalised to a scale from 0 to 1, where one corresponds to the highest possible common activity.
This produces a three-dimensional intensity map, where each voxel displays the intensity corresponding to the average activity in the data, for the respective voxels. Since this intensity map was built using two-dimensional images, we also performed a 3D smoothing, using a Gaussian ellipsoid with dimensions corresponding to 5 % of the template size.
In our reporting, we decided to use the jet colour scale for the summarising intensity map. There, colours go from dark blue to dark red, covering also green and yellow. One big difference between our scale and typical fMRI reports is that we don’t distinguish between increase or decrease of activity, when compared to a reference, but consider any coloured report as “interesting”. Therefore the dark blue corresponds to locations with very low reporting of activity (positive or negative) while dark red is used for locations with many reports. To avoid showing all the brain in dark blue, we show only intensities for locations where the number of reported blobs is more than 10 % of the total.
By superimposing this intensity map over the template volume, we can obtain a visual summary of all the results found in the articlesFootnote 1.
3 Results
3.1 Extracted Information
Table 1 shows how many articles, figures, images and blobs were found according to the publications analysed. Note that the number of samples for the unhealthy cases is quite small when compared to the healthy brains. This bias might affect the quality of the results, but the same problem would occur to any other meta-researcher, investigating brain activity in Alzheimer/Schizophrenia, due to the smaller sample of research dealing with these cases, when compared to the healthy controls.
Regarding the accuracy of the method, a simple visual inspection shows that once the volume and type of section are identified, the section coordinates were typically accurate within 1 voxel of distance. Also, the cleaning procedure of images mentioned in Sect. 2.3 doesn’t remove all artifacts from images, e.g., when the letters are inside the brain. Nonetheless we found that leftover artifacts rarely affected activity detection and subsequent mapping.
3.2 Meta-Analysis
After the compilation of all the results and the creation of the three-dimensional activity maps, one can perform analyses on the different types of brains studied.
3.2.1 Healthy Brain Activity
Figure 4 shows the brain activity reported for healthy subjects, displayed on axial section of the standard Colin reference. The highest areas of activity are the typical subsystems that compose the DMN: the posterior cingulate/precuneous, the medial pre-frontal cortex and the inferior parietal lobes. Note that, in the majority of the reports, including Alzheimer and Schizophrenia, most subjects presented the bulk of the activity in these major areas.
3.2.2 Alzheimer vs Healthy Brain Activity
We can now focus on the comparison between healthy, Alzheimer and Schizophrenia DMN activity, for example at axial height 114 of Colin’s standard brain (see Fig. 5). According to [2], one would expect that older brains have larger areas of activity than younger ones. We can see this in the posterior cingulate and in the inferior parietal lobes for Alzheimer when compared to the healthy brain image. On the other hand, the aged brain image shows somewhat less spread activity on the frontal lobe, when compared to the other areas of DMN. This seems counter-intuitive in light of the referred work. One may say that the lack of samples could cause this phenomenon, but our results seem rather consistent for the other areas. To find out a possible reason for this discrepancy, we can search for corroborating evidence in one of the articles analysed. In Fig. 1 of [5], there is a similar decrease in activity for aged brains, compared to healthier ones, confirming our own results.
3.2.3 Schizophrenia vs Healthy Brain Activity
Another analysis that can be performed with our method relates to finding areas of the brain with different activities between unhealthy brains and healthy ones. In Fig. 6, one can see images for axial height 130, where publications dealing with healthy brains report a bigger area of activity in the posterior cingulate area (PCC), when compared with brains suffering from Alzheimer, and even more so on schizophrenic brains.
3.2.4 Overall Comparison between Healthy, Alzheimer and Schizophrenia DMN Activity
To have a better overall perspective of the reported brain activations/deactivations, we can look at the 3D images of the intensity map, as depicted in Fig. 7. In this figure, we can clearly see that the areas reported on the healthy brains correspond exactly to the ones normally expected for studies of the DMN. On the brains suffering with Alzheimer, the intensity values decrease when compared to the healthy brain, as we suggested already in Fig. 5, although the areas reported are still the same as the from normal controls. Regarding the brains with Schizophrenia, we can see an area increase in the frontal region of the brain, while several smaller foci of activation appear, e.g., near the cerebellum.
4 Discussion
We gathered more than 180 articles studying the default mode network, and analysed the images contained therein, in order to get a summarising overview of their results. Our main goal was to automatically map the results of studies reported by several researchers, onto a standard brain, and use this mapping to analyse the differences between healthy and unhealthy brains. This task would involve a tremendous amount of work and time if done by a human curator, whereas our method retrieves most information in a uniform and almost automatic manner.
The complete procedure is done in approximately 1 min per article (including human intervention if needed), while it takes 30–60 min when done by a curator, as in [15]. In that publication, the researchers went through 13 publications to obtain the information they desired. Using our method, not only it would save a considerable amount of manual work, it would enable them to find other fMRI studies related to the areas they are interested in.
Looking at the results, it seems clear that our method performs remarkably well, suggesting that it could be used to help creating a comprehensive functional brain atlas. Since we only performed a rough analysis of a particular research topic, we didn’t aim at a complete report of all brain activities that might be studied.
There are some problems with our approach, that also occur in other automatic data-mining approaches. First, by using only image information we are giving the same weight to all publications, irrespectively of the number of subjects studied. Furthermore, statistical thresholds and analysis methods vary in every publication, hence we cannot claim to make a thorough statistical analysis. Also, the number of articles dealing with the unhealthy cases is quite small when compared to the healthy brains. All these problems will affect quantitatively our analysis, although we may still draw valuable information from the data. We also expect their influence to decrease with an increasing number of analysed publications.
We showed that with a clear topic in mind, it is possible to obtain results of high relevance. As an example, we have seen that most reports on DMN, regardless of the health condition of the subjects show activity on the posterior cingulate/precuneous, the medial pre-frontal cortex and the inferior parietal lobes. On the other hand, the pre-frontal activity of Alzheimer subjects is shown to be spatially restricted. Corroborating evidence for this finding can be traced back to the original published reports. Due to the reduced sample statistics for the unhealthy brains, we can’t guarantee that there is a ‘real’ lack of activity, or just the absence of reports, but it suggests a possible area of investigation.
As stated before, there is a considerable variability in how each researcher displays their results. In the future, and to mitigate the lack of availability of original data, our method could be included in online submission systems for publication, after authors have uploaded their document. With minimal manual effort, the authors could validate the proposed summarising data, and hence improve the quality of the information gathered.
Lately there have been more and more efforts to increase data availability, either through common databases or by submitting the data at the same time as the article. Naturally, when available, this would allow for a much better analysis of the data, avoiding all the problems of detecting fMRI images or which colour scale they have. Nevertheless, these databases are still rather rare.
Despite the specificity of the method regarding fMRI images, we believe the principles behind it could be easily ported to other areas of investigation, such as weather reports or earthquake maps.
We hope to further refine our method by combining it with a text-mining approach, and test it in situations where there is either a clear agreement between different research reports, or a challenge between theories. The former is a key aspect to the construction of functional neuro-atlases, whereas the latter may lead to true findings in neuroscience.
5 Appendix–Articles Database
Notes
- 1.
The 2D projections of said summarising volumes were produced using the ITK-SNAP tool [23].
References
Bankman I (ed) (2000) Handbook of medical imaging. Academic, New York
Beason-Held LL (2011) Dementia and the default mode. Curr Alzheimer Res 8(4):361–365
Brett M, Johnsrude IS, Owen AM (2002) The problem of functional localization in the human brain. Nat Rev Neurosci 3(3):243–249.
Canny J (1986) A computational approach to edge detection. Patt Anal Mach Intell IEEE Transac PAMI 8(6):679–698
De Vogelaere F, Santens P, Achten E, Boon P, Vingerhoets G (2012) Altered default-mode network activation in mild cognitive impairment compared with healthy aging. Neuroradiology 54(11):1195–1206
Deco G, Jirsa VK, McIntosh AR (2011) Emerging concepts for the dynamical organization of resting-state activity in the brain. Nat Rev Neurosci 12(1):43–56
Derrfuss J, Mar R (2009) Lost in localization: The need for a universal coordinate database. Neuroimage 48(1):1– 7
Duda RO, Hart PE (1972) Use of the Hough transformation to detect lines and curves in pictures. Commun ACM 15(1):11–15
Esposito F, Pignataro G, Di Renzo G, Spinali A, Paccone A, Tedeschi G, Annunziato L (2010) Alcohol increases spontaneous BOLD signal fluctuations in the visual network. Neuroimage 53(2):534–43
FIL Methods Group: Statistical Parametric Mapping. http://www.fil.ion.ucl.ac.uk/spm/
Hand D, Mannila H, Smyth P (2001) Principles of data mining. MIT Press, Cambridge
Huettel SA, Song AW, McCarthy G (2008) Functional magnetic resonance imaging, 2nd ed. Sinauer, Sunderland
Johnson SC, Ries ML, Hess TM, Carlsson CM, Gleason CE, Alexander AL, Rowley HA, Asthana S, Sager MA (2007) Effect of Alzheimer disease risk on brain function during self-appraisal in healthy middle-aged adults. Arch Gen Psychiat 64(10):1163–1171
Laird AR, Lancaster JL, Fox PT (2009) Lost in localization? the focus is meta-analysis. Neuroimage 48(1):18–20
Levy DJ, Glimcher PW (2012) The root of all value: a neural common currency for choice. Curr Opin Neurobiol 22(6):1027–1038
Lowe DG (2004) Distinctive image features from scale-invariant keypoints. Int J Comput Vision 60:91–110
Raichle ME, MacLeod AM, Snyder AZ, Powers WJ, Gusnard DA, Shulman GL (2001) A default mode of brain function. Proc Natl Acad Sci USA 98(2):676–682
Rajasekharan J, Scharfenberger U, Gonçalves N, Vigário R (2010) Image approach towards document mining in neuroscientific publications. In: IDA, pp 147–158
Snyder AZ, Raichle ME (2012) A brief history of the resting state: the Washington University perspective. NeuroImage 62(2):902–910
Szeliski R (2010) Computer vision: algorithms and applications, 1st edn. Springer, New York
Yarkoni T, Poldrack RA, Nichols TE, Van Essen DC, Wager TD (2011) Large-scale automated synthesis of human functional neuroimaging data. Nature Methods 8(8):665–670
Ylipaavalniemi J, Vigário R (2008) Analyzing consistency of independent components: an fMRI illustration. NeuroImage 39(1):169 – 180. http://dx.doi.org/10.1016/j.neuroimage.2007.08.027. http://www.sciencedirect.com/science/article/pii/S10538119070 07288
Yushkevich PA, Piven J, Cody Hazlett H, Gimpel Smith R, Ho S, Gee JC, Gerig G (2006) User-guided 3D active contour segmentation of anatomical structures: significantly improved efficiency and reliability. Neuroimage 31(3):1116–1128
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this chapter
Cite this chapter
Gonçalves, N., Vranou, G., Vigário, R. (2015). Automated Image Mining in fMRI Reports: a Meta-research Study. In: Tavares, J., Natal Jorge, R. (eds) Developments in Medical Image Processing and Computational Vision. Lecture Notes in Computational Vision and Biomechanics, vol 19. Springer, Cham. https://doi.org/10.1007/978-3-319-13407-9_5
Download citation
DOI: https://doi.org/10.1007/978-3-319-13407-9_5
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-13406-2
Online ISBN: 978-3-319-13407-9
eBook Packages: EngineeringEngineering (R0)