Introduction

Psychosis is one of the frequently observed non-motor symptoms (NMS) of Parkinson’s disease (PD), which commonly manifests as visual hallucinations (VH) and minor hallucinatory phenomena (MH) such as sense of presence and passage (Frei and Truong 2017). The lifetime prevalence of psychosis in patients with PD is more than 50% and is associated with poor quality of life and higher healthcare resource utilization (Fredericks et al. 2017). Although numerous risk factors for VH that include older age, female gender, greater disease severity, depression, autonomic dysfunction, sleep disturbances, and cognitive impairment have been identified (Lenka et al. 2017), a comprehensive understanding of the neurobiological underpinnings of VH is hitherto lacking. Neuroimaging studies (Lenka et al. 2015), until this date, have documented structural and functional alterations in regions of the brain corresponding to the dorsal and ventral visual pathways (Goldman et al. 2014), hippocampus (Yao et al. 2016), and certain cholinergic structures such as substantia innominata (Shin et al. 2012), and pedunculopontine nucleus (Janzen et al. 2012). Based on these results, several possibilities related to the mechanism of genesis of VH in PD have been proposed. These include presence of an aberrant top-to-down visual processing unit dominating over the usual down-to-top unit and neuro-transmitter imbalance especially involving acetylcholine. In addition, alterations in the hippocampal structure and function have also been speculated to play a significant role in the emergence of VH (Lenka et al. 2017).

The hippocampus has been regarded as a site of unification of spatial and non-spatial contextual information (Goodale and Milner 1992) and is crucial in encoding and retrieval of event memories (Behrendt 2010). It has been speculated that inappropriate integration of visual information by the hippocampi could induce hallucinations instead of reflecting the reality (Olypher et al. 2006). Although the underlying mechanism remains unclear, earlier studies, especially those employing electrical stimulation of the hippocampus, have triggered complex VH in patients with epilepsy (Vignal et al. 2007). In addition, there is evidence for hippocampal involvement in schizophrenic patients with hallucinations, especially in the visual mode (Amad et al. 2014). Several lines of evidence suggest that structural and/or functional abnormalities of the hippocampi could potentially play an important role in the genesis of VH in PD. For example, autopsy studies on patients with PD and VH have revealed higher accumulation of Lewy bodies in the medial temporal lobe (Gallagher et al. 2011), while the neuroimaging studies have revealed atrophy (Ibarretxe-Bilbao et al. 2008) and altered functional connectivity of the hippocampus (Yao et al. 2016). These studies have examined the hippocampus as a single structure; however, the hippocampus is a complex structure made up of various subfields that characterize different functional aspects related to memory (Mueller et al. 2011). For instance, encoding is associated with activation in input regions in the hippocampal circuit such as the dentate gyrus (DG) and CA2-3, while recall is related to activation in output regions such as the subiculum (Eldridge et al. 2005). In fact, significant association of atrophy of subiculum with impaired recall has been demonstrated in patients with PD (Beyer et al. 2013). It is, therefore, crucial to acquire a deeper understanding of the role of hippocampus in the pathogenesis of VH, not only by exploring it as a single structure but also by gaining insights into the specific and differential changes in the volumes of various hippocampal subfields. An earlier study has demonstrated significantly lower hippocampal subfield volumes of CA2/3, CA4/DG, and subiculum in patients with VH compared to healthy controls (HC) (Pereira et al. 2013); however, the study was carried out on a smaller sample size and employed an approximate atlas from Freesurfer 5.3 (Fischl and Dale 2000), providing coarse parcellations that included only seven subfields which did not account for the hippocampal head and tail regions (Pereira et al. 2013).

This study aims to explore and underline the hippocampal subfield volume differences on a large data set of PD patients with and without VH/MH and HC through a more sophisticated atlas (Freesurfer 6.0) (Fischl and Dale 2000; Fischl 2012) that include 12 hippocampal subfields. In addition, we aim to correlate our findings to psychosis severity as well as the neuropsychological scores pertinent to hippocampal functions, and to predict the severity of psychosis from the subfield volumes using a multi-variate prediction model.

Materials and methods

Subject recruitment and clinical evaluation

This study included 51 patients with PD without psychosis (PD-NP), 42 patients with PD with VH or MH (PD-P), and 48 age, gender, and education matched healthy HC. The patients were recruited from the general Neurology outpatient clinics and Movement Disorders services of the National Institute of Mental Health and Neurosciences (NIMHANS), Bangalore, India. All the subjects were parts of a large study aimed at identifying multi-modal biomarkers for psychosis in PD (over a period of 3.5 years: 2014 Nov–2018 April). Informed consent was obtained from all participants and the Institute Ethics Committee of NIMHANS, Bangalore, approved the study. Diagnosis of PD was based as per the UK Parkinson’s disease society brain bank criteria (Hughes et al. 1992). The disease severity was evaluated using the motor section of Unified Parkinson’s Disease Rating Scale (UPDRS-III) and stage of the disease was assessed as per the modified Hoehn and Yahr (H & Y) staging system. All patients underwent semi-structured interviews (by authors: AL and SSA) to explore the presence of psychiatric symptoms and the diagnosis of PD-P was based on the National Institutes of Neurological Disorders and Stroke and National Institute of Mental Health (NINDS–NIMH) criteria for diagnosis of PD associated psychosis (Ravina et al. 2007). Severity of the psychosis was assessed using the scale proposed by Ondo et al. (2015). Details of the anti-parkinsonian medications were obtained from all the patients and total levodopa equivalent dose per day (LEDD) was calculated (Tomlinson et al. 2010). Anxiety and depression were assessed by the corresponding Hamilton rating scales (i.e. HAM-A and HAM-D). The exclusion criteria for HC included history of any neurological or psychiatric disorders or brain injuries, and family history of any neuropsychiatric disorders.

Cognitive evaluation

Cognitive functions across various domains were briefly evaluated using the Montreal Cognitive Assessment Scale (MoCA). In addition, frontal assessment battery (FAB) was used to assess the frontal executive functions (Dubois et al. 2000). As this study aimed to explore the association of the hippocampal subfield volumes with the presence of psychosis in PD, we used neuropsychological tests to assess the functions (learning and memory, visuo-spatial abilities, attention, and executive function) that are relevant to hippocampal functioning (Godsil et al. 2013; Wicking et al. 2014). Rey auditory verbal learning test (RAVLT) (Vakil and Blachstein 1993) was used to assess verbal learning and memory, complex figure test (CFT) (Shin et al. 2006) and Corsi’s block-tapping test (Berch et al. 1998) were used to assess the visuo-spatial functions, and digit span was used to assess the attention of the recruited subjects. Trail making test (TMT-B) was used to assess the executive functions (Chan et al. 2008).

Image analysis

Image acquisition

Imaging data were acquired at NIMHANS, Bangalore using a Philips Achieva® 3T MRI scanner with a 32 channel head coil. High-resolution 3D T1 TFE images were acquired with repetition time (TR) = 8.1 ms, echo time (TE) = 3.7 ms, flip angle = 8°, sense factor = 3.5, field of view (FOV) = 256 × 256 × 155 mm, voxel size = 1 × 1 × 1 mm, slice thickness = 1 mm, acquisition matrix = 256 × 256, and 165 sagittal slices.

Image processing

The image-processing pipeline from Free Surfer 6.0 was adopted to pre-process the images as well as to automatically segment the hippocampal subfields (Fischl 2012). The pre-processing steps involved bias correction, automated transformation to the Talairach reference space, intensity normalization, and removal of non-brain tissue. This was followed by segmentation of the subcortical white matter and deep gray matter nuclei, tessellation of the gray matter and white matter boundary, automated topology correction, and surface deformation to optimally place the gray/white and gray/cerebrospinal fluid boundaries. Subsequently, automated hippocampal subfield segmentation was carried out using a Bayesian inference technique that employed a probabilistic atlas of the hippocampal formation that was trained on a hybrid data set involving in-vivo ultra-high-resolution (0.1 mm) MRI and ex-vivo autopsied brain MRI from multiple subjects. Each hippocampus (left and right) was segmented into 12 subfields: CA1, CA2–3, CA4, granule cell layer of dentate gyrus (GC-DG), parasubiculum, presubiculum, subiculum, hippocampus–amygdala transition area (HATA), fimbria, molecular layer (ML-HP), hippocampal fissure (HF), and the hippocampal tail (HT). The entire hippocampus volume was considered as region 13. The description of each of these regions is provided elsewhere (Iglesias et al. 2015). Figure 1 demonstrates the automated segmentation with labeled regions in one of our subjects. Following the automated subfield extraction, volumes of each of the subfield were computed and statistical analysis was performed between the three groups. To account for different brain sizes, we normalized the subfield volumes of each subject by the total brain volume of that subject.

Fig. 1
figure 1

Hippocampal subfield segmentation on a T1 MRI image using FreeSurfer 6.0 on one of the HCs

Statistical analysis

The analysis of demographic variables (Table 1) was performed using a standard analysis of variance test (ANOVA). For the volumetric analysis, between PD-P, PD-NP, and HC, we employed multi-variate analysis of covariance (MANCOVA) model, where the test was performed between the means of two groups under study, while age and gender were used as co-variates. Group analysis was carried out between HC vs. all patients (PD-P + PD-NP), HC vs. PD-NP, HC vs. PD-P, and PD-NP vs. PD-P. While comparing PD-P and PD-NP groups in addition to age and gender, LEDD, HAM-A, and duration of disease were also used as co-variates. Multiple comparisons were accounted for by employing False Discovery Rates (FDR) with a significance level of 0.1. The probability density functions (PDF) for each of the subfield volumes were also computed and plotted for each of the groups. Performance on the cognitive assessment was correlated with various subfield volumes separately for HC, PD-NP, and PD-P groups. The scores were initially adjusted for age and gender by means of linear regressions and the resulting standard residuals were utilized in the correlations. Pearson’s r was computed and the significance of the correlation was maintained at p value < 0.01.

Table 1 Demographic and clinical characteristics of the subjects

Prediction of severity in psychosis

A multi-variate model known as support vector regression (ɛ-SVR) algorithm (Smola and Schölkopf 2004) was employed to predict the psychosis severity scores from the hippocampal subfield volumes. SVR is a supervised learning technique based on the concept of support vector machines (SVM), but generalizes the categorical classification of SVM to predict continuous variables, which in our case was psychosis severity. SVMs model the data by finding the best separating hyperplane between data samples from two different classes by maximizing the margin between these two classes and are specified by data samples on the margin boundary known as support vectors. Similarly, in SVR, which is a regression model, the margin defined by threshold ɛ is minimized to achieve a superior fit to the data (Smola and Schölkopf 2004). In our case, we employed volumes of all the subfields (26) as features and severity as the predictor variable in the SVR model. A tenfold cross validation was performed, where the data were divided into ten parts, nine out of which were employed in training the SVR and the fifth one was used for testing. This process was repeated for all the folds such that every data sample was tested. Finally, the predicted scores were plotted against the severity scores and a correlation was obtained and tested for significance.

Results

Demographics, clinical characteristics, and cognitive evaluations

The three groups were matched for age and gender. The PD groups were not significantly different in terms of age at onset of PD and the duration of PD. Of those 42 patients with PD-P, 11 had isolated VH, 27 had isolated MH, and 4 had both VH and MH. Severity of motor symptoms (assessed by UPDRS-III) and stage of PD (assessed by H&Y scale) were similar in two PD groups. Both the PD groups had significantly higher HAM-A and HAM-D score compared to HC. Comparison between PD-NP and PD-P revealed higher HAM-A scores in PD-P, whereas the two groups did not have a significantly different HAM-D score. LEDD was significantly higher in the PD-P group compared to the PD-NP group (Table 1).

The overall cognitive performance as measured by MoCA revealed no difference between the PD groups, whereas the HC had significantly higher MoCA scores compared to both PD-P and PD-NP groups. FAB score, which represents frontal executive functions, were significantly low in both PD groups compared to HC, whereas the PD-P group compared to PD-NP had a lower FAB score. The only difference observed in the Corsi block-tapping test and digit span test (forward and backward) was between HC and PD-P (better performance in HC). CFT revealed significantly poor performance in PD-P compared to both PD-NP and HC. In RAVLT (total learning, immediate recall, and delayed recall), PD-P performed poorly than both PD-NP and HC. In TMT-B, PD-P group performed poorly compared to both HC and PD-NP (Table 1).

Subfield volumetric analysis

From the statistical analysis on each of the normalized complete hippocampal volume, it was revealed that HC have significantly higher hippocampal volumes than the PD-P subgroup; however, there was no difference between HC and PD-NP subgroup as well as the complete PD cohort. Nonetheless, a trend of lower subfield volumes (uncorrected) in PD (PD-P + PD-NP) subjects was observed in left hippocampal tail and right HATA. The differences in PD-P and HC were significantly discernible in multiple regions that involved: bilateral molecular layer, GC-DG, left subiculum and hippocampal tail and right CA3, CA4, HATA (corrected p value < 0.05), where PD-P demonstrated significantly lower volumes than HC (Table 2). However, only in the left and right HF, the PD-P subgroup revealed higher volume than the HC (values displayed in Table 2). Furthermore, between controls and PD-NP, no significant differences were observed; however, an uncorrected trend in the left hippocampal tail and right HATA was revealed, similar to the complete PD vs. HC analysis. Finally, comparison between PD-P and PD-NP groups revealed an uncorrected significant difference in bilateral hippocampal fissure, where PD-P subgroup showed significantly higher volume than PD-NP. Bar charts in Fig. 2 and Table 2 summarize all the volumetric results.

Table 2 Volumes of all the hippocampal subfields (normalized for whole brain volume) of the three groups
Fig. 2
figure 2

Bar charts demonstrating the results from MANCOVA between HC and PD-P (top-left), HC and PD-NP (top-right), PD-P vs. PD-NP (bottom-left) and HC vs. all PD (bottom-right). The bars represent the normalized mean volume and the error bars are the standard deviation

To attain deeper understanding of the trend between PD-P and PD-NP, we plotted the PDFs of specific subfield volumes for all three groups (Fig. 3). It can be observed that the PD-P group had the lowest subfield volumes compared to the other two groups, while the PDFs of PD-NP group were either overlapped with the HC (left subiculum, left CA1, left GC-DG, and right and left hippocampal fissure) or lied in between HC and PD-P (left and right molecular layer, right CA3, right CA4, right GC-DG, R-HATA, and bilateral total hippocampal volume) (Fig. 3). Only the HF demonstrated an opposite trend, where the PD-P group illustrated highest volumes in comparison with PD-NP and HC, as shown in Fig. 3.

Fig. 3
figure 3

Figure displays the PDFs of the subfield volumes in the three groups (HC, PD-P, and PD-NP). The PD-P (purple) has its PDF is on the leftmost side, while PD-NP (pink) lies in the middle of PD-P and HC and demonstrates high variability compared to PD-P group

Correlation of the subfield volumes with parameters of cognition

Analysis for the correlation of the subfield volumes and the cognitive parameters was done for the PD groups (Table 3; Fig. 4). In the PD-P group—(1) the FAB score had a significant positive correlation (p < 0.01) with the volume of left fimbria and (2) the psychosis severity score had a negative correlation with the left CA3 volume. Other correlations with p value < 0.05 have been provided in Table 3 and their plots in the supplementary material (Supp. Figure 1). The key findings were the negative correlation of psychosis severity with right CA3 (p value = 0.011) and a negative correlation between CFT (copy) score and left hippocampal fissure volume (p value = 0.05).

Table 3 Summary of correlation analysis between volume of hippocampal subfields in PD-P group and scores of the neuropsychological examinations
Fig. 4
figure 4

Correlation of the volumes of hippocampal subfields with scores of neuropsychological tests and psychosis severity (results with p < 0.01)

Predicting the psychosis severity

The multi-variate SVR model demonstrated a precise prediction of psychosis severity from all the hippocampal subfield volumes together. A tenfold cross validation revealed that severity scores could be predicted accurately (p value = 0.009, when tested for correlation), with a correlation coefficient of 0.4. Figure 5 demonstrates the plot of baseline scores vs. the predicted scores.

Fig. 5
figure 5

Plot of baseline scores vs. the predicted scores obtained through support vector regression

Discussion

In this study, we compared the volumes of the hippocampal subfields of PD-P, PD-NP, and HC and correlated these volumes to psychosis severity as well as scores of representing several cognitive domains. In addition, we employed advanced SVR model to predict the psychosis severity index. Our findings not only endorsed earlier discoveries, but also provided deeper insights into the bidirectional volumetric changes that occur with psychosis and their cognitive associations. Moreover, using multi-variate regression method, we demonstrated that the trajectory of psychosis severity could be mapped using hippocampal subfield volumes. The key findings of this study included: (1) total hippocampal volume as well as widespread subfield volume reduction in the PD-P group compared to HC; (2) larger hippocampal fissure volume in PD-P in comparison with controls and PD-NP; (3) significant negative association between the CA3 volumes with psychosis severity scores, as well as significant specific associations with cognitive functions; and (4) high accuracy in predicting psychosis severity in PD-P subjects from the hippocampal subfields.

Hippocampal involvement in PD has received a great deal of attention since the identification of a wide range of non-motor symptoms in PD. Growing evidence from clinical observations and experimental studies suggests a complex hippocampal cross talk among the dopaminergic and other transmitter systems which is crucial for synaptic plasticity, adaptive memory, and motivated behavior (Calabresi et al. 2013). Earlier studies have documented degeneration of hippocampal subfield CA2-3 in autopsy studies on demented as well as non-demented patients with PD (Mattila et al. 1999; Harding and Halliday 2001) as well as via MRI-based analyses (Beyer et al. 2013; Pereira et al. 2013). Moreover, the presence of Lewy neurites has been documented in the CA-2 region in patients with PD (Braak and Braak 2000). In addition, recent studies have also identified significant impairment in the hippocampal white matter pathways in patients with PD compared to HC (Shah et al. 2017). Although the current study did not reveal any difference in the hippocampal subfield volumes between the overall PD cohort and HC as well as between PD-NP and controls, there was a trend of lower volume in right HATA and left HT (uncorrected result). A recent study on a large number of PD patients supports our finding as they did not find any significant difference between hippocampal volumes between PD and HC (Schulz et al. 2018).

One of the key results of this study was higher volume of the hippocampal fissure in PD-P group compared to both PD-NP and HC (uncorrected). Moreover, there was a significant negative correlation (at significance level of p < 0.05) between the volumes of hippocampal fissure and the score of CFT (copy), which represents visual memory and visuo-spatial functions. HF dilation, during the course of hippocampal atrophy, has been recently established from a mouse hippocampus study that demonstrated that the increase in HF volume is highly correlated with decrease in the overall hippocampal volume and can be considered as a radiological hallmark of ongoing hippocampal atrophy (Li et al. 2018). Existing studies in Alzheimer’s disease and schizophrenia have also confirmed widening of the HF in patients and have related to medial temporal lobe atrophy (Smith et al. 2003; De Bastos-Leite et al. 2006). The aforementioned evidence indicates that HF hypertrophy perhaps heralds atrophy of hippocampus. Since we did not find any difference in the remaining subfields between PD-NP and PD-P, it is possible that the PD-P group in our study in the early stage of a process leading to hippocampal atrophy and longitudinal evaluations may provide more insights into the same.

The comparison between HC and PD-P revealed significant reduction in the volumes of left subiculum, granule cell-dentate gyrus, right CA3, CA4, bilateral molecular layer, hippocampal tail, and HATA in PD-P. Subiculum has been implicated in memory and recall process and the previous studies have documented atrophy of the subiculum in PD patients with VH (Pereira et al. 2013) as well as in patients with Alzheimer’s disease (Bobinski et al. 2000; Mueller et al. 2011). The possible role of subiculum in the aforementioned cognitive processed gets support from our correlation analysis, which revealed a significant positive correlation of volume of the left subiculum with scores of AVLT (delayed recall) (sup. figure-1). Our study also demonstrated the associations between the CA3 subfield volumes (Fig. 4 and Supp. Figure 1) and psychosis severity suggesting putative role of CA3 in generating hallucinations. Although the exact mechanism by which structural alterations in CA3 result in hallucinations is not clear, aberrant connections between CA3 and CA1 have been implicated in the genesis of positive symptoms in patients with schizophrenia (Tamminga et al. 2010). It is posited that these hippocampal subfields act as a binding module for the cortical circuits comprising weakly related sensory representations. Moreover, CA3 in particular has been speculated to generate representations of space and time as a basis of conscious awareness (Behrendt 2010). Hence, the resultant sensory disintegration secondary to atrophy of the aforementioned subfields is perhaps responsible for the genesis of hallucinations. Furthermore, with regard to the notion that hyper-dopaminergic state may induce psychosis (Tost et al. 2010), it has been hypothesized that disinhibition of hippocampal subfields may favor hyper-dopaminergic states, thus resulting into psychosis (Lisman et al. 2008).

In addition, animal studies have reported that lesions in the fimbria and/or amygdala result in impaired visuo-spatial functions, and object discrimination and interestingly, visuo-spatial dysfunction, and poor object discrimination have been reported in PD patients with hallucinations (Lenka et al. 2016a). Our results corroborate this by revealing a positive correlation between right fimbria volume and FAB scores (Fig. 4) and with CFT (copy) (Supp. Figure 1). The PD-P group also illustrated atrophy of the subfield representing the GC-DG, which is an important input region of the hippocampus. Several lines of evidence indicate that the DG participates in the pre-processing of incoming information, preparing it for subsequent processing in CA3 (Jonas and Lisman 2014). Moreover, the DG is probably involved in pattern separation, transforming relatively similar input patterns into substantially different output patterns. Hence, it is possible that structural and functional alterations in the DG may precipitate hallucinations.

We observed a distinct trend between the groups via the PDF plots (Fig. 3), where the HC demonstrated highest subfield volumes, whereas the PD-NP either overlapped with HC or were located in between HCs and PD-Ps. This trend was observed only in specific subfields, as shown in Fig. 3. This trend indicates that genesis of minor or visual hallucination perhaps parallels to that of progressive hippocampal atrophy. Overall, our analysis provides novel insights into the role of hippocampus in PD-P. The core volumetric changes occurring in the hippocampus could provide a direct indication of the severity of psychosis, as shown in Fig. 5. The multi-variate model could capture the bidirectional volumetric alterations that spanned over all the hippocampal subfields to predict the severity of hallucinations. Such a model can be further tested and employed to quantify the pathology through structural changes.

In this study, we had a relatively limited but focused scope to investigate the alterations in hippocampal subfields in PD-P. The interpretation is largely limited to hippocampal function, which is one of the limitations of this study. Cross-sectional design of the study was another limitation as patients who did not have psychosis at the time of the study may develop psychosis in future. It may also be argued that the volume changes in the hippocampal subfields, which we have noticed in PD-P group compared to HC, are more representative of the cognitive dysfunction in the PD-P rather than the psychosis. However, it is known that psychosis and cognitive dysfunction usually coexist as part of PD non-motor symptoms and it is possible that both these symptoms have overlapping neural correlates (Lenka et al. 2016b). As cross-sectional studies provide limited inference regarding the exact association between them, future studies involving longitudinal design may aid in achieving deeper insights.

Conclusion

In summary, we conclude that patients with PD-P have atrophy of several hippocampal subfields compared to HC. The volumetric changes in PD-P are differential in nature, as majority of the subfields had lower volume (compared to controls), whereas the HF had increased volume (compared to both HC and PD-NP) and the volume changes had significant correlations with specific cognitive functions. In addition, the CA3 subfield volumes in PD-P illustrated a strong correlation with severity of psychosis, thus facilitating deeper understanding of the role of hippocampus in VHs. The study reinforces the possibility that structural alterations in specific hippocampal regions in PD-P are associated with cognitive impairment and hippocampal subfield volumetry merits further exploration in this context.