Introduction

Electroencephalography (EEG), the measurement of scalp electrical potential differences originating mainly from postsynaptic currents of signaling neurons, is a traditional method for the measurement of brain function. Some modern methods, such as, functional magnetic resonance imaging (fMRI), positron emission tomography (PET), and near-infrared spectroscopy (NIRS), differ from EEG and other electrophysiological methods in that they measure hemodynamic changes related to brain activity. Hemodynamic imaging is possible because the underlying neuronal activity is linked to brain hemodynamics through the so-called neurovascular coupling (NVC) (Fox et al. 1988; Raichle and Mintun 2006). To better understand the results from these techniques, the relationship between neuronal activity and hemodynamic responses (HDRs) has been studied intensively in recent years. Studies in humans suggest that, under certain circumstances, the HDR is linearly coupled to a stimulus parameter that can be, e.g., the stimulus duration, frequency, or intensity and that roughly approximates the neuronal activity induced by the stimulus (Boynton et al. 1996; Soltysik et al. 2004; Wobst et al. 2001). In this context, linearity means that scaled and summed neuronal activity or stimulus parameter produces scaled and summed HDRs. Mathematically expressed, if h 1(t) and h 2(t) are the HDRs produced by the neuronal activities n 1(t) and n 2(t), respectively, then the HDR to a n 1(t) + b n 2(t) is a h 1(t) + b h 2(t), where a and b are scaling factors. In many situations, however, the HDRs do not follow this linear relationship. For instance, HDRs are nonlinearly coupled to visual stimulus contrasts (e.g., Michelson contrast) and concentration changes of oxyhemoglobin (HbO2) to visual stimulus train durations (Boynton et al. 1996; Rovati et al. 2007; Wobst et al. 2001).

Electrophysiological measurements can predict the neuronal activity more realistically than a simple stimulus parameter, such as the duration, frequency, or intensity, as the stimulus and the evoked neuronal activity may be nonlinearly related. Combined electrophysiological and hemodynamic measurements have mainly been carried out in animals (Devor et al. 2003; Franceschini et al. 2008; Logothetis et al. 2001), but also some human studies have been published (Janz et al. 2001; Obrig et al. 2002; Ou et al. 2008; Rovati et al. 2007). They report both linear and nonlinear relationships between neuronal responses and HDRs, depending on the stimulus type and parameters and the electrophysiological method.

Simultaneous measurement of electrical and vascular brain activity is technically challenging, especially in human subjects. For example, fMRI, using high magnetic fields and changing field gradients, cannot be combined with magnetoencephalography (MEG), which is very sensitive to external magnetic fields. A combined fMRI/EEG measurement is possible but difficult (Bonmassar et al. 1999, 2001; Janz et al. 2001). In contrast to fMRI, NIRS does not affect nor is affected by the electromagnetic fields measured or produced by other techniques and is therefore well-suited for multimodal studies. Compared to fMRI, NIRS has a poorer spatial resolution but a higher temporal resolution and it can measure both HbO2 and deoxyhemoglobin concentration changes (HbR), whereas the blood-oxygen-level-dependent (BOLD) signal of fMRI is mostly sensitive to HbR. Some simultaneous NIRS and EEG measurements have already been reported in humans (Herrmann et al. 2008; Horovitz and Gore 2004; Kennan et al. 2002; Koch et al. 2006, 2008; Moosmann et al. 2003; Noponen et al. 2005; Obrig et al. 2002; Rovati et al. 2007). However, only Obrig et al. (2002) and Rovati et al. (2007) have attempted to model the relationship between the evoked potentials and the HDRs among these studies.

Currently, there is no clear evidence whether NIRS responses evoked by short (3–12 s) visual stimuli of different durations are linearly correlated with visual-evoked potentials (VEPs) in humans and in animals, or not. Wobst et al. (2001) have shown that HbR responses are linearly coupled to the pattern-reversing (10 Hz) checkerboard stimulus duration (6–24 s) in humans, but HbO2 responses are not. Furthermore, Obrig et al. (2002) have provided evidence that the relative amplitudes of NIRS responses and VEPs habituate approximately the same amount during a long (1 min) pattern-reversing stimulus, suggesting a linear relationship between them. Our study combines a NIRS measurement of HbO2 and HbR responses to short (3–12 s) visual stimuli with a simultaneous EEG measurement of VEPs in humans. The area-under-the-curves of VEPs summed over the stimulus train duration \((\Upsigma\hbox{VEP})\) provide alternative and more direct estimates of neuronal activity in addition to the stimulus train durations. Using area-under-the-curves of NIRS responses and VEPs, we try to incorporate not only habituation but also other processes, such as attention, gaze fixation, and eye movements, affecting the responses. Our hypothesis is that the linear correlation of the HbO2 and HbR responses is stronger with the \(\Upsigma\hbox{VEP}\) as a regressor than with the stimulus train duration as a regressor indicating a more linear relationship between these.

Materials and methods

Instrumentation

We used a 16-channel frequency-domain NIRS instrument which utilized two laser diodes (685 and 830 nm) modulated at 100 MHz as light sources (Nissilä et al. 2005b). Two source fibers transmitted the light with an optical power of 10–15 mW to the subject’s head. The light scattered and absorbed in the tissue was guided from the head to detectors (photomultiplier tubes) through 16 fiber bundles.

We recorded VEPs with a 60-channel EEG device (eXimia EEG, Nexstim Ltd.) with a sampling frequency of 1,450 Hz per channel and a passband of 0.1–350 Hz. Eye movements were monitored with an electrooculogram (EOG) for which one electrode was placed laterally and another electrode superiorly to the right eye.

For simultaneous NIRS/EEG measurements, we integrated NIRS fiber holders into the EEG cap (Fig. 1). One NIRS source fiber and eight detector fiber bundles were attached over the occipital lobe on each hemisphere in two semicircles at 2 and 4 cm source-to-detector (SD) distances (Fig. 2a). A source fiber above each hemisphere was used to avoid disturbances in the optical signals caused by the longitudinal fissure and the superior sagittal sinus. In addition, the two SD distances provided signals that contained differing contributions from superficial and brain tissues (Firbank et al. 1998; Germon et al. 1998).

Fig. 1
figure 1

EEG cap with integrated NIRS fiber holders

Fig. 2
figure 2

a Layout of NIRS sources and detectors along with some electrodes in the occipital areas. b Layout of EEG electrodes. Recorded channels are marked in black, other channels in gray. Some electrodes are labeled according to the International 10–20 electrode positioning system

EEG was recorded mainly above the occipital lobe with 22 Ag/AgCl electrodes (Fig. 2b). The reference electrode was placed on the left cheek, but the data were afterwards rereferenced to Fz (International 10–20 System) to obtain standardized waveforms (American Clinical Neurophysiology Society 2006a; Odom et al. 2004).

An anesthesia monitor (AS/3 Anesthesia Monitor, Datex-Ohmeda, Finland) recorded pulse waveform and arterial oxygen saturation with a pulse oximeter placed on the finger tip and head movements with an inclinometer attached on the forehead.

Subjects

We measured ten healthy male subjects (age 20–36, mean 25.3), who gave their signed informed consent before their enrollment for the study.

Study design

The subjects sat comfortably in a dimmed room during the measurements. They looked at a reversing checkerboard pattern projected on a screen at a 1.5-m distance from their eyes (Fig. 3). The size of the checkerboard pattern was 31.5 × 31.5 cm (12° of the visual field), and each check was 1.3 × 1.3 cm (0.5° of the visual field). A pattern-reversing stimulus was selected, since its VEPs vary less across subjects compared to other typical visual stimuli, and the VEP waveform contains the clearly detectable peaks N75, P100, and N135 (Odom et al. 2004).

Fig. 3
figure 3

Measurement set-up

Three stimulation runs of about 14 min were presented to each subject with the Presentation® software (version 9.30, http://www.neurobs.com), each followed by rest periods of about 10 min during which the subjects could move (see Fig. 4). Each stimulation run began with a gray screen visible for 30 s, followed by 22 stimulus trains. Each train consisted of 3, 6, or 12 s checkerboard stimulus reversing at 2 Hz and a randomized inter-stimulus-interval (ISI) of 25–35 s (mean 30 s) during which the screen was gray. The ISI was randomized to reduce the interference of slow systemic oscillations in the averaged HDRs (Elwell et al. 1999; Obrig et al. 2000), and it was sufficiently long for the HDRs to return back to baseline between the stimulus trains (Dale 1999). Each of the three runs comprised seven 6 and 12 s and eight 3 s stimulus trains in a randomized order. The checkerboard pattern and the gray screen had a small red fixation cross in the center on which the subjects were asked to focus their eyes. Each run contained 278 pattern reversals, resulting in total of 834 pattern reversals for each subject.

Fig. 4
figure 4

Stimulation protocol. Three stimulation runs of approximately 14 min were presented to each subject interleaved with approximately 10-min rest periods. The stimulation runs consisted of trains of reversing checkerboard patterns and a gray screen during the ISIs

The study protocol was approved by the Ethics Committee of the Helsinki University Central Hospital, and it was in compliance with the Declaration of Helsinki.

Data processing (NIRS)

The NIRS amplitude and phase signals were calibrated before further data processing (Nissilä et al. 2005b; Tarvainen et al. 2005). Systemic slow oscillations and fiber contact variations were suppressed by dividing the amplitude signals with low-pass filtered signals (−3 dB cutoff frequency at 0.017 Hz). Higher frequency components, such as the heart beat, were attenuated with low-pass filtering (−3 dB cutoff frequency at 0.33 Hz). Changes in the modulation amplitude of the light were transformed into changes in HbO2 and HbR with the modified Beer–Lambert law, where the optical path length was determined from the calibrated phase (Elwell 1995). Channels having a standard deviation of the HbO2 signal higher than 1.7 μM or a standard deviation of the HbR signal higher than 0.7 μM were rejected, since their SNR was not sufficiently high to detect the HDRs. The low SNR of these channels was probably due an excess of hair between the skin and the detector. Epochs containing movements were rejected from the NIRS data using the inclinometer data. Also, epochs containing over 5 μM HbO2 or 2 μM HbR peak-to-peak changes in the time window from −5 to 25 s with respect to the stimulus onset were rejected because they most likely contained movement or contact artifacts and no reliably detectable HDRs.

The HbO2 and HbR changes of each subject were averaged for each detector channel and each stimulus train duration in the time window from −5 to 25 s with respect to the stimulus onset. These averaged HDRs of each channel and stimulus train duration were tested with a two-tailed Student’s t test for statistically significant differences between the mean of the baseline (from −5 to 0 s) and the positive peak value of HbO2 and negative peak value of HbR in the time window from 4 to 20 s. The significance level was set at 0.05. Channels with statistically significant HbO2 and HbR responses to a stimulus train duration were accepted for the linearity analysis.

The area-under-the-curves of rectified HbO2 and HbR responses \((\Upsigma\hbox{HbO}_2\,\hbox{and}\,\Upsigma\hbox{HbR})\) were calculated for the accepted channels in the time window from the stimulus onset to the return of the HDRs to baseline to quantify the vascular changes produced by the three different stimuli. Area-under-the-curves were used instead of peak values of the responses, since the response amplitudes saturate when long stimuli are used. The \(\Upsigma\hbox{HbO}_2\) and \(\Upsigma\hbox{HbR}\) to different stimuli were not calculated with a fixed time window, since the responses to shorter stimulus trains return faster to baseline than the responses to longer stimulus trains. If the time window was fixed, the \(\Upsigma\hbox{HbO}_2\) and \(\Upsigma\hbox{HbR}\) to short stimulus trains would be overestimated because of nonzero noise at the end of the rectified responses. The time point at which the HDR returned to baseline was determined manually for each subject and stimulus separately.

A time-locked average of the heart rate calculated in the time-window from −5 to 25 s from the pulse oximeter data provided an estimate of average systemic changes in the circulation due to stimulation. It was used to ensure that HDRs are evoked by local cerebrovascular changes rather than by global changes in circulation. The data of subjects having greater than 3 bpm changes in their average heart rate were rejected from the data analysis.

Data processing (EEG)

The EEG data were divided into epochs ranging from −100 to 400 ms with respect to each pattern reversal and the initial display of the checkerboard pattern at the start of each stimulus train. Epochs containing eye blinks or clear motion artifacts were rejected, and the mean value of the baseline of the remaining epochs (from −100 to 0 ms) was shifted to zero. The accepted epochs were averaged and band-pass filtered (−3 dB cutoff frequencies at 0.6 and 106.5 Hz).

The VEPs corresponding to the first checkerboard display and to each of the subsequent pattern reversal in the stimulus trains were averaged across the stimulus trains for each channel and subject, resulting in a sequence of averaged VEPs (VEPi, i = 1, ..., 24, where 24 is the number of consecutive VEPs in the sequence). A higher SNR was obtained for VEPs from VEP1 to VEP6, since they were evoked by stimuli present in all of the 3, 6, and 12 s trains. Progressively fewer responses were averaged for the later pattern reversals in the longer trains. Also, a grand average VEP (VEPave) over all shown pattern reversals (not including the initial display of the pattern, i.e., VEP1) was calculated for each subject and channel separately.

To minimize noise, the VEPave was fitted to VEPi (i = 2, ..., 24) by scaling and latency shifting it. The VEP1 was left out from the fitting because it is a response to a flash stimulus; the checkerboard showed up from the gray background at the beginning of each stimulus train. This original VEP1 and a sequence of scaled and latency shifted VEPave (VEP iave , i = 2, ..., 24) were used in subsequent analysis instead of VEPi (i = 1, ..., 24), since the original VEPi sequence had a variable and low SNR depending on i. The fitting procedure allowed the VEP iave to change in amplitude and latency but not in waveform, which was a reasonable assumption, since the stimulus was unchanged over time.

The fitting procedure of VEPave to VEPi was performed so that first a latency shifted VEPave(t) was calculated for each pattern reversal (i = 2, ..., 24):

$$ \hbox{VEP}^i_{{\text{ave}}, \tau}(t) = \hbox{VEP}_{\text{ave}}(t-\tau_i), $$
(1)

where t is the time index and τ i the latency shift corresponding to the ith pattern reversal. The parameter τ i was defined as the time index that maximized the unbiased estimate of the cross-correlation r(t) between VEPi(t) and VEPave(t):

$$ \tau_i = \mathop{\arg \max}_{t \in [-T+1 \; T-1]} r(t), $$
(2)

where T is the number of time samples in VEPi(t).

After latency shifting, the final VEP iave (t) was obtained by scaling the \( \hbox{VEP}^i_{{\text{ave}}, \tau}(t) \) VEP iave, τ(t) to match its amplitude with the amplitude of the original sequence VEPi(t):

$$ \hbox{VEP}^i_{\text{ave}}(t) = a_i \hbox{VEP}^i_{{\text{ave,}}\tau}(t) + b_i, $$
(3)

where a i and b i are scaling factors obtained by minimizing the sum of squared residuals between VEPi(t) and VEP iave (t) in the time window from 20 to 150 ms. This time window was selected, since it contains all of the three commonly known peaks N75, P100, and N135 of the pattern-reversal VEPs (American Clinical Neurophysiology Society 2006b).

To quantify the magnitude of the VEPs, the area-under-the-curves of rectified VEP iave (i = 2, ..., 24) and \(\hbox{VEP}^1 (\Upsigma\hbox{VEP}^i_{\text{ave}}\, \hbox{and} \Upsigma\hbox{VEP}^1)\) were calculated separately for each subject and channel in the time window from 0 to 400 ms. Time windows of less than 400 ms were also tested, but the length of the time window had no significant effect on the results. The \(\Upsigma\hbox{VEP}^i_{\text{ave}}\) and \(\Upsigma\hbox{VEP}^1\) were averaged over electrodes 54–60 (Fig. 2b) and summed over the stimulus train duration to quantify the mean evoked potential activity induced by each of the three stimulus trains. In other words, these sums over the stimulus train durations \((\Upsigma\hbox{VEP})\) were obtained for the different stimulus train durations as follows:

$$ \begin{aligned} 3\,\hbox{s}\!: \Upsigma\hbox{VEP} &= \Upsigma\hbox{VEP}^1 + \sum_{i=2}^{6} \Upsigma \hbox{VEP}^i_{\text{ave}}\cr 6\,\hbox{s}\!:\Upsigma\hbox{VEP} &= \Upsigma\hbox{VEP}^1 + \sum_{i=2}^{12} \Upsigma \hbox{VEP}^i_{\text{ave}}\cr 12\,\hbox{s}\!:\Upsigma\hbox{VEP} &= \Upsigma\hbox{VEP}^1 + \sum_{i=2}^{24} \Upsigma \hbox{VEP}^i_{\text{ave}}\cr \end{aligned} $$

where \(\Upsigma\hbox{VEP}^1\) and \(\Upsigma\hbox{VEP}^i_{\text{ave}}\) are the averages over electrodes from 54 to 60.

We also used the difference between amplitudes of P100 and N135 peaks to quantify the evoked potential activity [adopted from Obrig et al. (2002)], but these did not qualitatively change the results.

Linearity

The linearity between the \(\Upsigma\hbox{HbO}_2\) or the \(\Upsigma\hbox{HbR}\) and the stimulus train duration or the \(\Upsigma\hbox{VEP}\) was measured by fitting a regression line y = β0 x + β1 to the data in the least-squares sense, where y is the \(\Upsigma\hbox{HbO}_2\) or the \(\Upsigma\hbox{HbR}\) evoked by the stimulus train and x the stimulus train duration or the corresponding \(\Upsigma\hbox{VEP}.\) The constants β0 and β1 are obtained from the fitting procedure. The regression was performed for each subject separately with y containing all the statistically significant \(\Upsigma\hbox{HbO}_2\hbox{s}\) or \(\Upsigma\hbox{HbR}\hbox{s}\) of the subject and for all subjects together with y containing the mean over statistically significant \(\Upsigma\hbox{HbO}_2\hbox{s}\) or \(\Upsigma\hbox{HbR}\hbox{s}\) of each subject. Also, the Pearson’s correlation coefficients (r) were calculated and tested with one-sample t test for a null hypothesis that r is zero (data uncorrelated) against an alternative hypothesis that r is not zero (data correlated). The squares of the correlation coefficients—the coefficients of determination (R 2)—were also obtained to quantify the fraction of the variance in y explained by the regression model.

Results

NIRS data

The data of two subjects (subjects 2 and 8) were rejected from further analysis, since in these subjects, the stimulation trains induced over 3 bpm changes in the heart rate. Over all subjects, the stimulation trains induced a change of 2.2 ± 0.3 (mean ± SEM) bpm in the heart rate.

Statistically significant HDRs to at least one of the three stimulus train durations appeared in the NIRS data of the remaining eight subjects (Table 1). Five of these subjects produced statistically significant HbO2 and HbR responses to all three durations, one to two durations, and two to one duration. The long SD-distance channels had more statistically significant HDRs than the short SD-distance channels during all stimulus train durations. The subjects with responses to only one or two stimulus train durations (subjects 3, 4, and 7) were left out from the further analysis since no regression line could be fitted in their data reliably.

Table 1 Numbers of NIRS detector channels having statistically significant HDRs to different stimulus train durations for all accepted subjects

The averages over all statistically significant HDRs of the five accepted subjects (subjects 1, 5, 6, 9, and 10) to each of the three stimulus train durations are shown in Fig. 5. The mean peak amplitudes of statistically significant HbO2 responses of the accepted subjects were larger than the absolute mean peak amplitudes of the HbR responses, and the shortest stimulus train duration produced lower amplitudes than the two longer ones (Table 2). The \(\Upsigma\hbox{HbO}_2, \Upsigma\hbox{HbR}\) and the peak latency of HbO2 increased with increasing stimulus duration.

Fig. 5
figure 5

Mean HbO2 (black line) and HbR (gray line) responses over all statistically significant channels of the five subjects accepted into the regression analysis and \(\Upsigma\hbox{VEP}^i_{\text{ave}}\) (gray area) to 3, 6, and 12 s stimulus train durations. Thin black curves indicate the SEM of the \(\Upsigma\hbox{VEP}^i_{ave}\) and error bars the SEM of the HDRs. The dotted line indicates stimulus train onset

Table 2 Descriptive statistics of the statistically significant HDRs of accepted subjects (mean ± SEM)

VEPs

All subjects showed clear N75, P100, and N135 peaks in their VEPs. The grand average VEPs over all subjects are shown in Fig. 6 for occipital channels. The average of \(\Upsigma\hbox{VEP}\) values over accepted subjects were 2.8 ± 0.3, 5.7 ± 0.7, and 11.4 ± 0.8 μVs (mean ± SEM) for the 3, 6, and 12 s stimulus train durations, respectively. The values increased approximately linearly with increasing stimulus train duration. The \(\Upsigma\hbox{VEP} ^i_{\text{ave}}\) values to different stimulus train durations are shown in Fig. 5 along with the average HDRs. The \(\Upsigma\hbox{VEP} ^i_{\text{ave}}\) values did not show clear habituation during the stimulation trains.

Fig. 6
figure 6

Grand average VEPs over all subjects over the occipital lobe. The dotted line indicates the stimulus onset

Linearity

The results of the linear regression are shown in Table 3 and in Fig. 7. With the \(\Upsigma\hbox{VEP}\) as a regressor, the correlation coefficients over all subjects were 0.74 for \(\Upsigma\hbox{HbO}_2\) and 0.86 for \(\Upsigma\hbox{HbR},\) and with stimulus train duration as a regressor 0.67 for \(\Upsigma\hbox{HbO}_2\) and 0.72 for \(\Upsigma\hbox{HbR}.\) All of these four correlation coefficients differed statistically significantly from zero (p < 0.05) indicating that \(\Upsigma\hbox{HbO}_2\) and \(\Upsigma\hbox{HbR}\) are correlated with the \(\Upsigma\hbox{VEP}\) and the stimulus train duration. The correlation coefficients of \(\Upsigma\hbox{HbR}\) were higher than the ones of \(\Upsigma\hbox{HbO}_2\) with both regressors. Of the regressors, the \(\Upsigma\hbox{VEP}\) gave higher correlation coefficients than the stimulus duration, and it explained 55% of the \(\Upsigma\hbox{HbO}_2\) and 74% of the \(\Upsigma\hbox{HbR}\) variance, whereas the stimulus duration explained only 45% of the \(\Upsigma\hbox{HbO}_2\) and 51% of the \(\Upsigma\hbox{HbR}\) variance.

Table 3 Pearson’s correlation coefficients (r), their p values [p(r)], and coefficients of determination (R 2) separately within all accepted subjects (rows with a subject number) and for all of the accepted subjects together (row ‘All’)
Fig. 7
figure 7

Average \(\Upsigma\hbox{HbO}_2\) (upper row) and \(\Upsigma\hbox{HbR}\) (lower row) as a function of the \(\Upsigma\hbox{VEP}\) (left) and a stimulus train duration (right) for subjects showing statistically significant HDRs. The vertical error bars represent the SEMs of the averaged \(\Upsigma\hbox{HbO}_2\hbox{s}\) or \(\Upsigma\hbox{HbR}\hbox{s}\) and horizontal error bars the SEMs of the \(\Upsigma\hbox{VEP}\hbox{s}.\) Error bars are not shown for subjects having only one statistically significant HDR for each stimulus train duration. Pearson’s correlation coefficient is marked with r and its p value with p. The fitted regression line is shown in gray

The correlation coefficients within subjects (Table 3) showed similar trends as the correlation coefficients over all subjects. The correlation coefficients of \(\Upsigma\hbox{HbR}\) were higher than the correlation coefficients of \(\Upsigma\hbox{HbO}_2\) in all cases except subject 6. The improvement in the correlation coefficients of individual subjects is however small when the regressor is changed from stimulus train duration to \(\Upsigma\hbox{VEP}.\) The reason for this is that the correlation coefficients for individual subjects are calculated using all statistically significant channels of the corresponding subject, whereas the correlation coefficients over all subjects are calculated over the \(\Upsigma\hbox{HbO}2\) or \(\Upsigma\hbox{HbR}\) values averaged over the statistically significant channels of each subject, resulting in reduced variability in \(\Upsigma\hbox{HbO}_2\) and \(\Upsigma\hbox{HbR}.\) Furthermore, there is only one \(\Upsigma\hbox{VEP}\) value per stimulus train duration for each subject. When correlation coefficients over all subjects are calculated, the individual \(\Upsigma\hbox{VEP}\) values are used, resulting in more than one \(\Upsigma\hbox{VEP}\) value per stimulus train duration.

Discussion and conclusions

The main findings of our study are that (1) \(\Upsigma\hbox{HbO}_2\) and \(\Upsigma\hbox{HbR}\) correlate linearly with the \(\Upsigma\hbox{VEP}\) and the stimulus train duration (3, 6, and 12 s), (2) \(\Upsigma\hbox{HbR}\) is more linearly correlated with the stimulus train duration and the \(\Upsigma\hbox{VEP}\) than \(\Upsigma\hbox{HbO}_2\) , and (3) the linear correlation of \(\Upsigma\hbox{HbO}_2\) and \(\Upsigma\hbox{HbR}\) is stronger with the \(\Upsigma\hbox{VEP}\) than with the stimulus train duration.

Our first finding implies that the HDRs are approximately linearly coupled to the stimulus train duration (3–12 s) and the simultaneously measured \(\Upsigma\hbox{VEP}\hbox{s}.\) However, according to our second finding, the HbR responses seem to possess a stronger linear relationship with the regressors than the HbO2 responses. Our third finding implies that there is probably some variability in the neuronal responses caused, e.g., by changes in attention or gaze fixation that is not accounted by the stimulus train duration. It also suggests that interindividual differences in HbO2 and HbR responses may be explained with differences in the evoked potential activity because we calculated the correlation coefficients over the data of all subjects. Thus, a small evoked potential activity of a subject indicates that the \(\Upsigma\hbox{HbO}_2\) and \(\Upsigma\hbox{HbR}\) were also probably also small.

The investigation of the NVC with EEG and NIRS includes many challenges. Because these techniques sample partly different volumes, the measured neuronal and hemodynamic signals do not arise from exactly the same area of the brain. The spatial localization of activity sources could be improved by solving the inverse problems from the measurement signals (Arridge 1999; Hämäläinen et al. 1993; Nissilä et al. 2005a).

The visual stimuli used in this study produced relatively weak HDRs, and only five accepted subjects out of eight gave statistically significant positive HbO2 and negative HbR responses to all three stimulus train durations. Our 2-Hz stimulation frequency was a compromise between a good VEP stimulus and a good NIRS visual stimulus, and in favor of the VEPs. A higher stimulation frequency would have facilitated the detection of HDRs, since their amplitudes increase linearly as a function of the stimulation frequency up to about 8 Hz (Fox and Raichle 1984; Ozus et al. 2001). However, increasing the stimulation frequency would have distorted the VEPs and single VEP waveforms could not have been obtained (American Clinical Neurophysiology Society 2006b). Most of the studies on visual NIRS responses have used a frequency in the range of 6–8 Hz (Plichta et al. 2006, 2007; Rovati et al. 2007; Wobst et al. 2001). To the best of our knowledge, 3 Hz is the lowest frequency of a 1-min pattern-reversing checkerboard stimulus that has previously been reported to evoke HDRs detectable with NIRS (Obrig et al. 2002). Moreover, a moderate reproducibility rate of visual HDRs with an optimal stimulation frequency of 6 Hz has been reported previously (Plichta et al. 2006).

A greater number of presented stimulus trains could have improved the SNR of the HDRs, but increasing the measurement time would have increased the possibility of physiological noise in the signals. Experienced subjects gave more reliable results than first-timers, indicating that breathing, neck tension, attention, etc., may have affected the results of the first-timers through systemic blood flow changes and hemodynamic changes in the superficial layers of the head. Two inexperienced subjects showed increases in their heart rate during the stimulation, which supports this hypothesis. However, activation occurred mostly in long SD-distance channels, suggesting that the measured signals originate mostly from the brain. The channels with a long SD distance should namely sample a greater amount of brain tissue than the short SD-distance channels (Firbank et al. 1998; Germon et al. 1998).

Despite some challenges in combining NIRS and EEG, we obtained HDRs and VEPs from several subjects. We also confirmed our hypothesis that the linear correlation of \(\Upsigma\hbox{HbO}_2\) and \(\Upsigma\hbox{HbR}\) improves when \(\Upsigma\hbox{VEP}\hbox{s}\) is used as a regressor instead of the stimulus train duration. In addition, our results are in concordance with previous studies. Wobst et al. (2001) have found that HbR responses are linearly coupled to the stimulus durations of a 10 Hz pattern-reversing checkerboard pattern in the range of 6–24 s. Moreover, BOLD signals, closely related to HbR responses, are linearly coupled to 7.5 and 8 Hz pattern-reversing checkerboard stimulus durations from 3–6 to 24 s (Boynton et al. 1996; Soltysik et al. 2004). Also, the weaker linearity of the HbO2 response to stimulus duration is in agreement with the results of Wobst et al. (2001).

In conclusion, our study of simultaneous NIRS and EEG measurements in healthy adults suggests that the relationship between brain HDRs and VEPs is approximately linear for 3–12 s long stimulus trains consisting of checkerboard patterns reversing at 2 Hz. However, the results indicate that a linear model is better suited for \(\Upsigma\hbox{HbR}\) than \(\Upsigma\hbox{HbO}_2,\) and the linear correlations are stronger for the \(\Upsigma\hbox{VEP}\) as a regressor than for the stimulus train duration as a regressor. Thus, when interpreting the hemodynamic responses, it is beneficial to relate them to parameters linked to the evoked potentials rather than to indirect parameters (e.g., stimulus duration). In addition, interindividual differences in the HbO2 or HbR responses to visual stimuli may be explained with interindividual differences in the VEP activity.