Introduction

Across the primate species, the lateral geniculate nucleus (LGN) is characterized by two ventral M-magnocellular layers of large cells and a series (four in human) of smaller cell P-parvocellular layers located more dorsally. In addition, there is a less laminated distribution of koniocellular neurons with tiny cell somata. Primate neurophysiologists (Kaplan and Shapley 1986; Maunsell et al. 1999) have established that geniculate M neurons demonstrate relatively high contrast gain, response saturation at high contrast, low chromatic sensitivity and high temporal frequency cut-offs compared with the P neurons, which typically show chromatic tuning, high spatial sensitivity, lower contrast gain without saturation, and lesser temporal frequency performance.

Cortically, however, the separation of magnocellular and parvocellular contributions becomes more complicated (Nassi and Callaway 2009), with mixing of M and P inputs at early cortical levels. Despite this, two basic streams of visual processing have been identified—the vision for perception set of visual cortical areas extending from occipital cortex into the temporal lobe, and the vision for action dorsal cortical stream comprising multiple representations in parietal cortex.

The development of multifocal ERG techniques through the VERIS system (EDI, San Mateo, USA) (Sutter 1992) provided multiple decorrelated pseudorandom binary stimuli to test retinal responses across the visual field. The random stimulation also allows for the analysis of higher-order kernels through the Wiener kernel decomposition (Sutter 1992). These techniques for multifocal visual field mapping have been extended from retina to cortex (Baseler et al. 1994), with immediate application for using cortical visual evoked potentials to monitor retinal disease (Hood and Greenstein 2003) and then to MRI and MEG techniques. Multifocal fMRI mapping (Henriksson et al. 2012; Vanni et al. 2005) provides complementary information to the more conventional phase encoding methods (Engel et al. 1994; Sereno et al. 1995). Multifocal MEG has been employed for establishing retinotopy, mainly with contrasting reversing stimuli and analysed usually with dipole fitting methods (Nishiyama et al. 2004; Tabuchi et al. 2002).

A breakthrough in the biological significance of higher-order kernels came with the realization that slices of the second-order kernels mimicked the contrast response and saturation of LGN magnocellular and parvocellular neurons (Klistorner et al. 1997). These kernel slices largely separated the M and P contributions as a function of interaction time. In addition, there was a 25-ms latency difference between peaks of cortical activation observed for these waveforms, giving priority to magnocellularly driven information in cortical processing. Such latency advantage—termed the magnocellular advantage (Laycock et al. 2007, 2008)—supports the notion that early information reaching primary visual cortex has sufficient time to feed forward to early dorsal cortical areas and back to V1 to coordinate with parvocellular input (Bullier 2001). The role of such retro-injection appears to lie in figure/ground segmentation (Hupe et al. 1998, 2001; Super and Lamme 2007a, b).

The current research extends the study of temporal nonlinearities in visual evoked response from VEP to MEG, particularly investigating the timing of activation of visual cortex by the magnocellular and parvocellular streams, and importantly, allowing estimation of order of activation of such brain sources, testing the hypotheses inherent in the notion of the magnocellular advantage (Laycock et al. 2007). VEP typically is recorded from a single electrode over occipital cortex. As such, it is insensitive to the location of multiple brain sources of signal. MEG using several hundred sensors is better able to localize sources than EEG mainly because of the transparency of the skull to magnetic fields. Thus, our intention was to study the temporal (nonlinear) structure of the visual evoked magnetic fields with an aim of locating brain sources and measuring the relative timing of their activations.

Methods

Participants

Upon approval from the Swinburne Human Research Ethics Committee, five young adult participants were recruited for the MEG study and gave written informed consent (in accordance with the Declaration of Helsinki) prior to testing. They had normal or corrected-to-normal vision.

Stimuli

A set of pseudorandom m-sequence binary temporal codes was programmed using LabView (National Instruments, USA). A 9-segment dartboard stimulus (circular centre patch with two rings of 4 segments) was programmed in VPixx (http://www.VPixx.com—see Fig. 1a). Each segment was diffusely coloured and fluctuated between two luminance levels using the m-sequence (m = 14) broken into 4 segments of approximately 1 min. The sequences were maximally offset in time so that mutual decorrelation was achieved, allowing the independent simultaneous recording from each patch. Only the central patch, subtending 4° of visual angle, when viewed at 120 cm, was analysed in the current paper, as our primary interest was to extend the study of higher-order Wiener kernels from EEG-based visual evoked potentials to MEG recordings. Two separate acquisitions were made with binary stimulation between black and white (96 % temporal contrast) and two levels of grey (24 % contrast). The stimuli were projected into the magnetically shielded room via a data projector operating at a frame rate of 60 Hz. Triggers were set and reset for stimulus onset and offset, respectively, allowing for first order and the first two slices of the second-order kernels to be easily estimated. The participants had no task to perform apart from keeping fixation on the centre of the stimulus.

Fig. 1
figure 1

Butterfly plots, sensor array maps and interpolated sensor amplitudes (from the planar gradiometers) for participant R1054. The first-order response K1 is shown in (a) along with the 96 % contrast multifocal stimulus (shown as an inset). The first two slices of the second-order kernel K2.1, K2.2 are shown in (b, c). Note that these kernel elements K1, K2.1 and K2.2 are derived from the same recordings. The recordings show good signal-to-noise ratio, with a concentration of response centred over occipital sensors. Comparing Fig. 1a–c, both the butterfly plots and the interpolated disc plots show differences in timing and in relative strength of the peaks. In particular, the response density shown at the latency (79 ms) in the disc plots shows evident difference between K1 and K2.1 in terms of a best fitting dipole for such a field distribution. Scale bars (butterfly plots) represent 100fT. In ac, the green line represents the global field potential, and the scale bar represents 100fT

MEG

MEG recordings were made using an Elekta TRIUX MEG scanner with 102 magnetometer and 204 planar gradiometer SQUID detectors. Recordings were carried out in a dimly illuminated magnetically shielded (two layers of µ-metal and one layer of copper) room (MSR) with active shielding coils within and outside the MSR in order to provide active shielding of magnetic transients. Only the first- and second-order responses from the central patch were analysed.

Each 4-min recording resulted in 214-1 frames of stimulation. The VPixx binary stimulus sent different triggers to the MEG recording system each time stimulus level 1 or stimulus level 2 was set. The first-order kernel (K1) is the sum of all responses to stimulus one (R 1) minus the sum of all responses to stimulus two (R 2) throughout the perfectly balanced pseudorandom sequence, i.e. 0.5*(R 1 − R 2) (Sutter 1992). The first two slices of the second-order response (K2.1 and K2.2) of the MEG evoked fields (MEF) were similarly extracted (Klistorner et al. 1997; Sutherland and Crewther 2010). The second-order response takes account of the history of stimulation, with the first slice (K2.1) representing a comparison between two consecutive frames when a transition has occurred (R 12, R 21) and where a transition has not occurred (R 11, R 22), i.e. K2.1 = 0.25*(R 11 + R 22 R 12 R 21). The second slice (K2.2) is similar, but derived with an additional intervening frame of either polarity. Thus, K2.1 measures neural recovery within the duration of one frame (16.7 ms with a 60 Hz frame rate), and K2.2 measures recovery over two frames (33 ms). On the basis of contrast gain, contrast saturation and peak latencies from VEP recordings at similar frame rates (Jackson et al. 2013; Klistorner et al. 1997), it was expected that the K2.1 response would be dominated by inputs of magnocellular origin, while K2.2 would be dominated by parvocellular inputs.

Analysis

Recorded data with associated stimulus hardware-generated triggers were initially pre-processed using MaxFilter (Elekta Neuromag) to project out sources of noise external to the spherical surface of the SQUID sensors. This also allowed for any bad channels to be rejected. In addition, the temporal signal space separation (tsss) routine (Taulu and Simola 2006) was used to minimize time-varying noise sources. The data were then imported into Brainstorm (Tadel et al. 2011) and epoched, and the kernel responses were calculated at the sensor level. Activations were projected onto the cortical surface via the MNE (minimum norm estimation) technique using the common brain Colin27 (http://packages.bic.mni.mcgill.ca/tgz/).

Results

Following MaxFiltering (to remove projected noise sources), the epoched data were averaged around the triggers for onset and offset of the central stimulus patch. The waveforms of Fig. 1 show the outputs from the 204 planar gradiometer sensors. The difference in averages provided the first-order responses (K1—see Fig. 1a for a single participant). The second-order kernel slices were derived via a similar analysis with addition and subtraction of the averages of the four possible combinations of stimuli for the successive frames (K2.1—Fig. 1b and K2.2—Fig. 1c). See (Sutter 2000) for a simple description of the kernel extraction process.

Comparison of the low (24 %) and high (96 %) contrast mean amplitudes derived from an occipital cluster showed a marked amplitude growth with contrast for the major N60P90 peak in the first-order response (see Fig. 2). Also, at high contrast there is a “notch” in the waveform at a latency of approximately 65 ms, similar to that reported in K1 for the VEP (Sutherland and Crewther 2010). However, by comparison, the amplitude of the second-order first slice (K2.1) response was already saturated at the lower contrast (a similar reduction in amplitude of the K2.1 response is seen at high contrast in VEP (Klistorner et al. 1997)). The major peak of the second slice of the second-order kernel (K2.2) response (N90P110) showed twice the amplitude for high compared with low contrast. These findings are consistent with contrast response functions recorded with VEP (Jackson et al. 2013; Klistorner et al. 1997).

Fig. 2
figure 2

Average cluster MEF (principal components derived) for low contrast (24 %—thin line) and high contrast (96 %—thick line) from a cluster of planar gradiometer sensors maximally activated over occipital cortex. The fringe around each mean curve represents ±1SE. The second slice of the second-order response (K2.2) shows evidence at low contrast of an early peak (N50P60) that is dominated by a longer latency wave for stimulation at high contrast

The process of estimating cortical activation was carried out with the MNE tools available within Brainstorm, using the surface map of the common (Colin27) brain produced using Freesurfer (https://surfer.nmr.mgh.harvard.edu). The butterfly plot of average planar gradiometer activation across the five participants shows an excellent signal/noise ratio (Fig. 3a). The MNE-derived brain sources clearly demonstrate the expected pattern of activations, not only in calcarine occipital cortex, but also with discrete early cortical activations in the extrastriate cortex (see Fig. 3b). Five scout activations (average of MNE contributions from dipoles situated over a small group of mesh vertices on the cortical surface) located in occipital cortex were selected for further analysis. These are shown on the brain surface (Fig. 3b) and their location details (Talairach coordinates and nearest grey matter locations from Talairach Daemon: http://www.talairach.org/client.html#CommandLine) are to be found in Table 1. Activation in Right MT+ is highlighted in MRI viewer format in Fig. 3c.

Fig. 3
figure 3

MNE source localization of first-order kernel response recorded at high contrast: a Average butterfly plot for the five participants, with showing an initial early activation peaking at around 55 ms and two later responses peaking at 80 and 118 ms (the green line represents the global field potential). b Minimum norm estimates of first-order responses on the brain surface at 118 ms demonstrate the major occipital cortical regions activated. c 3D cortical image of MEG activation (K1, 118 ms) superposed onto the structural brain image. The datum lines shown intersect on MT+

Table 1 Selected scout activation sites from MNE source localization

The question of which cortical regions are activated first was investigated through close comparison of individual and mean power generated by the V1 and MT+ scouts. The first order and first slice of the second-order evoked response power are shown for V1 and MT4 in Fig. 4 (the second slice shows greater latency and hence was not included in this analysis). Inspection of the four graphs shows that activation occurs in all regions well before 50 ms after stimulus onset. The pattern of K1 and K2.1 mean curves shows that the first-order power builds to reach a maximum at around 110 ms (Fig. 4a, b) while the K2.1 graphs show much more abrupt onset with peak power at around 80 ms (Fig. 4c, d). The regularity across participants is most apparent in the K2.1 response from MT + (Fig. 4d), indicative of a transient, rapidly adapting neural source.

Fig. 4
figure 4

Individual time series of activation of the V1 and MT+ scout activations (derived from both magnetometers and gradiometers for the high-contrast stimulus condition) for the K1 and K2.1 responses. Also displayed in heavier line is the mean across the 5 participants’ data. It is clear that the initial activation of the visual cortex occurs essentially simultaneously in these two sites. Note the rapid activation, consistent across participants, displayed in the K2.1 response from MT+

In order to answer the question of which response activation is most rapid, estimation was made, for each of the graphs in Fig. 4, of the mean and confidence interval of noise in the period (−50, 20 ms), prior to cortical activation. Estimation was then made of when the mean response curves first exceeded the upper 95 % confidence level of the noise. For the first-order K1 responses, initial activations were V1: 42 ms, MT+: 44 ms. For the second-order first slice K2.1 responses, initial activations were V1: 43 ms, MT+: 44 ms.

Time–frequency (TF) analyses of the five scout activations—from left occipital V1, left lingual gyrus (V2), left and right A19 and right MT+—were estimated for K1, K2.1 and K2.2 waveforms, using a Morlet wavelet transform available in Brainstorm (see Fig. 5). Here, a central frequency of 1 Hz and a time resolution of 1 s were chosen for the mother wavelet, with all other wavelets shifted and scaled variants of the mother wavelet.

Fig. 5
figure 5

Time–frequency analysis of high-contrast (96 %) scout activations (Morlet transform of the average from the MNE-defined scouts of the five participants selected) of the first- and second-order (K1, K2.1 and K2.2) kernel responses from the five occipital cortical scouts. All second slice K2.2 responses show peak power that is delayed compared with first slice K2.1 by approximately 30 ms. Relatively low K2.2 nonlinear intensity compared with K1 was observed in V1 compared with such comparison in V2. Colour bars represent a common scale across regions for K1, K2.1 and K2.2. The responses for each of the time–frequency maps have units of intensity of 10−24 (pA.m)2

The first-order TF responses from both scouts show maximum power for frequencies in the high alpha—low beta band, and with peak power occurring at around 100–110 ms. Some high-frequency (gamma band) activity is observable in all sites. The first slice K2.1 TF responses show much earlier activation for peak power (around 65 ms with frequency for peak power around 20 Hz in V1 and a little less in MT+). Gamma band activity is more prevalent and is prominent in V1, V2 and MT+ TF plots. The second slice K2.2 time–frequency plots show maximum power at 12–15 Hz, but with longer latency for peak (105–110 ms) compared with the K2.1 TF responses.

Discussion

This manuscript presents the first nonlinear analysis of MEG evoked fields and sources. The spatiotemporal analysis of results presented here with diffuse flashed stimuli is an extension of previous visual evoked potential multifocal studies (Jackson et al. 2013; Klistorner et al. 1997; Sutherland and Crewther 2010), demonstrating contrast responses and latency properties indicative of the ability to code magnocellular and parvocellular activity. In addition, the ability to localize sources on the cortical surface has opened up the question of the timing of initial cortical activation, previously only really studied in primates. Meta-analyses of primate early visual processing demonstrate first neural spikes to activate V1 at around 35–40 ms with first spikes recordable from middle temporal area MT at around 40–42 ms (Bullier 2001; Lamme and Roelfsema 2000). The MNE source activations of Fig. 4 demonstrate that cortical activity in human is not far behind in timing compared with the early spike activity of monkey, with mean MEG activations becoming discernible above noise levels at around 42–44 ms for both areas V1 and MT+. These MEG latencies are in accordance with multifocal VEP recordings of around 43 ms (Jackson et al. 2013).

While conventionally it is thought that the major route to activation of V5/MT+ is through the LGN -> V1 (4Cα) -> V1 (layer IVB) -> MT/V5 (Nassi and Callaway 2009), the results presented here might give cause for a rethink. Even if the time loss for each synapse is 2 ms (Maunsell et al. 1999), MT activation via this path should be delayed by at least 5–6 ms, compared with the earliest activation in V1. Instead, activation in MT/V5 is delayed by less than 2 ms compared with V1. Thus, it is possible that other inputs of the magnocellular ganglion cell output of the retina, e.g. magno inputs via superior colliculus and pulvinar to MT+, could provide an alternative pathway, as recently demonstrated (Berman and Wurtz 2010).

Comparison of the first and second slices of the second-order responses from the five scout activations produces some interesting contrasts. Overall, differences in peak power must be discounted on the basis that the power recorded is dependent on the size of the mesh that is selected for each scout. However, as the K1, K2.1 and K2.1 TF plots were generated by the same recordings for each scout site, relative activity differences between the first- and second-order responses are worthy of report. The first such characteristic is the relative delay of K2.2 peak activity relative to that shown in K2.1. This is of the order of 30 ms, perhaps reflecting again the magnocellular advantage, given the fact that the overall contrast response of the first slice K2.1 response already appears saturated at a temporal contrast of 24 %. In addition, the K2.1 response showed higher-frequency contributions than K2.2 indicative of a neural drive more capable of high-frequency operation. One observation of interest is the fact that the K2.2 TF response for the V1 scout is relatively weak compared with first order, compared with the K2.2 versus K1 comparison for area V2 alongside. This could indicate that the V1 neurons, particularly those driven by the parvocellular input, have relatively high neural efficiency, a feature that could derive from relatively weak stimulation of orientation selective simple cell receptive fields by diffuse patches of light.

The application of nonlinear analysis of MEG evoked signals to understanding the neuroscience of disorders of cognition is now a definite possibility. People with high autistic tendency show significantly larger second-order components driven by the magnocellular system (Jackson et al. 2013) and also abnormal high-contrast magnocellular function (Sutherland and Crewther 2010). In addition, early perceptual abnormalities in schizophrenia and autism have been addressed by other techniques involving nonlinear analysis (Butler et al. 2007; Lalor and Foxe 2009; Lalor et al. 2006; Sehatpour et al. 2010). However, to date, the ability to separate the responses from the different cortical areas for such transient signals with high temporal frequency content has been difficult.

The ability of MEG Wiener kernel analysis to clearly resolve V1 from MT+ and other extrastriate brain regions, as well as the ability to separate magnocellular from parvocellular contributions and the temporal priority of activation of different cortical regions, should prove increasingly useful in the analysis of neurodevelopmental disorders of perception. It will provide clues as to the role that magnocellular activation plays in the early processes of perception, in particular, foreground/background segmentation and global versus local perception.