Background

The interpretation of ground magnetic data has long been realized as having an important role in understanding physical processes in near-Earth space. Much of the signal arises from electric currents in the ionosphere and flowing along magnetic field lines. An ideal inversion method would allow the characteristics of these currents to be found at any time, reflecting physical variations in near-Earth electrodynamics.

Methods such as that of Kamide, Richmond, and Matsushita (called KRM) (Ahn et al. 1995) and assimilative modeling of ionospheric electrodynamics (AMIE) (Richmond 1992) allow determination of electrodynamic quantities of the upper atmosphere, and models exist for the entire near-Earth system with input from the solar wind (Ridley et al. 2002). These models incorporate auroral and magnetospheric physics in ways which are linked to the inversion in a complex way. The KRM and AMIE techniques are based on a global representation of electric and magnetic fields related through electric currents and conductivities. The latter parameter in particular is difficult to determine, although AMIE attempts to refine it through the ingestion of many types of data. A simpler approach, which does not require a global representation, can be given through direct application of the Biot-Savart integral to a specified configuration of electric currents in space and the ionosphere, with corrections for Earth conductivity. Specifying where the currents flow constitutes making a forward model, and the parameters of this model can be adjusted to obtain a best fit to data. If an acceptable fit is attained, one can claim that the configuration of electric currents usefully represents the physical current system. This simpler approach was implemented using an optimization procedure and referred to as automated forward modeling (AFM), by Connors (1998).

Within the more general context of magnetic data inversion, AFM would be considered to be “parametric inversion” (Li and Oldenburg 1996). Parametric inversion requires simple causation and “a great deal of a priori knowledge.” In the case of auroral zone currents, desirable physical quantities would be, for example, the current strength and latitudinal location of the auroral electrojets. These are useful parameters to allow computation of a model, and they clearly correspond, at least at some level of approximation, to physical quantities, based on long-standing (Boström 1964) knowledge. Here, we describe a new method of optimizing a simple forward model involving auroral zone currents, including field-aligned currents (FACs), for inversion of magnetic data alone, without the use of electrodynamic quantities such as conductivity. We give examples of use of this technique for simple models of electrojets over meridians and refer to this restricted form of use as automated meridian modeling (AMM). Connors et al. (2014) used a similar approach (the same computer code with a more general forward model and constraints) to deduce parameters for the substorm current wedge, using data from the North American region, and compare them to those inferred using an independent space-based dataset. We usually refer to the technique used and cursorily described in supplemental material, by Connors et al. (2014) as automated regional modeling (ARM), and explain below its relation to the present work.

The thrust of this study is to automate forward modeling techniques, most particularly those based on direct precision integration of the Biot-Savart integral. The parameters are physically meaningful variables, and their variation is taken to indicate physical changes responsible for the observed perturbations dB. In practice, the parameters used are simple geometric ones and the current. The method is thus complementary to other methods such as KRM and AMIE that necessitate involvement of electrodynamic parameters such as the conductivity. That the interplay of parameters involved in more complex models can lead to a possible reduction in their physical meaning has been discussed by Ridley et al. (2002). The fact that ground magnetic data cannot be reproduced well by current MHD models supplying boundary conditions to AMIE has also been discussed (Ridley et al. 2001). In our technique, the primary condition is a good match of optimized forward model output to ground magnetic data, and physical assumptions are kept to a minimum.

Regional techniques with some resemblance to AFM have already been developed but generally use a simpler magnetic field model than we do. Lühr et al. (1994) developed an automated fitting routine to optimize parameters for simple current systems consisting of a line current, a strip of constant east–west current density, or a region in which the east–west current density varied as a quadratic (parabolic model). The current densities obtained from optimizing the models using the Marquardt technique (very similar to that used in this study) were compared with those obtained by the EISCAT radar. The current density estimated from the ground was found to be about 15 % higher than that derived from the radar (and in two cases rocket) measurements. The discrepancy was attributed to induction effects. Another technique based on line currents and a flat Earth was developed by Popov et al. (2001). At any given time, this technique gives a profile of current density, and total current across the meridian can also be determined. Earth induction is taken into account. Presentation of the data for a day allows contour plots to be made showing the changes in the patterns of current density. Our focus differs in that we aim for the simplest possible parameters in order to eventually compare them with each other and with outside factors. Since our parameters imply simple causation and are based on a priori knowledge, we consider the optimized forward model to represent data inversion in the sense of Li and Oldenburg (1996). We now proceed to discuss our method in detail and then give examples in which the output parameters are useful in understanding auroral events.

Methods

Here, we describe first how magnetic perturbations due to electric current systems in near-Earth space may be calculated to form a forward model. Next, the general techniques of optimization that allow automation of the modeling procedure are described. This is followed by a description of constraints and weighting that allow use of AFM on global, regional, or local scales. Finally, we describe local application to auroral electrojets as used in the data analysis part of the paper below.

Magnetic field model

Methods for calculating the perturbations due to physically realistic current systems were presented by Kisabeth (1972). He proposed combinations of those current systems which reproduced the input data. The data were most often in the form of perturbations observed along meridian chains of magnetometers. The proposed current systems constitute a forward model, and if the perturbations they produce agree with the data, confidence can be placed in the physical parameters of the model. Kisabeth and Rostoker (1977) performed studies of electrojets using these techniques. Kisabeth (1979) developed several methods of resolving the Biot-Savart integral that could be employed depending on circumstances, and Earth induction was included in the models. In this early forward modeling approach, determination of parameters was done manually. In practice, and partly due to limitations of the manual approach, application of forward modeling to complex situations involving arbitrary distributions of stations has been difficult. An element of “trial-and-error” (Kamide and Baumjohann 1993, p. 98) was involved. In complex situations, the number of parameters involved became large, and their determination from data at the many stations needed to uniquely optimize them became problematic. An automated technique for finding an optimal set of parameters for modeling magnetic data is needed.

The meaning of the word “optimal” is that a set of parameters in the forward model has been changed (possibly) from initial values guessed at, to improve the model’s representation of data. The deviation of a scalar model F from a scalar data at a set of N observation points x i may be represented by the scalar value chi-squared \( {\displaystyle {\chi}^2}={\displaystyle \sum_{i=1}^N}{\left(\frac{{\displaystyle {y}_i}-F\left(\mathbf{a},{\displaystyle {x}_i}\right)}{{\displaystyle {\sigma}_i}}\right)}^2 \), where a is the vector of parameters in the model, and σ i an error value applicable for each measurement. In geomagnetism, the observations are usually along three orthogonal axes, X being northward, Y being eastward, and Z being downward. The X and Y directions are usually with respect to magnetic north (usually local magnetic as determined at the time of instrument installation). In some cases, for example permanent observatories, measurements are done in the geodetic system, and if there is a significant deviation between magnetic and geodetic north (i.e., declination), such observations should be rotated into a geomagnetic coordinate system (Untiedt and Baumjohann 1993). Also, the squared separation in a vector space with three dimensions is the sum of the squares of separations along the orthogonal axes, so one can simply express the chi-squared for magnetic measurements as

$$ {\displaystyle {\chi}^2}={\displaystyle \sum_{i=1}^N}\left[{\left(\frac{{\displaystyle {y}_{iX}}-{F}_X\left(\mathbf{a},{\displaystyle {x}_i}\right)}{{\displaystyle {\sigma}_{iX}}}\right)}^2+{\left(\frac{{\displaystyle {y}_{iY}}-{F}_Y\left(\mathbf{a},{\displaystyle {x}_i}\right)}{{\displaystyle {\sigma}_{iY}}}\right)}^2+{\left(\frac{{\displaystyle {y}_{iZ}}-{F}_Z\left(\mathbf{a},{\displaystyle {x}_i}\right)}{{\displaystyle {\sigma}_{iZ}}}\right)}^2\right], $$
(1)

where subscripts corresponding to the axes have been added to the measurement, model, and error values. In some fields where a modeling approach is applied, errors may be rigorously determined, but it is usually the case in geomagnetism that instrumental error is small, so that most of the error arises from the model not being able to completely represent the data. Setting a large error value for a certain station’s data is equivalent to applying a small weighting to it in the sum. The geometry of the χ 2 space, path to optimization, and possibly the final result are all affected by the weightings. Standard methods of applying weightings define classes of AFM, and the weightings are not changed during a run. Application of weightings is a standard practice in geophysical data inversion, often done to reduce dominant sources that can mask an overall signal (Li and Oldenburg 1996). Calculation of equivalent currents often is based on only the horizontal components of the surface magnetic field (Untiedt and Baumjohann 1993), and this corresponds to applying zero weight (in the view above, infinite error) to the vertical component. Explicit or implicit use of weighting functions that vary by component, as here, is an established practice, and details of its implementation in AFM will be given below.

Optimization of parameters

Parameters of a linear model can be optimized in one step that minimizes the squared (and perhaps weighted) deviation, χ 2, but in nonlinear cases such as those arising in magnetic problems, the minimization equations are generally not analytically soluble. Solution requires numerically following gradients of χ 2 in the parameter space. The Levenberg-Marquardt method is now widely used for such problems. It “works very well in practice and has become the standard” (Press et al. 1992, p. 683). It is a variable metric method that attempts to find an optimal step size in parameter space and is related to Newton’s method in the sense of following gradients toward a solution but enhanced in the region of a minimum by a quadratic approach. Details of the algorithm are given by Press et al. (1992) and by Lampton (1997). A brief description is also given by Lühr et al. (1994). As in that study, the Y or eastward magnetic component is not normally used in our meridian chain inversions. The option exists to include it, however, for example in regional applications (such as ARM) rather than on a meridian. Since our technique includes a weighting applied to any component of data from any station, suppression or inclusion of effects from the Y component is readily done by changing its weighting.

Levenberg-Marquardt method

We assume that χ 2 for a forward model can be represented in the space of its parameters as a differentiable function. This is the case for magnetic field models if using M physically meaningful parameters, which define a space within which we denote those parameters by a vector a. The χ 2 function can be expanded about any point a 0 in this parameter space of M dimensions, to have a value at another point a (i.e., for a different set of parameters) which is

$$ {\chi}^2\left(\mathbf{a}\right)={\chi}^2\left({\mathbf{a}}_0\right)+{\displaystyle \sum_{\mathbf{i}=1}^{\mathbf{M}}\frac{\partial {\chi}^2}{\partial {\mathbf{a}}_{\mathbf{i}}}}\left|{}_{{\mathbf{a}}_0}\left({\mathbf{a}}_{\mathbf{i}}-{\mathbf{a}}_{0\mathbf{i}}\right)+{\scriptscriptstyle \frac{1}{2}}{\displaystyle \sum_{\mathbf{i}=1}^{\mathbf{M}}{\displaystyle \sum_{\mathbf{j}=1}^{\mathbf{M}}\frac{\partial^2{\chi}^2}{\partial {\mathbf{a}}_{\mathbf{i}}\partial {\mathbf{a}}_{\mathbf{j}}}}}\right|{}_{{\mathbf{a}}_0}\left({\mathbf{a}}_{\mathbf{i}}-{\mathbf{a}}_{0\mathbf{i}}\right)\left({\mathbf{a}}_{\mathbf{j}}-{\mathbf{a}}_{0\mathbf{j}}\right)+\dots $$
(2)

This may be truncated within the domain of convergence and sufficiently close to a and written more compactly in vector form as

$$ {\chi}^2\left(\mathbf{a}\right)\approx {\chi}^2\left({\mathbf{a}}_0\right)-\mathbf{b}\cdot \left(\mathbf{a}-{\mathbf{a}}_0\right)+{\scriptscriptstyle \frac{1}{2}}\left(\mathbf{a}-{\mathbf{a}}_0\right)\mathbf{A}\left(\mathbf{a}-{\mathbf{a}}_0\right), $$
(3)

where \( {b}_i=-\frac{\partial {\chi}^2}{\partial {a}_i}\left|{}_{{\mathbf{a}}_0}=-{\left(\nabla {\chi}^2\right)}_{\mathrm{i}}\right|{}_{{\mathbf{a}}_0} \) is the negative gradient vector of χ 2 in parameter space. A, with \( {A}_{ij}=\frac{\partial^2{\chi}^2}{\partial {a}_i\partial {a}_j}\Big|{}_{{\mathbf{a}}_0} \), is the so-called Hessian matrix. The truncated expansion is a quadratic form. If the gradient is set to zero and the point a 0 is thus regarded as the extremal point, Newton’s method simply gives A (a − a 0) = b, with solution a = a 0 + A 1 b. This solution is to be regarded in an iterative sense, since the Hessian matrix and the gradient vector b are both evaluated at the expansion point and may not be representative of the behavior of the χ 2 function all the way to its extremum. If necessary, as is generally the case in modeling magnetic fields, the derivatives can be evaluated numerically. If one is far from the extremum, linear terms will dominate. Then, the gradient can be followed downslope toward the minimum. The problem with this approach is that one does not know how far to follow the gradient. The Levenberg-Marquardt process attempts to solve this problem by combining use of the quadratic form near the extremum of χ 2 with following a gradient, including a self-correcting method of determining how far to follow it. The modified system A′ (a − a 0) = b contains the desired solution, with \( {\mathbf{A}}_{\mathbf{ij}}^{\mathbf{\prime}}={\mathbf{A}}_{\mathbf{ij}}\left(1+\lambda {\delta}_{\mathbf{ij}}\right) \) the elements of the new combined matrix A′ and δ the identity matrix. The parameter λ is an effective scale length and is modified as the solution progresses, to move most efficiently toward the minimum. For λ = 0, we recover the quadratic approach, while if λ is very large, A′ becomes diagonally dominant and the linear case is effectively in force. Ultimately, in the case of successful approach to an extremum, the value of λ is reduced to near zero, the pure quadratic form appropriate to being near a minimum is used, and convergence accelerates. Based on these observations, the “recipe” for the Levenberg-Marquardt technique given by Press et al. (1992), p. 684, may be used. An optimal starting value of λ must be determined by the user. The recipe is:

  1. 1.

    For a starting guess at the parameters a 0, compute χ 2 (a 0).

  2. 2.

    Pick a small value of λ.

  3. 3.

    Solve A′ (a − a 0) = b to get a = a 0 + A′ 1 b then evaluate χ 2(a).

  4. 4.

    If χ 2(a) > χ 2(a 0), one has failed to approach the minimum and must increase λ significantly (they suggest a factor of 10) and repeat step 3.

  5. 5.

    If χ 2(a) < χ 2(a 0), one is nearer to the minimum, so a is a better guess than a 0. One must replace a 0 in step 3 by a, decrease λ to reflect being nearer the minimum, and redo step 3.

Steps 4 and 5 must include some criterion to determine when iteration is complete. This may be by tracking of successive values of χ 2 so that when it stops changing much, one stops. While this is a desirable approach, it is more practical simply to count and limit the number of iterations. This should be coupled with examination of output at each step so that the behavior of χ 2 is monitored and can be seen to be reasonable.

Variants in automated forward modeling

Although the basic methods of modeling and optimization are applicable generally, constraints and weighting also play a large role in data inversion, and nonuniqueness can be a special challenge (Li and Oldenburg 1996). For parametric inversion, the role of initial parametrization or “an initial guess for parameter values” can also be important. All of these factors play a role in the functional division of AFM into global (AGM), regional (ARM), and meridian (AMM) approaches, which share a compute module but have different constraints, weighting (including not using certain components), and initial condition requirements. These approaches are summarized in Table 1.

Table 1 Contrasting versions of automated forward modeling and global EM solvers

Within Table 1, various versions of automated forward modeling are presented in columns, along with a final column applying to the KRM and AMIE techniques, which have some similarities. Some comments applying to table entries follow. Automated global modeling (AGM) was found by Connors (1998) to obtain results comparable to those of AMIE. In practice, multiple current systems must be used for modeling on a global scale, and constraint equations are used to join them by requiring portions of some current systems to share common elements. Even with this constraint, which reduces the number of parameters to be solved for, the model is very sensitive to initial conditions, and global data remains sparsely distributed (for example, due to the presence of oceans). The result is that, although parametric inversion on a global scale may in principle present advantages, such as a possibly higher resolution than spherical harmonic gridded methods, AGM is difficult to use. In selected regional studies, a high density of stations may constrain simple models well (Connors 1998). In this case, ARM may be applicable. Connors et al. (2014) found that the one current system of the substorm current wedge (SCW) as envisaged by McPherron et al. (1973) could represent perturbations associated with a substorm over North America well and verified that the current found by the ARM model closely matched that determined independently by the Active Magnetosphere and Planetary Electrodynamics Response Experiment (AMPERE) spacecraft constellation. ARM has many similarities to AGM, with station weighting increasing with distance from the auroral zone, but may require only one current system to regionally represent magnetic perturbations. If only stations very near the electrojet are used, AMM can have uniform station weighting as indicated in the table, however (for example, in the second example below, with further discussion in the Appendix), it is also possible to judiciously apply weightings. AMM uses only the perturbations along the local magnetic north (dB X ) and vertically down (dB Y ) directions.

To represent surface magnetic field perturbations due to near-Earth electric currents, one requires a model specifying the perturbations that would arise at the surface, as a function of parameters which usually can be chosen to be physical quantities, as described in the next section. Those parameters can be varied through the above recipe until an optimized match between model output and the observed fields at many points is obtained. The parts of the model circuit which are nearest the observing sites contribute most to the modeled perturbations at those sites, so that the entire closed current system in space is not used. Not only are the details of the distant parts of the system the least well known but compared to smaller local current systems and inaccuracies resulting from a simplified model, the contributions from those parts are small (Friedrich et al. 2000).

Electrojet magnetic field model

In practice, the method of Kisabeth (1972, 1979) and Kisabeth and Rostoker (1977) has been used in calculating the magnetic perturbations due to model current systems. This technique has a spherical Earth, induction treated through a superconductor at depth (usually chosen as 250 km), and FAC following dipole field lines. In this simple view, an auroral electrojet may be regarded as being characterized by FAC flow into the ionosphere, flow within the ionosphere for some distance, and then FAC flow out of the ionosphere. The regions in which FAC intersects the ionosphere may be regarded as sheets of current without much loss of generality: in this case, the coordinates of two points must be given to specify the end points of the each sheet. Since there is one sheet of downward FAC and one of upward FAC for each electrojet, eight geometric parameters are required to specify such a system, a ninth being the total electric current (Connors 1998; Connors and Rostoker 2002). If one sheet is north of the other and the system has large east–west extent, the resulting north–south (poloidal) ionospheric currents joining them would be primarily Pedersen currents and their ground perturbations small (Boström 1964). Associated auroral arcs would be aligned east–west, and toroidal current parallel to them would produce most of the magnetic effect on the ground. The electric field must be known to determine if a given ionospheric current is a Hall or Pedersen current, but to perform the Biot-Savart integral, it is not necessary to know the nature of the electric current, merely where it is and its strength.

The “long narrow toroidal current” case is often applicable to modeling data from magnetic meridian chains and is emphasized in this article. In this case, only three parameters (two latitudinal limits and a current strength) characterize the electrojet as observed far from the FAC, whose influence is taken to be small. Of course situations arise, such as being near an auroral surge (Weimer et al. 1994), where intense FAC are near an observing station and this approximation is not valid. In this case, a large Y (eastward) component field magnitude would be present, so that it is easy to identify and exclude such cases.

The north–south perturbation component due to an east–west aligned ionospheric current system has a maximum in intensity directly below the center of the current (Kisabeth 1979). This is flanked by extrema of the vertical component. Application of the modeling technique described above is straightforward with such current systems. The squared deviation χ 2 between measured ground signatures and perturbations calculated from the forward model current system is determined. The above procedure is used to vary the parameters of the model until the minimum of χ 2 is obtained. Since the ground effects of the electrojets are largely due to roughly east–west aligned (nominally Hall) current flow, this simple technique allows the basic parameters of this current flow to be determined.

The perturbations observed by satellites traversing FACs associated with the poleward (region 1 or R1) and equatorward (region 2 or R2) borders of the auroral oval are familiar (Iijima and Potemra 1978). The ground perturbations due to such currents are usually small and often ignored but have a characteristic disturbance pattern differing from that of the Hall electrojets (Kisabeth 1979). Although the perturbations from the north–south systems are small, they are not zero, while Fukushima’s (1969) theorem is often cited to suggest that they should be. There is no contradiction, since the theorem applies only to a uniformly conducting ionosphere, and that is not the case in the auroral zone. The fact that the perturbation pattern from the R1/2 (region 1 and 2 FAC) and Pedersen ionospheric system is different from that of the Hall electrojets allows, at least in principle, the detection of the R1/2 FAC from the ground. Tamao (1986) pointed out that oblique FAC may also be detectable from the ground, but that this effect is most pronounced for systems whose east–west length is short compared to their north–south extent. Here, we concentrate on the opposite case of long electrojets. In contrast, Kawasaki and Rostoker (1979) found that the ground magnetic effects of Ps 6 perturbations, which are mostly in the Y and Z components, could be well explained by eastward drifting current systems of short east–west extent: this interpretation is consistent with the results of Tamao (1986). This study concentrates on cases in which there is not a large Y component and does not attempt to determine FAC associated with electrojets. An important aspect of FAC associated with net current supply to the toroidal electrojet is that in cases where such FAC is distant, the distinction between physical current which is determined by AFM and equivalent current determined by some other methods disappears. This issue is treated below.

Having outlined how AFM can be applied to inversion of magnetic data, we now present its use in event studies where the AMM variant can be applied to inversion in single meridians.

Results

April 1, 1986 event

We will first examine an older substorm event to give an overview of the uses of magnetic inversion. Due to good auroral imaging and ground data available, this event was selected as “Event A” for the CDAW-9 campaign of substorm studies. With expansive phase onset at 18:50 UT April 1, 1986, it was presented as prototypical of the “eye” appearance of UV manifestation of substorm expansive phases as seen from the Viking satellite by Rostoker et al. (1987). It was also discussed with focus on the use of satellite optical data to interpret substorm onsets by Murphree et al. (1991). A magnetic data set put together by Ahn et al. (1995) allowed them to study this event using the KRM method, with instantaneous conductivity based on Viking images. An inversion method determining magnetic local time (MLT) of currents (but not their strength) from midlatitude variations permitted Sergeev et al. (1996) to study this event in conjunction with images from the Viking satellite. Another study (Lu et al. 1997) is based primarily on AMIE and also presents GOES results, with an emphasis on mapping and no specific attention to the SCW total FAC. Here, we describe the event and assess the role of AFM in studying electrojets in such events. The overall situation with the auroral event well developed is shown in the transformed Viking spacecraft image of Fig. 1. By this time, the “eye of the substorm” had expanded northward (Rostoker et al. 1987). The auroral currents and in particular the westward electrojet flowed over the then extant Russian Kara Sea chain. The Russian observations were presented at low cadence (5 min) to CDAW-9 for analysis and obtained from the CDAW-9 CD-ROM.

Fig. 1
figure 1

Viking view of “eye of the substorm” in magnetic coordinates, showing the Kara Sea Russian Chain on April 1, 1986, at 19:05 UT

Figure 2 presents a stack plot of magnetic perturbations for four periods in the development of the substorm as a function of latitude. Dots represent observations, and model results are shown as solid lines for the X (northward) and Z (downward) components. The Y component is not used in simple electrojet modeling, and a model value is not shown. In the middle two panels, the Y component follows the X component and is negative (westward). This could arise from the electrojet being tilted toward the northwest, and this image leads to believe this could be a realistic configuration. However, in what follows, we assume the “westward electrojet” flowed due west. Allowing for tilt would give a slightly larger current than derived here. Figure 2a shows −X perturbations centered at about 61°, with +Z to the north and −Z to the south. This is the signature of a westward electrojet. In Fig. 2b, shortly after the time of onset, the electrojet is stronger and now centered at 62°. In Fig. 2c, the electrojet was centered at 64°, and the larger amplitude of perturbations indicates a strengthened current. In the northern half of the electrojet, Z is not large enough in the model, which may be due to structure in the electrojet not present in the model’s uniform electrojet. An enhanced current density at the poleward border would account for this and is possible to implement, but we are striving for a simplest possible model. In Fig. 2d, the perturbations are smaller but in close to the same locations, indicating little change in position in a half hour, but some weakening of current. Stack plots permit one to see that the electrojet model represented the data well, but it is the simple parameters derived from modeling that have the most utility.

Fig. 2
figure 2

Russian chain latitude profiles on April 1, 1986. Two stations slightly displaced in longitude at latitude 62° have slightly differing data values. Dots indicate observations and lines are the model fit. Local magnetic X M (black), Y M (red), and Z (blue) components are shown. Times progress by rows from bottom and are: (a) 18:30 UT (b) 18:50 UT (c) 19:00 UT (d) 19:30 UT

Figure 3 shows (from bottom up) the model parameters of cross-meridian current and electrojet borders, the interplanetary magnetic field northward component, and the standard AL index, the latter two from the OMNI database. It is clear (even from the low cadence data inverted) that substorm onset with the classic signatures of strengthening of the westward electrojet and poleward motion of the electrojet (here only the northern border) took place at 18:45 and was well underway at 18:50, the latter being the time found in other studies. However, the AL (1-min) index did not drop until 18:59 UT. Since this is very shortly after northward turning of the IMF, a causal relation could be inferred if AL was used as the timing indicator for this substorm. In fact, onset took place before the northward turning in the (propagated) OMNI data. As pointed out by Rostoker (1972) and reiterated by Connors (2012), the AL index must be used with care in individual events. However, we have succeeded showing that the use of simple physical parameters, even with low cadence, allows good timing and gives current strength and latitudes that could be used for mapping.

Fig. 3
figure 3

Current over Russian chain from 18.5 to 19.5 UT April 1, 1986 (bottom). Borders of electrojet over Russian chain, second from bottom. OMNI IMF Bz (second from top). Standard AL index (top). Substorm onset took place before Bz northward turning and before AL indicated it

We now proceed to describe another defunct magnetometer chain but one allowing high-quality inversions useful in the THEMIS era.

Polaris chain inversion

The Polaris geophysical project emplaced geophysical instruments in several locations in Canada (Bastow et al. 2015) including the east shore of Hudson Bay and southward, as the Hudson Bay Lithospheric Experiment (HuBLE). The instruments included magnetometers producing high cadence data, available with gaps during the first years (2007–2012) of the THEMIS (Angelopoulos 2008) mission. The Polaris East Hudson Bay array, for which a map is given in Fig. 4, may be seen as the predecessor of the new AUTUMNX array (Connors et al. 2015) and had more stations in its meridian chain aligned with the east coast of Hudson Bay. As such, its dense network of stations is suitable for demonstrating inversion, especially when combined with other nearby stations such as those of MACCS (Hughes and Engebretson 1997) and NRCan Canadian federal government observatories.

Fig. 4
figure 4

Stations of the Polaris East Hudson Bay Chain, 2007–2012, in eastern Canada. MACCS and NRCan stations are also shown

We did inversion using the stations listed in Table 2, using as reference a quiet day on November 6, 2007, whose data were smoothed with a 1-h boxcar filter. Figure 5 shows inversion results for a simple westward electrojet for a substorm with onset time 5:33 UT on November 8, 2007. The bottom panel shows eastward current, plotted as negative values to resemble the AL index, and this attained a magnitude of about 0.2 MA. The electrojet borders widened from the time of onset both toward the north and the south by about 3°. At the end of the current intensification, the poleward border retreated equatorward, and a period of unsettled activity followed. Although the date was after the launch of the THEMIS constellation, no spacecraft were in interesting locations except GOES 10, GOES East at the time, which showed a small dipolarization signature (data not shown). THEMIS ground instrumentation at Rankin Inlet (RANK), on the west shore of Hudson Bay at CGM (Gustafsson et al. 1992) latitude approximately 72.3 °, did optically detect the substorm (the imager south of Rankin, Gillam, had cloudy weather). The imager at RANK detected a brightening near the southern horizon at the time of magnetic onset (5:33 UT) and rapidly poleward moving auroras, approaching the northern horizon at approximately 6 UT, and subsequently retreating equatorward to be followed by auroral activity. We cannot tell from the optical data if there was equatorward expansion of the auroras, but the overall poleward expansion was to about the same width as the magnetically inferred electrojet boundaries. The overall shape in latitude-time space was similar during the expansive/early recovery phase lasting from 5:33 UT to nearly 6:30 UT. Reference to Fig. 1, the “eye” of the substorm, would lead one to expect some differences over the approximately 2 h of local time separating Polaris from RANK, but in general a simple electrojet model shows the same behavior as that inferred from a keogram, with the advantage that the electrojet borders and current are numeric values able to be used for other purposes.

Table 2 Site names and locations for inversion results
Fig. 5
figure 5

Eastward Current across the Polaris chain on November 8, 2007 from 5 UT to 8 UT (bottom). Electrojet borders in magnetic coordinates (middle). THEMIS imager keogram from Rankin Inlet (top). Time of onset is marked by a vertical line

Within the array itself, one can get an idea of the utility of the simple model by comparing input data to the results from the model’s parameters. Figure 6 shows a stack plot of X (black) and Z (blue) perturbations as observations (solid) and model results (dots). In general, there is a very good reproduction of the input data through the use of only the three simple parameters of north and south electrojet borders and total current. The stations with large signal seem to have the best fit, which may suggest that adjusting the weighting of input data could improve the model. We present a detailed statistical analysis of the fitting in the Appendix. We do not know the position of field-aligned current for the SCW of this event (although it could to some extent be inferred from subauroral perturbations). We discuss an event in which it is known from external data in the next section. There is also a period before the substorm onset when perturbations were weak and the model did not converge. Growth phase phenomenology remains of great interest and we have done a more specific study of this substorm phase in a section below for which the perturbations were more pronounced, and optical data from the same meridian were available.

Fig. 6
figure 6

Original perturbation data (lines) and model output (dots) for the Polaris stations modeled in Fig. 5. The scale changes among the panels and the X component is in black, Z in blue

The basic result here is that a chain with good latitudinal spacing can produce useful inversions of electrojet activity. The similar AUTUMNX chain can produce similar results (Connors et al. 2015).

Cross meridian and 1-D equivalent currents

The practice of using a three-dimensional current system representing a physical set of currents, yet placing FAC far away when inverting data from a meridian chain, allows a parallel to be made between AMM and methods that determine an equivalent current, such as spherical elementary current systems or SECS (Amm and Viljanen 1999). When FAC systems are placed far enough away to have minimal effect in the model, the ionospheric current system crossing the meridian is equivalent to the divergence-free elementary system which flows in a toroidal direction in the SECS method. We can thus reconcile the view that parameters of three-dimensional physical current systems may be determined from the ground, as Connors et al. (2014) demonstrated by comparison with AMPERE space-based data (Anderson et al. 2014), and the stricter view that they cannot. For meridian chain inversion with distant FAC, the AFM method should be essentially determining the divergence-free part of the current system and get the same result as SECS used in 1-D form over a meridian chain. We proceed to demonstrate empirically that this is the case. Further, using AMPERE data, we show that the cross-meridian current determined by AMM is the same as that in the net FAC of the SCW, which is added on to pre-existing R1 currents.

By browsing the maps of AMPERE-determined radial current (essentially FAC at auroral latitudes), showing landmasses, available at http://ampere.jhuapl.edu/rBrowse/index.html, we determined that a substorm on January 15, 2010, with onset time 20:37 UT, bracketed Fennoscandia in a way that should have produced a substorm westward electrojet over the IMAGE array. An electrojet inversion was performed using an automated version of SECS available on the IMAGE (Tanskanen 2009) FMI website (http://www.ava.fmi.fi/MIRACLE/iono_1D.php) with baseline relative to a quiet period earlier in the UT day. An AMM inversion was performed using the same data downloaded to a local computer. The eastward current in the event as determined by the two techniques is shown in Fig. 7 in the bottom panel, as before, negative values indicating that the current was actually westward. The two results agree to within about 10 %. The SECS result also produced an eastward current during the early stage, which although not shown would reduce the difference. After the minimum, the SECS result was slightly smaller in magnitude than the AFM result. As the SECS eastward current was not large at this time, this does not explain the difference then.

Fig. 7
figure 7

Cross-meridian physical current from AFM (black dots) and equivalent current digitized every 0.1 h and extremal points (red line) for the IMAGE array on January 15, 2010 (bottom). Electrojet boundaries from AFM (middle panel). IMF GSE magnetic field (top panel)

The middle panel of Fig. 7 shows the electrojet borders. These appeared poorly determined near the time of onset; but when the current became large enough, these followed a pattern similar to that in a color density plot (not shown) based on the online SECS technique. The top panel shows the IMF during the period of interest, which appears to indicate that the event was largely caused by negative B Z in the IMF, in the well-understood manner (Baumjohann 1986).

In order to compare the cross-meridian current, which AMM takes as a physical current modeled by the Biot-Savart integral, and SECS as an equivalent current, to the likely physical current, the integration technique for AMPERE data developed by Connors et al. (2014) was applied to AMPERE current densities. Figure 8 shows a snapshot of these current densities at 20:53 UT, a time marked in Fig. 7 by a vertical bar near the time of maximum (westward) current flow. The pattern shown was typical of that during the substorm: downward current in the morning/midnight sector and upward current in the evening sector clearly represent the FAC of a SCW, much as shown by Connors et al. (2014). There is more than one upward current region in the evening sector: similarly, at earlier times, there had been more than one downward current region in the morning sector. Space does not permit more detailed description of FAC patterns at all stages of the substorm, but these may be examined at http://ampere.jhuapl.edu/rBrowse/index.html. We note that in the evening sector, there is a clear R1/2 pattern corresponding to the pattern found in an averaged sense by Iijima and Potemra (1978). This is less clear in the morning sector but at other stages of development was more in evidence. We proceed here to study the currents in an integrated sense.

Fig. 8
figure 8

North polar view of AMPERE field-aligned current density in μA/m2, scale at right saturating from red to white, and from blue to black for values off scale, at 20:53 UT on January 15, 2010 (time shown by vertical bar in Fig. 7). The magnetic local time (MLT) is shown around the edge and with radial lines, and geomagnetic latitude lines are at 85 °, 80 °, 70 °, 60 °, and 55 ° (outer boundary)

Throughout this event, the IMAGE chain in Fennoscandia was located 2300 MLT, and this was near the central meridian of the substorm, being located between the clearly separated downward and upward FAC regions shown in Fig. 8. During the event, FAC exceeding the threshold value of 1 μA/m2 occurred only between 60 ° and 80 ° geomagnetic latitude. FAC could, however, extend as far as 0900 MLT in the morning sector or 1500 MLT in the afternoon sector. From 2300 MLT to these limits, and using the latitude limits mentioned, large regions were outlined that were used as integration boxes with gridded data in the manner described by Connors et al. (2014). Integrating in the morning sector, a downward current is in R1 and an upward current in R2 south of it. In the evening sector, this pattern is reversed. It is known that large poloidal currents can flow and close from R1 to R2 or vice versa, with little ground effect due to the solenoidal nature of the currents. If the R1/R2 closure is not balanced, then toroidal current flow is expected, crossing the near-midnight central meridian of the SCW in the case of substorms.

We calculated cross-meridian current independently in two ways. The difference of R1/R2 integrated currents from AMPERE data, with the assumption that current missing in the morning must have flowed toward the evening, gives the net current across the meridian. AMM based on the IMAGE magnetometer data should determine the same current independently. The results of AMPERE integration and of AMM are shown in Fig. 9. In each of the morning and evening sectors, we integrated the R1 and R2 currents. The downward and upward current totals in the morning and evening integration boxes are shown with blue and red lines, respectively. Their difference is shown with a green line. In the morning sector, the R1 current is downward, as indicated by a red line, and the R2 current is upward and indicated with a blue line. In the evening sector, the association of R1 and R2 with color is reversed.

Fig. 9
figure 9

Upward (red) and downward (blue) integrated currents in the midnight-morning sector (east of 23 MLT) and evening sector (west of 23 MLT) throughout the interval around the substorm onset time (edge of gray area) at 20:37 UT January 15, 2010. Green lines mark the difference in net FAC in each sector, inferred to be the cross-meridian current, and black dots show this quantity based on AFM

If R1 and R2 currents were the same, then they must have closed in nearly poloidal fashion (that is, locally) with no cross-meridian current. This was the case before substorm onset, with variations in the green line more likely associated with uncertainty in the integrated current magnitudes than with any physical effect (the green line represents the difference of two relatively large numbers while it is small, so that uncertainties are more apparent). The results of AMM integration based on IMAGE magnetometer data are indicated by black dots. The green line in the morning sector (top) panel closely follows the AMM results. This is also the case in the evening sector. We conclude that the difference of R1 and R2 integrated currents in the morning sector was the cross-meridian current and matched that calculated from AMM; in the evening sector, this difference also matched AMM. In the morning sector, the downward current (blue), which is in R1, increased, while in the evening sector the upward current (red), also in R1, increased. These currents flowed across the meridian and caused the ground magnetic perturbations in the IMAGE meridian, as evidenced by the AMM results showing the same current magnitude. Connors et al. (2014) concluded that the bulk of the substorm current flowed in R1: here, this result is verified.

In the pre-substorm time (white region in Fig. 9), R1 and R2 integrated currents as determined by AMPERE were between 0.5 and 1.0 MA but closely balanced, leaving little cross-midnight current. During this period, AFM indicated a low cross-meridian current. At onset time, the downward current in the morning sector and the upward current in the evening sector both rose. Their difference closely tracked the AFM current (which in turn was very close to the SECS equivalent current). As found by Connors et al. (2014), the morning sector FAC current more closely followed the AFM current than did the evening sector FAC current. However, here, the evening sector upward current did not persist as in that case. Quantitatively, an increase of up to 0.6 MA in morning sector downward FAC was measured along with a very similar increase in evening sector upward FAC. These give rise to FAC differences very closely tracking the cross-meridian true (AMM) or equivalent (SECS) currents in the ionosphere.

It was the aim of this subsection to investigate to what degree AFM agreed with SECS equivalent currents in a substorm event, and in turn to verify that the cross-meridian current indicated by those methods corresponded AMPERE’s determination of net FAC difference between morning and evening sectors. Much as in Connors et al. (2014), we find a consistent picture between the parameters derived from independent ground magnetometer and AMPERE in situ data. This verifies that it is possible to quantify the simple SCW model of McPherron et al. (1973) in a physically meaningful way, and this case based only on cross-meridian current that forms part of the SCW.

Growth phase currents

A final example pushes AMM to its limits. The previous examples succeeded during substorm expansion, with magnetic perturbations in the hundreds of nT and currents of several hundred kA, but did not converge well based on the weak currents before onset. Growth phase typically features such much smaller perturbations and in turn, currents. Figure 10 shows magnetic variations on February 22, 1997, using smoothed data from February 19, 1997 as a quiet day reference. Due to the fortuitous occurrence of a good quiet day nearby in time, small variations due to local currents are detected. A small expansive phase onset took place at 3.77 UT (03:46 UT), as manifested by the decrease in the X component at several stations at that time and the corresponding changes in the Z component. We did not model that expansive phase but rather the growth phase preceding it. We used data from the CANOPUS Churchill line, the predecessor array to CARISMA (Mann et al. 2008). The event took place in the evening sector, where the growth phase was manifested mainly by an eastward electrojet.

Fig. 10
figure 10

Perturbations at CANOPUS Churchill line stations as observed (lines) and modeled (dots) for the X (black) and Z (blue) components, prior to substorm onset in the evening sector on February 22, 1997

OMNI data (see Fig. 11) had a southward IMF averaging about −4 nT during the period 1:00–3:30 UT, with a brief northward turning at 2:00 UT. Thus, the normal growth phase pattern of expansion of the auroral oval was taking place. Positive X component perturbations moved to lower latitudes as time progressed, moving from ESKI to GILL, indicating equatorward motion of the evening sector auroral oval, as seen in Fig. 10. Z component changes also took place, but it is difficult from a stack plot to decipher exactly what combination of electrojet movement and change of strength of eastward current occurred. We note from the dots in Fig. 10 that a generally good match to data was obtained with a simple electrojet model.

Fig. 11
figure 11

Cross-meridian current over the CANOPUS Churchill line for the growth phase of Fig. 10 (bottom). Electrojet borders overlain on 557.7-nm auroral emission observed from Gillam (middle). IMF Bz in GSE coordinates

Figure 11 presents the output parameters of that model during the growth phase, overplotted on the output of a meridian scanning photometer for 557.7-nm electron precipitation aurora at Gillam (station GILL). At the top is the OMNI IMF Bz. Examining the AMM output first, we note that the eastward current across the meridian was decreasing until time 3.2 UT (3:11). The equatorward boundary of the electrojet moved steadily southward throughout the period, however accelerated when the IMF turned more southward. At this time, the aurora also brightened considerably. The poleward boundary of the electrojet also moved steadily equatorward. This motion took place even as the current weakened, and the current weakening appeared to have no relation to auroral brightness. After 3.2 UT, the current started to increase, without much change in auroral brightness. The overall width of the electrojet narrowed, with the current near 0.05 MA after 3.5 (3:30) UT. Finally, immediately before onset, the current rose dramatically, nearly doubling. After this point, the signal was dominated by intrusion of the westward electrojet of the expansive phase, with its negative perturbations in X, and was not studied.

An important aspect of Fig. 11 is that it again shows AFM results compared to an independent dataset. While the poleward boundary of the evening sector electrojet is defined optically by electron precipitation causing aurora, the equatorward boundary is dark and only indicated by AMM. The results suggest a narrowing of the electrojet and a rise in the current it carries, as immediate precursors of onset.

Discussion

We have described how the Levenberg-Marquardt algorithm, coupled with a magnetic field model, may be applied to ground magnetic data in general as a procedure we call AFM and then used it for meridian chains of stations in the form we denote AMM. AMM can yield insight into the gross physical parameters of electrojets which change during substorms. Despite using only the two electrojet borders and the cross-meridian current, good reproduction of the input data is demonstrated, so that we can conclude that the simple modeling parameters represent a large part of the variation in physical parameters during the events studied.

We showed the technique applied to a previously extensively studied substorm, and how even with low time resolution, inversion results can give a clearer picture of timing than can the AL index. We provided information about the Polaris predecessor of the AUTUMNX network in Québec (eastern North America) and an example of inversion using data from that chain. We emphasized that comparison of output model results to input is essential in knowing to what degree a simple model represents electrojet activity. We then compared the results of AMM as applied to a meridian for which both the SECS reduction technique and AMPERE integrations could also determine cross-meridian current. We found that all three techniques gave similar results. The physical picture provided by AMPERE’s measurement of FAC reinforced the concept that AFM can invert ground data into a three-dimensional framework resembling the original SCW. We noted that in any case, far from the FAC normally present in the AFM model, its cross-meridian current is divergence-free and thus directly comparable to the equivalent current in a 1-D SECS application. Finally, we found that AMM used on very carefully baselined growth phase data could extract information about the eastward electrojet that was, at least in the motion of the poleward border, verifiable from an independent optical dataset.

Conclusion

The AFM technique in its AMM form has been shown here to allow a simple and efficient representation of magnetic meridian data. Its output parameters have been shown to correspond to physical parameters of the electrojet system by comparison to other indicators. AFM is extensible as has been shown by use of ARM by Connors et al. (2014) and less successfully on a global scale (AGM) by Connors (1998) and should be helpful in our quest to understand substorm electrodynamics.