1 Introduction

Solar flares are localized and intense brightenings of the solar atmosphere, much brighter than the background, and they are evident throughout the entire electromagnetic spectrum. They are associated with in situ acceleration of energetic particles and (often) coronal mass ejections (CMEs), comprising some of the most spectacular and energetic phenomena of the solar system (Fletcher et al., 2011). The overall associated energy, released in a very short time, is even exceeding \(10^{32}\) erg in very large events that encompass both thermal and non-thermal processes.

The soft X-ray flux of a flare in the 1 – 8 Å range, as recorded by the Geostationary Operational Environmental Satellites (GOES), is used to categorize solar flares in order from largest to smallest in a logarithmic scale of classes X, M, C, B, and A. These scales are complemented by decimal subclasses (M1.0, C5.2, etc.). Flares of M1.0 and higher are often referred to as major flares and affect the geospatial environment more than the others. On the other hand, B- and A-class flares are the weakest and often lie within the soft X-ray background produced by the global solar atmosphere.

Flares and CMEs affect the geospatial environment in diverse ways on timescales ranging from a few minutes to days. In flares, these effects are immediate. The electron density in a range of ionospheric altitudes is affected by the enhanced X-ray and EUV radiation, which disrupts radio communications, while the subsequent expansion of the atmosphere increases the drag on low-altitude satellites. Space-borne instrumentation and crew are also vulnerable to direct flare-related electromagnetic radiation and particles. This evident absence of early warning for flares and flare-related effects has necessitated the pursuit of accurate flare prediction. The scientific community has intensified efforts in this direction by incorporating the constant flow of solar magnetic observations that have been achieved during the past decades. This is done towards two complementary and often overlapping directions: on one hand, to understand the fundamental physics behind the flare phenomenon (Shibata and Magara, 2011), and on the other hand, to develop efficient flare prediction schemes (see e.g. Georgoulis, 2012b).

Energy released in flares is known to be stored in the intense, complex magnetic field configurations of solar active regions. The ability of magnetic fields to store energy is associated with their departure from a current-free (potential) state. This non-potentiality is evident in optical (H\(\upalpha \), H\(\upbeta \)), EUV, and X-ray images of the active region solar corona, where significant twist of the coronal loops is seen (Leka et al., 1996; Schrijver, 2016). This twist requires substantial amounts of field-aligned electric currents. Therefore, the magnetic field configuration and the electric current distribution offer two aspects of the same physical reality (see e.g. Melrose, 1995, and references therein). Since inferring the electric currents requires the ability to record all three components of the magnetic field vector, studies focusing on electric currents and their role in flares started after vector magnetograms became widely available (see e.g. Canfield et al., 1993; Zhang, 1995; Leka et al., 1996, and references therein). Today, the regular flow of high temporal and spatial resolution photospheric vector magnetograms has boosted research in this topic (e.g. Ravindra et al., 2011; Gosain, Démoulin, and López Fuentes, 2014; Janvier et al., 2014; Vemareddy, Venkatakrishnan, and Karthikreddy, 2015; Inoue et al., 2015).

A pertinent issue is whether electric currents in solar active regions are neutralized. In the ideal case of an isolated twisted flux tube embedded in a field-free medium, the net current along the tube should be zero, being equal to the sum of two opposite-directed currents: a direct (volume) current along its axis that produces its twist, and a return (surface) current that isolates the flux tubes from the field-free environment (Parker, 1979). Current neutralization subsequently means that direct and return currents balance each other within a given magnetic polarity of an active region, giving rise to a zero net current per polarity. However, early calculations based on vector magnetograms showed that high values of net currents exist within active regions, implying a non-neutralization situation and a subsequent injection of net currents in the solar corona (Melrose, 1991, 1995).

Since then, a series of studies addressed the non-neutralization of photospheric electric currents and their origin. Most of them have shown that there are weak or no return currents in the photosphere (Leka et al., 1996; McClymont, Jiao, and Mikic, 1997; Semel and Skumanich, 1998; Wheatland, 2000; Falconer, 2001), and it is now being largely accepted that the currents that run along the magnetic field lines are non-neutralized. However, there have been some opposing arguments, reporting the existence of return currents in isolated sunspots (Wilkinson, Emslie, and Gary, 1992). Moreover, according to Parker (1996), the observational limitations of magnetographs consequently lead to erroneous calculations of net electric currents with no physical meaning, when the differential form of Ampère’s law is used.

To address the issues of spatial resolution and errors that may hinder the correct interpretation, Georgoulis, Titov, and Mikić (2012) proposed a detailed method for calculating the electric current neutralization. They also incorporated a detailed error analysis and imposed strict criteria on current neutralization. Their study of two active regions showed that intense non-neutralized currents are found exclusively at the vicinity of strong magnetic polarity inversion lines (MPIL) and that higher values are linked with higher flare productivity. Strong MPILs separate opposite polarities and are characterized by high magnetic field strength, leading to a plasma \(\upbeta \) on the order of unity at the photosphere. In such conditions, the Lorentz force can overcome the hydrodynamic inertia of the photospheric plasma and produce shear. It was therefore concluded that non-neutralized currents are injected in the atmosphere with the emergence of flux and that the shear observed in MPILs is generated by the Lorentz force when the cylindrical symmetry of the flux-tube footprints breaks down. Their results have not been reproduced since on a larger sample, but the exclusive relation between non-neutralized currents in MPILs and shearing/twisting motions has been demonstrated via observations and models in subsequent studies (Janvier et al., 2014; Török et al., 2014; Vemareddy, Venkatakrishnan, and Karthikreddy, 2015; Dalmasse et al., 2015). Although there is no general consensus on the causal relationship between shearing motions and the Lorentz force, there seems to be a consensus that substantial net currents are injected in the corona along strong MPILs. Dissipative processes (magnetic reconnection) are required for current neutralization, justifying the use of current-related quantities as flare predictors.

Electric current densities have shown good correlation with flaring and coronal mass ejection (CME) activity (Falconer, Moore, and Gary, 2002; Yang et al., 2012) and have been incorporated in flare prediction schemes in several studies (Leka and Barnes, 2003a,b, 2007; Bobra and Couvidat, 2015). In these studies, predictors related to the electric current rank among the best performing, motivating us to investigate new ones, based on the systematic approach of Georgoulis, Titov, and Mikić (2012), where the effects of numerical artefacts and magnetic field measurement errors are mitigated and the contribution of MPILs with strong shear, such as potential flaring sites in active regions, are intensified. Thus, we extend their results to a large sample of active regions, aiming to produce a new current-based predictor for use in the context of the FLARECAST project.

FLARECAST is a novel endeavor whose purpose is to produce real-time flare predictions of unmatched accuracy. It aims to do so by incorporating the most efficient predictors and methods in a highly sophisticated prediction scheme, exploiting advanced machine-learning techniques. Along with using the predictors proposed so far in the literature, part of the project is devoted to developing new ones. In this context, we investigate the flare-prediction capability of the total and maximum non-neutralized electric currents in active regions. Our sample is statistically significant and consists of thousands of space-based active-region vector magnetograms acquired during the current Solar Cycle 24.

2 Dataset Description: SHARP Near Real-Time Data

We used observations from the Helioseismic Magnetic Imager (HMI; Scherrer et al., 2012; Schou et al., 2012) on board the Solar Dynamics Observatory mission (SDO; Pesnell, Thompson, and Chamberlin, 2012). Aiming to produce data that are specifically directed to space weather research, the HMI team have developed the Space weather HMI Active Region Patches (SHARP; Bobra et al., 2014). These are cutouts that contain the magnetic field vector components, remapped and deprojected as if observed at the solar disk center along with a set of quantities used for predicting solar eruptions. In this study we used the near-real time (NRT) cylindrical equal area (CEA) SHARP data. Because we aim to use the non-neutralized currents as predictors in an automated flare-prediction service that will rely on the automatically produced HARP cutouts, no constraints on the solar disk location have been imposed in this study.

To initially test the algorithm and its performance and determine whether there is a correspondence between the evolution of the parameters and the flaring activity of active regions, a sample of 11 National Oceanic and Atmospheric Administration (NOAA) active regions (AR) was chosen. These ARs and the corresponding time-series durations, starts, and ends were selected randomly and show none, low, moderate, or high flare productivity. Table 1 summarizes the time-series start and end, the Mt.Wilson classification type, and the corresponding flare productivity of the active regions.

Table 1 NOAA AR sample. \(t_{\mathrm{start}}\) and \(t_{\mathrm{end}}\) are the starting and ending dates of each time-series. The B, C, M, and X columns denote the number of flares in the corresponding flare class within this interval, FI is the corresponding flare index, while Mag.Type is the Mt.Wilson classification type of the NOAA AR.

For further testing, a representative sample of SHARP cutouts was processed. We selected 336 random days between September 2012 and May 2016, which comprise 25% of the entire SHARP data coverage. For each day, we processed SHARP cutouts at a cadence of 6 h, resulting in 9454 data points.

The GOESFootnote 1 database was used to derive the flare association of the SHARP observations. For every HARP cutout of our sample (both active region time-series and the representative sample), we derived the flare onset times and classes within the following 24 h. Specifically for the time series, we also calculated the total flare index (FI) for the duration of each time-series, using the formula of Abramenko (2005), that is,

$$ \mathrm{FI}=\bigl(100{S}^{(\mathrm{X})}+10{S}^{(\mathrm{M})}+1.0{S}^{(\mathrm{C})}+0.1{S}^{(\mathrm{B})} \bigr)/\Delta t, $$
(1)

where \(S^{(\mathrm{X},\mathrm{M},\mathrm{C},\mathrm{B})}\) is the per class sum of flare magnitude (the number that follows the class letter) and \(\Delta t\) is the time interval in days. The number of flares per class and the total flare index for the AR sample are shown in Columns 4 – 8 of Table 1. Of the 9454 points of the representative sample of SHARP CEA NRT cutouts, 2573, 212, and 16 are associated with C-, M-, and X-class flares, respectively, over the preset window.

3 Analysis: Non-neutralized Currents in Solar Active Regions

We followed the method of Georgoulis, Titov, and Mikić (2012), which incorporates measurement uncertainties and poses strict criteria on deciding whether the calculated currents are neutralized. Here we briefly describe the method, while we refer to the original article for more details.

The map of the radial component of the SHARP vector magnetic field is partitioned into non-overlapping patches of the same magnetic polarity, using a gradient-based flux tessellation scheme (Barnes, Longcope, and Leka, 2005). This partitioning method has also been used by Georgoulis and Rust (2007) for the calculation of the effective connected magnetic field strength. To ensure that quiet-Sun magnetic elements are excluded and only sizable AR-only magnetic polarities are taken into account, thresholds on magnetic field strength, enclosed magnetic flux, and area are imposed. According to Georgoulis, Titov, and Mikić (2012), small changes in the selected thresholds do not alter the essence of the results. Their conclusion was also verified in our dataset by testing thresholds, e.g. magnetic field strength up to 300 G and enclosed magnetic flux up to \(10^{20}\) Mx. Using lower thresholds will result in more partitions, which will not contribute to the calculations but will dramatically slow up the process. The thresholds selected in magnetic field strength, enclosed magnetic flux, and size per partition are 100 G, \(5\times 10^{19}\) Mx, and 40 pixel. Before partitioning, the input magnetograms were smoothed with a \(5\times 5\) pixel kernel to produce partitions with slightly smoother boundaries and (in the case of time-series) avoid dramatic changes of the partitioning process from frame to frame. The process returns all non-overlapping partitions, along with their flux-weighted centroid positions, boundaries, and corresponding magnetic fluxes.

For each partition, the vertical electric current density is calculated using the differential form of Ampère’s law:

$$ J_{z}=\frac{1}{\mu_{0}}\biggl(\frac{\partial B_{y}}{\partial x}-\frac{ \partial Bx}{\partial y} \biggr), $$
(2)

where \(B_{x}\) and \(B_{y}\) are the two horizontal components of the magnetic field and \(\mu_{0}\) is the magnetic permeability of the vacuum. The total current \(I_{i}\) contained in each partition is calculated as the algebraic sum of the contained \(J_{z}\). The error maps of the three components of the magnetic fields are used to calculate the corresponding error \(\delta I_{i}\) for each \(I_{i}\). To decide whether \(I_{i}\) is non-neutralized, we calculated the corresponding potential magnetic field components (Alissandrakis, 1981) using the vertical magnetic field \(B_{z}\) as boundary condition and repeated the calculation for the potential magnetic field configuration. The value of current \(I_{i}^{\mathrm{pot}}\) in this case should be, by definition, equal to zero. Owing to numerical effects, however, non-zero results are produced. The total electric current, \(I_{i}\), of a given partition is considered non-neutralized, \(I_{i}^{\mathrm{NN}}\), if it satisfies the following criteria: \(I_{i} > 5\times I_{i}^{\mathrm{pot}}\) and \(I_{i} > 3\times \delta I _{i}\). From the set of non-neutralized partition currents (if any) within the active region at a given time, we calculated the maximum value, \(I_{\mathrm{NN},\mathrm{max}}\), and the total unsigned non-neutralized current \(I_{\mathrm{NN},\mathrm{tot}}\). The latter is defined as the sum of the absolute current values of the non-neutralized partitions, \(I_{i}^{\mathrm{NN}}\) i.e.,

$$ I_{\mathrm{NN},\mathrm{tot}}=\sum_{i}\bigl|I_{i}^{\mathrm{NN}}\bigr|. $$
(3)

The rationale for selecting these two quantities is, as already mentioned, that strong flares in ARs are associated with MPILs, which are in turn associated with strong non-neutralized currents (Georgoulis, Titov, and Mikić, 2012; Dalmasse et al., 2015; Vemareddy, Venkatakrishnan, and Karthikreddy, 2015). Several sites in an active region may develop suitable conditions for flaring. \(I_{\mathrm{NN},\mathrm{max}}\) may be an indicator of the non-neutralized current that has increased at a certain site, while if several such locations are indeed present (hence \(I_{\mathrm{NN},\mathrm{tot}}\) is increased), then the active region should be even more prone to strong flares.

It should be pointed out that our method ensures that the calculated parameters are not size dependent but exclusively linked to the presence of strong MPILs. In this sense, \(I_{\mathrm{NN},\mathrm{tot}}\) and \(I_{\mathrm{NN},\mathrm{max}}\) are not extensive but intensive, and therefore neither the size of an active region nor the extent of the cutout (so long as it fully contains the active region) affect the calculations.

We investigated the flare-predictive capability of \(I_{\mathrm{NN},\mathrm{max}}\) and \(I_{\mathrm{NN},\mathrm{tot}}\) in a twofold manner. First, we explored the time evolution of the non-neutralized currents of active regions in comparison with their flaring activity in order to assess whether the results of Georgoulis, Titov, and Mikić (2012) extend to more than the two cases the authors presented. Then, the performance of the two parameters was examined on a sizable sample of point-in-time observations that is representative of the sample that FLARECAST will rely on.

4 Results

4.1 General Trends, and Rationale for Using Non-neutralized Currents in the FLARECAST Project

The purpose of this section is to assess whether 1) more flare-productive ARs are linked with higher non-neutralized currents, 2) the two quantities show an evolution similar to their flare productivity, and 3) this evolution is suitable for prediction, i.e. if the increase in the two parameters precedes flares and shows some correlation with flare class or flare index within a given time-window.

In Figure 1 we plot the temporal evolution of the \(I_{\mathrm{NN},\mathrm{tot}}\) and \(I_{\mathrm{NN},\mathrm{max}}\) for each of the 11 active regions of the first sample. It is clear that flare-productive ARs have \(I_{\mathrm{NN},\mathrm{tot}}\) and \(I_{\mathrm{NN},\mathrm{max}}\) values that are higher by more than an order of magnitude. The strongest flares require the existence of systematically high non-neutralized currents such as in the case of ARs 11429, 11515, 11748, and 11875. Another interesting finding is that many strong flares are preceded by a few hours of increasing \(I_{\mathrm{NN},\mathrm{tot}}\) (e.g. ARs 11158, 11515, and 11875). This increase corresponds to the emergence of new flux and to the development of a strong PIL and is also reflected on their magnetic type evolution (Table 1, last column).

Figure 1
figure 1

\(I_{\mathrm{NN},\mathrm{tot}}\) (black) and \(I_{\mathrm{NN},\mathrm{max}}\) (blue) temporal evolution of the 11 ARs of Table 1. Blue, green, yellow, and red vertical lines mark the start times of B-, C-, M-, and X-class flares.

Active regions with \(I_{\mathrm{NN},\mathrm{tot}}\) and \(I_{\mathrm{NN},\mathrm{max}}\) lower than \(10^{12}\) A exhibit very low or no flaring activity at all. For instance, AR 11072 and AR 11663 produced only B or up to C1.1 class flares, respectively. In the latter case, these flares were clustered around relatively enhanced non-neutralized currents during the second day. This clustering of flares during well-defined peaks of \(I_{\mathrm{NN},\mathrm{tot}}\) is also observed in AR 11640, which exhibits more frequent flaring activity, owing to its overall higher values of \(I_{\mathrm{NN},\mathrm{tot}}\) and \(I_{\mathrm{NN},\mathrm{max}}\).

To some extent, the more sizable flare-productive active regions also exhibit repeated flaring activity during peaks of the two parameters, although these peaks do not stand out clearly, but lie on an increased background because these active regions contain many non-neutralized partitions along intense MPIL, which increase the total amount of non-neutralized current. More careful inspection shows that even for these active regions, the long temporal increase of non-neutralized currents exhibits structure on finer temporal scales, manifested by shorter peaks in addition to the general trend. This is seen mostly at the \(I_{\mathrm{NN},\mathrm{tot}}\) curve, while the \(I_{\mathrm{NN},\mathrm{max}}\) curve is less inclined. This difference may arise because the non-neutralized current of a given partition may not increase above a certain limit. Instead, the increasing number of non-neutralized partitions results in an overall increase of \(I_{\mathrm{NN},\mathrm{tot}}\). We also note that intense flaring activity is found at times when \(I_{\mathrm{NN},\mathrm{tot}}\) is several times higher than \(I_{\mathrm{NN},\mathrm{max}}\). We return to this remark in the next section. By incorporating the added effect of multiple non-neutralized partitions within an active region, \(I_{\mathrm{NN},\mathrm{tot}}\) seems to represent its evolution more efficiently, and in this sense, it may prove a more efficient flare predictor than \(I_{\mathrm{NN},\mathrm{max}}\).

Accumulation of non-neutralized currents in AR partitions as part of an energy build-up process may be clearly seen in ARs 11158, 11429, 11515, and 11875. In AR 11158, the flux emergence that is observed at the end of the second day is accompanied by an increase in the total non-neutralized currents, showing that flux emergence results in the injection of net currents in the AR corona. This is in line with previous results on the origin of the non-neutralized currents. Georgoulis, Titov, and Mikić (2012) used a dimensionless non-neutralization factor to show that two active regions with very different flare productivity were coherent structures. This led the authors to conclude that the non-neutralized currents they contained had a sub-photospheric origin. Their results were corroborated by the magnetohydrodynamics (MHD) simulations of Török et al. (2014), which showed that active regions are born with substantial non-neutralized currents.

The association between non-neutralized current increase and flaring–CME activity has been implied in previous studies (see e.g. Ravindra et al., 2011; Janvier et al., 2014; Vemareddy, Venkatakrishnan, and Karthikreddy, 2015). In most of these studies, which prompt the evolution of an AR, this association was inferred a posteriori by manually selecting regions of interest, based on the sites of recorded flaring activity (around MPILs). Conversely, judging by Figure 1, it seems that the temporal variation of \(I_{\mathrm{NN},\mathrm{tot}}\) and \(I_{\mathrm{NN},\mathrm{max}}\) brings out the evolution of non-neutralized currents at the regions of interest as clear enhancements in addition to the overall evolution. During the strongest flares, the total non-neutralized current continues to increase, showing that the free energy build-up process continues and flares remove only a part of this energy. According to Janvier et al. (2014), a local increase of the current may be observed after the flare at the footpoints of the ribbons.

It has been shown that strong direct currents, with weak or no return currents (implying the existence of non-neutralized currents), are compatible with the existence of twisted flux ropes (Schrijver et al., 2008). For instance, AR 11429 was one of the most flare-productive ARs of Cycle 24, whose total non-neutralized current content is the highest of the sample (along with that of AR 11515). AR 11429 exhibited strong MPIL, with strong shear, indicative of strong deviations from non-potentiality, justifying the build-up of large amounts of \(I_{\mathrm{NN},\mathrm{tot}}\). It produced two X-class flares at the end of day 3, both accompanied by CMEs, which were preceded by the formation of magnetic flux ropes (Chintzoglou, Patsourakos, and Vourlidas, 2015; Syntelis et al., 2016). The X-class flare of the well-studied AR 11158 has also been associated with a flux rope formation (Inoue et al., 2015). These comparisons are useful as they possibly link the proposed predictors with the underlying physics of the flaring phenomenon. A more detailed examination exceeds the scope of the present study and will be reserved for the future.

From a flare-forecasting point of view, we may infer approximate \(I_{\mathrm{NN},\mathrm{tot}}\) thresholds of \(0.5\times 10^{12}\), \(5\times 10^{12}\), and \(15\times 10^{12}\) A for C-, M-, and X-class flares to occur. However, except for one case where a C-class flare occurred for \(0.5\times 10^{12}\) A (AR 11663), for the rest of the sample, a \(I_{\mathrm{NN},\mathrm{tot}}\) value of at least \(3\times 10^{12}\) A was observed. Of course these thresholds are indicative: the actual merit of \(I_{\mathrm{NN},\mathrm{tot}}\) and \(I_{\mathrm{NN},\mathrm{max}}\) is examined in the next section. Still, the good correspondence between \(I_{\mathrm{NN},\mathrm{tot}}\), \(I_{\mathrm{NN},\mathrm{max}}\) and flare productivity is also illustrated in Figure 2. There, we plot the flare index during the observation interval for each AR of Figure 1 along with the corresponding temporal averages of the two examined predictors. From this plot it is also confirmed that quiet and flare-productive ARs differ by more than an order of magnitude in terms of non-neutralized current content. The Pearson correlation coefficients between FI and \(I_{\mathrm{NN},\mathrm{tot}}\), \(I_{\mathrm{NN},\mathrm{max}}\) are 0.869631 and 0.865931, respectively. We therefore conclude that the flare productivity of active regions is very well correlated with their amount of non-neutralized electric currents.

Figure 2
figure 2

Time-averaged \(I_{\mathrm{NN},\mathrm{tot}}\) (blue) and \(I_{\mathrm{NN},\mathrm{max}}\) (green) for each active region of Table 1 and the corresponding flare index (red), calculated for the entire duration of each time-series of Figure 1.

4.2 \(I_{\mathrm{NN},\mathrm{tot}}\) and \(I_{\mathrm{NN},\mathrm{max}}\) as Solar Flare Predictors

Manually selected active region samples are useful to (visually) assess whether there is flare-predictive potential in certain active region properties, in order to test the performance of the calculation algorithms, and to fine-tune the input parameters. The time evolution of these properties also often allows deriving some association with the physical processes that are involved in the flaring phenomenon. Nevertheless, these samples, however randomly selected, are inherently different than the samples that are actually used in automated prediction services, where photospheric magnetogram cutouts are produced in real time and no (human-biased) selection effects apply. To accommodate an examination of \(I_{\mathrm{NN},\mathrm{tot}}\) and \(I_{\mathrm{NN},\mathrm{max}}\) in “realistic” conditions, we used the representative sample of 9454 SHARP CEA NRT vector magnetograms.

The corresponding non-zero \(I_{\mathrm{NN},\mathrm{tot}}\) and \(I_{\mathrm{NN},\mathrm{max}}\) are presented as a scatter plot in Figure 3 along with color-coded flare association information. It is clearly seen that most cutouts that are associated with major flares within the next 24 h (marked with yellow and red crosses) are found towards the upper right corner, i.e. are associated with higher non-neutralized electrical currents. It therefore seems that the two parameters allow a partial separation between flaring and non-flaring regions. For major flares to occur within the following 24 h, approximate thresholds of \(I_{\mathrm{NN},\mathrm{tot}}\) and \(I_{\mathrm{NN},\mathrm{max}}\) may be derived: \(5\times 10^{11}~\text{A}\) (\(I _{\mathrm{NN},\mathrm{tot}}\)) and \(1.9\times 10^{11}~\text{A}\) (\(I_{\mathrm{NN},\mathrm{max}}\)) for M-class flares and \(4.6\times 10^{12}~\text{A}\) (\(I_{\mathrm{NN},\mathrm{tot}}\)) and \(8\times 10^{11}~\text{A}\) (\(I _{\mathrm{NN},\mathrm{max}}\)) for X-class flares.

Figure 3
figure 3

\(I_{\mathrm{NN},\mathrm{tot}}\) vs. \(I_{\mathrm{NN},\mathrm{max}}\) for the representative sample of SHARP CEA NRT cutouts with non-zero non-neutralized currents. Lower to upper diagonal lines mark the \(I_{\mathrm{NN},\mathrm{tot}} = [1,1.5,2,2.5,3,3.5] \cdot I_{\mathrm{NN},\mathrm{max}}\) fits.

From the examination of the time evolution of the parameters in the previous section (Figure 1), we found that intense flaring occurs at times when the difference between \(I_{\mathrm{NN},\mathrm{tot}}\) and \(I_{\mathrm{NN},\mathrm{max}}\) is largest, while in quiet phases, the two quantities are roughly equal (or both close to zero). Figure 3 allows us to extend these results to a larger sample. It can be seen that no major flares are found below the \(I_{\mathrm{NN},\mathrm{tot}} = I_{\mathrm{NN},\mathrm{max}}\) line: M- and X-class flares require \(I_{\mathrm{NN},\mathrm{tot}} > 1.5 \cdot I_{\mathrm{NN},\mathrm{max}}\) and \(I_{\mathrm{NN},\mathrm{tot}} > 3.5 \cdot I_{\mathrm{NN},\mathrm{max}}\), respectively. Therefore, major flares require the presence of many non-neutralized partitions. A large number of non-neutralized partitions denotes the presence of a stronger, more fragmented MPIL. It is well established that strong complicated MPILs are linked with high non-potentiality of the magnetic field (e.g. Falconer, Moore, and Gary, 2002; Schrijver, 2007) and strong shear and that they are consequently related to major flares. It is reasonable to suggest that if an AR contains several MPILs or one extended/more complicated MPIL, it is more prone to flare. While it seems that one single partition may reach a specific amount of non-neutralized current (reflecting high \(I_{\mathrm{NN},\mathrm{max}}\)), the presence of many non-neutralized partitions (either because there are several non-neutralized partitions along an MPIL or because there are several MPILs within an AR) results in \(I_{\mathrm{NN},\mathrm{tot}}\) being several times higher than \(I_{\mathrm{NN},\mathrm{max}}\). The findings of Figure 3 are therefore an alternative way to demonstrate that strong, fragmented MPILs are required for major flares and that this requirement is effectively incorporated in the definition of \(I_{\mathrm{NN},\mathrm{tot}}\).

Next, we compared the performance of \(I_{\mathrm{NN},\mathrm{tot}}\) and \(I_{\mathrm{NN},\mathrm{max}}\) with that of the total unsigned flux, \(\Phi_{\mathrm{tot}}\), and the two electric current-related parameters given by the SHARP team, that is, the total vertical current \(J_{z,\mathrm{tot}}\) and the sum of the modulus of the average net current per polarity \(J_{z,\mathrm{sum}}\) (Bobra et al., 2014). The two latter parameters express the amount of net current in an active region, and the aim of the comparison is to investigate in practice whether \(I_{\mathrm{NN},\mathrm{tot}}\) and \(I_{\mathrm{NN},\mathrm{max}}\) have meaningful or trivial differences with these already established parameters.

The scatter plots between parameters calculated for each of the 9454 points are found in Figure 4. Cutouts associated with flares of different classes that occur over the next 24 hours are denoted differently. The calculation process produced zero or no results for many cases. This is either because some cutouts contained no non-neutralized partitions or because the partitioning process was, at times, unable to provide eligible partitions as per the preset thresholds. Both cases may include decaying or developing AR without MPILs and were treated as corresponding to zero \(I_{\mathrm{NN},\mathrm{tot}}\) and \(I_{\mathrm{NN},\mathrm{max}}\). These points are included in all panels of Figure 4 at an ordinate equal to \(10^{10}\) A.

Figure 4
figure 4

\(I_{\mathrm{NN},\mathrm{tot}}\) (left) and \(I_{\mathrm{NN},\mathrm{max}}\) (right) vs. \(\Phi_{\mathrm{tot}}\) (top), the total unsigned vertical current \(J_{z}\) (middle) and the sum of the modulus of the average net current per polarity \(J_{z,\mathrm{sum}}\)(bottom). The points associated with flares within 24 h are noted in different colors.

In general, all panels in Figure 4 imply at best a weak correlation between the examined parameters. The highest values of \(\Phi_{\mathrm{tot}}\) and \(J_{z}\) correlate better with \(I_{\mathrm{NN},\mathrm{tot}}\) and \(I_{\mathrm{NN},\mathrm{max}}\). These points also correspond to increased flaring activity, as most of the major flares (yellow and red crosses) are concentrated there.

It is also obvious that zero non-neutralized electric currents do not correspond to zero \(\Phi_{\mathrm{tot}}\), \(J_{z}\), and \(J_{z,\mathrm{sum}}\). These are, by definition, extensive parameters, i.e. they increase with the size of the active region and/or the extent of the cutout and acquire non-zero values. On the other hand, \(I_{\mathrm{NN},\mathrm{tot}}\) and \(I_{\mathrm{NN},\mathrm{max}}\) are intensive (non-extensive) parameters, as they depend only on the size and strength of the MPIL and are zero when no MPILs are formed, regardless of the cutout extent. In our sample we find that only a very small fraction of these cutouts (\({\sim}\, 0.5\%\)) are associated with C-class flares and none with M- or X-class flares.

The qualitative characteristics of the scatter plots of Figure 4 are quantified in terms of flare-forecasting potential by the Bayesian-inferred probabilities (Georgoulis, 2012a; Wheatland, 2005) in Figure 5. For a given threshold, normalized to the maximum value of each parameter to facilitate comparison, we count the total number, N, of active regions with a parameter value higher than the threshold and the number, F, of them that are flaring within the next 24 h. Then, the Bayesian probability is given by

$$ {p=\frac{\mathrm{F}+1}{\mathrm{N}+2}} , $$
(4)

with an uncertainty of

$$ {\delta p=\sqrt{\frac{p(1-p)}{\mathrm{N}+3}}}. $$
(5)
Figure 5
figure 5

Bayesian-inferred probabilities of \(I_{\mathrm{NN},\mathrm{tot}}\) (red) and \(I_{\mathrm{NN},\mathrm{max}}\) (orange) compared with those of \(\Phi_{\mathrm{tot}}\) (black), \(J_{z}\) (green), and \(J_{z,\mathrm{sum}}\) (cyan) for C-, M-, X-, above C-, and above M-class flares. \(\Phi_{\mathrm{tot}}\), \(J_{z}\), \(J_{z,\mathrm{sum}}\), \(I_{\mathrm{NN},\mathrm{tot}}\), and \(I_{\mathrm{NN},\mathrm{max}}\) thresholds have been normalized to the corresponding maximum values, i.e. \(1.68917\times 10^{23}\) Mx, \(2.59441 \times 10^{14}\) A, \(4.21399\times 10^{13}\) A, \(7.52898\times 10^{13}\) A, and \(6.06393\times 10^{12}\) A, respectively. Each bin has the same statistical weight to facilitate comparison.

The calculated Bayesian probabilities of the five parameters as functions of the normalized thresholds are shown in Figure 5. The two non-neutralized current parameters produce more significant probabilities than \(\Phi_{\mathrm{tot}}\), \(J_{z}\), and \(J_{z,\mathrm{sum}}\) for a range of normalized thresholds. As expected, \(I_{\mathrm{NN},\mathrm{tot}}\) is more efficient. For C-class flares and flares higher than C-class, the difference with \(\Phi_{\mathrm{tot}}\) starts at ∼0.1 for low thresholds and exceeds 0.4. For major solar flares, the performance of \(I_{\mathrm{NN},\mathrm{tot}}\) is significantly better than the \(\Phi_{\mathrm{tot}}\). It should be noted that for very high thresholds the uncertainties increase, as the number of eligible data points (N) becomes smaller, which decreases the significance of the statistical sample. The results of Figure 5 suggest that \(I_{\mathrm{NN},\mathrm{tot}}\) and \(I_{\mathrm{NN},\mathrm{max}}\) are more efficient than \(\Phi_{\mathrm{tot}}\), \(J_{z}\), and \(J_{z,\mathrm{sum}}\) at separating flaring from non-flaring active regions. They are therefore worth considering as flare predictors.

5 Discussion and Conclusions

In the context of an operational flare-forecasting service, we processed a sizable sample of SDO/HMI SHARP cutouts, aiming to examine the forecasting potential of two predictors related to the amount of non-neutralized electric currents contained in active regions. For the first time, we studied the evolution of the total and maximum unsigned non-neutralized currents, \(I_{\mathrm{NN},\mathrm{tot}}\) and \(I_{\mathrm{NN},\mathrm{max}}\), for 11 active regions, as well as their performance in flare prediction in a representative sample of SHARP CEA NRT data, consisting of 9454 cutouts.

A brief discussion of the temporal evolution of these two new parameters showed that they correspond to physical processes such as MPIL formation, and as per previous studies, to a possible flux rope formation, closely related with intense flaring activity. After focusing on the potential of the \(I_{\mathrm{NN},\mathrm{tot}}\) and \(I_{\mathrm{NN},\mathrm{max}}\) as solar flare predictors, however, we did not discuss the origin and evolution of non-neutralized currents in active regions thoroughly. This is a long-standing question that has been addressed through simulations and observations (Georgoulis, 2017), and a more detailed investigation is pending.

Both parameters are closely associated with flare productivity, confirming and extending the conclusions drawn by Georgoulis, Titov, and Mikić (2012). It was shown for the first time, using a large representative sample of Solar Cycle 24, that both predictors are worth considering in automated flare prediction schemes. Of the two proposed predictors, \(I_{\mathrm{NN},\mathrm{tot}}\) characterizes the flaring potential of an active region more efficiently because it encompasses the contributions of all possible flare-producing sites within an active region.

An important advantage of these predictors is that they include a detailed error analysis in the calculations of the non-neutralized currents. This approach has proven to be reliable for concluding whether the currents in ARs are neutralized (Georgoulis, Titov, and Mikić, 2012). The immediate comparison with the potential field ensures that the measured currents do exist and are significant through existing observations and are not due to the numerical effects that arise when analytical expressions (such as Ampère’s law) are applied to discretized distributions of continuous physical quantities (such as the magnetic flux density). Therefore, possibly erroneous electric currents are constrained. Qualitatively, our results also suggest that the method we used does not depend on the resolution of the processed magnetograms. The method was originally developed for Hinode SOT/SP magnetograms, which have the highest resolution available to date, and in this study was applied to the lower resolution SDO/HMI vector magnetograms. For all the reasons mentioned, \(I_{\mathrm{NN},\mathrm{tot}}\) is more advantageous than other commonly used current parameters, such as the total vertical current density, \(J_{z}\), or the total unsigned current per polarity.

This said, the association between \(I_{\mathrm{NN},\mathrm{tot}}\) and \(J_{z}\) is not trivial since by definition, there is no linear relationship between the two predictors. The total unsigned current of an active region is an extensive parameter, meaning that in principle, it increases with the size of the active region (similarly to \(\Phi_{\mathrm{tot}}\)). Non-neutralized currents, on the other hand, should be zero for active regions that do not contain strong MPILs or for SHARP cutouts that lack one polarity, regardless of their magnetic flux content and vertical electric current density. Therefore, \(I_{\mathrm{NN},\mathrm{tot}}\) and \(I_{\mathrm{NN},\mathrm{max}}\) are non-extensive parameters (as the lack of correlation in Figure 4 also demonstrates), which incorporate in a physical way morphological information in active regions with possible (and plausible) implications for flare triggering (i.e. strong and sheared MPILs). Given that current-related parameters have shown good prospects for flare prediction (Bobra and Couvidat, 2015), we expect improved results for \(I_{\mathrm{NN},\mathrm{tot}}\). In addition, non-extensive parameters were found to outperform the extensive ones in CME prediction (Bobra and Ilonidis, 2016), which means that there may be some potential in extending the usage of \(I_{\mathrm{NN},\mathrm{tot}}\) to CME prediction as well.

In view of numerous studies on solar flare prediction, it has now become apparent that no single flare-predictor suffices to categorically forecast solar flares and that active regions are complex dynamical systems where more than one parameter is involved in flare triggering. Investigating the evolution and absolute magnitudes of non-neutralized electric currents is more attractive and tractable in the era of systematically and regularly available vector magnetograms. It is thus tempting to explore the possibility of producing, in the future, several predictors that involve such currents. For the moment, we are planning to include \(I_{\mathrm{NN},\mathrm{tot}}\) in the set of predictors that will feed machine-learning algorithms in the framework of the FLARECAST project, at the same time investigating their physical significance via the project’s explorative research component.