Forecasting Solar Cycle 25 Using Deep Neural Networks

Benson, B.; Pan, W. D.; Prasad, A.; Gary, G. A.; Hu, Q.

doi:10.1007/s11207-020-01634-y

Forecasting Solar Cycle 25 Using Deep Neural Networks

Published: 19 May 2020

Volume 295, article number 65, (2020)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Solar Physics Aims and scope Submit manuscript

Forecasting Solar Cycle 25 Using Deep Neural Networks

Download PDF

1284 Accesses
43 Citations
6 Altmetric
Explore all metrics

Abstract

With recent advances in the field of machine learning, the use of deep neural networks for time series forecasting has become more prevalent. The quasi-periodic nature of the solar cycle makes it a good candidate for applying time series forecasting methods. We employ a combination of WaveNet and Long Short-Term Memory neural networks to forecast the sunspot number using the years 1749 to 2019 and total sunspot area using the years 1874 to 2019 time series data for the upcoming Solar Cycle 25. Three other models involving the use of LSTMs and 1D ConvNets are also compared with our best model. Our analysis shows that the WaveNet and LSTM model is able to better capture the overall trend and learn the inherent long and short term dependencies in time series data. Using this method we forecast 11 years of monthly averaged data for Solar Cycle 25. Our forecasts show that the upcoming Solar Cycle 25 will have a maximum sunspot number around 106 ± 19.75 and maximum total sunspot area around 1771 ± 381.17. This indicates that the cycle would be slightly weaker than Solar Cycle 24.

An Improved Prediction of Solar Cycle 25 Using Deep Learning Based Neural Network

Article 30 March 2023

Can Solar Cycle 25 Be a New Dalton Minimum?

Article 03 November 2021

Stacked 1D Convolutional LSTM (sConvLSTM1D) Model for Effective Prediction of Sunspot Time Series

Article 17 October 2023

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

The solar cycle is a product of the solar dynamo processes that drive the Sun and is influenced by the cyclic regeneration of its magnetic field (Charbonneau, 2010; Hathaway, 2015). It is quasi-periodic in nature and has a periodicity of approximately 11 years. The rise and fall in solar activity has a direct impact on the geospace environment and on life on Earth. An increase in solar activity comprises of increase in harmful EUV and X-ray emissions toward Earth which affect the temperature and density of our atmosphere. This can be harmful to the orbital lifetime of satellites in low Earth orbit (Pulkkinen, 2007; Hathaway, 2015). Higher solar activity which results in an increase in solar flares and coronal mass ejections (CMEs) can be harmful to communication systems, power systems, satellites and various other assets in addition to being harmful to astronauts in space. Therefore, it is in our best interest to have the ability to predict the strength of the solar cycle with good accuracy for long-term planning of space weather impacts on space missions and societal technologies.

Predicting the strength of the solar cycle is a non-trivial task due to the complexity of the solar dynamo. Forecasting strategies and methods have varied based on the type of indices used to estimate the strength and occurrence of the solar cycle. Statistics like periodicity and trends observed in previous cycles have been used in prediction of the solar cycle (Ahluwalia, 1998; Wilson, Hathaway, and Reichmann, 1998; Javaraiah, 2007). Other indices were based on geomagnetic precursors (Thompson, 1993; Hathaway and Wilson, 2006; Wang and Sheeley, 2009), polar fields (Layden et al., 1991; Svalgaard, Cliver, and Kamide, 2005; Muñoz-Jaramillo et al., 2012) and flux transport dynamos (Dikpati, de Toma, and Gilman, 2006; Kitiashvili and Kosovichev, 2008; Nandy, Muñoz-Jaramillo, and Martens, 2011). Neural networks trained on sunspot numbers have also been used in solar cycle predictions (Fessant, Bengio, and Collobert, 1996; Pesnell, 2012). More recently, Pala and Atici (2019) and Covas, Peixinho, and Fernandes (2019) used neural networks to predict the strength of the upcoming Solar Cycle 25.

The variation in the number of sunspots is an indicator of solar activity (Hathaway, 2015). It also directly corresponds with the total sunspot area that relates to the magnetic field entering the corona. The quasi-periodic nature of the solar cycle makes it a good candidate for applying time series forecasting methods to these datasets. Time series forecasting methods have been applied successfully to areas such as finance (Bao, Yue, and Rao, 2017), meteorology (Bai et al., 2016) and signal processing (Khandelwal, Adhikari, and Verma, 2015).

Classical methods for time series forecasting include models using moving averages (MA) and auto regression (AR) such as Autoregressive Moving Average (ARMA) and Autoregressive Integrated Moving Average (ARIMA), where the time series of historical observations is assumed to be linear and follow a known stochastic distribution. Other variants of these classical methods such as Seasonal Autoregressive Integrated Moving-Average (SARIMA), Seasonal Autoregressive Integrated Moving-Average with Exogenous Regressors (SARIMAX), Vector Autoregression (VAR) and Vector Autoregression Moving-Average (VARMA) have also been successfully applied (Box and Jenkins, 1976; Hipel and McLeod, 1994; Cochrane, 1997; Cools, Moons, and Wets, 2009). Recently, machine learning and deep-learning techniques have been extensively used on time series forecasting problems with better results. The deep-learning methods benefit from not having to assume any information about the long or short tern distributions and from having the capability to model complex non-linear systems with rapidity. Studies have shown that neural network and deep-learning based models have outperformed these classical methods for time series forecasting tasks because of their ability to handle non-linearity which is more likely to be seen in real world problems (Adhikari and Agrawal, 2013; Siami-Namini, Tavakoli, and Siami Namin, 2018; Pala and Atici, 2019).

Recent research of convolutional networks with features such as dilated convolutions and residual connections outperform generic recurrent architectures for sequence modeling tasks (Bai, Zico Kolter, and Koltun, 2018). In this study, we propose a model based on WaveNet (van den Oord et al., 2016) which uses one-dimensional dilated convolutions, residual connections and LSTM (Hochreiter and Schmidhuber, 1997) that models the distribution of time series data with the capability to learn very long-term and short-term dependencies. We also do a comparison study of three other models using combination of LSTM and 1D ConvNets to find the best deep neural network model that is capable of delivering accurate forecasts. These models are applied on the monthly datasets of both observed sunspot number and total sunspot area with the goal of predicting the strength of the upcoming solar cycle. The objective of this paper is to show an improved method to forecast the next 11-year Solar Cycle 25. We present the data preparation, the applied neural network architecture, and the resulting predictions.

2 Dataset Preparation

Predicting each of the sunspot number and sunspot area for the upcoming Solar Cycle 25 is cast as a univariate multi-step time series forecasting task. A historical time series $[x_{1}, y_{2},\ .\ .\ . \ , x_{T}]$ is used as an input to predict the next $N$ time steps $[x_{T+1}, x_{T+2},..., x_{T+N}]$ known as the forecast horizon. The data is segmented using the sliding window method where a fixed window size of observations from the time series is chosen as an input and a fixed number of the following observations form the forecast horizon. This windowing process is repeated for the entire dataset by sliding the window one time step at a time to get the next slice of input and forecast horizon pairs.

Figure 1 illustrates the sliding window method for the multi-step time series forecasting. The sunspot numbers were obtained from the World Data Center SILSO, Royal Observatory of Belgium, Brussels http://sidc.be/silso/datafiles (SILSO World Data Center, 2019). It contains 3251 records of monthly averaged sunspot number observations from the year 1749 to 2019. The sunspot area dataset is obtained from the website http://solarcyclescience.com/activeregions.html made available and maintained by Lisa Upton and David Hathaway. It contains daily sunspot area from the year 1874 to 2019. The daily data is averaged monthly and produces 1744 records. Both datasets are treated as a univariate time-series and the sliding window method is applied to them. In every alternate solar cycle the magnetic polarities of the sunspots in the northern and southern hemispheres change sign. This leads to at least two solar cycles of data being the minimum requirement needed to forecast the next cycle. However, two cycles are insufficient to notice the longer trend in data as seen in Figures 2 and 3. By choosing the window size of four cycles, we can ensure that the changes in polarity as well as the longer trend is noticeable and help in producing a more accurate forecast. Therefore, a window size of 528 observations (4 cycles × 11 years/cycle × 12 months/year) and a forecast horizon of 132 observations (1 cycle × 11 years/cycle × 12 months/year) is chosen as seen in Figure 4. This produces 2560 unique input window and forecast horizon pairs for the sunspot number dataset. The same sliding window method with a window size of 528 and forecast horizon size of 132 observations is applied to the monthly averaged sunspot area dataset and produces 1085 unique input window and forecast horizon pairs. As deep neural network models require large datasets, these input and forecast pairs are useful for providing accurate forecasts. Figure 5 shows that when the sunspot number and total sunspot area datasets are normalized and aligned by the timeline, they show similar time variation. Thus, both datasets are indicative of each other in strength and time of occurrence.

In order to make an accurate forecast based on historical observations, it is necessary for any model to be trained on the complete dataset. However, this makes it impossible to judge the performance of the model due to the absence of a ground truth to verify the forecast. Therefore, we divide our datasets into training and validation splits and pick the model with the best validation performance to be trained on the entire data to make our forecast. A time series cross-validation scheme known as “TimeSeriesSplit” is implemented from the widely used machine learning library Scikit-learn (Pedregosa et al., 2011). The idea is to divide the training and validation sets at each fold or iteration such that the validation data is always ahead of the training data. Figure 6 shows the cross-validation scheme (Qamar-ud-Din, 2019). This ensures that the chronological order is maintained thereby allowing the model to identify trends in data.

3 Deep Neural Networks

Machine Learning methods such as Time Delay Neural Networks (Waibel et al., 1990), Support Vector Machines (SVM) (Cortes and Vapnik, 1995), Random Forests (Breiman, 2001) and deep-learning methods such as Recurrent Neural Networks (RNN), and Long Short-Term Memory (LSTM) (Hochreiter and Schmidhuber, 1997) have been extensively used for time series forecasting problems in domains such as finance, meteorology, statistics and signal processing. RNNs include an additional hidden state that passes on pertinent information learned from the current time step to the next time step, thereby allowing the model to learn the temporal dependencies in data. LSTMs are a type of RNN that include features such as memory gates and forget gates that make it possible to learn long-term dependencies. Studies have indicated that deep-learning models outperform stochastic models for time series forecasting problems (Siami-Namini, Tavakoli, and Siami Namin, 2018; Pala and Atici, 2019). Another method for processing sequential data such as time series data is by using one-dimensional (1D) convolutions. In a 1D convolution, each time step is obtained from a small patch or sub-sequence of temporal data in the input sequence. This extracted patch is then passed on through the neural network by adding weights and biases and produces a single time-step output. As the same input transformation is applied to each patch, patterns learned are easy to recognize at any position in the time sequence. 1D convnets can compete with RNNs delivering similar or better performance with much less training time. More recently, WaveNet (van den Oord et al., 2016) which is based on 1D dilated convolutions achieved state of the art performance in audio generation. Since audio data is also sequential, the same techniques can be applied to time-series forecasting. For faster training and identifying longer trends in data, in this study, we propose a deep neural network model based on a combination of WaveNet and LSTM which include recurrent and 1D dilated methods.

3.1 WaveNet Architecture

Wavenet is an autoregressive generative model that operates on the time-series data directly. It learns to model the conditional probability distribution of the time-series data using a stack of 1D dilated causal convolution layers. For any given time series $x = \{x_{1},\ .\ .\ .\ ,\ x_{T}\}$ its joint probability is factorized as a product of its conditional probabilities.

Equation 1 shows the product of the conditional probabilities of the time series. Therefore, each time step $x_{t}$ is conditioned on the samples of all the preceding time steps. Dilated causal convolutions form the main ingredient of WaveNet models. They ensure that the model does not violate the order in which the data is modelled. In a dilated causal convolution layer filters are applied by skipping a constant dilation rate of inputs on the input sequence. The dilation rate is increased exponentially every layer which allows the model to have exponentially increasing receptive fields in each successive layer.

$$ p(x) = \prod _{t=1}^{T} p(x_{t}\ |\ x_{1},\ .\ .\ .\ ,\ x_{t-1}) \,. $$

(1)

Figure 7 shows both a stack of simple 1D convolution layers and a stack of dilated causal convolution layers as used in the WaveNet architecture. We can see, for the dilated convolutions, with a dilation rate of 4 and filter size of 2 the output has a receptive field of 8 input units compared to only a receptive field view of only 4 units when dilations are not used. Therefore, by stacking a few layers of dilated convolutional layers we exponentially increase the receptive field allowing models using dilated convolutions to learn much longer sequences of time dependencies than traditional recurrent models (Bai, Zico Kolter, and Koltun, 2018). In addition to having a larger receptive field, WaveNets use gated activations, residual and skip connections in the neural network architecture. Gated activation units allow greater control than rectified activation whereas residual and skip connections enable faster convergence. In our proposed model shown in Figure 8, we use a single LSTM layer at the end of the WaveNet model which ensures that our model learns both very long-term and short-term time series dependencies from the input data.

4 Experimental Setup and Results

In this study, we compare the performance of four deep neural network models in order to find the model with the most accurate forecasts. The first model is a simple LSTM layer with 132 units. The second model consists of two stacked LSTM layers with 132 units each. The third model consists of a 1D convolution layer without any time dilations stacked with an LSTM layer of 132 units. The fourth model is the WaveNet architecture with dilation rates of 1, 2, 4, 8, 16, 32, 64, 128, 256 and 512 stacked with a single LSTM layer of 132 units. All four models are compared with a naive average forecast where all cycles are rescaled to have the same 132 month length and averaged over each time step. This provides us with a baseline to compare our models and measure the accuracy of the forecasts. To ensure unbiased results, we use the same hyper-parameters such as dropout of 30% and batch normalization on each model. All experiments were performed on an NVIDIA DIGITS^TM workstation dedicated for deep-learning running on Ubuntu 16.04. It is equipped with four NVIDIA TITAN X GPUs with 12 GB memory per GPU board and features 7 TFlops of single precision. All models were created using TensorFlow 2.0 and Keras using Python programming language (Abadi et al., 2015; Géron, 2019). Root mean squared error (RMSE) was the chosen performance metric for all experiments to compare the performance of each model. The RMSE is the standard deviation of the residuals or prediction errors and provides a measure of how far the prediction is from the actual data.

The datasets for sunspot number and total sunspot area were processed as described in Section 2. The data is also normalized to values between 0 and 1 to speedup the convergence of the gradients. To ensure there is enough data in the validation set, we implement a 5-fold cross-validation scheme for both datasets using TimeSeriesSplit from the Scikit-learn library. Table 1 describes the cross-validation scheme and the split used for training and validation data. The Adam optimization algorithm was used to update weights while training on sequences of batch size 32 with a learning rate of $5\times 10^{-4}$ and a decay rate of $10^{-6}$ for 100 iterations. This allowed for faster convergence of the loss function. The tensorflow.keras library allows for a scheme called “early-stopping” from the “callbacks” module to avoid overfitting where we can stop the training when the model starts to overfit with a “patience” value (set to 5 iterations). This gives control over the number of times the validation loss is allowed to exceed its previous best value. We also used this scheme to save the model and its weights at a checkpoint with the best performance. Figure 9 shows the training and validation loss curves for the sunspot number and total sunspot area datasets. The curves show that the model stopped training before reaching 100 iterations for both datasets.

Table 1 Cross-validation scheme using TimeSeriesSplit showing the different cross-validation (CV) folds with the number of training and validation pairs.

Full size table

Table 2 summarizes the performance of all models on the sunspot number and total sunspot area. The WaveNet + LSTM model performed the best on both datasets. Figures 10, 11 and 12 show the true history and forecast of the sunspot and total sunspot area datasets for the Stacked LSTM and WaveNet + LSTM models. The figures show training data used as “History”, the actual forecast horizon as “True Future” and the predicted forecast as “Forecast”. From Figures 10 and 11, it can be seen that the predicted forecasts for the WaveNet + LSTM model perform far better than the Stacked LSTM model on the sunspot number dataset even though there is comparatively small difference in their RMSE values. This reflects the ability of WaveNets to learn very long-term and well as short-term dependencies from the input data. The RMSE values for the total sunspot area dataset across all models is higher than the models for the sunspot number dataset. This is expected due to the limited amount of data available for the total sunspot area. To verify this, we extracted data from the sunspot area dataset to match the same time period of the total sunspot area dataset and trained it using our WaveNet + LSTM model which resulted in an RMSE value of 8.51 that is closer to the RMSE value for the total sunspot area dataset.

Table 2 Performance summary of all models on the sunspot number and total sunspot area datasets.

Full size table

As discussed in Section 2, we chose the model with the best validation performance to train on the whole dataset to produce our 11-year forecast for the upcoming Solar Cycle 25. Figures 13 and 14 show the actual vs. forecast predictions for the sunspot number and total sunspot area using validation data with the WaveNet + LSTM model. The forecasts for Solar Cycle 23 and Solar Cycle 24 show that the model is able to predict the trends in data and also forecast the strength of those cycles accurately for both datasets, although the total sunspot area shows a time lag of about 1.5 years. Figures 15 and 16 show the forecast for the upcoming Solar Cycle 25 using the entire data for training the WaveNet + LSTM model. Both cycles show similar strength and suggest that Solar Cycle 25 will be slightly weaker compared to the previous Solar Cycle 24. The forecast for the sunspot numbers dataset suggest a peak of 106 $\pm 19.75$ with the peak occurring in March 2025, whereas, the forecast for the total sunspot area suggest a peak of 1771 $\pm 381.17$ with the cycle reaching its peak in May 2022. The discrepancy in the peak dates is expected as seen in the validation data forecasts. However, as stated earlier, both forecasts suggest similar strength of the solar cycle. Using the mean average error, we determine that the uncertainty in predictions is 8% for the sunspot number dataset and 12% for the total sunspot area dataset.

Forecasts for Solar Cycle 25 have varied from suggesting a stronger cycle than Solar Cycle 24 to the weakest cycle ever recorded with sunspot numbers ranging from 57-167. Covas, Peixinho, and Fernandes (2019) used spatial-temporal data with neural networks to predict that the upcoming Solar Cycle 25 would be the weakest cycle ever recorded with sunspot numbers of 57± 17 and total sunspot area of ≈ 700 with a peak around 2022-2023. Upton and Hathaway (2018) used a flux transport model and predicted that Solar Cycle 25 would be similar in size to Solar Cycle 24 with a 15% uncertainty. Labonville, Charbonneau, and Lemerle (2019) used a dynamo-based model to forecast the upcoming solar cycle and predicted a maximum sunspot number of 89 +29/-15. An international panel co-chaired by NOAA/NASA released a preliminary forecast on April 5, 2019 with the consensus predicting that Solar Cycle 25 would be similar in size to Solar Cycle 24. The minimum and maximum sunspot number forecast were 95 and 130, respectively. Pala and Atici (2019) used two layers of stacked LSTMs and predicted that the upcoming Solar Cycle 25 would have a maximum sunspot number of 167.3 with the peak being reached in 2023.2 $\pm 1.1$. The low RMSE values of the model and its forecasts on Solar Cycle 23 and Solar Cycle 24 give us confidence that our model is the best deep neural network based approach using temporal indices of sunspot numbers and total sunspot area for predicting the strength of the upcoming solar cycle.

5 Conclusion

In this study, we presented four deep neural network models with the goal of predicting the strength of the upcoming Solar Cycle 25 using monthly averaged data from the sunspot number and total sunspot area datasets. We clearly demonstrated that the proposed WaveNet + LSTM model performed best compared to the other deep neural network based models. It is capable of modeling both long-term and short-term dependencies and of identifying trends in time series data as seen in forecasts in the validation data along with forecasts of cycles Solar Cycle 23 and Solar Cycle 24. Our forecasts indicate that Solar Cycle 25 will be slightly weaker than Solar Cycle 24 with a maximum sunspot number of 106 ± 19.75 with 8% uncertainty and total sunspot area of 1771 ± 381.17 with 12% uncertainty and the cycle reaching its peak in May 2025 (± one year). Our forecast falls within the uncertainty of Upton and Hathaway (2018) and the NOAA/NASA forecasts. This is consistent with the consensus forecast of a weak cycle ahead.

Our proposed method can be applied to any univariate time series data that exhibits properties such as trend and seasonality. One limitation is we cannot expect to forecast time series data too far into the future. When we feed a period of forecast as input back into the model the errors tend to accumulate and get worse over time. This work can also be extended to including other related parameters that are indicative of the solar cycle and forecast them as a multivariate time series. We hope that this study would encourage the use of deep neural networks in forecasting tasks in heliophysics.

References

Abadi, M., Agarwal, A., Barham, P., Brevdo, E., Chen, Z., Citro, C., Corrado, G.S., Davis, A., Dean, J., Devin, M., Ghemawat, S., Goodfellow, I., Harp, A., Irving, G., Isard, M., Jia, Y., Jozefowicz, R., Kaiser, L., Kudlur, M., Levenberg, J., Mané, D., Monga, R., Moore, S., Murray, D., Olah, C., Schuster, M., Shlens, J., Steiner, B., Sutskever, I., Talwar, K., Tucker, P., Vanhoucke, V., Vasudevan, V., Viégas, F., Vinyals, O., Warden, P., Wattenberg, M., Wicke, M., Yu, Y., Zheng, X.: 2015, TensorFlow: Large-scale machine learning on heterogeneous systems.
Adhikari, R., Agrawal, R.: 2013, An Introductory Study on Time Series Modeling and Forecasting. ISBN 978-3-659-33508-2. DOI .
Book Google Scholar
Ahluwalia, H.S.: 1998, The predicted size of cycle 23 based on the inferred three-cycle quasi-periodicity of the planetary index ap. J. Geophys. Res.103(A6), 12103. DOI .
Article ADS Google Scholar
Bai, S., Zico Kolter, J., Koltun, V.: 2018, An empirical evaluation of generic convolutional and recurrent networks for sequence modeling. arXiv .
Bai, Y., Li, Y., Wang, X., Xie, J., Li, C.: 2016, Air pollutants concentrations forecasting using back propagation neural network based on wavelet decomposition with meteorological conditions. Atmos. Pollut. Res.7(3), 557. DOI .
Article Google Scholar
Bao, W., Yue, J., Rao, Y.: 2017, A deep learning framework for financial time series using stacked autoencoders and long-short term memory. PLoS ONE12(7). DOI .
Box, G.E.P., Jenkins, G.M.: 1976, Time Series Analysis: Forecasting and Control, Holden-Day, San Francisco.
MATH Google Scholar
Breiman, L.: 2001, Random forests. Mach. Learn.45(1), 5. DOI .
Article MATH Google Scholar
Charbonneau, P.: 2010, Dynamo models of the solar cycle. Living Rev. Solar Phys.7(1), 3. DOI .
Article ADS Google Scholar
Cochrane, J.H.: 1997, Time Series for Macroeconomics and Finance, Graduate School of Business, University of Chicago, Chicago.
Google Scholar
Cools, M., Moons, E., Wets, G.: 2009, Investigating the variability in daily traffic counts through use of arimax and sarimax models: Assessing the effect of holidays on two site locations. Transp. Res. Rec.2136(1), 57. DOI .
Article Google Scholar
Cortes, C., Vapnik, V.: 1995, Support-vector networks. Mach. Learn.20(3), 273. DOI .
Article MATH Google Scholar
Covas, E., Peixinho, N., Fernandes, J.: 2019, Neural network forecast of the sunspot butterfly diagram. Solar Phys.294(3), 24. DOI .
Article ADS Google Scholar
Dikpati, M., de Toma, G., Gilman, P.A.: 2006, Predicting the strength of solar cycle 24 using a flux-transport dynamo-based tool. Geophys. Res. Lett.33(5). DOI .
Fessant, F., Bengio, S., Collobert, D.: 1996, On the prediction of solar activity using different neural network models. Ann. Geophys.14, 20. DOI .
Article ADS Google Scholar
Géron, A.: 2019, Hands-on Machine Learning with Scikit-Learn, Keras, and Tensorflow: Concepts, Tools, and Techniques to Build Intelligent Systems, O’Reilly Media, Newton.
Google Scholar
Hathaway, D.H.: 2015, The solar cycle. Living Rev. Solar Phys.12(1), 4. DOI .
Article ADS Google Scholar
Hathaway, D.H., Wilson, R.M.: 2006, Geomagnetic activity indicates large amplitude for sunspot cycle 24. Geophys. Res. Lett.33(18). DOI .
Hipel, K.W., McLeod, A.I.: 1994, Time Series Modelling of Water Resources and Environmental Systems45, Elsevier, Amsterdam.
Book Google Scholar
Hochreiter, S., Schmidhuber, J.: 1997, Long short-term memory. Neural Comput.9(8), 1735. DOI .
Article Google Scholar
Javaraiah, J.: 2007, North–South asymmetry in solar activity: Predicting the amplitude of the next solar cycle. Mon. Not. Roy. Astron. Soc. Lett.377(1), L34. DOI .
Article ADS Google Scholar
Khandelwal, I., Adhikari, R., Verma, G.: 2015, Time series forecasting using hybrid arima and ann models based on dwt decomposition. Proc. Comput. Sci.48, 173. DOI .
Article Google Scholar
Kitiashvili, I., Kosovichev, A.G.: 2008, Application of data assimilation method for predicting solar cycles. Astrophys. J.688(1), L49.
Article ADS Google Scholar
Labonville, F., Charbonneau, P., Lemerle, A.: 2019, A dynamo-based forecast of solar cycle 25. Solar Phys.294(6), 82. DOI .
Article ADS Google Scholar
Layden, A.C., Fox, P.A., Howard, J.M., Sarajedini, A., Schatten, K.H., Sofia, S.: 1991, Dynamo-based scheme for forecasting the magnitude of solar activity cycles. Solar Phys.132(1), 1. DOI .
Article ADS Google Scholar
Muñoz-Jaramillo, A., Sheeley, N.R., Zhang, J., DeLuca, E.E.: 2012, Calibrating 100 years of polar faculae measurements: Implications for the evolution of the heliospheric magnetic field. Astrophys. J.753(2), 146. DOI .
Article ADS Google Scholar
Nandy, D., Muñoz-Jaramillo, A., Martens, P.C.H.: 2011, The unusual minimum of sunspot cycle 23 caused by meridional plasma flow variations. Nature471(7336), 80. DOI .
Article ADS Google Scholar
Pala, Z., Atici, R.: 2019, Forecasting sunspot time series using deep learning methods. Solar Phys.294(5), 50. DOI .
Article ADS Google Scholar
Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V., Vanderplas, J., Passos, A., Cournapeau, D., Brucher, M., Perrot, M., Duchesnay, E.: 2011, Scikit-learn: Machine learning in Python. J. Mach. Learn. Res.12, 2825.
MathSciNet MATH Google Scholar
Pesnell, W.D.: 2012, Solar cycle predictions (invited review). Solar Phys.281(1), 507. DOI .
Article ADS Google Scholar
Pulkkinen, T.: 2007, Space weather: Terrestrial perspective. Living Rev. Solar Phys.4(1), 1. DOI .
Article ADS Google Scholar
Qamar-ud-Din, M.: 2019, Cross-validation strategies for time series forecasting. https://hub.packtpub.com/cross-validation-strategies-for-time-series-forecasting-tutorial . Accessed: 2019-11-15.
Siami-Namini, S., Tavakoli, N., Siami Namin, A.: 2018, A comparison of arima and lstm in forecasting time series. In: 2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA), 1394. DOI .
Chapter Google Scholar
SILSO World Data Center: 2019, The International Sunspot Number. International Sunspot Number Monthly Bulletin and online catalogue. http://www.sidc.be/silso/ .
Svalgaard, L., Cliver, E.W., Kamide, Y.: 2005, Sunspot cycle 24: Smallest cycle in 100 years? Geophys. Res. Lett.32(1). DOI .
Thompson, R.J.: 1993, A technique for predicting the amplitude of the solar cycle. Solar Phys.148(2), 383. DOI .
Article ADS Google Scholar
Upton, L.A., Hathaway, D.H.: 2018, An updated solar cycle 25 prediction with aft: The modern minimum. Geophys. Res. Lett.45(16), 8091. DOI .
Article ADS Google Scholar
van den Oord, A., Dieleman, S., Zen, H., Simonyan, K., Vinyals, O., Graves, A., Kalchbrenner, N., Senior, A., Kavukcuoglu, K.: 2016, Wavenet: A generative model for raw audio. arXiv .
Waibel, A., Hanazawa, T., Hinton, G., Shikano, K., Lang, K.J.: 1990, Phoneme Recognition Using Time-Delay Neural Networks, Morgan Kaufmann Publishers Inc., San Francisco, 393.
Google Scholar
Wang, Y.M., Sheeley, M.R.: 2009, Understanding the geomagnetic precursor of the solar cycle. Astrophys. J.694(1), L11. DOI .
Article ADS Google Scholar
Wilson, R.M., Hathaway, D.H., Reichmann, E.J.: 1998, An estimate for the size of cycle 23 based on near minimum conditions. J. Geophys. Res.103(A4), 6595. DOI .
Article ADS Google Scholar

Download references

Acknowledgements

The authors would like to thank Dr. David Hathaway and Dr. Lisa Upton for maintaining and providing us with the data for the total sunspot area.

Author information

Authors and Affiliations

The University of Alabama in Huntsville, Huntsville, USA
B. Benson, W. D. Pan, A. Prasad, G. A. Gary & Q. Hu
Department of Electrical and Computer Engineering, Huntsville, USA
B. Benson & W. D. Pan
Center for Space Plasma and Aeronomic Research, Huntsville, USA
A. Prasad, G. A. Gary & Q. Hu

Authors

B. Benson
View author publications
You can also search for this author in PubMed Google Scholar
W. D. Pan
View author publications
You can also search for this author in PubMed Google Scholar
A. Prasad
View author publications
You can also search for this author in PubMed Google Scholar
G. A. Gary
View author publications
You can also search for this author in PubMed Google Scholar
Q. Hu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to W. D. Pan.

Ethics declarations

Disclosure of Potential Conflicts of Interest

The authors declare that they have no conflicts of interest.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Benson, B., Pan, W.D., Prasad, A. et al. Forecasting Solar Cycle 25 Using Deep Neural Networks. Sol Phys 295, 65 (2020). https://doi.org/10.1007/s11207-020-01634-y

Download citation

Received: 23 March 2020
Accepted: 03 May 2020
Published: 19 May 2020
DOI: https://doi.org/10.1007/s11207-020-01634-y

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Forecasting Solar Cycle 25 Using Deep Neural Networks

Abstract

Similar content being viewed by others

An Improved Prediction of Solar Cycle 25 Using Deep Learning Based Neural Network

Can Solar Cycle 25 Be a New Dalton Minimum?

Stacked 1D Convolutional LSTM (sConvLSTM1D) Model for Effective Prediction of Sunspot Time Series

1 Introduction

2 Dataset Preparation