Abstract
Earthquakes and tsunamis can trigger acoustic and gravity waves that could reach the ionosphere, generating electron density disturbances, known as traveling ionospheric disturbances. These perturbations can be investigated as variations in ionospheric total electron content (TEC) estimated through global navigation satellite systems (GNSS) receivers. The VARION (Variometric Approach for Real-Time Ionosphere Observation) algorithm is a well-known real-time tool for estimating TEC variations. In this context, the high amount of data allows the exploration of a VARION-based machine learning classification approach for TEC perturbation detection. For this purpose, we analyzed the 2015 Illapel earthquake and tsunami for its strength and high impact. We use the VARION-generated observations (i.e., dsTEC/dt) provided by 115 GNSS stations as input features for the machine learning algorithms, namely, Random Forest and XGBoost. We manually label time frames of TEC perturbations as the target variable. We consider two elevation cut-off time series, namely, 15° and 25°, to which we apply the classifier. XGBoost with a 15° elevation cut-off dsTEC/dt time series reaches the best performance, achieving an F1 score of 0.77, recall of 0.74, and precision of 0.80 on the test data. Furthermore, XGBoost presents an average difference between the labeled and predicted middle epochs of TEC perturbation of 75 s. Finally, the model could be seamlessly integrated into a real-time early warning system, due to its low computational time. This work demonstrates high-probability TEC signature detection by machine learning for earthquakes and tsunamis, that can be used to enhance tsunami early warning systems.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Avoid common mistakes on your manuscript.
Introduction
Natural hazards such as volcanic eruptions, earthquakes, and tsunamis can perturb the ionosphere (Astafyeva 2019; Huang et al. 2019; Calais and Minster 1995; Peltier and Hines 1976; Hargreaves 1992; Occhipinti 2015; Rolland et al. 2010; Meng et al. 2019; Artru et al. 2005; Chou et al. 2017; Zettergren et al. 2017). In detail, these events can generate acoustic and gravity waves (AGWs), that are amplified as atmosphere density decreases and can reach the ionosphere. These waves interact with the ionospheric plasma and cause ionospheric electron density disturbances, known as traveling ionospheric disturbances (TIDs; Galvan et al. 2012; Astafyeva 2019). Here, we mention acoustic gravity waves generated near the epicenter (AGWepi) and internal gravity waves (IGWtsu; Occhipinti 2015). AGWepi, related to the uplift at the source, reaches the ionosphere in around 8 min, whereas IGWtsu, linked to tsunami offshore propagation, takes about 45–60 min (Lognonné et al. 2006; Occhipinti et al. 2011).
These perturbations are detected through variations in ionospheric total electron content (TEC; Coster et al. 2013; Hofmann-Wellenhof et al. 2008), retrieved by global navigation satellite system (GNSS) measurements, and measured in TEC units (1 TECU = 1016 electrons/m2). Specifically, we refer to the slant TEC (sTEC) measured by dual-frequency GNSS receivers and encountered by the GNSS signal during its path through the ionosphere from the satellite to the receiver.
The VARION algorithm (Variometric Approach for Real-Time Ionosphere Observation) is an established tool to estimate sTEC variations from GNSS observations (Savastano et al. 2017; Ravanelli et al. 2021). It is based on a single time difference of geometry-free combinations, being suitable for real-time applications (Ravanelli et al. 2020).
TIDs detection considering sTEC time series has been conducted using traditional techniques reliant on human expertise, including the analysis of the ionospheric power index (Manta et al. 2020), the wavelet analysis threshold-based (Torrence and Compo 1998), and the 2D principal component analysis (Lin 2022). Despite the effectiveness of these traditional approaches, there is increasing recognition that artificial intelligence (AI), particularly machine learning, holds potential for advancing TIDs detection. Machine learning uses data-driven algorithms for autonomous decision-making, offering computational efficiency and data handling capacity (Kuglitsch et al. 2022; Crocetti et al. 2021). During the past years, the application of machine learning algorithms has become widespread in ionospheric research for different purposes: automatic detection methods to study TID signatures (Brissaud and Astafyeva 2022; Constantinou et al. 2023a, b); forecasting TEC data (Cesaroni et al. 2020; Huang and Yuan 2014; Natras et al. 2022; Liu et al. 2020); nowcasting TEC data (Zhukov et al. 2018; Camporeale 2019; Łoś et al. 2020); improvement of regional and global TEC models (Zhukov et al.2020); ionospheric scintillation parameter predictions (Atabati et al. 2021; McGranaghan et al. 2018; Linty et al. 2019); and the general analysis of TEC variations induced by earthquakes and tsunamis (Zhukov et al. 2020).
Within the framework of automatic detection methods to study TID signatures, the previous studies by Constantinou et al. (2023a, b) considered computer vision and convolutional neural networks, while Brissaud and Astafyeva (2022) applied the Random Forest classifier, similar to our study, focusing on classifying ionospheric waveforms into TIDs and noise, picking TID arrival times, and associating arrivals across a satellite network in near real-time. In this context, our study aims to use machine learning algorithms for classifying ionospheric TEC variations caused by earthquakes and tsunamis using the large amount of GNSS time-series data (provided by every satellite-station link available). It aligns with the existing body of research while contributing novel insights into the classification of TIDs in large-scale GNSS data sets. Automatic TIDs detection, as planned by the NASA-JPL GUARDIAN system (Martire et al. 2023), underscores the importance of our research within the scientific community.
This paper investigates the 2015 Illapel earthquake and tsunami, a valid case study with a well-documented high magnitude and tsunami signature (Reddy et al. 2017; Ravanelli et al. 2021; Shrivastava et al. 2021).
The main objective is to examine if and how machine learning algorithms are suitable to find TEC time-series signatures related to earthquakes and tsunamis.
To reach our aim, we consider two machine learning classifiers, namely, Random Forest and XGBoost (Breiman 2001; Chen and Guestrin 2016), and apply them to the first temporal derivative of the sTEC (dsTEC/dt), representing VARION-core output. It depicts the rate of change of sTEC with respect to time, to which an elevation cut-off of 15° and 25° is implemented. Several experiments are carried out to determine the optimally performing model, such as data pre-processing, and feature-selection. The following Section provides an overview of the analyzed event, the study region, and the data used. The Methods Section describes the methodologies: the VARION algorithm and the machine learning techniques. Moreover, it presents the set-up of the machine learning classification, the explanations of the pre-processing features, and the assessment of the model performance. In the Results Section, the outcomes are presented, analyzed, and discussed. Here, we also validate the model by applying it to time series with no seismic-induced variations in sTEC. Specifically, we consider data with alterations in dsTEC/dt values not related to the earthquake, testing the algorithm's capability to exclusively detect variations directly linked to the earthquake, aligning with our primary objective. Finally, the last Section summarizes the outcomes derived from this study, identifies the most effective configuration for an efficient model, and offers a prospective outlook on potential enhancements yet to be explored.
Study context and data set overview
On September 16, 2015, at 19:54:33 Chile Standard Time (22:54:33 UTC), a devastating earthquake with a moment magnitude of Mw 8.3 occurred 46-km offshore of the Coquimbo region of Chile. The primary seismic event lasted between 3 and 5 min, followed by multiple aftershocks (Ravanelli et al. 2021). Both the NOAA Pacific Tsunami Warning Centre (https://www.tsunami.gov/; https://earthquake.usgs.gov/earthquakes/eventpage/us20003k7a/executive) and the Servicio Hidrográfico y Oceanográfico de la Armada (Chile’s National Tsunami Warning System) (https://www.armada.cl/noticias-navales/shoa-difunde-oportuna-alerta-de-tsunami; https://www.snamchile.cl/) issued tsunami threat messages and alarms, respectively. Within 10 min, tsunami waves measuring 4.5 m struck the Chilean shoreline between Chañaral (~ 26°S) and Constitución (~ 35°S), causing substantial impact (https://www.tsunami.gov/events/PAAQ/2015/09/16/nuskyv/22/WEAK51/WEAK51.txt). Pichidangui (~ 32°S) was reached by the first wave 13.70 min after the mainshock. The coastal area between Coquimbo (~ 30°S) and Valparaíso (~ 33°S) recorded the highest wave height (over 1.5 m), leading to significant flooding in Coquimbo (Shrivastava et al. 2021).
Data set
Data from 115 GNSS stations mostly located in Chile but also spread all over the South American continent (Fig. 1) are processed with VARION (Ravanelli et al. 2021). The GNSS receivers collect data at 10-, 15-, and 30-s rates. Following Reddy et al. 2017 research, we consider G12, G24, and G25 satellites, resulting in a data set composed of 345 observations.
In detail, our data set consists of 345 VARION-obtained sTEC variations over time, i.e., dsTEC/dt [TECU/s], which represent the primary and the real-time output of VARION and do not require any additional post-processing (Ravanelli et al. 2021). Specifically, the dsTEC/dt time series are representative as the earthquake has high magnitude and causes evident signatures in the ionosphere. However, while our data set provides insights into TID detection, it has limitations. Indeed, seismic and non-seismic external factors such as space weather, noise, satellite angles, geomagnetic field, and observation geometry influence the induced TEC observation (Meng et al. 2022; Bagiya et al. 2019). We analyze both GPS days, i.e., DOY 259 and 260 (the earthquake day and the day after), considering the time series of the whole DOY 259 and part of the time series of DOY 260, namely, until satellites descend above the elevation cut-off (Fig. 2). Moreover, the data set considers only 31 selected links satellite-stations following four criteria:
-
Data availability on both days (DOY 259 and 260)
-
Considerable (that can be visually identified) variations in the dsTEC/dt time series (related to the earthquake)
-
Only one gap during the day (due to satellite visibility)
-
Data availability from the first observation of the day (which is on the first 15 s of the day), for both DOY 259 and 260
The frequency of the observations related to the selected 31 samples is 15 s.
Methodology
The VARION algorithm is applied to the observations of GPS DOY 259 and 260. VARION estimates real-time sTEC variations relying on stand-alone GNSS receivers and standard GNSS broadcast products. It is based on single time differences of a geometry-free combination of GNSS carrier-phase measurements (Savastano et al. 2017; Ravanelli et al. 2020, 2021). A crucial tool in this study, VARION provides the time series for both train the machine learning algorithm and validate the results.
In this analysis, we use the elevation cut-off angles of 15° and 25° to mitigate the impact of observational noise, a prevalent challenge in ionospheric studies. These cut-off angles represent a strategic filter, effectively excluding observations derived by satellites with lower elevation angles where data tend to be noisier due to increased atmospheric interference. This aligns with common practices in the literature, where such elevation cut-offs are used to enhance the signal-to-noise ratio and improve the overall quality of the data set, ensuring a more robust and reliable foundation for our analysis (Occhipinti et al. 2011; Astafyeva 2019; Ravanelli et al. 2023).
To achieve our final aim of detecting the TIDs caused by earthquakes and tsunamis, we formulate a binary classification problem using supervised machine learning algorithms. Data are classified into two categories: 0 (if there are no sTEC variations related to the earthquake and tsunami in a 30-min window that runs through the time series) and 1 (if there are sTEC variations). The TIDs detection is based on established criteria derived from prior studies available in the literature. These include factors such as the arrival time of the perturbation in the ionosphere, its shape, absence of geomagnetic disturbances, and frequency content, as identified and validated in the previous research (Reddy et al. 2017; Ravanelli et al. 2021; Shrivastava et al. 2021; Sanchez et al. 2023).
We consider Random Forest (RF) and XGBoost (XGB) classifiers among the several available for classification tasks (Zhang et al. 2017), for their well-known good performances (Brissaud and Astafyeva 2022). Indeed, Crocetti et al. 2021 show that RF and XGB algorithms perform better than others (i.e., Linear Support Vector Classification, Perceptron, K-Nearest Neighbor, and more). Specifically, RF excels in handling high-dimensional data and mitigating overfitting through bootstrap aggregation, while XGBoost improves weak learners iteratively, enhancing accuracy and computational efficiency. Both algorithms, as ensemble learning methods, capture intricate relationships within the data and enhance predictive accuracy. They are suitable for our purposes, especially since we do not require deep learning methods (i.e., Convolutional Neural Networks—CNNs; Albawi et al. 2017) given our moderate data set size and the division into short chunks of the time series.
In detail, RF consists of decision trees, nonlinear models with multiple linear boundaries. Decision tree nodes use data-related questions linked to specific feature values, recursively dividing layers into child nodes. The iterative process creates a tree with a predefined depth. The algorithm selects bootstrapped samples and a random feature subset for model evaluation. The final prediction aggregates results from all decision trees (Breiman 2001). In contrast, XGBoost builds trees sequentially, each minimizing the error of the previous tree. It starts with a constant, iteratively trains trees on residuals, and combines them with the previous model to reduce error. Finally, strong learner results from combining all weak learners (Wang and Liu 2020; Chen and Guestrin 2016).
The classification is conducted for the selected 31 links (described in the Data set Section), split into 80% (25 links) and 20% (6 links) for training and testing. The dsTEC/dt time series of each link are divided into individual chunks, as described in the following Section. The model is trained based on the dsTEC/dt time series of the 25 training links, while tested considering the ones of the six testing links. Finally, the validation of the model and the evaluation of its performance are analyzed by considering 18 unseen dsTEC/dt time series, also related to the Illapel event. However, these show variations in dsTEC/dt not linked to the seismic event. This additional analysis allows us to assess the model's ability to distinguish variations in dsTEC/dt specifically linked to the earthquake and tsunami from those unrelated to the seismic event.
Feature matrix
We use the dsTEC/dt time series of 31 links satellite-stations as features for the machine learning algorithms. In detail, we exclude the gap due to the satellite visibility of each time series and consider the time frames (1) from the first 15 s of DOY 259 until the start of the gap of DOY 259 and (2) from the end of the gap of DOY 259 until the start of the gap of the DOY 260 of each time series.
The time series are split into k chunks to create the feature matrices. Each chunk is m = 30 min long and shifted by n = 1 min from the next one: The first chunk contains dsTEC/dt values from the first 15 s (the first observation for both the days) until m, while the second one from 75 s (15 s + n) until 75 s + m, and so on. The feature matrices have the dimension [k, 120] as 120 is the number of 15 s in m. Finally, we obtain 31 feature matrices (one per link), where we combine 25 to create the feature matrix used for training (80%), and 6 to create the one used for testing (20%). The structure of the feature matrix is shown in Fig. 3.
Target vector
The target vector denotes whether sTEC variations due to the tsunami occurred within their corresponding chunks, classifying them as “1” if the perturbation is present or “0” if not. As previous mentioned, this attribution is based on well-defined conditions validated in the literature, which demonstrate the link between sTEC perturbations and the seismic event. To perform this binary classification, the time frame of sTEC perturbation related to the event is manually labeled from the time series, i.e., the initial and finishing times of the sTEC perturbation for every link satellite-station. Therefore, the target vector is labeled as 1 whenever the 30-min chunks contain any point in time of the manually labeled time frame of the sTEC perturbation.
In this way, we create a target vector with the dimension [k, 1] for every link (Fig. 3). As for the feature matrix, we use the same 80% of the links for training and 20% for testing.
Data pre-processing
The pre-processing of the data set consists of cleaning and preparation of the data, to improve the quality of the data set and ensure better performance of the model. In this study, we consider standardization and normalization, two methods used to scale the data set. In detail, standardization transforms the data to a specific range (e.g., 0 and 1 or − 1 and + 1); while normalization changes the data so that they resemble a normal distribution (Ali and Faraj 2014; Vieira et al. 2020).
Finally, we also include additional features in the feature matrix, namely, the value range, defined as the maximum value minus the minimum one before the normalization, and the variance of each chunk, both individually and together.
Model evaluation
To evaluate the performance of the model, we consider the confusion matrix, the difference between the labeled and predicted middle epochs of the perturbations time frames, the receiver operating characteristic (ROC) curve and the area under the curve (ROC-AUC).
In detail, the confusion matrix shows the number of false negatives (FNs), false positives (FPs), true negatives (TNs), and true positives (TPs) generated from the machine learning classification (Crocetti et al. 2021). In our case, TPs and TNs indicate correctly classified chunks with (or without, for TNs) earthquake- and tsunami-induced sTEC variations. Conversely, FNs identify chunks with undetected sTEC variations, while FPs designate chunks without sTEC variations, wrongly classified as containing them. Due to our aim of not overlooking any tsunamis, FNs are considered the most crucial errors in the analysis. However, to evaluate the best model, we consider the confusion matrix in terms of the well-known performance measures of precision, recall, and F1 score (Ting 2017).
Moreover, we compare the labeled perturbation time frame with the one generated by the model, represented by the time frame containing all the TPs (assumed to be at the middle epoch of the 30-min time period of the chunk). To achieve this goal, we calculate the numerical difference between the middle epoch of both time frames and assess how many 15 s of difference there are between them (according to the time-series resolution).
Results and discussion
To reach the final aim of finding signatures in TEC time series related to earthquakes and tsunamis using machine learning algorithms, we performed several experiments considering two classifiers, i.e., Random Forest and XGBoost, and two elevation cut-off angles, i.e., 15° and 25° (see the following Section). We conducted hyperparameter tuning through grid search for each of the four combinations classifier elevation cut-off. We also assessed the influence of additional features (see the Section related to the impact of added features) and pre-processing techniques (refer to the Section concerning the pre-processing effects) on the model's performance. The model with the best performance in terms of F1 score was then selected for each combination, resulting in four models. Finally, our final best model was selected in terms of F1 score, difference between the middle epochs of the labeled and predicted perturbation time frames, ROC curve and ROC-AUC.
The best result is presented in the first part of this section, while the following subsections show how we came to this conclusion by comparing the best model with the others.
We trained our model on a machine with a 2.20-GHz i7-Intel Core processor, 32 GB of memory, and an Intel(R) UHD Graphics 630 graphics card. So, it can even be trained on a laptop without a proper GPU.
From this study, the best model is the XGBoost classifier that uses 15° elevation cut-off dsTEC/dt time series and includes the value range of each chunk as an additional feature. A hyperparameter tuning was performed to optimize the overall model performance. In the case of XGBoost, the number of boosting rounds (or the number of trees to be built in the ensemble), the maximum depth of trees, the number of columns to be randomly sampled for each tree, and the learning rate (the step size at each iteration while the model optimizes toward its objective) were analyzed. To determine the most suitable hyperparameters, we used a grid search, which systematically searches through a grid of a manually predefined set of hyperparameter combinations. In this study, the performance of all hyperparameter value combinations is evaluated based on a threefold cross-validation. This involves partitioning the data set into three subsets, with two-thirds used for direct model training and one-third reserved for validation. This process was repeated three times, ensuring each subset served as a validation set once. The average performance across these three runs for each hyperparameter combination was then compared to identify the set of hyperparameter values that reach the best performance. Table 1 shows the tested combinations and the best and default hyperparameter values for XGB-15°, our best model.
Regarding the testing samples, the best-performing model correctly classifies 183 of 247 (74.09%) samples of sTEC variations related to the earthquake and the tsunami (TP). Furthermore, 2975 of 3021 (98.49%) of the testing samples are correctly classified, containing no earthquake-induced sTEC variations (TN). Thus, 64 of 247 samples (25.91%) are wrongly classified as containing no seismic-induced sTEC variations (FN), while 46 of 3021 (1.51%) are the number of wrongly classified chunks as containing sTEC variations related to the event (FP), as shown in Table 2. The model achieves an F1 score of 0.77, recall of 0.74, and precision of 0.8; while the accuracy considering the training and the testing data is 0.98 and 0.97, respectively (Table 3). Figure 4 depicts these results for two links and the difference between the labeled and predicted middle epochs of the perturbation time frames.
The average numerical difference between the epoch of the TID is 70.8 s, considering the 25 links used to train the model, and 75 s for the 6 links used for testing (Table 4). As the time series are 15-s step discontinuous, achieving a 5-step average differences between the two epochs both in training and in testing is relevant. This highlights the algorithm's precise temporal detection of the AGWepi-related perturbations, underlining its critical role in timely TIDs identification for effective early warning systems. Furthermore, having similar results for training and testing data shows the model’s generalization ability, avoiding overfitting problems. In this context, the model's capability to generalize is ensured by incorporating an additional feature (i.e., value range), which prevents overfitting and captures essential information. The generalization is also achieved by the threefold cross-validation. Moreover, the sensitivity to noisy observations (i.e., selection of elevation cut-off angles) improves the model's focus on relevant information.
Figure 5 illustrates the ROC curves and AUC values for both algorithms and cut-off angles. RF reaches better ROC curves and higher AUC values; however, we opted for XGB-15° considering its greater performance in the F1 score and in the difference between the labeled and predicted middle epochs. In particular, a good performance in the last metric is essential for good results in real-time applications, aligning with our aim scope.
Overall, the algorithm demonstrates good computational efficiency, detecting in about 2–3 min, with hyperparameters tuning as the most critical step. An effective computational performance is crucial for real-time operational requirements, essential for effective early warning systems. However, our analysis uses data sampled at larger intervals than the 1-s real-time data rate, which can increase the computational time to 30–40 min.
Finally, the algorithm holds potentials as a highly viable tool, considering its ability to operate in real-time using only sTEC time series. However, several crucial aspects must be addressed for practical implications. In fact, successful real-world implementation requires access to real-time data from networks in high-risk areas prone to tsunamis. Furthermore, incorporating buffer time into the analysis process allows for a thorough examination of the data and its quality in real-time (i.e., data integrity, and bias and outlier identification). Establishing the necessary infrastructure is also crucial. This includes dedicated servers that can perform complex computations, efficiently run AI algorithms, and transfer data to cloud platforms. Finally, the use of servers and cloud storage is essential for storing data collected in real-time, which can be used for training new models and conducting retrospective analyses.
Comparison of different elevation cut-off angles
We tested two different elevation cut-off angles, 15° and 25°, as input features for our model. The results for the two classifiers, and the elevation cut-off angles, are shown in Table 3, where the first column shows the results of our best model.
For both classifiers, the performance related to the different cut-off angles is similar in terms of accuracy and precision. However, the recall is a bit higher when using an elevation cut-off of 25° (Table 3), whereas the F1 score is higher for an elevation cut-off of 25° for RF and similar for XGBoost. Conversely, when comparing the results of the two classifiers, we note that while the precision is higher when using RF, the F1 score and recall are significantly higher for XGBoost (Table 3).
Furthermore, Table 4 shows that for both classifiers, the differences between the labeled and predicted middle epochs for the links used for training and testing are smaller when using an elevation cut-off of 15° instead of 25°. Even though the average difference between the middle epochs for the training links is higher for XGBoost than for RF, the one for the testing links is smaller (Table 4). Moreover, RF presents a high dissimilarity between the average difference of the middle epochs considering the training and testing links, so the model proves not to be suitable to generalize well over unseen data sets.
Thus, we conclude that the XGBoost algorithm using dsTEC/dt time series with an elevation cut-off of 15° is the best model since it has high accuracy, precision, recall, and F1 score and performs best when investigating the average differences between labeled and predicted middle epochs.
Impact of additional features
We evaluate the impact of adding additional features, namely, the value range and variance of the chunk, to the feature matrix used in the machine learning algorithm. The choice of incorporating those features in our model is motivated by their capacity to capture different aspects of the dsTEC/dt time series. The value range represents the amplitude of variations within each chunk, providing insights into the overall perturbation magnitude. Meanwhile, variance quantifies the internal dynamics and temporal variability, describing the temporal characteristics of the ionospheric perturbations. For all combinations (XGB-15°, XGB-25°, RF-15°, and RF-25°), this addition improves the results in terms of precision, recall, and F1 score, as shown in Table 5. Indeed, they enable the model to better distinguish seismic-induced variations in dsTEC/dt time series, thus improving predictive accuracy through a deeper understanding of ionospheric perturbations. In particular, in XGB-15°, RF-25°, and XGB-25°, including value range or variance leads to similar results, having thus a similar impact on the model. On the other hand, in RF-15°, including the value range has a greater impact than including variance as it doubles the values of the metrics obtained with the raw data. This sensitivity of the algorithm to the range values aligns with the nature of Random Forest, which benefits from capturing a broad spectrum of information for robust decision-making, and the value range seems to capture the data variability more effectively than the variance. Finally, the study of the feature importance reveals that, for our best model (XGB-15°), the value range has the greatest impact. Thus, this shows that the value range significantly contributes to reduce errors in the tree ensemble.
Including both value range and variance in the feature matrix improves the raw case results. However, in XGB-15° and XGB-25°, the outcomes are worse than the ones related to the addition of value range or variance individually. Only in the case of RF-25°, adding both value range and variance presents better results than including them individually. This is likely because we do not have that much training data, and having more data can help the model better learn the patterns and relationship within the features. In this case, the joint inclusion of the features might introduce redundancies or interactions that the model struggles to effectively leverage.
Impact of the pre-processing
In this section, we evaluate the impact of standardization and normalization of the feature matrix chunks, done separately for each chunk, on the performances of the different models (Table 6), which commonly should be small on tree-based algorithms (García et al. 2015; Dougherty 2013). We highlight that only standardization applied to XGB-15° outperforms the raw case. This improvement can be attributed to the sensitivity of the XGBoost algorithm to the scale of input features. Standardization helps align the features to a common scale, facilitating more effective convergence during the boosting process. Furthermore, with 15° elevation cut-off, we observe more noisy data compared to the ones at 25°. This leads to more pronounced fluctuations and a high amount of data, underscoring the greater impact of standardization.
In contrast, in RF-15°, RF-25°, and XGB-25°, both standardization and normalization have a negative impact on the model, worsening the raw case results in terms of F1 score and recall. This outcome shows that RF and XGB are in this case not highly sensitive to the scale of features.
Validation of the model
To validate our results, we apply the best model to unseen dsTEC/dt time series related to the same event but generated from different satellite-station links. Specifically, we select the time series with variations in dsTEC/dt values occurring before the seismic event, ensuring that these perturbations are not caused by the earthquake. To select them, we apply some variations to the first two selection criteria used before:
-
Data availability only on DOY 259, excluding DOY 260 (as those time series are not registering at the end of DOY 259, when the earthquake occurred, so there would be a gap between DOY 259 and 260)
-
Considerable (that can be visually identified) variations in the dsTEC/dt time series, but not related to the earthquake (caused by noise or ionospheric background)
In this way, 18 dsTEC/dt time series are selected, enabling us to evaluate the performance of the model. Indeed, the expectations for this application are to predominantly capture all or most TNs, and minimize the occurrence of FPs. This is particularly crucial, as we are specifically considering time-series data with variations in sTEC that is not induced by the earthquake, ensuring a focus on our primary objective. The confusion matrix shows good results: 96.89% TNs and 3.11% FPs (in numbers: 5119 of TNs and 164 of FPs). These successful results are also confirmed by an accuracy of 0.97 (Fig. 6).
Conclusions
This study successfully used two machine learning algorithms, Random Forest and XGBoost, to detect TEC variations induced by the significant 2015 Illapel earthquake and tsunami, known for its substantial ionospheric TEC signatures. We approached the problem as a supervised binary classification task and used the VARION-generated dsTEC/dt time series as the input for our model. We followed specific criteria to select 31 links satellite-station. Then, we split the corresponding dsTEC/dt time series into individual 30-min chunks to create the feature matrix used in the machine learning algorithms.
We considered different classifiers, elevation cut-off angles, additional features, and pre-processing techniques. The best result, based on F1 score, average difference between labeled and predicted middle epochs of the perturbation time frames, and ROC curves, was with XGBoost classifier applied to 15° elevation cut-off time dsTEC/dt time series. The best performance was achieved by including the value range of the chunks as an additional feature and by tuning the hyperparameter using grid search.
Applying our final model to unseen test data, we obtained an overall performance of an F1 score of 0.77, a recall of 0.74, and a precision of 0.80. Indeed, focusing on the testing samples, 183 of 247 (74.09%) were correctly classified to contain sTEC variations related to the earthquake and the tsunami (TP). Furthermore, 2975 of 3021 (98.49%) of the testing samples were correctly classified, containing no sTEC variations caused by an earthquake (TN). Thus, 64 samples (25.91%) were wrongly classified as containing no sTEC variations caused by the event (FN), and 46 (1.51%) were the number of wrongly classified chunks as containing sTEC variations related to the earthquake and tsunami (FP).
Our model, with a smaller data set and single event, achieves competitive TP and TN rates when compared to the study by Brissaud and Astafyeva (Brissaud) with a substantial data set of 12 earthquake events, underscoring its potential utility in operational early warning systems.
This model demonstrated a 75-s average difference in predicting perturbation time frames for testing links, equivalent to an average difference of five steps considering the 15-s steps time series. This highlights the algorithm's potential for early detection of ionospheric perturbations caused by earthquakes and tsunamis, aiding in early warnings purposes.
The model’s versatility allows application in an operational real-time setting using real-time GNSS data, as it only needs the VARION-generated real-time sTEC time series (dsTEC/dt). In that case, buffer time conversion of the sTEC time series, tools for store the processed data, powerful servers, and adequate computational resources need to be considered.
The model takes a few minutes to detect TIDs, presenting a very low percentage of FPs (1.51%) and showing a high computational efficiency, crucial for effective early warning systems. However, real-time situations may increase the computational time due to higher data frequency (1 s) and bigger data set size, emphasizing the need for continuous improvement.
Finally, we validated our results applying the best model to dsTEC/dt time series related to the same event but generated from different satellite-station links. In detail, those time series had variations in dsTEC/dt values occurring before the seismic event, thus not caused by the earthquake. We obtained promising results: 96.89% TNs, 3.11% FPs, and 97% accuracy. However, we know that validation process also involves applying the model to similar events. We acknowledge the importance of external validation on entirely different data sets to assess the model's performance in different scenarios, which will be discussed in the future works.
In conclusion, we have demonstrated a powerful tool for timely and accurate identification of ionospheric perturbations linked to seismic events. Our study not only provides a valuable contribution to the field of ionospheric research but also sets the stage for the integration of advanced machine learning techniques into operational early warning systems, improving our ability to respond proactively to seismic events and associated hazards.
However, recognizing the current model's limitations, particularly referring to the low frequency and low copiousness of the data, future studies should fine-tune the algorithm for real-time, high-resolution data, widening the analysis to different kinds of TIDs and data sets. Furthermore, the model should be improved by incorporating additional features and optimizing the computational efficiency considering parallel processing, together with the analysis of its generalization capabilities. This will enhance the model's robustness and effectiveness in real-time application. Moreover, a database that stores the events (such as earthquakes, tsunamis, and volcanic eruptions) with their characteristic features (time frame, waveform, frequency content, and period) should be established. In this way, it will be possible to collect several events that can be adopted within different algorithms, reaching a way of continuous learning. Those outcomes could then be used as tools that enable early warning systems to combine data derived from the ionosphere with other information to achieve integrated systems that work synergically, enhancing the overall effectiveness of disaster prediction and mitigation strategies.
Data availability
The GNSS data set used in this study is kindly provided by the Centro Sismológico Nacional, Universidad de Chile (CSN), the Low-Latitude Ionospheric Sensor Network (LISN), the Red Argentina de Monitoreo Satelital Continuo (RAMSAC) of Instituto Geográfico Nacional de Argentina (IGN), the Rede Brasileira de Monitoramento Contínuo dos sistemas GNSS (RBMC), the NASA Global GNSS Network (GGN), the Deutsches Geodätisches Forschungsinstitut (DGFI-TUM), the Geodesy & Geodynamics Group of The Ohio State University (G2/OSU), the Universidad de Concepción, the Jicamarca Radio Observatory, the Institut national de l’information géographique et forestiére (IGN-France), the Centre National d’Études Spatiales (CNES), the European Space Operations Centre (ESA/ESOC), the University of Miami, the Instituto Geofísico de Ecuador (EPN), the Servicio Geográfico Militar (IGM), and the Universidad Nacional del Sur (UNS).
References
Albawi S, Mohammed TA, Al-Zawi S (2017) Understanding of a convolutional neural network. In: International conference on engineering and technology (ICET), Antalya, Turkey, pp 1–6. https://doi.org/10.1109/ICEngTechnol.2017.8308186
Ali PJM, Faraj RH (2014) Data normalization and standardization: a technical report. Mach Learn Tech Rep 1:1–6. https://doi.org/10.13140/RG.2.2.28948.04489
Artru J, Ducic V, Kanamori H, Lognonné P, Murakami M (2005) Ionospheric detection of gravity waves induced by tsunamis. Geophys J Int 160(3):840–848. https://doi.org/10.1111/j.1365-246X.2005.02552.x
Astafyeva E (2019) Ionospheric detection of natural hazards. Rev Geophys 57(4):1265–1288. https://doi.org/10.1029/2019RG000668
Atabati A, Alizadeh MM, Schuh H, Tsai L (2021) Ionospheric scintillation prediction on S4 and ROTI parameters using artificial neural network and genetic algorithm. Remote Sens 13:2092. https://doi.org/10.3390/rs13112092
Bagiya MS, Sunil AS, Rolland L, Nayak S, Ponraj M, Thomas D, Ramesh DS (2019) Mapping the impact of non-tectonic forcing mechanisms on GNSS measured coseismic ionospheric perturbations. Sci Rep 9:18640. https://doi.org/10.1038/s41598-019-54354-0
Breiman L (2001) Random forests. Mach Learn 45:5–32. https://doi.org/10.1023/A:1010933404324
Brissaud Q, Astafyeva E (2022) Near-real-time detection of co-seismic ionospheric disturbances using machine learning. Geophys J Int 230(3):2117–2130. https://doi.org/10.1093/gji/ggac167
Calais E, Minster JB (1995) GPS detection of ionospheric perturbations following the January 17, 1994, Northridge Earthquake. Geophys Res Lett 22(9):1045–1048. https://doi.org/10.1029/95GL00168
Camporeale E (2019) The challenge of machine learning in space weather: nowcasting and forecasting. Space Weather 17:1166–1207. https://doi.org/10.1029/2018SW002061
Cesaroni C, Spogli L, Aragon-Angel A, Fiocca M, Dear V, De Franceschi G, Romano V (2020) Neural network based model for global total electron content forecasting. J Space Weather Space Clim 10:11. https://doi.org/10.1051/swsc/2020013
Chen T, Guestrin C (2016) XGBoost: a scalable tree boosting system. In: Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining (KDD '16). Association for Computing Machinery, pp 785–794. https://doi.org/10.1145/2939672.2939785
Chou MY, Lin CCH, Yue J, Tsai HF, Sun YY, Liu JY, Chen CH (2017) Concentric traveling ionosphere disturbances triggered by Super Typhoon Meranti. Geophys Res Lett 44:1219–1226. https://doi.org/10.1002/2016GL072205
Constantinou V, Ravanelli M, Liu H, Bortnik J (2023a) A Deep Learning Approach for Detection of Internal Gravity Waves in Earth’s Ionosphere, IGARSS 2023 - 2023 IEEE International Geoscience and Remote Sensing Symposium, Pasadena, CA, USA, pp. 1178–1181. https://doi.org/10.1109/IGARSS52108.2023.10282501
Constantinou V, Ravanelli M, Liu H, Bortnik J (2023b) Deep learning driven detection of tsunami related internal GravityWaves: a path towards open-ocean natural hazards detection. https://doi.org/10.48550/arXiv.2308.04611
Coster A, Williams J, Weatherwax A, Rideout W, Herne D (2013) Accuracy of GPS total electron content: GPS receiver bias temperature dependence. Radio Sci 48:190–196. https://doi.org/10.1002/rds.20011
Crocetti L, Schartner M, Soja B (2021) Discontinuity detection in GNSS station coordinate time series using machine learning. Remote Sens 13(19):3906. https://doi.org/10.3390/rs13193906
Dougherty G (2013) Feature extraction and selection. In: Pattern recognition and classification: an introduction. Springer, New York, pp 123–141. https://doi.org/10.1007/978-1-4614-5323-9_7
Galvan DA, Komjathy A, Hickey MP, Stephens P, Snively J, Tony Song Y, Butala MD, Mannucci AJ (2012) Ionospheric signatures of Tohoku-Oki tsunami of march 11, 2011: Model comparisons near the epicenter. Radio Sci. https://doi.org/10.1029/2012RS005023
García S, Luengo J, Herrera F (2015) Introduction. In: Data preprocessing in data mining. Intelligent system reference library, vol 72. Springer, Cham, pp 1–17. https://doi.org/10.1007/978-3-319-10247-4_1
Hargreaves JK (1992) The solar-terrestrial environment: an introduction to geospace—the science of the terrestrial upper atmosphere, ionosphere, and magnetosphere. Cambridge University Press, Cambridge. https://doi.org/10.1017/CBO9780511628924
Hofmann-Wellenhof B, Lichtenegger HI, Wasle E (2008) GNSS—global navigation satellite systems: GPS, GLONASS, Galileo, and more. Springer, Vienna. https://doi.org/10.1007/978-3-211-73017-1
Huang Z, Yuan H (2014) Ionospheric single-station TEC short term forecast using RBF neural network. Radio Sci 49(4):283–292. https://doi.org/10.1002/2013RS005247
Huang CY, Helmboldt JF, Park J, Pedersen TR, Willemann R (2019) Ionospheric detection of explosive events. Rev Geophys 57:78–105. https://doi.org/10.1029/2017RG000594
Jin S, Occhipinti G, Jin R (2015) GNSS ionospheric seismology: recent observation evidences and characteristics. Earth Sci Rev 147:54–64. https://doi.org/10.1016/j.earscirev.2015.05.003
Kuglitsch M, Pelivan I, Ceola S, Menon M, Xoplaki E (2022) Facilitating adoption of AI in natural disaster management through collaboration. Nat Commun 13:1579. https://doi.org/10.1038/s41467-022-29285-6
Lin JW (2022) Generalized two-dimensional principal component analysis and two artificial neural network models to detect traveling ionospheric disturbances. Nat Hazards 111:1245–1270. https://doi.org/10.1007/s11069-021-05093-x
Linty N, Farasin A, Favenza A, Dovis F (2019) Detection of GNSS ionospheric scintillations based on machine learning decision tree. IEEE Trans Aerosp Electron Syst 55(1):303–317. https://doi.org/10.1109/TAES.2018.2850385
Liu L, Zou S, Yao Y, Wang Z (2020) Forecasting global ionospheric TEC using deep learning approach. Space Weather 18:e2020SW002501. https://doi.org/10.1029/2020SW002501
Lognonné P, Artru J, Garcia R, Crespon F, Ducic V, Jeansou E, Occhipinti G, Helbert J, Moreaux G, Godet P-E (2006) Ground-based GPS imaging of ionospheric post-seismic signal. Planet Space Sci 54(5):528–540. https://doi.org/10.1016/j.pss.2005.10.021
Łoś M, Smolak K, Guerova G, Rohm W (2020) GNSS-based machine learning storm nowcasting. Remote Sens 12(16):2536. https://doi.org/10.3390/rs12162536
Manta F, Occhipinti G, Feng L, Hill EM (2020) Rapid identification of tsunamigenic earthquakes using GNSS ionospheric sounding. Sci Rep 10:11054. https://doi.org/10.1038/s41598-020-68097-w
Martire L, Krishnamoorthy S, Vergados P, Romans LJ, Szilágyi B, Meng X, Anderson JL, Komjáthy A, Bar-Sever YE (2023) The GUARDIAN system-a GNSS upper atmospheric real-time disaster information and alert network. GPS Solut 27(1):32. https://doi.org/10.1007/s10291-022-01365-6
McGranaghan RM, Mannucci AJ, Wilson BD, Mattmann CA, Chadwick R (2018) New capabilities for prediction of high-latitude ionospheric scintillation: a novel approach with machine learning. Space Weather 16:1817–1846. https://doi.org/10.1029/2018SW002018
Meng X, Vergados P, Komjathy A, Verkhoglyadova O (2019) Upper atmospheric responses to surface disturbances: an observational perspective. Radio Sci 54:1076–1098. https://doi.org/10.1029/2019RS006858
Meng X, Ravanelli M, Komjathy A, Verkhoglyadova OP (2022) On the north-south asymmetry of co-seismic ionospheric disturbances. Geophys Res Lett 49:e2022GL098090. https://doi.org/10.1029/2022GL098090
Natras R, Soja B, Schmidt M (2022) Ensemble machine learning of random forest, AdaBoost and XGBoost for vertical total electron content forecasting. Remote Sens 14:3547. https://doi.org/10.3390/rs14153547
Occhipinti G (2015) The seismology of the planet Mongo. In: Morra G, Yuen DA, King SD, Lee SM, Stein S (eds) Subduction dynamics. American Geophysical Union (AGU), Washington, pp 169–182. https://doi.org/10.1002/9781118888865.ch9
Occhipinti G, Coïsson P, Makela JJ, Allgeyer S, Kherani A, Hebert H, Logonné P (2011) Three-dimensional numerical modeling of tsunami-related internal gravity waves in the Hawaiian atmosphere. Earth Planet Space 63:847–851. https://doi.org/10.5047/eps.2011.06.051
Occhipinti G, Rolland L, Lognonne P, Watada S (2013) From sumatra 2004 to Tohoku-Oki 2011: the systematic GPS detection of the ionospheric signature induced by tsunamigenic earthquakes. J Geophys Res Space Physics 118(6):3626–3636. https://doi.org/10.1002/jgra.50322
Peltier WR, Hines CO (1976) On the possible detection of tsunamis by a monitoring of the ionosphere. J Geophys Res 81(12):1995–2000. https://doi.org/10.1029/JC081i012p01995
Ravanelli M, Occhipinti G, Savastano G, Komjathy A, Shume EB, Crespi M (2021) GNSS total variometric approach: first demonstration of a tool for real-time tsunami genesis estimation. Sci Rep 11(1):1–12. https://doi.org/10.1038/s41598-021-82532-6
Ravanelli M, Astafyeva E, Munaibari E, Rolland L, Mikesell TD (2023) Ocean-ionosphere disturbances due to the 15 January 2022 Hunga-Tonga Hunga-Ha’apai eruption. Geophys Res Lett. https://doi.org/10.1029/2022GL101465
Ravanelli M, Crespi M, Foster J (2020) Tids detection from ship-based GNSS receiver: first test on 2010 Maule tsunami. In: IGARSS 2020—2020 IEEE international geoscience and remote sensing symposium. Waikoloa, HI, USA, pp 6846–6849. https://doi.org/10.1109/IGARSS39084.2020.9324549
Ravanelli M (2021) An innovative approach for real-time GNSS ionosphere seismology: assessment, potentialities, applications and issues. Ph.D. thesis, Sapienza University
Reddy CD, Shrivastava MN, Seemala GK, González G, Baez JC (2017) Ionospheric plasma response to Mw 8.3 Chile Illapel Earthquake on September 16, 2015. In: Braitenberg C, Rabinovich A (eds) The Chile-2015 (Illapel) Earthquake and Tsunami. Pageoph Topical Volumes. Birkhäuser, Cham, pp 145–155. https://doi.org/10.1007/978-3-319-57822-4_12
Rolland L, Occhipinti G, Lognonné P, Loevenbruck A (2010) Ionospheric gravity waves detected offshore Hawaii after tsunamis. Geophys Res Lett. https://doi.org/10.1029/2010GL044479
Sanchez SA, Kherani EA, Astafyeva E, de Paula ER (2023) Rapid detection of co-seismic ionospheric disturbances associated with the 2015 Illapel, the 2014 Iquique and the 2011 Sanriku-Oki Earthquakes. J Geophys Res (space Phys). https://doi.org/10.1029/2022JA031231
Savastano G, Komjathy A, Verkhoglyadova O, Mazzoni A, Crespi M, Wei Y, Mannucci AJ (2017) Real-time detection of tsunami ionospheric disturbances with a stand-alone GNSS receiver: a preliminary feasibility demonstration. Sci Rep 7:46607. https://doi.org/10.1038/srep46607
Shrivastava MN, Maurya AK, Gonzalez G, Sunil PS, Gonzalez J, Salazar P, Aranguiz R (2021) Tsunami detection by GPS-derived ionospheric total electron content. Sci Rep 11:12978. https://doi.org/10.1038/s41598-021-92479-3
Ting KM (2017) Confusion matrix. In: Sammut C, Webb GI (eds) Encyclopedia of machine learning and data mining. Springer, Boston. https://doi.org/10.1007/978-1-4899-7687-1_50
Torrence C, Compo GP (1998) A practical guide to wavelet analysis. Bull Am Meteorol Soc 79(1):61–78. https://doi.org/10.1175/1520-0477(1998)079%3c0061:APGTWA%3e2.0.CO;2
Vieira S, Lopez Pinaya WH, Mechelli A (2020) Chapter 2—main concepts in machine learning. In: Mechelli A, Vieira S (eds) machine learning. Academic Press/Elsevier, London, pp 21–44. https://doi.org/10.1016/B978-0-12-815739-8.00002-X
Wang XW, Liu YY (2020) Comparative study of classifiers for human microbiome data. Med Microecol 4:100013. https://doi.org/10.1016/j.medmic.2020.100013
Zettergren MD, Snively JB, Komjathy A, Verkhoglyadova OP (2017) Nonlinear ionospheric responses to large-amplitude infrasonic-acoustic waves generated by undersea earthquakes. J Geophys Res Space Physics 122:2272–2291. https://doi.org/10.1002/2016JA023159
Zhang C, Liu C, Zhang X, Almpanidis G (2017) An up-to-date comparison of state-of-the-art classification algorithms. Expert Syst Appl 82:128–150. https://doi.org/10.1016/j.eswa.2017.04.003
Zhukov AV, Sidorov DN, Mylnikova A, Yasyukevich YV (2018) Machine learning methodology for ionosphere total electron content nowcasting. Int J Artif Intell 16:144–157. https://doi.org/10.13140/RG.2.2.19349.83685
Zhukov AV, Yasyukevich YV, Bykov AE (2020) GIMLI: Global ionospheric total electron content model based on machine learning. GPS Solut 25(1):19. https://doi.org/10.1007/s10291-020-01055-1
Acknowledgements
F. F. was supported by the Zegna Foundation Grant and by Doctoral Program fellowship within the Data Science Course of Sapienza University of Rome founded by INGV. M.R. was supported by the AXA Research Fund and the Intergovernmental Oceanographic Commission of UNESCO (IOC-UNESCO) on Coastal Livelihood grant (B83C23002490007). L.C. and B.S. were supported by the ETH Zurich.
Funding
Open access funding provided by Università degli Studi di Roma La Sapienza within the CRUI-CARE Agreement.
Author information
Authors and Affiliations
Contributions
F.F. wrote the main manuscript, F.F. and L.C. carried out the experiments, and M.R. and B.S. conceived the experiments. All authors reviewed the manuscript.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare no competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Fuso, F., Crocetti, L., Ravanelli, M. et al. Machine learning-based detection of TEC signatures related to earthquakes and tsunamis: the 2015 Illapel case study. GPS Solut 28, 106 (2024). https://doi.org/10.1007/s10291-024-01649-z
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s10291-024-01649-z