Abstract
This paper addresses a new data analysis method which is suitable to cluster flight data and complement current exceedance-based flight data monitoring programmes within an airline. The data used for this study consists of 296 simulated approaches from 4.5 NM to 1 NM to the runway threshold, flown by 74 participants (both pilots and non-pilots) with either a conventional sidestick or a gamepad in the future flight simulator at Cranfield University. It was clustered and analysed with the use of Kohonen’s Self-Organising Maps (SOM) algorithm. The results demonstrate that SOM can be a meaningful indicator for safety analysts to accurately cluster both optimal and less-optimal flying performance. This methodology can therefore complement current deviation-based flight data analyses by highlighting day-to-day as well as exceptionally good performance, bridging the cap of current analyses with safety-II principles.
Access provided by Autonomous University of Puebla. Download conference paper PDF
Keywords
1 Introduction
Flight data analysis has become an important part of each airline’s safety management system (SMS) to the extent that establishing a flight data monitoring (FDM) programme is nowadays mandatory for aircraft operators over 27 tons as per ICAO regulations [1]. Airlines and regulators collect a vast amount of data, but only a small portion of it is typically analysed in depth [2]. This is due to the fact that only events which are considered “abnormal” are analysed, leaving most of the data to be stored without further use. This data about everyday performance can, however, contain useful safety information for airlines, both in terms of trend analyses and learning about exceptionally good performance [3]. Many studies have already been published about different algorithms and data analysis methods to derive new learning opportunities from existing data [4]. This study focuses on the use of self-organising maps (SOM) to cluster flight data collected during a simulated approach. Different performance metrics were then collected to assess the clustering performance and the possibilities of deriving new knowledge from the existing data.
2 Relative Work
This study focuses on applying self-organising maps, which is a machine learning algorithm, to flight data analysis methods. Many studies which focus on data analysis algorithms and techniques, especially about data clustering, have been published in the past. This study aims to contribute to the previous research in this field.
Airlines typically use FDM to analyse their daily safety performance. Various flight parameters (over 1000 different parameters for modern aircraft such as the Airbus A350 or Boeing 787) are recorded, downloaded, and processed by flight data analysts. Flight data analyses as of today mostly focus on capturing deviations from acceptable range of parameters, such as unusually high or low airspeeds for a specific flight phase, markers of an unstable approach (e.g. the aircraft not being fully configured for landing below a specific height) and dual pilot inputs. The data is usually also classified within a safety matrix to analyse trends both in terms of event frequency and severity [5, 6]. Some recent developments now also include performing big data analyses to monitor ongoing trends within an airline, consistent with a safety-II approach [3]. Deviations from an acceptable range of parameters can help airlines to monitor their pilots’ proficiency and take mitigation measures such as specific training exercises if required [7]. As experts may accept minor deviations (that are within the acceptable range) in order to leave cognitive space for other objectives, an understanding of how these deviations interact with safety performance can prove very beneficial for airlines with pilots from diverse backgrounds featuring different piloting techniques, exposure to different equipment types, cultural values, and perspectives on individual safety objectives [8]. However, most analysis methods used in flight operations remain focused on identifying threshold exceedances, which leads to roughly only 3 to 5% of the data being analysed [9]. The 95% left could, however, lead to additional information about daily operations and normal occurrences, consistent with safety-II principles [10]. Furthermore, it heavily relies on the correct setting of the threshold for the exceedance event detection. If the detection threshold is too narrow, many events would be flagged, resulting in many false positives. Conversely, a too wide threshold would flag few events or no events at all, leading to many false negatives. Finally, it also relies on the supposition that a specific incident could occur. Unimagined potential incidents are hard to notice in the best case if no detection method has been designed previously [11]. Analysing flight data is not only done during everyday operation but also during the design stage of a new technology, such as using a touchscreen as means of flight control [12, 13].
Much previous research has demonstrated the benefits of adding machine learning to current flight data analyses. Machine learning can be defined as a programme’s ability to increase its performance with experience, i.e. through learning from the data it is fed with [14]. Machine learning can be divided into two categories: supervised and unsupervised learning. Supervised learning involves knowing the correct solution for a given dataset, whereas unsupervised learning involves not knowing any solution for a specific dataset [9]. Previous research about machine learning in flight analysis comprises Bayesian networks [15], local outlier probability [11], Multiple Kernel Anomaly Detection (MKAD) [16] and clustering [17, 18] to cite a few.
Self-organising Maps (SOM), often used as a synonym for Kohonen’s Self-Organising Map is a part of a broader type of machine learning techniques called artificial neural networks (ANN), which has not been researched extensively in conjunction with flight data analysis. ANNs are designed to simulate the sensory processing of the human brain. It simulates a network of model neurons, which can ‘learn’ many different types of problems, especially classifications [19]. The SOM algorithm specifically maps the data pattern from an input space (the original data patterns) into a n-dimensional space, known as the output space (Fig. 1). The mapping aims at preserving the topological relations between both spaces. In order to ease visualisation, the output space is usually one or two-dimensional. To map the output space, the SOM algorithm uses a neighbourhood function, which is responsible for the interactions between the different units. Therefore, SOM can be used effectively for clustering tasks and performs similarly to k-means clustering [20]. SOM are able to extract stabilised phases of flight as well as transient changes in flight parameters. Furthermore, SOM can handle large datasets, which makes them well-suited to analyse flight data and an interesting alternative to k-means clustering [21, 22].
3 Methodology
3.1 Data Source
The dataset consists of 296 simulated approaches flown in an engineering flight simulator called the Future Systems Simulator at Cranfield University [24].
74 participants (55 males, 18 females, 1 preferred not to say) were asked to fly four approaches and landings. Table 1 displays the participants’ demographical data. The first two approaches were performed with the use of a sidestick, the first one without any turbulence and the second one with simulated turbulence. The following two approaches, again with and without turbulence, were performed this time with a gamepad. The participants were asked to follow a standard three-degree descent path while tracking the runway, assisted with the instrument landing system (ILS) through the use of the Flight Director (FD). Additionally, participants could follow the cues displayed to them by the precision approach path indicator (PAPI), while the auto throttle (ATHR) would control the engine thrust to maintain a constant approach speed. For the purpose of this study, the following parameters were considered: the anonymous participant ID, the altitude, calibrated airspeed (CAS), deviation in altitude from the three-degree glideslope, aircraft pitch, presence of a disturbance (turbulence) and the mean of controlling the aircraft (sidestick or gamepad).
3.2 Research Procedure
The dataset was first cleaned to only keep the last 4.5 NM to 1 NM to the runway threshold. As most participants were not trained pilots, flying the last 1NM to the runway threshold and landing the aircraft as per standard operating procedures (SOPs) turned out to be challenging and led to high data variability within the last section of the flight. Therefore, it was removed in addition to 22 different approaches, where the data featured too many inconsistencies. The data was also discretised and interpolated for every 0.05 NM by the distance to the threshold. A new variable called ‘\(\Delta GP\)’ was created, which measures the participants’ ability to track the three-degree glideslope given by the PAPI. It represents the difference in altitude from the aircraft’s altitude compared to the reference altitude corresponding to an ideal 3-degree glide path. It has been determined as follows:
The following parameters were used: xthr, representing the aircraft distance to the runway threshold in NM and CHthr – the crossing height, which is 50 ft above the runway threshold height.
The data was then divided into two sets: one for approaches flown with the sidestick and one for the approaches flown with the gamepad. The root mean square error was calculated for each \(\Delta GP\), both of the sidestick dataset (RMSES) and of the gamepad dataset (RMSEG). Each dataset was clustered by corresponding \(\Delta GP\) variable using the self-organising map (SOM) algorithm to compare the participants’ performance. Following the clustering, Welch ANOVAs, and subsequent Games Howell post-hoc tests were conducted on each cluster’s RMSE to assess the clustering performance. Python 3.10.11 was used to conduct the analysis.
3.3 Statistical Tools Used
The SOM algorithm requiring the Python ‘MiniSom’ package was used. The SOM are a type of Artificial Neural Networks (ANN) which convert nonlinear statistical relationships on higher dimensions into a low-dimensional discretised representation map. The map consists of output neurons, usually arranged in a two-dimensional grid and trying to preserve topological relations. SOM and k-means algorithms are identical when the radius of the neighbourhood function in the SOM is equal to 0 [20]. The maximum number of clusters which can be obtained is equal to the number of output neurons. To obtain the optimal number of clusters, the Silhouette score was applied. It is equal to \(S = \frac{{b_{i} - a_{i} }}{{\max a_{i} ,b_{i} }}\), where a represents the mean cluster centroid distance and b the average nearest cluster distance for every sample i. The Silhouette score is a measure of how similar an object is to its own cluster versus the other clusters. It returns a value between −1 and 1, 1 being the best clustering. For each clustering, a variable optimisation algorithm was used, which calculated the SOM σ and learning rate for the best Silhouette score over 10′000 iterations. The optimal grid dimension corresponds to \(C = 5\sqrt N\), where C corresponds to the number of neurons and N the number of samples in the dataset [25]. For the size of the used dataset, it corresponds to a 3 × 3 grid. To analyse the SOM performance, several metrics were used: the normalised quantization error, the topographical error, the trustworthiness, and the neighbourhood preservation. The quantization error represents the mean difference between the input samples and the winning neurons. It is equal to \(QE\left( M \right) = \frac{1}{n}\mathop \sum \nolimits_{i = 1}^{n} \left\| {x_{i} - w_{{c\left( {x_{i} } \right)}} } \right\|\), n representing the number of data points and wcxi the weight vector of the best matching unit in the map for the data point xi. The quantization error was then normalized by calculating the average quantization error for each node as follows: \(NQE = \left\{ {\begin{array}{*{20}c} {\frac{1}{N}\mathop \sum \nolimits_{j = 1}^{N} \left( {\begin{array}{*{20}c} {\frac{1}{n}\mathop \sum \nolimits_{i = 1}^{n} \left\| {\emptyset \left( {x_{i} } \right) - x_{i} } \right\|} \\ {norm\left( {w_{j} } \right)} \\ \end{array} } \right)} \\ {1\, if\, no\, data\, point\, matches\, the\, unit} \\ \end{array} } \right.\), k corresponding to the number of vectors mapped of each unit. The topographical error is equal to \(TE = \frac{1}{n}\sum\nolimits_{i = 1}^{n} {d_{i} }\) where n is the number of input vectors and di the distance between the best matching and second-best matching units. The trustworthiness is equal to \(M_{1} \left( k \right) = 1 - \frac{2}{{Nk\left( {2N - 3k - 1} \right)}}\sum\nolimits_{i = 1}^{N} {\mathop \sum \nolimits_{{x_{j} \in U_{k} \left( {x_{i} } \right)}} \left( {{\text{r}}(x_{i} ,x_{j} } \right) - k)}\). The neighbourhood preservation is equal to \(M_{2} \left( k \right) = 1 - \frac{2}{{Nk\left( {2N - 3k - 1} \right)}}\sum\nolimits_{i = 1}^{N} {\mathop \sum \nolimits_{{x_{j} \in V_{k} \left( {x_{i} } \right)}} \left( {r^{ \wedge } (x_{i} ,x_{j} } \right) - k)}\). In both formulas, N represents the data set. Uk(xi) represents the data points which are k closest to the input space xi and Vk(xi) the data points which are the k closest to xi in the output space. r(xi, xj) represents the rank of xj when the data points are ordered by distance from xi and r^(xi,xj) represents the rank of xj when ordered by distance in the projection. The elbow method was used to determine the optimal k value. As the data does not show variance homogeneity (pBartlett < 0.05 for both sidestick and gamepad clustering datasets), a Welch ANOVA and Games Howell post-hoc test were used to determine the clustering performance.
4 Results and Discussion
4.1 Sample Characteristics
A total of 278 approaches from 4.5 NM to 1 NM to the runway threshold were analysed through the SOM clustering methodology. Focus was set on analysing the participants’ vertical performance, i.e. their ability to maintain a stable descent path according to a standard 3° descent. The clustering of the participants’ tracking of the 3-degree glideslope making it possible to assess the participants’ manual flying skills both in terms of good performance and common errors. The flight parameters look as displayed in Fig. 2.
4.2 Clustering of the Deviation from the 3 Degrees Glideslope for Approaches Flown with a Gamepad
The variables shown in Table 2 were used to optimise the SOM algorithm to cluster the approaches flown with a gamepad. The Silhouette score indicates a moderately strong clustering performance with an optimised cluster number of four clusters [26]. The results from Table 3 demonstrate that SOM can be a meaningful tool to cluster the gamepad flight data. Although the NQE remains fairly high, indicating that some data points do not match the unit’s weight vector [27], the topographic structure of the original data is well-preserved on the map [28, 29].
The approaches flown with a gamepad grouped by cluster are displayed in Fig. 3, which shows the CAS, deviation from the three degrees glideslope and pitch. The trials with disturbance appear to have been more difficult for some participants, as clusters 3, 6 and 7 are almost only composed of landings with disturbances and feature a comparatively higher RMSE (Fig. 3 and Table 4). The clusters are well-defined, the Welch ANOVA indicates significant differences between clusters (F = 3015, p < 0.05). The post hoc analysis shows significant differences in RMSE (p < 0.05) between the clusters (Table 5). Cluster 5 displays the least RMSE, and cluster 6 displays the highest RMSE (Table 4). The difference between experienced pilots and novices is less pronounced than within the sidestick dataset. A cause for this can be the smaller experience gap in using gamepads compared to flying with a sidestick.
4.3 Clustering of the Deviation from the 3 Degrees Glideslope for Approaches Flown with a Sidestick
The variables shown in Table 6 were used to optimise the SOM algorithm to cluster the approaches flown with a sidestick. The Silhouette score showed moderately strong clustering performance with an optimised cluster number of four clusters [26], although lower than the one corresponding to the gamepad flight data. The results from Table 7 demonstrate that SOM can be a meaningful tool to cluster the sidestick flight data. Although the NQE remains fairly high, indicating that some data points do not match the unit’s weight vector [27], the topographic structure of the original data is well-preserved on the map [28, 29]. Overall, the clustering result metrics are very similar compared to the gamepad flight data, but the flying performance is better when participants flew with the sidestick. This can be due to the difference in experience, as qualified pilots would perform better than non-pilots with the sidestick whereas the overall flying performance is lower with the gamepad (Table 8).
The approaches flown with a sidestick grouped by cluster are displayed in Fig. 4, which shows the CAS, deviation from the three degrees glideslope and pitch. The trials with disturbance appear to have been more difficult for some participants, as clusters 3 and 7 are almost only composed of landings with disturbances and feature a comparatively higher RMSE, similar to the results of the gamepad landings (Fig. 3 and Table 4). The clusters are well-defined, the Welch ANOVA indicates significant differences between clusters (F = 5272, p < 0.05). The post hoc analysis shows significant differences in RMSE (p < 0.05) between the clusters (Table 9). Cluster 4 displays the least RMSE (Table 8) and cluster 2 displays the highest RMSE. The results are interesting with regards to the participants’ experience, as both experienced pilots and novices are present in cluster 4.
4.4 Limitations
Several limitations are present in this study. Firstly, there is a high variance within the pilots’ experience, ranging from novice to expert, which distorts the data, compared to an FDM dataset. Moreover, it can be difficult for non-pilots to follow the precision approach indicator (PAPI) and ILS guidance for the first time. The control forces on the sidestick are those of a generic aircraft which might also differ from some actual aircraft types. Finally, the SOM algorithm parameters could be optimised, and a stronger algorithm could be used for the clustering instead of the MiniSom package, which is an introductory package into SOM. This would provide a better overall clustering performance.
5 Conclusion
The clustering method through SOM provides useful information to analyse flight performance beyond exceedance events. For these datasets, it shows that, in overall, the flying performance is less susceptible to variability when flying with a sidestick compared to flying with a gamepad. This might indicate that for a novice, the use of a gamepad is easier than the sidestick. However, experienced pilots perform better when flying with the sidestick, as the average RMSEs for the sidestick data are lower than for the gamepad. Based on the deviation from the glidepath clusters, it is also possible to analyse the pilots’ pitch inputs on the flight controls and determine the effects of the different flying techniques on the flight path. Furthermore, the results show that some participants had more difficulty handling approaches with disturbances, both with the sidestick and with the gamepad. A more detailed study could be accomplished with further tuning of the SOM parameter to increase the clustering performance. The use of SOM and clustering in general could prove beneficial for airlines to perform big data and trend analyses in addition to a purely exceedance-based analysis and so take a step further towards safety-II by considering the contexts behind detected exceedances which may be influenced by previous experience levels and familiarity.
References
ICAO: Annex 6 Part 1 (2010). Accessed 27 Aug 2021. https://store.icao.int/en/annex-6-operation-of-aircraft-part-i-international-commercial-air-transport-aeroplanes
Li, W.-C., Nichanian, A., Lin, J., Braithwaite, G.: Investigating the impacts of COVID-19 on aviation safety based on occurrences captured through Flight Data Monitoring. Ergonomics, 1–39 (2022). https://doi.org/10.1080/00140139.2022.2155317
Flight Safety Foundation: Learning From All Operations: Expanding the Field of Vision to Improve Aviation Safety, July 2021. Accessed 18 Nov 2021. https://flightsafety.org/wp-content/uploads/2021/07/Learning-from-All-Operations-FINAL.pdf
Alreshidi, I., Moulitsas, I., Jenkins, K.W.: Advancing aviation safety through machine learning and psychophysiological data: a systematic review. IEEE Access 12, 5132–5150 (2024). https://doi.org/10.1109/ACCESS.2024.3349495
Walker, G.: Redefining the incidents to learn from: safety science insights acquired on the journey from black boxes to Flight Data Monitoring. Saf. Sci. 99, 14–22 (2017). https://doi.org/10.1016/j.ssci.2017.05.010
Maille, N.: On the use of flight operating procedures for the analysis of FOQA data. In: 6th European Conference for Aeronautics and Space Sciences (EUCASS), Krakow: ONERA, July 2015. Accessed 9 Nov 2021. https://www.researchgate.net/publication/314973879
Bromfield, M.A., Landry, S.J.: Loss of control in flight – time to re-define? In: AIAA Aviation 2019 Forum, pp. 1–10 (2019). https://doi.org/10.2514/6.2019-3612
Chan, W., Li, W.-C.: Perception of causal factors in flight operations between Ab-Initio and expatriate pilots. In: 2023 7th International Conference on Transportation Information and Safety (ICTIS), pp. 1728–1732. IEEE, August 2023. https://doi.org/10.1109/ICTIS60134.2023.10243970
Jasra, S.K., Valentino, G., Muscat, A., Camilleri, R.: Hybrid machine learning-statistical method for anomaly detection in flight data. Appl. Sci. 12(20), 10261 (2022). https://doi.org/10.3390/app122010261
Hollnagel, E.: Safety–I and Safety–II, 1st edn. CRC Press, London (2018). https://doi.org/10.1201/9781315607511
Oehling, J., Barry, D.J.: Using machine learning methods in airline flight data monitoring to generate new operational safety knowledge from existing data. Saf. Sci. 114, 89–104 (2019). https://doi.org/10.1016/j.ssci.2018.12.018
Li, W.-C., Wang, Y., Korek, W.T.: To be or not to be? Assessment on using touchscreen as inceptor in flight operation. Transp. Res. Procedia 66, 117–124 (2022). https://doi.org/10.1016/j.trpro.2022.12.013
Korek, W.T., Li, W.C., Lu, L., Lone, M.: Investigating pilots’ operational behaviours while interacting with different types of inceptors. In: Harris, D., Li, WC. (eds.) HCII 2022, vol. 13307. LNAI, pp. 314–325. Springer, Cham (2022). https://doi.org/10.1007/978-3-031-06086-1_24
Mitchell, T.M.: Machine Learning. McGraw-Hill Science/Engineering/Math (1997)
Barry, D.J.: Estimating runway veer-off risk using a Bayesian network with flight data. Transp. Res. Part C Emerg. Technol. 128 (2021). https://doi.org/10.1016/j.trc.2021.103180
Das, S., Li, L., Srivastava, A.N., John Hansman, R.: Comparison of algorithms for anomaly detection in flight recorder data of airline operations. In: 12th AIAA Aviation Technology, Integration and Operations (ATIO) Conference and 14th AIAA/ISSMO Multidisciplinary Analysis and Optimization Conference. American Institute of Aeronautics and Astronautics Inc. (2012). https://doi.org/10.2514/6.2012-5593
Li, L., Das, S., Hansman, R.J., Palacios, R., Srivastava, A.N.: Analysis of flight data using clustering techniques for detecting abnormal operations. J. Aerosp. Inf. Syst., 587–598 (2015). https://doi.org/10.2514/1.I010329
Fernández, A., et al.: Flight Data Monitoring (FDM) unknown hazards detection during approach phase using clustering techniques and AutoEncoders. In: 9th SESAR Innovation Days. SESAR (2019)
Krogh, A.: What are artificial neural networks? Nat. Biotechnol. 26(2), 195–197 (2008). https://doi.org/10.1038/nbt1386
Bação, F., Lobo, V., Painho, M.: Self-organizing maps as substitutes for K-means clustering. In: Sunderam, V.S., van Albada, GDick, Sloot, P.M.A., Dongarra, J. (eds.) Computational Science – ICCS 2005. LNCS, vol. 3516, pp. 476–483. Springer, Heidelberg (2005). https://doi.org/10.1007/11428862_65
Bardet, J.-M., Faure, C., Lacaille, J., Olteanu, M.: Design aircraft engine bi-variate data phases using change-point detection method and self-organizing maps. In: Grenada: ITISE, September 2017. Accessed 18 Jan 2024. https://www.safran-aircraft-engines.com
Bektas, O.: Visualising flight regimes using self-organising maps. Aeronaut. J. 127(1316), 1817–1831 (2023). https://doi.org/10.1017/AER.2023.71
Andrades, I.S., Castillo Aguilar, J.J., García, J.M.V., Carrillo, J.A.C., Lozano, M.S.: Low-cost road-surface classification system based on self-organizing maps. Sensors 20(21), 6009 (2020). https://doi.org/10.3390/s20216009
Flight simulator at Cranfield University wins international award. Accessed 31 Jan 2024. https://www.cranfield.ac.uk/press/news-2021/flight-simulator-at-cranfield-university-wins-international-award
Vesanto, J., Alhoniemi, E.: Clustering of the self-organizing map. IEEE Trans. Neural Netw. 11(3), 586–600 (2000). https://doi.org/10.1109/72.846731
Shahapure, K.R., Nicholas, C.: Cluster quality analysis using silhouette score. In: 2020 IEEE 7th International Conference on Data Science and Advanced Analytics (DSAA), pp. 747–748. IEEE, October 2020. https://doi.org/10.1109/DSAA49011.2020.00096
Ayadi, T., Hamdani, T.M., Alimi, A.M.: A new data topology matching technique with multilevel interior growing self-organizing maps. In: 2010 IEEE International Conference on Systems, Man and Cybernetics, pp. 2479–2486. IEEE, October 2010. https://doi.org/10.1109/ICSMC.2010.5641936
Bauer, H.-U., Pawelzik, K.R.: Quantifying the neighborhood preservation of self-organizing feature maps. IEEE Trans. Neural Netw. 3(4), 570–579 (1992). https://doi.org/10.1109/72.143371
De Bodt, E., Cottrell, M., Verleysen, M.: Statistical tools to assess the reliability of self-organizing maps. Neural Netw. 15(8–9), 967–978 (2002). https://doi.org/10.1016/S0893-6080(02)00071-0
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Nichanian, A., Li, WC., Korek, W.T., Wang, Y., Chan, W.TK. (2024). Self-organising Maps for Comparing Flying Performance Using Different Inceptors. In: Harris, D., Li, WC. (eds) Engineering Psychology and Cognitive Ergonomics. HCII 2024. Lecture Notes in Computer Science(), vol 14693. Springer, Cham. https://doi.org/10.1007/978-3-031-60731-8_8
Download citation
DOI: https://doi.org/10.1007/978-3-031-60731-8_8
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-60730-1
Online ISBN: 978-3-031-60731-8
eBook Packages: Computer ScienceComputer Science (R0)