Introduction

Real-time remote health monitoring is important given that patients who live far from hospitals and use telemedicine may suffer from different chronic diseases, such as chronic heart disease, chronic blood pressure (BP), fall detection and diabetes. Chronic diseases are an increasingly important concern for e-healthcare systems worldwide. For example, clinical expenses for chronic diseases in the United States are projected to reach 80% of the total medical costs and more than 150 million people who are prone to chronic diseases will be included in “Large Scales of Data’ by 2020 [1]. The Large Scales of Data in healthcare refer to the large and complex electronic health data sets that are difficult (or impossible) to manage either with traditional software and/or hardware or traditional or common data management tools and methods [2,3,4,5]. In healthcare, large scales of data are overwhelming not only because of their volume but also because of the diversity of data types and the speed at which they must be managed. The totality of data related to patient healthcare and well-being are the main components of large scales of data in the healthcare industry [2, 6, 7]. Thus, the applications in healthcare take advantage of data explosion to gain insights into making informed decisions [8]. However, such massive amounts of data, which are continuously created by sensing technologies, add to the big data problem [9]. Although the large scales of data concept and techniques [10] are used in various fields, such as Smart Cities, they are not widely used in biomedicine and telemedicine or patient monitoring for the use and integration of data from biosensors [11,12,13,14,15,16,17,18,19,20]. Despite some modern solutions, current data mining techniques, which involve patient monitoring in the medical domain, are generally at an alarmingly early state [11]. Continuous and daily monitoring of physiological data, such as heart rate or electrocardiogram (ECG) signals, is important in managing chronic diseases. Thus, healthcare researchers and developers have focused on health monitoring in out-of-hospital conditions, specifically in home environment, where telemedicine is used. Automatic diagnosis of heart diseases is an important and actual medical concern because they affect the health and working performance of patients specifically during disasters and peak times and in the elderly. The World Health Organisation estimated that 12 million deaths occur annually worldwide because of heart diseases [1].

Chronic heart diseases include several types of diseases and symptoms that manifest in patients. For example, cardiac arrhythmia is a life-threatening medical emergency that can result in cardiac arrest and sudden death. According to a medical report by the American Heart Association in 2010, approximately 55% of patients with heart diseases die because of arrhythmia [21]. Serious arrhythmia cases, such as ventricular tachycardia or fibrillation, are induced by vortex-like re-entrant electric waves in cardiac tissues. In addition, vital signs, such as ECG and oxygen saturation sensor (SpO2), are important in triage setting because they objectively complement the triage decision and optimise inter-rater consistency [22]. Certain medical guidelines are followed in triage setting and prioritisation based on vital signs and chronic disease-related features of patients.

Patients who are physically present in the emergency department (ED) of a hospital are prioritised by triage nurses. Triage setting traditionally relies on the ability of nurses to prioritise cases. Triage setting and prioritisation become complicated when patients live far from the hospital and use telemedicine during disasters and peak times; in such cases, triage nurses and doctors are not physically available to help the patient; triage setting and prioritisation are more complex in telemedicine than that in actual ED situations [23]. Triage setting and prioritisation of patients who require the most urgent attention in telemedicine has gained considerable attention. In telemedicine, patients are triaged and prioritised for treatment and transportation to hospitals by assessing their vital signs [23, 24]. Patient condition must be primarily assessed to determine the priority category according to medical guidelines [23,24,25].

Technically, triage setting and prioritisation processes during disasters and peak times are complex decision-making procedures [26]. Thus, several triage scales have been designed to correspond to decision support systems and provide a guide for making correct decisions [26,27,28,29,30]. Triage setting and prioritisation processes during disasters and peak times involve simultaneous consideration of multiple attributes, including vital signs and features, and assignment of the proper weight for each feature to score a patient based on the most urgent case [24]. Patients under the most emergency cases must receive the highest priority level, whereas patients with less emergency cases must be given the lowest priority levels compared with other patients over the telemedicine environment. However, setting this prioritisation is a very difficult and challenging task because each patient with chronic heart disease requires a multi-attribute sensor for evaluation of their vital signs. For example, ECG and SpO2 have been proven to be important in triage setting because they provide an objective complement to the triage decision-making process and optimise inter-rater consistency. Consequently, certain medical guidelines must be followed in the triage setting and prioritisation based on the vital signs and chronic heart disease-related features of patients. Furthermore, each decision maker (DM) provides different weights to these attributes (vital signs). On one hand, a server who aims to score a patient might give more weight to the vital feature rather than the other less interesting ones. On the other hand, developers who aim to use software to solve this problem will probably target different attributes as the most important ones. Thus, the triage setting and prioritisation processes of patients with chronic heart diseases are a multi-complex attribute problem during disasters and peak times, in which each patient is considered an available alternative for the DM.

This study presents a new real-time approach to aid the decision-making process for patients with chronic heart diseases in the telemedicine environment during disasters and peak times. An integrated model is proposed to evaluate and score patients based on Multi-layer for Analytic Hierarchy Process (MLAHP) and Technique for Order Performance by Similarity to Ideal Solution (TOPSIS). The remaining sections of this paper are organised as follows. Section “Literature review” presents the literature review. Section “Methodology” describes the decision-making methodology for evaluation and scoring of patients with chronic heart diseases. Section “Results and discussion” reports the results and discussion. Section “Validation and evaluation” discusses the results of validating and evaluating the proposed method. Section “Summary points” highlights the contributions of the research as summary points. Section “Conclusion” concludes the report.

Literature review

Current literature on patient prioritisation methods is largely limited and scattered. Some attempts have been made to create strategies for these methods. Several studies investigated patient prioritisation to overcome the potential shortcomings of priority assignment in the traditional triage process [31].

A previous study [32] presented the use of utility theory in healthcare, specifically in improving triage decision-making, productivity and reducing cognitive load on triage nurses in EDs. The inherent uncertainty in this problem is the reason behind selecting utility theory to solve the problem of sorting patients in EDs. In this study, patients were ranked based on an emergency severity index and three descriptive variables, namely, age, gender and pain level. However, the small sample size of 21 patients and a determinant, such as pain level, may affect the accuracy of the results; moreover, this study did not consider the conflict in between the data and its resolution. The same author [33] also proposed dynamic grouping and prioritisation (DGP) algorithm to identify appropriate patient groups and prioritise them based on patient and system benefits. Based on discrete event simulation, the results provide statistical evidence that the DGP system outperforms alternative prioritisation methods in terms of all performance measures. However, the algorithm does not improve patient throughput over large scales of data based on multi-performance measures.

An approach referred to as the ‘floating patient’ method was proposed for optimising the schedule of patients for ED examination [31]. However, this study was applied inside the ED settings without considering multiple evaluations, which can affect the optimisation of the scheduling of patients based on their states.

Another study aimed to provide insights into the problem of patient prioritisation during complete evacuations in healthcare facilities [34]. The authors of this study proposed a dynamic programming model for emergency patient evacuation and concluded that continuous discussions among healthcare workers concerning the ethical dilemmas associated with making evacuation decisions are necessary. An algorithm with a column generation approach was also proposed [35]. However, this study was applied in a disaster situation and restricted to the allocation of emergency medical resources. A dynamic programming scheduling algorithm that included halting and a proposed simulation method were applied to crowded patients inside the ED [36]. However, both studies did not consider the problem of patient prioritisation based on emergency cases by using multi-measurement attributes.

Ref. [37] conducted an exploratory work using a hypothetical example of a methodology and multi-attribute utility analysis of healthcare. The hypothetical sample problem presented involves patient prioritisation in an ED. This study [38] also investigated the potential for integrating technology and multi-attribute utility theory in developing a dynamic decision support system for patient prioritisation in ED settings. The attributes studied in this research complied with preferential and utility independence with one another. However, this condition may not hold true when considering other attributes, such as complaints. Furthermore, both studies did not consider multiple attributes, which are related to the emergency state of patients.

Ref. [39] proposed a system using the electronic triage tag (e-Triage), which enables emergency medical technicians to identify the locations and conditions of patients. However, this work focused on the transportation scheduling problem after secondary triage of patients. Another study [40] attempted to maximise the number of expected saved patients under limited medical resources. The proposed heuristic algorithm was based on depth-limited search. The results showed that the average number of saved patients is 10% higher than that when using greedy methods. However, the algorithm cannot calculate patient priority based on multiple attributes. The priority level is only assumed in the simulation, and the patients are assumed to be already triaged.

The above-mentioned methods show disadvantages, and no particular study for any type of chronic disease is available and did not consider the large scales of data during disasters and peak times. Ref. [23] proposed a multi-source healthcare architecture (MSHA) to improve healthcare services by enhancing remote triaging and remote prioritisation processes for telemedicine of patients with chronic heart disease. The general scheme of telemedicine system consists of three tiers, namely, sensors/sources (e.g. ECG, SpO2), base station and server. However, the simulation of MSHA is implemented only at the base station (Tier 2), indicating that patients who are physically present in the hospital (server side, Tier 3) are not addressed. The exclusion of in-hospital patients raises an ethical healthcare issue, including the consideration for all patients (i.e., in-hospital and telemedicine patients) in the prioritisation process and providing them with compatible healthcare services based on their emergency level. As an approach for prioritising patients with the most urgent status, the inclusion of in-hospital patients with telemedicine patients requires a robust method that can accommodate an increasing number of patients and handle an increasing data size as ‘large scales of data’ during disasters and peak times because triage setting and prioritisation processes involve simultaneous consideration of multiple attributes (vital signs) and assigning proper weight for each feature to score the patients based on the most urgent cases. Therefore, this process can be considered a multi-criterion decision problem. Multi-sensor sources are available. For each sensor source, the subset of features displays a range of different, conflicting data used in various triage levels for the difficult task of increasing data size during disasters and peak times.

Different methods have been applied to prioritise patients. However, healthcare decisions are generally complex and involve confronting trade-offs between multiple and often conflicting attributes. Specifically, the prioritisation of patients based on their medical condition and chance of survival is a complex decision-making problem [37, 41, 42] because the decision is made based on a set of attributes [43]. Additionally, patient prioritisation involves simultaneous consideration of multiple attributes (vital signs) and requires assigning proper weights for each feature to score the patients based on the most urgent cases. Therefore, using structured and explicit approaches in decisions involving multiple attributes can improve the quality of decision making, and a set of techniques, which are classified under the collective heading multiple criterion decision analysis (MCDA), can be used for this purpose. MCDA is a sub-discipline of operational research and explicitly considers multiple criteria in decision-making conditions, which occur in various actual situations of medical diagnosis [44]. Several useful techniques can be used to deal with multi-attribute decision-making or multi-criterion decision-making (MADM/MCDM) problems in real world. These methods help DMs organise the problems to be solved and perform analysis, ranking and scoring of alternatives [44, 45]. Accordingly, the scoring of a suitable alternative(s) must be performed. MADM/MCDM methods can solve the scoring problem of big data for patients with chronic diseases based on the most urgent cases in a telemedicine environment. In any MADM/MCDM ranking, fundamental terms must be defined, and these terms include decision or evaluation matrix (EM), alternatives and criteria [25]. EM consists of m alternatives and n criteria, which must be created. Considering the intersection of each alternative and criteria as x ij , we obtain the following matrix(x ij )m ∗ n:

$$ {\displaystyle \begin{array}{c}\kern3.75em {C}_1\kern1.00em {C}_2\kern1.00em \begin{array}{cc}\dots & {C}_n\end{array}\\ {} DM/ EM=\begin{array}{c}\begin{array}{c}{A}_1\\ {}{A}_2\end{array}\\ {}\begin{array}{c}\vdots \\ {}{A}_m\end{array}\end{array}\left[\begin{array}{cc}\begin{array}{cc}{x}_{11}& {x}_{12}\\ {}{x}_{21}& {x}_{22}\end{array}& \begin{array}{cc}\dots & {x}_{1n}\\ {}\dots & {x}_{2n}\end{array}\\ {}\begin{array}{cc}\vdots & \vdots \\ {}{x}_{m1}& {x}_{m2}\end{array}& \begin{array}{cc}\vdots & \vdots \\ {}\dots & {x}_{mn}\end{array}\end{array}\right]\end{array}} $$

where A1, A2, …, A m are the possible alternatives scored by DMs (i.e. patients); and C1, C2, …, C n are criteria against which each alternative performance is measured (i.e. measurements). Finally, x ij is the rating of an alternative A i with respect to criterion C j , and W j is the weight of the criterion C j . Certain processes, such as normalisation, maximisation of indicators and addition of weights must be completed to rank the alternatives.

Several MCDM theories or methods have been investigated. The most popular methods of MADM that use different concepts include multiplicative exponential weighting (MEW), weighted product method (WPM), weighted sum model (WSM), simple additive weighting (SAW), hierarchical adaptive weighting (HAW), analytic hierarchy process (AHP), analytic network process (ANP) and TOPSIS [45]. To our knowledge, none of these methods have been used in scoring large scales of data of patients with chronic diseases during disasters and peak times.

According to [46,47,48,49,50,51,52,53,54,55,56,57,58], the drawbacks, benefits and recommendations for the popular MCDM techniques can be summarised as follows: HAW and WSM techniques are easy to use and understand, but the weights of the attributes are arbitrarily assigned; moreover, both techniques become difficult to use as the number of criteria increases [59,60,61,62,63]. Another problem with these methods stems from the use of common numerical scales to obtain the final score. SAW considers all of the criteria, perform decisions intuitively and offers simple calculation. All values of the criteria must be maximum and positive. Moreover, SAW does not always reflect the real situation. The strengths of MEW and WPM include the ability to remove any unit of measure and the use of relative values rather than actual ones. However, no solution with an equal weight of decision matrices is offered. Meanwhile, the ANP model provides full understanding of the level of importance that a criterion can have by agreeing to its interrelationship with the other elements of the model. A benefit of the ANP model is that it allows assessment of the consistency of judgments, and such assessment is impossible to evaluate using a method that assigns weights by compromise. Another positive aspect of the ANP model is that it facilitates assigning weights because it splits up the problem into small parts, in which a group of academics can have a manageable discussion and only two criteria can be compared to assign the judgments. However, ANP suffers from two disadvantages. Firstly, providing a correct network structure for the criteria is difficult even for experts, and different structures lead to varying results. Secondly, to form a super matrix, all criteria must be pair-wisely compared with regard to all other criteria, and the comparison is difficult and unnatural [64, 65]. AHP enables DMs to arrange a decision-making problem into a hierarchy to understand and simplify the problem. This technique is time consuming because of the mathematical calculations and number of pairwise comparisons, which increase as the number of alternatives and criteria increases or changes. Ranking of alternatives in AHP depends on the alternatives considered for evaluation. Adding or deleting alternatives can change the final ranking (rank reversal problem).

The MCDM techniques discussed in this section are used to score big data of patients with chronic heart diseases and prioritise the most urgent cases. However, these techniques lack indicators of how well the healthcare service can satisfy the needs of patients. Another problem with these techniques is the non-adoption of a requirement-driven approach, which makes them inadequate for priority scoring based on decision making [44]. TOPSIS is functionally associated with problems of discrete alternatives and is one of the most practical methods for solving real-world problems. The relative advantage of TOPSIS is its ability to rapidly identify the optimal alternative. Therefore, TOPSIS is suitable for cases with numerous alternatives and attributes [66]. However, the major weakness of TOPSIS is the lack of provision for weight elicitation and consistency of checking for judgments. In this regard, TOPSIS requires an efficient technique to obtain the relative importance of different criteria with respect to the objective; AHP provides such a procedure. Given that AHP is used to set weights for objectives based on stakeholder preferences [66] and has been significantly restrained by the human capacity for information processing, the value of 7 ± 2 would be the ceiling for comparison [67]. From this viewpoint, TOPSIS alleviates the requirement of paired comparisons, and the capacity limitation may not significantly influence this process [68].

The newest trend with respect to MCDM is to combine two or more methods to compensate for the shortcomings of a single method [69,70,71]. TOPSIS and AHP have become widely accepted integrated MCDM methods for the following reasons: ability to provide complete ranking results, use of weights and objective data to calculate relative distances, suitability to be combined with stochastic analysis, smoothing of trade-offs by dealing with nonlinear relationships and easy conversion of the method into a programmable procedure [66, 70]. Hence, combining AHP and TOPSIS has been recommended for ranking patients with chronic diseases by using remote large scales of data and for prioritising the most urgent cases during disasters and peak times.

Methodology

This study provides a detailed overview of the alternative large scales of data of patients with chronic heart diseases based on a set of measures through the relatively infrequent route of actual measurement of patient data and reporting of hands-on evaluation results based on the ECG and SpO2 sensors and other variables as raw data. Data alignment of raw data and decision-making matrix was performed by converting each extracted feature into an integer, which represents their state in the triage level based on medical guidelines. The input to this part (sources and inclusion criteria of subject articles) is discussed in subsequent subsections. Ranking of patients with chronic heart diseases by using big data is based on inputs received from medical guidelines, medical sensor evaluation and text frame sources. The output ranks the patients based on a constructed decision-making matrix by using integrated MLAHP and TOPSIS. All of the elements of our study are shown in the overall conceptual framework in Fig. 1.

Fig. 1
figure 1

Conceptual framework for triage and prioritisation patients with chronic heart diseases by using large scales of data

Evaluation of medical sensors and sources

Several medical devices are used to measure the vital signs of patients. The number and type of sensors depend on the type of disease that must be monitored on the patient. Given that the focus of this research is chronic heart diseases, this section evaluates four relevant medical sources that demonstrate heart performance and reflect the medical symptoms of the patients. The three sensors used as signal sources and one text source [23] are shown in Table 1. For each source, certain medical features are considered (e.g. the ECG signal includes rhythm and ST elevation). The medical assessment of each feature with the related source is shown in Table 1.

Table 1 Description of four relevant medical sources used in monitoring patients

Each sensor signal is represented by an array. In the dataset used as sensory data, each element in the signal represents two values, namely, time and voltage. The array of each signal contains two columns (each column represents a value). The number of rows is defined by the number of elements in the signal, starting from 0 to n. Meanwhile, the array of the text feature is 1 × 4 because four variables represent four non-sensory features. The mathematical representations for the multi-function feature extraction algorithm for each source are displayed in Fig. 2.

Fig. 2
figure 2

Feature extraction algorithm

In Fig. 2, the early stage input data are processed and analysed using logically defined medical guidelines. The medical guidelines consider the rules that are defined and validated not only by research but also by practical experiences of medical doctors and experts to support the decision of an emergency level user [5]. Medical guidelines demonstrate the relation between user input vital sign as input data and medical diagnosis as output. In this study, 11 features from four heterogeneous sources are modelled (Table 1) and used as criteria in the proposed decision-making algorithm. The procedure for computing the features and details about the medical guidelines for each source are described in the following sections.

  1. 1.

    ECG Sensor

A real-time data processing multi-function algorithm was designed to extract ECG features. The ECG signal is represented by an array of two columns, which correspond to time (ms) and voltage (mv), and their values were used to extract the features. The ECG signal exhibits many cycles, as shown in Fig. 3. One ECG cycle demonstrates many ECG features, such as rhythm, QRS, ST and R–R regularity, which are represented in Fig. 3a. For each cycle, the signal values in time varied around the zero line, and those values were used to split the ECG cycle into up and down halves (Fig. 3b). Subsequently, the upper half was sorted based on voltage values to determine the maximum point, R (Fig. 3c). The upper half of the ECG cycle was split into right and left halves (Fig. 8c). The location of Q and S points was located (Fig. 3d) using certain functions presented in Fig. 1 to sort the values of the ECG cycle for each half (Up_Lift and Up_right) based on (t) and (v) values.

Fig. 3
figure 3

ECG feature extraction processes

According to medical guidelines [23], the normal value of ECG rhythm is within 60–100 beats per minute. The normal range of QRS width is 0.06–0.12 ms. ST elevation is considered normal when the ST segment is straight and is abnormal when the ST segment is evaluated to be up or down. Finally, regular and irregular patterns of peak-to-peak interval were regarded as normal and abnormal, respectively.

Technically, in Fig. 3, \( {\mathrm{e}}_{\mathrm{i}}^0 \) to \( {\mathrm{e}}_{\mathrm{v}}^0 \) represent all elements that belong to the ECG signal for 1 min. The (get Peaks ();) function is used to calculate the number of R in the ECG signal within 1 min. Therefore, the number of R represents the rhythm. The function (getAvgCycleInterval ()) is used to determine the average width cycle and the QRS width in the (getQRS ();) function. The ST elevation can be determined using the (isSTSegmentUP();) function, which is based on the differences in ∆t and ∆v values by using subtraction functions (Fig. 3e).

  1. 2.

    BP Sensor

BP measurements are commonly classified into normal and abnormal cases on the basis of the measurement of high and low BP levels for a patient [45]. According to medical guidelines [23], for patients with regularly high BP, values between 110 and 140 mmHg, 140 and 180 mmHg >180 mmHg are considered normal, abnormal and at risk, respectively. For patients with regularly low BP, values between 60 and 90 mmHg, 90 to 110 mmHg and >110 are regarded as normal, abnormal and at risk, respectively. Technically, the value of high and low BP is calculated using the (get HighValue ();)and (get LowValue ();) functions, respectively. These two functions are implemented for all elements of BP signals in 1 min, starting from \( {b}_i^0 \) and finishing at \( {b}_v^0 \) (Fig. 2).

  1. 3.

    SpO2 Sensor

One feature value is extracted from SpO2 sensors. The accelerometer (SpO2) value can be changed for the same user in different activities. According to medical guidelines [59], the levels of SpO2 vary according to different types of chronic heart diseases. The normal, abnormal and at risk values of SpO2 are 96%–100%, 90%–96% and <90%. Technically, the value of SpO2 in 1 min can be extracted using the (get s value ();) function for all elements of SpO2 signal starting from \( {\mathrm{s}}_{\mathrm{i}}^0 \) to \( {\mathrm{s}}_{\mathrm{v}}^0 \) (Fig. 2).

  1. 4.

    Text Source

Text features are important for chronic heart diseases because these features present the activity of muscles surrounding the heart [me]. In addition, text features are non-sensory data that add context to the user or patient. Patients generally present their complaints to physicians by manually completing forms to describe their pathological conditions. A series of discussions with doctors revealed that text features are important to chronic heart diseases [23]. Given that these features present the activity of muscles surrounding the heart, abnormal text features are common in all heart diseases, regardless of whether ECG features are normal or abnormal [23]. The presence of abnormal features in the ECG indicates that they are also present in the texts. In this instance, a patient is considered an urgent case. By contrast, when the ECG is normal and the text features are abnormal, a certain type of heart disease that is not classified under urgent cases and not previously classified is reflected. Technically, text inputs related to chronic heart diseases are chest pain, shortness of breath, palpitation and current activity of the patient (resting or exercising). A system that includes a graphical user interface equal to the diagnostic procedure is proposed to enable a user to answer four questions corresponding to four text features. Additional questions increase processing time and system complexity. Users answer either ‘Yes’ or ‘No’ to the questions, and the value of each feature is determined using functions (chest pain, shortness of breath, palpitation and rest or exercise; Fig. 1).

Using a single platform to consolidate all results from the evaluated features from various sources is challenging. This approach can improve the delivery of healthcare services, especially when the features exert a common medical influence on the diagnostics of a certain disease. Thus, the developed and proposed framework facilitates the integration of ECG, SpO2 and BP sensors and text in a single approach dedicated to the diagnostics of chronic heart diseases. Table 2 demonstrates the features of the proposed framework for a sample of 40 patients over 500. However, the score evaluation of each feature requires validation based on medical guidelines.

Table 2 Evaluation matrix

Employing decision-making theory to the raw data presented in Table 2 is not mathematically applicable because of the following: (1) the raw data show inconsistent format (strings and numbers); and (2) the numbers are different in medical diagnostics. For example, the value of 80 in low BP represents ‘risk triage level’, whereas 80 in SpO2 is regarded as ‘normal triage level’. Thus, data alignment process must be designed and applied between raw data processing and decision-making matrix. Data alignment designates an integer to each extracted feature. The integers represent the state of the feature in the triage level based on medical guidelines. The measurements of SpO2 and BP sources represented by five triage levels, namely, normal, cold state, sick, urgent and at risk, are replaced by the numbers 0, 1, 2, 3 and 4, respectively. The other two sources, namely, ECG and text sources, which correspond to normal and abnormal triage levels, are substituted by 0 and 1, respectively. According to the data alignment process, the raw data shown in Table 2 in the decision matrix are presented in Table 3.

Table 3 Evaluation matrix of Table 2 in decision-making matrix

As shown in Table 3, different sources exhibit varying features that show various triage levels. In a medical perspective, the vital features can simultaneously indicate more than one triage level for the same patient. For example, patients are in the normal triage level when they demonstrate shortness of breath and peak-to-peak feature (ECG features). However, the same patients are also considered to show abnormal triage levels according to the SpO2 data and low BP features. Consequently, triage nurses cannot determine the triage levels of these patients. Therefore, the final decision that represents the triage levels of patients is difficult to make through paper-based triage system or basic automated triage system, which is generally used in hospitals. A decision-making algorithm and computer-based approach are necessary for mitigating the complexity of patient triage setting and prioritisation and addressing incongruent patient data.

Prioritising patients with chronic heart diseases

Patients with chronic heart diseases were evaluated through several criteria, such as ECG, SpO2, BP sensors, diastolic BP and non-sensory measurement (text frame). Problems emerge when each patient with chronic heart disease exhibit several attributes, and each DM correspond to different weights for these attributes. In typical situations, weights can be assigned by experts, such as doctors. This situation is attributed to the varied opinions of doctors on critical criteria that influence focus on patients. This situation is incompatible with the objectives of different layers of criteria. Therefore, prioritisation of patients with the most urgent condition is difficult. In addition, a server might prefer ECG, SpO2, BP sensors, diastolic BP and non-sensory measurement (text frame) over other features. By contrast, developers of this software may target different attributes. The ranking of patients with chronic heart diseases, particularly in software development, is a multi-attribute measurement problem.

Weights can be assigned to solve this problem in several ways through three scenarios. Firstly, weights are set according to the objectives of doctors. This scenario is recommended for researchers and academics who plan their evaluation and prioritisation. Secondly, multiple weights are assigned to evaluate and prioritise patients under different circumstances. In this instance, doctors are required to set different weights that represent different cases. Finally, three to six evaluators are used to set the preference weight. This scenario is recommended for servers that rank patients according to the number of emergency cases; that is, those with the most number of emergency cases represent high priority levels, whereas those with the minimum number of emergency cases acquire low priority levels. Notably, different evaluators exhibit different preferences, which create different ranking schemes. The final stage in this process is to consolidate these ranking orders into a final ranking scheme. Several recommended techniques, such as AHP, can be used to measure the weights of the criteria. This technique is highly recommended by researchers. The AHP algorithm generates pairwise comparisons among the criteria. In addition, AHP is a popular MCDM method [67]. A multilayer process of the AHP, that is, MLAHP, is adapted to calculate the weight of each attribute. Subsequently, TOPSIS is used to rank available patients. On the basis of the integration of TOPSIS and MLAHP, the proposed methodology can be used to solve complex multi-attribute measurements and selection problems in various medical diseases. The ranking scheme is shown in Fig. 4 and described as follows:

Fig. 4
figure 4

Integrated TOPSIS and MLAHP model for patient prioritisation

Weight calculation based on MLAHP

Weights are assigned to each basic attribute in the hierarchy of criteria through MLAHP. Each basic criterion is rated in the hierarchy for each patient considered for evaluation. MLAHP is used to derive ratio scales from pairwise comparisons, allowing small inconsistencies in judgment because humans are typically consistent. The ratio scales are derived from principal eigenvectors, and consistency index is derived from the principal eigenvalue. The number of required pairwise comparisons is equal to n × (n − 1)/2, where n is the number of criteria used during the evaluation. Evaluators are instructed to complete the comparisons among the criteria. The answers must follow the form designed according to the number of criteria. The MLAHP measurement matrices are processed to obtain weights according to the preference of evaluators. The level of emergency cases with different objectives can be obtained through the assignment of these weights. The MLAHP measurement steps for the weight preferences assumed for each evaluator are shown in Fig. 5.

Fig. 5
figure 5

MLAHP measurement steps

In Fig. 5, E = ECG, S = Spo2 level, B = blood pressure, T = text, R = rhythm, RP = R-R regularity, ST = ST elevation, H = high blood level, L = low blood level, CP = chest pain, SOB = shortness of breath, PAL = palpitation, PIR = patient in rest and Q = QRS. Six doctors who specialise in cardiologist and with >3 years of experience are selected to complete MLAHP. A total of six copies, which includes 19 pairwise comparisons, among all criteria on layer 1 and sub-criteria on layer 2 are presented to the doctors, and their responses to these criteria are obtained. The relative scale (1 to 9) is created to quantify the preferred criteria of the doctors. Each doctor critically analyses these criteria according to their experience and knowledge. After attaining the responses on the pairwise comparisons, four reciprocal matrices are created from the pairwise comparisons; the results obtained from the MLAHP sequence process are listed in Table 4. Eleven features for each doctor are calculated, and the results provide the corresponding relative weight of the criteria.

Table 4 Results of weight calculated for six evaluators

Ranking patients using TOPSIS

The available patient scores are ranked in descending order, and the most urgent patients are prioritised according to TOPSIS (Fig. 5). The aggregate scores roughly classify the patients. Individuals are expected to score the most urgent cases, similar to other ranking options. TOPSIS allocates scores to each alternative (per patient) according to their geometric distance from the positive and negative ideal solutions [44, 45]. The patients are ranked in this method, and those with the most number of emergency cases exhibit the shortest geometric distance to the positive ideal solution and the longest geometric distance to the negative ideal solution, as described in the following steps.

  • 1: Construction of normalised decision matrix

This process attempts to transform the dimensions of various attributes (vital features) into non-dimensional attributes. It also allows comparisons across the attributes. The matrix (xij)m*n is normalised from (xij)m*n to the matrix R = (rij)m*n by the following method:

$$ {r}_{ij}={\boldsymbol{x}}_{\boldsymbol{i}\boldsymbol{j}}/\sqrt{\sum \limits_{\boldsymbol{i}=1}^{\boldsymbol{m}}{\boldsymbol{x}}_{\boldsymbol{i}\boldsymbol{j}}^2}\kern0.5em $$
(1)

This process will generate a new matrix R, which is shown below

$$ R=\left[\begin{array}{cc}\begin{array}{cc}{r}_{11}& {r}_{12}\\ {}{r}_{21}& {r}_{22}\end{array}& \begin{array}{cc}\dots & {r}_{1n}\\ {}\dots & {r}_{2n}\end{array}\\ {}\begin{array}{cc}\vdots & \vdots \\ {}{r}_{m1}& {r}_{m2}\end{array}& \begin{array}{cc}\vdots & \vdots \\ {}\dots & {r}_{mn}\end{array}\end{array}\right] $$
  • 2: Construction of MLAH-based weighted and normalised decision matrix

In this process, the weights for each attribute are calculated according to the MLAHP model. A set of weights w = w1, w2, w3, ⋯, wj, ⋯, wn from the DM are integrated into the normalised decision matrix. The resulting matrix can be calculated by multiplying each column of the normalised decision matrix R with its associated weight wj. Moreover, the set of weights is equal to 1.

$$ \sum \limits_{j=1}^m{w}_j=1 $$
(2)

This process will produce a new matrix V, such that

$$ \mathrm{V}=\left[\begin{array}{cc}\begin{array}{cc}{v}_{11}& {v}_{12}\\ {}{v}_{21}& {v}_{22}\end{array}& \begin{array}{cc}\dots & {v}_{1n}\\ {}\dots & {v}_{2n}\end{array}\\ {}\begin{array}{cc}\vdots & \vdots \\ {}{v}_{m1}& {v}_{m2}\end{array}& \begin{array}{cc}\vdots & \vdots \\ {}\dots & {v}_{mn}\end{array}\end{array}\right]=\kern0.5em \left[\begin{array}{cc}\begin{array}{cc}{w}_1{r}_{11}& {w}_2{r}_{12}\\ {}{w}_1{r}_{21}& {w}_2{r}_{22}\end{array}& \begin{array}{cc}\dots & {w}_n{r}_{1n}\\ {}\dots & {w}_n{r}_{2n}\end{array}\\ {}\begin{array}{cc}\vdots & \vdots \\ {}{w}_1{r}_{m1}& {w}_2{r}_{m2}\end{array}& \begin{array}{cc}\vdots & \vdots \\ {}\dots & {w}_n{r}_{mn}\end{array}\end{array}\right] $$
  • 3: Determining ideal and negative ideal solutions

In this process, two artificial patients, namely, A* (Most critical case for each criterion) and A (Least critical case for each criterion), are defined as follows:

$$ {A}^{\ast }=\left\{\left(\left(\underset{i}{\max }\ {v}_{ij}\left|j\in J\right.\right)\left|i=1,2,\dots, m\right.\right)\right\}=\left\{{v}_1^{\ast },{v}_2^{\ast },\dots, {v}_j^{\ast },\cdots {v}_n^{\ast}\right\} $$
(3)
$$ {A}^{-}=\left\{\left(\left(\underset{i}{\min }\ {v}_{ij}\left|j\in J\right.\right)\left|i=1,2,\dots, m\right.\right)\right\}=\left\{{v}_1^{-},{v}_2^{-},\dots, {v}_j^{-},\cdots {v}_n^{-}\right\} $$
(4)
  • 4: Calculation of separation measurement according to Euclidean distance

In this process, separation measurement is performed by calculating the distance between each patient in V and the ideal vector A* through Euclidean distance, which is provided by the following:

$$ {S}_{i^{\ast }}=\sqrt{\sum \limits_{j=1}^n{\left({v}_{ij}-{v}_j^{\ast}\right)}^2},\kern0.5em i=\left(1,2,\cdots m\right) $$
(5)

Similarly, separation measurement for each patient in V from the negative ideal A is provided by the following:

$$ {S}_{i^{-}}=\sqrt{\sum \limits_{j=1}^n{\left({v}_{ij}-{v}_j^{-}\right)}^2},\kern0.5em i=\left(1,2,\cdots m\right) $$
(6)

At the end of step 4, Si* and Si for each patient are counted. These two values represent the distance between a patient and both the ideal and negative ideal.

  • 5: Calculation of closeness to the ideal solution

In this step, the closeness of Ai to the ideal solution A* is defined as

$$ {C}_{i^{\ast }}={S}_{i^{-}}/\left({S}_{i^{-}}+{S}_{i^{\ast }}\right),\kern0.75em 0<{C}_{i^{\ast }}<1,\kern0.5em i=\left(1,2,\cdots m\right) $$
(7)

Evidently, Ci* = 1, if and only if Ai = A*. Similarly, Ci* = 0, if and only if Ai = A

  • 6: Ranking patients according to their closeness to the ideal solution

The set of patient Ai can now be ranked according to the descending order of Ci*. High values indicate optimal performance. A patient who is near the high record and far from the worst record is an emergency case and must be given the highest priority level.

Decision-making contexts

Two main decision-making contexts are emphasised: decision making based on a single individual DM and group decision making based on multiple DMs. Multi-criterion group decision-making (GDM) is a situation in which individuals collectively select from the alternatives before them. The decision is no longer attributable to any single individual who is a member of the group because all individuals contribute to the outcome. GDM techniques systematically collect and combine knowledge and judgment of experts from different fields. In a group context, each expert provides his/her judgment to the criteria that require subjective judgment. In addition, the weight of each criterion is also assigned by the same expert. TOPSIS can be extended to a group-decision environment through two methods, namely, internal and external aggregation [76]. Internal aggregations aim to apply the aggregation process at the separation stage. In this case, the separation is a group separation by aggregating different decision values of distances to positive and negative ideals and proceeding to the next process. The mathematical operator in this case is a geometric mean. In external aggregation, the same operators are used to aggregate the total score achieved from each evaluator.

Results and discussion

The proposed system was simulated in JAVA, which possess many desirable features, such as real-time implementation, parallel execution, portability, cross-platform usability and compatibility with different operating systems (e.g. Android, Windows, and Linux).

Figure 6 demonstrates that XAMMP can be used on the server side (Tier 3). XAMPP is a small and light Apache distribution tool that contains the most used web development technologies in a single package. XAMPP is an open-source software program. Each letter in XAMPP represents the following: (X) cross-platform for Web server, (A) HTTP Apache Server, (M) MySQL database, (P) PHP script writing language and (P) Perl programming language. Researchers prefer XAMPP when they develop and test applications because of its content, size and portability [77]. In the simulation, JAVA was used as the programming language.

Fig. 6
figure 6

Block diagram scheme of the simulation architecture of the proposed system

The data sent from Tier 2 to Tier 3 include the vital signs of a user, and such data are used by doctors in Tier 3 to personalise the healthcare services for that user. The improvement in remote triage, prioritising and healthcare services using such data was previously demonstrated. However, the objective of this section is to demonstrate how adding this data affects the total message size and how these data are considered big data in the server (Tier 3). The size of the message sent by the user can satisfy the big data requirement through several stages (number of users, number of requests for a single user per day, telemedicine users and users from certain departments of hospitals, such as ED). Figure 8 shows that data size increased in the healthcare server (Tier 3) for many users per day when the size of the sending message reached 314.647460 KB; the data included three sensory signals, namely, ECG, SpO2 and BP, for 1 min and the text features. When a user updated his or her profile in the hospital server every 5 min, the total size of data in the server for one day nearly reached 91 MB. In addition, nearly 1000 individuals used telemedicine, and 1000 users were physically present in the hospital. As a result, the total size of the data reached almost 2.2 TB for one day. Figure 7 demonstrates the increase in data size of telemedicine users inside and outside of the hospital during disasters and peak times.

Fig. 7
figure 7

Calculation of data size during disasters and peak times

Current technologies introduce new Internet and mobile cellular communication protocols that can easily and successfully accommodate the increase in message size between Tier 2 and Tier 3. However, computing the large scales of data in Tier 3 to extract knowledge on the emergency level of a user remains difficult.

The input dataset used to evaluate the proposed model varied in several aspects, such as gender, age, patient location and available medical history of the patient in the server of the hospital. For all datasets used, males and females constituted 60% and 40% of the dataset, respectively. Moreover, 50% of the patients aged between 40 and 65 years old, 40% were older than 65 years old and 10% were below 40 years old. Approximately 4.5% of the patients used telemedicine, and the remaining patients (95.5%) were distributed in different departments of the hospital. In addition, 50% of the patients had medical records.

In the evaluation, the values for the evaluation metric are presented as follows: ECG sensor, SpO2 sensor, BP sensor, diastolic BP and non-sensory measurement (text frame). These values show the data in three aspects, namely, normal, abnormal and at risk, which are presented according to the data on the abovementioned alignment. An experiment based on the evaluation metric was performed by integrating MLAHP to calculate the weight; TOPSIS was also used to score the patients with chronic heart diseases with respect to urgency. The dataset presented different symptoms defined by doctors. These symptoms were related to chronic heart diseases. The standard ECG, SpO2 and BP datasets from different data packages were used to validate the reliability of the standard dataset [77]. Each package includes the symptoms of chronic heart disease of each user.

Results of TOPSIS decision-making contexts

Two main decision-making contexts are stressed, namely, individual and group decision-making contexts. In this section, the results of the individual and group TOPSIS decision-making contexts are presented in the following subsections.

TOPSIS results of individual context for different weights of experts

Available alternative scores are ranked in descending order based on TOPSIS. TOPSIS allocates the scores to each alternative (per patient) based on its geometric distance from the positive and negative ideal solutions. The patients were ranked based on emergency cases (from the highest to the lowest). According to this technique, the patient with the highest emergency case will exhibit the shortest geometric distance to the positive ideal solution and the longest geometric distance to the negative ideal solution.

According to Table 4, the preference weights revealed the features of scoring the patients from the perspective of experts. Table 4 shows the results of six experts, who stated the importance of the evaluation criteria from the viewpoint of each expert. TOPSIS was used to prioritise 500 patients based on the urgency of their cases through the perspectives of six experts. TOPSIS identify the highest and lowest emergency cases of patients and compare each patient with the positive ideal (highest emergency case) and negative ideal (lowest emergency case). S− and S* represents the closeness of a patient to the lowest and highest emergency cases, respectively.

The weights of the main sources provided by the six experts according to the MLAHP results are shown below:

  • First expert: 48.5% for ECG, 28.9% for BP, 14% for SpO2 and 8.6% for text. The values of the four sub-features of ECG, namely, ST, Rhy, QRS and P to P are 29.6%, 3%, 3% and 12.9%, respectively. Finally, the values for text, which represents the four sub features chest pain, shortness of breath, palpitation and patient at rest, are 3.9%, 1.4%, 2.8% and 0.4%, respectively. Each patient was evaluated according to these weights. Accordingly, TOPSIS ranking results provided by the first expert attained an average of 0.4932 ± 0.1868. The highest and lowest rank values are 0.8230 and 0.0078, respectively.

  • According to the AHP results, the weights of the main sources generated by the second expert were 35.5% for ECG, 31.2% for BP, 26.6% for SpO2 and 6.7% for text. ST, Rhy, QRS and P to P are 6.5%, 2.4%, 13.6% and 13%, respectively. Finally, the values for chest pain, shortness of breath, palpitation and patient at rest are 0.6%, 0.8%, 1.6% and 3.7%, respectively. Each patient was evaluated according to these weights. Accordingly, TOPSIS ranking results provided by the first expert attained an average of 0.4912 ± 0.1517. The highest and lowest rank values are 0.7698 and 0.0117, respectively.

  • According to the AHP results, the weights of the main sources generated by the third expert are 52.4% for ECG, 22.7% for BP, 18.7% for SpO2 and 6.2% for text. The values of ST, Rhy, QRS and P to P are 34.5%, 2.6%, 7.7% and 8.7%, respectively. Finally, the values for chest pain, shortness of breath, palpitation and patient at rest are 0.8%, 3.9%, 1.2% and 0.3%, respectively. Each patient was evaluated according to these weights. Accordingly, TOPSIS ranking results provided by the first expert attained an average of 0.4748 ± 0.2069. The highest and lowest rank values are 0.8402and 0.0057, respectively.

  • According to the AHP results, the weights of the main sources generated by the fourth expert are 54.6% for ECG, 26.8% for BP, 12.5% for SpO2 and 6.1% for text. The values for ST, Rhy, QRS and P to P are 27.4%, 13.6%, 10% and 3.7%, respectively. Finally, the values for chest pain, shortness of breath, palpitation and patient at rest are 2.8%, 1%, 2% and 0.3%, respectively. Each patient was evaluated according to these weights. Accordingly, TOPSIS ranking results provided by the first expert attained an average of 0.4691 ± 0.2239. The highest and lowest rank values are 0.9219 and 0.0056, respectively.

  • According to the AHP results, the weights of the main sources generated by the fifth expert are 45.5% for ECG, 30.6% for BP, 17% for SpO2 and 6.8% for text. The values of ST, Rhy, QRS and P to P are 27.5%, 7.8%, 7.4% and 2.8%, respectively. Finally, the values for chest pain, shortness of breath, palpitation and patient at rest are 0.8%, 0.9%, 2.3% and 2.8%, respectively. Each patient was evaluated according to these weights. Accordingly, TOPSIS ranking results provided by the first expert attained an average of 0.4920 ± 0.2079. The highest and lowest rank values are 0.9416 and 0.0169, respectively.

  • According to the AHP results, the weights of the main sources generated by the sixth expert are 38.8% for ECG, 32.8% for BP, 20% for SpO2 and 8.4% for text. The values of ST, Rhy, QRS and P to P are 14.2%, 14.6%, 5.9% and 4.1%, respectively. Finally, the values for chest pain, shortness of breath, palpitation and patient at rest are 1.1%, 5.3%, 1.7% and 0.4%, respectively. Each patient was evaluated according to these weights. Accordingly, TOPSIS ranking results provided by the first expert attained an average of 0.4929 ± 0.1795. The highest and lowest rank values are 0.9135 and 0.0082, respectively.

TOPSIS sample results of the first and last forty patients after applying the weights provided by the six experts are shown in Tables 11, 12, 13, 14, 15 and 16 in Appendix. After presenting the results of the TOPSIS ranking as an individual context for each expert, describing and discussing the ranking results generated by the six experts and recognising the closeness and variance among them are necessary. The TOPSIS final ranking results of the six experts are presented in Fig. 8.

Fig. 8
figure 8

Overall TOPSIS final ranking for six experts: (a) first group ranking, (b) second group ranking, (c) third group ranking, (d) fourth group ranking, (e) fifth group ranking, and (f) sixth group ranking

As illustrated in Fig. 8 and Tables 11, 12, 13, 14, 15 and 16 in Appendix, the ranking results of the TOPSIS individual context from the six experts are presented. Discussion of the ranking results in the individual contexts is needed to illustrate differences in patient ranking among the six experts. The first five patients with the most emergency cases and the last five patients with the least emergency cases were chosen from the ranking results of each expert for comparison. In other words, the five patients with the most emergency cases of each expert were compared to demonstrate the matches and variances among the ranking provided by the experts. The five patients with the least emergency cases of each expert were compared to show the matches and variances among the ranking made by the experts. Regarding the five patients with the most emergency cases, the ranking results from five of the experts show that patient number 288 as the first patient with the most emergency case. However, the ranking from the other expert allocate patient number 464 as the first patient. The second patient position corresponds to patient numbers 287, 280 and 463, as recommended by three, two and one experts, respectively. The third patient position is allotted to patient numbers 284, 280 and 460, as recommended by three, two and one experts, respectively. The fourth patient position is given to patient numbers 297, 283 and 459, as recommended by two, one and one experts, respectively. The fifth patient position matches patient numbers 286 and 462, as recommended by five and one experts, respectively. Regarding the five patients with the less emergency cases, the first patient position is allocated to patient numbers 66 and 75, as recommended by four and two experts, respectively. The second patient position corresponds to patient numbers 65 and 69, according to four and two experts, respectively. The third patient position is determined for patient numbers 69, 77 and 73, as recommended by two experts for each patient. The fourth patient position matches patient numbers 70 and 74, according to two and two experts as a fourth patient. The fifth patient position is assigned to patient numbers 67 and 75 according to four and two experts, respectively. In summary, the ranking results of TOPSIS in the individual context for six experts are presented, and the first and last five patients with most and least emergency cases are described and discussed. Clearly, the results of the individual context show variances among the ranking from six experts’. Thus, applying a group TOPSIS decision-making context is required to provide patient ranking considering the overall DMs. The following section presents the results of the group TOPSIS decision-making context.

Group TOPSIS with internal and external aggregation

TOPSIS was extended to a group-decision environment through two strategies, namely, internal and external aggregation. Internal aggregations aim to apply aggregation process at the separation stage. For this case, the separation involves group separation by aggregating different decision values for distance positive and negative ideals, and the next process was then performed. Internal aggregation is calculated by the summation values of the negative separation divided by the negative separation values plus the positive separation values for each evaluator, whereas the external aggregation is computed by finding the average summation of the ranking values for each expert, as mentioned in “Ranking patients using TOPSIS” section. The results of the first and last forty patients of group TOPSIS with internal and external aggregation are presented in Table 5 below.

Table 5 Group decision making of TOPSIS with internal and external aggregation (for the first and last 40 patients)

Furthermore, the external and internal aggregation results are illustrated in Figs. 9 and 10.

Fig. 9
figure 9

Results of external aggregation

Fig. 10
figure 10

Results of internal aggregation

In external aggregation, the result of patient ranking attained a mean ± SD average of 0.4855 ± 0.1824, and the internal aggregation of patient ranking achieved a mean of 0.4851 ± 0.1810. The results of mean ± SD for the internal and external aggregation ranking demonstrated the similarity in the ranks estimated by the mentioned methods. For this reason, the external aggregation method will then be considered for validation and evaluation in the next section.

Validation and evaluation

This section describes in detail the validation and evaluation of the proposed work. The validation process is presented in Section “Validation process”, the ranking results are validated objectively and subjectively based on different characteristics in this section. Section “Evaluation process” demonstrates the evaluation process by providing scenarios and checklist benchmarking.

Validation process

Validation is an important measure for many empirical studies to prove the validity and accuracy of results. Two validation processes, namely, objective and subjective validation, were used in this research, as presented in Figure 11, and explained in the following subsections.

Fig. 11
figure 11

Structure of validation processes

Objective validation

At this stage, the final ranking results of the prioritised patients are divided into four equal groups, as in Qader et al. [78]. Each group comprised 125 patients. Mean ± SD is calculated for each group to ensure that the prioritised patients undergo systematic ranking. The results of the first and second groups are presented in Table 6, and the results of the third and fourth are presented in Table 7. The numbers in the tables represent the scores generated by the TOPSIS process, and scores of 25 patients are shown in a single column.

Table 6 Results of the first and second groups (TOPSIS scores)
Table 7 Results of the third and fourth groups (TOPSIS scores)

The data on prioritised patients presented in Tables 6 and 7 above are visualised in graphical formats to further discuss their comparisons. In Fig. 12, Figures A, B, C and D illustrate patient groups 1, 2, 3 and 4, respectively. The ranking results of the four patient groups are combined in Figure E. Initial observation of the ranking results of the four patient groups show that the patient groups are systematically distributed as the ranking results of the second group start from the end of the ranking results of the first group. Furthermore, the ranking results of the third and fourth groups start from the end of the ranking results of second and third groups, respectively.

Fig. 12
figure 12

Results of four groups of patients. a 1st group. b 2nd group. c 3rd group. d 4th group. e Four patient groups

Statistical analysis was performed among the groups after dividing the prioritised patients into four groups. The results are presented in Table 8 and Fig. 13.

Table 8 Results of patients according to groups
Fig. 13
figure 13

Bar chart for mean and standard deviation results of the four groups

The results of statistical analysis for the four groups are presented in Table 8. In the first group, the value is M = 0.716 ± 0.075 with a minimum value of 0.612 and a maximum value of 0.859. The first group was the highest scoring among the four groups because the mean and SD exhibited the highest values. The second group obtained a value of M = 0.539 ± 0.039 with minimum and maximum values of 0.484 and 0.611, respectively. The second group obtained lower scores than the first group but higher scores than the third and fourth groups. The third and fourth groups showed a value of M = 0.436 ± 0.029 and M = 0.250 ± 0.105 with a minimum value of 0.370 and 0.025 and a maximum value of 0.483 and 0.370, respectively. The fourth group demonstrated the lowest scores among the four groups. For each group, the patient with the highest ranking result (with maximum value) acquired a lower value than the patient with the lowest ranking result (with minimum value) in the next patient group. To ensure that significant differences are present amongst the four groups of the prioritised patients, p value is calculated, as shown in Table 9.

Table 9 P Value of four groups

As shown in Table 9, the P value results showed significant differences among the four groups with a P < 0.05. The first group was significantly different from the second, third and fourth groups by 1.99E-12, 6.85E-23 and 1.77E-04, respectively. The second group was significantly different from the third and fourth groups by 1.33E-03 and 6.52E-25, respectively. Finally, the third group was significantly different from the fourth group by 1.33E-03. These results indicated that all the groups are statistically different among one another with a P < 0.05. Thus, the statistical results indicate that the ranking results underwent systematic ranking.

Subjective validation

At this stage, the patient prioritisation results have been validated by a medical committee. To prove the effectiveness of prioritising the patients, we decided to validate the patient ranking through evaluation of vital signs of patients by the medical committee. Asking the expert to intervene with the ranking of 500 patients was difficult in this case. Thus, after discussing with the experts, the medical committee was presented with the first and last five patients from the patient prioritisation list. In other words, rather than judging all patients, the medical committee only evaluate the first five patients with the most emergency cases who require fast response and the last five patients with the less emergency cases who can wait. The experts used their clinical experience and their knowledge to confirm patient ranking and to ensure whether the first five patients on the list are the most emergency cases and medically deserving to be in the highest priority level upon evaluating their clinical symptoms. This method ensures that the last five patients are given the lowest priority level as they demonstrate the less emergency cases. Furthermore, this strategy allows determining if a patient needs to be moved up in the queue when one or more of the vital signs approach critical values. The scenario process that is followed by the three experts to check the patient position within the ranking based on the group decision-making method is as follows.

The first five patients are considered as the patients with the highest priority level, so they require rapid response from the server to receive healthcare services. According to the evaluation of the expert for the overall 10 features used in the proposed method, the first five patients exhibited six common abnormal features (Spo2, BP, shortness of breath, Rhythm, QRS width and ECG ST Elevation) and one common normal feature (peak-to-peak interval). However, the other three features (chest pain, palpitation and patient at rest) demonstrated different medical classifications. Patient 1 showed three abnormal features (chest pain, palpitation and patient at rest), so this patient is ranked first and triaged to the most emergent case. Patient 2 presented with one normal feature (patient at rest) and two abnormal features (chest pain and palpitation). Patient 3 also displayed one normal feature (chest pain) and two abnormal features. Patient 4 also exhibited one normal feature (palpitation). Although patients 2, 3 and 4 demonstrated the same number of abnormal features, they differed in the type of features and obtained different medical assessment and weight for each feature. In other words, patient 2 ranked higher than patient 3 because the weight of the chest pain feature is higher than that of being at rest, and patient 3 was ranked higher than patient 4 because the weight of the palpitation is higher than the weight of chest pain. Finally, two features were abnormal for patient 5 (chest pain and patient at rest). Thus, patient 5 was ranked fifth. The last five patients exhibited six common normal features (SpO2 level, blood pressure, ECG rhythm, QRS width, peak-to-peak interval and ST elevation). However, the other four features (chest pain, shortness of breath, palpitation and patient in rest) demonstrated different medical classifications. Patients 499, 498, 497 and 496 showed only one abnormal feature, which are patient at rest, chest pain, palpitation and shortness of breath, respectively. Although the four patients presented with the same number of abnormal features, patient 498 ranked before patient 499 because the patient at rest feature is of lower importance in terms of medical assessment than the chest pain feature. Patient 497 ranked before 498 because the chest pain feature is lower in importance in terms of medical assessment than palpitation feature. Furthermore, patient 496 ranked before patient 497 because palpitation feature is of lower importance in terms of medical assessment than shortness-of-breath feature.

In summary, based on the actions of experts, the proposed method has suggested and confirmed that the first five patients deserve to be in the high priority levels with the most emergency case. On the other hand, the last five patients deserve to be in the lowest priority levels because they exhibit less emergency cases. Thus, we subjectively indicate that the ranking results are systematically accurate.

Evaluation process

In this section, a number of scenarios are presented to demonstrate situations and cases requiring patient prioritisation methods.

Scenario 1 considers a large scale of patients expected to occur in several aspects, such as population aging, disasters and MCIs reported in a specific area. From one perspective, this area exhibits a large population of elderly patients remotely monitored by their providers. From another perspective, remote patients are critically affected by MCIs and disasters [79]. The server of a hospital or healthcare agency monitoring these patients must assess the situations of patients and prioritise their services and treatments with regard to the urgency of their medical condition. However, for remotely monitored patients, the overwhelming heterogeneous data can cause difficulty in deciding among numerous patients to whom the care must first be given and who follows [80]. Therefore, the prioritisation method must support a multi-criteria ranking that considers the scalability issue and can handle large data. Furthermore, identifying the targeted tier and the environment where the prioritisation process is executed is necessary to provide services and treatments for each prioritised patients, from the most to the least emergency ones.

Scenario 2 demonstrates two or more home patients with a slight difference in their healthcare emergency conditions and who must be prioritised. In this case, the healthcare providers encounter a problem in recognising slight differences from the many vital signs of patients regardless of the time needed for the process to prioritise them at the end. Meanwhile, traditional triage and prioritisation methods allocate patients in a similar scale or category [38]. Therefore, the prioritisation method must consider the smallest difference between two patient records and improve patient prioritisation accuracy. Providing patient order through supporting feature weighing method and multi-criteria ranking is therefore necessary.

Scenario 3 presents two patients with different emergency conditions and who sent requests to the server but at different times. However, the patient who first sends the request suffers from a less urgent situation. In this case, the most urgent case must be prioritised before the previous one, as they deserve to be serviced and treated. Thus, applying FCFS in these cases may jeopardise the life of patients, and the FCFS cannot be used in reality [37, 81]. Therefore, the prioritisation methods must consider the medical condition of patients and support feature weighting method. In summary, three main scenarios showing situations and cases covering patient prioritisation are identified and explained. After describing the scenarios, some issues were recognised and highlighted for each scenario and further required consideration in the patient prioritisation method. These issues are extracted, and the connections between each scenario and the related points are described in Fig. 14.

Fig. 14
figure 14

Relations between comparison issues and scenarios

After describing the scenarios and their related issues, these issues are defined to consider as comparison points in the checklist benchmarking. Three general points must be considered in patient prioritisation methods, namely, targeted disease, whether to support vital signs and chief complaints. Each method must be validated and evaluated. The descriptions of terms on the checklist comparison points are presented as follows:

  • Targeted disease: The disease is adopted as the case study for applying and testing the prioritisation process. Targeting a specific disease helps to directly and accurately identify vital signs and complaints indicating disease severity [23], in contrast with setting the prioritisation process for general illness and diseases.

  • Support vital signs: This point indicates that the vital signs were considered and used in the prioritisation process, considering that vital signs are important in evaluating patient condition [23, 29].

  • Support chief complaints: This point indicates that the chief complaints were taken into consideration and used in the patient prioritisation process because non-sensory data are necessary in remote healthcare monitoring [23].

  • Targeted tier: Identifying the tier involved in the prioritisation process is important because the remote healthcare monitoring and telemedicine architecture embodies three tiers, namely Tiers 1, 2 and 3, which represent sensors, base station and remote server [23, 82].

  • Support scalability: This point shows whether scalability was accommodated and handled. In this study, scalability is defined as the increase in the number of patients. It a challenge in both remote healthcare monitoring and the prioritisation process [23].

  • Remote environment: This point shows whether the patient prioritisation process was executed in the remote environment. The prioritisation process is important to support continuous care of remote patients in a pervasive environment [80]. In a remote environment, the overwhelming heterogeneous data from patients hinder the prioritisation process.

  • Prioritisation method category/order: This shows whether the prioritisation process supports categories or order methods. Categorisation method classifies patients and prioritises them according to prioritisation levels, whereas order method provides a rank for patients according to their emergency situations. Although most triage systems categorise patients into a priority group, the patient order is typically determined using an FCFS principle [38].

  • Feature weighting: This point shows the technique used for feature weighting. A server aiming to provide a score for a patient may assign more weight to the vital features rather than to other features that gain less interest than these attributes. Additionally, the judgments and preferences of experts are important in extracting the weights of the vital signs [33, 37, 83, 84].

  • Multi-criteria ranking: This point indicates whether the study addresses the multi-criteria in the prioritisation process. Patient prioritisation is a complex decision-making problem [26, 37, 41, 42], and the decision is made based on a set of attributes [43].

  • Handling large data: This point concerns the handling of overwhelming data from multiple sources for a large number of patients. Supporting large data is important because overwhelming data can cause difficulty in deciding over numerous patients requiring prioritisation and those that follow [80].

  • Patient prioritisation accuracy: This point indicates the accuracy of patient prioritisation process in representing the ranking of patients according to their urgent situation in consideration of their medical condition. Likewise, accuracy is reflected in the recognition of slight differences in the urgent situation between patients.

  • Validation: This point shows whether a validation was provided, including the methods used for validation.

  • Evaluation: This point shows whether an evaluation was provided, including the methods used for evaluation.

After recognising and defining the checklist comparison issues, the proposed patient prioritisation method is compared with the most relevant study in this research area regarding these specific issues. Based on literature review analysis, the study of Salman et al. [23] is the most relevant remote patient prioritisation study. After describing the comparison issues of the checklist, the checklist comparison between the proposed and the benchmark studies are presented in Table 10.

Table 10 Checklist benchmarking

As presented in Table 10, the checklist benchmarking points are discussed between the proposed and benchmark method. The first three general checklist points, namely, targeted disease, support vital signs and chief complaints, are addressed by both methods. Investigating whether the scenarios covered by these methods address the comparison points of each scenario is necessary to compare the proposed and benchmark methods. Regarding the first scenario, only three issues, namely, targeted tier, support scalability and environment representing 60% of the scenario, are addressed by both methods. The proposed method alone handles multi-criteria ranking and large data. Thus, two out of five main issues related to the first scenario are disregarded by the benchmark method, which indicates that the first scenario is covered by the benchmark method with a percentage of 60%. The proposed method addresses all issues covering the first scenario with a percentage of 100%. Regarding the second scenario, only two issues, namely, targeted tier and prioritisation method category/order representing 40% of the scenario, are addressed by both methods, whereas the proposed method alone addresses feature weighting, multi-criteria ranking and patient prioritisation accuracy. Thus, three out of five main issues related to the second scenario are disregarded by the benchmark method, which means the second scenario is covered by the benchmark method with a percentage of 40%. The proposed method addresses all issues covering the second scenario with a percentage of 100%. Regarding the third scenario, two main issues must be considered, namely, feature weighting and multi-criteria ranking, each of which represents 50% of the scenario. For the benchmark method, the two main issues related to the third scenario are disregarded, indicating that the third scenario is not covered (0%). Meanwhile, the proposed method addresses all issues covering the third scenario (100%). The differences in method coverage for the scenarios are presented in Fig. 15. As shown in Table 10, validation and evaluation are considered in the proposed method, whereas no validation or evaluation is provided in the benchmark method.

Fig. 15
figure 15

Differences in method coverage for the scenarios

As shown in Fig. 15, the proposed method covers all scenarios and their related issues, yielding 100% for all scenarios, whereas the benchmark method does not cover the scenarios, considering that not all issues related to each scenario are addressed, obtaining 60%, 40% and 0% for the first, second and third scenarios, respectively. The advantages and strengths of the issues considered by the proposed method and ignored by the benchmark are the following. Firstly, multi-criteria ranking is crucial for patient prioritisation as the complex decision-making problem and decision making are based on a set of attributes. Secondly, handling large data is important because it facilitates the prioritisation decisions with overwhelming data and helps in deciding who among the numerous patients must first be attended. Third, the feature weighting technique extracts the importance of each source and feature against the others involving judgments from experts to specify a fixed weight for each feature. Lastly, the accuracy of patient prioritisation is important in simultaneous consideration of multiple attributes, namely, vital signs and complaints, with respect to the proper weight assigned for each attribute to score patients based on the most urgent cases and the disregarding FCFS technique. Thus, the proposed method addressing these issues is essential for patient prioritisation. In conclusion, regarding the first, second and third scenarios, the proposed method offers an advantage over the benchmark method with percentages of 40%, 60% and 100%, respectively.

Summary points

What is already known?

  • No particular study was conducted on multi-attribute analysis on triage setting and prioritization for large-scale data of patients with any type of chronic disease in telemedicine during disasters and peak seasons.

  • A multi-attribute sensor for evaluating vital signs and the multi-attribute patients are considered complex problems for the existing approaches. Each decision matrix (DM) is assigned with different weights for these attributes. Other issues include a server, which scores a single type of patient and provides more weight to a certain criterion than other features. Ranking of patients is a multi-attribute problem.

  • Improper triage and prioritization of patients may result to wrong strategical decisions, especially when the patients with emergency case are assigned to a lower than required triage priority, thereby endangering their health.

What does this study contribute?

  • A new real-time approach is developed to prioritize “Large-scale Data” of patients with chronic heart diseases by using body sensor information and communication technology. These patients require urgent attention as indicated by the multi-measurement criterion records in telemedicine during disasters and peak seasons.

  • Integrated multi-layer for analytic hierarchy process (MLAHP) is used to calculate the weight of each attribute for each patient with chronic heart disease. Technique for order performance by similarity to ideal solution (TOPSIS) is subsequently used to score the available alternatives that can be considered. The proposed algorithms are used to solve the complex multi-attribute selection problems in patients with chronic heart diseases.

  • The results of the proposed work are validated and evaluated. During validation, the ranking results are validated objectively and subjectively based on different characteristics. Evaluation is conducted by providing scenarios and checklist benchmarking to assess the proposed and existing prioritization methods. The patients who exhibited the most and least emergency cases obtained the highest and lowest priority levels, respectively, when their scores are identical.

Conclusion

This study presented a new real-time approach in allocating and prioritising large-scale data on patients with chronic heart diseases in a telemedicine environment during disasters and peak times. An integrated model was formulated to evaluate and score the patients according to MLAHP and TOPSIS. A hands-on study was performed, and 500 patients exhibiting different symptoms and emergency levels of heart diseases were separately evaluated according to the following main measurements: ECG, SpO2 and BP sensors and non-sensory measurement, that is, text frame. The patients were then scored based on the measurement outcomes from the integrated methods. The results demonstrated that the subjective and statistical evaluations exhibited similar results such that patients with the most and least emergency cases obtained the highest and lowest priority levels, respectively. An adaptive and integrated decision-making platform for different chronic diseases, such as heart disease, diabetes and abnormal BP, requires further investigation in the future. The platform can be used to designate priority levels to patients, considering the diversity of diseases and emergency levels of each patient. Moreover, increasing the sources, such as, video, audio, image, medical sensors and GPS, for triage for the prioritisation and designation of appropriate emergency levels to patients remains an issue. Finally, the challenge in formulating a single model that can integrate servers and databases from several hospitals for triage and prioritisation requires further studies, analyses and justifications.

Potential future research

The findings of this study can be used in providing healthcare services. In normal scenario of providing healthcare services in telemedicine architecture, the medical center server (Tier 3) is connected with distributed hospitals to control the healthcare services and send the services as a response to client side (Tier 2). Telemedicine architecture has many drawbacks associated with providing healthcare services. These shortcomings can be solved by this study as a future direction of the new designs of telemedicine architecture as provided below:

New design of telemedicine architecture for providing healthcare services within medical center server (Tier 3)

The medical center server monitors and makes decisions for a selected appropriate hospital to provide healthcare services to patients at home (considering that the movement of patient is static within a specific area). Providing healthcare services to patients with chronic heart diseases based on triage level and number of available services in hospitals is a challenging task. On the one hand, the numbers of services vary from one hospital to another, thus requiring the medical center to balance and provide healthcare services after selecting the appropriate hospital. On the other hand, the medical center server can receive vital signs of 500 patients exhibiting different symptoms and emergency levels of heart diseases from the following main measurements: ECG, SpO2, and BP sensors and non-sensory measurement, namely, text frame. Their triage level is calculated by Dempster–Shafer theory as a three emergency levels (risk, urgent, or sick). Hospitals can provide healthcare services to patients with chronic heart disease based on their triage levels. The three healthcare services packages are package 1 (Alarm1), package 2 (Alarm), and package 3 (Direction). Therefore, accurate triage system and integrated platform between triage levels and healthcare packages can solve the major issues by using Dempster–Shafer theory. Hospital selection should be based on triage level and the available services in distributed hospitals. A new design in telemedicine architecture can be adopted by developing a new integrated model to evaluate and score the hospitals according to analytic hierarchy process (AHP) and TOPSIS methods. Such new design can provide healthcare services from appropriate hospital for only one patient. This condition is considered as a limitation in providing healthcare services because the medical center should provide real-time healthcare services to many patients. Another problem is the prioritization of patients. The patients with the most emergency case (for instance, risk level) should be prioritized and provided healthcare services before attending to other patients (urgent and sick levels). Therefore, this study can be used in the new design to solve these major problems. The new design of telemedicine architecture that uses the new prioritization approach is explained in the Fig. 16 below.

Fig. 16
figure 16

New design of telemedicine architecture for providing healthcare services within medical center server (Tier 3)

First, this study can integrate a new design to a medical center to prioritize the patients. After being prioritized, the patients can be provided with healthcare services through three decision matrices based on triage level from an accurate triage system. The dummy data can represent the number of healthcare services in distributed hospitals (12 hospitals as a proof of concept) to complete the full scenario of the decision-making process. Such new design constructs a decision-making matrix within the medical center server based on a crossover of multi-services and hospitals lists. In the decision-making matrix, the hospitals represent the alternatives, whereas the services represent the multi-criteria that are used to evaluate the hospitals according to their numbers of healthcare services. In conclusion, hospitals with many available healthcare services obtain the highest ranking, whereas hospitals with few services obtain low ranking.

New design of telemedicine architecture for providing healthcare services within mHealth (Tier 2) in case of medical center server (Tier3) failure

Selecting a hospital within mHealth to provide healthcare services to patient with chronic heart diseases based on triage level is a challenging task due to many issues. In a normal scenario of telemedicine as explained in section “New design of telemedicine architecture for providing healthcare services within medical center server (Tier 3)”, the medical center server monitors and decides the selection of appropriate hospital and fully controls the telemedicine architecture. Typical healthcare services in telemedicine are based on client–server architecture, where the availability of such structure is a complex issue due to many possible configurations and scalability challenges. Therefore, any disruption to the telemedicine network and medical center server can lead to link outage and potentially to severe consequences. The availability of healthcare services varies from one hospital to another. Therefore, mHealth can connect directly with distributed hospital servers as a backup of first design (section “New design of telemedicine architecture for providing healthcare services within medical center server (Tier 3)”) to identify the available healthcare services of each hospital and ensure the continuity of providing healthcare services in case of any failure in the medical center server. mHealth can calculate the triage level of patients and decide the package services after receiving the vital signs of the patients with heart diseases. Vital signs are measured using ECG, SpO2, and BP sensors and non-sensory measurement, namely, text frame. Their triage level is calculated by Dempster–Shafer theory as a three emergency levels (risk, urgent, or sick). Furthermore, patient location with reference to each hospital is an important factor in hospital selection and in patient life. In this design, the situation of patient location is a frequent movement (dynamic). The new design aims to aid decision-makers in hospital selection based on multi-criteria decision making within mHealth doctor (Tier 2) in a telemedicine environment. This design can construct a DM based on a crossover of patient location/multi-services and hospital lists. In this decision-making matrix, the hospitals represent the alternatives, whereas the services and patient location represent the multi-criteria that are used to evaluate and score the hospitals and their available healthcare. Both patient location and availability of healthcare services in distributed hospitals can be represented by using dummy data to complete the full scenario of decision making. The hospitals can be subsequently prioritized and ranked by using multi-criteria decision-making techniques, namely, multi-analytic hierarchy process (MAHP).

Problem arises when several patients use mHealth and are admitted at the same time in the same hospital. The selected hospital then needs to provide priority healthcare services for the patient with a high emergency level rather than those with other levels. Therefore, this study can solve this problem by using a prioritization approach for the patients in each hospital. The Fig. 17 below illustrates the new design of telemedicine architecture for providing healthcare services within mHealth (Tier 2) in case of medical center server (Tier3) failure, which adopts the new prioritization approach.

Fig. 17
figure 17

New design of telemedicine architecture for providing healthcare services within mHealth (Tier 2) in case of medical center server (Tier3) failure

In the Fig. 17, mHealth (patient) can deal with the distributed hospitals and send a request to receive the data representing their healthcare services. mHealth will identify these services and obtain a copy record of the local database for only available services from each hospital. Hence, the decision-making matrix will rank the hospitals according to the last update of package services (available services) for each local server in the distributed hospitals. mHealth will then select the appropriate hospital and update the services in the database of the selected hospital (local server).