Keywords

1 Introduction

Clinical Decision Support Systems (CDSSs) in Health Care have a long history going back to the 1970s. A CDSS is the computer software designed to contribute to clinical treatments and diagnoses [1]. More precisely, Clinical Decision Support (CDS) can be defined as “a process for enhancing health-related decisions and actions with pertinent, organized clinical knowledge and patient information to improve health and healthcare delivery”, and a CDSS is an implementation of one or more CDS interventions [2]. CDSSs can support many different activities such as diagnosis, therapy, monitoring, or prevention and are used in all kinds of medical domains such as chronic illness, acute care, primary care, and patient advice lines. CDSSs may provide many different services such as access to knowledge, statistical calculations and individual adaptations, recommendations, reminders or alerts to different user groups.

There are two ways to facilitate the CDSSs: the knowledge-based systems and the data-intensive systems. The medical knowledge based systems perform a variety of functions, including knowledge acquisition, knowledge translation, logical reasoning based on evidence-based medical literature and the approved clinical practice guidelines. While the data intensive systems utilize the data from patients’ Electronic Medical Records (EMR). Compared to medical knowledge based way, data-intensive way yields a significant performance overhead.

Through collaborating with hospitals, we have already put many efforts to apply big data and machine learning technologies to deal with health-related data. In [3], a deep learning based information extraction framework is designed to extract medical entities and relations from Chinese EMR. In [4], for the problem of patient’s cost profile estimation, a model based on a patient’s historical visits in EMR is presented. In [5], several deep learning algorithms are proposed to classify the topics of documents in diabetics. In [6], we investigate the influences of different language models on the sentiment classifications for Chinese health forums. In [7], a QA prototype is developed to answer health related questions. Based on these efforts, we decided to implement a system to integrate them to help doctors and hospitals.

This paper provides a unified framework based on knowledge graph to integrate knowledge-based and data-intensive methods, and designs a data-intensive clinical decision support platform - IDS. It follows the process of a patient’s visit. This platform provides a series of clinical decision support services including inquiry, inspection, diagnosis, medication & treatment and prognosis.

This paper is organized as follows. Section 2 introduces the related work. Section 3 provides the integration framework. Section 4 presents the architecture of IDS. Section 5 shows the process of a patient visit in IDS. Section 6 demonstrates the application of IDS. Finally we conclude the paper in Sect. 7.

2 Related Work

In this section, we will discuss the related work from three aspects: knowledge-based CDSSs, data-intensive CDSSs and the application of CDSSs.

Knowledge-Based CDSSs:

Shen et al. [8] described the construction and optimization of the sensitivity and specificity of a decision support system named IDDAP, which is based on ontologies for infectious disease diagnosis and antibiotic therapy. Nazari et al. [9] utilized expert systems and fuzzy logic, and developed an intelligent system, which is capable of diagnosing occurrence of heart diseases. Ohno-Machado [10] considered the update of medical knowledge base and the contextual information.

Data-Intensive CDSSs:

In [11], a Semantic Web-based, multi-strategy reasoning approach is presented and integrates deductive and plausible reasoning and exploits Semantic Web technology to solve complex clinical decision support queries. Beaudoin et al. [12] evaluated the machining learning for CDSS to enhance antimicrobial stewardship programs. Piri et al. [13] analyzed data from more than 1.4 million diabetics and developed a CDSS for predicting DR.

Application of CDSSs:

Greenes et al. [14] discussed how to integrate workflow-oriented models with a user-interactive mode of CDSS. Huberts et al. [15] introduced an innovative structured modeling approach in which model personalization is guided by sensitivity analysis and in which the effect of input uncertainties and model assumptions are considered during model corroboration. Humm et al. [16] described a personalised CDSS for cancer care. Anya et al. [17] presented the design of an awareness environment for cross-boundary clinical decision support in e-health that takes account of the concept of work practice as a design requirement. Bennett et al. [18] provided a substantive review of international literature evaluating the impact of CDSSs on the care of emergency department (ED) patients. Wajid et al. [19] discussed some of the current challenges in designing an efficient CDSS as well as some of the latest techniques that have been proposed to meet these challenges.

Our method combines these two strategies - knowledge-based method and data-intensive method, and will be discussed detailly in the next section

3 Integration Framework

Current CDSSs use either the knowledge-based method or the data-intensive method. There are few works to consider them together. This paper designs a new integration framework which could combine these two methods together.

The simplest way is to use them separately in one system. For a decision problem, if there are enough features in the input data, the knowledge-based method will be used. Otherwise, the data-intensive method will be used. These two methods can also be used at the same time, thus the final output will be the weighted average of the outputs from them.

In the knowledge-based method, the foundation is the medical knowledge base. While in the data-intensive method, the foundation is the medical sample base. The key to combine these two methods is how to connect the medical knowledge base and the medical sample base.

We propose a unified framework to connect the medical knowledge base and the medical sample base. Base on this framework, the medical knowledge base can be expanded and the medical sample base can be improved as well. So that the decision support efficiency can be improved. Specifically, it includes two aspects:

  • The medical sample base supports the medical knowledge base: Based on a large number of samples, the framework can discover new rules and even discover problems within the original rules, so that the medical knowledge base can be constantly expanded and verified.

  • The medical knowledge base supports the medical sample base: The relationships between the features of one sample are very important, which can contribute to both the selection of learning models and the adjustment of their parameters. Those relationships between features may be revealed by the medical knowledge base.

In order to support the above two aspects, we use the knowledge graph as a link between the medical sample base and the medical knowledge base. Specifically, we extract the triples from the medical sample base to form a knowledge graph. We perform rule detection in the knowledge graph, abstract those rules into knowledge and add them to the medical knowledge base. Meanwhile, we could apply the medical knowledge base to the knowledge graph, i.e., we can give more semantics to the nodes and edges as well as create more connections between the nodes. Moreover, those semantics and connections can be mapped to the relationships among the features of the data from the medical sample base.

4 Platform Architecture

Guided by the integration framework, we developed IDS - a data-intensive clinical decision support platform, whose architecture is shown in Fig. 1. It has eight modules: data integration (①), knowledge fusion (②), medical sample base (③), medical knowledge graph (④), medical knowledge base (⑤), model integration (⑥), rule reasoning (⑦) and IDS services (⑧).

Fig. 1.
figure 1

IDS Architecture

The main functions of IDS are described below:

  • Data integration: It includes four parts: the raw data storage, the data standard management, the data cleaning and the data management.

  • Knowledge fusion: Based on the open knowledge bases in various fields and the international mainstream standards such as ICD10, this module performs knowledge fusion on various knowledge repositories.

  • Medical sample base: It standardizes the records from the data integration and models those records based on the knowledge graph.

  • Medical knowledge graph: It includes rule detection, relation management, feature relation detection, alignment in medical knowledge base, entity management, alignment in medical sample base and labeling & extract.

  • Medical knowledge base: It includes code management, metathesaurus management, semantic network management, specialist lexicon management, and new concept/relation review.

  • Model integration: It includes model training, model management and model release. The whole process includes model selection, model training, model evaluation, parameter tuning and model persistence. All of the models are stored in the model library.

  • Rule reasoning: For certain specific area such as cardiovascular and diabetes, it performs a rule reasoning based on customized views from the medical knowledge base.

  • IDS Services: It packs the selected models from model management & release module. With the packed models, the platform will provide web services that follow restful standards. It includes chief complaint input formalization, intelligent diagnosis, disease-based intelligent recommendation, similar case matching, re-hospitalization risk prediction.

5 The Process of a Patient Visit in IDS

Around the process of a patient visit, this platform provides a series of clinical decision support services - inquiry, inspection, diagnosis, medication & treatment and prognosis. This process is based on the models which are trained by the data from the medical sample base, or the rules in the medical knowledge base (see Fig. 2). Due to page limit, here we only discuss the implementation based on model integration.

Fig. 2.
figure 2

The process of IDS

5.1 Inquiry

This step includes two functions. The first one is to help doctors regulate their chief complaints and remind doctors of key information that is easy to ignore (see Table 1). The second one is to match the most similar case according to the chief complaint.

Table 1. Chief complaint input formalization.

5.2 Diagnosis

  1. (1)

    Intelligent diagnosis. The input of this function is the structured chief complaint data, and the output is disease risk ranking, e.g.:

    1. 1.

      [ACS cardiogenic chest pain 80%]:

      1. 1.1

        (STEMI 60%)

      2. 1.2

        (Aortic dissection 20%)

    2. 2.

      [Non-ACS cardiogenic chest pain 15%]:

      1. 2.1

        (Arrhythmias 10%)

      2. 2.2

        (Ischemic cardiomyopathy 5%)

      3. 2.3

        (Coronary heart disease xx%)

    3. 3.

      [Non-cardiogenic diseases 5%]:

      1. 3.1

        (Respiratory diseases 3%)

      2. 3.2

        (Digestive system diseases 1%)

      3. 3.3

        (Nervous system disease 1%)

  2. (2)

    Cardiovascular diseases risks ranking. The input of this function is the dynamic or static image files and the output is the disease risk ranking.

  3. (3)

    Similar case matching. The input of this function is the structured chief complaint, inspection result and diagnosis. The output is based on the EMR from hospitals that is stored in the medical sample base, as described above.

5.3 Inspection

  1. (1)

    Intelligent inspection items recommendation. The input of this function is the disease risk ranking and the output is the suggested inspection e.g.:

    • [Sign detection]:

      • (blood oxygen xx%) (blood pressure xxx/xxx mmHg) (ECG)

    • [Blood test]:

      • (creatinine xx umol/L) (BNP xx pg/ml ) (PT-INR x) (creatinine protein xx ng/ml)

  2. (2)

    Similar case matching. The input of this function includes the structured chief complaint and inspection result. The output is based on the EMR from hospitals that is stored in the medical sample base.

5.4 Medication and Treatment

  1. (1)

    Intelligent medication & treatment recommendation. The input of this function is the structured chief complaint, diagnostic result and inspection result. The output is the medication and medical treatment advice, e.g.:

    • Medication advice:

      • [Ant platelet]: (Aspirin)

      • [Anti-angina]: (Nitroglycerin)

      • [Sedative]: (Morphine)

      • [Heart rate, blood pressure]: (Atropine) (Dopamine)

      • [Surgical treatment]: (Interventional surgery)

    • Medical treatment advice:

      • [Conventional disposal]: (CPR) (Oxygen absorption) (Defibrillation)

      • [Surgical treatment]: (Interventional surgery)

      • [Unable to dispose of]: (Enter a high level of specialist treatment)

  2. (2)

    Similar case matching. The input of this function contains the structured chief complaint, diagnostic result and inspection result. The output is based on the EMR from hospitals that is stored in the medical sample base, as described above.

5.5 Prognosis

After going through the whole process of a patient visit (inquiry, inspection, diagnosis, medication & treatment), this platform provides the prognosis prediction services. The purpose is to assess the effectiveness of treatment from the perspective of health economics. Specifically this platform uses re-hospitalization risk prediction model to achieve the above purpose. Meanwhile the input of the service is just like the EMR from hospitals which is stored in the medical sample base. The output of the service is the days of re-hospitalization.

6 Application of IDS

6.1 Motivation

At present, the quality of primary health care in China is at a relatively low level. Many village doctors need professional guidance from experienced doctors who are very busy themselves. CDSS could help village doctors to reduce their misdiagnosis rate. We applied IDS to support the village doctors.

6.2 Implementation

The application was developed using Hadoop, Spark, Scale and Java. Figure 3 shows an example of decision support for diagnosis. The input of this interface contains three parts:

Fig. 3.
figure 3

Example of User Input Interface

  1. 1.

    The upper left part of the figure is the chief complaint input area, such as sex, age and weight.

  2. 2.

    The lower left part of the figure is the current medical history input area which is text input.

  3. 3.

    The right part of the figure is the inspection result input area, such as blood oxygen, blood pressure, creatinine, BNP, PT-INR and creatinine protein.

The output is disease risk ranking (see Fig. 4).

Fig. 4.
figure 4

Example of Result Interface

7 Conclusion

This paper proposes IDS, a data-intensive clinical decision support platform based on knowledge graph. This paper designs a new integration framework which connects the medical knowledge base and the medical sample base. Under this framework, the medical knowledge base can be expanded and the medical sample base can be improved as well. Therefore, the decision support efficiency can be improved. Following the process of a patient visit, this platform provides a series of clinical decision support services. The platform has been integrated in an application for village doctors.