Keywords

1 Introduction

The emergence of large-scale clinical and administrative data repository’s, or “big data”, has provided nurse leaders tremendous opportunities but in the face of enormous challenges. Today’s nurse leaders are literally inundated with data. From clinical documentation in the electronic health record (EHR), to publicly reported outcomes, to business intelligence reports, nurse leaders are awash in a tsunami of data. Clearly the success of nurse executives in the future will depend on their ability to manage this tidal wave of data in a way that meets the Triple Aim of improving quality, improving the patient’s experience and reducing cost.

To be successful, nurse administrators of the future will need to collaborate with experts in the field of data science, and familiarize themselves with different approaches to analyzing big data. These methods include knowledge discovery through data mining and machine learning: techniques that can identify and categorize patterns in the data and predict events such as stroke, heart failure, falls, pressure ulcers and readmission to the hospital. They can also predict which staffing model is most likely to yield the best outcomes for a specific patient population.

Equally important will be nurse administrator’s ability to communicate real time system performance through data visualization dashboards used to report clinical analytics and business intelligence. Fortunately today’s health systems are starting to realize the vast opportunities that big data and data science can unlock. This chapter focuses on the state of big data and data science and what is possible related to nursing leadership.

2 Defining Big Data and Data Science

The terms big data and data science are often used interchangeably. However, there is a fundamental difference between the two areas. Data Science is an interdisciplinary field that seeks to capture the underlying patterns of large complex data sets and then program these patterns (or algorithms) into computer applications. An example of a computer application using a programmed algorithm is the Modified Early Warning (MEW) system which predicts how quickly a patient experiencing a sudden decline receives clinical care. The algorithm uses six factors to predict a MEWs score: respiratory rate, heart rate, systolic blood pressure, conscious level, temperature, hourly urine output (for previous 2 h) (AHRQ Innovations Exchange Webpage 2016).

Big data refers to data sets too large or complex for traditional data management tools or applications (Big Data 2015). Said another way, big data encompasses a volume, variety, and velocity of data that requires advanced analytic approaches. So, in this chapter, when we refer to big data, we are referring to the combination of structured, semi-structured and unstructured data stores and the special analytic approaches required to harness, to analyze, and to communicate insights from these data. Big data in contrast to data science looks to collect and manage large amounts of varied data to serve large-scale web applications and vast sensor networks.

3 Nursing Leader Accountabilities and Challenges

The chief nursing officer is a member of the executive leadership team of the organization. As such, this leader has accountabilities to both the organization and to the profession. At each level of nursing leadership, nurse leaders balance fiduciary and ethical accountabilities to the employer with accountabilities for the professional practice of nursing. This balancing act inevitably leads to some difficult decisions about deployment of scarce or expensive resources, continuing or discontinuing low volume services, or optimizing the model of care delivery to achieve best outcomes at the lowest cost.

4 Systems Interoperability

To meet these accountabilities, nurse leaders use data, lots of data, from many different types of data systems. Generally these data systems can be classified as financial, human resources, operational and clinical. Figure 7.1 summarizes the kinds of data nurse leaders frequently use from these different types of data systems.

Fig. 7.1
figure 1

Data-drive decision making:Traditional Approach

These systems have varying levels of sophistication within a given organization but typically the systems are not interoperable. This means nurse leaders are often analyzing financial data first, followed by human resource data, then operational data, and finally clinical data. Nurse leaders then manually synthesize these separate analyses to make decisions. Figure 7.1 depicts this process.

5 Non-Standardization

To further complicate this traditional approach to data analysis, the data systems may have different data definitions and different time periods that hinder cross-functional analysis. For example, financial systems may be tied to a fiscal calendar with well-defined month-end close activities. Human resource systems may be based on bi-weekly pay periods that do not coincide with the monthly fiscal calendar. Allocating external agency staff hours and costs to the department, shift, and patients across these two data systems for operational or clinical analyses is difficult and may result in different values and conclusions depending on which data source is used. For example, a “day” in the financial system is usually a 24-h period that begins at midnight and ends at 2359 (using the 24-h clock). A “day” in an operational scheduling system is usually defined by shift start and end times. So, the number of patient days or patient encounters in a given department on a specific day may vary, depending on which definition of “day” is used.

6 The Invisibility of Nursing

Financial data poses yet another challenge for nursing leaders. While financial information systems are usually the most mature information systems in the organization, nursing financial information is frequently not cataloged in the systems in a manner that enables analysis in relation to patient outcomes, operational changes or specific nurses. Lags in accounting processes for contract labor and lack of alignment with shift times were previously discussed. The inability to assign individual nursing practitioners to individual patients in financial systems is a significant hindrance to understanding the amount of nursing time each patient is consuming and therefore estimating the cost of care for each patient. This type of analysis is important to understanding the value of nursing care.

In addition to these technical challenges, traditional approaches to data analysis have some significant limitations for nurse leaders. Most importantly, these analyses often lack variables that are important to nurse leaders. Variables such as patient care requirements, nurse competence levels, departmental structure and care processes are often the phenomena of most interest to nurse leaders but are not reflected in the traditional data sources in most organizations. Secondly, traditional data sources may not be timely enough to inform operational decision-making. Many traditional data systems produce metrics on biweekly, monthly or quarterly bases. Cross-functional analyses that include clinical outcomes, such as infection rates, may be several months removed from the implementation of the novel clinical process that is being evaluated. Thirdly, many of the variables in the traditional systems are not actionable for the nurse leader. Clinical and human resource metrics calculated at the organizational level, such as pressure ulcer and turnover rates, do not identify the specific departments that need leadership intervention. Finally, the manual nature of the cross-functional analysis may systematically under- or over-estimate the interactions between the variables in the different systems because it relies on the judgment of the nurse leader.

Nurse leaders also expend significant effort helping non-nursing leaders within an organization understand the value of nursing to the organization, in terms of both patient outcomes and financial outcomes. The ability to measure the value of nursing in relation to the outcomes produced and the cost of that production is critical in an era of value-based health care (Pappas 2013). Pappas cites a need for a reporting framework that combines cost and quality as an important first step in documenting the value of nursing to outcomes and error avoidance. Pappas also identifies a need for data that makes tangible some of the intangible work of nursing, such as surveillance. Big data and data science offer techniques for quantifying or datafying and communicating (Mayer-Schonberger and Cukier 2013) concepts that have traditionally been intangible or subjective, giving nurse leaders a new language for expressing the value of nursing to decision-makers at all levels within the organization. These include concepts such as ‘complexity of care’ when discussing patients or ‘novice to expert’ when discussing nurses.

7 A Common Data Repository Across the System

The chief nursing officer (CNO) role has traditionally been specific to an organization. Thus the data used by the CNO has been confined to data from a single facility. The growth of health systems, and system CNOs (Englebright and Perlin 2008), has created an avenue for aggregating multi-facility data and engaging in big data. The desire to harness the power of big data to answer nursing questions will require nurse leaders within and across organizations to adopt national data standards and to explicitly define data models to generate sharable and comparable data. This is the first for step toward realizing the promise of big data for nursing.

8 The Value of Big Data for Nurse Leaders

Clancy and Reed (2015) discuss the data tsunami that is facing nurse executives. Traditional data sources are being supplemented with new data from sensors on equipment, patients and staff within the care environment, with social media interactions, and with retail information. Finding patterns and correlations among these diverse data sources is the province of big data and data science.

Big data and data science promise to answer some of the most perplexing questions facing nursing leaders today. What is the optimal way to organize and staff a patient care service? Big data provides a way to combine large amounts of different types of data to answer complex questions. Traditionally nurse leaders have had to ask financial questions of financial systems, human resource questions of human resource systems, clinical questions of clinical systems, and operational questions of operational systems. Answering these questions for an emergency department would require analyzing volume, payer mix, revenue and cost in the financial system; payroll costs, turnover, skill mix and staff satisfaction in the human resources systems; patient outcomes in clinical systems; and patient arrival patterns and throughput times in operational systems. Nurse leaders are then forced to use their knowledge, experience and intuition to make a judgment on the best way to organize and staff the department. Data science approaches can help by combining large amounts of data from multiple data sources into a coherent analysis and creating predictive models that provide the foundation for evidence based executive decision-making.

Big data and data science would embrace all these traditional data sources but might also include new data sources such as website hits searching for information on flu symptoms during flu season to anticipate a spike in volumes or police reports of accidents near the emergency department. Big data can search for correlations that might not be apparent to nurse leaders, such as the operating hours of nearby businesses. From this data, a model could be derived that predicts patient arrival patterns and care needs. This model could be used for long range planning, but can also be adjusted in near-real time to help the front line manager adjust staffing to be ready to meet the needs of patients that are about to present to the emergency department. This is big data and data science at its best, combining a large volume of highly variable data at high velocity.

9 The Journey to Sharable and Comparable Data in Nursing

Sounds almost too good to be true. So, what’s the catch? Big data is a group activity and healthcare is still largely an industry of individual or small providers who take great pride in their individuality. To be most effective, big data should rest on a foundation of standardized, coded data elements. SNOMED, LOINC, and ICD10 are well-known data standards in healthcare. However, these taxonomies do not address many of the concepts important to nursing (Rutherford 2008).

Attempts to map the domain of nursing knowledge have been underway for many years. The American Nurses’ Association currently recognizes twelve standardized nursing languages, which are summarized in Table 7.1 (American Nurses’ Association 2012). The Health Information Technology Standards Panel (HITSP) recognized two of these terminologies, Clinical Care Classification System (CCC) and the Omaha System, as meeting the criteria required for the harmonization efforts undertaken as a component of the American Recovery and Reinvestment Act and Meaningful Use Electronic Health Record (EHR) Incentive Program (HITSP 2009; CMS 2016).

Table 7.1 Currently recognized nursing languages

The goal of the significant investment in health information technology funded by the Health Information Technology for Economic and Clinical Health (HITECH) Act was to create the infrastructure for sharable comparable clinical data that promises to improve care processes and care outcomes (Payne et al. 2015). The detailed data now available in the electronic health record offers the promise for more detailed, more specific and more actionable indicators of nursing effectiveness. However, the ability to apply big data techniques to nursing clinical data is limited by the lack of adoption of standard terminologies that are essential for comparable and sharable data. Currently, there is no requirement that healthcare providers or health information technology vendors use these standardized nursing languages. Adoption is voluntary and uptake has been slow across the industry.

The nursing profession has been working toward shareable and comparable data to describe nursing practice for many years. In 2004, the National Quality Forum endorsed National Voluntary Consensus Standards for Nursing-Sensitive Care; see Table 7.2 (National Quality Forum 2004). This set of 15 voluntary metrics included patient-centered outcome measures, nursing-centered intervention measures, and system-centered measures. These metrics are incorporated into the National Database of Nursing Quality Indicators™ (NDNQI®) (Montalvo 2007) and are considered by the American Nurses Credentialing Corporation (ANCC) in the Magnet Recognition Program®. Over 2000 hospitals participate in NDNQI (Press Ganey 2015), submitting standard data elements, generating benchmarks, and stimulating learning and improvement.

Table 7.2 National Quality Forum nurse-sensitive measures

Nurse leaders may be able to learn much about nursing practice from analyzing data from electronic health records. Data generated by nurses comprises a significant portion of the information in the EHR. Yet, most of this data has not been entered into the EHR in a standard format that allows it to be reused, shared, analyzed and compared. Kaiser Permanente and the US Department of Veterans Affairs worked together to compare patient data from the electronic health record related to pressure ulcers (Chow et al. 2015). They created a prototype of a common nursing information model with standard terms that allowed the two organizations to share and compare patient data to improve care coordination and quality of care. The process they used to standardize nursing concepts and assign codes using Clinical LOINC and SNOMED CT was both rigorous and onerous. Chow et al. (2015) cite the challenges of heterogeneous EHR systems, architectural limitations, and lack of data harmonization that were encountered in sharing one clinical data set between two organizations.

Clancy and Reed (2015) highlight the importance of data models in organizing data and standardizing how they relate to one another, how they comport to computer fields within the EHR and how they map to standard nursing terminologies. Data models are the “Rosetta stone” that allows data from distinct systems to be combined for analysis without losing the meaning or context of the original intent or meaning. Data models are particularly important for data originating from clinical documentation due to the variability that exists in both technology and philosophy.

10 Gaining Insight from Data in Real Time

While there are amazing insights to gain from EHR data and from clinical measures of nursing performance, the real value of big data comes from sourcing it from multiple systems to address complex questions such as how to operate a nursing service that generates the best patient and staff outcomes at the lowest cost. Big data and data science provide the tools and techniques to bring together large volumes of data from different types of information systems and to analyze it in novel ways that provide insights and support the evidence based decision-making that has been lacking. When you consider the velocity of big data, you create more than understanding or insight; you can create tools that nurse leaders use to manage nursing services in new ways. No longer do nurse administrators need to wait until the end of the month or the end of the quarter to receive time sensitive reports. Today CNO’s are receiving data in near-real time, enabling point of care application of data insights. In other words, nurse leaders at all levels of the organization can use this data, including the charge nurse in the middle of a busy shift. The trick is how to deliver the output to the user within their workflow so that it adds value and does not detract from the important work of taking care of patients.

11 Strategies for Moving Forward

In 2013, the University of Minnesota School of Nursing began convening an annual meeting of nurses interested in advancing big data and data science applications within the profession. The participants work collaboratively throughout the year to advance key initiatives related to big data (University of Minnesota 2015). One output of the group in 2015 was The Chief Nurse Executive Big Data Checklist, see Table 7.3 (Englebright and Caspers 2016). The checklist identifies three arenas in which nurse leaders can begin to advance the adoption of big data and data science in nursing.

Table 7.3 The chief nurse executive big data checklist

12 Instilling a Data-Driven Culture Through Team Science

The first arena identified on The Chief Nurse Executive Big Data Checklist is instilling a data-driven culture in the organization (Englebright and Caspers 2016). Nurse leaders do this by embedding data into all decision-making processes in the organization, assuring all levels of leaders have access to data, and that data feedback loops to leaders and staff are timely and transparent. The second arena identified on the checklist is developing competencies for understanding and using big data in self and others. This is a rapidly moving target as big data and data science evolves. The final arena on the checklist is creating the organizational infrastructure that ensures nursing and nursing informatics is an important component of data analytics within the organization.

Nurse leaders must connect to a data team in order to engage in big data and data science. In today’s healthcare environment data science is an emerging discipline and hence the question frequently arises as how to form the data team to bring the analytics and technical aspects of this work to fruition. Data scientists are, to read the literature, unicorns, and competition for such talent is fierce. We have found that instead of trying to hire data scientists, it is more fruitful to grow a team.

A critical aspect is to realize that data science or advanced data analytics is distinct from informatics or information technology, it is increasingly part of the operations of the organization. The complexity of the big data and data science questions asked by the leaders of the organization require frequent iteration with the data science team. The necessary players on the data science team starts with a leader who can interact credibly and communicate clearly with both the organizational leadership and the analytical team. That team is composed of nursing domain experts, software developers, user interface specialists, business intelligence developers, statisticians and machine learning experts. They interact with a technical team in information technology (IT) comprised of database architects and engineers, big data architects and engineers, software developers, extract- transform-load (ETL) engineers, testers, and product support. The key for success amid this complexity is communication and teamwork: and like nursing, data science is a team sport.

13 Putting It All Together: An Example

This example illustrates how big data and data science can be applied to drive improved performance on inpatient nursing units. This example shows a progression from descriptive analytics (what happened), through diagnostic analytics (why did it happen), to predictive analytics (what will happen next) and finally to prescriptive analytics (what should I do about it).

A nursing unit is a highly dynamic and complex environment and managing this requires a holistic picture encompassing the most important variables from financial, human resources, operational and clinical domains of performance. The question is which are the most important variables?

13.1 Step 1: Diagnostic Analytics

This project began with a key user analysis in the patient experience domain, identifying those variables correlated, and causative, with high “Willingness to Recommend” and “Overall Patient Satisfaction” scores on patient experience surveys. One theme that emerged was the management of pain. Diagnostic analytics revealed that patients who reported better experiences with pain management had the highest scores on Willingness to Recommend and Overall Satisfaction. This led to the development of indicators of effective pain management practices that could be incorporated into the unit-based dashboard or data portal. A subset of patient experience scores includes patients with catheter associated urinary tract infections.

13.2 Step 2: Diagnostic Analytics

Catheter associated urinary tract infections (CAUTI) are an important, nurse sensitive, measure of clinical quality. In creating a dashboard one might be tempted to display the CAUTI rate in the clinical domain. This is valuable, but is not only in arrears, and a-definitio, unmanageable (the event has already happened) it is also difficult to attribute to a specific unit or nurse as a stimulus for action or improvement. Diagnostic analytics fuses many variables to find the leading variables. Unsurprisingly, the mean indwelling time of urinary catheters on a unit is highly predictive of the CAUTI rate on that unit. So, on a management dashboard we display that mean indwelling time (with navigation down to the details of each catheter and patient) rather than the CAUTI rate. The display of average hours on catheter is both meaningful and actionable for the manager and the nurse and directly correlated to the outcome they are trying to impact, infection rate.

13.3 Step 3: Predictive Analytics

The diagnostic analytics revealed that that urinary catheter indwelling time is predictive of CAUTI.

13.4 Step 4: Prescriptive Analytics

Now that we know that having a urinary catheter indwelling for more than 24 hours is highly predictive of CAUTI, we can simply create a list each shift, ranking patients with catheters by the length of time the catheter has been present and suggesting which need to be cleaned or removed in the present shift. This is actionable for both the manager and the nurse, providing them with tangible steps to take this shift to improve outcomes by driving down CAUTI rates.

This four-step process was repeated for each domain of performance to create a nursing unit dashboard. Figure 7.2 illustrates the first iteration of the dashboard with three domains of performance, clinical, patient experience, and productivity. Still in development are human resources and financial as well as additional clinical indicators. By taking the same approach though all domains areas of nursing performance, identifying key driver variables, displaying them in a consistent way, and combining that with action-enabling tools, we create a truly potent data-driven management approach.

Fig. 7.2
figure 2

Examples of nursing metrics from each type of data system

Big data and data science provide nurse leaders with data that is timely, consistent and relevant and presented in ways that prompt insight and action. This is the realm of clinical decision support and is the effector arm of big data in the clinical setting. The technical components necessary to acquire, process and deliver analytics in support of Clinical Decision Support are significant, most especially when integrating disparate non-clinical sources. Substantial preparation is necessary to ensure success.

14 Conclusions

Big data and big data science are an exciting new frontier for nurse leaders. They offer new tools and techniques for simplifying data-driven decision-making. Nurse leaders operate within a tsunami of data and big data and data science offer strategies for finding the meaningful signals and patterns in the data, allowing the leader to provide focus and direction to the organization with confidence.