Quality characteristics and measures for human–computer interaction evaluation in ubiquitous systems

Carvalho, Rainara Maia; de Castro Andrade, Rossana Maria; de Oliveira, Káthia Marçal; de Sousa Santos, Ismayle; Bezerra, Carla Ilane Moreira

doi:10.1007/s11219-016-9320-z

Quality characteristics and measures for human–computer interaction evaluation in ubiquitous systems

Published: 07 July 2016

Volume 25, pages 743–795, (2017)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Software Quality Journal Aims and scope Submit manuscript

Quality characteristics and measures for human–computer interaction evaluation in ubiquitous systems

Download PDF

Rainara Maia Carvalho¹,
Rossana Maria de Castro Andrade¹,
Káthia Marçal de Oliveira²,
Ismayle de Sousa Santos¹ &
…
Carla Ilane Moreira Bezerra¹

2074 Accesses
28 Citations
Explore all metrics

Abstract

The advent of ubiquitous systems places even more focus on users, since these systems must support their daily activities in such a transparent way that does not disturb them. Thus, much more attention should be provided to human–computer interaction (HCI) and, as a consequence, to its quality. Dealing with quality issues implies first the identification of the quality characteristics that should be achieved and, then, which software measures should be used to evaluate them in a target system. Therefore, this work aims to identify what quality characteristics and measures have been used for the HCI evaluation of ubiquitous systems. In order to achieve our goal, we performed a large literature review, using a systematic mapping study, and we present our results in this paper. We identified 41 pertinent papers that were deeply analyzed to extract quality characteristics and software measures. We found 186 quality characteristics, but since there were divergences on their definitions and duplicated characteristics, an analysis of synonyms by peer review based on the equivalence of definitions was also done. This analysis allowed us to define a final suitable set composed of 27 quality characteristics, where 21 are generic to any system but are particularized for ubiquitous applications and 6 are specific for this domain. We also found 218 citations of measures associated with the characteristics, although the majority of them are simple definitions with no detail about their measurement functions. Our results provide not only an overview of this area to guide researchers in directing their efforts but also it can help practitioners in evaluating ubiquitous systems using these measures.

A Quality Model for Human-Computer Interaction Evaluation in Ubiquitous Systems

Establishing Guidelines for User Quality of Experience in Ubiquitous Systems

Usability, Quality in Use and the Model of Quality Characteristics

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

The increasing improvement in the miniaturization of computational devices and in the wireless communications has been an important factor for advances in the ubiquitous systems development. These systems change the focus of interest from computer technology to users and their needs (Rocha et al. 2011). They are capable of monitoring users and their environments to provide relevant services in a transparent and intuitive way, changing completely the way users interact with systems. This suggests new challenges for human–computer interaction (HCI) evaluation in ubiquitous systems.

These issues are even more relevant if we consider that ubiquitous systems are present anywhere and anytime for users, which leads to a high risk of users feeling annoyed and overwhelmed by ubiquitous systems. No user would like to have several systems requiring too much interruption with irrelevant information everywhere and any time (Evers et al. 2014). Considering this scenario, we argue that ubiquitous systems should be delivered to the user by prioritizing the quality of interaction. Therefore, to properly execute an HCI quality evaluation in ubiquitous systems, it is essential to know which quality characteristics and measures have to be taken into account.

Looking to the international standard Software Quality Requirements and Evaluation (SQuaRE) for any type of system (ISO/IEC 25010 2011), several quality characteristics could be considered to evaluate HCI in ubiquitous systems (e.g., Usability, Freedom from Risk and Context Coverage), and also several software measures could be used to evaluate these characteristics. However, considering that ubiquitous systems present a particular type of interaction, which is the natural and transparent interaction (Poppe et al. 2007), and particular characteristics, like Context-Awareness and Adaptability (Evers et al. 2014), we believe that new specific characteristics and measures could be also applied in an HCI quality assessment.

Then, we conducted a systematic mapping (SM) study (Petersen et al. 2008) to identify which quality characteristics and measures should be taken into account for ubiquitous systems. The SM is a research method that provides a broad overview of a research area to establish whether research evidence exists on a topic and whether it provides an indication of the evidence quantity (Kitchenham and Charters 2007). Based on the SM study, this paper presents the set of quality characteristics and measures that one should consider while performing an HCI quality evaluation of ubiquitous systems. It is important to point out that we did not find a work like ours that aggregates characteristics and measures for evaluating ubiquitous systems from an extensive literature review and organizes them using a standard quality model.

The remainder of this paper is organized in five sections. Section 2 describes an overview of ubiquitous systems, emphasizing HCI issues. Section 3 describes the research method we used, and Sect. 4 presents the obtained results. In Sect. 5, the results are discussed through a classification of the final set of the characteristics according to SQuaRE quality models. Section 6 presents the threats to validity of the study. Section 7 presents related work and a comparison with our study. Finally, Sect. 8 presents our conclusions and future work.

2 Background

Mark Weiser’s vision of ubiquitous computing is well expressed in his following famous quote: “The most profound technologies are those that disappear. They weave themselves into the fabric of everyday life until they are indistinguishable from it” (Weiser 1991). Therefore, this paradigm includes services and the provision of information to support users in everyday tasks by a variety of computers. Moreover, this support should be executed without users noticing that they are interacting with several technologies.

To achieve this vision, the system should be able to understand the user’s behavior and adapt itself. This is enabled by the context-awareness characteristic, which captures relevant information during the interaction between users and applications, and applies it to support users in performing their tasks (Dey 2001). This characteristic allows the system to know, for example, who the user is, where he/she is, what he/she is doing in a given time, and what makes it possible to deliver several relevant services to the user.

Adapting the HCI evaluation of ubiquitous systems is even more relevant to this scenario if we consider the following four differences between interaction in traditional systems and in ubiquitous systems, according to (Poppe et al. 2007):

New possibilities of sensing In traditional systems the inputs of the users are provided often by hardware devices, such as keyboard or mouse. In ubiquitous systems, inputs can be captured by sensors (e.g., GPS, accelerometer and magnetometer) without the user noticing or captured by the voice, gesture and touch. These new sources of inputs make the interaction more natural.
Shift in initiative In traditional systems, the HCI corresponds to an explicit dialogue between the user and the computer, and usually, it is the user who begins the interaction. In ubiquitous systems, dialogues can be initiated by the system itself, given its ability to sense the user, his/her environment and his/her needs.
Heterogeneity of physical interfaces Ubiquitous systems can be present in several everyday objects. Thus, there is a movement to make ubiquitous systems for both large interfaces, like interactive display, and small ones, like smartphones and wearable devices.
Shift in application purpose Ubiquitous systems focus on the user and on everyday life, whereas traditional systems are, in general, task-based.

In Fig. 1, the ubiquitous system puts a mobile phone in silent mode when the presence of the user in an event like meeting or cinema is detected. This system does not use traditional input devices such as keyboard or mouse, but inputs (e.g., activity and location) captured by both physical and logical sensors (e.g., GPS for location and Calendar for activity) without users’ perception. Besides that, in this example, the interaction is initiated by the system; the user does not have to take actions. Instead of this, the system substitutes an action usually performed by the user (e.g., put a mobile phone in silent mode).

Besides these differences, Bezerra et al. (2014) mention three challenges for usability testing in ubiquitous systems:

Ubiquitous environments have more usability factors that should be evaluated, such as contextual information. Thus, it is necessary to predict all relevant changes in context and analyze when those changes can impact on the behavior of the system;
Most of the software measures do not consider the factors of ubiquitous applications. It is a challenge to make the evaluation of the usability of these systems more reliable and to identify measures that consider the ubiquitous features in usability tests; and
Currently, usability testing methods follow the same activities performed in traditional systems. Research need to be conducted to elaborate an approach for usability testing with specific tasks and measures to evaluate ubiquitous systems.

Based on all these particularities of HCI in ubiquitous systems and the challenges mentioned before, we were convinced that, for an adequate quality assessment of ubiquitous applications, we need a deep analysis of specific quality characteristics and measures for this type of application and to achieve that we need to first investigate the existing characteristics and measures that have been explored in the literature.

3 The research method: systematic mapping

Systematic mapping (SM) is a method to build a classification scheme and to structure a field of interest (Petersen et al. 2008). It is defined as a rigorous, unbiased and auditable procedure for searching research literature. Systematic mapping studies use the same basic methodology as systematic review (Kitchenham et al. 2010) guided by research questions. Nevertheless, the research questions for a mapping study are more general, related to research trend, and quite high level, including issues such as: Which subtopics have been addressed, what empirical methods have been used, and what subtopics have sufficient studies for a more detailed system review.

To perform our systematic mapping, we followed a process with three main activities proposed by Kitchenham and Charters (2007) for systematic studies: (1) planning; (2) conducting; and (3) reporting (see Fig. 2). The definition of steps for each activity was based on Petersen et al. (2008), Silveira et al. (2011) and Wohlin (2014). The first activity (planning) aims to define the protocol that will guide all research. The second activity (conducting) aims to execute the defined protocol. In our study, this activity was performed in two different phases. In the first one, the primary studies selection was based on database search, i.e., we used digital libraries to start our search. In the second one, we use snowballing procedures defined by Wohlin (2014) in order to complement the set of papers found by the database search, as performed by Tahir and Jafar (2011).

3.1 Planning: definition of protocol

The aim of the planning phase is the definition of a review protocol (single step in this phase as shown in Fig. 2). This protocol is composed of the following information:

A.
Research Questions The aim of our study was to identify the quality characteristics and measures for HCI evaluation of ubiquitous systems. Therefore, we established the following research questions:

RQ1 What quality characteristics have been proposed for ubiquitous systems’ HCI evaluation?

RQ2 What software measures have been proposed for ubiquitous systems’ HCI evaluation?

Knowing that, usually, quality characteristics are organized in a hierarchical tree [named a quality model (ISO/IEC 25000 2014)] that goes from a generic definition up to measures that allow the product assessment, we established a third research questions as follows:

RQ3 Are the characteristics and measures organized in quality models for ubiquitous systems’ HCI evaluation?

B.
Key terms The key terms were derived based on the research questions, identifying the object we were looking for (quality characteristic, measures and quality models), the purpose (HCI evaluation) and the context (ubiquitous systems). After that, some synonyms or alternative words were added in the set of key terms. When analyzing the HCI field, standards and research usually talk about assessment, methods and techniques related to usability (see, e.g., Nielsen 1994; ISO 9241-11 1998; Sears and Jacko 2009). In this way, we include usability evaluation as another term. The final key terms are presented in Table 1.
Table 1 Key terms
Full size table

Regarding the term “ubiquitous systems,” we avoided the use of the term “systems,” because we did not want to limit the search for any kind of software. Furthermore, we agree with Petersen et al. (2008) that adding specific outcomes is a restriction and the mapping study aims to have a broad overview of the research area as a whole. If we had only considered certain types of software (e.g., “systems,” “applications,” “services”) the overview could have been biased and the map incomplete.

We also avoid to use the term “mobile” as a synonym of ubiquitous since not all mobile applications are ubiquitous applications. As the main goal of this research is to find characteristics only for ubiquitous systems, we believe that if we had added mobile in our string, several irrelevant papers to our research questions could appear. Other SLR studies (Spínola and Travassos 2012; Viana et al. 2014) also do not consider the keyword “mobile” as synonym of ubiquitous.

C.
Search String Based on the key terms previously presented, the following search string was defined:

((characteristic OR measure OR metric OR “quality model” OR framework) AND (HCI OR “human–computer interaction” OR “human–computer interface” OR “user interface” OR interaction OR usability) AND (evaluation OR assessment) AND (ubiquitous OR pervasive))

We used three control papers (Scholtz and Consolvo 2004; Kim et al. 2008; Song et al. 2009), which means papers that we expected to appear in the results, because we already know they answer our research questions. Once they were present in the results, it indicated that the search string was validated to execute the systematic mapping.

D.
Research Sources To obtain the primary studies, we used two kinds of search: database search and snowballing. For the database search, we selected the most relevant digital libraries used in other systematic studies: ACM Digital Library (http://dl.acm.org/), IEEE Xplore (http://ieeexplore.ieee.org/), Scopus (http://www.scopus.com/scopus/home.url), Science Direct (http://www.sciencedirect.com), SpringerLink (http://www.springer.com/) and Compendex (http://www.engineeringvillage.com/). For the snowballing, we used the backward (i.e., checking the references list of the studies) and forward (i.e., checking papers that cited the studies) procedures. For the forward snowballing, we used Google Scholar for a broader search since the search engines from these databases, such as Scopus and IEEE, limit the search only for papers indexed from them. Furthermore, Wohlin (2014) suggests uses Google Scholar for forward snowballing procedures.
E.
Study Selection Criteria: We have defined the following selection criteria in order to select the most suitable studies:

SC1—The study should be written in English;

SC2—The study should be dated from 1991 or later. We choose this date because the term “ubiquitous computing” appeared in the paper of Weiser (1991), considered the father of ubiquitous computing;

SC3—The study should be available in the internet that means even if not available directly in the digital library, it should be possible to find it by internet facilities;

SC4—The study should present initiatives related to HCI evaluation on ubiquitous applications (no other contexts like desktop, web systems or HCI development); and

SC5—When the same study was published in different papers, only the most complete and recent was included, as suggested by Silveira et al. (2011) in systematic mappings.

It is important to highlight that no restriction was defined for the kind of paper selection that means all kinds of study (papers in conference or in journal, books, book chapters, short and long papers, etc.) were accepted. They were processed in the same way considering the above selection criteria.

3.2 Conducting

As presented in Fig. 2, this activity was performed in two phases. The first one is composed of four steps: (1) Conduction of database search, which is performed to find relevant papers in digital libraries using well-defined search strings; (2) Screening of papers; (3) Keyword relevant topics and Data extraction; and (4) Peer review 1.

The second phase is composed of four steps: (1) Conduction of backward snowballing, which implies seeking papers from reference lists of the identified papers in the conducting activity’s first phase, (2) Conduction of forward snowballing, which implies seeking papers that have cited the papers found in the conducting activity’s first phase, (3) Data extraction and (4) Peer review 2. All these steps are described in the next subsections.

3.2.1 Conducting: first phase

3.2.1.1 Conduction of database search

In this step, we searched papers based on the defined protocol. The selection was done on April 9, 2013. The set of search strings was applied within the search engines (ACM, IEEE, Scopus, Compendex, Springer and Science Direct), and all information about the papers, including title and abstracts,^{Footnote 1} was downloaded and imported to the Start tool^{Footnote 2} (Hernandes et al. 2012), which is a free tool that supports systematic review’s activities. This tool was selected because it is free, easy to use, and the authors have experience in using it. This step retrieved 1170 papers (see Fig. 3), 500 from Compendex (42.7 %), 269 from Scopus (23 %), 268 from Springer (22.9 %), 101 from IEEE (8.6 %), 24 from ACM (2.1 %) and 8 from Science Direct (0.7 %).

3.2.1.2 Screening of papers

This step involved the selection of studies considering three filters, described in Fig. 4. The aim of the first filter was to exclude duplicated papers, because some papers appeared in several sources, and thus, just one of them was included. We identified 302 duplicated papers (26 %) in the initial set of 1170 papers. Thus, 868 remaining papers (74 %) were selected to the next filter.

The aim of the second filter was to apply the defined selection criteria reading the abstract and title. This analysis was performed by peers, because we would like to avoid bias in the selection process. Thus, one researcher reviewed the selection of the other. To that end, we performed several face meetings during 1 week, where one peer reviewed the selection of the other, and, in case of disagreement, we opened a discussion to reach a consensus. Although this process could seem long, it was better to all peers since they previously schedule the meeting in their agenda to work on the selection process. We rejected 749 papers (86.30 %) and accepted 119 papers (13.70 %).

To apply the third filter, we downloaded the 119 papers and performed a detailed reading. This step was performed by four researchers. Two other researchers participated in the review of the papers that caused some doubt. The selection criteria were applied once again. As a result, we got 87 rejected papers (73 %) and 32 accepted papers (27 %). From the total of rejected papers, 85 were eliminated by the selection criterion SC4 and 2 papers by SC5.

The following 32 accepted papers went to next phase (Keyword relevant topics and Data extraction): (Abi-Char et al. 2010; Cappiello et al. 2009; Chang and Lin 2011; Damián-Reyes et al. 2011; De Moor et al. 2010; Evers et al. 2010; Haapalainen et al. 2010; Iqbal et al. 2005; Jafari et al. 2010; Jia et al. 2009; Kemp et al. 2008; Kim et al. 2008; Ko et al. 2010; Kourouthanassis et al. 2008; Kryvinska et al. 2011; Lee and Yun 2012; Lee et al. 2008; Liampotis et al. 2009; Ranganathan et al. 2005; Ross and Burnett 2001; Rubio and Bozo 2007; Schalkwyk et al. 2010; Scholtz and Consolvo 2004; Sousa et al. 2011; Sun and Denko 2008; Thompson and Azvine 2004; Toch 2011; Wagner et al. 2012; Waibel et al. 2010; Weihong-Guo et al. 2008; Wu and Fu 2012; Zhang et al. 2006).

3.2.1.3 Keyword relevant topics and Data extraction

To finish the conducting review phase, we should define the classification scheme, which will serve to create our systematic map (the main result of an SM). This classification scheme is composed of at least two facets. It is defined by keywording relevant topics in the abstract of the papers, which means the searching of keywords and concepts that reflect the contribution of the study. Reading the papers, we identified concepts that reflect the main following contributions: definition of the quality model, conceptual frameworks of measures and quality characteristics. Therefore, we defined the contribution type as one facet in our classification scheme, as shown in Table 2.

Table 2 Contribution type facet

Quality characteristics and measures for human–computer interaction evaluation in ubiquitous systems

Abstract

Similar content being viewed by others

A Quality Model for Human-Computer Interaction Evaluation in Ubiquitous Systems

Establishing Guidelines for User Quality of Experience in Ubiquitous Systems

Usability, Quality in Use and the Model of Quality Characteristics

Explore related subjects

1 Introduction

2 Background

3 The research method: systematic mapping

3.1 Planning: definition of protocol

3.2 Conducting

3.2.1 Conducting: first phase

3.2.1.1 Conduction of database search

3.2.1.2 Screening of papers

3.2.1.3 Keyword relevant topics and Data extraction

3.2.1.4 Peer review 1

3.2.2 Conducting: second phase

3.2.2.1 Conduction of backward and forward snowballing

3.2.2.2 Data extraction

3.2.2.3 Peer review 2

4 Results of the systematic mapping

4.1 Quality characteristics proposed for ubiquitous systems’ HCI evaluation

4.2 Software measures proposed for ubiquitous systems’ HCI evaluation

4.3 Quality models for ubiquitous systems’ HCI evaluation

4.4 The systematic map

5 Discussion

6 Threats to validity

6.1 Descriptive validity

6.2 Theoretical validity

6.3 Generalizability validity

6.4 Interpretive validity

7 Related work

8 Conclusion and Future work

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Appendix: Software Measures

Appendix: Software Measures

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation