Keywords

1 Introduction

Motion sickness (MS) is an ‘umbrella’ term, covering many types of motion (or perceived motion) induced ‘sicknesses’ such as ‘car sickness’, ‘seasickness’, ‘virtual reality sickness’ and ‘simulator sickness’. MS research has been considerably under-researched over the past 20 years, however with the development of technologies such as virtual reality devices, the increasing use of driving simulators, and the push to develop self-driving vehicles the demand for MS research is again of relevance.

Reported symptoms of MS include that of nausea, cold sweats, disorientation, increased salivation, fatigue and vomiting amongst others. It is important to note that when a person is reported to be experiencing ‘motion sickness’, at the low end of impact they may not even be feeling ‘sick’ at all. For example, reports of increased salivation and fatigue may be indicative of ‘motion sickness’, although few people would consider themselves ‘sick’, or even uncomfortable. However, in the more advanced stages of MS the symptoms can be very uncomfortable and indeed result in being physically sick. Hence, the requirement to grade MS levels to understand the extent of the sensation. There are currently two main theories that suggest the causes for MS.

1.1 Sensory Conflict

The theory of sensory conflict explains that “mismatches between (or within) the visual, vestibular, and somatosensory inputs” cause MS [1]. This essentially means that if movement is sensed (for example) within the inner ear which does not correlate to the motion that is seen by the eye, then there is a conflict of senses and MS prevails. This conflict can arise if the human eye sees movement around it, but the inner ear senses no motion, or vice-versa. Sensory conflict can also arise through more acute mismatches. This theory is often favored in explaining MS, for reasons as shown by [2].

1.2 Postural Instability

The central hypothesis of postural instability theory is that MS is caused by loss of postural control [2]. As such, an underlying contributor to this theory is postural sway – where the human body is understood to be in constant movement through small, multi-axial swaying motions. These motions that can be exhibited in all states, including sitting, are thought to help the brain predict motion through comparing expected sway against any immediate effects/variations. The postural instability theory is more recently being considered as less of a cause of MS, but perhaps more of a precursor to MS. Where studies have shown postural instability may not necessarily cause MS, but does precede subjective MS symptoms [3, 4].

2 Motion Sickness and Future Technologies

With an understanding of the two theories presented above it is possible to understand why people may become motion sick in certain situations. Historically it has been common to hear complaints from those attempting to read a book whilst being a passenger in a car, or when traveling inside a boat with limited views of the horizon. Incidentally, in both these examples the cause of MS is likely due to the mismatch of the static visual environment and the physical movement sensed by the inner ear. However, with the development of future automotive technologies such as vehicle simulators there is growing concern for other forms of MS and potential impacts. Although vehicle simulators have been around for many years, it is the recent popularization of these which brings the subject to relevance. With the rapid development of technologies (such as self-driving vehicles) and the improvement in simulator capability and design, more and more research is being conducted within simulated environments.

2.1 Motion Sickness in Vehicle Simulators – Simulation Sickness

In a fixed base vehicle simulator, a person would sense movement with their eyes as the simulator displays a moving environment, although the inner ear would realize the vehicle is stationary. Therefore, there is a conflict in senses and MS is likely to occur. This can also happen in moving base simulators, with more acute mismatches. MS in a simulator is referred to as ‘simulation sickness’ - a form of MS. It is not uncommon to see around 25% of participants not able to complete a simulator study due to MS onset. A meta-study of such MS looking at multiple user trials [5], reported the highest dropout rate in one trial as 71% (p. 259). Where the mean drop-out percentage was 14% between the 9 user trials this paper reviewed.

2.2 Impact of Motion Sickness on Human Performance

It is understood that a considerable percentage of the population will likely become motion sick (or simulator sick) in vehicle simulators. There are two main areas of concern considering the impact of MS. The most immediate concern is in relation to comfort and wellbeing of participants in simulator studies. However, limited consideration has been given to understand to what extent human performance is affected by MS.

Considering seasickness, (another form of MS) one study looked at cognitive function and found that “5% of their users did not complete a task when not feeling sick at all” (i.e., in a baseline condition), however “this increased to about 60%’ when subjected to motion primed sickness” [6]. In 2005 NASA sponsored a project with the aim of increasing task performance ability in a commonly MS prone environment – space travel (where MS in this state is commonly referred to as ‘space motion sickness’). In this study [7] they employed a virtual reality training period aimed at reducing MS. The results showed that training reduced nausea scores (a subscale of MS measurement) and also significantly increased task performance when compared to the control group. The task being measured was an industry/job specific “navigation and switch activation task” [7]. A separate US military project from 1995 also touched on this relationship between task performance and MS, where they advised that if a person reports MS whilst using their tank simulator they “should not be required to drive any (real-world) vehicle” [8] as they expected an impact on task performance ability. This recommendation was echoed by another report which concluded MS “could directly affect visuomotor coordination” [9]. The recommendation that participants should not operate heavy machinery within two hours of completing a driving simulator study is considered good practice and is frequently advised in most simulator trials.

2.3 Motion Sickness, Human Performance and Vehicle Simulator Use

A few studies, such as those mentioned previously, have alluded to MS affecting human performance, but there is currently limited understanding of what areas of human performance are affected. The previously discussed research relied on their own industry-specific, non-standardized and non-repeatable measure of performance. The potential impact of MS on human performance could be of concern in vehicle simulator trials for many reasons. If, for example, researchers are measuring human interaction with a new piece of technology and participant performance is affected by MS then it is possible that user trial data may become invalid. Without knowing how simulator use may impact human performance it is not possible to say to what extent simulator-based user trials give representative, valid and/or reliable data for real-world applications. Even validity of post-simulator questionnaires could be of concern if participants’ cognitive ability is affected.

One of the main objectives of any simulator-based study is to ensure that participants interact with the technology in the same way as they would in the ‘real-world’. However, without knowing the impact of MS on human performance, this ‘transferability’ cannot yet be assured. The research objective was therefore set to understand to what extent does MS affect human performance?

3 Method

To answer the research question, it was important to first consider what ‘human performance’ consists of, specifically, what is required of a driver. Figure 1 (below) shows the three key areas of performance in relation to driving: physical, cognitive and visual performance (and the intersections between each). Auditory performance was not considered for this user trial, as this is not a requirement to drive a vehicle.

Fig. 1.
figure 1

Task performance diagram

Considering the range of tasks, a driver is expected to perform, and without being able to predict what interactions future vehicles may require, it was not possible to measure human performance with driving-specific tasks. Instead, the six areas above are likely to cover every human interaction instance at its core performance ability. For example, route planning may fall within the cognitive performance section, pressing a button (requiring visual skill and physical dexterity) would fit in the physical-visual section, and interacting with an out-of-sign control may rely on just physical performance.

Six individual tests were selected, one to assess each intersection of the above human performance diagram. The tests were chosen based on a few criteria: they should be pre-validated and standardized tests, they should each take less than 1 min to complete, they should have no learning affect (i.e., once familiar with the test, repeated exposures should not affect the scores) and they should represent the section they are assigned to test and be independent of others as much as possible. Through an extensive review of the literature, the pre-validated tests decided for each intersection included:

  • Test 1 - Visual performance: A visual acuity ETDRS LogMar test chart was used whereby participants would use only their dominant eye, standing at a set distance and read out the letters presented on the chart. They were scored on the total number of letters read.

  • Test 2 - Physical performance: The card turning test extracted from the ‘Jebson Taylor Hand Function Test’ (see [10]) was used to measure physical skill and dexterity. The cards used were 3” by 5” index cards set on a contrasting table, so visual ability to identify the cards was minimized. They were scored on the time taken to turn over all cards for both their dominant and non-dominant hand independently.

  • Test 3 - Cognitive performance: A ‘Paced Visual Serial Addition Test’ (PVSAT) was used, which is a visual version of an n-back test (for n-back see [11]). Numbers were presented on a screen and participants had to add the current number to the previously shown number and give the answer verbally. A 200pt font size with black text on a white background was used to isolate cognitive performance by reducing any effect of visual performance. Participants were scored on the number of correct answers.

  • Test 4 – Visual-cognitive performance: A mental rotation test was used [12] whereby a ‘target’ 3D shape was presented on paper to the participant with four other shapes, their task was to identify which two of the shapes matched the target shape, despite being rotated in a different plane. Participants were scored on completion time and the number of correct answers (i.e., if they gave the two correct shapes per question).

  • Test 5 - Physical-visual performance: The Perdue Pegboard (see [13]) was used which requires the physical dexterity and visual skill of locating pins in small holes. Participants were tested with their dominant and non-dominant hand independently and were scored on the number of pins they put in the holes in the time given (60 s).

  • Test 6 – Physical-cognitive performance: A reaction time test was used whereby a large traffic light was displayed on the screen, when the bottom green light illuminated (within a random length of time between 1 and 6 s) the participants had to press a physical button. This test depended on cognitive processing speed and the physical response to press the button. Participants were timed for five consecutive repetitions and their score was derived from the average of the five measurements.

Participants completed a demographic questionnaire and the Simulation Sickness Questionnaire (SSQ) [14] to give a baseline result of MS and to counter any pre-existing conditions such as fatigue. They were then guided through all six tests as a learning activity (familiarization run) where the tests were explained and no scores were taken. After familiarization, the tests were given again (in a randomized order) and scores were taken as their baseline results. Participants were then introduced to the driving simulator (the 3xD simulator at the University of Warwick [15]) and were read a pre-prescribed script on the upcoming driving scenario. The participants were asked to drive along a pre-set route following automated voice-command directions. The route takes around 30 min to complete in its entirety including 5-min of a familiarization route at the beginning, followed by a mixture of country, town, and motorway roads.

Throughout the driving participants were asked (via a two-way intercom system) to rate their MS once per minute as per the Fast Motion Sickness Questionnaire (FMS) [16], where a rating out of 20 was noted. After the driving scenario participants completed the six tests once again and scores were taken to give a comparative measure. Participants filled out another SSQ considering their MS feelings whilst completing the six tests. Participants who ended the driving scenario early due to MS were still able to complete the six tasks and SSQ again (as long as they were comfortable to do so). The six tests were given with the aim of completion within the first 6 min of the driving scenario ending, due to the potential for the effects of MS to wear off quickly for some participants. The results are to be analyzed looking at the change in MS score (i.e., delta SSQ) compared to the change of the six human performance scores (delta scores).

The aim of the study was not to deliberately induce MS in participants, but expose a range of people to a scenario that gradually builds up complexity to include activities that are known to increase the likelihood of MS. When adhering to best practice guidelines the 3xD simulator does not cause excessive MS. These best practices include limiting simulator exposure to around 15 min at a time, providing pre-cooled air vented into the drivers’ cabin (this was provided during the current study), completion of a familiarization run (also completed in this study), pre-screening participants for a propensity for MS and avoiding complex junctions or higher speed tight sweeping bends. However, for this study we were specifically interested in the effect of MS on performance, hence a longer duration of scenario was developed (30 min) which in the final 10 min exposed participants to increasingly complex turns and junctions. A prescreening questionnaire was completed and those who indicated responses which might indicate a propensity for MS were not excluded, but given a written and verbal warning that they might experience MS and given the opportunity to withdraw. To counter the effects of extended simulator exposure and, as we are exposing people to a scenario which we know might increase the likelihood of MS, continuous monitoring (through FMS and visually) of the participant was completed to ensure welfare. This study was approved through The University of Warwick BSREC (REGO-2017-2090).

4 Results

51 participants took part in the user trial, including 27 males and 24 females. The minimum participant age was 22 ± 4 years, with a maximum age of 49 ± 4 years, a mean age of 31 and a standard deviation of 10.13. 45% of participants ended the study early due to MS (N = 23), including 26% of males (N = 7) and 67% of females (N = 16). These participants are referred to as ‘dropouts’. All dropouts still completed the post-driving SSQ and six tests. SSQ scores were collected and calculated to give a ‘total’ score and a score for the three subcategories (‘nausea’, ‘oculomotor’ and ‘disorientation’). An exploratory analysis was conducted (see Table 1), where scores were reported for ‘pre-driving’ (i.e., scores reported before the driving task) and ‘post-driving’ (i.e., the scores reported directly after the driving task) including mean and standard deviation (SD):

Table 1. Exploratory analysis of the subscales of the SSQ

Results indicated no gender effect on total post SSQ scores (F = 2.046, p > 0.05), although there was a significant gender effect on participant dropouts (F = 11.411, p = 0.001), as previously quantified. Considering the subcategories of the SSQ, there too was no gender effect on Oculomotor (F = 0.356, p > 0.05) or Disorientation (F = 0.697, p > 0.05) scores, however, there was a gender effect observed for Nausea scores (F = 4.675, p < 0.05) with females reporting higher Nausea. Similar findings were observed when looking at delta SSQ scores (i.e., change in SSQ scores). Results from the SSQ showed a significant difference in the mean scores for SSQ ‘total’, ‘nausea’ and ‘disorientation’ between participants who completed the driving scenario and those who dropped out (Total F = 14.135, p < 0.001), (Nausea F = 19.624, p < 0.001) and (Disorientation F = 15.556, p < 0.001). However, ‘oculomotor’ mean scores between participants who completed the driving scenario and those who dropped out were non-significant (F = 1.570, p > 0.05). No age effect was observed on total post SSQ scores (F = 0.539, p > 0.05) with similar findings for the three subcategories also.

The two methods used to measure MS (SSQ and FMS) were found to have a significant correlation when looking at participants who completed the driving task (n = 28) (Pearson’s correlation = 0.620, p < 0.001). Not all dropouts were able to give a final FMS score, so a correlation test for the total sample was not possible.

An exploratory analysis of the six human performance tests (each denoted by a number 1–6) was conducted, again looking at the scores given ‘pre’ driving task and the scores recorded ‘post’ driving task (i.e., delta sores or ‘Δ’) where in tests 2 and 5 scores were taken for dominant and non-dominant hands independently (see Table 2 below).

Table 2. Exploratory analysis of pre and post driving results

Looking at delta mean scores, test 1 showed a positive change meaning an improved performance. Test 2 shows a positive change which relates to an increased time, indicating a poorer performance. Test 3 shows a negative score, which indicates a poorer performance. Test 4 shows an increased score and decreased time which indicates an improved performance in both areas. Test 5 shows decreased scores which indicates a poorer performance. Test 6 shows an increased time, and thus a poorer performance.

It was shown previously that participants who dropped out reported significantly higher MS scores compared to those who completed the study. It was next tested to see if there was a difference between the delta performance scores of participants who completed compared to those who dropped out. The paired (pre and post) data sets were not normally distributed and symmetry was not observed in the data from test 3, test 4 (time and score), test 5 dominant and test 6. Therefore, data was analyzed using the Sign Test for the non-symmetrical results and the Wilcoxon Signed-Rank Test for the symmetrical. The results are presented below in Table 3.

Table 3. Analysis of dropouts and complete scores (where ** indicates 99% confidence)

5 Discussion

Dropout rates were higher than expected at 45%. The reason for such a high dropout rate may be explained by two factors: firstly, the scenario design and methodology (as explained previously) and secondly the user trial process, where in line with ethics guidelines, participants were repeatedly reminded they are welcome to stop the trial at any time if they feel uncomfortable. Further, the FMS questions may have primed participants for MS, as their comfort was constantly on their mind. Considering the finding that increasing self-efficacy reduced MS [17], the adverse could be true for this user trial. In line with previous literature [18, 19] a greater number of females dropped out compared to males. Interestingly, there was no significant difference observed between female and male total MS scores, as given by the SSQ, with females only reporting higher Nausea scores than males. This finding, considered alongside the finding that females dropped out at a higher rate, could point towards Nausea being a greater impactor for MS discomfort in females. It was also possible to show that total SSQ score, nausea score, and disorientation scores were all significantly related to dropouts when looking at the group as a whole. This finding hints that Oculomotor scores are a less important factor when considering MS sub scores and their predictability of dropouts.

Adding to the conflicting investigations of age effect on MS, we found that age had no effect on SSQ scores or any subcategories. Previous research reported “Individuals who have been studied objectively in the laboratory suggests that towards older age, susceptibility may increase in some individuals” [20]. However, in a questionnaire survey of 3256 coach travelers it was found that “Travel sickness decreased with increasing passenger age and greater travel experience” [21]. The contradiction in this age effect is widespread in the literature, so little conclusions can be drawn at this time. It should be noted this study did not provide a fully representative age demographic, with age being skewed towards younger participants.

Not all dropout participants were able to give a final FMS score, as some expressed a desire to exit the simulator immediately. All those who completed the driving task (N = 28) were able to give a final FMS score however. Comparing this final FMS score to the SSQ total score, correlation was observed (r = 0.620). This bodes well for the use of the FMS in future studies, in that it may be an effective way of tracking subjective MS in ‘real-time’. The creators of the FMS previously showed a Persons correlation of (r = 0.785), so it is beneficial for the continued use of this scale that this study has found a similarly positive result.

The main objective of this user trial was to see to what extent MS affects human performance. To analyse this the results were split into two groups – those who completed the driving scenario (N = 28) and those who dropped out (N = 23). All those who dropped out cited MS as the sole reason for ending the study early, therefore we can assume those who dropped out were significantly affected by MS. We had to make the assumption therefore that those who completed the driving scenario were less affected by MS. Using these two groups it was possible to explore the data for differences in task completion score between those who were severely motion sick, compared to those who were not. Each area of performance has been broken down into the original subcategories of human performance for exploration.

5.1 Discussion of Individual Tests

Test 1 – Visual Performance (LogMar Visual Acuity).

No significant effect of MS on visual performance was observed in this study (despite a small increase in delta scores, indicating improved overall scores), although lighting was a variable that could not be controlled and may have interfered with results. The lighting in the simulator was significantly darker than the lighting in the control room, where participants came from the darker simulator (where the participants are dark-adapted), into the brighter control room to take the visual acuity test. Anecdotally, a few participants noted they felt more ‘alert’ when coming back to the control room due to the change in lighting conditions. At this stage, it is not possible to make any conclusions on the effect of MS on visual acuity. It is recommended this area is explored again in a controlled study. Further, the visual acuity test only looks at one area of human performance, other measures such as depth perception would also be interesting to explore.

Test 2 – Physical Test (Card Turning).

Looking at the group average change of all participants (pre to post driving) as given in Table 2, both dominant and non-dominant hand times increased slightly, indicating that people (on average) were slower at completing the physical test after driving the simulator. The statistical analysis (in Table 3) showed that there was no significant change in either hand for those who managed to complete the study (i.e., those who are presumed to have low MS), however, those who had to end the study early on account of MS performed significantly worse with both dominant and non-dominant hands. From this we can conclude that the use of the driving simulator has no effect on physical task performance, however it appears that MS does significantly affect physical hand/arm performance.

Test 3 – Cognitive Test (N-Back).

Looking at the group average change of all participants (pre to post driving) as given in Table 2, it is shown that people performed worse at the n-back test after the driving than before (on average) – as shown by a negative delta score, although this group difference was not significant. Table 3 however confirms that indeed MS does have a significant effect on cognitive performance as dropouts performed significantly worse on the cognitive test after driving than before. This confirms the previous findings that cognitive performance is affected by MS [6]. However, there was no significant difference in the group of people who completed the driving study, which confirms that the use of the driving simulator alone does not affect cognitive performance.

Test 4 – Visual Cognitive Test (Mental Rotation).

This test revealed perhaps the most surprising results, where it was found that on average, after driving in the simulator people scored higher on the mental rotation test and completed the test quicker (as shown in Table 2). This indicated that the driving simulator (with or without MS) may improve spatial ability. However, there were no significant differences found in scores or times as indicated in Table 3. This is an area which should be explored further as this experiment could not isolate the impact of simulator exposure (and the potential to train spatial skills through use), and motion sickness.

Test 5 – Visual Physical Test (Perdue Pegboard).

Table 2 shows an average decrease in scores given by the Perdue Pegboard test for both dominant and non-dominant hands, interestingly however there does not appear to be a significant difference in pre to post scores in those who dropped out (i.e., those with the highest MS) in their non-dominant hand, although there is a significant difference in their dominant hand. This was somewhat of a surprise to know that MS appears to affect the dominant hand and not the non-dominant hand. As the results in Table 3 show, people who used the simulator and did not drop-out had no significant affect to their scores – indicating that simulator use for non-motion sick participants has no direct effect on visual-physical skills.

Test 6 – Cognitive Physical Test (Reaction Time).

Table 2 indicates a small increase in time taken for the reaction speed test when looking at the entire group average. Looking then at Table 3 it is shown there is a significant difference for both groups in reaction time scores. The difference is greater for those who dropped out (i.e., were more motion sick), but it is interesting to find that use of a driving simulator regardless of MS state can alter cognitive-physical skills – in this instance reaction time.

Overall there were some interesting results when looking at the two groups (complete and dropouts) and their change in task performance. It is understood to be a somewhat binary method (drop out verses completed) of grouping the participants as in both groups there could be significant variation in severity of sickness. However, there are currently no validated methods of categorizing severity of sickness scores into reasonable groups. For a future study, it would be interesting to consider how groups could be broken down further to consider severity of sickness and if severity has an effect on the scale of human performance change.

6 Conclusion

The user trial provided some interesting results and for the first time it has been possible to consider how MS affects all areas of driver task completion. In this study, a simulator – and therefore simulation sickness was used as a type of MS so without further study of other types of MS, transferability cannot be assured. It was beneficial to note that there was minimal impact on performance for participants who completed the driving scenario (and therefore were less motion sick), where only the cognitive-physical test reported a significant negative effect. Other than that, simulator use did not seem to affect human performance in any other category; this finding can help validate simulator-based user trial results in that if participants are not strongly affected by MS their performance is not changed, so the trial results can be considered valid. A follow up study to consider why simulator use affected the cognitive-physical performance is recommended. Given the uncontrollable variable of light/darkness adaptation, it is not possible to conclude how MS affects visual performance. Otherwise, this research project has shown that MS significantly affects cognitive performance, physical performance, physical-visual performance and physical-cognitive performance. A further study to understand how the scale of the impact may change in accordance to the scale of MS severity is recommended. Other interesting findings included that the oculomotor subscale was the only SSQ measurement that had no effect on participant dropout and the category is perhaps less impactful on overall wellness. The finding that MS affected visual-physical performance only in the dominant hand is an interesting finding to consider for future HMI (human machine interface) design research.