Planning paths to multiple targets: memory involvement and planning heuristics in spatial problem solving

Wiener, J. M.; Ehbauer, N. N.; Mallot, H. A.

doi:10.1007/s00426-008-0181-3

Planning paths to multiple targets: memory involvement and planning heuristics in spatial problem solving

Original Article
Published: 08 November 2008

Volume 73, pages 644–658, (2009)
Cite this article

Download PDF

Access provided by CONRICYT-eBooks

Psychological Research PRPF Aims and scope Submit manuscript

Planning paths to multiple targets: memory involvement and planning heuristics in spatial problem solving

Download PDF

J. M. Wiener^1,2,
N. N. Ehbauer¹ &
H. A. Mallot¹

799 Accesses
46 Citations
Explore all metrics

Abstract

For large numbers of targets, path planning is a complex and computationally expensive task. Humans, however, usually solve such tasks quickly and efficiently. We present experiments studying human path planning performance and the cognitive processes and heuristics involved. Twenty-five places were arranged on a regular grid in a large room. Participants were repeatedly asked to solve traveling salesman problems (TSP), i.e., to find the shortest closed loop connecting a start location with multiple target locations. In Experiment 1, we tested whether humans employed the nearest neighbor (NN) strategy when solving the TSP. Results showed that subjects outperform the NN-strategy, suggesting that it is not sufficient to explain human route planning behavior. As a second possible strategy we tested a hierarchical planning heuristic in Experiment 2, demonstrating that participants first plan a coarse route on the region level that is refined during navigation. To test for the relevance of spatial working memory (SWM) and spatial long-term memory (LTM) for planning performance and the planning heuristics applied, we varied the memory demands between conditions in Experiment 2. In one condition the target locations were directly marked, such that no memory was required; a second condition required participants to memorize the target locations during path planning (SWM); in a third condition, additionally, the locations of targets had to retrieved from LTM (SWM and LTM). Results showed that navigation performance decreased with increasing memory demands while the dependence on the hierarchical planning heuristic increased.

Spatial Survey Estimation Is Incremental and Relies on Directed Memory Structures

The many routes of mental navigation: contrasting the effects of a detailed and gist retrieval approach on using and forming spatial representations

Article 23 June 2017

Choice between decision-making strategies in human route-following

Article Open access 26 April 2023

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Planning short paths to multiple targets can be complex and computationally expensive. This is best demonstrated by the traveling salesman problem (TSP) that can be stated as follows: given a number of locations and the costs (here distance) of traveling between them, what is the cheapest round trip route that visits each location once. The number of possible round trips is computed as (N − 1)!/2, with N being the number of locations. For six locations, 60 different round trips are possible, for ten locations, already 181,440 round trips exist. The TSP belongs to the class of NP-hard problems for which no algorithm exists for calculating the optimal solution within a practical time if N is large (Lawler, Lenstra, Kan, & Schmoys, 1985).

For humans, TSP-like planning tasks are actually quite common in our day-to-day wayfinding, for example, on a typical shopping route on which several shops are visited (e.g., Gärling & Gärling, 1988). Obviously, rather than actually calculating and comparing all possible solutions, we rely on simplifying processes that reduce cognitive effort while resulting in reasonably short solutions (c.f., Dry, Lee, Vickers, & Hughes, 2006). Such simplifying processes that replace complex or computationally expensive algorithms have been termed heuristics (Newell & Simon, 1972), and have primarily been studied in the context of judgment and decision making (e.g., Gigerenzer, Todd, & the ABC Research Group 1999). Shah and Oppenheimer (2008) recently suggested that heuristics research should focus on investigating how people reduce the effort associated with complex decision processes. In this work we investigated planning heuristics that humans apply when solving complex TSP-like tasks that require movements through space, and we describe a hierarchical planning strategy that reduces cognitive effort on several levels. Additionally, as path planning for actual navigation usually requires spatial working memory (SWM) as well as spatial long term memory (LTM), we systematically varied memory demands to investigate the impact of the different memory systems for planning performance and planning processes as well as interrelations between the memory systems and planning heuristics.

Spatial optimization in the TSP has primarily been investigated by means of visual versions, in which participants are presented with a number of identical dots on a computer screen. Their task is to connect the dots such that the resulting path is optimal with respect to overall length (e.g., MacGregor & Ormerod, 1996; MacGregor, Ormerod, & Chronicle, 1999; Van Rooij, Stege, & Schactman, 2003; Graham, Joshi, & Pizlo, 2000; Vickers, Lee, Dry, & Hughes, 2003b; Vickers, Lee, Dry, Hughes, & McMahon, 2006; Pizlo et al., 2006; Gibson, Wasserman, & Kamil, 2007; Kong & Schunn, 2007). Generally, results from these studies show that humans reach very good performance levels, often with near optimal solutions (e.g., Vickers, Butavicius, Lee, & Medvedev, 2001). There is an ongoing debate on the optimization-strategies applied in these experiments: MacGregor and Ormerod (1996) have proposed that participants used the convex hull as part of their strategy (see also MacGregor, Ormerod, & Chronicle, 2000, 2004). They argue that the fact that a tour that follows the convex hull method is free of crossings and that humans tend to avoid crossings is one important piece of supporting evidence for this method. Van Rooij et al. (2003), however, argue that participants know that crossings will result in sub-optimal solutions. Hence, they avoid crossings when solving TSPs, rather than following the convex hull method. Vickers, Bovet, Lee, and Hughes, (2003a) proposed a hierarchical nearest neighbor (NN) method, assuming that participants first establish clusters of several dots based on NN distances, which they then sequentially link into a tour, using some variant of the NN algorithm. Graham et al. (2000) proposed another hierarchical model, assuming that from the original stimulus (dot pattern) a series of images are generated which are increasingly blurred and compressed. By these means a hierarchy of images is generated in which neighboring points collapse to clusters. The algorithm then starts with generating a tour in an image with only three blurred clusters. By progressively moving to the next lower layer in the hierarchy further clusters are inserted into the tour, eventually reaching the level of single dots. A common feature shared by many of the approaches presented above is that the spontaneous perceptual organization when perceiving the stimulus pattern can be assumed to play a critical role in solving visual TSPs (see Dry et al., 2006).

Visual TSPs are conducted in figural or pictorial space (Montello, 1993; Hegarty, Montello, Richardson, Ishikawa, & Lovelace, 2006) in which spatial relations between participants and relevant locations remain constant. Usually only relevant locations are presented, all information required to solve the task is visually accessible, and the chosen paths are displayed while solving the TSP. Hence, no memory is required. Path planning during actual navigation, however, has very different characteristics (for a comparison of path planning at different scale levels, see Wiener & Tenbrink, 2008): most importantly, it takes place in larger spaces, i.e., vista or environmental spaces (c.f. Montello, 1993), resulting in a variety of cognitive processes and memory demands that are absent in figural space. Due to movements, for example, spatial relations between navigator and the surrounding constantly change and relevant locations get out of sight. This requires memorizing and updating their positions during navigation. Furthermore, given that no external representation of space is available, the actual locations of relevant places in large scale, environmental spaces (such as cities) has to be retrieved from spatial LTM rather than from perception. Additionally, in order to plan paths covering multiple locations, these locations have to be simultaneously activated and represented in a SWM during the actual planning process. Only few studies investigated path planning and optimization with multiple target locations in the context of actual navigation. And, to the authors’ knowledge, no study so far systematically investigated the role and impact of the various memory related constraints on path planning performance, planning processes, and the planning heuristics applied.

Gärling and Gärling (1988) demonstrated that most shoppers who minimized the total distance of their shopping routes employed strategies similar to the NN algorithm (see also Gärling, Säisä, Böök, & Lindberg, 1986), a simple algorithm for solving TSP-like tasks quickly: from its current location, the NN algorithm repeatedly visits the closest target that has not been visited before until all target locations have been visited (e.g., Golden, Bodin, Doyle, & Stewart, 1980). Wiener and Mallot (2003) demonstrated that environmental regions influenced navigation behavior when planning short paths to visit multiple targets: participants minimized the number of region boundaries they crossed during navigation and preferred paths that allowed for fastest access to the region containing the target. These results corroborate findings suggesting that regional information is explicitly represented in spatial memory (cf. Stevens & Coupe, 1978; Hirtle & Jonides, 1985; McNamara, 1986), and show that such information is taken into account during planning. In everyday navigation, multiple information sources are available that allow for various planning strategies. In a series of navigation experiments, Wiener, Schnee, and Mallot (2004) studied the use and interaction of different planning strategies. In addition to the region-based planning strategy sketched above, two further strategies, the cluster-strategy and the least-decision-load strategy, were identified. The cluster-strategy states that neighboring places are grouped together to form clusters. If two clusters are equidistant, routes are planned such that the larger cluster is visited first, by this means increasing the number of visited targets as fast as possible (c.f. Gallistel & Cramer, 1996). The least-decision-load strategy states that the number of possible movement decisions along a path is taken into account during planning. This strategy predicts preferences for paths that minimize possible movement decisions along the path. It could be employed, because the risk of getting lost is smaller on less complex routes. The NN-, the cluster-, and the region-based-strategy are heuristics that reduce mental effort during planning, either by planning locally rather than globally (NN-strategy), or by aggregating targets in a first planning step (cluster and region-based approaches), thus reducing search or problem space. At the same time, applying these planning strategies reduces working memory demands during planning and might be a response to memory related constraints that do not play a role in visual TSPs.

Motivation and synopsis

The objective of this study was to develop an increased understanding of the cognitive components, processes, and heuristics involved in spatial problem solving in a navigational context. As discussed above, path planning in everyday wayfinding usually requires both, spatial LTM and SWM: the positions of target location beyond the current sensory horizon have to be retrieved from spatial LTM; and, if multiple target locations are to be visited, these locations have to be simultaneously activated and maintained in a SWM during the actual planning process. According to the number of targets to visit, working memory related constraints will influence path planning. To gain first insight into the impact of these memory systems on planning performance, planning processes, and planning strategies applied, we asked participants to solve navigational TSPs of different sizes in a large experimental room. Spaces of this scale are referred to as vista spaces (Montello, 1993) as they can be apprehended from a single place. Vista spaces combine features of figural (pictorial) spaces with characteristics of environmental spaces that are crucial for navigation: while the entire environment can be overlooked (as in figural spaces), spatial relations between observer and targets change during locomotion which requires memory and updating processes (as in environmental spaces). Carrying out the experiments in a vista space also allowed us to control the visual accessibility of symbols defining the target locations. By this means we systematically varied the memory demands required (no memory, SWM, and LTM). To solve the TSPs efficiently, participants needed to judge the local distances between any two target locations. It is well-known that spatial memory of large scale (environmental) spaces is subject to systematic distortions (e.g., McNamara & Diwadkar, 1997) and it has been questioned whether humans do possess Euclidean metric spatial knowledge of such spaces (e.g., Foo, Warren, Duchon, & Tarr, 2005; Foo, Duchon, Warren, & Tarr, 2007). There is, however, convincing evidence that humans can quite accurately judge and estimate distances in vista spaces, as used in this study. If they are, for example, shown a target location and are then asked to walk towards it with their eyes closed, they usually end up near the physical target locations (Philbeck, Loomis, & Beall, 1997; Loomis, Klatzky, Philbeck, & Golledge, 1998). Furthermore, in vista space, participants can reliably distinguish paths composed of multiple segments if these paths differed as little as 1.7% in total length (Wiener, Lafon, & Berthoz, 2008). Knowledge about local distances between goal locations is sufficient for solving the TSPs in this study; full metric embeddings as are discussed by Foo et al. (2005) are not required.

Experiment 1 pursued two main purposes. First, it was designed to establish the novel approach and to test for participants’ general performance in solving navigational TSPs in vista space. For this, participants’ performance of finding the shortest path in TSPs with varying number of targets was evaluated. Second, the experiment examined two simple planning strategies, the NN strategy and the cluster-strategy (see above) that have been suggested to be involved in visual TSPs as well as in path planning in large scale spaces. Interviews with participants after the experiments allowed for insights into further planning strategies particularly relevant in the current experimental approach. This information was used to design Experiment 2, in which memory demands were systematically varied between experimental conditions: similar to the visual TSPs, one condition required no memory; a second condition required to memorize the locations of the targets (SWM) during planning; a third condition additionally required to retrieve the target locations from spatial LTM (SWM + LTM). Comparisons of planning performance and the usage of planning strategies and heuristics between conditions allowed investigating the influence of the different memory systems on spatial problem solving.

Experiment 1

Materials and methods

The experimental setup

The experiment was conducted in a 6.0 × 8.4 m experimental room. Twenty-five small cardboard boxes were arranged on a 5 × 5 squared grid with a mesh size of 1.1m. Twenty-five symbols were randomly distributed about the 25 pillars (see Fig. 1). In order to control for effects of the specific symbol-configuration, two versions of the setup were created that differed only in the specific arrangement of the symbols. Half of the participants conducted the experiment in one configuration, the other half conducted the experiment in the alternative configuration.

Procedure

Participants were repeatedly asked to solve TSPs. For each TSP they received a ‘shopping list’ depicting the symbol defining the start location and the symbols defining the target locations (Fig. 2). Participants were given the lists in random order, one at a time, and upside-down. They were verbally informed about the start location and asked to move to that location. Only after reaching the start location, they were allowed to turn around the shopping list and the trial started. The participants’ task was to navigate the shortest route connecting the start location with all target locations and return to the start location assuming straight line distances between target locations. During navigation, they kept the shopping list and marked visited target locations by placing little black markers on the cardboard pillars.

To control for the influence of the specific order of the symbols depicted on the shopping list, two versions of each shopping list were generated. Half of the participants received one version of the shopping lists, the other half received the other version.

Types of navigation tasks

Each participant solved 36 different TSPs consisting of a start location plus 4, 5, 6, 7, 8 or 9 target places (TSP sizes therefore range between 5 and 10) in random order. The TSPs could be subdivided into three types, NN-adequate tasks, NN-inadequate tasks, and cluster-tasks (NN-ambiguous tasks) (see Fig. 3; Table 1 for a complete list of all TSPs).

1.
NN-adequate/inadequate tasks For NN-adequate tasks, the predictions of the NN algorithm were identical with the optimal (i.e., the shortest possible) path. For NN-inadequate tasks, the NN algorithm did predict a clearly sub-optimal path (see Fig. 3).
2.
Cluster tasks (NN-ambiguous) Here target locations were distributed in two distinct clusters of unequal size. These TSPs were NN-ambiguous (i.e., the NN-algorithm did not make clear predictions for a single path): the closest target locations were always equidistant from the starting place and similar situations re-occurred during navigation (i.e., close target locations were equidistant from the current position).

Table 1 The table lists all navigation tasks of Experiment 1

Full size table

Participants

Twenty-four participants (12 women, mean age: 22.88 years) participated in the experiment. They were mostly university students and were paid 8 Euro an hour.

Analysis

The sequence of places visited were recorded for each TSP and the length of the resulting tour was calculated, assuming linear route segments between target points. For each TSP we also computed the optimal solution by comparing the length of all possible permutations. Performance of planning and executing a short route was assessed by comparing the length of the chosen path with the length of the optimal solution and was described in percentage above optimal (PAO; Wiener et al., 2008). A PAO value of 10 corresponded to a path that was 10% longer than the optimal solution. Furthermore, the percentage of trials in which participants found the optimal solution was calculated (found shortest route). For each trial the start time (the time from revealing the shopping list until initiating locomotion) was recorded.

Due to the large number of main effects and interactions tested in this experiment, an α level of 0.01 was used. The error bars of all plots in this study display standard errors of the mean (SEM).

Predictions

It was expected that performance of finding the shortest possible route decreased with increasing number of targets of the TSP. This expectation was supported by two considerations. First, the number of route alternatives that had to be considered during planning increases with increasing TSP size. Second, working memory load is higher if more targets have to be memorized and dealt with. For sufficiently large numbers of targets, it will not be possible to simultaneously hold their positions in working memory. Thus, paths cannot be planned taking all targets into account. In the following the specific predictions for the different types of navigation tasks are summarized.

1.
NN-adequate/inadequate tasks If participants employed the NN-strategy, they should find and navigate the optimal path when confronted with NN-adequate tasks. When confronted with NN-inadequate tasks, on the other hand, they should systematically fail to find the optimal paths (see Fig. 3). In other words, if participants employed the NN-strategy we expect better performance in NN-adequate than in NN-inadequate tasks.
2.
Cluster tasks (NN-ambiguous) If participants applied the cluster-strategy, stating that they plan paths in order to visit as many targets as fast as possible, they should first visit the large rather than the small target cluster.

Results

Neither the specific configuration of symbols on the 5 × 5 grid, nor the order of the symbols depicted on the shopping list influenced participants performance.

Experimental condition

Percentage above optimal (PAO) On average, PAO was 5.86. Even for the largest TSPs (nine targets plus start place), participants produced less than 10 PAO (see Fig. 4). For three TSPs with nine targets plus start place, the PAO values for all 181,440 path alternatives was exemplarily calculated. Less than 0.08% of all path alternatives had PAO values equal or below the values produced by participants. The performance of the participants was thus remarkably above chance.

An ANOVA revealed a significant main effect of the number of targets [F(5, 115.86) = 17.25, P < 0.001], while no main effect for the type of navigation task [F(2, 46.22) = 1.57, P = 0.22] and no interaction [F(10, 233.01) = 1.53, P = 0.13] were found. PAO increased with increasing number of targets (Pearson’s product-moment correlation: r = 0.94, P < 0.01). PAO did not differ between female and male participants [6.71 vs. 4.76%, t-test: t(22) = 1.63, P = 0.12].

Found shortest route On average participants found the shortest possible route in 47.3% of the trials. An ANOVA revealed a significant main effect for the number of targets [F(5, 116.02) = 25.37, P < 0.001] and the type of navigation task [F(2, 46.27) = 79.09, P < 0.001] as well as a significant interaction [F(10, 233.01) = 6.88, P < 0.001]. While a Pearson’s product-moment correlation revealed only a marginally significant correlation between performance of finding the optimal route and the number of target locations (r = −0.80, P = 0.06), a significant difference was found between small TSPs (with 4–6 targets plus start place), and larger TSPs [with 7–9 targets plus start place; 32.7 vs. 61.6%, t-test: t(23) = 8.89, P < 0.001]. Performance of finding the optimal solution did not differ between female and male participants [44.13 vs. 50.87%, t-test: t(22) = −1.43, P = 0.17].

Performance in finding the optimal route did not differ between cluster tasks and NN-inadequate tasks [34.28 vs. 35.86%, t-test: t(23) = 0.47, P = 0.64], but differed both, between cluster tasks and NN-adequate tasks [34.28 vs. 72.32%, t-test: t(23) = 10.88, P < 0.001], and between NN-adequate tasks and NN-inadequate tasks [72.32 vs. 35.86%, t-test: t(23) = 10.19, P < 0.001].

Start time On average start time was 22.10 s. An ANOVA revealed a significant main effect for the number of targets [F(5, 115) = 24.02, P < 0.001] while no main effect for type of navigation task [F(2, 46) = 1.75, P = 0.19] and no interaction [F(10,230) = 1.21, df = 10, P = 0.29] was found. Start time increased with increasing number of targets (Pearson’s product-moment correlation: r = 0.95, P < 0.01). Start time did not differ between female and male participants [23.2 vs. 20.9 s, t-test: t(22) = 0.63, P = 0.53].

Predictions of the NN-algorithm The PAO predictions when using a NN-strategy were calculated for the different types of navigation tasks: for NN-adequate tasks it was obviously 0, for NN-inadequate tasks it was 16.92 and for cluster-tasks it was 8.13 (cluster tasks were NN-ambiguous: the NN strategy did not predict a single but multiple solutions as it was faced with situations in which the closest target locations were equidistant from its current position. PAO values were calculated by averaging over the different solutions predicted by the NN strategy). Participants’ PAO for both, the cluster-tasks and the NN-inadequate tasks were significantly smaller than predicted by the NN-algorithm [cluster-tasks: 5.38 vs. 8.13%, t-test: t(23) = 4.39, P < 0.001; NN-inadequate tasks: 6.60 vs. 16.92%, t-test: t(23) = 10.56, P < 0.001]. For NN-adequate tasks, PAO was significantly higher than predicted by the NN-algorithm [5.24 vs. 0%: t-test: t(23) = 7.05, P < 0.001].

Correlations between participants’ start time and overshoot performance Mean start time was negatively correlated with PAO (r = −0.42, P = 0.04), demonstrating that participants who took longer before initiating their trials showed better planning performance.

Cluster tasks In the cluster tasks the target locations were distributed in two distinct target clusters of unequal size. Overall, participants showed a significant preference to first visit the large cluster [59.02% vs. chance level (50%), t test: t(23) = 3.09, P < 0.01].

Discussion

Overall, PAO performance when solving the TSPs was remarkably good. On average, participants produced PAO values of less than 6. Even for the most complex navigation tasks with nine targets (plus start place), participants produced PAO values of ∼10. The fact that less than 0.08% of all path alternatives of the largest TSPs tested produce PAO values below 10 emphasizes participants’ remarkably good performance. With increasing TSP size, performance for finding the optimal solution decreased while start time increased. These results were expected for two reasons: (1) with increasing number of target locations the computational complexity of a TSP increases as more alternative solutions have to be taken into account; (2) the task of localizing and memorizing the positions of all target locations becomes more challenging as the number of target locations increases (i.e., SWM load increases).

Types of navigation tasks The most important result with respect to the planning strategies applied was that participants outperformed the NN-algorithm on NN-inadequate tasks and on cluster tasks. Together with the result that performance on NN-adequate tasks was significantly worse than predicted by the NN-algorithm, this clearly demonstrates that the NN-algorithm is not sufficient to explain human path planning in such navigational TSPs (for similar results in visual TSPs, see Graham et al., 2000). In cluster-tasks the target locations were distributed in two distinct target clusters of unequal size. Participants showed a preference to first visit the large target cluster as compared to the small target cluster. This result is in line with earlier work (Wiener et al., 2004) providing additional support for the cluster-strategy, stating that participants plan their routes in order to visit as many targets as fast as possible (for similar results in vervet monkeys see Cramer & Gallistel, 1997). While for both, path planning performance (PAO) and start time, no significant differences could be found between the three types of navigation tasks, performance of finding the shortest route was almost twice as good for NN-adequate tasks than for NN-inadequate tasks and cluster-routes. This dissociation between planning performance (PAO) and performance of finding the optimal route suggests that many errors on cluster- and NN-inadequate tasks were insignificant with respect to the resulting PAO values.

Interviews with participants Further insights into planning strategies came from informal interviews with participants after the experiments. Most of them reported to have applied one of two strategies when faced with larger TSPs: (1) Participants subdivided the 25 locations into a (differing) number of regions. During planning, they assigned the actual target locations to these regions and planned a coarse route on that region level. Such coarse routes are simple and easily remembered and a fine-detailed plan can be created by inserting close-by target locations during navigation; (2) Participants first selected a subset of target locations depicted on the shopping list according to some criteria, for example, color. They then planned a coarse route taking into account only this subset. Again, this route plan is simple and easily remembered and can be refined either before or during navigation by inserting the missing target locations into the route. Both of the reported navigation strategies follow essentially the same logic: they simplify the planning task by applying a hierarchical planning scheme. First, a coarse and simple path plan is generated on basis of an abstraction of the environment or the planning task itself. This path plan is then refined during navigation by inserting target locations.

Experiment 2

Motivation

Experiment 2 served two main purposes: (1) to test the region based planning strategy reported by participants in informal interviews after Experiment 1; (2) to test for the role and impact of different memory systems for spatial problem solving and optimization.

Informal interviews with participants in Experiment 1 suggested that one planning strategy was based on participants’ regionalizations of the environment. If based on regions, path planning becomes a hierarchical process. First, a coarse route plan is generated on the level of the regions exclusively. This plan is then refined during navigation. Such a planning scheme states that first all target locations in one region are visited before the next region is entered. Experiment 2 tested this region-based planning strategy. For this, the environment was subdivided into different objective regions and participants solved similar TSPs as in Experiment 1.

Results from Experiment 1 furthermore suggested that capacity limits of SWM had a crucial impact on both, planning performance and start time. Here we specifically tested for the role and impact of both SWM and spatial LTM for path planning by systematically manipulating memory demands between the experimental conditions.