Improving Parallel State-Space Exploration Using Genetic Algorithms

Renault, Etienne

doi:10.1007/978-3-030-00359-3_9

Etienne Renault¹⁷

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 11181))

Included in the following conference series:

International Conference on Verification and Evaluation of Computer and Communication Systems

276 Accesses
1 Citations

Abstract

The verification of temporal properties against a given system may require the exploration of its full state space. In explicit model-checking this exploration uses a Depth-First-Search (DFS) and can be achieved with multiple randomized threads to increase performance.

Nonetheless the topology of the state-space and the exploration order can cap the speedup up to a certain number of threads. This paper proposes a new technique that aims to tackle this limitation by generating artificial initial states, using genetic algorithms. Threads are then launched from these states and thus explore different parts of the state space.

Our prototype implementation runs 10% faster than state-of-the-art algorithms. These results demonstrate that this novel approach worth to be considered as a way to overcome existing limitations.

Access provided by CONRICYT-eBooks. Download conference paper PDF

Improving swarming using genetic algorithms

Article 02 June 2020

Many-core on-the-fly model checking of safety properties using GPUs

Article Open access 29 April 2015

Parallelizing Heuristics for Generating Synchronizing Sequences

1 Introduction and Related Work

Model checking aims to check whether a system satisfies a property. Given a model of the system and a property, it explores all the possible configurations of the system, i.e., the state space, to check the validity of the property. Typically two kind of properties are distinguished, safety and liveness properties. This paper focus on safety properties that are of special interest since they stipulate that some “bad thing” does not happen during execution. Nonetheless the adaptation of this work for liveness properties is straightforward.

The state-space exploration techniques for debugging and proving correctness of concurrent reactive systems has proven their efficiency during the last decades [3, 13, 18, 21]. Nonetheless they suffer from the well known state space explosion problem, i.e., the state space can be far too large to be stored and thus explored in a reasonable time. This problem can be addressed using symbolic [4] or explicit techniques even if we only consider the latter one in this paper.

Many improvements have been proposed for explicit techniques. On-the-fly exploration [5] computes the successors of a state only when required by the algorithm. As a consequence, if the property does not hold, only a subset of the state space is constructed. Partial Order Reductions (POR) [15, 19, 23] avoid the systematic exploration of the state space by exploiting the interleaving semantic of concurrent systems. State Space Caching [9] saves memory by “forgetting” states that have already been visited causing the exploration to possibly revisit a state several times. Bit-state Hashing [11] is a semi-decision procedure in which each state is associated to a hash value. When two states share the same hash value, one of this two states (and thus its successors) will be ignored.

The previous techniques focus on reducing the memory footprint during the state-space exploration. Combining these techniques with modern computer architectures, i.e., many-core CPUs and large RAM memories, tends to shift from a memory problem to an execution time problem which is: how this exploration can be achieved in a reasonable time?

To address this issue multi-threaded (or distributed) exploration algorithms (that can be combined with previous techniques) have been developed [2, 7, 12, 18]. Most of these techniques rely on the swarming technique presented by Holzmann et al. [13]. In this approach, each thread runs an instance of a verification procedure but explores the state space with its own transition order.

Nowadays, best performance is obtained when combining swarming with Depth-First Search (DFS)^{Footnote 1} based verification procedures [3, 21]. In these combinations, threads share information about states that have been fully explored, i.e. states where all successors have been visited by a thread. Such states are called closed states. These states are then avoided by other threads explorations since they can not participate in invalidating the property. These swarmed-DFS algorithms are linear but their scalability depends on two factors:

Topology problems. :: If the state space is linear (only one initial state, one successor per state), using more than one thread cannot achieve any speedup. This issue can be generalized to any state space that is deep but not wide.
Exploration order problems. :: States are tagged closed following the DFS postorder of a given thread. Thus, a state s can only be marked closed after visiting at least N states, where N is the minimal distance between the initial state and s.

		1 thread	2 threads	4 threads	8 threads	12 threads
Time in milliseconds		2 960 296	1 796 418	118 6344	981 222	978 711
Speedup		1	1.65	2.50	3.016	3.025

The table above highlights this scalability problem over the benchmark^{Footnote 2} used in this paper. It presents the cumulated exploration time (in a swarmed DFS fashion) for 38 models extracted from the literature. It can be observed that this algorithm achieves reasonable speedup up to 4 threads but is disappointing for 8 threads and 12 threads (the maximum we can test).

This paper proposes a novel technique that aims to keep improving the speedup as the number of threads increases and which is compatible with all memory reduction methods presented so far.

The basic idea is to use genetic algorithms to generate artificial initial states (Sects. 2 and 3). Threads are then launched with their own verification procedure from these artificial states (Sects. 4 and 5). We expect that these threads will explore parts of the state space that are relatively deep regarding to (many) DFS order(s). Thus, some states may be marked as closed without processing some path between the original initial state to these states.

Our prototype implementation (Sect. 6) has encouraging performances: the proposed approach runs 10% faster (with 12 threads) than state-of-the-art algorithms (with 12 threads). These results are encouraging and show that this novel approach worth to be considered as a way to overcome existing limitations.

Related Work. To our knowledge, the combination of parallel state space exploration algorithms with the generation of artificial initial states using genetic algorithms has never been done. The closest work is probably the one of Godefroid and Khurshid [8] that suggests to use genetic programming as an heuristic to help random walks to select the best successor to explore. The generation of other initial states have been proposed to maximize the coverage of random walks [22]: to achieve this, a bounded BFS is performed to obtain a pool of states that can be used as seed states. This approach does not help the scalability when the average number of successors is quite low (typically when mixing with POR).

In the literature there are some work that combine model checking with genetic programming but they are not related to the work presented here: Katz and Peled [14] use it to synthesize parametric programs, while all the other approaches are based on the work of Ammann et al. [1] and focus on the automatic generation of mutants that can be seen as particular “tests cases”.

2 Parallel State Space Exploration

Preliminaries. Concurrent reactive systems can be represented using Transitions Systems (TS). Such a system \(T = \langle Q, \iota , \delta , V, \gamma \rangle \) is composed of a finite set of states Q, an initial state \(\iota \in Q\), a transition relation \(\delta \subseteq Q \times Q\), a finite set of integer variables V and \(\gamma : Q \rightarrow \mathbb {N}^{|V|}\) a function that associates to each state a unique assignment of all variables in V. For a state \(s\in Q\), we denote by \(\texttt {post}(s) = \{ d \in Q \mid (s,d) \in \delta \}\) the set of its direct successors. A path of length \(n\ge 1\) between two states \(q,q'\in Q\) is a finite sequence of transitions \(\rho = (s_1, d_1)\ldots (s_n, d_n)\) with \(s_1 = q\), \(d_i = q'\), and \(\forall i \in \{1,\ldots ,n-1\},\,d_i = s_{i+1}\). A state q is reachable if there exists a path from the initial state \(\iota \) to q.

Swarming. Checking temporal properties involves the exploration of (all or some part) of the state space of the system. Nowadays, best performance is obtained by combining on-the-fly exploration with parallel DFS reachability algorithms. Algorithm 1 presents such an algorithm.

This algorithm is presented recursively for the sake of clarity. Lines 4 and 5 represent the main procedure: ParDFS takes two parameters, the transition system and the number n of threads to use for the exploration. Line 5 only launches n instances of the procedure DFS. This last procedure takes three parameters, s the state to process, tid the current thread number and a color used to tag new visited states. Procedure DFS represents the core of the exploration. This exploration relies on a shared hashmap visited (defined line 2) that stores all states discovered so far by all threads and associate each state with a color (line 1):

open indicates that the state (or some of its successors) is currently processed by (at least) a thread,
closed indicates that the states and all its successors (direct or not) have been visited by some thread.

The DFS function starts (lines 7 to 8) by checking if the parameter s has already been inserted, by this thread or another one, in the visited map (line 7). If not, the state is inserted with the color open (line 7). Otherwise, if s has already been inserted we have to check whether this state has been tagged closed. In this case, s and all its successors have been visited: there is no need to revisit them. Line 9 grabs all the successors of the state s that are then shuffled to implement the swarming. Finally lines 10 to 14 perform the recursive DFS: for each successor \(s'\) of the current state, if \(s'\) has not been tagged closed a recursive call is launched. When all successors have been visited, s can be marked closed.

One can note that a shared Boolean \(\textit{stop}\) is used in order to stop all threads as soon as a thread closes the initial state. This Boolean is useless for this algorithm since, when the first threads ends, all reachable states are tagged closed and every thread is forced to backtrack. Nonetheless this Boolean will be useful later (see Sect. 4). Moreover the visited map is thread safe (and lock-free) so that it does not degrade performances of the algorithm.

Problem Statement. The previous algorithm (or some adaptations of it [3, 21]) obtains the best performance for explicit model checking. Nonetheless this swarmed algorithm suffers from a scalability problem. Figure 1 describes a case where augmenting the number of threads will not bring any speedup^{Footnote 3}. This figure describes a transition system that is linear. The dotted transitions represent long paths of transitions. In this example, state x cannot be tagged closed before state y and all the states between x and y have been tagged closed. The problem here is that all threads start from state s. Since threads have similar throughput they will discover x and y approximately at the same time. Thus they cannot benefit from the information computed by the other threads. This example is pathological but can be generalized to any state space that is deep and narrow.

Suppose now that there are 2 threads and that the distance between s and x is the same than the distance between x and y. The only way to obtain the maximum speedup is to launch one thread with a DFS starting from s and launch the other thread from x. In this case, when the first thread reaches state x, x has just been tagged closed: the first thread can backtrack and stop.

A similiar problem arise when performing on-the-fly model checking since (1) there is only one initial state and (2) all states are generated during the exploration. Thus a thread cannot be launched from a particular state. Moreover, the system’s topology is only known after the exploration: we need a technique that works for any kind of topology.

The idea developed in this paper is the automatic generation of state x using genetic algorithms. The generation of the perfect state (the state x in the example) is a utopia. Nonetheless if we can generate a state relatively deep regarding to many DFS orders, we hope to avoid redundant work between threads, and thus maximize the information shared between threads. In practice we may generate states that do not belong to the state space, but Sect. 6 shows that more than 84% of generated states belongs to it.

3 Generation of Artificial Initial State

Genetic Algorithms. For many applications the computation of an optimal solution is impossible since the set of all possible solutions is too large to be explored. To address this problem, Holland [10] proposed a new kind of algorithms (now called genetic algorithms) that are inspired by the process of natural selection. These algorithms are often considered as optimizer and used to generate high-quality solutions to search problems. Basically, genetic algorithms start by a population of candidate solutions and improve it using bio-inspired operators:

Crossover: selects multiple elements in the population (the parents) and produces a child solution from them.
Mutation: selects one element in the population and alters it slightly.

Applying and combining these operators produces a new generation that can be evaluated using a fitness function. This fitness function allows to select the best elements (w.r.t the considered problem) of this new population. These best elements constitute a new population on which mutation and crossover operations can be re-applied. This process is repeated until some satisfying solution is found (or until a maximal number of generations has been reached).

Genetic algorithms rely on a representation of solutions that is chromosome-like. In the definition of a transition system we observe that every state can be seen as a tuple of integer variables using the \(\gamma \) function. Each variable can be considered as a gene and the set of variables can be considered as a chromosome composed of 0 and 1. For instance, if a state is composed of two variables \(a=42\) and \(b=51\) the resulting chromosome (considering 8 bits integers) would be the one described Fig. 2.

Crossover. Concurrent reactive systems are generally composed of a set of \(N_p\) processes and a set of shared variables (or channels). Given a transition system \(T = \langle Q, \iota , \delta , V, \gamma \rangle \) we can define \(E: V \rightarrow [0, N_p]\), such that if v is a shared variable, E(v) returns 0 and otherwise E(v) returns the identifier of the process where the variable v is defined.

Algorithm 2 defines the crossover operation we use. This algorithm takes a parameter S which represents the population to use for generating a new state. Line 2 instantiates a new state s that will hold the result of the crossover operation. Lines 3 to 5 set up the values of the shared variables of s: for each shared variable v, an element of S is randomly selected to be the parent. Then, at line 5, one can observe that \(\gamma (s)[v]\) (the value of v in s) is set according to \(\gamma (parent)[v]\) (the value of v in the parent). Lines 6 to 9 perform a similar operation on all the remaining variables.

These variables are treated by batch, i.e., all the variables that belong to a same process are filled using only one parent (Line 7). This choice implies that in our Crossover algorithm the local variable of a process cannot have two different parents: this particular processing helps to exploit the concurrency of underlying system. A possible result of this algorithm is represented Fig. 3 (with 8 bits integer variables, only one process, no shared variables, \(S=\{parent1,\) \(parent2\}\) and child the state computed by Crossover(S)).

Mutations. The other bio-inspired operator simulates alterations that could happen while genes are combined over multiples generations. In genetic algorithms, these mutations are performed by switching the value of a bit inside of a gene. Here, all the variables of the system are considered as genes.

Algorithm 3 describes this mutation. For each variable in the state s (line 2), a random number is generated. A mutation is then performed only if this number is above a fixed threshold (line 4): this restriction limits the number of mutations that can occur in a chromosome. We can then select randomly a bit in the current variable v and flip it (line 5). Finally, line 6 exploits the information we may have about the system by restricting the mutated variable to its bounds.

Indeed, even if all variables are considered as integer variables there are many cases where the bounds are known a priori: for instance Boolean, enumeration types, characters, and so on are represented as integers but the set of value they can take is relatively small regarding the possible values of an integer. A possible result of this algorithm is represented Fig. 4 (with 8 bits integer variables and only two character variables, i.e., that have values between [0..255]).

Fitness. As mentioned earlier, every new population must be restricted to the only elements that help to obtain a better solution. Here we want to generate states that are (1) reachable and (2) deep with respect to many DFS orders. These criteria help the swarming technique by exploring parts of the state space before another thread (starting from the real initial state) reaches them.

We face here a problem that is: for a given state it is hard to decide whether it is a good candidate without exploring all reachable states. For checking deadlocks (i.e., states without successors) Godefroid and Khurshid [8] proposed a fitness function that will only retains state with few transitions enabled^{Footnote 4}.

Since we have different objectives a new fitness function must be defined. In order to maximize the chances to generate a reachable state, we compute the average outgoing transitions (\(T_{\textit{avg}}\)) of all the states that belong to the initial population. Then the fitness function uses this value as a threshold to detect good states. Many fitness function can be considered:

equality: the number of successors of a good state is exactly equal to \(T_{\textit{avg}}\). The motivation for this fitness function is that if there are \(N > 1\) independent processes that are deterministic then at every time, any process can progress. Thus a good state has exactly N (equal to \(T_{\textit{avg}}\)) outgoing transitions.
lessthan: the number of successors of a good state is less than \(T_{\textit{avg}}\). The motivation for this fitness function is that if there are \(N > 1\) independent deterministic processes that communicate then at any time each process can progress or two processes can be synchronized. This latter case will reduce the number of outgoing transitions
greaterthan: the number of successors of a good state is greater than \(T_{\textit{avg}}\). The motivation for this fitness function is that if there are \(N > 1\) independent and non-deterministic processes then at any time each processes can perform the same amount of actions or more.

Generation of Artificial State. Algorithm 4 presents the genetic algorithm used to generate artificial initial states using the previously defined functions.

The only parameter of this algorithm is the initial population S we want to mutate: S is obtained by performing a swarmed bounded DFS and keeping trace of all encountered states. From the initial population S, a new generation can be generated (lines 4 to 8). At any time the next generation is stored in \(S'\) (lines 7 and 3). The algorithm stops after nb_generation generations (line 2). Note that this algorithm can report an empty set according to the fitness function used.

4 State-Space Exploration with Genetic Algorithm

This section explains how Algorithm 1 can be adapted to exploit the generation of artificial initial states mentioned in the previous section. Algorithm 5 describes this parallel state-space exploration using genetic algorithm. The basic idea is to have a collaborative portfollio approach in which threads will share information about closed states. In this strategy, half of the available threads runs a the DFS algorithm presented Sect. 2, while the other threads perform genetic exploration. This exploration is achieved by three steps:

1.
Perform swarmed bounded depth-first search exploration that stores into a set \(\mathcal {P}\) all encountered states (line 7). This exploration is swarmed, so that each thread has a different initial population \(\mathcal {P}\). (Our bounded-DFS differs from the literature since it refers DFS that stops after visiting N states.).
2.
Apply Algorithm 4 on \(\mathcal {P}\) to obtain a new population \(\mathcal {P}'\) of artificial initial states (line 8).
3.
Apply the DFS algorithm for each element of \(\mathcal {P}'\) (lines 9 to 11). When the population \(\mathcal {P}'\) is empty, just restart the thread with the initial state \(\iota \) (line 12).

One can note (line 1) that the color enumeration has been augmented with open_gp. This new status may seem useless for now but allows to distinguish states that have been discovered by the genetic algorithm from those discovered by the traditional algorithm. In this algorithm open_gp acts and means exactly the same than open but: (1) this status is useful for the sketch of termination proof below and, (2) the next section shows how we can exploit similar information.

Termination. Until now we have avoided mentioning one problem: there is no reason that a generated state is a reachable state. Nonetheless even if the state is not reachable, some of its successors (direct or not) may be reachable. Since the number of unreachable states is generally much larger than the number of reachable states, we have to ensure that Algorithm 5 terminates as soon as all reachable states have been explored.

First of all let us consider only threads running the DFS algorithm. Since this algorithm has already been prove (see. [21] for more details), only the intuition is given here. When all the successors of an open state have been visited, this state is tagged as closed. Since all closed states are ignored during the exploration, each thread will restrict parts of the reachable state space. At some point all the states will be closed: even if a thread is still performing its DFS procedure, all the successors of its current state will be marked closed. Thus the thread will be forced to backtrack and stop.

The problem we may have with using genetic algorithm is that all the threads performing the genetic algorithm may be running while all the other ones are idle since all the reachable states have already been visited. In this case, a running thread can see only unreachable states, i.e. open_gp, or closed ones. To handle this problem, a Boolean stop is shared among all threads (line 2). When this Boolean is set to \(\top \) all threads stop regardless the exploration technique used (line 10, Algorithm 1). We observe line 9 that the use of other artificial states is also stopped, and no restart will be performed (line 12). This Boolean is set to \(\top \) only when all the successors of the real initial state have been explored (line 16, Algorithm 1). Thus, one can note that even if a thread using the genetic algorithm visits first all reachable states it will stop all the other threads.

5 Checking Temporal Properties

Safety properties cover a wide range of properties: deadlock freedom (there is no state without successors), mutual exclusion (two processes execute some critical section at the same time), partial correction (the execution terminates in a state that does not satisfies the postcondition while the precondition of the run was satisfied), etc. One interesting characteristic of safety properties is that they can be checked using a reachability analysis (as described Sect. 2). Nonetheless, our genetic reachability algorithm (Algorithm 5) cannot be directly used to check safety properties. Indeed, if a thread (using genetic programming) reports an error we do not know if this error actually belongs to the state space.

Algorithm 6 describes how to adapt Algorithm 5 to check safety properties. To simplify things we focus on checking deadlock freedom, but our approach can be generalized to any safety property. This algorithm^{Footnote 5} relies on both Algorithms 1 and 5 The basic idea is still to launch half of the threads from the initial state \(\iota \) and the remaining ones from some artificial initial state (line 7).

For a thread performing reachability with genetic algorithm the differences are quite few. When a deadlock state is detected (line 24) we just tag this state as deadlock_gp rather than closed. This new status is used to mark all states leading to a deadlock state. Indeed since we do not know if the state is a reachable one we cannot report immediately that a deadlock has been found. Moreover we cannot mark this state closed otherwise a counterexample could be lost. This new status helps to solve the problem: when such a state is detected to be reachable, a deadlock is immediately reported. The other modifications are lines 20 and 22: when backtracking, if a deadlock has been found no more states will be explored.
For a thread performing reachability without genetic algorithm the differences are also quite few. Lines 16 to 18 only check if the next state to process has been marked deadlock_gp. In this case this state is a reachable one and it leads to a deadlock state. We can then report that a deadlock has been found and stop all the other threads. A deadlock can also be reported directly (line 23), if the current state is a deadlock.

Deadlock – Sketch of Proof. Due to lack of space only the schema of a proof, that the algorithm will report a deadlock if and only if there exists a reachable state that has no successors, is given here.

Theorem 1

For all systems S, the algorithm terminates.

Theorem 2

A thread reports a deadlock iff \(\exists s \in Q, post(s) = \emptyset \).

To simplify the sketch of proof, we denote by classical thread a thread that does not perform genetic algorithm while the other threads are called gp threads. The following invariants hold for all lines of Algorithm 6:

Invariant 1. :: If stop is \(\top \) then no new state will be discovered.
Invariant 2. :: A deadlock state can only be open, open_gp or deadlock_gp.
Invariant 3. :: No direct successor of a closed state is a deadlock state.
Invariant 4. :: A state is closed iff all its successors that are not on the thread’s recursive stack are closed.
Invariant 5. :: Only gp threads can tag a state deadlock_gp.
Invariant 6. :: A state is deadlock_gp iff it is a deadlock state or if one of its successors (direct or not) is a deadlock state.
Invariant 7. :: Only classical thread can report that a deadlock has been found.
Invariant 8. :: If a state is reachable then all its direct successors are reachable.

Invariants, combined to the sketch of proof of the previous section, helps to prove Theorem 1: the algorithm stops either because a deadlock is detected or because all reachable states have been explored. These invariants establish both directions of Theorem 2: invariant 7 and 8 are the most important for correctness.

Discussion. The verification of complex temporal properties involves the exploration of an automaton which is the result of the synchronous product between the state space of the system and the property automaton. Thus a state is composed of two parts: the system state and the property state. Genetic algorithms presented so far can then be applied by considering that the property state is a variable just like the other system’s variables. The adaptation of Algorithm 6 for checking liveness properties is straightforward: when a gp thread detects an accepting cycle, all the states forming it are tagged with an accepting_cycle status. When a classical thread detects such a state, a counterexample is raised.

6 Evaluation

Benchmark Description. To evaluate the performance of our algorithms, we selected 38 models from the BEEM benchmark [16] that cover all types of models described by the classification of Pelánek [17]. All the models where selected such that Algorithm 1 with one thread would take at most 40 min on Intel(R) Xeon(R) @ 2.00 GHz with 250 GB of RAM. This six-core machine is also used for the following parallel experiments^{Footnote 6}. All the approaches proposed here have been implemented in Spot [6]. For a given model the corresponding system is generated on-the-fly using DiVinE 2.4 patched by the LTSmin team^{Footnote 7}.

Reachability. To evaluate the performance of the algorithm presented Sect. 4 we conducted 9158 experiments, each taking 30 s on the average. Table 1 reports selected results to show the impact of the fitness function and the threshold over the performance of Algorithm 5 with 12 threads (the maximum we can test). For each variation, we provide nb the number of models computed within time and memory constraints, and Time the cumulated walltime for this configuration (to run the whole benchmark). For a fair-comparison, we excluded from Time models that cannot be processed. Table 1 also reports state-of-the-art and random (used to evaluate the accuracy of genetic algorithms by generating random states as seed state). This latter technique is irrelevant since it is five time slower than state-of-the-art and only process 32 models over 38.

If we now focus on genetics algorithms, we observe that the threshold highly impacts the results regardless the fitness function used: the more the threshold grows, the more models are processed within time and memory constraints.

Table 1. Impact of the threshold and the fitness function on Algorithm 5 with 12 threads (nb_generation = 3, init = 1000, pop_size = 50). The time is expressed in millisecond and is the cumulated time taken to compute the whole benchmark (38 models); nb is the number of instances resolved with time and memory limits.

Full size table

The table also reports the best threshold^{Footnote 8} for all fitness function, i.e. 0.999. It appears that greaterthan only processed 37 models: this fitness function does not seem to be a good fitness function since (1) it tends to explore useless parts of the state-space and (2) the variations of the threshold highly impacts the performance of the algorithm. All the other fitness function provide similar results for a threshold fixed at 0.999. Nonetheless we do not recommend equality since a simple variation of the threshold (0.7) could lead to extremely poor results. Our preference goes to lessthan and lessstrict since they seem to be less sensitive to threshold variation while achieving the benchmark 9% faster than state-of-the-art algorithm. Thus, while the speedup for 12 threads was 3.02 for state-of-the-art algorithm, our algorithm achieves a speedup of 3.31.

Note that the results reported Table 1 include the computation of the artificial initial states. On the overall benchmark, this computation take in average slightly less than 1 s per model (30 s for the whole benchmark). This computation has a negligible impact on the speedup of our algorithm.

We have also evaluated (not reported here, see webpage for more details) the impact of the size of the initial population and the size of each generation over the performance. It appears that augmenting (or decreasing) these two parameters deteriorate the performance. It is worth noting that the best value of all parameters are classical values regarding to state-of-the-art genetic algorithms. Finally, for each model (and lessthan as fitness), we compute a set of artificial initial states and run an exploration algorithm from each of these states. It appears that 84.6% of the 7 866 005 486 generated states are reachable states.

Safety Properties. Now that we have detected the best values for the parameters of the genetic algorithm we can evaluate the performance of our deadlock detection algorithm. In order to evaluate the performance of our algorithm we conduct 418 experiments. The benchmark contains 21 models with deadlocks and 17 models without. Table 2 compares the relative performance of state-of-the-art algorithm and Algorithm 6. For this latter algorithm, we only report the two fitness functions that give the best performance for reachability. Indeed, since Algorithm 6 is based on Algorithm 5 we reuse the best parameters to obtain the best performance. Results for detecting deadlocks are quite disappointing since our algorithm is 15% to 30% slower. A closer look to these results show that deadlocks are detected quickly and Algorithm 6 has degraded performance due to the computation of artificial initial states.

On the contrary we observe that our algorithm is 10% faster (regardless whether we use lessthan or lesstrict) than the classical algorithm when the system has no deadlock. One can note that this algorithm performs better than simple reachability algorithm. Indeed, even if the system has no deadlock: the algorithm can find non-reachable deadlock. In this case, the algorithm backtracks and the next generation is processed. This early backtracking force the use of a new generation that will helps the exploration of the reachable states. To achieve this speedup, we observe an overhead of 13% for the memory consumption. The use of dedicated memory reduction techniques could help to reduce this footprint.

Table 2. Comparison of algorithms for deadlock detection. Each runs with 12 threads, and we report the variation of two different fitness functions: lessstrict and lessthan. Results presents the cumulated time and states visited for the whole benchmark.

Full size table

Discussion. Few models in the benchmark have a linear topology, which can be considered as the perfect one for the algorithms presented in this paper. Nonetheless, we observe a global improvement of state-of-the-art algorithm. We believe that other fitness function (based on interpolation or estimation of distribution) could help to generate better states, i.e. deep with respect to many DFS orders.

7 Conclusion

We have presented some first and new parallel exploration algorithms that rely on genetic algorithms. We suggested to see variables of the model as genes and states as chromosomes. With this definition we were able to build an algorithm that generates artificial initial states. To detect if such a state is relevant we proposed and evaluate various fitness functions. It appears that these seed states improve the swarming technique. This combination between swarming and genetic algorithms has never been proposed and the benchmark show encouraging results (10% faster than state-of-the-art). Since the performance of our algorithms highly relies on the generation of good artificial states we would like to see if other strategies could help to generate better states.

This work mainly focused on checking safety properties even if we proposed an adaptation for liveness properties. A future work would be to evaluate the performance of our algorithm in this latter case. We also want to investigate the relation between artificial state generation and POR, since both rely on the analysis of processes variables. Finally, we strongly believe that this paper could serve as a basis for combining parametric model-checking with neural network.

Notes

1.
It should be noted that even if DFS-based algorithms are hard to parallelize [20] they scale better in practice than parallelized Breadth-First Search (BFS) algorithms.
2.
See Sect. 6 for more details about the benchmark.
3.
This particular case will certainly degrade performance due to contention over the shared hashmap.
4.
Godefroid and Khurshid [8] do not generate states but finite paths and their fitness fonction analyzes the whole paths to keep only those with few enabled transitions.
5.
Main differences have been highlighted to help the reader.
6.
For a description of our setup, including selected models, detailed results and code, see http://www.lrde.epita.fr/~renault/benchs/VECOS-2018/results.html.
7.
See http://fmt.cs.utwente.nl/tools/ltsmin/#divine for more details. Also note that we added some patches (available in the webpage) to manage out-of-bound detection.
8.
We evaluate other thresholds like 0.9999 or 0.99999 but it appears that augmenting the threshold does not increase performance, see the webpage for more details.

References

Ammann, P.E., Black, P.E., Majurski, W.: Using model checking to generate tests from specifications. In: ICFEM 1998, pp. 46–54, December 1998
Google Scholar
Barnat, J., Brim, L., Ročkai, P.: Scalable shared memory LTL model checking. STTT 12(2), 139–153 (2010)
Article Google Scholar
Bloemen, V., van de Pol, J.: Multi-core SCC-based LTL model checking. In: Bloem, R., Arbel, E. (eds.) HVC 2016. LNCS, vol. 10028, pp. 18–33. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-49052-6_2
Chapter Google Scholar
Burch, J.R., Clarke, E.M., McMillan, K.L., Dill, D.L., Hwang, L.J.: Symbolic model checking: \(10^{20}\) states and beyond. In: Proceedings of the Fifth Annual IEEE Symposium on Logic in Computer Science, pp. 1–33. IEEE (1990)
Google Scholar
Courcoubetis, C., Vardi, M., Wolper, P., Yannakakis, M.: Memory efficient algorithms for the verification of temporal properties. In: Clarke, E.M., Kurshan, R.P. (eds.) CAV 1990. LNCS, vol. 531, pp. 233–242. Springer, Heidelberg (1991). https://doi.org/10.1007/BFb0023737
Chapter MATH Google Scholar
Duret-Lutz, A., Lewkowicz, A., Fauchille, A., Michaud, T., Renault, É., Xu, L.: Spot 2.0 — a framework for LTL and \(\omega \)-automata manipulation. In: Artho, C., Legay, A., Peled, D. (eds.) ATVA 2016. LNCS, vol. 9938, pp. 122–129. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46520-3_8
Chapter Google Scholar
Garavel, H., Mateescu, R., Smarandache, I.: Parallel state space construction for model-checking. Technical report RR-4341, INRIA (2001)
Google Scholar
Godefroid, P., Khurshid, S.: Exploring very large state spaces using genetic algorithms. In: Katoen, J.-P., Stevens, P. (eds.) TACAS 2002. LNCS, vol. 2280, pp. 266–280. Springer, Heidelberg (2002). https://doi.org/10.1007/3-540-46002-0_19
Chapter MATH Google Scholar
Godefroid, P., Holzmann, G.J., Pirottin, D.: State space caching revisited. In: von Bochmann, G., Probst, D.K. (eds.) CAV 1992. LNCS, vol. 663, pp. 178–191. Springer, Heidelberg (1993). https://doi.org/10.1007/3-540-56496-9_15
Chapter Google Scholar
Holland, J.H.: Genetic Algorithms. Scientific American (1992)
Google Scholar
Holzmann, G.J.: On limits and possibilities of automated protocol analysis. In: PSTV 1987, pp. 339–344. North-Holland, May 1987
Google Scholar
Holzmann, G.J., Bosnacki, D.: The design of a multicore extension of the SPIN model checker. IEEE Trans. Softw. Eng. 33(10), 659–674 (2007)
Article Google Scholar
Holzmann, G.J., Joshi, R., Groce, A.: Swarm verification techniques. IEEE Trans. Softw. Eng. 37(6), 845–857 (2011)
Article Google Scholar
Katz, G., Peled, D.A.: Synthesis of parametric programs using genetic programming and model checking. In: INFINITY 2013, pp. 70–84 (2013)
Google Scholar
Laarman, A., Pater, E., Pol, J., Hansen, H.: Guard-based partial-order reduction. STTT 18, 1–22 (2014)
Google Scholar
Pelánek, R.: BEEM: benchmarks for explicit model checkers. In: Bošnački, D., Edelkamp, S. (eds.) SPIN 2007. LNCS, vol. 4595, pp. 263–267. Springer, Heidelberg (2007). https://doi.org/10.1007/978-3-540-73370-6_17
Chapter Google Scholar
Pelánek, R.: Properties of state spaces and their applications. Int. J. Softw. Tools Technol. Transf. (STTT) 10, 443–454 (2008)
Article Google Scholar
Pelánek, R., Hanžl, T., Černá, I., Brim, L.: Enhancing random walk state space exploration. In: FMICS 2005, pp. 98–105. ACM Press (2005)
Google Scholar
Peled, D.: Combining partial order reductions with on-the-fly model-checking. In: Dill, D.L. (ed.) CAV 1994. LNCS, vol. 818, pp. 377–390. Springer, Heidelberg (1994). https://doi.org/10.1007/3-540-58179-0_69
Chapter Google Scholar
Reif, J.H.: Depth-first search is inherently sequential. Inf. Process. Lett. 20, 229–234 (1985)
Article MathSciNet Google Scholar
Renault, E., Duret-Lutz, A., Kordon, F., Poitrenaud, D.: Variations on parallel explicit model checking for generalized Büchi automata. Int. J. Softw. Tools Technol. Transf. (STTT) 19, 1–21 (2016)
MATH Google Scholar
Sivaraj, H., Gopalakrishnan, G.: Random walk based heuristic algorithms for distributed memory model checking. Electron. Not. Theor. Comput. Sci. 89(1), 51–67 (2003)
Article Google Scholar
Valmari, A.: Stubborn sets for reduced state space generation. In: Rozenberg, G. (ed.) ICATPN 1989. LNCS, vol. 483, pp. 491–515. Springer, Heidelberg (1991). https://doi.org/10.1007/3-540-53863-1_36
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

LRDE, EPITA, Kremlin-Bicêtre, France
Etienne Renault

Authors

Etienne Renault
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Etienne Renault .

Editor information

Editors and Affiliations

Uppsala University, Uppsala, Sweden
Mohamed Faouzi Atig
Grenoble Alpes University, St Martin d’Hères, France
Saddek Bensalem
Inria Lille - Nord Europe, Villeneuve d’Ascq, France
Simon Bliudze
École Nationale Superieure de Techniques Avancées, Palaiseau, France
Bruno Monsuez

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Renault, E. (2018). Improving Parallel State-Space Exploration Using Genetic Algorithms. In: Atig, M., Bensalem, S., Bliudze, S., Monsuez, B. (eds) Verification and Evaluation of Computer and Communication Systems. VECoS 2018. Lecture Notes in Computer Science(), vol 11181. Springer, Cham. https://doi.org/10.1007/978-3-030-00359-3_9

Download citation

DOI: https://doi.org/10.1007/978-3-030-00359-3_9
Published: 31 August 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-00358-6
Online ISBN: 978-3-030-00359-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Improving Parallel State-Space Exploration Using Genetic Algorithms

Abstract

Similar content being viewed by others

Improving swarming using genetic algorithms

Many-core on-the-fly model checking of safety properties using GPUs

Parallelizing Heuristics for Generating Synchronizing Sequences

1 Introduction and Related Work

2 Parallel State Space Exploration

3 Generation of Artificial Initial State

4 State-Space Exploration with Genetic Algorithm

5 Checking Temporal Properties

Theorem 1

Theorem 2

6 Evaluation

7 Conclusion

Notes

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Improving Parallel State-Space Exploration Using Genetic Algorithms

Abstract

Similar content being viewed by others

Improving swarming using genetic algorithms

Many-core on-the-fly model checking of safety properties using GPUs

Parallelizing Heuristics for Generating Synchronizing Sequences

1 Introduction and Related Work

2 Parallel State Space Exploration

3 Generation of Artificial Initial State

4 State-Space Exploration with Genetic Algorithm

5 Checking Temporal Properties

Theorem 1

Theorem 2

6 Evaluation

7 Conclusion

Notes

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation