Optimized temporal monitors for SystemC

Tabakov, Deian; Rozier, Kristin Y.; Vardi, Moshe Y.

doi:10.1007/s10703-011-0139-8

Optimized temporal monitors for SystemC

Published: 19 January 2012

Volume 41, pages 236–268, (2012)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Formal Methods in System Design Aims and scope Submit manuscript

Optimized temporal monitors for SystemC

Download PDF

Deian Tabakov¹,
Kristin Y. Rozier² &
Moshe Y. Vardi³

310 Accesses
31 Citations
Explore all metrics

Abstract

SystemC is a modeling language built as an extension of C++. Its growing popularity and the increasing complexity of designs have motivated research efforts aimed at the verification of SystemC models using assertion-based verification (ABV), where the designer asserts properties that capture the design intent in a formal language such as PSL or SVA. The model then can be verified against the properties using runtime or formal verification techniques. In this paper we focus on automated generation of runtime monitors from temporal properties. Our focus is on minimizing runtime overhead, rather than monitor size or monitor-generation time. We identify four issues in monitor generation: state minimization, alphabet representation, alphabet minimization, and monitor encoding. We conduct extensive experimentation and identify a combination of settings that offers the best performance in terms of runtime overhead.

PSCV: A Runtime Verification Tool for Probabilistic SystemC Models

Assumption-based Runtime Verification

Article 01 April 2022

Sliding between Model Checking and Runtime Verification

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

The increasing complexity of hardware designs and systems-on-chip (SoC), together with shortening timelines from prototype to mass production, have challenged the traditional RTL-based design procedures. A new paradigm was needed to allow modeling at higher levels of abstraction, gradual refinement of the model, and execution of the model during each design stage. SystemC^{Footnote 1} has emerged as one of the leading solutions of the “design gap.”

SystemC is a system modeling language built as an extension of C++, providing libraries for modeling and simulation of systems on chip. It leverages the object-oriented encapsulation and inheritance mechanisms of C++ to allow for modular designs and IP transfer/reuse [24]. Various libraries provide further functionality, for example, SystemC’s Transaction-Level Modeling (TLM) library defines structures and protocols that streamline the development of high-level models. Thanks to its open-source license, actively involved community, and wide industrial adoption, SystemC has become a de facto standard modeling language within a decade after its first release.

Together, the growing popularity of SystemC and the increasing complexity of designs have motivated research efforts aimed at the verification of SystemC models using assertion-based verification (ABV), a widely used method for validation of hardware and software models [26]. With ABV, the designer asserts properties that capture design intent in a formal language, e.g., PSL^{Footnote 2} [17] or SVA^{Footnote 3} [45]. The model then is verified against the properties using runtime verification or formal verification techniques.

Most ABV efforts for SystemC focus on runtime verification (also called dynamic verification, testing, and simulation). This approach involves executing the model under verification (MUV) in some environment, while running monitors in parallel with the model. The monitors observe the inputs to the MUV and ensure that the behavior or the output is consistent with the asserted properties [24]. The complementary approach of formal verification attempts to produce a mathematical proof that the MUV satisfies the asserted properties. Our focus in this paper is on runtime verification.

A successful ABV solution requires two components: a formal declarative language for expressing properties, and a mechanism for checking that the MUV satisfies the properties. There have been several attempts to develop a formal declarative language for expressing temporal SystemC properties by adapting existing languages (see [42] for a detailed discussion). Tabakov et al. [42] argued that standard temporal property languages such as PSL and SVA are adequate to express temporal properties of SystemC models, after extending them with a set of Boolean assertions that capture the event-based semantics of SystemC. Enriching the Boolean layer, together with existing clock-sampling mechanisms in PSL and SVA, enables specification of properties at different levels of abstraction. Tabakov and Vardi [41] then showed how a nominal change of the SystemC kernel enables monitoring temporal assertions expressed in the framework of [42] with overhead of about 0.05–1% per monitor. (Note that [41] used hand-generated monitors, while this work focuses on automatically generated monitors.)

The second component needed for assertion-based verification, a mechanism for checking that the MUV satisfies the asserted properties, requires a method for generating runtime monitors from formal properties. For simple properties it may be feasible to write the monitors manually (cf., [20]); however, in most industrial workflows, writing and maintaining monitors manually would be an extremely high-cost, labor-intensive, and error-prone process [1]. This has inspired both academia and industry to search for methods to automate this process.

Formal, automata-theoretic foundations for monitor generation for temporal properties were laid out in [32], which showed how a deterministic finite word automaton (DFW) can be generated from a temporal property such that the automaton accepts the finite traces that violate the property. Many works have elaborated on that approach, cf. [2, 3, 14, 18, 19, 23]; see the discussion below of related work. Many of these works, e.g. [2], handle only safety properties, which are properties whose failure is always witnessed by a finite trace. Here, as in [14], we follow the framework of [32] in its full generality and we consider all properties whose failure may be witnessed by a finite trace. For example, the failure of the property “eventually q” can never be witnessed by a finite trace, but the failure of the property “always p and eventually q” may be witnessed by a finite trace.

A priori it is not clear how monitor size is related to performance, and most works on this subject have focused on underlying algorithmics, or on heuristics to generate smaller monitors, or on fast monitor generation. This paper is an attempt to shift the focus toward optimizing the runtime overhead that monitor execution adds to simulation time. We believe that this reflects more accurately the priorities of the industrial applications of monitors [2].

A large model may be accompanied by thousands of monitors [5], most of which are compiled once and executed many times, so lower runtime overhead is a crucial optimization criterion, much more than monitor size or monitor-generation time. In this paper we identify several algorithmic choices that need to be made when generating temporal monitors for monitoring frameworks implemented in software. (Please note that here we ignore the issue of integrating the monitor into the monitored code; cf. [41].) We conduct extensive experimentation to identify the choices that lead to superior performance.

We identify four issues in monitor generation: state minimization, should nondeterministic automata be determinized online or offline; alphabet representation, should alphabet letters be represented explicitly or symbolically; alphabet minimization, should mutually exclusive alphabet letters be eliminated; and monitor encoding, how should the transition function of the monitor be expressed. These options give us a workflow space of 33 different workflows for generating a monitor from a nondeterministic automaton.

We evaluate the performance of different monitor implementations using a SystemC model^{Footnote 4} representing an adder [41]. Its advantages are that it is scalable and creates events at many different levels of abstraction. For temporal properties we use linear temporal logic formulas. We use a mixture of pattern and random formulas, giving us a collection of over 1,300 temporal properties. We employ a tool called CHIMP (CHIMP Handles Instrumentation and Monitoring of Properties) to manage the transformation of LTL formulas into monitors using each of the 33 workflows. Our experiments identify a specific workflow that offers the best performance in terms of runtime overhead.

2 SystemC

Many contemporary systems consist of application-specific hardware and software, and tight production cycles make it impossible to wait for the hardware to be manufactured before starting to design the software. In a typical system-on-chip architecture [10], for example, a cell phone, there are hardware components that are controlled by software. In addition, many hardware design decisions, for example, numeric precision or the width of communication buses, are determined based on the needs of the software running on them. This has led to a design methodology where hardware and software are co-designed in the same abstract model. The partitioning between what will be implemented in hardware and what will be written as software is intentionally left blurry at the beginning, allowing the designers the ability to consider different configurations before committing a functional block to silicon or software.

SystemC is a system-level design framework that is capable of handling both hardware and software components. It allows a designer to combine complex electronic systems and control units in a single model, to simulate and observe the behavior, and to check if it meets the performance objectives. In the strict sense of the word, SystemC is not a new language. In fact, it is a library of C++ classes and macros that model hardware components, like modules and channels; provide hardware-specific data types, like 4-valued logic types; and define both abstract and specific communication interfaces, like Boolean input. SystemC is built entirely on standard C++, which means that every SystemC model can be compiled with a C++ compiler. The compiled model has to be linked with a SystemC simulator (for example, the OSCI-provided reference implementation) to produce an executable program.

Software typically executes sequentially, partly because most computer architectures have a single CPU core, and partly because a single thread of execution is easier to manage by the operating system. However, in a hardware system, many components execute simultaneously. For example, when using a cellphone to make a call, we activate simultaneously a radio subsystem that handles two-way communication with the cell tower, a signal processing unit that converts voice to signal and signal to voice, and a display controller that shows details about the conversation on the screen. Simulating such a system in software requires the ability to simulate a large number of tasks executing simultaneously, and is critical for the early stages of the design process.

SystemC addresses this issue by providing mechanisms for simulating (in software) parallel execution. This is achieved by a layered approach where high-level constructs share an efficient simulation engine [24]. The base layer of SystemC provides an event-driven simulation kernel that controls the model’s processes in an abstract manner. The kernel leverages a concept borrowed from hardware design languages, called delta cycles, to give the executing processes the illusion of parallel execution.

In SystemC, modules are the most fundamental building blocks. Similar to C++ objects, modules allow related functionality and data to be incorporated into individual entities and to remain inaccessible by the other components of the system unless exposed explicitly. This allows modules to be developed independently and to be reused or sold in commercial libraries [8]. As an example, the skeleton of a SystemC module is presented in Listing 1.

In this code fragment, SC_MODULE is one of SystemC’s macros, which declares a C++ class named “Nand.” Like any other C++ class, a module can declare local variables and functions. SC_CTOR is another predefined macro that simplifies the definition of a constructor for the module. A constructor of a module serves the same purpose as a constructor of a C++ class (i.e., initializing local variables, executing functions, etc.), but has some additional functionality that is specific to SystemC. For example, the processes of the module have to be declared inside the constructor. This is done using pre-defined SystemC macros that specify which class functions should be treated by the SystemC kernel as runnable processes. After declaring each process, the user can optionally specify its sensitivity list. The sensitivity list may include a subset of the channels and signals defined in the module, as well as externally defined clock objects or events. Whenever there is a change of value of any of the channels or signals listed in the sensitivity list, the corresponding process is triggered for execution. Listing 2 illustrates these concepts.

This code fragment declares one output and two input signals of type bool. The function some_function() implements the expected functionality of the NAND gate. The macro SC_METHOD declares it to be a SystemC process. When triggered, a method process executes from start to finish. In particular, a method process cannot suspend while waiting for some resource to become available. In contrast, a thread process may suspend its execution by calling wait(). The state of the thread process at the moment of suspension is preserved, and upon subsequent resumption (for example, when the waited-for resource becomes available) the execution continues from the point of suspension. Thread processes are declared using the macro SC_THREAD. Both thread and method processes can define a sensitivity list. Each sensitivity list declaration applies to the process immediately preceding the declaration. The sensitive declaration at the end of the module indicates that the method process some_function() should be triggered as soon as one of the input signals changes its value.

3 Related work

Most related papers that deal with monitoring focus on simplifying the monitor or reducing the number of states. Using smaller monitors is important for in-circuit monitoring, for example, for post-silicon verification [5], but for pre-silicon verification, using lower-overhead monitors is more important. There is a paucity of prior works focusing on minimizing runtime overhead.

For early work on constructing temporal monitors see [29]. Several papers focus on building monitors for informative prefixes, which are prefixes that violate input assertions in an “informative way.” Kupferman and Vardi [32] define informative prefixes and show how to use an alternating automaton to construct a nondeterministic finite word automaton (NFW) of size 2^O(ψ) that accepts the informative prefixes of an LTL formula ψ. Kupferman and Lampert [31] use a related idea to construct an NFW automata of size 2^O(ψ) that accepts at least one prefix of every trace that violates a safety property ψ. Two constructions that build monitors for informative prefixes are by Geilen [19] and by Finkbeiner and Sipma [18]. Geilen’s construction is based on the automata-theoretic construction of [22], while that of Finkbeiner and Sipma is based on the alternating-automata framework of [32]. Neither provide experimental results.

Armoni et al. [2] describe an implementation based on [32] in the context of hardware verification. Their experimental results focus on both monitor size and runtime overhead. They showed that the overhead is significantly lower than that of commercial simulators. Stolz and Bodden [40] use monitors constructed from alternating automata to check specifications of Java programs, but do not give experimental results. For other works that focus on minimization see [4, 30, 33].

Giannakopoulou and Havelund [23] apply the construction of [22] to produce nondeterministic monitors for X-free LTL formulas, and simulate a deterministic monitor on the fly. They provide one experimental result from the early testing of their implementation. A weakness of their approach is that their LTL semantics distinguishes between finite and infinite traces, which implies that LTL properties may have different meanings in the context of dynamic and formal verification.

Morin-Allory and Borione [35] show how to construct hardware modules implementing monitors for properties expressed using the simple subset [25] of PSL. Pierre and Ferro [37] describe an implementation based on this construction, and present some experimental results that show runtime overhead, but do not present any attempts to minimize it. Boulé and Zilic [5] show a rewriting-based technique for constructing monitors for the simple subset of PSL. They provide substantial experimental results, but focus on monitor size and not on runtime overhead.

Chen et al. describe a general framework of Monitoring-Oriented Programming (MOP) [11]. In MOP, runtime monitoring is supported as a fundamental principle for building reliable software: monitors are automatically synthesized from specified properties and integrated into the original system to check its dynamic behaviors.

D’Amorim and Roşu [14] show how to construct monitors for minimal bad prefixes of temporal properties without any restrictions regarding whether the property is a safety property or not. They construct a nondeterministic finite automaton of size 2^O(ψ) that extracts the safety content from ψ, and simulate a deterministic monitor on the fly. They present two optimizations: one reduces the size of the automaton, while the other searches for a good ordering of the outgoing transitions so that the overall expected cost of running the monitor will be smallest. They measure experimentally the size of the monitors for a few properties, but do not measure their runtime performance. A similar construction, but without any of the optimizations, is also described by Bauer et al. [3].

4 Theoretical background

Let AP be a finite set of atomic propositions and let Σ=2^AP be a finite alphabet. Given a temporal specification ψ over AP, we denote the set of models of the specification with \(\mathcal{L}(\psi) = \{w \in\varSigma^{\omega}\ |\ w \models\psi\}\). Let u∈Σ ^∗ denote a finite word. We say that u is a bad prefix for \(\mathcal{L}(\psi)\) iff \(\forall\sigma\in\varSigma^{\omega}: u\sigma\not \in \mathcal{L}(\psi)\) [32]. Intuitively, a bad prefix cannot be extended to an infinite word in \(\mathcal{L}(\psi)\). A minimal bad prefix does not have a bad prefix as a strict prefix.

A nondeterministic Büchi automaton (NBW) is a tuple \(\mathcal{A}={\langle \varSigma, Q, \delta, Q^{0}, F \rangle }\), where Σ is a finite alphabet, Q is a finite set of states, δ:Q×Σ→2^Q is a transition function, Q ⁰⊆Q is a set of initial states, and F⊆Q is a set of accepting states. If q′∈δ(q,σ) then we say that we have a transition from q to q′ labeled by σ. We extend the transition function δ:Q×Σ→2^Q to δ:2^Q×Σ ^∗→2^Q as follows: for all Q′⊆Q, δ(Q′,a)=⋃_q∈Q′ δ(q,a), and for all σ∈Σ ^∗, δ(q,aσ)=δ(δ(q,a),σ). A run of \(\mathcal{A}\) on a word w=a ₀ a ₁…∈Σ ^ω is a sequence of states q ₀ q ₁…, such that q ₀∈Q ⁰ and q _i+1∈δ(q _i,a _i) for some a _i∈Σ. For a run r, let \(\operatorname {\mathit {Inf}}{(}r)\) denote the states visited infinitely often. A run r of \(\mathcal{A}\) is called accepting iff \(\operatorname {\mathit {Inf}}{(}r) \cap F \not= \emptyset\). The word w is accepted by \(\mathcal{A}\) if there is an accepting run of \(\mathcal{A}\) on w. For a given Linear Temporal Logic (LTL) or PSL/SVA formula ψ, we can construct an NBW that accepts precisely \(\mathcal{L}(\psi)\) [44]. We use SPOT [16], an LTL-to-Büchi automaton tool, which is among the best available in terms of performance [39]. Using our framework for PSL or SVA would require an analogous translator.

A nondeterministic automaton on finite words (NFW) is a tuple \(\mathcal{A}= \langle\varSigma, Q, \delta, Q^{0}, F \rangle\). An NFW can be determinized by applying the subset construction, yielding a deterministic automaton on finite words (DFW) \(\mathcal{A}' = \langle \varSigma, 2^{Q}, \delta', \{Q^{0}\}, F' \rangle\), where δ′(Q,a)=⋃_q∈Q δ(q,a) and \(F' = \{Q: Q \cap F \not=\emptyset\}\). For a given NFW \(\mathcal{A}\), there is a canonical minimal DFW that accepts \(\mathcal{L}(\mathcal{A})\) [28]. In the remainder of this paper, given an LTL formula ψ, we use \(\mathcal{A}_{\mathrm{NBW}}(\psi)\) to mean an NBW that accepts \(\mathcal{L}(\psi)\), and \(\mathcal{A}_{\mathrm{NFW}}(\psi)\) (respectively, \(\mathcal{A}_{\mathrm{DFW}}(\psi)\)) to mean an NFW (respectively, DFW) that rejects the minimal bad prefixes of \(\mathcal{L}(\psi)\).

Building a monitor for a property ψ requires building \(\mathcal{A}_{\mathrm{DFW}}(\psi)\). Our work is based on the construction by d’Amorim and Roşu [14], which produces \(\mathcal{A}_{\mathrm{NFW}}(\psi)\). Their construction assumes an efficient algorithm for constructing \(\mathcal{A}_{\mathrm{NBW}}(\psi)\) and is, therefore, is applicable to properties expressed in any a wide variety of specification languages (for example, if the property is expressed in LTL, \(\mathcal{A}_{\mathrm{NBW}}(\psi)\) can be constructed using [16]; for PSL specifications, the construction of \(\mathcal{A}_{\mathrm{NBW}}(\psi)\) can be done using [9]; etc.) Below we sketch the construction of [14] and then we show how we construct \(\mathcal{A}_{\mathrm{DFW}}(\psi)\).

Given an NBW \(\mathcal{A}= {\langle \varSigma, Q, \delta, Q^{0}, F \rangle }\) and a state q∈Q, define \(\mathcal{A}^{q} = \langle\varSigma, Q, \delta, q, F \rangle\). Intuitively, \(\mathcal{A}^{q}\) is the NBW automaton defined over the structure of \(\mathcal{A}\) but replacing the set Q ⁰ of initial states with {q}. Let \(\operatorname{\mathit{empty}}(\mathcal{A}) \subseteq Q\) consist of all states q∈Q such that \(\mathcal{L}(\mathcal{A}^{q}) = \emptyset\), i.e., all states that cannot start an accepting run. The states in \(\operatorname{\mathit{empty}}(\mathcal{A})\) are “unnecessary” in \(\mathcal{A}\), because they cannot appear on an accepting run. We can compute \(\operatorname{\mathit{empty}}(\mathcal{A})\) efficiently using nested depth-first search [13]. Deleting the states in \(\operatorname{\mathit{empty}}(\mathcal{A})\) is done using the function call spot::scc_filter(), which is available in SPOT.

To generate a monitor for ψ, d’Amorim and Roşu build \(\mathcal {A}_{\mathrm{NBW}}{(\psi)}\) and remove \(\operatorname{\mathit{empty}}(\mathcal{A}_{\mathrm{NBW}}{(\psi)})\). They then treat the resulting automaton as an NFW, with all states taken to be accepting states. That is, the resulting NFW is \(\mathcal{A}= {\langle \varSigma, Q', \delta', Q^{0}\cap Q', Q' \rangle }\), where \(Q'=Q\mbox{-}\operatorname{\mathit{empty}}(\mathcal{A})\), and δ′ is δ restricted to Q′×Σ. Let the automaton produced by this algorithm be \(\mathcal{A}_{\mathrm {NFW}}^{dR}{(\psi)}\).

Theorem 1

(See [14])

\(\mathcal{A}_{\mathrm{NFW}}^{dR}{(\psi)}\) rejects precisely the minimal bad prefixes of ψ.

From now on we refer to \(\mathcal{A}_{\mathrm{NFW}}^{dR}{(\psi)}\) simply as \(\mathcal{A}_{\mathrm{NFW}}(\psi)\). \(\mathcal{A}_{\mathrm{NFW}}(\psi)\) is not useful as a monitor because of its nondeterminism. One way to construct a monitor from \(\mathcal{A}_{\mathrm{NFW}}(\psi)\) is to determinize it explicitly using the subset construction. In the worst case the resulting \(\mathcal{A}_{\mathrm{DFW}}(\psi)\) is of size exponential of the size of \(\mathcal{A}_{\mathrm{NFW}}(\psi)\), which is why explicit determinization has rarely been used. We note, however, that we can minimize \(\mathcal{A}_{\mathrm{DFW}}{(\psi)}\), getting a minimal DFW. It is not clear, a priori, what impact this determinization and minimization will have on runtime overhead.

An alternative way of constructing a monitor from \(\mathcal{A}_{\mathrm{NFW}}(\psi)\) that avoids the potential for exponential blow up of the number of states is to use \(\mathcal{A}_{\mathrm{NFW}}{(\psi)}\) to simulate a deterministic monitor on the fly. d’Amorim and Roşu describe such a construction in terms of nondeterministic multi-transitions and binary transition trees [14]. Instead of introducing these formalisms, here we use instead the approach in [2, 43], which presents the same concept in automata-theoretic terms. The idea in both papers is to perform the subset construction on the fly, as we read the inputs from the trace. Given \(\mathcal{A}_{\mathrm{NFW}}{(\psi)} = {\langle \varSigma , Q, \delta, Q^{0},Q \rangle }\) and a finite trace a ₀,…,a _n−1, we construct a run P ₀,…,P _n of \(\mathcal{A}_{\mathrm{DFW}}(\psi)\) as follows: P ₀={Q ⁰} and \(P_{i+1} = \bigcup_{s\in P_{i}}\delta(s, a_{i})\). The run is accepting iff P _i=∅ for some i≥0 (i.e., no transition is enabled), which means that we have read a bad prefix. Notice that each P _i is of size linear in the size of \(\mathcal {A}_{\mathrm{NFW}}{(\psi)}\), thus we have avoided the exponential blowup of the determinization construction, with the price of having to compute transitions on the fly [2, 43].

We do not consider the property as failing if eventualities are not satisfied by the end of the simulation. Doing so would require changing the semantics of the specification and would require special treatment of the last state. Our approach maintains the same semantics for dynamic and formal verification runs and only bad prefixes are reported as failures.

The workflows that we use to generate monitors can be grouped into two types, summarized in Fig. 1.

5 Monitor generation

We now describe various issues that arise when constructing \(\mathcal {A}_{\mathrm{DFW}}{(\psi)}\).

5.1 State minimization

As noted above, we can construct \(\mathcal{A}_{\mathrm{DFW}}{(\psi)}\) on the fly. We discuss in detail below how to express \(\mathcal {A}_{\mathrm{DFW}}{(\psi)}\) as a collection of C++ expressions. The alternative is to feed \(\mathcal{A}_{\mathrm{NFW}}{(\psi)}\) into a tool that constructs a minimal equivalent \(\mathcal{A}_{\mathrm{DFW}}(\psi)\). We use the BRICS Automaton tool [34]. Clearly, determinization and minimization, as well as subsequent C++ compilation, may incur a nontrivial computational cost. Still such a cost might be justifiable if the result is reduced runtime overhead, as assertions have to be compiled only once, but then run many times. A key question we want to answer is whether it is worthwhile to determinize \(\mathcal{A}_{\mathrm{NFW}}{(\psi)}\) explicitly, rather than on the fly.

5.2 Alphabet representation

In our formalism, the alphabet Σ of \(\mathcal{A}_{\mathrm{NFW}}{(\psi)}\) is Σ=2^AP, where AP is the set of atomic propositions appearing in ψ. In practice, tools that generate \(\mathcal{A}_{\mathrm{NBW}}(\psi)\) (SPOT in our case) often use \(\mathcal{B}(\mathit{AP})\), the set of Boolean formulas over AP, as the automaton alphabet: a transition from state q to state q′ labeled by the formula θ is a shortcut to denote all transitions from q to q′ labeled by σ∈2^AP, when σ satisfies θ. When constructing \(\mathcal{A}_{\mathrm{DFW}}(\psi)\) on the fly, we can use formulas as letters. Automata-theoretic algorithms for determinization and minimization of NFWs, however, require comparing elements of Σ, which makes it impractical to use Boolean formulas for letters. We need a different way, therefore, to describe our alphabet.^{Footnote 5} Below we show two ways to describe the alphabet of \(\mathcal{A}_{\mathrm{NFW}}{(\psi)}\) in terms of 16-bit integers.

5.2.1 Assignment-based representation

The explicit approach is to represent Boolean formulas in terms of their satisfying truth assignments. Let AP={p ₁,p ₂,…,p _n} and let \(\mathcal{F}(p_{1}, p_{2}, \ldots, p_{n})\) be a Boolean function. An assignment to AP is an n-bit vector a=[a ₁,a ₂,…,a _n]. An assignment a satisfies \(\mathcal{F}\) iff \(\mathcal{F}(a_{1}, a_{2}, \ldots,a_{n})\) evaluates to 1. Let A ⁿ be the set of all n-bit vectors and let I:A ⁿ→ℤ₊ return the integer whose binary representation is a, i.e., I(a)=a ₁2ⁿ⁻¹+a ₂2ⁿ⁻²+…+a _n2⁰. We define \(\mathit{sat}(\mathcal{F}) = \{I(a): a\ \mathrm {satisfies}\ \mathcal{F}\}\). Thus, the explicit representation of the automaton \(\mathcal{A}_{\mathrm{NFW}}{(\psi)} = {\langle \mathcal{B}(\mathit {AP}), Q, \delta, Q^{0}, F \rangle }\) is \(\mathcal{A}_{\mathrm{NFW}}^{abr}(\psi)={\langle \{0,\ldots,2^{n}-1\}, Q, \delta _{abr}, Q^{0}, F \rangle }\), where q′∈δ _abr(q,z) iff q′∈δ(q,σ) and z∈sat(σ).

5.2.2 BDD-based representation

The symbolic approach to alphabet representation leverages the fact that Ordered Binary Decision Diagrams (BDDs) [6, 7] provide canonical representations of Boolean functions. A BDD is a rooted, directed, acyclic graph with one or two terminal nodes labeled 0 or 1, and a set of variable nodes of out-degree two. The variables respect a given linear order on all paths from the root to a leaf. Each path represents an assignment to each of the variables on the path. For a fixed variable order, two BDDs are the same iff the Boolean formulas they represent are the same.

The symbolic approach uses SPOT’s spot::tgba_reachable_iterator_breadth_first::process_link() function call to get references to all Boolean formulas that appear as transition labels in \(\mathcal{A}_{\mathrm{NFW}}{(\psi)}\). The formulas are enumerated using their BDD representations (using SPOT’s spot::tgba_succ_iterator::current_condition() function call), and each unique formula is assigned a unique integer. We thus obtain \(\mathcal{A}_{\mathrm{NFW}}^{\mathit{bdd}}(\psi)\) by replacing transitions labeled by Boolean formulas with transitions labeled by the corresponding integers. While the size of \(\mathcal{B}(\mathit{AP})\) is doubly exponential in |AP|, the automaton \(\mathcal{A}_{\mathrm{NBW}}(\psi)\) is exponential in |ψ|, so the number of Boolean formulas used in the automaton is at most exponential in |ψ|.

5.2.3 From NFW to DFW

We provide both \(\mathcal{A}_{\mathrm{NFW}}^{abr}(\psi)\) and \(\mathcal{A}_{\mathrm{NFW}}^{\mathit{bdd}}(\psi)\) as inputs to BRICS Automaton, producing, respectively, minimized \(\mathcal{A}_{\mathrm{DFW}}^{abr}(\psi)\) and \(\mathcal{A}_{\mathrm{DFW}}^{\mathit{bdd}}(\psi)\). We note that neither of these two approaches is a priori a better choice. LTL-to-automata tools use Boolean formulas rather than assignments to reduce the number of transitions in the generated nondeterministic automata. However, when using \(\mathcal{A}_{\mathrm {DFW}}^{\mathit{bdd}}{(\psi)}\) as a monitor, the trace we monitor is a sequence of truth assignments, and \(\mathcal{A}_{\mathrm{DFW}}^{\mathit{bdd}}(\psi)\), while deterministic with respect to the BDD encoding of the transitions, is not deterministic with respect to truth assignments to atomic propositions. As a consequence, there is no guarantee that at each step of the monitor at most one state is reachable.

5.3 Alphabet minimization

While propositional temporal specification languages are based on Boolean atomic propositions, they are often used to specify properties involving non-Boolean variables. For example, we may have the atomic formulas (a == 0), (a == 1), and (a > 1) in a specification involving the values of a variable int a. Notice that in this example not all assignments in 2^AP are consistent. For example, the assignment (a == 0) && (a == 1) is not consistent, and a transition guarded by (a == 0) && (a == 1) is never enabled. Note that such a guard can be generated even if the guard is not a subformula in the specification. By eliminating inconsistent assignments we may be able to reduce the number of letters in the alphabet exponentially without in any way changing the correctness of the monitor. The advantage of this optimization is that by identifying transitions that always evaluate to false we can exclude them from the generated monitor and thus improve its run-time performance. Identifying inconsistent assignments requires calling an SMT (Satisfiability Modulo Theory) solver [36]. Here we would need an SMT solver that can handle arbitrary C++ expressions that evaluate to type bool. Not having access to such an SMT solver, we use the compiler as an improvised SMT solver.

A set of techniques called constant folding allows compilers to reduce constant expressions to a single value at compile time (see, e.g., [12]). When an expression contains variables instead of constants, the compiler uses constant propagation to substitute values of variables in subsequent subexpressions involving the variables. In some cases the compiler is able to deduce that an expression contains two mutually exclusive subexpressions, and issues a warning during compilation. We construct a function that uses conjunctions of atomic formulas as conditionals for dummy if/then expressions, and compile the function. (We use gcc 4.0.3.) To gauge the effectiveness of this optimization we apply it using two sets of conjunctions. Full alphabet minimization uses all possible conjunctions involving atomic formulas or their negations, while partial alphabet minimization uses only conjunctions that contain each atomic formula, positively or negatively.

We compile the function and then parse the compiler warnings that identify inconsistent conjunctions. Prior to compiling the Büchi automaton we augment the original temporal formula to exclude those conjunctions from consideration. For example, if (a == 0) && (a == 1) is identified as an inconsistent conjunction, we augment the property ψ to ψ∧G(!((a==0)∧(a==1))).

5.4 Monitor encoding

We describe seven ways of encoding automata as C++ monitors. Not all can be used with all automata directly, so we identify the transformations that need to be applied to an automaton before each encoding can be used.

The strategy in all encodings based on automata that are nondeterministic with respect to truth assignments (i.e., \(\mathcal{A}_{\mathrm{NFW}}{(\psi)}\) and minimal \(\mathcal{A}_{\mathrm{DFW}}^{\mathit{bdd}}(\psi)\)) is to construct the run P ₀,P ₁,… of the monitor using two bit-vectors of size |Q|: current[] and next[]. Initially next[] is zeroed, and current[j] = 1 iff q _j∈Q ⁰. Then, after sampling the state of the program, we set next[k] = 1 iff current[j] = 1 and if there is a transition from q _j to q _k that is enabled by the current program state. When we are done updating next[], we assign it to current[], zero next[], and then repeat the process at the next sample point. Intuitively, current[] keeps track of the set of automaton states that are reachable after seeing the execution trace so far, and next[] maintains the set of automaton states that are reachable after completing the current step of the automaton.

Notice that when the underlying automaton is deterministic with respect to truth assignments (i.e., \(\mathcal{A}_{\mathrm{DFW}}^{abr}(\psi)\)), after each step there are precisely 1 or 0 reachable states. In those cases it is inefficient to use bit-vector encoding of the set of reachable states, because this set is guaranteed to be singleton. Thus, when constructing monitors from deterministic automata, we use int current and int next to keep track of the run of the automaton. Initially, current = j iff q _j is the initial state. Then we set next = k iff the transition from q _j to q _k is enabled at the first sample point; since the automaton is deterministic, at most one transition is enabled. We continue in this fashion until the simulation ends or until none of the transitions in the monitor is enabled, indicating a bad prefix.

The details of the way we update current[] (respectively, current) and next[] (respectively, next) are reflected in the different encodings. As a running example, we show how to construct a monitor for the property φ=G(p→(q∧X q∧XX q)). The first step is to use SPOT to construct a NBW automaton that accepts all traces satisfying φ. Next, we use SPOT to construct \(\mathcal{A}_{\mathrm{NFW}}{(\varphi)}\), which is presented in Fig. 2.

5.4.1 Nondeterministic encodings

Two novel encodings, which we call front_nondet and back_nondet, expect that the automaton transitions are Boolean formulas, and do not assume determinism. Thus, front_nondet and back_nondet can be used with \(\mathcal{A}_{\mathrm{NFW}}{(\psi)}\) directly. They can also be used with \(\mathcal{A}_{\mathrm{DFW}}^{abr}(\psi)\) and \(\mathcal{A}_{\mathrm{DFW}}^{\mathit{bdd}}(\psi)\), once we convert back the transition labels from integers to Boolean formulas as follows. In \(\mathcal{A}_{\mathrm {DFW}}^{abr}{(\psi)}\), we calculate the assignment corresponding to each integer, and use that assignment to generate a conjunction of atomic formulas or their negations. In \(\mathcal{A}_{\mathrm{DFW}}^{\mathit{bdd}}(\psi)\) we relabel each transition with the Boolean function whose BDD is represented by the integer label.

The front_nondet encoding uses an explicit if to check if each state s of current[] is enabled. For each outgoing transition t from s it then uses a nested if with a conditional that is a verbatim copy of the transition label of t to determine if the destination state of t is reachable from s. Listing 3 illustrates this encoding.

The back_nondet encoding uses a disjunction that represents all of the ways in which a state in next[] can be reached from the currently reachable states. Listing 4 illustrates this encoding.

5.4.2 Deterministic encodings

Three novel deterministic encodings, which we call front_det_switch, front_det_ifelse, and back_det, expect that the automaton has been determinized using assignment-based encoding. Thus, these three encodings can be used only with \(\mathcal{A}_{\mathrm{DFW}}^{abr}(\psi)\). Note that we work with \(\mathcal{A}_{\mathrm{DFW}}^{abr}(\psi)\) directly and do not convert the automaton alphabet from integers back to Boolean functions. Instead, at the beginning of each step of the automaton we use the state of the MUV (i.e., the values of all public and private variables, as exposed by the framework of [42]) to derive an assignment a to the atomic propositions in AP(ψ). We then calculate an integer representing the relevant model state mod_st=I(a), where a is the current assignment, and use mod_st to drive the automaton transitions.

Referring to the running example automaton presented in Fig. 2, we first show how to convert the Boolean expressions on the transitions to integers using assignment-based integer representation. Table 1 shows the integer encoding of all possible assignments of values to p and q. We then construct \(\mathcal{A}_{\mathrm{NFW}}^{abr}{(\varphi)}\) in Fig. 3. Determinizing and minimizing \(\mathcal{A}_{\mathrm{NFW}}^{abr}{(\varphi)}\) using BRICS Automaton produces \(\mathcal{A}_{\mathrm {DFW}}^{abr}{(\varphi)}\), which in this case is identical to \(\mathcal{A}_{\mathrm{NFW}}^{abr}{(\varphi)}\).

Table 1 Assignment-based encoding for the transitions of the \(\mathcal{A}_{\mathrm{NFW}}{(\psi)}\) in Fig. 2

Full size table

The back_det encoding is similar to back_nondet in that it encodes the automaton transitions as a disjunction of the conditions that allow a state in next[] to be enabled. The difference is that here we use an integer instead of a vector to keep track of the (at most one) state reachable in the current step of the automaton, and the transitions are driven by mod_st instead of by Boolean functions. See Listing 5 for an illustration of this encoding.

The front_det_switch and front_det_ifelse encodings are similar, but differ in the C++ constructs used to take advantage of the determinism in the automaton. Applying front_det_switch encoding to the automaton in Fig. 3 is illustrated in Listing 6, and front_det_ifelse encoding is illustrated in Listing 7.

5.5 Deterministic table-based encodings

In the encodings discussed above, the transition function of the automaton is encoded using if or switch statements. The final two encodings, described below, are based on table look-up. The key to a table look-up monitor encoding is to create a table, such that given the current state and the current assignment, we can look up the next state in the table. Given the current state and the system state index mod_st, we can transition to the next state in one operation, avoiding overhead associated with large nested if statements or switch statements.

We illustrate both table-based encodings using the determinized, minimized automaton \(\mathcal{A}_{\mathrm{DFW}}^{abr}{(\varphi)}\) presented in Fig. 3 and the associated integer encodings of all possible assignments of values to p and q in Table 1. We can construct a look-up table as illustrated in Table 2.

Table 2 Look-up table corresponding to the automaton in Fig. 3. Given the current state and the integer representation of the alphabet, find the next state or detect failure. For example, if the current state is 0 and the alphabet representation is 3, the next state is state 0

Full size table

Two novel deterministic encodings, which we call front_det_file_table and front_det_memory_table, expect that the automaton has been determinized using assignment-based encoding. Like the other deterministic encodings, these two encodings can be used only with \(\mathcal{A}_{\mathrm{DFW}}^{abr}(\psi)\). Again, we work with \(\mathcal{A}_{\mathrm{DFW}}^{abr}(\psi)\) directly and do not convert the automaton alphabet from integers back to Boolean functions; for table encodings we take advantage of the fact that the automaton alphabet integers can be stored easily in a state-transition look-up table.

As we did for the encodings of Sect. 5.4.2, we use the state of the MUV (i.e., the values of all public and private variables, as exposed by the framework of [42]) at the beginning of each step of the automaton to derive an assignment a to the atomic propositions in AP(ψ). Again, we calculate an integer representing the relevant model state mod_st=I(a), where a is the current assignment.

The encodings front_det_file_table and front_det_memory_table are similar, but differ in the way the state-transition look-up table is stored and used by the monitor.

5.5.1 The file-based table encoding

In the front_det_file_table encoding we store the automaton \(\mathcal{A}_{\mathrm{DFW}}^{abr}{(\varphi)}\) in a text file using the LBT format [38]. Briefly, the LBT file format is a text-based encoding of automata. It iteratively describes each state (whether it is accepting, initial, both, or neither), together with a unique state ID. Each outgoing transition is listed immediately after the state description, and includes the destination state ID and a transition letter/guard.

When the monitor is instantiated, it uses an LBT parser that is automatically included with the monitor’s code to parse the automaton from the file and to construct the look-up table. Applying front_det_file_table encoding to the automaton in Fig. 3 is illustrated in Listing 8.

One advantage of using the file-based table encoding is that it separates the code implementing the monitor from the definition of the automaton. Such decoupling allows the monitor to be compiled and linked with the MUV before the LBT representation of the automaton is even created. It further allows monitored properties to be changed on the fly, without having to recompile the MUV, by simply replacing the contents of the LBT file.

5.5.2 The memory table encoding

In the front_det_memory_table encoding we declare the state-transition look-up table explicitly in the monitor’s constructor. The table is declared directly as a one-dimensional, row-major array, forgoing the need for a LBT parser library or a file containing the automaton. Applying the front_det_memory_table encoding to the automaton in Fig. 3 is illustrated in Listing 9.

Note that in this encoding the variable table is a class variable of type int[] with the same capacity as the number of elements in local_table. A limitation of the current C++ standard (C99) does not allow arrays to be initialized explicitly after declaration, thus we use local_table to initialize the array, and std::memcpy to copy the array from local_table to table.

5.6 Workflow space

The different options allow 33 possible combinations for generating a monitor, summarized in Table 3. The first decision is whether state minimization is required. If it is not required, one of the three alphabet minimization options is applied, and one of the two non-deterministic monitor encodings (front_nondet or back_nondet) is used to create the final monitor.

Table 3 The workflow space for generating monitors

Full size table

When using state minimization it is necessary to select the alphabet representation (BDD- or assignment-based) to be using during minimization. The three alphabet minimization options can be selected independently of the alphabet representation selection. Recall that BDD-based minimization produces automata that are non-deterministic with respect to assignments, therefore only the two non-deterministic monitor encodings are available. Alternatively, if assignment-based minimization is employed, all non-deterministic and deterministic encodings (seven total) can be used.

In summary, there are six workflows that require no state minimization, six workflows that use BDD-based state minimization, and 21 workflows that use assignment-based state minimization.

6 Experimental setup

6.1 SystemC model

Our experimental evaluation is based on the Adder^{Footnote 6} model presented in [41]. The Adder implements a squaring function by using repeated incrementing by 1. We used the Adder to calculate 100² with 1,000 instances of a monitor for the same property. Since we are mostly concerned with monitor overhead, we focus on the time difference between executing the model with and without monitoring. We established a baseline for the model’s runtime by compiling the Adder model with a virgin installation of SystemC (i.e., without the monitoring framework of [41]) and averaging the runtime over 10 executions. To calculate the monitor overhead we averaged the runtime of each simulation over 10 executions and subtracted the baseline time. Notice that the overhead as calculated includes the cost of the monitoring framework and the slow-down due to all 1,000 monitors.

6.2 Properties

We used specifications constructed using both pattern formulas and randomly generated formulas. We used LTL formulas, as we have access to explicit-state LTL-to-automata translators (SPOT, in our case). Note, however, that the framework is applicable to any specification language that produces NBWs and is not restricted to LTL formulas. Minimization of finite-state automata was performed by BRICS Automaton. SPOT, BRICS Automaton, and CHIMP, the tool that manages the different workflows, are available for download.^{Footnote 7}

We adopted the pattern formulas used in [21] and presented below:

In addition to these formulas we also used bounded F and bounded G formulas, and a new type of nested U formulas, presented below:

In our experiments we replaced the generic propositions p _i in each pattern formula with atomic formulas (a==100^2-100(n-i-1)), where a is a variable representing the running total in the Adder. For each pattern we scaled up the formulas until all 33 workflows either timed out or crashed. Most workflows can be scaled up to n=5, except for the bounded properties, which can be scaled to n=17. We identified 127 pattern formulas for which at least one workflow could complete the monitoring task.

The random formulas that we used were generated following the framework of [15], using the implementation from [39]. For each formula length there are two parameters that control the number of propositions used and the probability of selecting a U or a V operator (formula length is calculated by adding the number of atomic propositions, the number of logical connectives, and the number of temporal operators). We varied the number of atomic propositions between 1 and 5, the probability of selecting a U or a V was one of {0.3, 0.5, 0.7, 0.95}, and we varied the formula length from 5 to 30 in increments of 5. We used the same style of atomic propositions as in the pattern formulas. For each combination of parameters we generated 10 formulas at random, giving us a total of 1200 random formulas.

7 Results for non-table-based workflows

The results described in this section are based on experiments on Ada, Rice’s Cray XD1 compute cluster.^{Footnote 8} Each of Ada’s nodes has two dual core 2.2 GHz AMD Opteron 275 CPUs and 8 GB of RAM. We ran with exclusive access to a node so all 8 GB of RAM were available for use. We allowed 8 hours (the maximal job time on Ada) of computation time per workflow per formula for generating a Büchi automaton, automata-theoretic transformations, generating C++ code, compilation, linking with the Adder model using the monitoring framework presented in [41], and executing the monitored model 10 times.

We first evaluate the individual effect of each optimization. For each formula we partition the workflow space into two groups: those workflows that use the optimization and those that do not. We form the Cartesian product of the overhead times from both groups and present them on a scatter plot.

7.1 State minimization

Figure 4 shows the effect of determinization and state minimization on the automaton size. We observe that in most cases minimizing the automata (i.e., minimizing \(\mathcal{A}_{\mathrm{DFW}}^{abr}{(\varphi)}\) and \(\mathcal{A}_{\mathrm{DFW}}^{\mathit{bdd}}{(\varphi)}\)) produces smaller automata than the equivalent \(\mathcal{A}_{\mathrm{NFW}}{(\varphi)}\). It is known [28] that in the worst case, nondeterministic automata are exponentially more succinct than the corresponding minimal deterministic automata. Our experimental results show that the worst case blow up is avoided for the types of formulas that are likely to be used in practice, and, in fact, for some formulas we see three orders of magnitude smaller deterministic automata. This observation goes against the traditional justification for constructing monitors from nondeterministic rather than deterministic automata.

In Fig. 5 we show the effect of state minimization on the runtime overhead. A few outliers notwithstanding, using state minimization lowers the runtime overhead of the monitor.

7.2 Alphabet representation

Figure 6 shows that using assignments leads to better performance than BDD-based alphabet representation. Our data show that in most cases, using assignments leads to smaller automata, which again suggests a connection between monitor size and monitor efficiency.

7.3 Alphabet minimization

Our data shows that partial- and full-alphabet minimization typically slow down the monitor (see Fig. 7). We think that the reasons behind this are two-fold. On one hand, the performance of gcc as a decision engine to discover mutually exclusive conjunctions is not very good (in our experiments it was able to discover only 10–15% of the possible mutually exclusive conjunctions). On the other hand, augmenting the formula increases the formula size, but SPOT does not take advantage of the extra information in the formula and typically generates bigger Büchi automata. If we manually augment the formula with all mutually exclusive conjunctions we do see smaller Büchi automata, so we believe this optimization warrants further investigation.

7.4 Monitor encoding

Finally, we compared the effect of the different monitor encodings (Fig. 8). Our conclusion is that no encoding dominates the others, but two (front_nondet and front_det_switch) show the best performance relative to all others, while back_det has the worst performance. Comparing front_nondet and front_det_switch directly to each other (Fig. 9) indicates that front_det_switch delivers better performance for all but a few formulas.

7.5 Best non-table-based workflow

The final check of our conclusion is presented in Fig. 10, where we plot the performance of the winning workflow against all other workflows. There are a few outliers, but overall the workflow gives better performance than all others.

Based on the comparison of individual optimizations we conclude that front_det_switch encoding with assignment-based state minimization and no alphabet minimization is the best overall workflow.

8 Results for table-based workflows

Soon after we completed the experiments described in Sect. 7, the compute cluster Ada was decommissioned, thus preventing us from evaluating the table-based encodings on the same hardware. In order to make an objective comparison between the different encodings, we re-ran all original experiments and new experiments involving the table-based encodings, on the Shared University Grid at Rice (SUG@R), Rice’s Intel Xeon compute cluster.^{Footnote 9} Each of SUG@R’s 134 SunFire×4150 nodes has two quad-core Intel Xeon processors running at 2.83 GHz and 16 GB of RAM per processor. SUG@R is running Red Hat Enterprise 5 Linux, 2.6.18 kernel. We ran with exclusive access to a node so all 16 GB of RAM were available for use. As before, we allowed 8 hours of computation time per workflow per formula for generating Büchi automata, automata-theoretic transformations, generating C++ code, compilation, linking with the Adder model using the monitoring framework presented in [41], and executing the monitored model 10 times.

First we confirmed that the conclusions based on the initial experiments on Ada remain valid when applied to the experimental results obtained on SUG@R. For example, we compared the performance of the winning workflow identified in Sect. 7 against the performance of the 27 non-table workflows. The results are presented in Fig. 11. We observe that the front_det_switch encoding with assignment-based state minimization and no alphabet minimization dominates the other non-table-based workflows on SUG@R, thus validating our earlier conclusion.

Next we consider the performance of the two table-based workflows. Each was run on the same set of formulas as the other workflows. First we show the runtime overhead when using the file-based table encoding, compared to the overhead of all non-table-based encodings (Fig. 12). Although for some formulas the file-based table encoding shows significantly smaller overhead, for others it shows much larger overhead. Our interpretation of this data is that the cost of accessing the disk to read the file containing the automaton incurs an overhead that cannot be offset by the workflow’s runtime performance.

We evaluate the performance of the memory-based table encoding in a similar manner (Fig. 13). We see that avoiding disk access improves the performance significantly over the file-based table encoding. This observation is confirmed by direct comparison between file-based and memory-based table encoding (Fig. 14). For all formulas evaluated by the two workflows, the memory-based encoding is at least as fast (in most cases, significantly faster) than the file-based encoding.

This data indicates that the memory-based table encoding is very competitive, but it is not clear whether its performance is better than the wining workflow identified in Sect. 7. Direct comparison of the runtime overhead is presented in Fig. 15. Our conclusion is that for the majority of formulas the runtime overhead of the winning workflow identified earlier is smaller. Thus, the front_det_switch encoding with assignment-based state minimization and no alphabet minimization remains the best overall workflow that we have evaluated.

9 Discussion and future work

In this paper we focus on minimization of monitor runtime. We identify the exploration space consisting of monitor encodings, alphabet encodings, transition representation, and other possible optimizations. We use off-the-shelf components (SPOT, BRICS Automaton, gcc) to perform some of the transformations, and a custom tool (CHIMP) to manage the different workflows. Together with the specification formalism proposed in [42], and the monitoring framework described in [41], this work provides a general ABV solution for temporal monitoring of SystemC models. Since the starting point is \(\mathcal{A}_{\mathrm{NBW}}(\psi)\), the techniques presented here are easy to integrate with a wide variety of specification languages. For example, it is easy to see that by applying [9] we can easily extend the scope of this work to efficient monitoring of PSL properties. We have identified a workflow that generates low-overhead monitors and we believe that it can serve as a good default setting.

Although the two table-based workflows have higher runtime overhead, they offer other important advantages. Both table-based workflows allow us to reduce the size of the monitor from hundreds of thousands of lines of code in some cases, to hundreds of lines of code. This avoids compilation problems and reduces the compilation time significantly. Another advantage of using the file-based table encoding is the flexibility to change the monitored properties without recompiling the MUV. The focus of this paper is on runtime overhead and exploring these issues is beyond its scope, but we believe that they are worthy of further consideration.

Practical use of our tool may involve monitoring tasks that are different than the synthetic load that we used for our tests. Recent developments in the area of self-tuning systems show that even highly optimized tools can be improved by orders of magnitude using search techniques over the workflow space (cf., [27]). One possible extension of our work is to apply different optimizations to different types of formulas. For example, our data shows that when the minimized automaton (\(\mathcal{A}_{\mathrm{DFW}}^{\mathit{bdd}}(\psi)\) or \(\mathcal{A}_{\mathrm{DFW}}^{abr}(\psi)\)) has more states than the unminimized automaton (\(\mathcal{A}_{\mathrm{NFW}}(\psi)\)), generating a monitor using \(\mathcal{A}_{\mathrm{NFW}}(\psi)\) leads to smaller runtime overhead. This observation can be used as a heuristic, and further investigation may reveal that for different classes of formulas different workflows yield the best results. Thus, we have left the user full control over the tool workflow.

Notes

IEEE Standard 1666–2005.
Property Specification Language, IEEE Standard 1850–2007.
SystemVerilog Assertions, IEEE Standard 1800–2005.
Note that the comparison is between different monitor implementations and is applicable to other C or C++ modeling languages.
BRICS Automaton represents the alphabet of the automaton as Unicode characters, which have one-to-one correspondence to the set of 16-bit integers.
Source code available at http://www.cs.rice.edu/CS/Verification/Software/software.html.
http://www.cs.rice.edu/CS/Verification/Software/software.html.
http://www.rcsg.rice.edu/ada.
http://rcsg.rice.edu/sugar/.

References

Abarbanel Y, Beer I, Gluhovsky L, Keidar S, Wolfsthal Y (2000) Focs: Automatic generation of simulation checkers from formal specifications. In: CAV’00: Proc of the 12th international conference on computer aided verification, pp 538–542
Chapter Google Scholar
Armoni R, Korchemny D, Tiemeyer A, Vardi M, Zbar Y (2006) Deterministic dynamic monitors for linear-time assertions. In: Proc workshop on formal approaches to testing and runtime verification. LNCS, vol 4262. Springer, Berlin
Google Scholar
Bauer A, Leucker M, Schallhart C (2006) Monitoring of real-time properties. In: FSTTCS’06: Foundations of software technology and theoretical computer science, 26th international conference. LNCS, vol 4337. Springer, Berlin, pp 260–272
Chapter Google Scholar
Bodden E, Hendren LJ, Lam P, Lhoták O, Naeem NA (2010) Collaborative runtime verification with tracematches. J Log Comput 20(3):707–723
Article MATH Google Scholar
Boulé M, Zilic Z (2008) Generating hardware assertion checkers. Springer, Berlin
Book Google Scholar
Bryant R (1986) Graph-based algorithms for Boolean-function manipulation. IEEE Trans Comput C-35(8)
Bryant R (1992) Symbolic Boolean manipulation with ordered binary-decision diagrams. ACM Comput Surv 24(3):293–318
Article Google Scholar
Bunker A, Gopalakrishnan G, McKee SA (2004) Formal hardware specification languages for protocol compliance verification. ACM Trans Des Autom Electron Syst 9(1):1–32
Article Google Scholar
Bustan D, Fisman D, Havlicek J (2005) Automata construction for PSL. Tech Rep, The Weizmann Institute of Science
Chang H, Cooke L, Hunt M, Martin G, McNelly AJ, Todd L (1999) Surviving the SOC revolution: a guide to platform-based design. Kluwer Academic, Norwell
Google Scholar
Chen F, Jin D, Meredith P, Roşu G (2009) Monitoring oriented programming—a project overview. In: Proceedings of the fourth international conference on intelligent computing and information systems (ICICIS’09). ACM, New York, pp 72–77
Google Scholar
Cooper KD, Torczon L (2004) Engineering a compiler. Morgan Kaufmann, San Mateo
Google Scholar
Courcoubetis C, Vardi M, Wolper P, Yannakakis M (1992) Memory efficient algorithms for the verification of temporal properties. Form Methods Syst Des 1:275–288
Article Google Scholar
d’Amorim M, Rosu G (2005) Efficient monitoring of ω-languages. In: Proc 17th international conference on computer aided verification, pp 364–378
Chapter Google Scholar
Daniele M, Giunchiglia F, Vardi MY (1999) Improved automata generation for linear temporal logic. In: CAV’99: Proc 11th int conf on computer aided verification. Springer, London, pp 249–260
Chapter Google Scholar
Duret-Lutz A, Poitrenaud D (2004) SPOT: An extensible model checking library using transition-based generalized Büchi automata. Modeling Anal Simul Comput Syst. doi:10.1109/MASCOT.2004.1348184
Google Scholar
Eisner C, Fisman D (2006) A practical introduction to PSL. Springer, New York
Google Scholar
Finkbeiner B, Sipma H (2004) Checking finite traces using alternating automata. Form Methods Syst Des 24(2):101–127. doi:10.1023/B:FORM.0000017718.28096.48
Article MATH Google Scholar
Geilen M (2001) On the construction of monitors for temporal logic properties. Electr Notes Theor Comput Sci 55(2)
Geist D, Biran G, Arons T, Slavkin M, Nustov Y, Farkas M, Holtz K, Long A, King D, Barret S (1999) A methodology for the verification of a “system on chip”. In: DAC’99: Proc 36th design automation conference. ACM, New York, pp 574–579. doi:10.1145/309847.310001
Google Scholar
Geldenhuys J, Hansen H (2006) Larger automata and less work for LTL model checking. In: Model checking software, 13th int SPIN workshop. LNCS, vol 3925. Springer, Berlin, pp 53–70
Google Scholar
Gerth R, Peled D, Vardi M, Wolper P (1995) Simple on-the-fly automatic verification of Linear Temporal Logic. In: Dembiski P, Sredniawa M (eds) Protocol specification, testing, and verification. Chapman & Hall, London, pp 3–18
Google Scholar
Giannakopoulou D, Havelund K (2001) Automata-based verification of temporal properties on running programs. In: Int conf on automated software engineering. IEEE, Washington, p 412
Google Scholar
Grotker T, Liao S, Martin G, Swan S (2002) System design with SystemC. Kluwer Academic, Norwell
Google Scholar
IEEE working group (2007) Standard for property specification language (PSL). IEC 62531:2007 (E), pp 1–156. doi:10.1109/IEEESTD.2007.4408637
Google Scholar
Gupta A (2002) Assertion-based verification turns the corner. IEEE Des Test Comput 19:131–132. doi:10.1109/MDT.2002.10025
Google Scholar
Hoos HH (2008) Computer-aided design of high-performance algorithms. Tech rep, University of British Columbia
Hopcroft J, Ullman J (1979) Introduction to automata theory, languages, and computation. Addison-Wesley, Reading
MATH Google Scholar
Jard C, Jeron T (1989) On-line model-checking for finite linear temporal logic specifications. In: Automatic verification methods for finite state systems, Proc international workshop, Grenoble. LNCS, vol 407. Springer, Grenoble, pp 189–196
Chapter Google Scholar
Jin D, Meredith P, Griffith D, Roşu G (2011) Garbage collection for monitoring parametric properties. In: Programming language design and implementation (PLDI’11). ACM, New York, pp 415–424. doi:10.1145/1993316.1993547
Google Scholar
Kupferman O, Lampert R (2006) On the construction of fine automata for safety properties. In: ATVA’06: Proc of the international symposium on automated technology for verification and analysis, pp 110–124
Chapter Google Scholar
Kupferman O, Vardi M (2001) Model checking of safety properties. Form Methods Syst Des 19(3):291–314
Article MathSciNet MATH Google Scholar
Meredith P, Jin D, Griffith D, Chen F, Roşu G (2011) An overview of the MOP runtime verification framework. Int J Softw Tech Technol Transfer. doi:10.1007/s10009-011-0198-6
Google Scholar
Møller A (2004) http://www.brics.dk/automaton/
Morin-Allory K, Borrione D (2006) Proven correct monitors from PSL specifications. In: DATE’06: Proc conf on design, automation and test in Europe, European Design and Automation Association, pp 1246–1251
Google Scholar
de Moura LM, Bjørner N (2008) Z3: An efficient SMT solver. In: TACAS’08: Tools and algorithms for the construction and analysis of systems, 14th international conference, pp 337–340
Chapter Google Scholar
Pierre L, Ferro L (2008) A tractable and fast method for monitoring SystemC TLM specifications. IEEE Trans Comput 57:1346–1356. doi:10.1109/TC.2008.74
Article MathSciNet Google Scholar
Rönkkö M (2011) LBT: LTL to Büchi conversion. Available online (1999). http://www.tcs.hut.fi/Software/maria/tools/lbt/. Accessed March 29, 2011
Rozier KY, Vardi MY (2007) LTL satisfiability checking. In: Proc 14th int SPIN conference on model checking software. Springer, Berlin, pp 149–167
Google Scholar
Stolz V, Bodden E (2006) Temporal assertions using AspectJ. Electron Notes Theor Comput Sci 144(4):109–124. doi:10.1016/j.entcs.2006.02.007
Article Google Scholar
Tabakov D, Vardi M (2010) Monitoring temporal SystemC properties. In: Proc 8th int’l conf on formal methods and models for codesign. IEEE, New York, pp 123–132
Google Scholar
Tabakov D, Vardi M, Kamhi G, Singerman E (2008) A temporal language for SystemC. In: FMCAD’08: Proc int conf on formal methods in computer-aided design. IEEE Press, New York, pp 1–9. http://portal.acm.org/citation.cfm?id=1517446
Chapter Google Scholar
Tabakov D, Vardi MY (2005) Experimental evaluation of classical automata constructions. In: LPAR’05: 12th int conf on logic for programming, artificial intelligence, and reasoning, pp 396–411
Chapter Google Scholar
Vardi M, Wolper P (1994) Reasoning about infinite computations. Inf Comput 115(1):1–37
Article MathSciNet MATH Google Scholar
Vijayaraghavan S, Ramanathan M (2005) A practical guide for SystemVerilog assertions. Springer, New York
Google Scholar

Download references

Acknowledgements

We thank Alexandre Duret-Lutz for his code patch replacing the functionality of spot::prune_scc, which we used to upgrade from SPOT 0.4, used in the experiments executed on Ada, to SPOT 0.7.1 used in the experiments executed on SUG@R. We also thank Patrick Meredith and Dmitry Korchemny for suggesting that we consider table look-up encodings. Finally, we thank the anonymous reviewers for their comments and feedback.

Author information

Authors and Affiliations

Schlumberger Information Solutions, 5599 San Felipe Str. #100, Houston, TX, 77056, USA
Deian Tabakov
NASA Ames Research Center, Moffett Field, CA, 94035, USA
Kristin Y. Rozier
Rice University, 6100 Main Str. MS-132, Houston, TX, 77005, USA
Moshe Y. Vardi

Authors

Deian Tabakov
View author publications
You can also search for this author in PubMed Google Scholar
Kristin Y. Rozier
View author publications
You can also search for this author in PubMed Google Scholar
Moshe Y. Vardi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Deian Tabakov.

Additional information

Work supported in part by NSF grants CCF-0613889, CCF-0728882, and Grant EIA-0216467, BSF grant 9800096, the Shared University Grid at Rice (SUG@R), NASA’s Airspace Systems Program, a gift from Intel, and a partnership between Rice University, Sun Microsystems, and Sigma Solutions.

A preliminary version of this work was reported by D. Tabakov and M.Y. Vardi in “Optimized temporal monitors for SystemC,” Proc. 1st Int’l Conf. on Runtime Verification, Lecture Notes in Computer Science 6418, Springer, pp. 436–451, 2010.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Tabakov, D., Rozier, K.Y. & Vardi, M.Y. Optimized temporal monitors for SystemC. Form Methods Syst Des 41, 236–268 (2012). https://doi.org/10.1007/s10703-011-0139-8

Download citation

Published: 19 January 2012
Issue Date: December 2012
DOI: https://doi.org/10.1007/s10703-011-0139-8

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Optimized temporal monitors for SystemC

Abstract

Similar content being viewed by others

PSCV: A Runtime Verification Tool for Probabilistic SystemC Models

Assumption-based Runtime Verification

Sliding between Model Checking and Runtime Verification

1 Introduction

2 SystemC

3 Related work

4 Theoretical background

Theorem 1

5 Monitor generation

5.1 State minimization

5.2 Alphabet representation

5.2.1 Assignment-based representation

5.2.2 BDD-based representation

5.2.3 From NFW to DFW

5.3 Alphabet minimization

5.4 Monitor encoding

5.4.1 Nondeterministic encodings

5.4.2 Deterministic encodings

5.5 Deterministic table-based encodings

5.5.1 The file-based table encoding

5.5.2 The memory table encoding

5.6 Workflow space

6 Experimental setup

6.1 SystemC model

6.2 Properties

7 Results for non-table-based workflows

7.1 State minimization

7.2 Alphabet representation

7.3 Alphabet minimization

7.4 Monitor encoding

7.5 Best non-table-based workflow

8 Results for table-based workflows

9 Discussion and future work

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation