Survey on test data generation tools

Galler, Stefan J.; Aichernig, Bernhard K.

doi:10.1007/s10009-013-0272-3

Survey on test data generation tools

An evaluation of white- and gray-box testing tools for C#, C++, Eiffel, and Java

Regular Paper
Published: 11 April 2013

Volume 16, pages 727–751, (2014)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

International Journal on Software Tools for Technology Transfer Aims and scope Submit manuscript

Survey on test data generation tools

Download PDF

Stefan J. Galler¹ &
Bernhard K. Aichernig¹

1761 Accesses
20 Citations
Explore all metrics

Abstract

Automating the process of software testing is a very popular research topic and of real interest to industry. Test automation can take part on different levels, e.g., test execution, test case generation, test data generation. This survey gives an overview of state-of-the art test data generation tools, either academic or commercial. The survey focuses on white- and gray-box techniques. The list of existing tools was filtered with respect to their public availability, their maturity, and activity. The remaining seven tools, i.e., AgitarOne, CodePro AnalytiX, AutoTest, C++test, Jtest, RANDOOP, and PEX, are briefly introduced and their evaluation results are summarized. For the evaluation we defined 31 benchmark tests, which check the tools capabilities to generate test data that satisfies a given specification: 24 primitive type benchmarks and 7 non-primitive type and more complex with respect to the specification benchmarks. Most of the commercial tools implement a test data strategy that uses constant values found in the method under test or values that are slightly modified by means of mathematical operations. This strategy turns out to be very effective. In general, all tools that combine multiple techniques perform very well. For example PEX uses constraint solving techniques, but in cases where the constraint solver reaches its limitations it uses random based techniques to overcome those limitations. Especially, the two commercial tools AgitarOne and PEX that combine multiple approaches to test data generation are able to pass all 31 tests. This survey reflects the status in 2011.

Automatic Test-Case Generation with CoVeriTest (Extended Abstract)

Advances in Automatic Software Testing: Test-Comp 2022

Status Report on Software Testing: Test-Comp 2021

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Software is in every part of our life. It is software that wakes us up in the morning, makes us coffee, tells us the early morning news. It is software that drives us to our working place, that controls the traffic lights, that moves the elevator. It is software that flies planes and keeps nuclear reactors under control. And these software components are getting more and more complex, have to be maintained and updated. Therefore, testing is crucial. In academia and industry many people are working on new technologies that reduce both bugs in software and costs to find them.

Automating the test process still consists of multiple facets, ordered with respect to increasing complexity: (a) executing tests, (b) generating empty test classes and methods, (c) generating test cases, and (d) generating test data. Many tools exist that automate test execution, for example the JUnit framework. Most of the state-of-the-art integrated development environments (IDEs), such as Microsoft Visual Studio, and Eclipse, include tools that automatically generate empty test classes and methods. Current research efforts focus on automatically generating test cases and test data. The former automates the process of finding out which method sequence may reveal an error. The latter automates the generation of primitive values and especially non-primitive objects that can be used in test cases.

This survey attempts to give an overview of available commercial and academic tools with respect to their test data generation capabilities. Therefore, we compiled a list of test generation tools, filtered them with respect to their level of availability, maturity, and activity. The remaining seven tools, i.e., AgitarOne, CodePro AnalytiX, AutoTest, C++test, Jtest, RANDOOP, and PEX, are challenged with in total 31 benchmark tests: 24 benchmark tests show the tools capabilities to generate primitive values; 7 benchmark tests show how well they perform on non-primitive types and complex specifications. The information collected in this survey reflects the status in 2011.

This survey continues as follows: Sect. 2 introduces the criteria for tool selection and evaluation. Thereafter, Sect. 3 presents the result of evaluating the tools. Each tool is shortly introduced and the evaluation result is discussed. The related work is mentioned in Sect. 4. The survey concludes in Sect. 5.

2 Evaluation procedure

AgitarOne, CodePro AnalytiX, AutoTest, C++test, Jtest, RANDOOP and PEX are the seven tools that satisfy all criteria to be part of this survey. Section 2.1 (a) shows a classification of all candidate tools, and (b) introduces the selection criteria availability, maturity and activity. Furthermore, Sect. 2.2 (a) introduces the evaluation criteria, and (b) describes the evaluation procedure.

2.1 Candidate tools

Figure 1 presents the map of all relevant tools on automatic test generation. The tools are categorized with respect to two dimensions:

1.
source code required/present
2.
specification usage

On the one hand we distinguish tools with respect to their access to source code. On the other hand we distinguish between tools that use no specification, use specification as test oracle only, and tools that use specification as test oracle as well as for steering the test input generation.

Figure 1 clusters the tools with respect to the well-known terminology [1, p. 21] of black-box, white-box, and gray-box testing. Black-box tests are derived from external description of the software, e.g., specifications. White-box tests are derived from source code internals, e.g., branch conditions. The term gray-box testing is used for test generation approaches that use both, source code internals as well as external descriptions of the software.

This survey focuses on state-of-the art test data generating tools. To ensure the quality of this survey we have to further filter the candidate list. First, only white-box or gray-box testing tools are considered for this survey. Second, the remaining tools are rated with respect to availability, maturity, and activity.

Availability Tools have to be publicly available. Either as free download or as commercial tool.
Maturity Only tools that are already applied to industrial size applications are considered. We therefore rate all tools from 1 to 4:
1. 1.
  commercial tool
2. 2.
  applied to (at least one) industrial size case study
3. 3.
  applied to (at least one) case study
4. 4.
  no information about case studies available
Activity Tools have to be maintained. In other words, only tools updated within the last 3 years (i.e., since 2009) are considered.
Citation The amount of (scientific) publications that include references to the tool. Figures are extracted from Google scholar in December 2011. The delta value in brackets shows the amount of additional citations since October 2010.

Table 1 lists all white- and gray-box testing tools and summarizes the rating with respect to the introduced classification criteria. AgitarOne, CodePro AnalytiX, AutoTest, C++test, Jtest, RANDOOP, and PEX satisfy the criteria and are therefore part of the evaluation for this survey presented in Sect. 3. They are highlighted in the table.

Table 1 Candidate list

Survey on test data generation tools

Abstract

Similar content being viewed by others

Automatic Test-Case Generation with CoVeriTest (Extended Abstract)

Advances in Automatic Software Testing: Test-Comp 2022

Status Report on Software Testing: Test-Comp 2021

Explore related subjects

1 Introduction

2 Evaluation procedure

2.1 Candidate tools

2.2 Evaluation criteria

3 Evaluation

3.1 Jtest

3.1.1 General information

3.1.2 Data generation approach

3.1.3 Evaluation

3.2 C++test

3.2.1 General information

3.2.2 Test data generation

3.2.3 Evaluation

3.3 AgitarOne

3.3.1 General information

3.3.2 Data generation approach

3.3.3 Evaluation

3.4 AutoTest

3.4.1 General information

3.4.2 Data generation approach

3.4.3 Evaluation

3.5 CodePro AnalytiX

3.5.1 General information

3.5.2 Data generation approach

3.5.3 Evaluation results

3.6 RANDOOP

3.6.1 General information

3.6.2 Data generation approach

3.6.3 Evaluation results

3.7 PEX

3.7.1 General information

3.7.2 Data generation approach

3.7.3 Evaluation results

4 Related work

4.1 Test data generation tools

4.2 Black-Box testing tools

5 Conclusion

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation