A systematic review of fuzzing

Zhao, Xiaoqi; Qu, Haipeng; Xu, Jianliang; Li, Xiaohui; Lv, Wenjie; Wang, Gai-Ge

doi:10.1007/s00500-023-09306-2

A systematic review of fuzzing

Application of soft computing
Published: 31 October 2023

Volume 28, pages 5493–5522, (2024)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Soft Computing Aims and scope Submit manuscript

A systematic review of fuzzing

Download PDF

Xiaoqi Zhao¹,
Haipeng Qu ORCID: orcid.org/0000-0002-1564-8980²,
Jianliang Xu²,
Xiaohui Li²,
Wenjie Lv² &
…
Gai-Ge Wang²

1538 Accesses
3 Citations
Explore all metrics

Abstract

Fuzzing is an important technique in software and security testing that involves continuously generating a large number of test cases against target programs to discover unexpected behaviors such as bugs, crashes, and vulnerabilities. Recently, fuzzing has advanced considerably owing to the emergence of new methods and corresponding tools. However, it still suffers from low coverage, ineffective detection of specific vulnerabilities, and difficulty in deploying complex applications. Therefore, to comprehensively survey the development of fuzzing techniques and analyze their advantages and existing challenges, this paper provides a comprehensive survey of the development of fuzzing techniques, summarizes the main research issues, and provides a categorized overview of the latest research advances and applications. The paper first introduces the background and related work on fuzzing. Research issues are subsequently addressed and summarized, along with the latest research developments. Furthermore, various customized fuzzing techniques in different applications are presented. Finally, the paper discusses future research directions.

Hermes: A Targeted Fuzz Testing Framework

Refined Grey-Box Fuzzing with Sivo

The Research on the Fuzzing

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Fuzzing, also known as fuzz testing, is a powerful software testing technique that has gained significant attention in the field of software and system security testing. It involves automatically generating a large number of test cases and feeding them into the target program to detect bugs, crashes, or vulnerabilities. Today, fuzzing has emerged as a popular technique in both academia and industry. Some prominent software companies, such as Google (Abhishek and Cris 2012; Chris et al. 2011; Max and Kostya 2016), Microsoft (Onefuzz 2020), Cisco and Adobe (Brad 2009), have developed their fuzzing tools and have successfully discovered thousands of vulnerabilities in their products. An increasing number of fuzzing studies appear at security and software engineering-related conferences and journals (Godefroid et al. 2008a; Woo et al. 2013). Designed fuzzing tools (also known as fuzzers) open sourced on GitHub and discovered many vulnerabilities in open-source software. Additionally, fuzzing has been widely employed in various renowned competitions, including the DARPA Cyber Grand Challenge (2016).

Fuzzing was proposed by Miller et al. in 1988. It was primarily employed for testing the robustness of UNIX programs (Miller et al. 1995). In 1999, it was extended to encompass security testing. During this period, blackbox fuzzing was predominantly implemented, with notable fuzzers such as PROTOS (Viide et al. 2008), SPIKE (Godefroid 2020), and Peach (Liang et al. 2018a). Blackbox fuzzing generates test cases randomly, with fast testing speed. However, it lacks access to internal program information, limiting the full exploration of deep program logic. In 2008, Godefroid et al. (2008c) developed SAGE, a whitebox fuzzer that combines symbolic execution and fuzzing techniques to generate test cases. Compared to blackbox fuzzing, whitebox fuzzing can generate test cases correlating to particular paths by exploiting program internal information. Nonetheless, software complexity and solver limitations (Avgerinos et al. 2014; Baldoni et al. 2018) present obstacles to the effectiveness of fuzzing in conducting thorough testing within a restricted time frame. Therefore, researchers have shown considerable interest in achieving a balance between the utilization of program internal information and testing efficiency. This has driven the development of greybox fuzzing. At the end of 2013, Zalewski (2013) released a greybox fuzzer American Fuzzy Lop (AFL). AFL uses instrumentation to collect path information from the target program and uses coverage to guide test case generation during fuzzing process, which has become known as coverage-based greybox fuzzing (CGF) (HonggFuzz (2015); Serebryany 2016).

Despite the successes achieved by AFL and CGF in the field of fuzzing, there are still numerous unresolved challenges. One of the primary challenges is the limited comprehension of target programs, especially for complex programs. Fully comprehending the logic and data flow of programs is an arduous task. Consequently, this lack of comprehension impedes the exploration of in-depth paths within the program, thereby restricting the improvement of code coverage (Lou and Song 2020). Another significant challenge arises from the restrictions of fuzzing in modeling specific vulnerabilities. Fuzzing randomly generates test case, bug it frequently lacks the crucial information concerning particular vulnerability features and their locations. As a result, it struggles to accurately simulate and detect certain types of vulnerabilities (Trickel et al 2023). Additionally, the deployment of fuzzing in complex applications, and their testing efficiency are important current challenges (Beaman et al. 2022; Donaldson et al. 2023).

In recent years, fuzzing has shown a trend toward integration, diversification, and open source (Google: ClusterFuzz 2019; Serebryany 2017). Current research on fuzzing mainly focuses on general fuzzing, vulnerability-oriented fuzzing, combining fuzzing with other techniques, and fuzzing for different applications. General fuzzing aims to improve the process of fuzzing to explore deep program paths and improve code coverage. For example, Skyfire (Wang et al. 2017) improves initial test case generation to increase code coverage, AFLFast (BÖhme et al. 2019) improves energy allocation to discover more paths, FairFuzz enhances mutation strategies to improve path coverage, MooFuzz (Zhao et al. 2021) improves seed schedule for better path discovery, and AFLSmart (Pham et al. 2019) focuses on the input format of the target program to generate test cases that conform to the program’s format to explore deep path. General fuzzing has better generality, but they face certain challenges in detecting specific vulnerabilities. To address this challenge, vulnerability-oriented fuzzing focuses on particular vulnerabilities and conducts relevant fuzzing research based on those vulnerability features. For example, MemLock (Wen et al. 2020) focuses on detecting uncontrollable memory consumption and uncontrollable recursive bugs. PerfFuzz (Lemieux et al. 2018) explores algorithmic complexity vulnerabilities by maximizing the edge count in the control flow graph. ConFuzz (Vinesh et al. 2020) considers the characteristics of concurrency vulnerabilities and focuses on detecting this type of vulnerability. Moreover, fuzzing is combined with other security testing techniques such as taint analysis (Bekrar et al. 2012), symbolic execution (Noller et al. 2018), machine learning (Saavedra et al. 2019), and other techniques to improve its testing performance. Fuzzing is also currently being customized for complex applications to uncover potential vulnerabilities and bugs within them.

This overview is motivated by two main points. Firstly, fuzzing has gained significant attention and undergone rapid development in recent years. It has been widely adopted across various applications and extensively utilized by numerous companies and competitions. This highlights the importance and effectiveness of fuzzing in identifying vulnerabilities and enhancing software security. Secondly, there is a lack of comprehensive surveys specifically focused on fuzzing that cover recent advancements and developments. Previous reviews (Li et al. 2018; Liang et al. 2018b; Manès et al. 2019) have provided summaries of fuzzing achievements up until 2018. Other papers (Eisele et al. 2022; Wang et al. 2020) offer systematic reviews of the historical development of fuzzing but tend to concentrate on specific types of fuzzing techniques. There is a necessity for an up-to-date and comprehensive review that encompasses the recent advancements and developments in fuzzing techniques.

This paper presents a comprehensive review of current research on fuzzing. Firstly, an overview of the basic process and classification of fuzzing is provided to offer readers a holistic understanding. The paper then proceeds to introduce CGF as a widely used and representative technique in fuzzing, establishing a solid theoretical foundation and providing technical support for subsequent research advancements. Subsequently, the latest advancements in fuzzing are categorized and discussed, exploring their applications across various domains. Finally, the paper concludes by summarizing the key findings of the reviewed research and future directions.

In this paper, we make the following main contributions.

We provide an overview of the processes and classifications of fuzzing, give definitions of CGF and related design details.
We discuss the research issues studied in fuzzing and categorize and survey the latest research work.
We survey fuzzing techniques in different application scenarios.
We summarize the challenges and future research directions of fuzzing.

The rest of the paper is organized as shown in Fig. 1. Background and related work are introduced in Sect. 2. Section 3 surveys recent fuzzing research advancements. This is followed by a review of fuzzing in applications in Sect. 4. Section 5 concludes the paper and discusses future directions.

2 Background and related work

In this section, we first provide the inclusion criteria for the papers covered in this review, then provide an overview of the fuzz testing process, discuss the classification of fuzzing, and introduce the current classic coverage-based greybox fuzzing (CGF), and finally discuss related work.

2.1 Inclusion criteria

We reviewed more than 100 papers, mostly significant works published in top conferences and journals in the software engineering and security field from 2018 to 2023. We also included outstanding fuzzing papers published in industrial conferences, such as Blackhat. To ensure a comprehensive comprehension of the development of fuzzing techniques across various applications, we have gathered a number of classical fuzz testing papers, without any limitations on publication dates. In addition, we have collected top journals papers covering various fuzzing applications to offer a holistic perspective. To clearly define the scope, the inclusion criteria adopted are as follows.

Table 1 Security and software engineering top conference papers

A systematic review of fuzzing

Abstract

Similar content being viewed by others

Hermes: A Targeted Fuzz Testing Framework

Refined Grey-Box Fuzzing with Sivo

The Research on the Fuzzing

Explore related subjects

1 Introduction

2 Background and related work

2.1 Inclusion criteria

2.2 Process of fuzzing

2.3 Classification of fuzzing

2.4 CGF

2.4.1 Instrumentation

2.4.2 Seed selection and power schedule

2.4.3 Mutation strategy

2.5 Related work

3 State-of-the-art fuzzing

3.1 General fuzzing

3.1.1 Initial seed selection (RQ1)

3.1.2 Seed selection optimization (RQ2)

3.1.3 Power schedule (RQ3)

3.1.4 Mutation strategy (RQ4)

3.1.5 Summary of general fuzzing

3.2 Vulnerability-oriented fuzzing (RQ5)

3.2.1 Uncontrollable memory consumption & uncontrolled recursive

3.2.2 Integer overflow and array overflow

3.2.3 Memory vulnerability

3.2.4 Consistency error

3.2.5 Use-after-free

3.2.6 Algorithmic complexity vulnerability

3.2.7 Concurrency vulnerability

3.2.8 Side channel attack

3.2.9 Summary of vulnerability-oriented fuzzing

3.3 Combining fuzzing with other techniques (RQ6)

3.3.1 Symbolic execution

3.3.2 Parallel and integration

3.3.3 Instrumentation

3.3.4 Other techniques

3.3.5 Summary of fuzzing integration with other techniques

4 Fuzzing: different applications

4.1 SMT solver

4.2 Virtual machine monitor

4.3 Kernel

4.4 Smart contract

4.5 Protocol

4.6 Machine learning model

5 Conclusion and future direction

Data availability

Code availability

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation