A Systematic Survey on CAPTCHA Recognition: Types, Creation and Breaking Techniques

Kumar, Mohinder; Jindal, M. K.; Kumar, Munish

doi:10.1007/s11831-021-09608-4

A Systematic Survey on CAPTCHA Recognition: Types, Creation and Breaking Techniques

Survey article
Published: 14 June 2021

Volume 29, pages 1107–1136, (2022)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Archives of Computational Methods in Engineering Aims and scope Submit manuscript

A Systematic Survey on CAPTCHA Recognition: Types, Creation and Breaking Techniques

Download PDF

1743 Accesses
21 Citations
Explore all metrics

Abstract

CAPTCHA stands for Completely Automated Public Turing Test to Tell Computers and Human Apart. CAPTCHA is used for internet security. A few CAPTCHA schemes are available today like, text-based, audio-based, video/animation-based, puzzle based etc. In this paper, all these types are collaborating at single place to analyze. The main aim of this article is to present a literature to identify and recognize CAPTCHA, its types, the creation and breaking techniques. It is a systematic and complete analysis of all available CAPTCHA types. In this paper, 16 text-based CAPTCHA’s generation methods are discussed with usability and security ranges from 3 to 100 and 65 to 100%, respectively. The security and usability measures are not calculated/sustained using some known English schemes. Out of 16 reviewed CAPTCHAs, 12 are based on English language, 1 on Arabic language, 1 on Chinese language, 1 on Devanagari language and 1 on Gurumukhi script. The designs are made segment proof with overlapping random shapes, overlapping characters, clasping, different colors and different shades. For making recognition proof many techniques are used like image masking, local and global warping; broken characters, random rotation, arcs, jaws, etc. Approximately 50 schemes, especially based on the English language, are successfully broken with a success rate that ranges from 2 to 100%. The techniques that are used to break these schemes include shape context matching, distortion estimation, Log Gabor 2D filter, horizontal and vertical projection (for a segment the letters) are used. For recognition CNN, KNN, DNN and MCDNN are used. Almost 15 images-based CAPTCHAs are discussed that are designed with usability and security range 90–100 and 17–100%, respectively. Out of these 5 schemes are successfully broken with a success rate ranging between 7 and 100%. The K-NN and SVM are mostly used algorithms to recognize the images. Audio based CAPTCHAs (5 designs) are discussed with usability and security range from 68.5 to 100 and 100%, respectively. The broken rate of these audio schemes is also 45–75%. These schemes are broken with SVM and K-NN algorithms. The paper also discusses 4 popular video-based designs that provide usability and security that ranges from 75 to 100 and 98 to 100, respectively. These schemes are also compromised with broken rate 16–10% using SIFT, NN and simple OCR techniques. The paper can be a benchmark to precede any specific research to dive into any one of these types.

Two Novel Image-Based CAPTCHA Schemes Based on Visual Effects

Security and Understanding Techniques for Visual CAPTCHA Interpretation

Usability Comparison of Text CAPTCHAs Based on English and Chinese

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

In the present era, the internet is a major interaction for every person. It does not matter what is the age, profession, gender, and sector. The availability of the rich variety of mobile devices and cheaper high-speed data plans increased the interest of the users for the Internet usage. The variety of the content on the web is also multitalented that attracts everyone. Most of the data is also available at free of cost on the Internet that adds the number of users in compounded way. As the Internet is becoming the most popular platform to provide data services the number of websites and blogs are also increasing. Today the web sites are designed for financial services, public services, entertainment services, grocery products, healthcare services, transportation services, hotel bookings etc. But the knowledge of the user has been also not always good for these web sites. Internet Security is always the main challenge for the web developers from the beginning. The increasing number of users also demands the high-end processing units at the deployed web site, but these high-end units are useless if the high-end machine attacks these servers. Ahn et al. [3] highlighted an instance that happened at CMU when the students developed a program for submitting ballots in online polls in their schools’ favor. So, a program can be trained in such a way that can enter a website and makes the server so overloaded with requests that in turn results in crashing of the server. This program is well known as bot program. A few methods are developed by the researchers to stop such kind of attacks. But most of the methods are very expensive and demands a lot of efforts by the experts as well as like One Time Password etc.

A very effective and cheapest method is CAPTCHA. This Reverse Turing Test is also known as Completely Automated Public Turing Test to Tell Computers and Humans Apart or CAPTCHA in short. In such methods a Reverse Turing Test is given to the attacker and depending on the challenge passed, it is decided that the attacker is a human or a computer. Naor [48] explained the term Reverse Turing Test introduced in 1950, when a Turing Test was introduced to check with a human that the other side is introduced by a computer or human. In this paper we call it CAPTCHA from now to make it easy to write. The CAPTCHA is designed in such a way that is easily understandable, but very difficult for a computer program. Coates et al. [24] text-based CAPTCHA shown in Fig. 1a. This includes simple text to be recognized easily by a human being but not by a program due to some noise and distortion. In the last 20 years a lot of CAPTCHAs designs are proposed that includes a large variety of forms. In this paper, we will discuss all these types of designs. We will discuss the design features and the breaking methods of these CAPTCHA’s.

2 Security and Usability Metrics of CAPTCHA

A CAPTCHA must hold the sweet spot between solvability by computers and humans as depicted in the Fig. 2. A delicate balance must be maintained by a valid CAPTCHA challenge. It must not be so easy to break by a computer program and at the same time is must not be hard to break by a human. Although it is not an easy task to achieve this sweet spot as the history of CAPTCHA tells.

A CAPTCHA is assumed to be secure if its success rate is less than 0.0001%. It means that out of 10,000 challenges only not more than one should be broken by a computer program. The usability of a CAPTCHA should be more than 80% for users. It means that out of 100 challenges 80 times, a CAPTCHA should be easily identified by the users within a minimum time e.g. 3–5 s.

3 Motivation

CAPTCHAs schemes are of many types like text-based, image-based, audio-based, animation- based, puzzle-based, video-based, and even now invisible schemes are also introduced by Google. These all schemes are unique in one way. The classical text-based schemes use simple text as a challenge. But these letters are made segment proof and recognition proof. To make this CAPTCHA noise is added, distortion and warping are applied to the text. Sometimes letters are broken and even hidden. In the image-based CAPTCHA schemes, the challenges include images of things, persons, animals, etc. The user is asked to identify an image among the given images. In the animation-based CAPTCHA, the text is moving, and a user is asked to identify the text. In the video-based schemes the user is asked to identify the type of video based on the contents of the video. In puzzle-based schemes, the user is given a number puzzle or sometimes image-based puzzle. In the latest invisible CAPTCHA, the user is provided with a checkbox and the user just need to click in the checkbox to pass the challenge. In the previous literature, all these CAPTCHAs are never analyzed at a single platform. Only the English language text-based schemes are discussed in more details as compared to other schemes. It motivates us to present all the available CAPTCHAs in one platform and perform analysis on all these techniques. Many of the review articles pick one type of scheme most of the times. That does not provide the perfect review for the upcoming research. In this paper, an effort is made to include all the different types of CAPTCHAs under one tree. An effort has been made by analyzing all the text-based CAPTCHAs of different languages. All the image-based, animation-based, video-based, audio-based CAPTCHAs are discussed in detail from their generation to end, like what are the techniques used to create these schemes and what are the techniques to break these challenges? Also, the guidelines are provided to make these successfully broken schemes, stronger. In the following sections, the authors have tried to find answers to some of the very important research questions as shown in the Table 1.

Table 1 Research questions

A Systematic Survey on CAPTCHA Recognition: Types, Creation and Breaking Techniques

Abstract

Similar content being viewed by others

Two Novel Image-Based CAPTCHA Schemes Based on Visual Effects

Security and Understanding Techniques for Visual CAPTCHA Interpretation

Usability Comparison of Text CAPTCHAs Based on English and Chinese

Explore related subjects

1 Introduction

2 Security and Usability Metrics of CAPTCHA

3 Motivation

4 Source the Information

5 Types of CAPTCHAs

6 Applications of CAPTCHA

7 Reported Work on Text Based CAPTCHAs

7.1 Creation of Text Based CAPTCHA’s

7.2 Breaking of Text Based CAPTCHAs

8 Reported Work on Image Based CAPTCHAs

8.1 Creation of Image Based CAPTCHAs

8.2 Breaking Techniques of Image CAPTCHAs

9 Reported Work on Puzzle Based CAPTCHA

9.1 Creation of Puzzle Based CAPTCHAs

9.2 Breaking Techniques Puzzle Bases CAPTCHAs

10 Reported Work on Audio CAPTCHA

10.1 Creation of Audio Based CAPTCHAs

10.2 Breaking Techniques Audio CAPTCHA

11 Reported Work on Animation/Video CAPTCHAs

11.1 Creation of Animation/Video CAPTCHAs

11.2 Breaking Techniques of Animation/Video CAPTCHA

12 Breaking of Mouse CAPTCHA and Invisible CAPTCHA

13 Tools and Techniques for Security Testing of Various CAPTCHAS

14 Guidelines to Make a Strong CAPTCHA

15 Conclusions

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Research Involving Human Participants and/or Animals

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation