RETRACTED ARTICLE: Feature reduced blind steganalysis using DCT and spatial transform on JPEG images with and without cross validation using ensemble classifiers

Gireeshan, M. G.; Shankar, Deepa D.; Azhakath, Adresya Suresh

doi:10.1007/s12652-020-02001-2

RETRACTED ARTICLE: Feature reduced blind steganalysis using DCT and spatial transform on JPEG images with and without cross validation using ensemble classifiers

Original Research
Published: 12 May 2020

Volume 12, pages 5235–5244, (2021)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Journal of Ambient Intelligence and Humanized Computing Aims and scope Submit manuscript

RETRACTED ARTICLE: Feature reduced blind steganalysis using DCT and spatial transform on JPEG images with and without cross validation using ensemble classifiers

Download PDF

M. G. Gireeshan¹,
Deepa D. Shankar² &
Adresya Suresh Azhakath³

218 Accesses
3 Citations
Explore all metrics

This article was retracted on 19 May 2022

This article has been updated

Abstract

The paper discuss the outcome evaluation of JPEG images in both spatial and DCT transform and a comparative study is being done. There are four distinct steganographic algorithms—LSB matching, LSB replacement, pixel value differencing and F5 are used. The embedding performed on the images are 25% with text. The idea of cross validation is used to validate the classifier better and a comparative analysis is performed on results with and without cross validation. Features removed for investigation are the first order, second order, extended features and Markov features. Relevant features are chosen by feature reduction. This process is done using principal component analysis (PCA). This is done to eliminate redundant feature that can hamper the efficiency of classification. The classifiers used are support vector machine (SVM) and support vector machine with particle swarm optimisation (SVM-PSO). The classification is done based on six kernels like radial, dot, multiquadratic, epanechnikov and ANOVA and four sampling techniques like shuffled, linear, stratified and automatic sampling. The existing techniques had always used radial as kernel without sampling for a classification. The proposed system make use of this imperfection and has formulated the result.

Blind Feature-Based Steganalysis with and Without Cross Validation on Calibrated JPEG Images Using Support Vector Machine

Minor blind feature based Steganalysis for calibrated JPEG images with cross validation and classification using SVM and SVM-PSO

Article 26 September 2020

Random embedded calibrated statistical blind steganalysis using cross validated support vector machine and support vector machine with particle swarm optimization

Article Open access 09 February 2023

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Steganography aims to give furtive information transmission. The goal line of steganography is to attach a message inside a carrier signal so that it has not been identified by unwanted receivers (Shih et al. 2011). Steganalysis is a technique for detecting the presence of concealed data (Das et al. 2011). Steganalysis discovers the hidden signals in supposed carriers or defines the media that possess the hidden signals/information. Steganography's primary problem is to define and apply a better identification methodology (Al-Kharobi et al. 2017). The method of steganography and steganalysis (Badr 2014) is better grasped through the picture depicted in Fig. 1.

Although steganography throws light on information in any of the digital media, due to their recurrent use on the internet, electronic photographs/images are the most common as “carrier” (Altaay et al. 2012). Since the image file is large, it can contain enormous amounts of information. The human visual system cannot discriminate with secret information on the usual picture and the original picture. Furthermore, as there are large numbers of redundant bits in digital format pictures, they are mostly preferred as cover objects (Pal et al. 2017). This work therefore uses images as a cover file. The standard picture used for Image steganography is Joint Photographic Experts Group (JPEG), which make use of the concept of lossy compression while keeping up the nature of the picture (Liu et al. 2010).

The image steganography is commonly partitioned into spatial and transform domain (Kaur et al. 2014), which can be explained using the block diagram in Fig. 2

The two fundamental kinds of steganalysis are targeted and blind steganalysis. Targeted steganalysis is proposed for a definite algorithm. This category is very tough since it deals with better accuracy of detection whereas blind steganalysis is not exposed to any distinct algorithm, thus eliminating the dependency of the same. Moreover, blind steganalysis works well with statistical data, hence also known as statistical steganalysis (Sabnis and Awale 2016). The various advances associated with steganalysis are feature selection, feature extraction and classification. Features that are pivotal to an image will be selected, extracted and send to the classifier. During feature extraction, there will also be features which is irrelevant and may adversely affect the efficacy of the classifier. Such features need to be removed, which can be done by a technique known as feature reduction (Jain and Singh 2018). In this research, principal component analysis is considered. Cross validation is a technique of validation a classifier to get a better efficiency. The data is divided into different folds and classified, hence known as k-fold classification. In this research we use tenfold classification for the research. The supervised learning techniques has previously given good results. The classifiers used in this research are support vector machine and its optimisation variant with particle swarm optimisation. The reason for the choice is that the SVM had been found to be very robust when working with high dimensionality inputs. Hence it is assumed that the optimisation variant may give a substantial result and is used here.

2 Related work

The effectiveness of steganalysis depend on how well the grouping of cover and stego images are done. With transformation and deciding on the optimum number of DCT coefficients, the embedding of data is done so that the images are not affected by visual attack (Zeng et al. 2017; Jiang et al. 2019). Transform domain approach can be integrated to achieve greater results with nominal modifications in the cover image (Attaby et al. 2018). Steganalysis is likewise completed in the spatial area, where the implanting happens straightforwardly into the picture's pixel intensity (Tuithung et al. 2015). Rabee et al. (2018) suggested a novel way of effectively revealing the presence of a concealed message in a JPEG image. discrete cosine transform (DCT) is generally incorporated in statistical steganalysis for JPEG picture format, which would help reduce the cost of memory and time of computation. After the classification, various features that will be statistically prominent in both spatial and transform domain, will be extracted. This is because the features are the best objects to describe an image (Ker et al. 2013). The combination of both spatial and transform domain yield better results in previous literature (Fridrich et al. 2012; Kodovsky et al. 2010). Large feature set would imply a big dimensionality which could adversely influence the efficiency of the classifier. Previous literature (Cadima et al. 2016) states that principal component analysis (PCA) is better suited to decrease the dimension when huge unrelated data is involved (Han et al. 2012; Lever et al. 2017). Cross validation is a technique used in machine learning which is used during classification to avoid the problem of overfitting, hence used as an optimal model (Liu et al. 2019). Thus the concept of cross validation is widely used to survey the generability of an algorithm (Bergmeir et al. 2018). The classifiers then decide whether the image is a stego or cover. SVM classifiers are the most popular ones for classification (Farid et al. 2003). Hence the application of SVMs are diverse, since it can be applied to graphs, sequences and even relational data and thereby designing the corresponding kernels for each (Ebrahimi et al. 2017). Particle swarm optimisation (PSO) has been of great significance due to its flexibility and low computation (Liliya Demidova et al. 2016). PSO helps in optimisation thus improving the performance when linked with SVM (Garcia Nieto et al. 2016). The same research is also done with calibrated images (Shankar and Azhakath 2020). Different embedding percentage and optimization variant of classifier had also been considered (Azhakath et al. 2019). Classification in low embedding percentage with SVM as classifier is considered for research (Shankar and Upadhyay 2020)

3 Problem statement

This research is intended to perform a blind steganalysis for an embedding of 25%. The images used are in JPEG format which is changed using discrete cosine transform. The dimensionality reduction of features is completed using principal component analysis. The steganographic algorithms used for embedding are LSB replacement, LSB matching, Pixel Value Differencing and F5. SVM and SVM PSO are the classifiers incorporated for the comparative study. Six various kernels and four diverse sampling methods are taken into consideration. The kernels are multiquadric, radial, dot, polynomial, Epanechnikov and ANOVA. The different sampling methods are linear, shuffled, stratified and automatic. The outline of implementation is given in Fig. 3.

4 Methodology

This part deals with the methodology of the research using JPEG image format. This is because the previous literature (Bedi et al. 2013) states that such a system is simple to store and transmit data over the internet. A low scale embedding percentage of 25 is used for the research. The raw images are converted to transform domain and the appropriate characters are being mined. The image attributes are normalised to promote the effectiveness of the steganographic algorithm.

4.1 Dataset

The presentation of any framework relies upon the nature of dataset utilized for it. This research considers a set of 2300 images from two different standard datasets. Out of them 1500 images from UCID image dataset (Schaefer et al. 2004) is used as the training set and 800 images from INRIA image database (Jegou et al. 2008) is used as the test dataset. The image is transformed as needed and the features are selected, extracted and classified. The selection and extraction is done on features that are profound to any changes in embedding.

4.2 Feature vector extraction

Four types of features namely first order features, second order features, extended DCT features and Markov features are considered for extraction. The functionalities of the features are as shown in Table 1.

Table 1 Table of extracted features

RETRACTED ARTICLE: Feature reduced blind steganalysis using DCT and spatial transform on JPEG images with and without cross validation using ensemble classifiers

Abstract

Similar content being viewed by others

Blind Feature-Based Steganalysis with and Without Cross Validation on Calibrated JPEG Images Using Support Vector Machine

Minor blind feature based Steganalysis for calibrated JPEG images with cross validation and classification using SVM and SVM-PSO

Random embedded calibrated statistical blind steganalysis using cross validated support vector machine and support vector machine with particle swarm optimization

Explore related subjects

1 Introduction

2 Related work

3 Problem statement

4 Methodology

4.1 Dataset

4.2 Feature vector extraction

4.3 Cross validation

4.4 Classification

4.4.1 Support vector machine

4.4.2 Support vector machine with particle swarm optimisation

4.5 Principal component analysis

4.6 Kernels

5 Results of experimentation

5.1 Results with no cross-validation

5.2 Results with cross-validation

6 Conclusions

Change history

19 May 2022

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

About this article

Cite this article

Share this article

Keywords

Search

Navigation