Revealing Image Splicing Forgery Using Local Binary Patterns of DCT Coefficients

Zhang, Yujin; Zhao, Chenglin; Pi, Yiming; Li, Shenghong

doi:10.1007/978-1-4614-5803-6_19

Yujin Zhang⁸,
Chenglin Zhao⁹,
Yiming Pi¹⁰ &
…
Shenghong Li⁸

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 202))

2058 Accesses
13 Citations

Abstract

The wide use of powerful image processing software has made it easy to tamper images for malicious purposes. Image splicing, which has constituted a menace to integrity and authenticity of images, is a very common and simple trick in image tampering. Therefore, image splicing detection is of great importance in digital forensics. In this chapter, an effective framework for revealing image splicing forgery is proposed. The local binary pattern (LBP) operator is used to model magnitude components of 2-D arrays obtained by applying multi-size block discrete cosine transform (MBDCT) to the test images, all of bins of histograms computed from LBP codes are served as discriminative features for image splicing detection. To avoid the high computational complexity and possible overfitting for support vector machine (SVM) classifier, principal component analysis (PCA) is utilized to reduce the dimensionality of the proposed features. Our experiment results demonstrate the efficiency of the proposed method over the Columbia image splicing detection evaluation dataset.

Access provided by Autonomous University of Puebla. Download conference paper PDF

Image splicing forgery detection based on low-dimensional singular value decomposition of discrete cosine transform coefficients

Article 10 July 2018

Image splicing detection with principal component analysis generated low-dimensional homogeneous feature set based on local binary pattern and support vector machine

Article 09 March 2023

Digital image splicing detection technique using optimal threshold based local ternary pattern

Article 23 January 2020

Keywords

1 Introduction

Image splicing is a very common and simple tampering manner which creates a composite image by cropping and pasting regions from the same or different images without postprocessing. Spliced images could be so eye-deceiving that they are scarcely distinguished from authentic ones even without any postprocessing. In addition, malicious image splicing manipulation may mislead the public and persuade them to believe something that never exists.

Recently, many techniques have been developed to reveal image splicing tampering. Ng et al. in [1] proposed to use third order moment spectra (i.e. bicoherence) based features for splicing detection. It is claimed that bicoherence is sensitive to quadratic phase coupling (QPC) caused by splicing discontinuity. The detection accuracy as high as 72% over the image dataset [2] was achieved. Johnson and Farid in [3] developed a method to determine whether an image has been tampered with the assumption that both the original part and tampered part were taken under the same or approximately similar lighting conditions. In [4], Fu et al. generated features from Hilbert-Huang Transform (HHT) and moments of characteristic function of wavelet sub-bands, a detection accuracy of 80.15% was reported. Chen et al. in [5] utilized 2-D phase congruency and statistical moments of wavelet characteristic function to capture splicing artifacts. The detection accuracy as high as 82.32% over the image dataset [2] was achieved. In [6], the detection performance of two types of statistical features derived from moments of characteristic functions of wavelet subbands and Markov transition probabilities of difference 2-D arrays, which are proposed by Shi et al., outperformed the prior arts in the field of image splicing detection. Their statistical features have achieved detection rates of 86.82% and 88.31% respectively.

The methods mentioned above have achieved very promising detection results. In this chapter, image splicing detection is considered from a different perspective. The LBP operator is used to model magnitude components of 2-D arrays obtained by applying multi-size block discrete cosine transform (MBDCT) to the test images, all of bins of histograms computed from LBP codes can be served as discriminative features for image splicing detection. Principal component analysis (PCA) is used to reduce the dimensionality of the proposed features. It is expected that the proposed method can detect the splicing introduced trace effectively.

The rest of this chapter is organized as follows. The proposed method is described in Sect. 19.2. The experimental results are reported in Sect. 19.3. Finally, conclusions are drawn in Sect. 19.4.

2 Proposed Method

In this Section, a concrete feature extraction procedure for image splicing tampering detection is proposed, which is shown in Fig. 19.1. The details are as follows.

2.1 Preprocessing

From a viewpoint of signal processing, image splicing detection can be considered as a problem of weak signal (i.e. splicing artifacts) detection in the background of strong signal (i.e. image content). To reduce the effects caused by the diversity of image content and enhance the splicing artifacts, it is necessary to preprocess images before feature extraction. The block discrete cosine transform (BDCT) has been commonly used in the popular image and video compression schemes such as JPEG and H.264 owing to its good property of decorrelation and energy compaction. In order to capture the splicing artifacts caused by different possible splicing operations, different test images and different pasted image fragments, we first preprocess the test images by multi-size block discrete cosine transform (MBDCT) which is proved to be effective for image splicing detection [6], and resulting BDCT coefficient 2-D arrays are used for subsequent feature extraction.

The b × b BDCT of an image can be divided into the following steps:

1.
Split the given image into non-overlapping b × b blocks.
2.
Perform 2-D DCT on each b × b image block independently. The corresponding DCT coefficient 2-D array Y for a b × b image block X is given as
$$ Y = {C^T}XC $$
(19.1)

Where
$$ \left\{ \begin{array}{llll} C(k,l) = \frac{1}{{\sqrt {b} }},\quad 0 \leq k \leq b - 1,l = 0 \hfill \\C(k,l) = \sqrt {{\frac{2}{b}}} \cos \bigg(\frac{\pi (2k + 1)l }{2b }\bigg),\quad 0 \leq k \leq b - 1,1 \leq l \leq b - 1 \end{array} \right. $$
(19.2)
3.
Combine all the b × b DCT coefficient 2-D arrays of the given image into a BDCT coefficient 2-D array.

Based on the experimental dataset [2], we empirically choose the block size as 4 × 4, 8 × 8, and 16 × 16 to perform BDCT for a compromise between detection performance and computing complexity when preprocessing the test images.

2.2 Feature Extraction

2.2.1 Brief Review of LBP

LBP [7] is a powerful texture classification method. As shown in Fig. 19.2, given a central pixel g _c, g _p (p = 0, 1, 2… P−1) is the value of its neighbors, P is the total number of involved neighbors, and R is the radius of the neighborhood. Suppose the coordinate of g _c is (x _c, y _c), then the coordinates of g _p are (x _c + Rcos (2πp/P), y _c−Rsin (2πp/P)). The gray values of neighbors which do not fall exactly on pixels can be estimated by bilinear interpolation. For the central pixel g _c, the LBP coding strategy can be formulated as

$$ LB{P_{P,R }} = \sum\limits_{p = 0}^{P - 1 } {s({g_p} - {g_c}){2^p}} $$

(19.3)

Where

$$ s(x) = \left\{ \begin{array}{lll} 1,\quad x \geq 0 \hfill \\0,\quad\quad x < 0 \end{array}\right. $$

(19.4)

After the LBP codes of all pixels for a gray image are computed, a histogram is built as a texture descriptor which characterizes important information about spatial structure of image texture.

Furthermore, the U value of an LBP pattern is defined as the number of spatial transitions (bitwise 0/1 changes), it can be formulated as

$$ \begin{array}{llll} U(LB{P_{P,R }}) = \left| {s({g_{p - 1 }} - {g_c}) - s({g_0} - {g_c})} \right| \hfill \\\quad \quad \quad \quad + \sum\limits_{p = 1}^{P - 1 } {\left| {s({g_p} - {g_c}) - s({g_{p - 1 }} - {g_c})} \right|}\end{array} $$

(19.5)

The uniform LBP patterns refer to the patterns which have U values of at most 2 while the remaining patterns are all classified into non-uniform class. Therefore, the number of bins in a histogram computed from LBP codes can be reduced from 2^P to P(P−1) + 3 by means of uniformity mapping, the resulting LBP descriptor is denoted as $ LBP_{P,R}^{u2 } $. The uniformity mapping can be implemented with a lookup table of 2^P elements. In this chapter, P = 8 and R = 1 are investigated for image splicing detection. Consequently, the number of bins in a histogram computed from $ LB{P_{8,1 }} $ is 256 while that computed from $ LBP_{8,1}^{u2 } $ is 59.

2.2.2 Capturing the Splicing Artifacts Using Local Binary Patterns of DCT Coefficients

From the image splicing procedure, sharp splicing edges could be exposed in a spliced image without any postprocessing, thus the key of image splicing detection is how to capture these splicing artifacts. The splicing manipulation changes the local frequency distribution of the host images. BDCT coefficients can reflect these changes to a certain degree. The essence of the LBP technique is that each element of a given 2-D array is compared with its neighbor elements and then binarized. Hence, LBP coding records the occurrences of various patterns. LBP can be employed to model the magnitude components of the 2-D arrays obtained by applying MBDCT to the test images. It is expect that the LBP operator can reflect the local frequency distribution change of the host images effectively.

In order to catch the artifacts caused by image splicing more sensitively and obtain more discriminative information between authentic images and spliced images, (19.4) can be redefined as

$$ s(x) = \left\{ \begin{array} {lll}1,\quad x \geq \sigma \hfill \\0,\quad x < \sigma \end{array} \right. $$

(19.6)

Based on the experimental dataset [2], $ \sigma $ is selected as 0.9 in the proposed method. The details are given in Sect. 19.3.2. When computing $ LB{P_{8,1 }} $ or $ LBP_{8,1}^{u2 } $ features, we only use the block size as 4 × 4, 8 × 8, and 16 × 16 to generate the MBDCT coefficient 2-D arrays. Therefore, we have 256 × 3 = 768 $ LB{P_{8,1 }} $ features and 59 × 3 = 177 $ LBP_{8,1}^{u2 } $ features for each test image in this specific implementation.

3 Experiments and Results

The Columbia Image Splicing Detection Evaluation Dataset [2] is used to evaluate the efficiency of the proposed method in our experimental work. There are 933 authentic and 912 spliced images in this dataset. Images in this dataset are all in BMP format with a fixed size of 128 × 128 pixels. LIBSVM [8] is used as the classifier. The RBF kernel function is selected for classification. In each experimental, 5/6 of the authentic images and 5/6 of the spliced images are randomly picked out to train the SVM classifier, and the remaining 1/6 of the authentic images and 1/6 of the spliced images are used to test the trained SVM classifier. The optimal parameters for the RBF kernel function of SVM classifier are achieved by cross-validation and grid-search procedure. The above procedure is repeated 100 times for eliminating the effect of randomness caused by image selection for training and testing. Experimental results are evaluated by the average true positive rate (TPR), average true negative rate (TNR) and average detection accuracy over 100 times random experiments.

3.1 Experimental Results

To evaluate the effectiveness of the proposed method, a series of experiments on the Columbia Image Splicing Detection Evaluation Dataset are carried out. Note that $ \sigma = 0.9 $ is set in this implementation. Detection results of $ LB{P_{8,1 }} $ and $ LBP_{8,1}^{u2 } $ features are shown in Table 19.1. As can be seen in Table 19.1, $ LB{P_{8,1 }} $ features perform better than $ LBP_{8,1}^{u2 } $ features even though $ LB{P_{8,1 }} $ features are of higher dimensionality. To avoid the high computational complexity and possible overfitting for SVM classifier, PCA [9] can be used to reduce the dimensionality of the proposed features. PCA achieves a linear transformation of a high dimensional input vector into a low dimensional one whose components are uncorrelated, and the first few features can be considered as dominant features for classification. Detection performance of $ LB{P_{8,1 }} $ and $ LBP_{8,1}^{u2 } $ features after PCA dimensionality reduction is shown in Fig. 19.3. Note that the first 100 dimensional PCA features are used for classification.

Table 19.1 Detection results of $ LB{P_{8,1 }} $ and $ LBP_{8,1}^{u2 } $ features (standard deviation in parentheses)

Full size table

From Fig. 19.3, the observations can be made as follows:

1.
For $ LB{P_{8,1 }} $ and $ LBP_{8,1}^{u2 } $ features, the detection accuracy increases dramatically with the increase of dimensionality of dominant features obtained by PCA, and then it fluctuates on comparatively small scales.
2.
Compared with Table 19.1, PCA features with dimensionality larger than 40 can perform as well as original $ LB{P_{8,1 }} $ and $ LBP_{8,1}^{u2 } $ features.
3.
$ LB{P_{8,1 }} $ features always perform better than $ LBP_{8,1}^{u2 } $ features with dimensionality larger than 30.

3.2 Choice of Threshold $ \sigma $

In order to select a desired threshold $ \sigma $, we need to find the best detection performance with various thresholds of $ \sigma $. Too small or too large $ \sigma $, the LBP operator will not be able to sensitively catch the artifacts caused by image splicing. The resulting LBP features will also offer relatively small discriminative information. Detection performance of original $ LB{P_{8,1 }} $ and $ LBP_{8,1}^{u2 } $ features for the threshold $ \sigma $ set in the range of 0–2 is shown in Fig. 19.4. From Fig. 19.4, it can easily be seen the best detection performance for the proposed features can be reached when $ \sigma = 0.9 $.

4 Conclusion

In this chapter, local binary patterns of DCT coefficients have been investigated for image splicing detection. Specifically, the LBP operator were used to model magnitude components of 2-D arrays obtained by applying MBDCT to the test images, the resulting LBP features were served as discriminative features for image splicing detection. Owing to the high dimensionality of the proposed features, PCA was therefore used for dimensionality reduction. Experimental results have shown that both $ LB{P_{8,1 }} $ and $ LBP_{8,1}^{u2 } $ features perform well for capturing the image splicing artifacts, but the detection performance of the former outperforms that of the latter. Furthermore, PCA reduced the dimensionality of original features greatly without losing discriminative information. The preliminary study has indicated that our proposal to use local binary patterns of DCT coefficients is effective for capturing the image splicing artifacts. Our future work is to make a further study to enhance the detection performance of the proposed method.

References

Ng T-T, Chang S-F, Sun Q (2004) Blind detection of photomontage using higher order statistics. In: Proceedings of the IEEE international symposium on circuits and systems, Vancouver, Canada, vol 5, pp V688–V691
Google Scholar
Ng T-T, Chang S-F (2004) A dataset of authentic and spliced image blocks. ADVENT Technical Report, #203-2004-3, Columbia University
Google Scholar
Johnson MK, Farid H (2005) Exposing digital forgeries by detecting inconsistencies in lighting. ACM multimedia and security workshop, New York, pp 1–9
Google Scholar
Fu D, Shi YQ, Su W (2006) Detection of image splicing based on Hilbert-Huang transform and moments of characteristic functions with wavelet decomposition. In: International workshop on digital watermarking, LNCS, Springer, Heidelberg, vol 4283, pp 177–187
Google Scholar
Chen W, Shi YQ, Su W (2007) Image splicing detection using 2-D phase congruency and statistical moments of characteristic function. Society of photo-optical instrumentation engineers conference series, SPIE, Washington, vol 6505, pp 65050R.1-65050R.8
Google Scholar
Shi YQ, Chen C, Chen W (2007) A natural image model approach to splicing detection. In: Proceedings of the 9th workshop on multimedia and security, Dallas, Texas, USA, pp 51–62
Google Scholar
Ojala T, Pietikainen M, Maenpaa T (2002) Multiresolution gray-scale and rotation invariant texture classification with local binary patterns. IEEE Trans Pattern Anal Mach Intell 24(7):971–987
Article Google Scholar
Chang CC, Lin CJ (2001) LIBSVM: a library for support vector machines [EB/OL]. http://www.csie.ntu.edu.tw/cjlin/libsvm
Theodoridis S, Koutroumbas K (2009) Pattern recognition. Academic, Burlington
Google Scholar

Download references

Acknowledgments

This work is supported by National Science Foundation of China (61071152, 60702043), 973 Program (2010CB731403, 2010CB731406) of China and National “Twelfth Five-Year” Plan for Science & Technology Support (2012BAH38 B04). Credits for the use of the Columbia Image Splicing Detection Evaluation Dataset are given to the DVMM Laboratory of Columbia University. CalPhotos Digital Library and the photographers listed in http://www.ee.columbia.edu/ln/dvmm/downloads/AuthSplicedDataSet/photographers.htm.

Author information

Authors and Affiliations

Department of Electronic Engineering, Shanghai Jiao Tong University, Shanghai, 200240, China
Yujin Zhang & Shenghong Li
School of Information and Communication Engineering, Beijing University of Posts and Telecommunications, Beijing, 100876, China
Chenglin Zhao
School of Electronic Engineering, University of Electronic Science and Technology of China, Chengdu, 611731, China
Yiming Pi

Authors

Yujin Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Chenglin Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Yiming Pi
View author publications
You can also search for this author in PubMed Google Scholar
Shenghong Li
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yujin Zhang .

Editor information

Editors and Affiliations

University of Texas at Arlington, 416 Yates St, Rm 518, Box 19016, Arlington, 76019-0016, Texas, USA
Qilian Liang
College of Physical and Electronic Infor, Tianjin Normal University, Bingshui West Road, XiQing District, Tianjin, 300387, China, People's Republic
Wei Wang
College of Physical and Electronic Infor, Tianjin Normal University, Bingshui West Road, XiQing District, Tianjin, 300387, China, People's Republic
Jiasong Mu
School of electronic engineering, University of Electronic Science and Tec, Xiyuan Road, Gaoxin District, Chengdu, 611731, China, People's Republic
Jing Liang
College of Physical and Electronic Infor, Tianjin Normal University, Bingshui West Road, XiQing District, Tianjin, 300387, China, People's Republic
Baoju Zhang
School of Electronic Engineering, University of Electronic Science and Tec, Xiyuan Road, Gaoxin district, Chengdu, 611731, China, People's Republic
Yiming Pi
School of Information and Communication, Beijing University of Posts and Telecomm, Xitucheng Road 10, Beijing, 100876, China, People's Republic
Chenglin Zhao

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhang, Y., Zhao, C., Pi, Y., Li, S. (2012). Revealing Image Splicing Forgery Using Local Binary Patterns of DCT Coefficients. In: Liang, Q., et al. Communications, Signal Processing, and Systems. Lecture Notes in Electrical Engineering, vol 202. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-5803-6_19

Download citation

DOI: https://doi.org/10.1007/978-1-4614-5803-6_19
Published: 28 October 2012
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4614-5802-9
Online ISBN: 978-1-4614-5803-6
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics