Three-dimensional histogram shifting for reversible data hiding

Zhao, Juan; Li, Zhitang

doi:10.1007/s00530-016-0529-2

Three-dimensional histogram shifting for reversible data hiding

Regular Paper
Published: 15 September 2016

Volume 24, pages 95–109, (2018)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Multimedia Systems Aims and scope Submit manuscript

Three-dimensional histogram shifting for reversible data hiding

Download PDF

Juan Zhao¹ &
Zhitang Li^2,3

456 Accesses
11 Citations
Explore all metrics

Abstract

Histogram shifting is an important method of reversible data hiding. However, every pixel, difference, or prediction-error is respectively changed to hide a data bit in the traditional histogram shifting, which constrains the capacity-distortion embedding performance. An efficient three-dimensional histogram shifting is proposed for reversible data hiding in this paper. Take H.264 videos as covers to show this method. In a 4 × 4 quantized discrete cosine transform luminance block, which is not inferred by others, three alternating current coefficients are selected randomly as an embeddable group. According to the different values of the selected coefficient groups, they could be divided into different sets. Data could be hidden according to these sets. In the traditional histogram shifting, only one information bit could be hidden with at most one modification of one coefficient, whereas two data bits could be hidden at the same cost by using the proposed scheme. The superiority of the presented technique is verified through experiments.

Quadruple histogram shifting-based reversible information hiding approach for digital images

Article 30 April 2021

High-Dimensional Histogram Utilization for Reversible Data Hiding

A novel reversible data hiding scheme with two-dimensional histogram shifting mechanism

Article 05 May 2018

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

With the rapid development of video sharing technology, it is essential to research the algorithms of hiding data into videos for copyright protection, covert communication, integrity authentication, and so on [1, 2]. H.264 is a state-of-the-art video compression standard and has become the most widely deployed video codec. At the H.264 encoder, a residual pixel block is obtained by subtracting a prediction block from its original pixel block in YUV video. After lossy compression [discrete cosine transformation (DCT) and quantization], the residual block becomes a quantized DCT (QDCT) block, which has one direct current coefficient and some alternating current (AC) coefficients. After entropy encoding (lossless compression) of each QDCT macro block (MB), YUV video is encoded into an H.264 video. Therefore, the information hidden by changing QDCT coefficients can be fully extracted after entropy decoding. It is most common to embed data into QDCT coefficients, but the distortion caused by hiding data will spread and accumulate [3, 4]. Using general data hiding methods to embed information will cause the permanent distortion of host multimedia. However, hiding data into QDCT coefficients with a reversible data hiding (RDH) algorithm, we could completely restore the value of QDCT coefficient after extracting information, so we can save and enjoy important videos without information and distortion caused by hiding data. Consequently, there will not be too many network videos with secret information so that it is difficult for others to find stego cover. In addition, RDH methods can also be applied in video error concealment [5–8] and some sensitive application fields [9–13] such as multimedia archive management, medical multimedia sharing, military affair, remote sensing, and law enforcement.

In recent years, many RDH methods such as lossless compression [14–16], difference expansion [5, 6, 8, 11, 13, 17–22], histogram shifting (HS) [7, 23–34] and integer transform [10, 12] have been presented. In a RDH framework based on lossless compression [14], the compressible parts, which are extracted nondestructively from the original cover, are compressed by a lossless compression algorithm. Then the to-be-hidden information is attached to the back of the compressed parts. Correspondingly, the receiver extracts the information from the end of the sequence and recovers the original cover by decompressing the compressed parts. When this method is used to hide information, little embedding capacity and high computational complexity are two issues which should be resolved. Furthermore, in the lossless compression scheme [16], a recursive code construction and a lossless compression algorithm are used to hide data. However, it is not easy to use the recursive construction to hide information into H.264 video since this video is encoded by treating each MB sequentially [35]. In the difference expansion algorithm [17], the difference between two adjacent pixels in a pixel pair was expanded to hide one data bit. Moreover, prediction-error expansion is used to improve the hiding performance of difference expansion by expanding the difference between the pixel and its prediction [19].

The peak of image histogram is used to hide information in the HS method proposed by Ni et al. [23]. In order to hide one data bit, each pixel value is changed at most by adding or subtracting 1. Li et al. [31] proposed a general framework of HS-based RDH, which could be utilized to construct a RDH algorithm by simply designing shifting and embedding functions. In order to achieve a better capacity-distortion trade-off, all kinds of prediction methods are used to get sharp difference histograms [36].

However, in general HS-based RDH methods, each pixel, difference or prediction-error is singly changed for hiding a data bit, which constrains the capacity-distortion performance. In order to solve this limiting problem of the capacity-distortion performance and the embedding efficiency, an efficient RDH algorithm based on three-dimensional (3D) HS is proposed in this work. Stereo H.264 videos, encoded or decoded through multi-view coding (MVC), are just taken as covers in this paper. An arbitrary block, which does not predict others, is treated as an embeddable block, from which three QDCT AC coefficients are randomly chosen as an embeddable unit. Coefficient units are divided into disjoint regions. On the basis of the regions of coefficient units, the 3D histogram is expanded or shifted for hiding data reversibly. In order to embed two data bits, two coefficients may be modified in the conventional HS, whereas only one coefficient may be changed by using the proposed scheme. Compared with some state-of-the-art methods, the presented algorithm has superior payload-distortion performance, which is verified by the experimental results.

The rest of the paper is organized as follows: Section 2 presents the leading idea of 3D HS. In Sect. 3, the proposed RDH algorithm for MVC video and its implementation details are described. The hiding performance of the presented algorithm is evaluated via experimental results in Sect. 4. Finally, the conclusions of the paper are made in Sect. 5.

2 RDH method using HS

2.1 Conventional HS

Ni et al.’s HS method [16] could be used for hiding data into QDCT coefficients of MVC video. Denote a QDCT coefficient as F _x1 and the marked QDCT coefficient as $ F^{\prime}_{x1} $. Information could be hidden by expanding and shifting one-dimensional (1D) histogram as shown in Fig. 1 and (1), where m _i ∈ {0, 1} is a to-be-embedded data bit.

$$ F^{\prime}_{x1} = \left\{ {\begin{array}{*{20}l} {0,} & {{\text{ if (}}F_{x1} \,{ = 0}) \wedge (m_{i} \,{ = 0})} \\ {1,} & {{\text{ if }}(F_{x1} \,{ = 0}) \wedge (m_{i}\, { = 1})} \\ {F_{x1} + 1,} & \,\,{{\text{if }}F_{x1}\, > 0} \\ \end{array} } \right. $$

(1)

The positive coefficients are shifted for making vacant space. When the value of F _x1 is 0, one data bit can be hidden, where the value of the coefficient F _x1 will become 1 if m _i is 1, and will not be changed if m _i is 0. Accordingly, the hidden information m _i could be extracted from the marked QDCT coefficient $ F^{\prime}_{x1} $, and the value of QDCT coefficient can be completely restored as follows:

1.
If $ F^{\prime}_{x1} $ = 0, the extracted information bit m _i = 0 and the original coefficient F _x1 = 0.
2.
If $ F^{\prime}_{x1} $ = 1, the extracted information bit m _i = 1 and the original coefficient F _x1 = 0.
3.
If $ F^{\prime}_{x1} $ > 1, there is no hidden data in the coefficient and the original coefficient F _x1 = $ F^{\prime}_{x1} $ − 1.

In this method, the 1D coefficient histogram is defined by (2), where # is the cardinal number of a set, s ₁ is a nonnegative integer.

$$ h\left( {s_{1} } \right) = \# \left\{ {F_{x1} |F_{x1} = s_{1} } \right\} $$

(2)

If the scheme shown in Fig. 1 is used to embed data into each of the three QDCT coefficients denoted by F _x1, F _x2, and F _x3, the mapping will be a traditional 3D HS as shown in Fig. 2, where 3D histogram is defined as

$$ w\left( {s_{ 1} ,s_{ 2} ,s_{ 3} } \right) = \# \left\{ {\left( {F_{x 1} ,F_{x 2} ,F_{x 3} } \right)|F_{x 1} = s_{ 1} ,F_{x 2} = s_{ 2} ,F_{x 3} = s_{ 3} } \right\} $$

(3)

where s ₂ and s ₃ are nonnegative integers.

2.2 Proposed 3D HS

In histogram shifting schemes, different positions are used to represent different information. As shown in Fig. 2, eight adjacent places are needed to store eight kinds of three information bits (000, 001, 010, 011, 100, 101, 110, and 111), four adjacent positions are needed to represent four sorts of two information bits (00, 01, 10, and 11), and two neighboring positions are used to record two kinds of one data bit (0 and 1). When only one position could be used, no message can be hidden, and the original position should be shifted to its neighboring place. It can be observed that the maximum cost of each QDCT coefficient group in Fig. 2 is 3, which may bring obvious distortion.

In order to reduce the expense, we first search different positions to store different information with at most one change. If only nonnegative coefficient groups are used to store information, four positions [(0,0,0), (1,0,0), (0,1,0), and (0,0,1)] could be used for four sorts of two data bits. When the value of coefficient group (F _x1, F _x2, F _x3) is (0,0,0), which can be used to represent two data bits 00 with no modification, it could be expanded to its neighboring positions (1,0,0), (0,1,0), and (0,0,1) for signifying two data bits 01, 10, and 11 with one modification, respectively. When the value of coefficient group (F _x1, F _x2, F _x3) is (F _x1,0,0) (F _x1 > 0), it could be expanded to its neighboring positions (F _x1, 0, 1), (F _x1 + 1,0,0), and (F _x1, 1,0) for signifying two data bits 00, 01, and 10 with one modification, respectively. And it can be expanded to (F _x1, 1, 1) for signifying two data bits 11, where the cost is 2. When F _x1, F _x2, F _x3 are all positive integers, the group is shifted to (F _x1, F _x2 + 1, F _x3 + 1), where the cost is 2. In Fig. 2, by contrast, the group is shifted to (F _x1 + 1, F _x2 + 1, F _x3 + 1), where the cost is 3. In this way, compared with the convention HS, we can get a more efficient 3D HS scheme, as shown in Fig. 3, where the set (denoted as J) of all the points could be divided into ten disjoint sets defined as follows:

$$ \begin{aligned} J_{ 1} & = \left\{ {\left( {0,\,0,0} \right)} \right\} \\ J_{ 2} & = \left\{ {\left( {F_{x 1} ,0,0} \right)|F_{x 1} > 0} \right\} \\ J_{ 3} & = \left\{ {\left( {0,F_{x 2} ,0} \right)|F_{x 2} > 0} \right\} \\ J_{ 4} & = \left\{ {\left( {0,0, 1} \right)} \right\} \\ J_{ 5} & = \left\{ {\left( {0,0,F_{x 3} } \right)|F_{x 3} > 1} \right\} \\ J_{ 6} & = \left\{ {\left( {F_{x 1} ,F_{x 2} , \, 0} \right)|F_{x 1} > 0,F_{x 2} > 0} \right\} \\ J_{ 7} & = \left\{ {\left( { 1, \, 0,F_{x 3} } \right)|F_{x 3} > 0} \right\} \\ J_{ 8} & = \left\{ {\left( {F_{x 1} ,0,F_{x 3} } \right)|F_{x 1} > 1,F_{x 3} > 0} \right\} \\ J_{ 9} & = \left\{ {\left( {0,F_{x 2} ,F_{x 3} } \right)|F_{x 2} > \, 0,F_{x 3} > 0} \right\} \\ J_{ 10} & = \left\{ {\left( {F_{x 1} ,F_{x 2} ,F_{x 3} } \right)|F_{x 1} > 0,F_{x 2} > 0,F_{x 3} > 0} \right\} \\ \end{aligned} $$

We divide the chosen coefficient groups into different sets and hide information based on the set the value of the coefficient group resides in. Accordingly, the embedding process can be described below.

1.
if the coefficient group (F _x1, F _x2, F _x3) ∈ J ₁, the marked coefficient group denoted by $ (F^{\prime}_{x1} ,F^{\prime}_{x2} ,F^{\prime}_{x3} ) $ will be
$$ (F^{\prime}_{x1} ,F^{\prime}_{x2} ,F^{\prime}_{x3} ) = \left\{ {\begin{array}{*{20}l} {(F_{x1} ,F_{x2} ,F_{x3} ) ,} & {{\text{if }}\,m_{i} m_{i + 1} = 00} \\ {(F_{x 1} + 1,F_{x 2} ,F_{x 3} ) ,} & {{\text{if}}\, \, m_{i} m_{i + 1} = 01} \\ {(F_{x 1} ,F_{x 2} + 1,F_{x 3} ) ,} & {{\text{if}}\, \, m_{i} m_{i + 1} = 10} \\ {(F_{x 1} ,F_{x 2} ,F_{x 3} + 1) ,} & {{\text{if}}\,\,m_{i} m_{i + 1} = 11} \\ \end{array} } \right. $$
(4)
2.
if the coefficient group (F _x1, F _x2, F _x3) ∈ J ₂, the marked coefficient group $ (F^{\prime}_{x1} ,F^{\prime}_{x2} ,F^{\prime}_{x3} ) $ will be
$$ (F^{\prime}_{x1} ,F^{\prime}_{x2} ,F^{\prime}_{x3} ) = \left\{ {\begin{array}{*{20}l} {(F_{x1} ,F_{x2} ,F_{x3} + 1) ,} & {{\text{if }}\,m_{i} m_{i + 1} = 00} \\ {(F_{x1} + 1,F_{x2} ,F_{x3} ) ,} & {{\text{if }}\,m_{i} m_{i + 1} = 01} \\ {(F_{x1} ,F_{x2} + 1,F_{x3} ) ,} & {{\text{if }}\,m_{i} m_{i + 1} = 10} \\ {(F_{x1} ,F_{x2} + 1,F_{x3} + 1) ,} & {{\text{if}}\, \, m_{i} m_{i + 1} = 11} \\ \end{array} } \right. $$
(5)
3.
if the coefficient group (F _x1, F _x2, F _x3) ∈ J ₃ ∪ J ₅, the marked coefficient group $ (F^{\prime}_{x1} ,F^{\prime}_{x2} ,F^{\prime}_{x3} ) $ will be
$$ (F^{\prime}_{x1} ,F^{\prime}_{x2} ,F^{\prime}_{x3} ) = \left\{ {\begin{array}{*{20}l} {(F_{x1} ,F_{x2} ,F_{x3} + 1) ,} & {{\text{if }}\,m_{i} = 0} \\ {(F_{x1} ,F_{x2} + 1,F_{x3} ) ,} & {{\text{if }}\,m_{i} = 1} \\ \end{array} } \right. $$
(6)
4.
if the coefficient group (F _x1, F _x2, F _x3) ∈ J ₄, the marked coefficient group $ (F^{\prime}_{x1} ,F^{\prime}_{x2} ,F^{\prime}_{x3} ) $ will be
$$ (F^{\prime}_{x1} ,F^{\prime}_{x2} ,F^{\prime}_{x3} ) = \left\{ {\begin{array}{*{20}l} {(F_{x1} ,F_{x2} ,F_{x3} + 1) ,} & {{\text{if }}\,m_{i} = 0} \\ {(F_{x1} + 1,F_{x2} ,F_{x3} + 1) ,} & {{\text{if }}\,m_{i} = 1} \\ \end{array} } \right. $$
(7)
5.
if the coefficient group (F _x1, F _x2, F _x3) ∈ J ₆, the marked coefficient group $ (F^{\prime}_{x1} ,F^{\prime}_{x2} ,F^{\prime}_{x3} ) $ will be
$$ (F^{\prime}_{x1} ,F^{\prime}_{x2} ,F^{\prime}_{x3} ) = \left\{ {\begin{array}{*{20}l} {(F_{x1} ,F_{x2} + 1,F_{x3} ) ,} & {{\text{if }}\,m_{i} = 0} \\ {(F_{x1} ,F_{x2} + 1,F_{x3} + 1) ,} & {{\text{if }}\,m_{i} = 1} \\ \end{array} } \right. $$
(8)
6.
if the coefficient group (F _x1, F _x2, F _x3) ∈ J ₇, the marked coefficient group $ (F^{\prime}_{x1} ,F^{\prime}_{x2} ,F^{\prime}_{x3} ) $ will be
$$ (F^{\prime}_{x1} ,F^{\prime}_{x2} ,F^{\prime}_{x3} ) = \left\{ {\begin{array}{*{20}l} {(F_{x1} ,F_{x2} + 1,F_{x3} + 1) , {\text{ if }}m_{i} = 0} \hfill \\ {(F_{x1} ,F_{x2} ,F_{x3} + 2) ,\,\,\,\,\,\,\,\,\,\,{\text{ if }}m_{i} = 1} \hfill \\ \end{array} } \right. $$
(9)
7.
if the coefficient group (F _x1, F _x2, F _x3) ∈ J ₈, the marked coefficient group $ (F^{\prime}_{x1} ,F^{\prime}_{x2} ,F^{\prime}_{x3} ) $ will be
$$ (F^{\prime}_{x1} ,F^{\prime}_{x2} ,F^{\prime}_{x3} ) = \left\{ {\begin{array}{*{20}l} {(F_{x1} ,F_{x2} ,F_{x3} + 1) ,} & {{\text{if }}m_{i} = 0} \\ {(F_{x1} ,F_{x2} + 1,F_{x3} + 1) ,} & {{\text{if }}m_{i} = 1} \\ \end{array} } \right. $$
(10)
8.
if the coefficient group (F _x1, F _x2, F _x3) ∈ J ₉ ∪ J ₁₀, no information is hidden, and the marked coefficient group $ (F^{\prime}_{x1} ,F^{\prime}_{x2} ,F^{\prime}_{x3} ) $ will be taken as (F _x1, F _x2 + 1, F _x3 + 1).

After data is hidden by using these mapping rules, we can extract the information based on the set which the value of the marked coefficient group may reside in, and recover the value of the marked coefficient group according to the reverse process of embedding.

2.3 Embedding capacity and distortion

When the 1D HS is used for hiding data, the embedding capacity denoted as EC is h(0). For QDCT coefficients, the embedding distortion denoted as ED in terms of l ²-error can be formulated as

$$ {\text{ED}} = \frac{1}{2}h(0) + \sum\limits_{s = 1}^{ + \infty } {h(s_{1} )} $$

(11)

The embedding capacities of the conventional 3D HS and the proposed 3D HS, denoted as EC_con and EC_pro, can be calculated by (12) and (13).

$$ \begin{aligned} {\text{EC}}_{\text{con}} & = 3\sum\limits_{{(F_{x1} ,F_{x2} ,F_{x3} ) \in J_{1} }} {w(F_{x1} ,F_{x2} ,F_{x3} )} + 2\sum\limits_{{(F_{x1} ,F_{x2} ,F_{x3} ) \in J_{2} \cup J_{3} \cup J_{4} \cup J_{5} }} {w(F_{x1} ,F_{x2} ,F_{x3} )} \\ & \quad + \sum\limits_{{(F_{x1} ,F_{x2} ,F_{x3} ) \in J_{6} \cup J_{7} \cup J_{8} \cup J_{9} }} {w(F_{x1} ,F_{x2} ,F_{x3} )} \\ \end{aligned} $$

(12)

$$ {\text{EC}}_{\text{pro}} = 2\sum\limits_{{(F_{x1} ,F_{x2} ,F_{x3} ) \in J_{1} \cup J_{2} }} {w(F_{x1} ,F_{x2} ,F_{x3} )} + \sum\limits_{{(F_{x1} ,F_{x2} ,F_{x3} ) \in J_{3} \cup J_{4} \cup J_{5} \cup J_{6} \cup J_{7} \cup J_{8} }} {w(F_{x1} ,F_{x2} ,F_{x3} )} $$

(13)

For QDCT coefficients, the embedding distortion in terms of $ l^{2} $-error of the conventional 3D HS and the proposed 3D HS, denoted as ED_con and ED_pro, can be formulated as

$$ \begin{aligned} {\text{ED}}_{\text{con}} & & = \frac{3}{2}\sum\limits_{{(F_{x1} ,F_{x2} ,F_{x3} ) \in J_{1} }} {w(F_{x1} ,F_{x2} ,F_{x3} )} + 2\sum\limits_{{(F_{x1} ,F_{x2} ,F_{x3} ) \in J_{2} \cup J_{3} \cup J_{4} \cup J_{5} }} {w(F_{x1} ,F_{x2} ,F_{x3} )} \\ \quad + \frac{5}{2}\sum\limits_{{(F_{x1} ,F_{x2} ,F_{x3} ) \in J_{6} \cup J_{7} \cup J_{8} \cup J_{9} }} {w(F_{x1} ,F_{x2} ,F_{x3} )} + 3\sum\limits_{{(F_{x1} ,F_{x2} ,F_{x3} ) \in J_{10} }} {w(F_{x1} ,F_{x2} ,F_{x3} )} \\ \end{aligned} $$

(14)

and

$$ \begin{aligned} {\text{ED}}_{\text{pro}} & = \frac{3}{4}\sum\limits_{{(F_{x1} ,F_{x2} ,F_{x3} ) \in J_{1} }} {w(F_{x1} ,F_{x2} ,F_{x3} )} + \frac{5}{4}\sum\limits_{{(F_{x1} ,F_{x2} ,F_{x3} ) \in J_{2} }} {w(F_{x1} ,F_{x2} ,F_{x3} )} + \sum\limits_{{(F_{x1} ,F_{x2} ,F_{x3} ) \in J_{3} \cup J_{5} }} {w(F_{x1} ,F_{x2} ,F_{x3} )} \\ & \quad + \frac{3}{2}\sum\limits_{{(F_{x1} ,F_{x2} ,F_{x3} ) \in J_{4} \cup J_{6} \cup J_{8} }} {w(F_{x1} ,F_{x2} ,F_{x3} )} + 2\sum\limits_{{(F_{x1} ,F_{x2} ,F_{x3} ) \in J_{7} \cup J_{9} \cup J_{10} }} {w(F_{x1} ,F_{x2} ,F_{x3} )} \\ \end{aligned} $$

(15)

According to (12) and (13), it can be inferred that the difference of embedding capacity between the presented 3D HS and the conventional 3D HS is

$$ {\text{EC}}_{\text{con}} - {\text{EC}}_{\text{pro}} = \sum\limits_{{(F_{x1} ,F_{x2} ,F_{x3} ) \in J_{1} \cup J_{3} \cup J_{4} \cup J_{5} \cup J_{9} }} {w(F_{x1} ,F_{x2} ,F_{x3} )} $$

(16)

According to (14) and (15), it can be inferred that the difference of embedding distortion between the presented 3D HS and the conventional 3D HS is

$$ \begin{aligned} {\text{ED}}_{\text{con}} - {\text{ED}}_{\text{pro}} & = \frac{3}{4}\sum\limits_{{(F_{x1} ,F_{x2} ,F_{x3} ) \in J_{1} \cup J_{2} }} {w(F_{x1} ,F_{x2} ,F_{x3} )} + \sum\limits_{{(F_{x1} ,F_{x2} ,F_{x3} ) \in J_{3} \cup J_{5} \cup J_{6} \cup J_{8} \cup J_{10} }} {w(F_{x1} ,F_{x2} ,F_{x3} )} \\ & \quad + \frac{1}{2}\sum\limits_{{(F_{x1} ,F_{x2} ,F_{x3} ) \in J_{4} \cup J_{7} \cup J_{9} }} {w(F_{x1} ,F_{x2} ,F_{x3} )} \\ \end{aligned} $$

(17)

Therefore, compared with the conventional 3D HS, although the capacity obtained by our method is lower, the distortion is decreased greatly. Two examples are given to show the advantage of the proposed method.

1.
For the group (F _x1, F _x2, F _x3) = (0, 0, 0) ∈ J ₁, in the proposed 3D HS, the distortion is 0, 1, 1 and 1 when (m _i, m _i+1) is (0, 0), (0, 1), (1, 0), and (1, 1), respectively. However, in the conventional 3D HS, the distortion is 2 if (m _i, m _i+1) is (1, 1). Therefore, it can be inferred that when the quantity of data hidden by the proposed method is the same as that hidden by the conventional 3D HS, the proposed method could achieve preferable quality compared with the traditional 3D HS.
2.
For the group (F _x1, F _x2, F _x3) = (2, 0, 0) ∈ J ₂, in the proposed 3D HS, the cost is 1, 1, 1 and 2 when (m _i, m _i+1) is (0, 0), (0, 1), (1, 0), and (1, 1), respectively. In the conventional 3D HS, the cost is 1, 2, 2 and 3, respectively. Accordingly, for the coefficient groups in the set J ₂, in order to hide the same number of information, the presented 3D HS can be used to obtain better cover quality compared with the conventional 3D HS.

In general, when the coefficient group (F _x1, F _x2, F _x3) ∈ J ₂, the same embedding capacity, i.e., $ 2\sum\nolimits_{{(F_{x1} ,F_{x2} ,F_{x3} ) \in J_{2} }} {w(F_{x1} ,F_{x2} ,F_{x3} )} $, can be acquired by the two methods, but lower distortion will be caused by using the proposed scheme, i.e. $ \frac{5}{4}\sum\nolimits_{{(F_{x1} ,F_{x2} ,F_{x3} ) \in J_{2} }} {w(F_{x1} ,F_{x2} ,F_{x3} )} < 2\sum\nolimits_{{(F_{x1} ,F_{x2} ,F_{x3} ) \in J_{2} }} {w(F_{x1} ,F_{x2} ,F_{x3} )} $. When the coefficient group (F _x1, F _x2, F _x3) ∈J ₁, the proposed method’s efficiency (which is capacity/distortion) is $ \left[ {2\sum\nolimits_{{(F_{x1} ,F_{x2} ,F_{x3} ) \in J_{1} }} {w(F_{x1} ,F_{x2} ,F_{x3} )} } \right]/\left[ {\frac{3}{4}\sum\nolimits_{{(F_{x1} ,F_{x2} ,F_{x3} ) \in J_{1} }} {w(F_{x1} ,F_{x2} ,F_{x3} )} } \right] = \frac{8}{3}, $ and the conventional method’s efficiency is $ \left[ {3\sum\nolimits_{{(F_{x1} ,F_{x2} ,F_{x3} ) \in J_{1} }} {w(F_{x1} ,F_{x2} ,F_{x3} )} } \right]/\left[ {\frac{3}{2}\sum\nolimits_{{(F_{x1} ,F_{x2} ,F_{x3} ) \in J_{1} }} {w(F_{x1} ,F_{x2} ,F_{x3} )} } \right]\,=\,2\,<\,\frac{8}{3} $ . Similarly, if the coefficient group (F _x1, F _x2, F _x3) belongs to other sets, when the same quantity of data is hidden, the better video quality and hiding efficiency can be achieved by the proposed 3D HS compared with the conventional 3D HS.

Additionally, the presented scheme can be applied in the image or the video RDH algorithms since the difference or the prediction-error histogram is similar to the coefficient histogram. When the presented scheme is used in some media, especially a gray-scale image with 8 storage bits [24], the overflow/underflow problem should be treated. However, this problem need not be considered when the information is embedded into QDCT coefficients of H.264 video [6].

3 The proposed RDH algorithm for MVC video

3.1 Embeddable blocks limiting distortion drift

The original YUV videos captured by cameras should be compressed to decrease the network transmission load. In order to diminish the spatial redundancy of video sequences, parallax prediction, inter-frame prediction, and intra-frame prediction are employed in MVC standard to calculate prediction block. Then the prediction block is subtracted from the original block in the YUV video at MVC encoder, where the residuary block denoted as K ^R0 undergoes 4 × 4 (or 8 × 8) DCT and quantization as shown in

$$ F = {round}\left[ {(C_{f} K^{R0} C_{f}^{T} ) \otimes (E_{f} /Q)} \right] $$

(18)

where F is a QDCT block with 16 QDCT coefficients numbered by zigzag scan as shown in Fig. 4, $ C_{f} = \left[ {\begin{array}{*{20}l} 1 & 1 & 1 & 1 \\ 2 & 1 & { - 1} & { - 2} \\ 1 & { - 1} & { - 1} & 1 \\ 1 & { - 2} & 2 & { - 1} \\ \end{array} } \right] , { }\quad E_{f} = \left[ {\begin{array}{*{20}l} {a^{2} } & {ab/2} & {a^{2} } & {ab/2} \\ {ab/2} & {b^{2} /4} & {ab/2} & {b^{2} /4} \\ {a^{2} } & {ab/2} & {a^{2} } & {ab/2} \\ {ab/2} & {b^{2} /4} & {ab/2} & {b^{2} /4} \\ \end{array} } \right],\quad \, a = 1/2, \, b = \sqrt {2/5} , $ the matrix $ C_{f}^{T} $ is the transpose of $ C_{f}$, Q is the quantization step size, $ \otimes $ is a mathematical operator, which indicates that each value in the former matrix is multiplied by the value at the corresponding position in the latter matrix.

If one data bit is hidden into one QDCT block F by changing some QDCT coefficients, the QDCT block F will be altered to a marked block denoted by F′, and the deviation (denoted as ∆F) introduced by hiding data is

$$ \Delta F = F^{\prime} - F $$

(19)

In order to reconstruct YUV videos that the users watch on the screen, at the decoder, the prediction block is computed and added to the residual block denoted by K ^R, which is achieved by lossless decompression (entropy decoding) and lossy decompression (inverse quantization and inverse 4 × 4 (or 8 × 8) DCT) as shown in

$$ K^{\text{R}} = {round}\left[ {C_{d}^{T} \left( {F \otimes E_{d} } \right)C_{d} } \right] $$

(20)

where $ C_{d} = \left[ {\begin{array}{*{20}l} 1 & 1 & 1 & 1 \\ 1 & {1/2} & { - 1/2} & { - 1} \\ 1 & { - 1} & { - 1} & 1 \\ {1/2} & { - 1} & 1 & { - 1/2} \\ \end{array} } \right],\quad E_{d} = \left[ {\begin{array}{*{20}l} {a^{2} } & {ab} & {a^{2} } & {ab} \\ {ab} & {b^{2} } & {ab} & {b^{2} } \\ {a^{2} } & {ab} & {a^{2} } & {ab} \\ {ab} & {b^{2} } & {ab} & {b^{2} } \\ \end{array} } \right]. $

When a data bit is hidden through modifying some QDCT coefficients of a block, the residual pixel block K ^R will be turned into a marked pixel block denoted as $ K^{{R^{\prime}}} $, and the mutation denoted by ∆K ^R is

$$ \Delta K^{R} = K^{{R^{\prime}}} - K^{R} = {round}\left[ {C_{d}^{T} (\Delta F \cdot Q \otimes E_{d} )C_{d} } \right]. $$

(21)

Take the QDCT coefficient F ₁₃ for example to show the distortion caused by embedding information. Suppose we add an integer denoted as r to the value of F ₁₃, that is, the change of the QDCT block for embedding information is $ \Delta F = \left[ {\begin{array}{*{20}l} 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & r \\ 0 & 0 & 0 & 0 \\ \end{array} } \right] , { } $ then the change of corresponding pixel block in YUV is $ \Delta K^{R} = \frac{1}{2}{Qabr}\left[ {\begin{array}{*{20}l} \,\,\,\,1 &{ - 2} & \,\,\,\,2 & { - 1} \\ { - 1} & \,\,\,2 & { - 2} & \,\,\,1 \\ { - 1} & \,\,\,2 & { - 2} & \,\,\,1 \\ \,\,\,\,1 & { - 2} & \,\,\,\,2 & { - 1} \\ \end{array} } \right] .$

It can be seen that the modification of one QDCT coefficient causes the distortion of the whole 4 × 4 transform block in the corresponding YUV video. Similarly, altering one QDCT coefficient in an 8 × 8 transform block will vary the whole 8 × 8 pixel block, whose range is bigger than the range of 4 × 4 block. Thus 4 × 4 transform blocks are chosen for hiding data in this paper.

Correspondingly, it can be inferred that the boundary pixels denoted as c ₀ … c ₁₂, shown in Fig. 5, may be modified by embedding information into some QDCT coefficients of the blocks K _u,v−1 (integers u and v are used to denote the position of a block), K _u−1,v−1, K _u−1,v, and K _u−1,v+1. Additionally, if intra-frame prediction is utilized by the current block K _u,v, its prediction block will be calculated by the pixels c ₀…c ₁₂. Consequently, the embedding induced deviation of the blocks K _u,v−1, K _u−1,v−1, K _u−1,v, and K _u−1,v+1 will drift to the block K _u,v. Otherwise, when the prediction block of the block K _u,v is counted by using inter-frame prediction or parallax prediction, i.e., referring another frame as shown in Fig. 6, any modification of adjacent blocks in the same frame will not have an impact on the block K _u,v.

A 16 × 16 MB with inter-frame prediction or parallax prediction is denoted as inter-MB. If the current block K _u,v is one of the nine 4 × 4 blocks numbered by 0…8, which are not located at the bottom or the rightmost column of the inter-MB as shown in Fig. 7, its adjacent blocks K _u,v+1, K _u+1,v+1 and K _u+1,v will be in the current inter-MB. In addition, its neighboring block K _u+1,v−1 may be either in the current inter-MB or one of the blocks numbered by 9, 10, and 11 in the encoded MB at the encoder (or the decoded MB at the decoder). Therefore, these adjacent blocks K _u,v+1, K _u+1,v+1, K _u+1,v, and K _u+1,v−1 will not be influenced by the block K _u,v, and the blocks numbered 0…8 in an inter-MB could be chosen as embeddable blocks to embed data without causing intra-frame distortion drift.

Besides intra-frame distortion drift, the inter-frame and the parallax distortion drift will also decrease the video quality. As illustrated in Fig. 6, hierarchical B coding is used in the prediction scheme of MVC video with two views. For one group of picture (GOP) with 16 frames, there are eight frames in each view. The horizontal prediction is inter-frame, and the vertical prediction is parallax. I₀ frame and P₀ frame are pivotal pictures at the highest level. Only intra-frame prediction is used for I₀ frame so that it will not be affected by hiding data into other frames, but hiding data into an I₀ frame will infect all the P₀, B₁, B₂, B₃, and b₄ frames in the two GOPs predicted by I₀ frame. In contrast, hiding information into P₀ or b₄ frames in the right view will not cause parallax distortion drift as P₀ or b₄ frames are not referred by the frames in the left view, where hiding information into b₄ frames also will not cause inter-frame distortion drift. In addition, only b₄ frames may be affected by hiding data into B₃ frames, and six B₃ frames in one GOP could afford enough redundancy space. Accordingly, compared with hiding data into I₀ frame, better video quality could be obtained by hiding information into P₀, B₃ or b₄ frames, which could be selected by users on demand.

3.2 Embedding procedure

The presented RDH algorithm for MVC video is shown in Fig. 8. The sender entropy decodes the MVC video to choose embeddable blocks from some QDCT inter-MB, where the MBs not selected for hiding data will be entropy encoded directly. The information is hidden into three QDCT coefficients (F _x1, F _x2, F _x3) chosen from each embeddable 4 × 4 block. The marked MVC video will be gotten by entropy encoding each MB after hiding data. Correspondingly, the receiver could entropy decode the marked MVC video to extract the embedded data from the marked QDCT coefficients that could be recovered completely later.

The way of selecting three QDCT coefficients (F _x1, F _x2, F _x3) from a 4 × 4 luminance block is shown in Algorithm 1. Random function is utilized to choose three coefficients from 15 AC coefficients in a block with zigzag scan described in Fig. 4. Figure 9 shows an example of selecting three embeddable coefficients.

Step 1:: The cursor starts from the first position marked by 1
Step 2:: When 7 is chosen randomly, the embeddable coefficient F _x1 is F ₇, and the positions of 7 and 1 pointed by the cursor are swapped
Step 3:: Move the cursor forward to point at 2
Step 4:: If 12 is selected at random, the embeddable coefficient F _x2 is F ₁₂, and the places of 12 and 2 are swapped
Step 5:: Move the cursor forward to point at 3
Step 6:: When 3 is selected randomly, the embeddable coefficient F _x3 is F ₃. Therefore, the selected coefficient group (F _x1, F _x2, F _x3) is (F ₇, F ₁₂, F ₃)

It can be seen that there are 15 ways to choose F _x1 from 15 QDCT AC coefficients, 14 ways to select F _x2 from the rest 14 QDCT AC coefficients, and 13 ways to select F _x3 from the rest 13 QDCT AC coefficients. Accordingly, the optional quantity of selecting (F _x1, F _x2, F _x3) is 15 × 14 × 13 = 2730. When a marked block is found by the third party, the probability for directly guessing the hidden data bit is 1/2730 $ \approx $ 3.66 × 10⁻⁴.

It is more difficult to find small area of distortion compared with large area of distortion. Therefore, it is necessary to limit the distortion region in a MB. In the hiding procedure shown in Algorithm 2, an embeddable 4 × 4 block, which could be used to hide data without causing intra-frame distortion drift, is selected randomly from 9 blocks shown in Fig. 7. In this way, only one 4 × 4 block may be modified for hiding data in one MB. In addition, a positive integer denoted as Z is set to generate a random threshold denoted by U so that we can randomly select embeddable blocks according to |F ₀| ≥ U. High threshold U will constrain the quantity of embeddable blocks, so the distortion region will be limited. Consequently, the application of arbitrary embeddable positions including blocks and coefficients could be employed to reduce the distortion of statistical histogram and enhance the undetectability of RDH scheme.

3.3 Extraction and recovery procedures

Algorithm 3 demonstrates the procedure of information extraction and video restoration. The same random seed can be used to generate some same random sequences. Therefore, when the sender and receiver use the same random seed S, the embeddable QDCT blocks and coefficients employed by the sender can be in one-to-one correspondence with the extractable QDCT blocks and AC coefficients utilized by the receiver. Finally, the receiver could extract the embedded data and restore the video fully according to the reverse process of Fig. 3.

In addition, the computational efficiency of the presented RDH algorithm depends on the video frame number denoted by N _F and the information length denoted by L _I. Therefore, the computational complexity of the presented algorithm can be denoted by O (N _F × L _I).

4 Experimental results and discussions

The presented algorithm has been effectuated in the H.264 reference software version JM18.4 [37]. The nine video sequences (640 × 480) [38] shown in Fig. 10 act as test samples. Two YUV files are encoded to a MVC video with 233 frames, which include 30 I₀ frames, 30 P₀ frames and 116 b₄ frames. The parameter intra-period was set as 8. The capacity of a video sequence is the mean number of bits hidden in one I₀/P₀/b₄ frame of all the I₀/P₀/b₄ frames in that sequence. The peak signal-to-noise ratio (PSNR) value and the structural similarity (SSIM) value, which are achieved by comparing the marked YUV video with the original YUV video, are the averages of all the frames. The embedding efficiency e is defined as

$$ e = L_{hide} /\sum\limits_{i = 1}^{{i = L_{modi} }} {{Lcha}_{i} } , $$

(22)

where L _hide is the number of hidden bits, and L _modi is the number of modified bits, and Lcha _i is the changed size of a modified coefficient.

If the parameter code block pattern of a block is 0, there is no QDCT coefficient hoarded in the block which only contains zero coefficients actually. Therefore, this block could not be modified for hiding information so that large visual distortion can be eliminated. The distribution of changeable QDCT coefficients in blocks with nonzero code block pattern is shown in Table 1. The schemes hiding data into P₀ and b₄ frame with intra-frame distortion drift are denoted as P₀_drift and b₄_drift. The schemes embedding information into P₀ and b₄ frame without intra-frame distortion drift are denoted as P₀_interMB and b₄_interMB, in which only embeddable blocks 0…8 shown in Fig. 7 are considered. The probability of changeable zero coefficients is denoted as p ₀. For P₀_drift, p ₀ is about 0.925. For P₀_interMB, p ₀ is about 0.927. For b₄_drift, p ₀ is about 0.935. For b₄_interMB, p ₀ is about 0.937. The overwhelming majority of changeable QDCT coefficients are zero, which indicates that the peak of the QDCT coefficient histogram is rather steep. Thus, fine payload-distortion performance can be obtained by using HS to embed data. Li et al.’s two-dimensional HS method [33] could be used for embedding information into QDCT coefficients of MVC video. When the value of coefficient pair (F _x1, F _x2) meets F _x1 F _x2 = 0, it is expanded to its neighboring positions (F _x1 + 1, F _x2 − 1) or (F _x1 − 1, F _x2 + 1) for signifying one data bit 1, where the cost is 2. In order to hide two data bits 11 into zero coefficients, the cost is 4, whereas the cost is 1 by using our method that takes full advantage of the coefficient distribution.

Table 1 Average numbers of different QDCT coefficient values in one frame of Ballroom

Full size table

Table 2 shows the embedding performance of four schemes where the threshold U is 0. Different hiding capacities can be achieved by hiding data into different videos. Therefore, different amounts of data are hidden into different videos. The proposed 3D-HS-based RDH method is used for hiding data in these schemes except Ma et al.’s scheme [3], which is denoted by Ma. Compared with Ma, where data is hidden into I₀ frame without causing intra-frame distortion drift, P₀_drift increases PSNR, SSIM and embedding efficiency e with 0.163 dB, 0.00005, and 1.2 for hiding 500 bits of data in one frame of Crowd, respectively. Compared with P₀_drift, PSNR, SSIM and embedding efficiency e can be enhanced with 0.003 dB, 0.00013 and 0.03 by P₀ _interMB for Crowd, respectively. Compared with P₀_ interMB, PSNR, SSIM and embedding efficiency e can be enhanced with 0.083 dB, 0.00033 and 0.05 by b₄_interMB for Crowd, respectively. On the whole, compared with Ma, b₄_interMB is superior by enhancing PSNR, SSIM, and embedding efficiency e with 0.156 dB, 0.00008 and 0.71 at least, respectively.

Table 2 Hiding performance of four schemes for hiding data into one frame on average

Full size table

In Table 3, three QDCT AC coefficients F ₂, F ₅, and F ₃ are used for hiding information. Compared with the conventional 3D HS, the presented 3D HS is better by improving PSNR, SSIM, and embedding efficiency e with 0.02 dB, 0.00004, and 0.41 at least, respectively. Accordingly, the marked frame of Akko&Kayo, Exit and Race are shown in Fig. 11. Figures (a–c) are the marked frame obtained by employing the conventional 3D HS to embed data. The rest figures are the marked frame achieved by using the proposed 3D HS to hide information. Their original frames are shown in Fig. 10. It is easy to find apparent distortion in the frames (a–c). There is a large distortion on the top left corner of frame (a). Many squares can be seen on the back of the man in frame (b). On the top middle of frame (c), the distortion is on the trees. By contrast, little distortion could be seen in the frames (d–f). The experimental results verify that superior visual quality could be obtained by using the proposed 3D HS method to hide information.

Table 3 Embedding performance of the conventional 3D HS and the presented 3D HS for hiding 500 bits of information into the first P₀ frame

Full size table

In order to compare the proposed RDH algorithm based on 3D HS with other RDH schemes in the same environment, embeddable blocks, which can be utilized for hiding data without the parallax or the intra-frame distortion diffusion, are chosen from inter-MBs of P₀ frame, as shown in Fig. 7. Huang and Chang’s scheme [30] is denoted as Huang. In Huang and our method, information is hidden into three QDCT AC coefficients F ₂, F ₅, and F ₃, where the coefficient pair (F ₂, F ₅) is used by Shi et al.’s scheme [13] denoted as Shi, and Ou et al.’s scheme [36] denoted as Ou. Additionally, F ₂ is used by Chung et al.’s scheme [7] denoted as Chung. At the leftmost point of every line in Fig. 12, data is hidden into the embeddable blocks that meet |F ₀| ≥ U, where the threshold U is 0. The five points of each line from left to right express the hiding cases in which the thresholdU is 4, 3, 2, 1 and 0, respectively.

It is obvious that the best PSNR, SSIM, embedding efficiency e, and the least bit-rate increase could be obtained by employing the presented method when the same number of data is embedded. We try to hide two data bits with at most one modification, so the embedding efficiency is improved, i.e., we can hide more information when the same distortion is caused. Consequently, if the same quantity of information is embedded, the less modification will be induced, and the video quality is better, which is verified by higher PSNR and SSIM. Additionally, little cost brings lower bit-rate increase that demonstrates hiding data with our method has little impact on the coding efficiency of H.264.

In Table 4, the threshold U is 0, and 700 bits of information are embedded into each P₀ frame on average. Compared with the other schemes, the presented method is superior by increasing PSNR and SSIM with 0.006 dB and 0.00005 at least, respectively. The best SSIM and PSNR mean that the best video quality can be gained by utilizing our algorithm. Furthermore, the embedding efficiency e of the presented scheme is much higher compared with that of other schemes, which demonstrates that when the same quantity of cover bits are changed, more bits of information could be embedded by our algorithm compared with other schemes.

Table 4 Embedding performance for hiding 700 bits of information into one P₀ frame on average

Full size table

5 Conclusions

An efficient 3D HS is presented for RDH algorithm in this paper. This new reversible method could be used for hiding data into image and video, where the image RDH such as 3D difference HS and 3D prediction-error HS will be applied and verified in our future work. We use the proposed 3D HS algorithm to embed data into QDCT coefficients of MVC video in this paper. Three coefficients chosen from each embeddable block are used for hiding two bits of information, where just one coefficient may be changed by adding 1 at most in most cases. Superior payload-distortion performance could be achieved by the proposed scheme compared with some state-of–the-art RDH methods. In order to improve the hiding capacity and decrease the distortion, the presented method will be generalized to hide over two data bits with at most one modification in future.

References

Socek, D., Kalva, H., Magliveras, S.S., Marques, O., Culibrk, D., Furht, B.: New approaches to encryption and steganography for digital videos. Multimed. Syst. 13(3), 191–204 (2007)
Article Google Scholar
Song, X.G., Lian, S.G., Hu, W., Hu, Y.: Digital video watermarking based on intra prediction modes for audio video coding standard. Multimed. Syst. 20(2), 195–202 (2014)
Article Google Scholar
Ma, X.J., Li, Z.T., Tu, H., Zhang, B.C.: A data hiding algorithm for H.264/AVC video streams without intra-frame distortion drift. IEEE. T. Circ. Syst. Vid. 20(10), 1320–1330 (2010)
Chen, W., Shahid, Z., Stutz, T., Autrusseau, F., Le Callet, P.: Robust drift-free bit-rate preserving H.264 watermarking. Multimed. Syst. 20(2), 179–193 (2014)
Article Google Scholar
Lie, W.N., Lin, T.C.I., Tsai, D.C., Lin, G.S.: Error resilient coding based on reversible data embedding technique for H.264/AVC video. In: Proc. IEEE International Conference on Multimedia and Expo (pp. 1175–1178) (2005)
Lin, S.F.D., Su, Y.L., Huang, J.Y.: Error resilience using a reversible data embedding technique in H.264/AVC. In: Proc. 7th WSEAS International Conference on Mulitmedia Systems and Signal Processing (MUSP ‘07) (pp. 112–117) (2007)
Chung, K.L., Huang, Y.H., Chang, P.C., Liao, H.Y.M.: Reversible data hiding-based approach for intra-frame error concealment in H.264/AVC. IEEE. T. Circ. Syst. Vid. 20(11), 1643–1647 (2010)
Lin, S.D., Chuang, C.Y., Meng, H.C., Su, Y.L.: An error resilient technique using reversible data embedding in H.264/AVC. Int. J. Innovat. Comput. Inf. Control. 7(5A), 2283–2290 (2011)
Google Scholar
Zeng, X., Chen, Z., Xiong, Z.: Issues and solution on distortion drift in reversible video data hiding. Multimed. Tools Appl. 52(2–3), 465–484 (2011)
Article Google Scholar
Farrugia, R.A.: Reversible visible watermarking for H.264/AVC encoded video. Proc. International Conference on Computer as a Tool, EUROCON 2011—Joint with Conftele 2011 (pp. 1–4) (2011)
Ali, M.A., Edirisinghe, E.A.: Multi-layer watermarking of H.264/AVC video using differential expansion on IPCM blocks. In: Proc. IEEE International Conference on Consumer Electronics (pp. 53–54) (2011)
Maiti, S., Singh, M.P.: A novel reversible data embedding method for source authentication and tamper detection of H.264/AVC video. In: Proc. 5th International Conference on Information Processing, ICIP 2011. 157, 349–355 (2011)
Shi, Y.J., Qi, M., Yi, Y.G., Zhang, M., Kong, J.: Object based dual watermarking for video authentication. Optik. 124(19), 3827–3834 (2013)
Article Google Scholar
Fridrich, J., Goljan, M., Du, R.: Lossless data embedding for all image formats. In: Proc. Security and Watermarking of Multimedia Contents IV (pp. 572–583) (2002)
Celik, M.U., Sharma, G., Tekalp, A.M., Saber, E.: Lossless generalized-LSB data embedding. IEEE. T. Image Process. 14(2), 253–266 (2005)
Article Google Scholar
Zhang, W.M., Hu, X.C., Li, X.L., Yu, N.H.: Recursive histogram modification: establishing equivalency between reversible data hiding and lossless data compression. IEEE. T. Image Process. 22(7), 2775–2785 (2013)
Article Google Scholar
Tian, J.: Reversible data embedding using a difference expansion. IEEE. T. Circ. Syst. Vid. 13(8), 890–896 (2003)
Article Google Scholar
Alattar, A.M.: Reversible watermark using the difference expansion of a generalized integer transform. IEEE. T. Image Process. 13(8), 1147–1156 (2004)
Article MathSciNet Google Scholar
Thodi, D.M., Rodriguez, J.J.: Expansion embedding techniques for reversible watermarking. IEEE. T. Image Process. 16(3), 721–730 (2007)
Article MathSciNet Google Scholar
Wu, H.C., Lee, C.C., Tsai, C.S., Chu, Y.P., Chen, H.R.: A high capacity reversible data hiding scheme with edge prediction and difference expansion. J. Syst. Softw. 82(12), 1966–1973 (2009)
Article Google Scholar
Lin, C.N., Buehrer, D.J., Chang, C.C., Lu, T.C.: Using quad smoothness to efficiently control capacity-distortion of reversible data hiding. J. Syst. Softw. 83(10), 1805–1812 (2010)
Article Google Scholar
Lee, C.F., Chen, H.L.: Adjustable prediction-based reversible data hiding. Digit. Signal Process. 22(6), 941–953 (2012)
Article MathSciNet Google Scholar
Ni, Z.C., Shi, Y.Q., Ansari, N., Su, W.: Reversible data hiding. IEEE. T. Circ. Syst. Vid. 16(3), 354–362 (2006)
Article Google Scholar
Lin, C.-C., Tai, W.-L., Chang, C.-C.: Multilevel reversible data hiding based on histogram modification of difference images. Pattern Recogn. 41(12), 3582–3591 (2008)
Article MATH Google Scholar
Tsai, P., Hu, Y.C., Yeh, H.L.: Reversible image hiding scheme using predictive coding and histogram shifting. Signal Process. 89(6), 1129–1143 (2009)
Article MATH Google Scholar
Tsai, H.M., Chang, L.W.: Secure reversible visible image watermarking with authentication. Signal Process. image. 25(1), 10–17 (2010)
Huang, H.C., Fang, W.C.: Techniques and applications of intelligent multimedia data hiding. Telecomm. Syst. 44(3–4), 241–251 (2010)
Article Google Scholar
Hong, W.E., Chen, T.S.: Reversible data embedding for high quality images using interpolation and reference pixel distribution mechanism. J. Vis. Comm. Image R. 22(2), 131–140 (2011)
Article Google Scholar
Lin, Y.C.: Reversible data-hiding for progressive image transmission. Signal Process. Image. 26(10), 628–645 (2011)
Huang, H.C., Chang, F.C.: Hierarchy-based reversible data hiding. Expert Syst. Appl. 40(1), 34–43 (2013)
Article Google Scholar
Li, X.L., Li, B., Yang, B., Zeng, T.Y.: General framework to histogram-shifting-based reversible data hiding. IEEE. T. Image Process. 22(6), 2181–2191 (2013)
Article MathSciNet MATH Google Scholar
Wang, Z.H., Lee, C.F., Chang, C.Y.: Histogram-shifting-imitated reversible data hiding. J. Syst. Softw. 86(2), 315–323 (2013)
Article Google Scholar
Li, X.L., Zhang, W.M., Gui, X.L., Yang, B.: A novel reversible data hiding scheme based on two-dimensional difference-histogram modification. IEEE Trans. Inf. Forensic. Secur. 8(7), 1091–1100 (2013)
Article Google Scholar
Zhao, J., Li, Z., Feng, B.: A novel two-dimensional histogram modification for reversible data embedding into stereo H.264 video. Multimed. Tools Appl, 1–22 (2015)
Xu, D.W., Wang, R.D., Wang, J.C.: A novel watermarking scheme for H.264/AVC video authentication. Signal Process. Image. 26(6), 267–279 (2011)
Ou, B., Li, X.L., Zhao, Y., Ni, R.R., Shi, Y.Q.: Pairwise prediction-error expansion for efficient reversible data hiding. IEEE. T. Image Process. 22(12), 5010–5021 (2013)
Article MathSciNet MATH Google Scholar
Sühring, K.: H.264/AVC software coordination (2012). http://iphome.hhi.de/suehring/tml
Video test sequences.: (2013) http://blog.csdn.net/do2jiang/article/details/5499464

Download references

Acknowledgments

This work is supported by the National Natural Science Foundation of China (Grant Nos. 61272407 and 61370230). We are heartily grateful to the reviewers for their valuable comments improving the quality of the original manuscript.

Author information

Authors and Affiliations

School of Mathematics and Computer Science, Wuhan Polytechnic University, Wuhan, 430023, China
Juan Zhao
Network Center, Huazhong University of Science and Technology, Wuhan, 430074, China
Zhitang Li
National Engineering Laboratory for Next Generation Internet Access System, Wuhan, 430074, China
Zhitang Li

Authors

Juan Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Zhitang Li
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Juan Zhao.

Additional information

Communicated by Q. Tian.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zhao, J., Li, Z. Three-dimensional histogram shifting for reversible data hiding. Multimedia Systems 24, 95–109 (2018). https://doi.org/10.1007/s00530-016-0529-2

Download citation

Received: 14 November 2014
Accepted: 31 August 2016
Published: 15 September 2016
Issue Date: February 2018
DOI: https://doi.org/10.1007/s00530-016-0529-2

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Three-dimensional histogram shifting for reversible data hiding

Abstract

Similar content being viewed by others

Quadruple histogram shifting-based reversible information hiding approach for digital images

High-Dimensional Histogram Utilization for Reversible Data Hiding

A novel reversible data hiding scheme with two-dimensional histogram shifting mechanism

1 Introduction