Enveloped Inverted Tree Recursive Hashing: An Efficient Transformation for Parallel Hashing

Kishore, Neha; Raina, Priya

doi:10.1007/978-981-15-5341-7_38

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 668))

Included in the following conference series:

International Conference on Advanced Communication and Computational Technology

1863 Accesses
1 Citations

Abstract

Security and performance are two of the most important concerns for cryptographic hashing algorithms, presenting a compelling challenge, since there seems to be a trade-off between achieving high speed on one hand and robust security on the other. However, with the advances in computer architecture and semiconductor technology, it is possible to achieve both by adopting parallelism. This paper presents a novel transformation based on the recursive tree hashing to parallelize and speed up typical hashing algorithms. The proposed transformation, called Enveloped Inverted Tree Recursive Hashing (EITRH), has three steps: “message expansion,” “parallel reduction,” and “hash value generation.” It improves upon the accuracy and the speed of hash code generation. Also proposed are some algorithms using the EITRH transformation for high-speed hashing on multiple cores. The security analysis of EITRH framework demonstrates its multi-property preservation capabilities. Discussion of EITRH w.r.t. performance benchmarks suggests its potential to achieve high speed in practical implementation.

This work is supported by the Science and Engineering Research Board, Department of Science and Technology under Young Scientist Scheme [YSS/2015/001573].

Access provided by Autonomous University of Puebla. Download conference paper PDF

A Novel Hash Function Based on Multi-iterative Parallel Structure

Article 01 July 2022

KangarooTwelve: Fast Hashing Based on $${\textsc {Keccak}\text {-}p}{}$$

Parallel SHA-256 on SW26010 many-core processor for hashing of multiple messages

Article 13 August 2022

Keywords

1 Introduction

Cryptographic hash functions (CHFs) [1] are a relatively recent addition to the cryptographic toolbox. They are used in various application areas where the integrity of data, the authenticity of the source, and the non-repudiation of delivery are important, including digital forensics, digital signatures, communication protocols like SSL, crypto-currency, etc. They are critical for all cloud computing services which require ensuring the integrity and correctness of the transmitted data, e.g. validation of downloaded files. Security is considered to be the most critical aspect of all cryptographic primitives and CHFs are no different. For a majority of the popular CHFs like the SHA-x family [2], the idea of security is based on computational complexity. This has adverse implications for performance, as the sequential nature of signature calculations in most algorithms leads to unwanted delays. Due to their sequential nature, the software performance of many hash functions on modern architectures, while decent in terms of speed and the no. of cycles used, is not optimal, resulting in wasted CPU cycles. The volume of data, easily running into terabytes, further deteriorates the performance as the compute-intensive nature of these algorithms makes the calculation of hash code for large files cumbersome and time-consuming. In 2007, National Institute Standards and Technology (NIST) made an announcement calling for suitable hash functions as candidates for the next standard SHA-3 [3], even though SHA-2 was still secure—because of the growing sophistication of attacks since Wang et al. [4], resulting in reduced lifespan for hash functions. The SHA-3 competition introduced parallelizability for increased performance as a desirable feature for the new hash function, suggesting that hashing algorithms need to be fast and efficient, as much as they need to be secure. However, it is remarkable that the focus was on parallelizable rather than parallel CHFs, implying that the design approach continued to remain sequential. It is suggested that a parallel and distributed approach toward CHFs would not only improve performance, but also enhance the security of CHFs. This paper presents a new parallel framework for CHFs to deal with the problem of long computations by lowering the computational complexity through the use of parallel implementation on multi-core processors.

The paper is organized as follows: Sect. 2 provides a brief introduction to the CHFs, followed by a detailed description of the proposed framework in Sect. 3. Section 4 presents the security analysis of the framework giving arguments in support of pre-image resistance, weak collision resistance, collision resistance, non-correlation resistance, parallel pre-image resistance, and pseudorandom oracle preservation properties. Section 5 discusses briefly the various performance benchmarks for evaluation of the framework. The conclusions along with the scope for future research work are given in Sect. 6.

2 Previous Work

A comprehensive study of the literature suggests that prior to the SHA-3 competition, parallelization of CHFs was not on the agenda of many researchers [5]. The evolution of parallel CHFs should thus be viewed in the light of SHA-3 competition, dividing the timeline into three phases, namely pre-SHA3, the years of the competition itself, and post-SHA3. In the initial stage, the efforts were concentrated toward software performance analysis of existing CHFs on the newly arrived Pentium processors [6,7,8]. The focus was largely on fine-grained parallelism. Hardware parallelism with FPGAs and ASICs was also prevalent and discussed in a few of the papers [9,10,11]. The arrival of GPUs in 2006 was a real game-changer for parallel CHFs as demonstrated by [12, 13]. The SHA-3 competition established parallelism/parallelizability of algorithm designs as a criterion for evaluation leading to the emergence of several promising designs like BLAKE [14], GR${\Phi }$STL [15], MD6 message-digest algorithm [16], skein [17, 18], Keccak [19], etc. These were subject to rigorous tests and analyses and eventually in the year 2013, Keccak was declared the winner of the SHA-3 competition because of its novel sponge construction [20]. The post-SHA-3 era saw the emergence of improved versions for SHA-3 candidates like BLAKE-2 [21], SHAKE [22], ParallelHash [23], etc. Some of the work was done in the domain of lightweight hash functions for resource-constrained environments [24,25,26,27]. Presently, efforts are being made toward standardization of tree modes for hash functions in order to achieve medium to coarse-grained parallelism [28,29,30]. The efforts in the three phases should be seen in the continuation of one other. The three-stage classification does not imply obsolescence of the techniques in the previous stage(s), and it merely signifies a change in approach toward parallel CHFs.

In 2014, Kishore and Kapoor [31] proposed ITRH transformation for a faster, more secure CHF, based on recursive tree hashing and presented its framework as well as detailed analysis. An algorithm RSHA-1 was proposed too. Speedup upto 3.5$\times $ was observed when large files were hashed using the proposed framework. Additionally, linear speedup proportional to the no. of processing elements was observed. However, at the time of security analysis, it was observed that the proposed transformation was vulnerable to certain attacks, a side-effect of removal of chain dependencies of the underlying function.

In order to overcome these deficiencies, this paper proposes an improved parallel transformation, Enveloped Inverted Tree Recursive Hashing (EITRH), which comes with an enhanced security level. The design is modeled on the lines of ITRH, but uses enveloping [32] as an additional measure for security.

3 The EITRH Transformation

Definition 1

Let a set of bit strings be denoted as $\{0,1\}^*$, a message $M \in \{0,1\}^* $, and k be the no. of blocks of size l, then keyless EITRH ($H{:}\,\{0,1\}^* \rightarrow \{0,1\}^n$) accepts a message $M \in \mathcal {M}$ and maps it to a unique n-bit digest as follows:

$$ h(i) = \left\{ \begin{array}{ll} h(M_i,H_0), &{} \quad 0 < i \le k-1 \\ h(M_i,H_{00}), &{} \quad i = k \end{array} \right. $$

and

$$\begin{aligned} H_i = h_1 \parallel h_2 \ldots \parallel h_k, M \rightarrow H_i\,\text {until}\, H_i \rightarrow \{0,1\}^n \end{aligned}$$

Definition 2

Let a set of bit strings be denoted as $\{0,1\}^*$, a message $M \in \{0,1\}^* $, and k be the no. of blocks of size l, then a keyed EITRH ($H_k{:}\, \{0,1\}^{\mathscr {K}} x \{0,1\}^* \rightarrow \{0,1\}^n$) accepts a message $M \in \mathcal {M}$ along with a fixed-length key $\mathscr {K} \in \mathcal {M}$ and maps it to a unique n-bit digest as follows:

$$ h(i) = \left\{ \begin{array}{ll} h(M_i,H_0), &{} \quad 0 < i \le k-1 \\ h(M_i,H_{\mathscr {K}}), &{} \quad i = k \end{array} \right. $$

and

$$ H_i = h_1 \parallel h_2 \ldots \parallel h_{k-1} \parallel h_k, M \rightarrow H_i\,\text {until}\, H_i \rightarrow \{0,1\}^n $$

Definitions 1 and 2 formally describe the keyless EITRH and keyed EITRH, respectively [33]. The sub-sections of this section further explain the structure and properties of the framework.

3.1 Describing the Framework

As depicted in Fig. 1, EITRH has an inverted tree structure. The transformation comprises of three recursive steps: “message expansion,” “parallel reduction,” and “hash value generation.” The process terminates when the size $h_n$ (of the final hash value) is reached, where $h_n$ depends upon the EITRH variant being used.

Step 1: “Message Expansion” It is performed at every level of the inverted tree of height $h_t$ (determined by the file size and EITRH variant being used). This step is necessary for improving the sensitivity of the hashing process. The message $M_i$ at level $h_i$ ($0< i < h_t$) is divided into k blocks of size l (determined by the variant used), where $k = (| M | \text {mod}\,l) + 1$. Padding is applied to the kth (last) block by appending a string of 10* followed by the length of $M_i$ at the end. The no. of 0s is adjusted to make the padded block-length a multiple of l.

$$ M_i = M_i \left| \right| 10 \ldots 0 \left| \right| \left| M _i \right| \quad \text {for}\,\,\text {each level}\,h_i . $$

Parallelism is achieved by executing each block on a multi-core processor. For enveloping, two distinct initial vectors of size $h_n$ are used: $\text {IV}_1 (H_0)$ for $k-1$ blocks and $\text {IV}_2 (H_{00})$ for the kth block.

Step 2: “Parallel Reduction” For each block reduction is done in a parallel mode, using nonlinear compression function (f()) and set of IVs ($(H_0)$ for $k-1$ blocks and $\text {IV}_2 (H_{00})$ for the kth block). Once the hash values are computed for each of the blocks, they become internal state hashes for the next level. These values are concatenated to generate $H_i$, which acts as the message M for the next level of the tree. Thus, a $h_n$-bit collective hash $H_i$ is obtained by recursive application of steps 1 and 2.

$$ h(i) = \left\{ \begin{array}{ll} h(M_i,H_0), &{} \quad 0 < i \le k-1 \\ h(M_i,H_{00}), &{} \quad i = k \end{array} \right. $$

This step is uniform for each block of $M_i$ and is performed independently, since there is no chaining input from the previous blocks.

Step 3: “Hash Generation” Once all the blocks of level $h_t - 1$ have been processed using the steps 1 and 2, the final digest is generated by $h = H(M_{h_t-1},H_{00})$.

Algorithm 1 presents the pseudocode for the EITRH domain extender in a data parallel model.

Algorithm 1

3.2 Properties

As described in Sect. 2, EITRH is an improved construction designed to overcome the security flaws in ITRH construction. Like ITRH, it is a recursion-based construction. Passing $\text {IV}_2 (H_0)$ to each block separately removed data dependencies, making parallel reduction possible for both the constructions. However, unlike its predecessor, EITRH is an enveloped construction. It uses an additional input $\text {IV}_2(H_{00}$) for hashing the last block at each level of the tree. Enveloping the last block in this manner improved the security level of the transformation, improving non-correlation, making it resistant against multi-collision attacks and partial pre-image attacks, in addition to the generic attacks. Also, due to $\text {IV}_2$, it is possible to use EITRH as both a keyless as well as a keyed construction; if $H_{00}$ in Definition 1 is replaced with a fixed-length key $\mathscr {K}$ , as defined in Definition 2. In this manner, a keyless hash function is transformed into a keyed variant simply by replacing $\text {IV}_2$ in the compression function with $\mathscr {K}$, without the necessity of separate provision for handling the key.

3.3 EITRH Variants

Table 1 indicates the CHF variants using EITRH construction and their internal state sizes. The proposed variants are derived from the SHA family where application of EITRH is done for enhancing security and parallelization. RSHA-1 [31], like SHA-1, gives a digest of 160 bits. Similarly, the digest size in RSHA-224, RSHA-256, RSHA-384, and RSHA-512 correspond to those in the SHA-2 standard.

This paper presents the analysis of EITRH based on the experiments done using RSHA-1. The outcomes can be extended to other variants as well.

Table 1 CHFs based on EITRH

Full size table

4 Security Analysis of EITRH

The EITRH transformation is designed to be a multi-property preserving domain extension transformation which can preserve multiple properties in addition to the ideal properties, viz. pre-image resistance (PIR), weak collision resistance (WCR), and collision resistance (CR). The sub-sections of this section provide the security analysis of the proposed framework.

4.1 Pre-image Resistance (PIR)

The pre-image resistant property refers to the one-wayness of the hash functions, which means that inverting the hash value in order to retrieve the original message should be computationally infeasible. The property of PIR is strongly preserved in EITRH as compared to MD construction because of its recursive nature. The fact that the final digest itself is a hash of hashes makes the use of backtracking to deduce the input message computationally difficult.

4.2 Weak Collision Resistance (WCR)

Weak collision resistance, also called second pre-image resistance, is the property which ensures that given a message M, it is difficult for the adversary (A) to find $M''$ which generates an identical hash. Due to this property, attacks involving falsification of the message may be thwarted by using encrypted hash. For a n-bit digest, the effort required to find a second pre-image is proportional to $2^n$. In Sect. 4.4, it will be proved that EITRH transformation supports the “avalanche effect.” Therefore, for any message, it would be computationally infeasible for an adversary to find another distinct message mapping to the same hash value, as the level of effort required for this type of attack is $2^n$. As n increases, the level of effort increases exponentially.

4.3 Collision Resistance (CR)

Strong collision resistance property implies that it should be computationally difficult for A to find two distinct messages M and $M''$ such that $H(M)=H(M'')$. For a n-bit digest, attacking CR requires an effort of $2^{\frac{n}{2}}$, much less than pre-image and second pre-image attacks, on account of the birthday paradox. Consider two inputs M and $M''$ (where $M\ne M''$) of the same size, differing by $\Delta $. Let the difference be in only half of the input blocks. Partial collision may happen, effecting the intermediate hashes and/or input to the next stage. Due to recursion, a collision in one half of the intermediate hashes ensures that the other half differs by $\Delta $. Further, the colliding values become input for the nonlinear compression function f() in the next stage, which uses sub-key values that are distinct from those used in the previous stages. As a result, in the succeeding stages, the internal collision is unlikely to persist.

4.4 Non-correlation Resistance (Confusion and Diffusion Analysis)

The avalanche effect for CHFs, as in encryption, requires a complex correlation between the input and hash value bits, such that a change in the former should cause a drastic change in at least half of the output bits.

EITRH demonstrated the avalanche effect when it was subjected to tests for confusion and diffusion on a simulator. A random text file from t5-corpus was used for this purpose; its hash was calculated using EITRH variant (RSHA-1). A random bit in the original input file was toggled and hash was re-calculated. The two hash values were XORed and the no. of set bits were counted, giving the no. of bits that had changed. The test was repeated N times on the algorithm (where $N=128$, 256, 512 or 1024) and the results shown in Fig. 2 were plotted after the following calculations.

$$\begin{aligned} \text {Minimum no. of bits changed}{:}\,\, L_{\text {min}} = \min (\{L_i\}^N_i) \end{aligned}$$

(1)

$$\begin{aligned} \text {Maximum no. of bits changed}{:}\, \,L_{\text {max}} = \max (\{L_i\}^N_i) \end{aligned}$$

(2)

$$\begin{aligned} \text {Mean no. of bits changed}{:}\,\, \overline{L} = \frac{1}{N} \sum ^N_{i=1} L_i \end{aligned}$$

(3)

$$\begin{aligned} \text {Mean Changed Probability}{:}\,\, \text {Pr} = \frac{\overline{L}}{\text {Hash Size}} \times 100 \% \end{aligned}$$

(4)

$$\begin{aligned} \text {Standard Deviation of bits changed}{:}\,\, \sigma _L = \sqrt{\frac{1}{N-1}\sum ^N_{i=1} (L_i - \overline{L})^2} \end{aligned}$$

(5)

$$\begin{aligned} \text {Standard Deviation}{:} \,\, \sigma _{\text {Pr}} = \sqrt{\frac{1}{N-1}\sum ^N_{i=1} \bigg (\frac{L_i}{\text {Hash Size}} - \text {Pr}\bigg )^2 \times 100 \%} \end{aligned}$$

(6)

Here, N is the total no. of statistics, $L_i$ is the no. of bits changed in the ith test, and $\sigma _{\text {Pr}}$, $\sigma _{L}$ indicate the stability of confusion and diffusion.

Table 2 records the statistics $L_{\text {min}}$, $L_{\text {max}}$, $L_i$, Pr, $\sigma _{L}$, $\sigma _{\text {Pr}}$ calculated using Eqs. (1)–(6), respectively. The table also shows the mean values for these statistics. It can be noted that mean values of $L_i$ and Pr for EITRH-based RSHA-1 are 83.94 and 52.46%, respectively. These are close to the benchmark values for the standard 160-bit CHF SHA-1, i.e., 64 bits and 50%. $\sigma _{L}$ and $\sigma _{\text {Pr}}$ indicate the stable capability of confusion and diffusion of the algorithm. The results show that the proposed algorithm is resistant to statistical attacks.

Table 2 Statistics of no. of changed bit $L_i$

Full size table

4.5 Partial Pre-image Resistance (PPR)

Partial pre-image resistance or local one-wayness is the computational difficulty in retrieving the message partially, given that A knows a part of the message. The EITRH transformation preserves PPR property. Since the final digest is a hash of hashes, it is difficult for A to recover the message even partially. The no. of intermediate hashes at each level is n and thus, can never lead to discovery of the original message even partially.

4.6 Pseudorandom Oracle Preservation (PRO-Pr)

EITRH transformation is an inverted tree hash that is inspired by EMD transformation. EITRH, like EMD, uses two distinct initial vectors (IV). The second IV is used as an input while hashing the last block of each level. This leads to a high probability of the random oracle behaving independently at the last application . Since EMD is proven to be PRO-Pr [32], EITRH using the same principle is PRO-Pr.

5 Performance Evaluation

The sub-sections of this section present an evaluation of EITRH variants based on the performance metrics for a parallel program, namely complexity, speedup, efficiency, cost optimality and iso-efficiency, using some analytical tools [31].

5.1 Algorithm Complexity

For calculating the algorithmic complexity of EITRH, the following assumptions were made.

Suppose the complexity of calculating the hash value H(M(i)) for each message block M(i) is $\text {Cmplx}(H(M(i)))$, and the no. of original blocks is b, then the computational complexity for standard CHF model can be given by Eq. (7).

$$\begin{aligned} b\cdot \text {Cmplx}(H(M(i))) \end{aligned}$$

(7)

The first phase of EITRH on p processors, where $p \ge 1$ has the complexity,

$$\begin{aligned} \frac{b}{p}\cdot \text {Cmplx}(H(M(i))) \end{aligned}$$

(8)

For the parallel reduction phase of EITRH, the no. of message blocks determines the no. of times H(M(i)) is called. After the first reduction, the remaining length of the message is $\frac{x}{y}$ of the length of the last message where x is the size of hash to be generated and y is the block size. x and y depend upon the variant of EITRH. Therefore, to achieve the x-bit hash value, $\lceil {\text {log}}_{\frac{x}{y}}b\rceil $ recursive reductions are required and the no. of times it will be called, $n_{\text {call}}$, is calculated as follows:

$$\begin{aligned} \begin{aligned} n_{\text {call}}&= \bigg \lceil \bigg (\frac{x}{y}\bigg )^1\cdot \frac{b}{p}\bigg \rceil +\bigg \lceil \bigg (\frac{x}{y}\bigg )^2\cdot \frac{b}{p}\bigg \rceil \cdots +\bigg \lceil \bigg (\frac{x}{y}\bigg )^z\cdot \frac{b}{p}\bigg \rceil \\&= \sum ^z_{i=1}\bigg \lceil \bigg (\frac{x}{y}\bigg )^i\cdot \frac{b}{p}\bigg \rceil \\ \end{aligned} \end{aligned}$$

(9)

where

$$ z=\lceil {\log }_{\frac{x}{y}}b\rceil , \quad \frac{x}{y} = \frac{\text {Hash Size}}{\text {Block Size}} $$

Now, the complexity for the complete reduction process is given by:

$$\begin{aligned} O(\text {rdc}) = n_{\text {call}}\cdot \text {Cmplx}(H(M(i))) \end{aligned}$$

(10)

From Eqs. (8) and (10), the overall computational complexity $f(\text {rdc})$ of EITRH transformation is given as:

$$\begin{aligned} \begin{aligned} f(\text {rdc})&= \frac{b}{p}\cdot \text {Cmplx}(H(M(i)))+n_{\text {call}}\cdot \text {Cmplx}(H(M(i))) \\&= \bigg (\frac{b}{p}+n_{\text {call}}\bigg )\cdot \text {Cmplx}(H(M(i))) \end{aligned} \end{aligned}$$

(11)

Therefore, from Eqs. (7) and (11), it may be concluded that even though the no. of calculations required in EITRH transformation is more computations than that in the Merkle-Damgård (MD) construction, in terms of computational complexity EITRH is at par with MD construction.

5.2 Speedup

Suppose that the reference CHF model (SHA-1 in this case) is able to hash a message block M(i) in the time T(M(i)), measured as the no. of elementary steps in the computation (provided each step is completed in constant time).

Thus, for b message blocks, time $=b\cdot T(M(i))$

Therefore, the sequential time can be given as:

$$\begin{aligned} T_s=b\cdot T(M(i))=\Theta (b) \end{aligned}$$

(12)

From Fig. 1, we know that the process of calculating hash using EITRH involves producing the intermediate hash of sub-blocks, concatenating these hashes and using recursion to determine the final hash value. The computation of the time complexity of EITRH on p processors, where $p\ge 1$ is done as follows:

Let T(M(i)) be the time required to hash ith block.

Let $t_c$ be the constant time required for concatenating the intermediate hashes.

From Eq. (9), the no. of recursive calls ($n_{\text {call}}$) is calculated as:

$$\begin{aligned} n_{\text {call}}&= \bigg \lceil \bigg (\frac{x}{y}\bigg )^1\cdot \frac{b}{p}\bigg \rceil +\bigg \lceil \bigg (\frac{x}{y}\bigg )^2\cdot \frac{b}{p}\bigg \rceil \cdots +\bigg \lceil \bigg (\frac{x}{y}\bigg )^z\cdot \frac{b}{p}\bigg \rceil \\&= \sum ^z_{i=1}\bigg \lceil \bigg (\frac{x}{y}\bigg )^i\cdot \frac{b}{p}\bigg \rceil \\ \end{aligned}$$

where

$$\begin{aligned} z=\lceil {\text {log}}_{\frac{x}{y}}b\rceil \quad \frac{x}{y} = \frac{\text {Hash Size}}{\text {Block Size}} \end{aligned}$$

Time taken for reduction and recursive calls $=n_{\text {call}}\cdot T(M(i)).$

So, parallel time is given by:

$$\begin{aligned} T_p&= \frac{b}{p}\cdot T(M(i))+n_{\text {call}}\cdot T(M(i))+t_c \nonumber \\&=T(M(i)).\bigg (\frac{b}{p}+n_{\text {call}} \bigg ) =\Theta \bigg (\frac{b}{p}+n_{\text {call}} \bigg ) \end{aligned}$$

(13)

Based on the above results, total parallel overhead is given by:

$$\begin{aligned} T_o&= p\cdot T_p-T_s \end{aligned}$$

(14)

$$\begin{aligned}&= p\cdot T(M(i))\cdot \bigg (\frac{b}{p}+n_{\text {call}} \bigg )-b\cdot T(M(i)) \nonumber \\&=p\cdot n_{\text {call}}\cdot T(M(i)) =\Theta (p\cdot n_{\text {call}}) \end{aligned}$$

(15)

Now, from Eqs. (12) and (13), speedup can be given as:

$$\begin{aligned} S(b)&= \frac{T_s}{T_p} =\frac{b}{\bigg (\frac{b}{p}+ n_{\text {call}}\bigg )} =\frac{b}{b\cdot \bigg (\frac{1}{p}+ \frac{n_{\text {call}}}{b}\bigg )}\nonumber \\&=\frac{1}{\bigg (\frac{1}{p}+ \frac{n_{\text {call}}}{b}\bigg )} =\Theta \bigg (\frac{p}{1+ \frac{p}{b}\cdot n_{\text {call}}}\bigg ) \end{aligned}$$

(16)

The speedup S(b) (from Eq. (16)) is a function of both problem size b and the no. of processing cores p. The growth of S(b) is sub-linear, with respect to b. Therefore, if b is increased, keeping p fixed, there is no significant increase in speedup, as observed in Fig. 3. However, there is a proportional increase in speedup, if both b and p are increased, as shown in Fig. 4.

5.3 Efficiency

Efficiency is an indicator of resource utilization. It is measured as the fraction of time for which the processor is usefully employed. Its value (E(b)) lies in the range $\frac{1}{b}\le E(b) \le 1.$ The efficiency of the EITRH variant on p processors can be calculated as:

$$\begin{aligned} E(b)&= \frac{\text {Speedup}}{\text {No. of processors}}=\frac{T_s}{p\cdot T_p} \end{aligned}$$

E(b) can be expressed in terms of parallel overhead, as done in Eqs. (14) and (15):

$$\begin{aligned}&E(b)= \frac{1}{1+\frac{T_o}{T_s}} \quad \therefore \quad E(b)= \Theta \bigg (\frac{1}{1+ \frac{p\cdot n_{\text {call}}}{b}}\bigg ) \end{aligned}$$

(17)

Generally, $b \gg p$, so $\frac{p\cdot n_{\text {call}}}{b}$ and efficiency cannot be 1, due to the parallel overhead. From Eq. (17), it is evident that, unlike speedup, efficiency increases if the problem size is increased while keeping the no. of processing elements fixed. However, due to Amdahl’s law, the speedup tends to saturate and correspondingly the efficiency drops with the increase in p in the results shown in Figs. 4 and 5.

5.4 Cost Optimality

In parallel systems, cost optimality means that the cost of solving a problem, as a function of input size, has the same asymptotic growth as the fastest known sequential algorithm on a single core [34]. The cost of using EITRH framework for computing hash value is:

$$\begin{aligned} T_{\text {cost}}&= p\cdot T_p \nonumber \\&= p\cdot \bigg (\frac{b}{p}+n_{\text {call}}\bigg ) \nonumber \\&= \Theta (b+p\cdot n_{\text {call}}) \end{aligned}$$

(18)

From Eq. (18), it may be concluded that as long as the input size $b=\Omega (p\cdot n_{\text {call}})$, EITRH is cost optimal as the cost is $\Theta (b)$, which is the same as the cost for sequential algorithm (see Eq. (12)).

5.5 Minimum Execution Time

For a problem of a fixed size, increasing the no. of processing cores does not always lead to improvement in execution time of the program. Initially, it decreases asymptotically, till it reaches a minimum value. After achieving the minima, any further increase in p leads to an increase in run-time. Therefore, the minimum parallel execution time $T_p^{\text {min}}$, given by Eq. (19), can be calculated as follows:

$$\begin{aligned} \frac{d}{dp}T_p&= 0\\ \frac{d}{dp}T_p&=\frac{d}{dp}\bigg (\frac{b}{p}+n_{\text {call}}\bigg )=\frac{b}{p^2}(-c) \end{aligned}$$

where c is constant.

$$ \therefore \,\,b=p^2 \implies p=\sqrt{b} $$

Substituting $p=\sqrt{b}$ in Eq. (13), we get

$$\begin{aligned} T_p^{\text {min}}=2\sqrt{b} \end{aligned}$$

(19)

5.6 Scalability

Scalability is defined as the ability of a parallel system to maintain its efficiency at a constant level, even as the size of the system is expanded by increasing, both, the problem size as well as the no. of processing elements. It is measured using an iso-efficiency function [34], defined as follows:

$$\begin{aligned} \text {Problem size}\ (W)=\frac{E}{1-E}\cdot T_o(T_s,p) \end{aligned}$$

From Eq. (14), the parallel overhead function of EITRH is approximately $p\cdot n_{\text {call}}.$ Therefore,

$$\begin{aligned} W=\frac{E}{1-E}\cdot p\cdot n_{\text {call}} \end{aligned}$$

(20)

Using the asymptotic notation, the iso-efficiency for EITRH framework is $\Theta (p\cdot n_{\text {call}}).$ This suggests that, if the no. of processors/cores is increased by a factor of $\frac{p'}{p}$ ($p'$ being the increased no. of processors), then in order to achieve a gain in the speedup by the same factor, the problem size (b) should be correspondingly increased by a factor of $\frac{p'\cdot n'_{\text {call}}}{p\cdot n_{\text {call}}}.$

The discussion presented in this section evaluated the EITRH framework with respect to various metrics for parallel programs. The results suggest that EITRH framework is at par with its sequential counterparts in terms of scalablity, efficiency as well as optimalty with respect to costs.

6 Conclusion and Future Work

In this paper, a fast parallel recursive tree-based transformation, called EITRH, was proposed. It can be used to design parallel algorithms from their sequential counterparts. The proposed transformation combines the use of tree hashing and EMD construction. It has three recursive steps, viz. “message expansion,” “parallel reduction,” and “hash value generation.” The envelope of the transform, provided by the use of the second vector ($\text {IV}_2$), makes it suitable for both keyed and keyless CHFs. Further, this paper proved that EITRH satisfies all the essential security properties for hash functions, namely PIR, WCR, CR, avalanche effect, PPR, and PRO-Pr, as discussed in the previous sections. This suggests that EITRH is a secure and flexible hash transformation that is multi-property preserving. Performance evaluation of the framework indicates its scalability for hashing long messages, implying potential application in digital forensics and parallelization of digital signatures. In the future, the theoretical results presented in this paper shall be confirmed by practical implementation of CHFs based on EITRH transformation.

References

Stallings W (2006) Cryptography and network security—principles and practice, 4th edn. Prentice Hall, Upper Saddle River, NJ
Google Scholar
National Institute of Standards and Technology (2015) FIPS PUB 180-4. Secure hash standard. Technical report
Google Scholar
Kayser RF (2007) Announcing request for candidate algorithm nominations for a new cryptographic hash algorithm (SHA-3) family. Fed Reg 72(FR 62212):62212–62220
Google Scholar
Wang X, Yin YL, Yu H (2005) Finding collisions in the full SHA-1. In: Shoup V (ed) Advances in cryptology–CRYPTO 2005: 25th annual international cryptology conference, Santa Barbara, CA, USA, 14–18 Aug 2005. Proceedings. Lecture notes in computer science, vol 3621. Springer, pp 17–36
Google Scholar
Kishore N, Raina P (2019) Parallel cryptographic hashing: developments in the last 25 years. Cryptologia 43(6):504–535
Article Google Scholar
Bosselaers A, Govaerts R, Vandewalle J (1996) Fast hashing on the Pentium. In: Koblitz N (ed) Advances in cryptology—CRYPTO ’96, 16th annual international cryptology conference, Santa Barbara, CA, USA, 18–22 Aug 1996, proceedings. Lecture notes in computer science, vol 1109. Springer, pp 298–312
Google Scholar
Bosselaers A, Govaerts R, Vandewalle J (1997) SHA: a design for parallel architectures? In: Fumy W (ed) Advances in cryptology—EUROCRYPT ’97, international conference on the theory and application of cryptographic techniques, Konstanz, Germany, 11–15 May 1997, proceeding. Lecture notes in computer science, vol 1233. Springer, pp 348–362
Google Scholar
Nakajima J, Matsui M (2002) Performance analysis and parallel implementation of dedicated hash functions. In: Knudsen LR (ed) Advances in cryptology—EUROCRYPT 2002, international conference on the theory and applications of cryptographic techniques, Amsterdam, The Netherlands, 28 Apr to 2 May 2002, proceedings. Lecture notes in computer science, vol 2332. Springer, pp 165–180
Google Scholar
Khalil M, Nazrin M, Hau Y (2008) Implementation of SHA-2 hash function for a digital signature system-on-chip in FPGA. In: International conference on electronic design, 2008. ICED 2008. IEEE, pp 1–6
Google Scholar
Li H, Miao C (2006) Hardware implementation of hash function SHA-512. In: First international conference on innovative computing, information and control (ICICIC 2006), Beijing, China, 30 Aug to 1 Sept 2006. IEEE Computer Society, pp 38–42
Google Scholar
McEvoy RP, Crowe FM, Murphy CC, Marnane WP (2006) Optimisation of the SHA-2 family of hash functions on FPGAS. In: 2006 IEEE Computer Society annual symposium on VLSI (ISVLSI 2006), Karlsruhe, Germany, 2–3 Mar 2006. IEEE Computer Society, pp 317–322
Google Scholar
Changxin L, Hongwei W, Shifeng C, Xiaochao L, Donghui G (2009) Efficient implementation for MD5-RC4 encryption using GPU with CUDA BT. In: 2009 3rd international conference on anti-counterfeiting, security, and identification in communication, ASID 2009, 20–22 Aug 2009. IEEE
Google Scholar
Hu G, Ma J, Huang B (2009) High throughput implementation of MD5 algorithm on GPU. In: Proceedings of the 4th international conference on ubiquitous information technologies & applications, 2009. ICUT’09. IEEE, pp 1–5
Google Scholar
Aumasson JP, Henzen L, Meier W, Phan RCW (2008) SHA-3 proposal Blake. Submission to NIST
Google Scholar
Gauravaram P, Knudsen LR, Matusiewicz K, Mendel F, Rechberger C, Schläffer M, Thomsen SS (2009) Grøstl–a SHA-3 candidate. In: Handschuh H, Lucks S, Preneel B, Rogaway P (eds) Symmetric cryptography, 11–16 Jan 2009. Dagstuhl seminar proceedings, vol 09031. Schloss Dagstuhl—Leibniz-Zentrum für Informatik, Germany
Google Scholar
Rivest RL, Agre B, Bailey DV, Crutchfield C, Dodis Y, Fleming KE, Khan A, Krishnamurthy J, Lin Y, Reyzin L et al (2008) The MD6 hash function–a proposal to NIST for SHA-3. Submission to NIST
Google Scholar
Atighehchi K, Enache A, Muntean T, Risterucci G (2010) An efficient parallel algorithm for skein hash functions. Cryptology ePrint Archive, Report 2010/432
Google Scholar
Ferguson N, Lucks S, Schneier B, Whiting D, Bellare M, Kohno T, Callas J, Walker J (2010) The skein hash function family. Submission to NIST (round 3)
Google Scholar
Bertoni G, Daemen J, Peeters M, Assche GV (2013) Keccak. In: Johansson T, Nguyen PQ (eds) Advances in cryptology—EUROCRYPT 2013, 32nd annual international conference on the theory and applications of cryptographic techniques, Athens, Greece, 26–30 May 2013. Proceedings. Lecture notes in computer science, vol 7881. Springer, pp 313–314
Google Scholar
NIST selects winner of secure hash algorithm (SHA-3) competition. https://www.nist.gov/news-events/news/2012/10/nist-selects-winner-secure-hash-algorithm-sha-3-competition
Aumasson JP, Neves S, Wilcox-O’Hearn Z, Winnerlein C (2013) BLAKE2: simpler, smaller, fast as MD5. In: Jacobson MJ, Locasto ME, Mohassel P, Safavi-Naini R (eds) Applied cryptography and network security—11th international conference, ACNS 2013, Banff, AB, Canada, 25–28 Jun 2013. Proceedings. Lecture notes in computer science, vol 7954. Springer, pp 119–135
Google Scholar
Dworkin MJ (2015) FIPS PUB 202- SHA-3 standard: permutation-based hash and extendable-output functions. Technical report, National Institute of Standards and Technology
Google Scholar
Kelsey J (2016) SHA-3 derived functions: SHAKE, KMAC, TupleHash, and ParallelHash. Technical report, National Institute of Standards and Technology. NIST Special Publication
Google Scholar
Aumasson JP, Henzen L, Meier W, Naya-Plasencia M (2013) Quark: a lightweight hash. J Cryptol 26(2):313–339
Article MathSciNet MATH Google Scholar
Bogdanov A, Knezevic M, Leander G, Toz D, Varici K, Verbauwhede I (2011) SPONGENT: a lightweight hash function. In: Preneel B, Takagi T (eds) Cryptographic hardware and embedded systems—CHES 2011—13th international workshop, Nara, Japan, 28 Sept to 1 Oct 2011. Proceedings. Lecture notes in computer science, vol 6917. Springer, pp 312–325
Google Scholar
Cabral R, López J (2016) Fast software implementation of quark on a 32-bit architecture. In: Lightweight cryptography for security and privacy: 4th international workshop, LightSec 2015, Bochum, Germany, 10–11 Sept 2015. Revised selected papers, pp 115–130
Google Scholar
Guo J, Peyrin T, Poschmann A (2011) The PHOTON family of lightweight hash functions. In: Rogaway P (ed) Advances in cryptology—CRYPTO 2011—31st annual cryptology conference, Santa Barbara, CA, USA, 14–18 Aug 2011. Proceedings. Lecture notes in computer science, vol 6841. Springer, pp 222–239
Google Scholar
Atighehchi K (2016) Note on optimal trees for parallel hash functions. CoRR abs/1604.04206
Google Scholar
Atighehchi K, Bonnecaze A (2016) Asymptotic analysis of plausible tree hash modes for SHA-3. Cryptology ePrint Archive, Report 2016/658
Google Scholar
Bertoni G, Daemen J, Peeters M, Assche GV (2014) Sakura: a flexible coding for tree hashing. In: Boureanu I, Owesarski P, Vaudenay S (eds) Applied cryptography and network security—12th international conference, ACNS 2014, Lausanne, Switzerland, 10–13 Jun 2014. Proceedings. Lecture notes in computer science, vol 8479. Springer, pp 217–234
Google Scholar
Kishore N, Kapoor B (2014) An efficient parallel algorithm for hash computation in security and forensics applications. In: Souvenir of the 2014 IEEE international advance computing conference, IACC 2014, pp 873–877
Google Scholar
Bellare M, Ristenpart T (2006) Multi-property-preserving hash domain extension and the EMD transform. In: Lai X, Chen K (eds) Advances in cryptology—ASIACRYPT 2006, 12th international conference on the theory and application of cryptology and information security, Shanghai, China, 3–7 Dec 2006. Proceedings. Lecture notes in computer science, vol 4284. Springer, pp 299–314
Google Scholar
Kishore N (2014) Parallel hashing algorithms for security and Forensic Applicatons. PhD thesis, Chitkara University School of Engineering and Technology, Chitkara University, Himachal Pradesh, India. http://shodhganga.inflibnet.ac.in//handle/10603/46759
Kumar V, Grama A, Gupta A, Karypis G (1994) Introduction to parallel computing. Benjamin/Cummings, Redwood City, CA
Google Scholar

Download references

Author information

Authors and Affiliations

Chitkara University School of Engineering and Technology, Chitkara University, Baddi, Himachal Pradesh, India
Neha Kishore & Priya Raina

Authors

Neha Kishore
View author publications
You can also search for this author in PubMed Google Scholar
Priya Raina
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Neha Kishore .

Editor information

Editors and Affiliations

Department of Mathematics and Computer Science, University of Maryland Eastern Shore, Princess Anne, MD, USA
Gurdeep Singh Hura
Department of Master of Computer Applications, National Institute of Technology Kurukshetra, Kurukshetra, Haryana, India
Ashutosh Kumar Singh
Department of Information Science and Technology, Multimedia University, Jalan Ayer Keroh Lama, Melaka, Malaysia
Lau Siong Hoe

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kishore, N., Raina, P. (2021). Enveloped Inverted Tree Recursive Hashing: An Efficient Transformation for Parallel Hashing. In: Hura, G.S., Singh, A.K., Siong Hoe, L. (eds) Advances in Communication and Computational Technology. ICACCT 2019. Lecture Notes in Electrical Engineering, vol 668. Springer, Singapore. https://doi.org/10.1007/978-981-15-5341-7_38

Download citation

DOI: https://doi.org/10.1007/978-981-15-5341-7_38
Published: 14 August 2020
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-5340-0
Online ISBN: 978-981-15-5341-7
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics