# Novel Efficient Circuit Design for Multilayer QCA RCA



Hamid Reza Roshany<sup>1</sup> · Abdalhossein Rezai<sup>1</sup>

Received: 5 October 2018 / Accepted: 18 February 2019 /Published online: 4 April 2019  $\circled{c}$  Springer Science+Business Media, LLC, part of Springer Nature 2019

# Abstract

The novel emerging technology, QCA technology, is a candidate for replacing CMOS technology. Full Adder (FA) circuits are also widely used circuits in arithmetic circuits design. In this paper, two new multilayer QCA architectures are presented: one-bit FA and 4-bit Ripple Carry Adder (RCA). The designed one-bit multilayer FA architecture is based on a new XOR gate architecture. The designed 4-bit multilayer QCA RCA is also developed based on the designed one-bit multilayer QCA FA. The functionality of the designed architectures are verified using QCADesigner tool. The results indicate that the designed architecture for 4-bit multilayer QCA RCA requires 5 clock phases, 125 QCA cells, and 0.17  $\mu$ m<sup>2</sup> area. The comparison results confirm that the designed architectures provide improvements compared with other adder architectures in terms of cost, cell count, and area.

Keywords Ripple carry adder. Full adder. Quantum-dot cellular automata . Multilayer design, XOR gate

# 1 Introduction

The novel technology, Quantum-dot Cellular Automata (QCA) technology, is a new emerging technology, which is utilized to achieve higher speed and scaling compared to CMOS technology [\[1](#page-12-0)]. Logic states are encoded according to the positions of electrons in this technology  $[2, 3]$  $[2, 3]$  $[2, 3]$  $[2, 3]$ . The XOR gate, majority gate and inverter gate are typical logic gates in the QCA technology [[4,](#page-12-0) [5\]](#page-12-0). Using these QCA gates, the logic circuits can be constructed [\[6](#page-12-0)]. The logic circuits in this technology can be implemented in coplanar or multilayer.

 $\boxtimes$  Abdalhossein Rezai [rezaie@acecr.ac.ir](mailto:rezaie@acecr.ac.ir)

> Hamid Reza Roshany hamidreza.roshany@gmail.com

<sup>&</sup>lt;sup>1</sup> ACECR Institute of Higher Education, Isfahan Branch, Isfahan 84175-443, Iran

On the other hand, Full Adder (FA) circuits are widely used circuits for implementation of complex logic circuits such as Arithmetic Logic Units (ALU) [\[6](#page-12-0), [7\]](#page-12-0). As a result, the implementation of high-performance FA circuits has a great deal of attention. So, many researchers designed FA circuits to increase the efficiency of FA circuit in this technology [[8](#page-12-0)–[12](#page-12-0)]. Balali et al. [[8](#page-12-0)] have designed one-bit QCA FA architecture using a new XOR gate, which requires  $0.02 \mu m^2$  area and 29 cells. Mohamadi et al. [\[9\]](#page-12-0) have developed multilayer onebit FA architecture that requires  $0.02 \mu m^2$  area and 38 cells. Roohi et al. [\[10\]](#page-12-0) have developed multilayer one-bit QCA FA architecture that requires  $0.01 \mu m^2$  area and 23 cells. Navi et al. [[11\]](#page-12-0) have developed one-bit QCA FA, which requires  $0.04 \mu m^2$  area and 73 cells. Hashemi et al. [[12](#page-12-0)] have proposed multilayer one-bit QCA FA, which requires  $0.05 \mu m^2$  area and 79 cells.

We develop a novel QCA XOR gate architecture in this paper and use this novel QCA XOR gate in sum structure for construct a multilayer one-bit QCA FA. Moreover, the novel robust efficient multilayer 4-bit QCA Ripple Carry Adder (RCA) architecture is developed using the designed one-bit QCA FA architecture. The functionality of the designed architectures are verified using QCADesigner tool version 2.0.3. The achieved results demonstrated that the developed architectures provide improvements compared to other QCA architectures in terms of delay time, and area.

The remainder of this study is arranged as follows. In section 2, the background of the QCA technology is presented. The developed architectures are demonstrated in section [3.](#page-8-0) The simulation results and comparison of the developed architectures are described in section [4](#page-10-0), and finally, this study is concluded in section [5](#page-11-0).

# 2 Background

#### 2.1 QCA Cell

Logic design using quantum cells is one of the most recent trends, which allows the scaling is continued to atomistic dimensions [\[8](#page-12-0), [13\]](#page-12-0). In the QCA technology, quantum cells are arranged to define the logic. Quantum cells are composed of 4 dots and 2 electrons that are arranged in a square. These 2 electrons can tunnel between these 4 dots [\[3](#page-12-0), [5,](#page-12-0) [13,](#page-12-0) [14](#page-12-0)]. Based on coulomb repulsion, two electrons always take up the diagonal positions as shown in Fig. 1.



Fig. 1 The QCA cell structure [\[3](#page-12-0), [14](#page-12-0), [15](#page-12-0)]

<span id="page-2-0"></span>

Fig. 2 QCA gates (a) two inverter gates, b two three-input majority gates and c 3-input XOR gate [\[3](#page-12-0)–[5](#page-12-0), [8](#page-12-0), [13](#page-12-0)]

The positions of 2 electrons inside the cell determines the binary levels [[3](#page-12-0), [8,](#page-12-0) [13](#page-12-0), [15](#page-12-0)]. The state of the cell is shown by polarization P that is computed as follows.

$$
P = \frac{(p_1 + p_3) - (p_2 + p_4)}{p_1 + p_2 + p_3 + p_4} \tag{1}
$$

In this technology,  $P = -1$  is utilized to show state "0" and  $P = +1$  is utilized to show state "1" [\[3,](#page-12-0) [8,](#page-12-0) [13,](#page-12-0) [15](#page-12-0)].

#### 2.2 QCA Gates

QCA architectures are created based on three basic gates: The inverter gate, majority gate and XOR gate. Figure 2 shows the QCA majority gate, inverter gate, and XOR gate [[3](#page-12-0), [4,](#page-12-0) [8\]](#page-12-0).

The output of majority gate is shown as follows [\[3,](#page-12-0) [4](#page-12-0), [14,](#page-12-0) [16](#page-12-0)]:

$$
M(A, B, C) = CA + BA + BC \tag{2}
$$

Where A, B and C denote the inputs of this gate.



Fig. 3 The utilized two inputs QCA XOR gate in [\[19](#page-12-0)]

<span id="page-3-0"></span>

Fig. 4 The utilized three inputs XOR gate in [\[17\]](#page-12-0)

The OR and AND gates are constructed by fixing one input of this gate to 1 or 0, respectively. The XOR gate is also a widely used logic gate in digital architectures design. The output of XOR gate is computed as follows [[4](#page-12-0), [8,](#page-12-0) [17](#page-12-0)]:

$$
XOR(A, B, C) = A \oplus B \oplus C \tag{3}
$$



Fig. 5 The two inputs XOR gate in [\[20](#page-12-0)]

<span id="page-4-0"></span>



#### 2.3 Full Adder

A one-bit QCA FA is an architecture that can add three one-bit inputs. Suppose that A, B and  $C_{in}$  are three inputs of the FA, the sum, and carry output for the one-bit QCA FA are computed as follows [\[6](#page-12-0)–[8](#page-12-0)]:

$$
S = A \oplus B \oplus C_{in} \tag{4}
$$

$$
C_{out} = AB + AC_{in} + BC_{in}
$$
 (5)



Fig. 7 The utilized QCA FA in [\[8](#page-12-0)]

<span id="page-5-0"></span>

Fig. 8 Multilayer one-bit QCA FA in [\[9](#page-12-0)] (a) layout, b layers

The expression for  $C_{out}$  can be reformulated in the QCA technology as follows [\[8,](#page-12-0) [18](#page-12-0)]:

$$
C_{out} = M(A, B, C_{in})
$$
\n<sup>(6)</sup>

As a result, the adder architectures are implemented using XOR gate and majority gate [[8](#page-12-0)].

# 2.4 Related QCA Works

In this section, we review and compare the related articles in two main parts.

#### 2.4.1 QCA XOR Gate

Sheikhfaal et al. [\[19\]](#page-12-0) have designed a QCA XOR gate, which has two inputs. It is constructed based on QCA primary gates, which is shown in Fig. [3.](#page-2-0) It has four clock phases latency and consists of 32 cells with  $0.02 \mu m^2$  area.

Ahmad et al. [\[17](#page-12-0)] have developed a three inputs QCA XOR gate by connecting two inputs XOR gates, which is shown in Fig. [4](#page-3-0).



Fig. 9 The utilized one-bit QCA FA in [[10](#page-12-0)], a sum generator architecture, b via layer, c carry generator architecture

<span id="page-6-0"></span>

Fig. 10 The utilized multilayer one-bit QCA FA in [[11](#page-12-0)]

This design has eight clock phases latency and contains 75 cells with  $0.08 \mu m^2$  area. Mustafa and Beigh [[20\]](#page-12-0) have developed a 2-input XOR gate, which is illustrated in Fig. [5](#page-3-0).

This design is implemented with 44 cells and it requires  $0.07 \mu m^2$  area. It has latency of four clock phases.

Ahmad et al. [\[17\]](#page-12-0) have also presented a three inputs QCA XOR gate. Figure [6](#page-4-0) shows this design.

This gate requires area of  $0.02 \mu m^2$  and 14 cells. It has latency of two clock phases. However, these XOR gates are suitable, but the performance of the XOR gates can be improved as described in this paper.



Fig. 11 The utilized QCA FA in [\[12\]](#page-12-0) (a) first layer, b via layer, c crossover cells

<span id="page-7-0"></span>

#### 2.4.2 The One-Bit QCA FA

Balali et al. [\[8](#page-12-0)] have designed one-bit QCA FA, which is shown in Fig. [7](#page-4-0). They enhanced three inputs QCA XOR by employing half distance in XOR gate. This FA architecture is requires  $0.02 \mu m^2$  area and 29 cells. It also has latency of two clock phases.



Fig. 13 The designed multilayer one-bit QCA FA (a) logic diagram, b three layers

<span id="page-8-0"></span>

Fig. 14 The designed 4-bit RCA, a QCA layout, b three layers

Mohammadi et al. [\[9\]](#page-12-0) have developed a multilayer one-bit QCA FA that is illustrated in Fig. [8.](#page-5-0)

In this FA, 38 cells are used with area of  $0.02 \mu m^2$  and latency of 3 clock phases.

The authors of [[10](#page-12-0)] used MG with control cell to develop one-bit QCA FA. Figure [9](#page-5-0) illustrates this design in three layers.

The number of cells in the utilized architecture in [\[10\]](#page-12-0) is 23. The required area is  $0.01 \mu m^2$  and the latency is 3 clock phases. The designed multilayer one-bit QCA FA in [\[11\]](#page-12-0) is illustrated in Fig. [10](#page-6-0). This FA requires  $0.04 \mu m^2$  area, and 73 cells. It also has 3 clock phases latency.

Hashemi et al. [\[12\]](#page-12-0) have developed a multilayer one-bit QCA FA. Figure [11](#page-6-0) depicts this architecture.

This FA requires 0.05  $\mu$ m<sup>2</sup> area and 79 cells. It also has 5 clock phases latency. Although, these QCA FA architectures are suitable, the performance of these architectures can be improved as described in the next section.

#### 3 The Designed Architectures

We design a new three inputs OCA XOR gate in this section, then, we design an efficient multilayer one-bit QCA FA architecture based on this new QCA XOR gate. In addition, a novel multilayer 4-bit QCA RCA architecture is developed using the designed multilayer QCA FA.



Fig. 15 The simulation result of the developed 3-input QCA XOR gate

| Ref.                                   | Cell count | Area $(\mu m^2)$ | Latency(clock phase) | Cost         |
|----------------------------------------|------------|------------------|----------------------|--------------|
| $[17]$ (Design 1)<br>$[17]$ (Design 2) | 75<br>14   | 0.08<br>0.02     |                      | 0.64<br>0.04 |
| $\lceil 8 \rceil$<br>This paper        | 14         | 0.01<br>0.01     |                      | 0.02<br>0.02 |

<span id="page-9-0"></span>Table 1 Comparison table for 3-input OCA XOR gate

#### 3.1 The Designed Three Inputs QCA XOR Gate

Figure [12](#page-7-0) illustrates the designed novel 3-input QCA XOR gate.

The architecture of the designed OCA XOR gate requires 0.01  $\mu m^2$  (=8004nm<sup>2</sup>) area and 11 cells.

#### 3.2 The Designed Multilayer One-Bit QCA FA Architecture

Figure [13](#page-7-0) depicts the designed multilayer one-bit QCA FA.

This FA is developed in two layers and one via layer. The output Sum is designed in third layer and the output carry, Cout, is designed in first layer. The proposed architecture requires 0.01  $\mu$ m<sup>2</sup> area and 25 cells.

#### 3.3 The Designed Multilayer 4-Bit RCA

The developed multilayer 4-bit RCA is displayed in Fig. [14.](#page-8-0)

The designed novel multilayer 4-bit RCA is based on the designed multilayer one-bit QCA FA architecture. It requires  $0.17 \mu m^2 (=171545 nm^2)$  area and 125 cells.



Fig. 16 The simulation results of the developed QCA FA

| Ref.               | Cell count | Area $(\mu m^2)$ | Latency(clock phase) | Cost |
|--------------------|------------|------------------|----------------------|------|
| $\lceil 12 \rceil$ | 79         | 0.05             |                      | 0.25 |
| $[11]$             | 73         | 0.04             |                      | 0.12 |
| $[10]$             | 23         | 0.01             |                      | 0.03 |
| [9]                | 38         | 0.02             |                      | 0.06 |
| $\lceil 8 \rceil$  | 29         | 0.02             |                      | 0.04 |
| This paper         | 25         | 0.01             |                      | 0.02 |

<span id="page-10-0"></span>Table 2 Comparison table for one-bit FA

#### 4 Simulation Results and Comparison

The results of the developed architectures are presented and compared with previous architectures in this section. The cost is determined as follows:

$$
Cost = Area \times Latency
$$
 (7)

Where latency is determined based on clock phase and area is calculated based on  $\mu m^2$ .

#### 4.1 The Developed Three Inputs QCA XOR Gate

Figure [15](#page-8-0) illustrates the simulation results of the developed three-input XOR gate.

These results confirm the accuracy of the functionality of the designed three-input QCA XOR gate. The latency of the developed three inputs QCA XOR gate is two clock phases. Table [1](#page-9-0) compares the developed three-input XOR gate with XOR gates in [[8,](#page-12-0) [17\]](#page-12-0).

Based on Table [1](#page-9-0), our designed 3-input QCA XOR gate has improvements in terms of cost, cell count, and area in comparison with other QCA XOR gate architectures in [\[8](#page-12-0), [17](#page-12-0)]. For instant, it has 97%, 85%, 87%, and 75% improvements in terms of cost, cell count, area, and latency, respectively, in comparison with [\[17](#page-12-0)] (design 1). It also has 50%, 50%, and 21.4% improvements in terms of cost, area and cell count, respectively, in comparison with [\[17\]](#page-12-0) (design 2).

#### 4.2 The Developed Multilayer QCA FA

Figure [16](#page-9-0) depicts the simulation results of the designed multilayer QCA FA.



Fig. 17 The simulation results of the designed multilayer 4-bit QCA RCA

| Ref.               | Cell count | Area $(\mu m^2)$ | Latency (clock phase) | Cost |
|--------------------|------------|------------------|-----------------------|------|
| $\lceil 12 \rceil$ | 308        | 0.29             |                       | 2.32 |
| $[10]$             | 165        | 0.20             |                       | 1.6  |
| [9]                | 237        | 0.24             |                       | 1.44 |
| $^{[8]}$           | 269        | 0.37             |                       | 2.96 |
| This paper         | 125        | 0.17             |                       | 0.85 |

<span id="page-11-0"></span>Table 3 Comparison table for 4-bit OCA RCA

The results confirm the correctness of the designed multilayer QCA FA. In addition, the latency of the designed multilayer one-bit QCA FA is 2 clock phases.

The simulation results of the developed multilayer QCA FA and the multilayer QCA FA architectures in [[8](#page-12-0)–[12](#page-12-0)] are summarized in Table [2](#page-10-0).

Based on achieved results, which are illustrated in Table [2](#page-10-0), our developed multilayer QCA FA has advantages in terms of cost, area, and cell count compared with other multilayer QCA FA architectures in  $[8-12]$  $[8-12]$  $[8-12]$ . For instant, it has 80%, 68%, 92%, and 60% improvements in terms of area, cell count, cost and latency, respectively, in comparison to [[12\]](#page-12-0). It also has 50%, 50% and 13.7% improvements in terms of cost, area, and cell count, respectively, in comparison with [[8](#page-12-0)].

#### 4.3 The Designed Multilayer 4-Bit QCA RCA

Figure [17](#page-10-0) displays the simulation results of the designed multilayer 4-bit RCA.

According to Fig. [17](#page-10-0),  $C_{out}$  is calculated as  $S_4$ . In addition, the results confirm the correctness of the designed multilayer 4-bit QCA RCA. The latency of the designed multilayer 4-bit QCA RCA is 5 clock phases. Table 3 compares the developed multilayer 4-bit RCAwith other architectures.

According to simulation results, which are summarized in Table 3, our developed architecture has improvements in terms of area, cell count, cost and latency compared to other multilayer 4-bit QCA RCA architectures in [\[8](#page-12-0)–[10,](#page-12-0) [12](#page-12-0)]. For instant, it has 54%, 53.5%, 71.2% and 37.5% improvements in terms of area, cell count, cost and latency, respectively, in comparison to [\[8](#page-12-0)]. It also has 15%, 24.2%, 46.8% and 37.5% improvements in terms of cell count, area, latency and cost, respectively, in comparison with [[10\]](#page-12-0).

#### 5 Conclusions

The QCA technology is a new technology for high-speed and dense architectures design. The FA is also a widely used architecture in computer arithmetic architecture design [\[1](#page-12-0)–[6](#page-12-0)]. A new QCA XOR gate was developed in this paper. Then, an efficient one-bit QCA FA was designed based on the developed three inputs QCA XOR gate. In addition, a 4-bit RCA architecture was developed based on this new one-bit QCA FA as structural unit. The developed architectures were simulated in QCADesigner tools version 2.0.3. Based on our simulation results, the developed architecture for 4-bit multilayer QCA RCA has 125 QCA cells, 0.17  $\mu m^2$  area, and 5 clock phases. Based on our comparison results, our designed 3-input QCA XOR gate, multilayer one-bit QCA FA architecture and multilayer 4-bit QCA RCA architecture have improvements in terms of area, cost, and cell count compared to other architectures. Therefore, the designed multilayer 4-bit QCA architecture has a huge potential to become an efficient architecture for implementing QCA RCA architectures.

#### <span id="page-12-0"></span>References

- 1. Compano, R., Molenkamp, L., Paul, D.: Roadmap for nanoelectronics. European commission IST programme, future and emerging technologies (2000)
- 2. Lent, C.S., Tougaw, P.D., Porod, W., Bernstein, G.H.: Quantum cellular automata. Nanotechnology. 4(1), 49–57 (1993)
- 3. Lent, C.S., Tougaw, P.D.: A device architecture for computing with quantum dots. Proc. IEEE. 85(4), 541– 557 (1997)
- 4. Balali, M., Rezai, A., Balali, H., Rabiei, F., Emadi, S.: A novel design of 5-input majority gate in quantumdot cellular automata technology. In: Computer Applications & Industrial Electronics (ISCAIE), 2017 IEEE Symposium on 2017, Pp. 13–16. IEEE (2017)
- 5. Rashidi, H., Rezai, A.: Design of novel efficient multiplexer architecture for quantum-dot cellular automata. J. Nano- Electron. Phys. 9(1), 1012–1011 (2017)
- 6. Rashidi, H., Rezai, A.: High-performance full adder architecture in quantum-dot cellular automata. J. Eng. 1(1), (2017)
- 7. Mokhtari, D., Rezai, A., Rashidi, H., Rabiei, F., Emadi, S., Karimi, A.: Design of novel efficient full adder architecture for quantum-dot cellular automata technology. Facta Univ. Ser.: Electron. Energ. (FU Elec. Energ). 31(2), 279–285 (2018)
- 8. Balali, M., Rezai, A., Balali, H., Rabiei, F., Emadi, S.: Towards coplanar quantum-dot cellular automata adders based on efficient three-input XOR gate. Results Phys. 7, 1389–1395 (2017)
- 9. Mohammadi, M., Mohammadi, M., Gorgin, S.: An efficient design of full adder in quantum-dot cellular automata (QCA) technology. Microelectron. J. 50, 35–43 (2016)
- 10. Roohi, A., DeMara, R.F., Khoshavi, N.: Design and evaluation of an ultra-area-efficient fault-tolerant QCA full adder. Microelectron. J. 46(6), 531–542 (2015)
- 11. Navi, K., Farazkish, R., Sayedsalehi, S., Azghadi, M.R.: A new quantum-dot cellular automata full-adder. Microelectron. J. 41(12), 820–826 (2010)
- 12. Hashemi, S., Tehrani, M., Navi, K.: An efficient quantum-dot cellular automata full-adder. Sci. Res. Essays. 7(2), 177–189 (2012)
- 13. Rashidi, H., Rezai, A., Soltany, S.: High-performance multiplexer architecture for quantum-dot cellular automata. J. Comput. Electron. 15(3), 968–981 (2016)
- 14. Niknezhad Divshali, M., Rezai, A., Karimi, A.: Towards multilayer QCA SISO shift register based on efficient D-FF circuits. Int. J. Theor. Phys. 57, 3326–3339 (2018). <https://doi.org/10.1007/s10773-018-3846-8>
- 15. Ahmad, F., Bhat, G.M., Ahmad, P.Z.: Novel adder circuits based on quantum-dot cellular automata (QCA). Circuits Syst. 5(06), 142–152 (2014)
- 16. Arani, I.E., Rezai, A.: Novel circuit design of serial–parallel multiplier in quantum-dot cellular automata technology. J. Comput. Electron. 1–9 (2018)
- 17. Ahmad, F., Bhat, G.M., Khademolhosseini, H., Azimi, S., Angizi, S., Navi, K.: Towards single layer quantum-dot cellular automata adders based on explicit interaction of cells. J. Comput. Sci. 16, 8–15 (2016)
- 18. Balali, M., Rezai, A.: Design of low-complexity and high-speed coplanar four-bit ripple carry adder in QCA technology. Int. J. Theor. Phys. 1–13 (2018)
- 19. Sheikhfaal, S., Angizi, S., Sarmadi, S., Moaiyeri, M.H., Sayedsalehi, S.: Designing efficient QCA logical circuits with power dissipation analysis. Microelectron. J. 46(6), 462–471 (2015)
- 20. Mustafa, M., Beigh, M.: Design and implementation of quantum cellular automata based novel parity generator and checker circuits with minimum complexity and cell count. (2013)

Publisher's Note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.