

# **A novel expeditious switching circuit design for non volatile combinational circuit**

**A. Ranjani Aruna[1](http://orcid.org/0000-0002-0956-4668) · J. Kamala1 · C. R. S. Hanuman2**

Received: 4 May 2021 / Revised: 30 April 2022 / Accepted: 1 August 2022 / Published online: 5 September 2022 © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2022

## **Abstract**

Magnetic Tunnel Junction (MTJ) is an important device to store the non-volatile. This MTJ device overcomes the disadvantages of CMOS technology by reducing the leakage power. Also, it stores the data in magnet domain. Non Volatile combinational circuits are nonvolatile memory elements, which distributes over a logic-circuit plane and expected to realize both the ultra-low-power and reduced interconnection delay. The existing Spin Orbit Torque- Magnetic Tunnel Junction method is used to store data with higher density, cost benefts, endurance and Non-Volatility. But it has some limitations, such as high current density needed for writing can infrequently damage the MTJ barrier, achieving reliable reading without making a diference remains a challenge. More constraints limit the designer to attain higher speed with reliable MRAM architectures, because the operations of writing and reading share the same path (via the junction). To overcome these limitations, an innovative nonvolatile combinational circuit depending on Spin Transfer Torque (STT) with perpendicular Magnetic Tunnel Junction (STT-PMTJ) is proposed in this manuscript for fast data storage in real time applications. Here, the nonvolatile combinational circuits are Spin Transfer Torque and Spin Orbit Torque. The proposed method gives higher reliability and the lower resistive writing path acts high-speed with energy-efficient WRITE operation with the help of Read and Write Parallel Switching (RWPS). The RWPS circuit is designed using Spin Orbit Torque (SOT) logic. The RWPS-PMTJ device used to maximize the robustness of entire structure and the noise is apparently reduced. The performance of the proposed method is compared with other existing methods, such as STT-MTJ, SOT-MRAM. The experimental results shows that the proposed RWPS-PMTJ method is efficiently reduced write delay by 64% compared to the STT switching, it also exhibits 61% faster read access with full swing eventually eliminating the setup time requirement. It enables the design of a new era of In-memory computing circuits to meet the challenges in the design of memory-based computation logic circuits. The AND logic gate and Full Adder (FA) circuits are implemented using Cadence virtuoso 45 nm technology.

**Keywords** SOT (Spin orbit torque) · Spin transfer torque (STT) · Read and write parallel switching (RWPS) · Non-volatile · Full adder (FA)

 $\boxtimes$  A. Ranjani Aruna aruna.ranjani@yahoo.in J. Kamala jkamalaa06@gmail.com

> C. R. S. Hanuman crshanuman@sasi.ac.in

<sup>1</sup> Department of Electronics and Communication Engineering, College of Engineering Guindy, Anna University, Chennai, Tamilnadu, India

<sup>2</sup> Department of Electronics and Communication Engineering, Sasi Institute of Technology and Engineering, Tadepalligudem, Andhra Pradesh, India

## **1 Introduction**

MTJ is a vital device to store non-volatile, where it overcomes the disadvantages of CMOS technology by reducing leakage power. It stores data in magnet domain [\[1](#page-10-0)]. The important application of MTJ is Magnetic Random Access Memory (MRAM), Hard Disk Drive read heads and memory mapping [\[2](#page-10-1)[–5](#page-10-2)]. The perpendicular MTJ has high Tunnel Magneto Resistance (TMR), high thermal stability, energyefficient and low switching current compared to the In-plane MTJ (IMTJ) and Tilted MTJ [[6–](#page-10-3)[8](#page-10-4)].

Diferent switching mechanisms have been developed to the write operation. They are Spin Transfer Torque [\[9](#page-10-5), [10](#page-10-6)], Field-Induced, Thermally Assisted, Voltage Assisted and



<span id="page-1-0"></span>**Fig. 1** The architecture of PMT



<span id="page-1-1"></span>**Fig. 2** Perpendicular Magnetic Tunnel Junction in Parallel and Antiparallel state

Spin–Orbit Torque switching techniques. Spin–orbit PMTJs is the most suitable device, due to its faster switching time, higher energy efficiency compared with the spin–orbit IMTJ. Three terminal MTJ devices have higher reliability than twoterminal devices [\[11](#page-10-7), [12\]](#page-10-8).

The three terminals PMTJ is chosen for analysis and design. In this model, the initial SOT torque is very strong and the incubation delay is negligible [[12](#page-10-8)[–14\]](#page-11-0). It supports rail-to-rail swing and eliminates the need of Pre-charge Sense Amplifer to read/write access. An external magnetic feld is applied initially in the Y direction as shown in Fig. [1](#page-1-0). It is designed with four primary layers, two ferromagnetic layers (CoFeB) namely pinned layer (PL) and free layer (FL), separated by Tunneling barrier (MgO) or Insulating layer (IL). Three layers are placed on top of the strong spin–orbit coupling, called the channel (Pt) [[15](#page-11-1)[–19](#page-11-2)]. Channel is the additional layer integrated with the free layer.

The presence of an external magnetic field collinear with the current flow is required to break the symmetry in a spin–orbit PMTJ. The power consumption of memories is predominantly occupied by the writing circuit [[20\]](#page-11-3).

The schematic view of the Perpendicular MTJ model with the terminals is shown in Fig. [2](#page-1-1). It has six terminals; three terminals are used for interfacing. Terminals  $W_1$  and  $W_2$ are used to write into the free layer and fxed/pinned layer respectively. 'R' terminal is used for a read operation.  $M_X$  $,M_{\nu}M_{\nu}$  terminals indicate the magnetization direction of the free layer during write operations. The external magnetic and electric feld are applied; the magnetization direction of non-volatile data is always in the Z-direction in the fxed layer. All are perpendicular to each other.

*Contribution of the Work:* The major contributions in this work are

- To improve the read and write speed of non-volatile data by STT switching with PMTJ compare to the other STT– MTJ,
- To reduce the energy consumption of STT–PMTJ based combinational logic,
- Novel RWPS circuit is designed for write operation using SOT logic,
- To analyse the speed, power and energy parameter in STT and RWPS circuit,
- Noise analysis in the III stage inverter in RWPS circuit,
- To implement the combinational circuit in STT and RWPS based switching logic.

#### **1.1 STT based switching technique**

This section describes the Spin Transfer Torque switching mechanism with the PMTJ-CMOS device model described using PMTJ is represented in Fig. [1](#page-1-0). Circuit operation is verifed for lower supply voltages. Sizing of switching transistors is designed to increase the switching current.

Electrons cross the barrier layer during each writes operation and read operation in STT switching. The spin electrons are crossing through the barrier layer from one ferromagnetic layer into other. In this case, tunnelling barrier thickness decides the critical current. Tunnel Magneto Resistance (TMR) is a magneto resistive effect that occurs in MTJ. It defnes as the ratio of the diference in the antiparallel resistance and the parallel resistance. Parallel state resistance is less than the anti-parallel state resistance. It prevents Boron atoms from segregating into the interface. TMR ratio increases with the spin polarization.

Critical switching current is proportional to the MTJ device area depends on other parameters shown in Eq. [\(1](#page-1-2)).

<span id="page-1-2"></span>
$$
I_C = \frac{2e(40K_BT)\left[d + (\sigma_F/\sigma_{Pt})t\right]}{\hbar L((d = \infty)/J_{e,Pt})\left[1 - \text{sech}(d/\lambda_{sf})\right]} \frac{M_s\left(|I_c|\right)B_{an}^0(|I_c|)}{M_s\left(I = 0\right)B_{an}^0(I = 0)}
$$
\n(1)

Also, the parameters are analyzed to get the desired output.

#### **2 Literature survey**

Among the research works on expeditious switching circuit design for non-volatile combinational circuit, some of the recent works are revised here,

Fakhari et al. [[21](#page-11-4)] have presented CNTFET switching logic base lesser power with area-efficient quaternary adder. The presented design substantially decreases the count of transistors, area, power consume, when managing output driving ability along full swing mode. The presented model was simulated utilizing HSPICE and Stanford CNTFET. To CNTFET-base circuits, the presented circuit was drawn by physical design tool. The outcomes show the efectiveness with respect to area, average power consume, PDP likened to its state-of -art counterparts.

Bastani et al.  $[22]$  $[22]$  $[22]$  have presented energy with area-efficient approximate ternary adder depending on CNTFET switching logic for decreasing area with energy consume. By utilizing the Synopsys HSPICE simulator along Stanford 32 nm CNTFET technology, the presented design was lesser than other CNTFET-base ternary circuits. Moreover, the presented ternary full adder was utilized to lessen the ternary multiplier structure steps with certain metrics, viz energyerror tradeof, normalized error distance were computed to estimate the presented circuit performance.

Hussain and Chaudhury [[23](#page-11-6)] have presented fast and high-performing 1-bit full adder circuit depending on input switching activity patterns and gate difusion input (GDI) technique. The adder was structured in 2 states: (i) XOR–XNOR module, (ii) required outputs. The switching activities of the transistors were lessened by utilizing the switching activity pattern of inputs with GDI in each stage. The presented method lessens delay, power consumption, computational complexity.

Mandal et al. [[24\]](#page-11-7) have presented an optical quaternary adder and subtractor model utilizing polarization switching. The frequency encoding approach was employed to data representation due to its inherent feature of constancy during refection, refraction, transmission. Various bits were processed without any error instead of any fuctuation in intensity, thus, the bit error rate was lessened. Polarization switching properties of semiconductor optical amplifer were adopted to develop these basic units.

Amirany and Rajaei [\[25](#page-11-8)] have presented spin-base fully nonvolatile full-adder circuit for scaling in memory. A nonvolatile with lower cost radiation hardened magnetic full adder (MFA) was presented. When compared with other designs, the presented MFA was able to tolerate particle strikes not considering the quantity of charge induced to a single and multiple nodes. The presented magnetic full adder gives lesser power, lesser area and higher performance. The full non volatility was presented by the magnetic full adder.

Nonvolatile logic circuits eliminate the cost of higher volume data transactions amid the memory and logic, also make possible power gating in logic-in-memory structures.

Alghare et al. [\[26\]](#page-11-9) have presented a Non-volatile spintronic flip-flop design for energy-efficient Single-Event Upset and Double Node Upset resilience. Where, latching circuits depending on soft error resilient CMOS was structured to be leveraged as master and slave latches in NV-FF structure. The presented design use feedback loops and clock-gating Muller C-elements, also maximizing the charging ability of vulnerable nodes. The simulations show that the presented Single-Event Upset and Double Node Upset resilient latching circuits attains 81% and 24% power-delayproduct improvement compared to other designs. The presented latching circuit was combined to develop four radiation hardened NV-FF designs. In this area was increased.

Razi et al. [[27\]](#page-11-10) have presented an energy-efficient radiation-hardened non-volatile magnetic latch. The presented model was deemed to design a reliable non-volatile magnetic latch utilizing MTJs. The presented latch was utilized to create a rad-hard magnetic master–slave fip-fop. Simulations show that the presented design present advantageous fgures of merit over the prior works. The presented design attains 52% and 82% enhancement in power and delay over other designs.

Rajaei and Amirany [\[28\]](#page-11-11) have presented a Nonvolatile low-cost approximate spintronic full adders for computing in memory architectures. The presented magnetic FAs provide full non-volatility, lower area, lower energy consume. The presented magnetic FAs have the beneft of single event upset (SEU) tolerance. The outcomes show that the presented design provides 50% energy efficiency than other fully nonvolatile magnetic FAs. In the approximate Gaussian flter, the fltered noisy images have almost the same results as the accurate Gaussian flter. The presented magnetic FAs have an accurate carry-out output and an approximate sum output with an error distance of 2. The delay was increased.

## **3 Proposed RWPS based switching technique**

Previous research shows read and write '0' accesses by applying opposite direction current pulses through PMTJ. Transfer of data to the cell and from the cell is not illustrated so far. Therefore, this paper proposes a novel switching circuit to write and read data. Instantaneous changes in the input data are immediately transferred to the contents of the memory without hold operation.

#### **3.1 SOT write and read circuit**

Spin–Orbit Torque Switching—In this case, read and write paths are decoupled, so they can be independently optimized.

WRITE: External electric current is applied to the channel due to Spin Hall Efect, the non-volatile data is written in the Z direction of PMTJ. Digital data '0' or '1' is stored, according to the direction of the spin polarization.

NMOS transistors MN1–MN4 generate a positive current pulse for volatile input data'1', the free layer is aligning Antiparallel to the fxed layer of PMTJ. The MTJ write output is '1'. Input data 0 is applied as negative current pulse to PMOS transistors MP1–MP4. It leads to the parallel connections of fxed and free layers. The write operation is based on the input data and does not depend on the "Enable" control signal. A basic current mirror circuit is used to read and write the data.  $I_{D2}(Drain Current)$  is equal to W1 terminal current, MN4, MN3 transistor sizes are chosen equal and half of MN1. Similarly, PMOS transistor sizes are also chosen using the same design technique. The volatile data at the input is transferred to non-volatile data and written in the MZ pin. It has been read out from the point 'B' as shown in Fig. [3.](#page-3-0)

Mn3, Mn4 width are high because 200  $\mu$ A current is passed, it act as a switch (triode region).

Mn1, Mn2–length should be high, due to Mn1 current should mirror of Mn2 and operated in saturation region [\[29\]](#page-11-12).

$$
I_2 = \frac{1}{2} \mu_n C_{ox} (W | L)_2 (V_{GS} - V_{th})^2
$$
 (2)

$$
I_1 = \frac{1}{2} \mu_n C_{ox} (W|L)_1 (V_{GS} - V_{th})^2
$$
\n(3)

<span id="page-3-1"></span>
$$
I_2 = \left(\frac{\left(\frac{w}{L}\right)_2}{\left(\frac{w}{L}\right)_1}\right)I_1\tag{4}
$$

$$
\frac{I_2}{I_1} = \left(\frac{\left(\frac{W}{L}\right)_2}{\left(\frac{W}{L}\right)_1}\right); (W|L)_2 = (W|L)_1
$$
\n(5)

$$
\mathbf{I}_2 = \mathbf{I}_1 \tag{6}
$$



<span id="page-3-0"></span>**Fig. 3** Read Write Parallel switching Circuit

The following conditions are satisfed to mirror the current in Mn1 and Mn2 transistor.

$$
Vgs_1 = Vds_1 = Vgs_2 \tag{7}
$$

Similarly the Mn3 and Mn4 transistors are follow the given equation

$$
(W|L)_{3} = (W|L)_{4}
$$
\n(8)

READ: The read path is from a fxed layer to the free layer through a barrier layer. The read and write accesses can be activated simultaneously. Transistors MN0, MN5, MP0 and MP5 are used for a read operation. The volatile data '0' is applied, MP1 – MP4 transistors are work up for write '0' and MN5,MN0 are turned ON to create a path for read operation. The complement data are processed similarly as shown in Fig. [3](#page-3-0).

## **3.2 SOT based combinational circuit design**

The combinational circuit AND and Full Adder circuits are implemented with PMTJ-CMOS. The resistance value of 100  $\Omega$  is chosen for the readout terminal, to transfer the data with reduced output swing to the NMOS pass transistor logic. PCSA is used to derive rail-to-rain swing for design the combinational circuit and bufer is used at the inputs of NMOS logic to increase the speed.

AND: Inputs 'B' and 'B bar' of the combinational logic is applied through PMTJ. Inputs 'A' and 'Abar' are applied through buffers. Buffers are designed to equalize the delay of volatile and non-volatile inputs of combinational logic.

<span id="page-4-1"></span>**Table 1** Switching states of Transistors AND logic

| A              | B                | Branch resistance                | Transistor<br>Switching<br>State  | AND output | <b>NAND</b><br>Output |
|----------------|------------------|----------------------------------|-----------------------------------|------------|-----------------------|
| $\overline{0}$ | $\overline{0}$   | $R_{\overline{Q_m}} > R_{Q_m}$   | $MN_{5,6}$ -OFF 0<br>$MN_{7}$ -ON |            |                       |
|                | $0 \quad 1$      | $R_{\overline{Q_m}} > R_{Q_m}$   | $MN_{5.6}$ -OFF 0<br>$MN_{7}$ -ON |            | 1                     |
|                | $\boldsymbol{0}$ | $R_{\overline{Q_m}} > R_{Q_m}$   | $MN_{5,6}$ -ON 0<br>$MN_{7}$ -OFF |            | 1                     |
| $1 \quad 1$    |                  | $R_{\overline{O_m}}$ < $R_{Q_m}$ | $MN_{5,6}$ -ON 1<br>$MN_{7}$ -OFF |            | 0                     |

Transistors MN5, MN6 and MN7 are used to implement AND and NAND logic as shown in Fig. [4](#page-4-0). The transistor's states are explained in Table [1.](#page-4-1)

Full Adder: During the read operation, MN0 and MP0 transistors discharge fast and improve the access speed. The combinational circuit of a Full Adder is shown in Fig. [5.](#page-5-0) Two MTJ devices are used in the writing circuit and confgured for FA sum and carry operation. The logic operation is performed in the evaluation mode. Similar to AND logic, buffers are included in the path of inputs 'A' and 'C' to make propagation delay of all signals the same. It reduces the setup requirement of the PCSA logic circuit. The transistor's states are explained in Table [2.](#page-5-1) Table [2](#page-5-1) tabulates the true table along resistance configuration of  $C_0$  logic.  $R_{OFF}$  and *RON* represents MOS transistors close with open resistances.  $R_L$ , $R_R$  represent entire resistance of PCSA left, right branch. To determine *A* and  $C_i$ , the  $AC_i$ ,  $AC_i$  sub branches have no



<span id="page-4-0"></span>**Fig. 4** RWPS based AND and NAND logic gate



NMOS structure for full adder SUM

<span id="page-5-0"></span>**Fig. 5** RWPS based Full Adder

<span id="page-5-1"></span>**Table 2** Switching states of Transistors—PMTJ Full Adder

| A              | B            | $C_i$          | Resistance com-<br>parison | $C_0$            | Sub branch of $AC_i$ | Sub branch of $AC_i$ |
|----------------|--------------|----------------|----------------------------|------------------|----------------------|----------------------|
| $\overline{0}$ | $\mathbf{0}$ | $\overline{0}$ | $\mathbf{c}$               | $\boldsymbol{0}$ | $2R_{OFF}$           | $2R_{ON}$            |
| $\theta$       | $\mathbf{0}$ |                | $R_L > R_R$                | $\mathbf{0}$     | $R_{OFF}$ + $R_{ON}$ | $R_{ON}$ + $R_{OFF}$ |
| $\theta$       | 1            | $\theta$       | $R_I > R_R$                | $\boldsymbol{0}$ | $2R_{OFF}$           | $2R_{ON}$            |
| $\overline{0}$ | 1            | 1              | $R_I < R_R$                | 1                | $R_{OFF}$ + $R_{ON}$ | $R_{ON}$ + $R_{OFF}$ |
|                | $\mathbf{0}$ | $\mathbf{0}$   | $R_L > R_R$                | $\mathbf{0}$     | $R_{ON} + R_{OFF}$   | $R_{OFF}$ + $R_{ON}$ |
|                | $\mathbf{0}$ |                | $R_L < R_R$                |                  | $2R_{ON}$            | $2R_{OFF}$           |
|                |              | $\mathbf{0}$   | $R_L < R_R$                | 1                | $R_{ON}$ + $R_{OFF}$ | $R_{OFF}$ + $R_{ON}$ |
|                |              |                | $R_I < R_R$                |                  | $2R_{ON}$            | $2R_{OFF}$           |

impact in output. The 2 sub branches resistance are same if  $A$  and  $C_i$  are varied. If it is same, their comparison associated with  $R_L$  and  $R_R$  in  $R_{OFF} > R_{AP}$  condition, which is always true. It allows  $AC_i$  to be deleted from Eq. ([4\)](#page-3-1) and get  $C_0$ logic circuit. The output of *Sum* is represented in Eq. ([9](#page-5-2)), the output of *Sum* is expressed in Eq. [\(10](#page-5-3)) and the output of  $C_0$  and  $\overline{C_0}$  are represented in Eqs. [\(11](#page-5-4)) and ([12\)](#page-6-0) respectively.

<span id="page-5-2"></span>
$$
Sum = A \oplus B \oplus C_i = ABC_i + \overline{ABC_i} + \overline{ABC_i} + \overline{ABC_i} \tag{9}
$$

<span id="page-5-3"></span>
$$
\overline{\text{Sum}} = AB\overline{C_i} + \overline{ABC_i} + \overline{ABC_i} + A\overline{B}C_i
$$
\n(10)

<span id="page-5-4"></span>
$$
C_0 = AB + AC_i + BC_i \tag{11}
$$

<span id="page-6-1"></span>

| Parameter                                        | Description                                             | Default Value       |
|--------------------------------------------------|---------------------------------------------------------|---------------------|
| TMR(0)                                           | TMR ratio with zero bias volt-<br>age $V_{\text{bias}}$ | 150%                |
| $V_{dd}$                                         | Write & Read Voltage                                    | 1 V                 |
| $I_c$                                            | Critical Current                                        | $\sim$ 175 µA       |
| T                                                | External Electric Field                                 | $200 \mu A$         |
|                                                  | Anti parallel Resistance                                | $8.2261$ KQ         |
| $\mathbf{R}_{\text{ap}}$ $\mathbf{R}_{\text{p}}$ | Parallel Resistance                                     | $3.2904$ K $\Omega$ |
| L                                                | Length                                                  | $150 \text{ nm}$    |
| Ð                                                | <b>Thickness</b>                                        | 3 nm                |
|                                                  |                                                         |                     |

<span id="page-6-2"></span>**Table 4** Noise analysis in the III stage inverters



$$
\overline{C_0} = \overline{AB} + \overline{AC_i} + \overline{BC_i} \tag{12}
$$

### **4 Simulation result and analysis**

The proposed RWPS switching circuit is implemented using cadence virtuoso 45 nm standard CMOS technology with a perpendicular MTJ model. Table [3](#page-6-1) tabulates the MTJ parameters. The read and write circuits are analyzed for STT and RWPS switching techniques. Logic gates and Full Adder are implemented. A supply voltage of 1 V is used. The threshold value is calculated from the transfer characteristics of the inverter and its value is 0.5 V. The proposed RWPS novel method result is analyzed, the simulation of STT model is analyzed the parameters required for novel method to compare the results of STT model as well as RWPS model.

#### **4.1 Write and read circuit transient response for STT**

The read and write operations are analyzed for input output and control signals in order to represent the volatile data. Compose access is enacted with Enable signal at a signifcant level and read admittance at zero levels. In Pre charge mode, the yields get charged to VDD, when 'CLK' is lower. In the Assessment stage, the put away worth has been perused with the CLK at a signifcant level. The positive edge CLK is applied at the time of Enable signal is

low. Table [4](#page-6-2) shows 86% percentage of improvement with respect to proposed work.

In this segment, life RWPS– PMTJ combinational circuits for AND logic and Full adder using Spintronics are analyzed. Input, output data and control signals of STT for read and write operations are depicted in Fig. [6.](#page-7-0) The 100 MHz volatile data input is applied to the writing circuit of STT–PMTJ. Write access is activated with Enable signal at a high level and read access at zero levels. In Precharge mode, the outputs get charged to  $V_{DD}$ , when 'CLK' is low. In the Evaluation phase, the stored value has been read with the CLK at a high level. The positive edge CLK is applied at the time of Enable signal is low.

Figure [7](#page-7-1) shows the Read and write access with degraded output and without set up time violation in STT AND logic. In this the out waveform of the AND switching circuit provides Input data 'B' (MTJ Data Output) is applied through PMTJ and input 'A' is applied directly in NMOS logic, AND and NAND outputs have taken from the PCSA. The setup time requirement of STT AND logic is 0.06 ns. The operating frequency is 17GHZ. The drawback of this method is that read write operations cannot be done parallel. Therefore, the proposed RWPS method does not require any setup time for RWPS based AND gate.

<span id="page-6-0"></span>Similarly, the input set-up time requirement is not satisfed and it afects the output. The setup time is not violated in 20 ns, the output response for the previous input value of 'A'.

Figure [8](#page-8-0) shows the output waveform of STT based Full Adder circuit simulation result. The operation of full adder shows the Full Adder: Clock of frequency 50 MHz is chosen for the full adder. The setup time requirement of STT FA is 0.09 ns. The sum path read delay is 15% higher than the carry path and programming frequency is 11GHZ.

Figure [9](#page-8-1) shows the output wave form of the Write and read access for RWPS. In the case of RWPS, the 100 MHz volatile data is applied to the current mirror circuit; the data is stored and read from the PMTJ devices without any control signal. The data has written into the PMTJ devices, parallelly read out from the terminal 'R' with full rail to rail. For example logic '0' is written 'B' and complement MTJ output 'B bar'. At the same time, it is readout. The writing a circuit (Fig. [3](#page-3-0)) is used to read and writes access. Read and write access delays are reduced by 65% and 38%, but the energy is increased. Half of the transistors are turned ON permanently and data missing (No hold state) is eliminated.

Figure [9](#page-8-1) shows the Write data in STT VS RWPS. In the RWPS method, the energy is high as 80% since switching is also 80% then the previous methods.

Figure [10](#page-9-0) shows the AND and NAND logic simulation using RWPS. The AND and NAND outputs are evaluated according to the input A and MTJ data with 'CLK' at a high level. In A = 1,  $R_{\overline{Q_m}} < R_{Q_m}$  and B = 1. So, the AND output is



<span id="page-7-0"></span>**Fig. 6** Write and read access for STT



<span id="page-7-1"></span>**Fig. 7** Read and write access with degraded output and without set up time violation in STT AND logic

logic '1' value and complement output of NAND is logic '0'. The other input combination of  $A = 0$ ,  $B = 1$ ,  $R_{\overline{Q_m}} > R_{Q_m}$ 

then  $AND = 0$ . The clock frequency is fixed at 66.6 MHZ. The outputs are very fast (82% and 80%) compared with the STT–PMTJ. The read access time is increased by adding a



<span id="page-8-0"></span>**Fig. 8** STT based Full Adder circuit simulation result



<span id="page-8-1"></span>**Fig. 9** Write data in STT VS RWPS

buffer with 'Input A'. The delay between the outputs is reduced by 79% in RWPS. Its write access is faster compared with the STT.

Table [4](#page-6-2) shows the Noise analysis in the III stage inverters. In this the input noise voltage is applied in the frst stage inverter and the input, output and noise gain also calculated. Here, 3 inverters are implemented instead of one as it is the concept of pipelining which is used and eventually analyzed for power of background noise. In this, the margins of the write operation, which need to be improved, so introduced a pipelining concept, it is benefcial as write margin is available without any noise.

Table [5](#page-9-1) shows the performance analysis of proposed switching circuit design compared with various existing



<span id="page-9-0"></span>**Fig. 10** AND and NAND logic simulation using RWPS

<span id="page-9-1"></span>

methods. The proposed RWPS-PMTJ-NVCC design uses 45 transistors and the number of used transistors for AND gate is 33 and the Full adder is 48. For write delay analysis the proposed method provides 21.44%, 28.04%, 27.37% lower than the existing method such as STT-PMTJ-NVSFF, SHE-STT-MTJ-NVML, SHE-STT-MTJ-NVLCASF respectively. For read delay analysis the proposed method provides 32.86%, 33.05%, 20.73% lower than the existing method such as STT-PMTJ-NVSFF, SHE-STT-MTJ-NVML, SHE-STT-MTJ-NVLCASF respectively. For total delay analysis the proposed method provides 31.65%, 34.03%, 26.97% lower than the existing method such as STT-PMTJ-NVSFF,

SHE-STT-MTJ-NVML, SHE-STT-MTJ-NVLCASF respectively. For PDP analysis the proposed method provides 25.96%, 35.87%, 16.76% lower than the existing method such as STT-PMTJ-NVSFF, SHE-STT-MTJ-NVML, SHE-STT-MTJ-NVLCASF respectively. For Write Energy analysis the proposed method provides 25.43%, 38.96%, 25.87% lower than the existing method such as STT-PMTJ-NVSFF, SHE-STT-MTJ-NVML, SHE-STT-MTJ-NVLCASF respectively. For Read Energy analysis the proposed method provides 35.87%, 27.97%, 39.43% lower than the existing method such as STT-PMTJ-NVSFF, SHE-STT-MTJ-NVML, SHE-STT-MTJ-NVLCASF respectively. For Write voltage analysis the proposed method provides 28.97%, 36.32%, 25.86% lower than the existing method such as STT-PMTJ-NVSFF, SHE-STT-MTJ-NVML, SHE-STT-MTJ-NVLCASF respectively. For read voltage analysis the proposed method provides 25.86%, 20.85%, 29.22% lower than the existing method such as STT-PMTJ-NVSFF, SHE-STT-MTJ-NVML, SHE-STT-MTJ-NVLCASF respectively. For Read Time analysis the proposed method provides 31.83%, 22.34%, 37.93% lower than the existing method such as STT-PMTJ-NVSFF, SHE-STT-MTJ-NVML, SHE-STT-MTJ-NVLCASF respectively. For write Time analysis the proposed method provides 24.02%, 31.83%, 27.94% lower than the existing methods, such as STT-PMTJ-NVSFF, SHE-STT-MTJ-NVML, and SHE-STT-MTJ-NVLCASF respectively.

## **5 Conclusion**

This paper proposes the RWPS–PMTJ combinational circuits for AND logic and Full adder using Spintronics devices. This design uses a minimum number of transistors with an increase in access speed. Timing issues and the conditions on signal transitions for valid outputs are analyzed in detail. The propagation delay of the proposed design is likened with the existing designs. The read and write access time is lower for RWPS switching. The critical path of logic gates lies in the complement output and the delay is reduced by introducing a buffer. The RWPS based full adder is more preferable than STT based full adder. The total propagation delay is less than 70% by introducing a bufer for applied volatile input. The RWPS-FA carry delay is 80% lower than STT-FA. The adder efficiency is improved in this paper. Especially CARRY output delay is signifcantly less and highly suitable for carry dependent adders. Set up time requirement of the STT based device is 67% higher than the RWPS FA. It is preferable to use RWPS based logic for high speed and STT logic for energy-efficient applications.

**Funding** No funding has been received.

**Data availability** Data sharing does not apply to this article as no new data was created or examined here.

#### **Declarations**

**Conflict of interest** The authors state that they have unknown competing fnancial interests or personal relationships that could have appeared to infuence the work reported here.

**Ethical approval** This article does not have any studies with human participants performed by any authors.

## **References**

- <span id="page-10-0"></span>1. Lakys, Y., Zhao, W., Klein, J., & Chappert, C. (2010). Low power, high reliability magnetic fip-fop. *Electronics Letters, 46*, 1493.
- <span id="page-10-1"></span>2. Wang, M., Cai, W., Cao, K., Zhou, J., Wrona, J., Peng, S., Yang, H., Wei, J., Kang, W., Zhang, Y., Langer, J., Ocker, B., Fert, A., & Zhao, W. (2018). Current-induced magnetization switching in atom-thick tungsten engineered perpendicular magnetic tunnel junctions with large tunnel magnetoresistance. *Nature Communications.* 9.
- 3. Prenat, G., Garello, K., Langer, J., Ocker, B., Cyrille, M., Gambardella, P., Tahoori, M., Gaudin, G., Jabeur, K., Vanhauwaert, P., Pendina, G., Oboril, F., Bishnoi, R., Ebrahimi, M., Lamard, N., & Boulle, O. (2016). Ultra-fast and high-reliability SOT-MRAM: From cache replacement to normally-off computing. *IEEE Transactions on Multi-Scale Computing Systems, 2*, 49–60.
- 4. Zabihi, M., Chowdhury, Z., Zhao, Z., Karpuzcu, U., Wang, J., & Sapatnekar, S. (2019). In-memory processing on the SpintronicCRAM: From hardware design to application mapping. *IEEE Transactions on Computers, 68*, 1159–1173.
- <span id="page-10-2"></span>5. Oboril, F., Bishnoi, R., Ebrahimi, M., & Tahoori, M. (2015). Evaluation of hybrid memory technologies using SOT-MRAM for on-chip cache hierarchy. *IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 34*, 367–380.
- <span id="page-10-3"></span>6. Das, J., Alam, S., & Bhanja, S. (2012). Ultra-low power hybrid CMOS-magnetic logic architecture. *IEEE Transactions on Circuits and Systems I: Regular Papers, 59*, 2008–2016.
- 7. Ikeda, S., Miura, K., Yamamoto, H., Mizunuma, K., Gan, H., Endo, M., Kanai, S., Hayakawa, J., Matsukura, F., & Ohno, H. (2010). A perpendicular-anisotropy CoFeB–MgO magnetic tunnel junction. *Nature Materials, 9*, 721–724.
- <span id="page-10-4"></span>8. Kishi, T., Yoda, H., Kai, T., Nagase, T., Kitagawa, E., Yoshikawa, M., & Takahashi, S. (2008). Lower-current and fast switching of a perpendicular TMR for high speed and high density spin-transfer-torque MRAM. In *2008 IEEE International Electron Devices Meeting*, 1–4. IEEE.
- <span id="page-10-5"></span>9. Monga, K., Chaturvedi, N., & Gurunarayanan, S. (2020). Energyefficient data retention in D flip-flops using STT-MTJ. *Circuit World, 46*, 229–241.
- <span id="page-10-6"></span>10. Nasri, A., Fathy, M., & Broumandnia, A. (2017). An energy-efficient 3D-stacked STT-RAM cache architecture for cloud processors: The efect on emerging scale-out workloads. *The Journal of Supercomputing, 74*, 1547–1561.
- <span id="page-10-7"></span>11. Jabeur, K., Pendina, G., Prenat, G., Buda-Prejbeanu, L., & Dieny, B. (2014). Compact modeling of a magnetic tunnel junction based on spin orbit torque. *IEEE Transactions on Magnetics, 50*, 1–8.
- <span id="page-10-8"></span>12. Kazemi, M., Rowlands, G., Ipek, E., Buhrman, R., & Friedman, E. (2016). Compact model for spin-orbit magnetic tunnel junctions. *IEEE Transactions on Electron Devices, 63*, 848–855.
- 13. Zhang, Y., Zhao, W., Lakys, Y., Klein, J., Kim, J., Ravelosona, D., & Chappert, C. (2012). Compact Modeling of Perpendicular-Anisotropy CoFeB/MgO Magnetic Tunnel Junctions. *IEEE Transactions on Electron Devices, 59*, 819–826.
- <span id="page-11-0"></span>14. Deng, E., Zhang, Y., Klein, J., Ravelsona, D., Chappert, C., & Zhao, W. (2013). Low Power Magnetic Full-Adder Based on Spin Transfer Torque MRAM. *IEEE Transactions on Magnetics, 49*, 4982–4987.
- <span id="page-11-1"></span>15. Sivaraman, E., & Manickachezian, R. (2019). Intelligent decision making service framework based on analytic hierarchy process in cloud environment. *International Journal of Networking and Virtual Organisations, 21*(2), 221–236.
- 16. Mythili, S., Thiyagarajah, K., Rajesh, P., & Shajin, F. H. (2020). Ideal position and size selection of unifed power fow controllers (UPFCs) to upgrade the dynamic stability of systems: An antlion optimiser and invasive weed optimisation algorithm. *HKIE Trans, 27*(1), 25–37.
- 17. Rajesh, P., & Shajin, F. (2020). A multi-objective hybrid algorithm for planning electrical distribution system. *European Journal of Electrical Engineering, 22*(4–5), 224–509.
- 18. Shajin, F. H., & Rajesh, P. (2020). Trusted secure geographic routing protocol: Outsider attack detection in mobile ad hoc networks by adopting trusted secure geographic routing protocol. *International Journal of Pervasive Computing and Communications*.
- <span id="page-11-2"></span>19. Thota, M. K., Shajin, F. H., & Rajesh, P. (2020). Survey on software defect prediction techniques. *International Journal of Applied Science and Engineering, 17*(4), 331–344.
- <span id="page-11-3"></span>20. Endoh, T., & Honjo, H. (2018). A recent progress of spintronics devices for integrated circuit applications. *Journal of Low Power Electronics and Applications, 8*, 44.
- <span id="page-11-4"></span>21. Fakhari, S., Bastani, N. H., & Moaiyeri, M. H. (2019). A lowpower and area-efficient quaternary adder based on CNTFET switching logic. *Analog Integrated Circuits and Signal Processing, 98*(1), 221–232.
- <span id="page-11-5"></span>22. Bastani, N. H., Moaiyeri, M. H., & Navi, K. (2018). An energyand area-efficient approximate ternary adder based on CNTFET switching logic. *Circuits, Systems, and Signal Processing, 37*(5), 1863–1883.
- <span id="page-11-6"></span>23. Hussain, I., & Chaudhury, S. (2021). Fast and high-performing 1-bit full adder circuit based on input switching activity patterns and gate difusion input technique. *Circuits, Systems, and Signal Processing, 40*(4), 1762–1787.
- <span id="page-11-7"></span>24. Mandal, S., Mandal, D., Mandal, M. K., & Garai, S. K. (2018). Design of optical quaternary adder and subtractor using polarization switching. *Journal of Optics, 47*(3), 332–350.
- <span id="page-11-8"></span>25. Amirany, A. & Rajaei, R. (2019). Spin-based fully nonvolatile full-adder circuit for computing in memory. In *Spin*, 9(0), 1950007. World Scientifc Publishing Company.
- <span id="page-11-9"></span>26. Alghareb, F. S., Zand, R., & DeMara, R. F. (2019). Non-volatile spintronic flip-flop design for energy-efficient SEU and DNU resilience. *IEEE Transactions on Magnetics, 55*(3), 1–11.
- <span id="page-11-10"></span>27. Razi, F., Moaiyeri, M. H., & Rajaei, R. (2020). Design of an energy-efficient radiation-hardened non-volatile magnetic latch. *IEEE Transactions on Magnetics, 57*(1), 1–10.
- <span id="page-11-11"></span>28. Rajaei, R., & Amirany, A. (2020). Nonvolatile low-cost approximate spintronic full adders for computing in memory architectures. *IEEE Transactions on Magnetics, 56*(4), 1–8.
- <span id="page-11-12"></span>29. Ibrahim, S., & Razavi, B. (2011). Low-power CMOS equalizer design for 20-Gb/s systems. *IEEE Journal of Solid-State Circuits, 46*, 1321–1336.

**Publisher's Note** Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s);

 $\circled{2}$  Springer

author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.



**A. Ranjani Aruna** (AntonyRaj Ranjani Aruna) received B.E. degree in ECE from M.I.E.T. institute of Engineering and Technology, Tiruchirappalli affiliated to Anna University, Tiruchirappalli, Tamil Nadu in 2011. M.E. Degree in VLSI Design from KCG College of Engineering and Technology, Karappakam affiliated to Anna University, Chennai, Tamil Nadu in 2014. She is currently working as a Teaching Fellow toward the Ph.D. degree at the Department of Electronics and Commu-

nication Engineering, College of Engineering Guindy, Anna University, Chennai, India. Her research interests include low power VLSI and VLSI application in spintronics.



**J. Kamala** (Jeyaraman Kamala) received B.E. degree from National Institute Technology, Tiruchirappalli, Tamil Nadu, India. M.E. Degree from National Institute Technology, Tiruchirappalli, Tamil Nadu, India. She is completed Ph.D. degree at the Department of Electronics and Communication Engineering, College of Engineering Guindy, Anna University, Chennai, India. She is currently working as a Associate Professor at the Department of Electronics and Communication

Engineering, College of Engineering Guindy, Anna University, Chennai, India. Her research interests include Solar application and Digital VLSI circuit.



**C. R. S. Hanuman** (Chunduri Ravi Shanker Hanuman) received B.Tech degree in ECE from Jawaharlal Nehru Technological University, Hyderabad, Andhra Pradesh in 2003 . M.Tech Degree in ECE from Jawaharlal Nehru Technological University, Hyderabad, Andhra Pradesh in 2007. He is completed Ph.D. degree at the Department of Electronics and Communication Engineering, College of Engineering Guindy, Anna University, Chennai, India. He is currently working as a Professor &

HOD at the Department of Electronics and Communication Engineering, Sasi Institute of Technology and Engineering, Tadepalligudem, Andhra Pradesh, India. His research interests include optimization of digital circuits and low power VLSI.