

# **Low power content addressable memory using common match line scheme for high performance processors**

**K. Muralidharan1 · S. Uma Maheswari1 · T. Balakumaran<sup>1</sup>**

Received: 6 June 2024 / Revised: 6 June 2024 / Accepted: 18 June 2024 © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2024

#### **Abstract**

Content Addressable Memory (CAM) is utilized in Artifcial Neural Networks, data compression, IP packet fltering, and network routers due to its high performance in the microprocessor. However, the use of CAM is limited because of its increased power consumption, especially in high capacitive Match-Lines (ML). The activation of every comparison circuit on every clock cycle is primarily responsible for the signifcant power dissipation, which leads to increased recharge activity and multiple transition occurrences in the ML. In order to overcome this issue, a novel Common Match Line Scheme (CMS) with a Pull Up/Pull Down (PUPD) Circuit is proposed. The new design of the CMS CAM architecture leverages by utilizing these technique, the mismatched tagline entries are kept in the pre-discharged phases, and only the matching tagline entry gets charged. Consequently, these approaches efectively reduce pre-charge activity and mitigate evaluate-power, thereby alleviating power dissipation concerns associated with CAM 13–45% and reducing delay 3–16% while comparing to the existing architectures without signifcant impact on the performance of the processor. Proposed CMS CAM outperforms the existing architectures in terms of noise also with minimal area overhead and it is a technology independent one which can be used in high performance microprocessor systems.

**Keywords** Power efficient  $\cdot$  NOR type match line  $\cdot$  Content addressable memory  $\cdot$  High speed

## **1 Introduction**

Memory Unit is essential for storing and accessing the data in the processor. Each and every instruction in the program was stored in the appropriate memory locations and they are implemented one by one using control unit in the controller. Speed has increased in contemporary high-performance processors. Nonetheless, improvements to Memory Units also focus on low power consumption, increasing data storage density in a smaller footprint, and reducing latency without adversely afecting CPU performance. A Content Addressable Memory (CAM) specifcally designed for high performance systems in order to achieve desirable

 $\boxtimes$  K. Muralidharan muralidharan.k@cit.edu.in S. Uma Maheswari

> umamaheswari@cit.edu.in T. Balakumaran

balakumaran@cit.edu.in

 $1$  Department of ECE, Coimbatore Institute of Technology, Coimbatore, India

speed. A Content-Addressable Memory (CAM) provides the addresses of the matched tagline data after all of the input data has been compared to the data taglines that have been stored in the memory cell. On network devices, CAM makes software and hardware-based search engines by ofering a single clock cycle throughput. They fnd applications in high-speed search scenarios. An SRAM cell is a fundamental memory unit that stores data using binary values, 1 s and 0 s. The number of CMOS transistors within an SRAM cell can vary based on the specifc type of SRAM. Traditionally, the 6 T SRAM confguration has been widely used for data storage. However, conventional 6 T SRAM tends to consume a signifcant amount of power, leading to a degradation in system performance [[5](#page-10-0)]. A variety of methods need to be used in the CAM cells in order to reduce this problem with power consumption and produce low-power memory cells. Because CAM cells enables data retrieval using content rather than address. They have the advantage of drastically decreasing the amount of time needed to fnd stored data in memory. This unique attribute makes CAM a valuable component in high-speed search applications [\[17\]](#page-10-1). When considering alternative memory search algorithms currently in use, CAMs provide a noticeable performance advantage. It shortens access times by concurrently comparing the requested data with the complete list of pre-stored data items. In CAMs, SRAM with comparison circuitry, which allows an entire search operation during single clock cycle.

Commonly used applications of CAMs involve searchintensive tasks, especially in the realms of data packet forwarding, fltering, splicing, data compression, and data packet classification within network routers. With the expanding range of applications for CAMs, the need for larger CAM sizes has become more noticeable. However, this growth in size also exacerbates the power consumption challenge. Addressing this issue and fnding ways to reduce power consumption while maintaining speed and area efficiency has become a central focus in recent research endeavours aimed at large-capacity CAMs [[15\]](#page-10-2).In this paper, our initial focus is on the analysis of methods to enhance speed while simultaneously reducing power consumption through the utilization of diferent architectures. Subsequently, we conduct a comparative evaluation of the performance analysis of existing and proposed architectures.

## **2 Related work and background**

Low-power CAM architectures using pass-transistor-logic comparison and weak-gate evaluation in high-delay designs [[2,](#page-10-3) [4](#page-10-4), [9](#page-10-5), [11,](#page-10-6) [14,](#page-10-7) [17](#page-10-1), [20–](#page-10-8)[25](#page-10-9)] were discussed. High-speed designs have used low-power methods that utilize NOR-type CAM cells  $[7-10, 13, 18, 19]$  $[7-10, 13, 18, 19]$  $[7-10, 13, 18, 19]$  $[7-10, 13, 18, 19]$  $[7-10, 13, 18, 19]$  $[7-10, 13, 18, 19]$  $[7-10, 13, 18, 19]$ . In these cells, the dynamic or transmission gate comparison unit and evaluation phase were controlled by rail-to-rail supply voltages. Matching lines (ML) settle more quickly in this configuration  $[1, 3, 3]$  $[1, 3, 3]$  $[1, 3, 3]$  $[1, 3, 3]$  $[1, 3, 3]$ [8](#page-10-16), [14](#page-10-7)]. In order to balance power consumption and delay, the self-power-off approach integrates two supply voltages within the Gated-Power CAM (GP-CAM) [[18](#page-10-13)]. Matching lines (MLs) are pre-charged to voltage levels which is lower than the supply voltage and hence reduces Match line power consumption in GP-CAM cells. This approach combines a selective pre-discharge/pre-charge technique with chargesharing systems between master-Matchlines and slave Matchlines [[5,](#page-10-0) [9](#page-10-5), [14\]](#page-10-7) and among ML segments [[4,](#page-10-4) [12,](#page-10-17) [13](#page-10-12)]. The goal is to further reduce ML power consumption. In the precomputation-based architectures, initial comparison step is achieved during the pre-processing of a group of search bits [\[15](#page-10-2), [16](#page-10-18)]. Search power consumption should be reduced at the search cycle's assessment stage. The majority of mismatched matching lines (MLs) are removed by segmentation systems, which reduces the amount of ML discharges that are needless during evaluation. In order to facilitate parallel search and lower power consumption within each segment, multiple NAND ML segments also function independently

[[16\]](#page-10-18). Shared matchline scheme is used to reduce the power in NOR type ML CAM [[23](#page-10-19)].

To enhance efficiency, a charge-sharing technique is combined with a segmentation scheme. This strategy aims to decrease match line swing and diminish charge loss during the search process [[17,](#page-10-1) [22\]](#page-10-20). By using clamping and current limiting techniques in the NOR-Match Line segments reduces the overall power consumption [[18](#page-10-13)]. In order to lower the ML switching capacitance, a set of Match lines is divided into clusters of NOR cells into the local-NOR CAM cell and global-NAND CAM cells. Shorter NAND-Matchlines lower the amount of searching power used in the dominant NOR-Matchlines by acting as a filter for mismatched circumstances [\[20,](#page-10-8) [21\]](#page-10-21). Pre-charge and predischarge prior to each evaluation phase are made possible by the complimentary characteristic fuctuation in charge between nMOS and pMOS transistors in the comparison/ evaluation, which also helps to reduce Matchlines swing in the diferential Matchline segments [[7](#page-10-10), [25](#page-10-9)]. Mismatching Matchlines are predicted earlier by using dynamically varying signals, limiting Matchlines swings to the lower precharge phase [19The majority of these methods typically involve gate-keeping and charging or discharging all of the cells [\[6](#page-10-22), [7,](#page-10-10) [9,](#page-10-5) [19](#page-10-14)] or select groups of cells [[10,](#page-10-11) [13](#page-10-12), [14](#page-10-7), [25,](#page-10-9) [26](#page-10-23)] in a Matchline row using extra ML control or sensing circuits. As a result, these methods impact search speed as well as cause area overhead. From the literature match line, comparison unit,encoder,decoder and sense amplifer are the power consuming part in the CAM architecture. Among all unnecessary precharge and discharge in the matchline the dominant one in the CAM. So that is taken into account to do the proposed CAM architecture.

## **3 Existing CAM architectures**

The traditional Content Addressable Memory Architecture was implemented by putting data into memory and using a comparison circuit with a search line to enter search data. It compares the input data given in the search tag line with the data that has been stored. When comparing the NAND and NOR CAM architectures, it's important to note that NAND CAMs tend to consume less power but operate at a slower search speed compared to NOR CAMs, which consume more power.

In order to accomplish CAM functionality, data must be stored in the memory and a comparison circuit with a search line for data input must be included. Using the search Tag lines, the comparison operation consists of comparing the corresponding input data lines with the stored data lines [\[7](#page-10-10)]. Figure  $1(a)$  $1(a)$  and  $1(b)$  depict the architectural configuration used with two distinct architectures, such as NAND and NOR architectures.



<span id="page-2-0"></span>**Fig. 1 a** NAND CAM architecture, **b** NOR CAM architecture



<span id="page-2-1"></span>**Fig. 2** XOR CAM cell architecture

The architectural configuration employed with two different designs, such as the NAND and NOR architectures, is shown in Fig. [2.](#page-2-1) The CAM cell can be accessed by access transistors [\[9](#page-10-5)]. There are two PMOS transistors in the comparison circuit. Four transistors are added to make crosscoupled inverters. XOR CAM cell architecture is represented in Fig. [3.](#page-2-2) Bitline, Bitlinebar, WriteLine, Searchline, and SearchlineBar are inputs, and Data, Databar and Matchline are outputs. WriteLine is activated during the write process, and data that is meant to be stored is supplied at Bitline, while the matching data is supplied at Bitlinebar. The data is saved in the CAM cell. In the course of the comparison, the stored data.



<span id="page-2-2"></span>**Fig. 3** XNOR CAM Cell architecture



<span id="page-2-3"></span>**Fig. 4** PF CAM Cell architecture

Using eight MOSFETs, the XNOR CAM cell is made up of comparison circuit and a standard 6 T SRAM cell. For the cell access depicted in Fig. [3,](#page-2-2) two NMOS transistors act as access transistors. The cross-coupled inverters with comparison circuit is composed of two PMOS transistors. By utilizing 1 NMOS and 1 PMOS an inverter was created. Cross-coupled inverters establish the CAM cell's storage unit. In the XNOR CAM cell, outputs are Data, Databar,

and Matchline, Bitline, Bitlinebar and inputs are Writeline, Searchline, and Searchlinebar. It functions without requiring an initial Match Line charge.

Figure [4](#page-2-3) shows the Precharge Free CAM architecture. These comprise the following inputs and outputs: Writeline, BitLine Bitlinebar, Searchline, and Searchlinebar. Most importantly, it works without requiring Matchline to have a full charge beforehand. When writing, WL is activated, and data is given at Bitline and Bitlinebar respectively. The data that is stored in the CAM cell and the data that is being sought are compared during a search process. There are two possible outputs from this comparison are matched and mismatched. When the searched data is matched with stored data match, Match line is activated at the high level. However, Matchline struggles when there is a discrepancy between the searched and stored data.

The Self-controlled Pre-charge free CAM cell combines a standard 6 T SRAM cell with a unique comparison circuit, as shown in Fig. [5](#page-3-0) Ten MOSFETs are used, including two NMOS access transistors for cell access and two NMOS transistors in a comparison circuit. Additionally, under both match and mismatch conditions, the charge control circuit which consists of the NMOS transistor and the PMOS transistor operates. Match line output is activated in accordance with the charge on node S [[14\]](#page-10-7).Fig. [5](#page-3-0) shows the SCPF CAM cell circuit with inputs Bitline, Bitlinebar, Writeline, Searchline, and Searchlinebar and outputs Data, Databar, and Matchline.

The cell functions well even without pre-charging Matchline. During a search operation, the input data and the stored data of the CAM cell are compared. In the instance of a mismatch, the match line stays low, and in the case of a match, it remains high. When doing comparative procedures, WL is turned off. When node S records a high value, Matchline is able to high through the NMOS transistor in a matched scenario. Node S displays a low-level during mismatch, which causes Matchline to discharge through the PMOS transistor [[14\]](#page-10-7).

The comparison circuit of ten MOSFETs and a basic 6 T SRAM cell comprise the Modifed Pre-charge Free (MPF) CAM architecture. To allow for cell access, two of these NMOS transistors act as access transistors. There are two NMOS transistors in the comparison circuit. Furthermore, in both match and mismatch scenarios, the charge control circuit, which consists of PMOS transistor and NMOS transistor, enables the output Matchline in response to the charge on node S.

Figure [6](#page-3-1) depicts the circuit for the MPF CAM cell, which contains the following inputs: Inputs, WriteLine, Searchline, Searchlinebar, Bitline, and Bitlinebar, Data, Datatabar, and ML. Interestingly, it functions without requiring ML to be pre-charged. When writing, WL is turned on, and data and its complement are supplied at BL and BLB, respectively. While conducting a search, the CAM's stored data is evaluated in connection with the input data that is being searched, including Searchline and Searchlinebar. As a match occurs, Matchline rises, and when a mismatch occurs, it stays low. WL is turned off while the comparison is being done  $[25]$  $[25]$ .



 $\rm V_{dd}$  $\circ$ VSS

<span id="page-3-0"></span>**Fig. 5** SCPF CAM Cell architecture **Fig. 6** MPF CAM Cell architecture

<span id="page-3-1"></span>

The NMOS transistor allows Matchline to go high if a match is detected, and node S stays at a low level. On the other hand, node S displays a high value under mismatched conditions, which causes Matchline to discharge through the PMOS transistor to low.

SMS architecture of a CAM cell shown in Fig. [7](#page-4-0).The issue of residual charge from the cascading of intermediate cells and the long delay in the Matchlines caused by serial discharge of transistors. This limits the length of words that can be used in CAM cells using NAND-cell Matchlines. Longer word lengths in CAM systems are not as beneficial for NAND-cell Matchlines because of the residual charge problem brought on by the intermediary cells cascading. Consequently, the last evaluation node adopts a LOW logic state. In the event that the stored data bits and the searched data bits match, the evaluation nodes in both cells turn to '0'. Finally, the evaluation node chooses a LOW logic state. Matchline is designed to use a transistor to discharge to ground (GND). When a 2-bit comparison-evaluation occurs, then each mismatch is handled by a single Matchline discharge path [\[23](#page-10-19)].

#### **4 Proposed cam architecture**

High-speed searching is accomplished with NOR-type ML outs, albeit at the expense of signifcant power consumption. A "N-1" match-line in a  $N \times M$ -CAM discharge was discovered while looking for a unique word match scenario. Furthermore, diferent MLs in the CAM array have diferent numbers of mismatched cells. While some MLs discharge through a greater number of mismatched cells, others through less. Therefore, during the phase transition from precharge to assessment, high discharge rates cause large Matchline power consumption and strong Matchline switching activity.

Suggested Common Matchline scheme (CMS) with PUPD network in depicted in Fig. [8.](#page-5-0) During search operation, it aims to enhance the power efficiency in a CAM system by using NOR- Matchline. It comprises of the highspeed 10 T NOR-type cell shown in Fig.  $8$  (c) and  $8(d)$ . Enhancement is accomplished by applying a Match-Line control unit (MLCU) to the segmented parts in the NOR-ML and it allowing for selective evaluation for each and every search. This results in a more straightforward precharge in between searches. Structure for the suggested Matchline technique is Fig.  $8(a)$  $8(a)$  displays a block diagram of the proposed CMS-CAM with PUC/PDC networks and the suggested Matchline approach is illustrated in Fig. [8\(](#page-5-0)b) with an N-bit word structure that is divided into N and K bit match-lines partitions. Apart from the Matchline sections (ML1 and ML2), the output Match-line (MLN) indicates the match result based on the search.

During the search process, the majority of the words in a Content Addressable Memory array were mismatched. In conventional and contemporary Matchline techniques each and every one bit cell contains specifc compare-evaluation logic unit.All mismatching cells in CAM give diferent Matchline states (precharge to discharge).If all Matchlines in a sequence have bit mismatches, the worst-case discharge occurs through 'N' data points.The suggested common Matchline system (CMS), illustrated in Fig. [8,](#page-5-0) seeks to decrease the number of discharge pathways. In Fig. [8](#page-5-0)(b), we illustrate a 2-bit CAM cell in which the left and right-side storage were compared in a common evaluation block. Each search process is preceded by a default phase (precharge) using the sense amplifer's precharge transistor, as illustrated in Fig. [8\(](#page-5-0)c). If the stored bits matched with the related search bits the two cells' evaluation nodes attains '0', resulting in



<span id="page-4-0"></span>**Fig. 7** SMS architecture of CAM cell [[23](#page-10-19)]



<span id="page-5-0"></span>**Fig. 8** Proposed architecture of CAM using PUPD scheme

LOW logic being transmitted to the fnal Evaluation Node. As a result, the common Matchline node  $(ML_{1/2})$  keeps its precharged state indicating a final match.  $E_{1/2}$  is driven with HIGH logic if either or both bits are mismatched, and so  $ML_{1/2}$  discharges to ground via transistor  $M_E$ . As a result, each mismatch generates one ML discharge path for the 2-bit comparison-evaluation. Mismatching Matchline rows undergo state changes during assessment, but matching rows retain their initial precharged states.

Each ML has the same number of discharge–charge pathways as its mismatching cells  $[1, 3, 15]$  $[1, 3, 15]$  $[1, 3, 15]$  $[1, 3, 15]$  $[1, 3, 15]$ . In contrast, the suggested CAM reduces the number of routes to at most half due to two cell mismatches contributing just one channel for Matchline discharge. In the worst-case scenario of an N-bit mismatch, there are only 'N∕2' Matchline discharge pathways, which is half the number of traditional and current techniques. As a consequence, the amount of Matchline switching in each and every row is minimized, and then allowing the CMS-CAM to save dynamic power during search. The suggested approach is also prone to compromising the ML latency because of the two-step assessment per two cells. The proposed design of CAM outperforms than all than existing architecture with some addition area overhead. In the proposed CAM, 2 Bit Evaluation comparison unit using PUPD uses common match line which also reduces overall power consumption while comparing to the existing CAM architectures.

# **5 Simulation and results**

The CADENCE Virtuoso tool is utilized for designing various CAM architectures using 45 nm, 20 nm, 16 nm, 7 nm technology. Simulations are consistently conducted at 27

<span id="page-6-0"></span>**Table 1** operation of 2Bit-CE CAM Cell

| Phase      | $E_1(SL_1 \oplus Q_1)$ | $E_2(SL, \oplus Q_2)$ | $E_{1/2}(E_1+E_2)$ | (Matchline) | Output(State) |
|------------|------------------------|-----------------------|--------------------|-------------|---------------|
| Precharge  | Don't care             | Don't care            | Don't care         |             | Default       |
| Evaluation | Match                  | Match                 | $\theta$           |             | Match         |
| Evaluation | Mismatch               | Mach                  |                    |             | Mismatch      |
| Evaluation | Match                  | Mismatch              |                    |             | Mismatch      |
| Evaluation | Mismatch               | Mismatch              |                    | $_{0}$      | Mismatch      |

<span id="page-6-1"></span>**Table 2** Average power consumption in  $\mu$ W for a CAM cell with  $8 \times 8$  bits at various voltages

| Voltage $(V)$ |       |       |        |        |        |        |        |      | NOR CAM NAND CAM XOR CAM XNOR CAM PF CAM SCPF CAM MPF CAM SMS CAM Proposed CAM |
|---------------|-------|-------|--------|--------|--------|--------|--------|------|--------------------------------------------------------------------------------|
| 1.2           | 35.47 | 37.23 | 3.5    | 3.236  | 3.498  | 3.808  | 1.859  | 1.58 | 1.36                                                                           |
| 1.1           | 26.67 | 28.39 | 2.346  | 2.199  | 2.183  | 2.48   | 1.206  | 1.25 | 1.22                                                                           |
|               | 19.29 | 20.94 | 1.518  | 1.439  | 1.339  | 1.544  | 0.7441 | 0.67 | 0.19                                                                           |
| 0.9           | 13.26 | 14.79 | 0.9438 | 0.901  | 0.802  | 0.915  | 0.4322 | 0.32 | 0.09                                                                           |
| 0.8           | 8.554 | 9.857 | 0.5615 | 0.5362 | 0.4608 | 0.514  | 0.2347 | 0.21 | 0.07                                                                           |
| 0.7           | 5.098 | 6.048 | 0.3166 | 0.3006 | 0.2498 | 0.2709 | 0.118  | 0.10 | 0.05                                                                           |

<span id="page-6-2"></span>**Table 3** Delay in nanoseconds for 8×8 bits CAM cells at various voltages

| Voltage $(V)$ | NOR CAM          | NAND CAM | XOR CAM | XNOR CAM PF CAM |        | SCPF CAM | MPF CAM | SMS CAM | Proposed CAM |
|---------------|------------------|----------|---------|-----------------|--------|----------|---------|---------|--------------|
| 1.2           | 20.385<br>0.3525 | 0.5208   | 0.9776  | 1.7255          | 0.3972 | 1.5975   | 0.4598  | 0.3976  |              |
| 1.1           | 20.34            | 0.5607   | 0.8518  | 1.7815          | 0.2868 | 1.6835   | 0.4336  | 0.3824  | 0.3245       |
|               | 20.345           | 0.5966   | 0.6959  | 1.86            | 0.2023 | 1.786    | 0.4014  | 0.3790  | 0.3002       |
| 0.9           | 20.405           | 0.6189   | 0.4954  | 1.9675          | 0.0853 | 1.9105   | 0.3613  | 0.3523  | 0.2908       |
| 0.8           | 20.495           | 0.6051   | 0.2103  | 2.117           | 0.0381 | 2.067    | 0.3118  | 0.3018  | 0.1020       |
| 0.7           | 20.63            | 0.5441   | 0.1060  | 2.347           | 0.0261 | 2.29     | 0.2489  | 0.230   | 0.0150       |

<span id="page-6-3"></span>**Table 4** Power Delay Product (Energy in fJ) for CAM cells with 8×8 bits at various voltages

| Voltage $(V)$ |        |       |       |       |        |       |       |       | NOR CAM NAND CAM XOR CAM XNOR CAM PF CAM SCPF CAM MPF CAM SMS CAM Proposed CAM |
|---------------|--------|-------|-------|-------|--------|-------|-------|-------|--------------------------------------------------------------------------------|
| 1.2           | 721.63 | 19.39 | 3.42  | 5.58  | 1.389  | 6.083 | 0.855 | 0.945 | 0.914                                                                          |
| 1.1           | 542.47 | 15.92 | 1.99  | 3.92  | 0.626  | 4.175 | 0.523 | 0.510 | 0.429                                                                          |
|               | 392.46 | 12.49 | 1.056 | 2.68  | 0.271  | 2.757 | 0.299 | 0.492 | 0.321                                                                          |
| 0.9           | 270.57 | 9.15  | 0.467 | 1.77  | 0.068  | 1.748 | 0.156 | 0.327 | 0.301                                                                          |
| 0.8           | 175.31 | 5.96  | 0.118 | 1.135 | 0.017  | 1.062 | 0.073 | 0.051 | 0.039                                                                          |
| 0.7           | 105.17 | 3.29  | 0.034 | 0.705 | 0.0065 | 0.620 | 0.029 | 0.018 | 0.009                                                                          |

<span id="page-6-4"></span>**Table 5** Noise in dB for 8×8 bits CAM cells



degrees Celsius. The supply voltages ranging from 0.7 to 1.2 V, the testing environment was created and tested using the identical input patterns for both the proposed and conventional CAM designs. The activity of the CAM cell is described in Table [1](#page-6-0) in terms of four diferent searches of the cell. Average power consumption, delay and power delay product of existing CAM architectures and proposed design CAM of using Common match line scheme using

<span id="page-7-0"></span>

| <b>Table 6</b> Energy for $8 \times 8$ bits CAM cells (fJ/bit/search) |       |      |       |       |       |       |       |      |                                                                                            |
|-----------------------------------------------------------------------|-------|------|-------|-------|-------|-------|-------|------|--------------------------------------------------------------------------------------------|
|                                                                       |       |      |       |       |       |       |       |      | Voltage (V) NOR CAM NAND CAM XOR CAM XNOR CAM PF CAM SCPF CAM MPF CAM SMS CAM Proposed CAM |
| 1.2                                                                   | 7.956 | 2.23 | 0.973 | 0.411 | 0.246 | 0.315 | 0.446 | -286 | .251                                                                                       |

<span id="page-7-1"></span>**Table 7** Area (μm 2) for 8×8 bits CAM cells



PUPD at various voltages (from 1.2 to 0.7 V) were shown in the Table [2](#page-6-1), Table [3](#page-6-2) and Table [4](#page-6-3) respectively for the  $8 \times 8$  bits. Table [5,](#page-6-4) Table [6,](#page-7-0) Table [7](#page-7-1) describes the Noise, Energy and Area of existing and proposed CAM design analysis at diferent voltages respectively. From this above









<span id="page-7-2"></span>**Fig. 9 a** Power consumption of Existing SMS and proposed CAM Schemes for 8×8 array at diferent nm Technologies, **b** Delay analysis of Existing SMS and proposed CAM Schemes for 8×8 array at diferent nm Technologies, **c** Noise analysis of Existing SMS and pro-

posed CAM Schemes for 8×8 array at diferent nm Technologies, **d** Energy of Existing SMS and proposed CAM Schemes for 8×8 array at diferent nm Technologies









<span id="page-8-0"></span>**Fig. 10 a** Power consumption of Existing and proposed Schemes for 8×8 array, **b** power consumption of Existing and proposed Schemes for 16×16 array, **c** power consumption of Existing and proposed Schemes for  $32 \times 32$  array

<span id="page-8-1"></span>**Fig. 11 a** Delay results of Existing and proposed Schemes for 16×16 array, **b** Delay results of Existing and proposed Schemes for 16×16 array, **c** Delay results of Existing and proposed Schemes for 32×32 array



<span id="page-9-1"></span>**Fig. 12** Layout of the proposed CAM cell using 45 nm

proposed design of CAM produce better results than exists CAM architectures.

Performance comparison in terms of average power consumption, delay, noise and Energy results of existing SMS CAM  $[23]$  $[23]$  and proposed CAM for  $8 \times 8$  bits shown in Fig. [9\(](#page-7-2)a), Fig. [9](#page-7-2)(b) and Fig. [9\(](#page-7-2)c) and Fig. [9](#page-7-2)(d) respectively using various 45 nm, 20 nm, 16 nm and 7 nm technologies. From this analysis, new proposed CAM architecture is technology independent one and produce better performance in terms of average power consumption (16%), delay  $(11\%)$ , noise (5%) and energy (10%) than existing SMS CAM architecture. Performance comparison in terms of average power consumption results of Existing SMS CAM [[23](#page-10-19)] and proposed CAM for  $8 \times 8$  bits,  $16 \times 16$  bits and  $32 \times 32$  bits shown in Fig. [10](#page-8-0)(a), Fig. [10\(](#page-8-0)b) and Fig. 10(c) respectively using 45 nm Technology. Delay analysis for existing SMS CAM [[23\]](#page-10-19) and proposed CAM architectures represented in Fig. [11](#page-8-1)(a), Fig. [11\(](#page-8-1)b) and Fig. [11](#page-8-1).c for  $8 \times 8$ bits,  $16 \times 16$  bits and  $32 \times 32$  bits respectively using 45 nm technology. Based on the above graph the new CMS architecture yield good results while comparing to all the all existing techniques even if no bits increased in the CAM size with respect to power consumption and delay analysis.

Figure [12](#page-9-1) describes the layout for the proposed CAM. To sum up, the data suggests that employing a PUPD network in CMS CAM leads to marginally 13%–60% lower power consumption compared to conventional memory architecture. This could prove advantageous in scenarios prioritizing power efficiency, such as in mobile devices or energy-saving systems. Proposed new CMS CAM also produced better results than existing architectures in terms of noise, delay and energy. Additionally, the proposed new CMS CAM outperformed the current designs in terms of energy, latency, and noise.

## **6 Conclusion**

To reduce the number of match line discharge pathways during search, common match line scheme (CMS) is introduced. The novel design of CAM performs better than conventional designs in terms of noise, power consumption, and latency. The proposed modifed CMS-CAM architecture utilizing a PUPD network demonstrates a noteworthy 13%–60% reduction in power consumption and 3–16% decrease in delay when traditional conventional architectures. These results were obtained using diferent 45 nm, 20 nm, 16 nm and 7 nm technologies in the Cadence tool across various array sizes like  $8 \times 8$  bits,  $16 \times 16$  bits, and  $32 \times 32$  bits. The proposed CMS CAM design by integrating a PUPD network achieves a balanced approach between power efficiency and delay optimization, offering a promising solution for without any signifcant impact on their performance. The suggested scheme's search performance is also demonstrated to be consistent under a range of operating situations and efficient for larger macro-sized designs. For data processing in cache-tags and data compressors, as well as for information interchange in network routers, the suggested CMS-CAM may prove helpful to HSEs in implementing lookup table management and high-performance systems.

**Author contributions** Muralidharan - Wrote the manuscriptUma Maheswari - Supervised the workBalakumaran - proof read the article

**Funding** Not applicable.

#### **Declarations**

**Competing interests** The authors declare that they have no known competing fnancial interests or personal relationships that could have appeared to infuence the work reported in this paper.

**Human and animal rights** This study does not include any human participants or animals; hence, any informed consent or animal welfare statement does not apply to this study.

## **References**

<span id="page-9-0"></span>1. Qazi, A., Hafeez, A., & Ullah, Z. (2021). Fast mapping and updating algorithms for a binary CAM on FPGA. *IEEE Canadian Journal of Electrical and Computer Engineering, 44*(2), 156–164.

- <span id="page-10-3"></span>2. Wang, C., Zhang, D., Zeng, L., & Zhao, W. (2020). Design of magnetic non-volatile TCAM with priority-decision in memory technology for high speed, low power, and high reliability. *IEEE Transactions Circuits Systems, 14*(3), 464–474.
- <span id="page-10-15"></span>3. Datta, D., Surana, N., Kumar, A., & Mekie, J. (2022). A 10T, 0.22fJ/Bit/Search mixed-VT pseudo precharge-free content addressable memory. *IEEE Transactions on Circuits and Systems II Express Briefs, 69*(3), 1572–1576.
- <span id="page-10-4"></span>4. Garzón, E., Golman, R., Jahshan, Z., Hanhan, R., Vinshtok-Melnik, N., & Lanuzza, M. (2022). Adam Teman and Leonid Yavits 'hamming distance tolerant content-addressable memory (HD-CAM) for DNA classifcation.' *IEEE Access, 10*(3), 28080–28093.
- <span id="page-10-0"></span>5. Kim, H., Cho, M., Lee, S., Kwon, H. S., Choi, W. Y., & Kim, Y. (2022). Content-addressable memory system using a nanoelectromechanical memory switch. *Electronics, 11*(3), 481–495.
- <span id="page-10-22"></span>6. Durai, J., Rajagopal, S., & Ganesan, G. (2020). Design and analysis of power efficient quasi-adiabatic ternary content addressable memory (QATCAM). *IET Circuits, Devices & Systems, 14*(7), 923–928.
- <span id="page-10-10"></span>7. Quashef, M. A. Z., & Alam, M. K. (2022). Ultracompact photonic integrated content addressable memory using phase change materials. *Optical and Quantum Electronics, 54*(3), 1–19.
- <span id="page-10-16"></span>8. Shi, Z., Yang, H., Li, J., Li, C., Li, T., & Wang, B. (2020). MsBV: A memory compression scheme for bit-vector-based classifcation lookup tables. *IEEE Access, 8*, 38673–38681.
- <span id="page-10-5"></span>9. Goswami, M., Pal, J., Tanwar, R., & Rawat, P. (2022). A modular approach to design ternary content addressable memory architecture in quantum dot cellular automata. *International Journal of Information Technology, 14*(1), 41–47.
- <span id="page-10-11"></span>10. Irfan, M., Sanka, A. I., Ullah, Z., & Cheung, R. C. C. (2022). Reconfgurable content-addressable memory (CAM) on FPGAs: A tutorial and survey. *Future Generation Computer Systems, 128*(3), 451–465.
- <span id="page-10-6"></span>11. Mythil, R., & Kalpana, P. (2020). Comparative analysis of parameter extractor for low-power precomputation based content addressable memory. *Wireless Personal Communications, 111*(2), 1313–1326.
- <span id="page-10-17"></span>12. Pathak, N., Bhoi, B. K., Misra, N. K., & Kumar, S. (2022). content addressable memory design in 3D pNML for energy-aware sustainable computing. *Journal of Circuits, Systems and Computers, 131*(2), 1328–1786.
- <span id="page-10-12"></span>13. Dhanalakshmi, G., Sundarambal, M., & Muralidharan, K. (2020). Diminution of power in load/store queue for CAM and SRAMbased out-of-order processors. *International Journal of Advanced Intelligence Paradigms, 15*, 89–97.
- <span id="page-10-7"></span>14. Gnawali, K. P. (2022). High-speed memristive ternary content addressable memory. *IEEE Transactions on Emerging Topics in Computing, 10*(3), 1349–1360.
- <span id="page-10-2"></span>15. Hussain, S. W., Mahendra, T. V., Mishra, S., & Dandapat, A. (2020). Low-power content addressable memory design using two-layer P-N match-line control and sensing. *Integration, 75*(11), 73–84.
- <span id="page-10-18"></span>16. Shylashree, N., Vahvale, Y. D., Praveena, N., & Mamatha, A. S. (2021). Design and implementation of 64-bit SRAM and CAM on cadence and open-source environment. *International Journal of Circuits, Systems and Signal Processing, 15*(7), 586–594.
- <span id="page-10-1"></span>17. Gogoi, S., & Peesapati, R. (2022). Design and implementation of gray-coded bitplane based reconfgurable motion estimation architecture using binary content addressable memory for video encoder. *IEEE Transactions on Consumer Electronics, 68*(1), 85–92.
- <span id="page-10-13"></span>18. Bagheri, T., Heikalabad, S. R., & Jabbehdari, S. (2022). Designing a contentaddressable memory cell using multiplexer in quantumdot cellular automata. *ECS Journal of Solid-State Science and Technology, 11*(5), 3007–3016.
- <span id="page-10-14"></span>19. Fan, X., Niklas, M., & Tobias, G. (2021). Compiling all-digital embedded content addressable memories on chip for edge application. *IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 123*(8), 67–75.
- <span id="page-10-8"></span>20. Yatheesh, K. C., & Shanti Prasad, M. J. (2021). An innovative design of spin transfer torque based ternary content addressable memory with match line sense amplifer. *Analog Integrated Circuits and Signal Processing, 107*(6), 637–647.
- <span id="page-10-21"></span>21. Chang, Y.-J., Tsai, K.-L., Cheng, Y.-C., & Meng-Rong, Lu. (2020). Low-power ternary content-addressable memory design based on a voltage self-controlled fin field-effect transistor segment. *Computers and Electrical Engineering, 81*(1), 1–11.
- <span id="page-10-20"></span>22. Hussain, S. W., Mahendra, T. V., Mishra, S., & Dandapat, A. (2023). SMS-CAM: Shared matchline scheme for content addressable memory. *Integration, 88*, 70–79.
- <span id="page-10-19"></span>23. Hussain, S. W., Mahendra, T. V., Mishra, S., & Dandapat, A. (2021). Match-line control unit for power and delay reduction in hybrid CAM. *IET Circuits, Devices & Systems, 15*(3), 272–283.
- 24. Mahendra, V., Hussain, S. W., Mishra, S., & Dandapat, A. (2020). A novel low-power matchline evaluation technique for content addressable memory (CAM). *Institute of Information Science Journal of Information Science and Engineering, 36*(5), 1035–1053.
- <span id="page-10-9"></span>25. Mahendra, T. V., Hussain, S. W., Mishra, S., & Dandapat, A. (2020). Energy-efficient precharge-free ternary content addressable memory (TCAM) for high search rate applications. *IEEE Transactions on Circuits and Systems I: Regular Papers, 67*(7), 2345–2357.
- <span id="page-10-23"></span>26. Pagiamtzis, K., & Sheikholeslami, A. (2006). Content-addressable memory (CAM) circuits and architectures: A tutorial and survey. *IEEE Journal of Solid-State Circuits, 41*(3), 712–727.

**Publisher's Note** Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.



**K. Muralidharan** received Bachelor's Degree in Electronic and Communication Engineering from Anna University (BIT campus), Tiruchirappalli, (India) in 2013 and received Master's Degree in Applied Electronics from Coimbatore Institute of Technology, Coimbatore, (India) in 2015. He is currently working as a Assistant Professor with the Department of Electronics and Communication Engineering in Coimbatore Institute of Technology, Coimbatore (India). He has teaching experience of 9 years.

Currently he is pursuing her Ph.D, under Anna University, Chennai in the area of Low Power VLSI Design. He has published 20 papers in International Journals, National Conference and International Conferences. He has guided both UG and PG projects. His area of interests includes Electronics circuit design, VLSI design, Advance digital signal processing and Digital image processing.



**S. Uma Maheswari** obtained her doctoral degree in Electrical and Electronics Engineering with specialization in Biometrics, from Bharathiar University. Coimbatore. She obtained her Masters Degree in Applied Electronics from Coimbatore Institute of Technology, March 1991 and Bachelor's Degree in Electronics and Communication Engineering from Government College of Technology in 1985. She is presently working as Professor in the Department of Electronics and Communication

Engineering, Coimbatore Institute of Technology, Coimbatore, India. She holds 37 years of teaching experience. Her research interest are VLSI Design, Digital Image Processing and Digital Signal Processing. She has published papers 85 in International Journals, National Conference and International Conferences. She is also guiding both UG and PG projects, research scholars in diferent disciplines.



**T. Balakumaran** received Bachelor's degree in Electronics and Communication Engineering under Bharathiyar University, Coimbatore in 2003 and received Master's Degree in Applied Electronics under Anna University, Chennai in 2005 and Doctorate Degree under Anna University Chennai in 2013. His areas of interest include Antenna design and Image Processing. He has about 19 years of experience in teaching. He has published 30 papers in International journals and conferences. He is life mem-

ber of the Institution of Engineers (IE(I)) and Indian Society of Technical Engineering (ISTE).