Abstract
An idea to use a magnetic nano-dots array for a reservoir computing is introduced. The mechanism of how the nonlinear calculation is carried out in the magnetic system is explained by showing the simplest case with three nano-dots system. The first trial to prove calculation ability and fabrication ability of the system is demonstrated. Since the proposed reservoir computing system may utilize integration technology of the magnetic random access memory (MRAM), it possesses a possibility to realize a large-scale reservoir computing system.
Access provided by Autonomous University of Puebla. Download chapter PDF
Similar content being viewed by others
Keywords
1 Spin-Glass Model and Spin-Glass Reservoir Computing
The Hopfield model (Hopfield 1982) and the Boltzmann machine (Ackley et al. 1985) are well-known mathematical models of recurrent neural networks (RNN). They are based on a physical model of magnetic material with randomness (Amit and Gutfreund 1985) called a spin glass. Classical view of the spin is a rotation of the electron itself that causes magnetic moment of the atoms. In a spin glass, each atom acts as a magnet and interacts with each other via a quantum mechanical exchange interaction, J. In terms of neural concepts, the direction of magnetic dipole moment of an atom can be considered as the electrochemical potential of a neuron and the interaction J as the synaptic weight. Thus, magnetic materials can be treated as models of RNN and sophisticated techniques from statistical physics can be applied to analyze RNNs. In particular, the “infinite range model” (Fig. 1), where an infinite number of atoms (neurons/nodes) with all inter-atomic exchange interactions (inter-neuron/inter-node connections), is considered, which simplifies the problem tremendously because the mean-field approximation from statistical physics can provide an exact solution for the model. In addition, the Ising spin model, in which the direction of the magnetic moment is constrained along +z- or –z-directions, can also simplify the problem without losing its essential calculation ability. These generalizations and simplifications of the spin-glass model resulted in the Hopfield model and Boltzmann machine.
Therefore, one may expect that magnetic materials can serve as good natural systems to realize RNNs. However, limited researches have been done to produce RNNs using the spin glasses. This is because individual atomic magnets cannot be easily controlled. On the other hand, recent developments in the field of electronics have made it possible to integrate large numbers of nano-sized magnets (hereafter called “nanomagnets”) to construct a solid-state magnetic random access memory (MRAM) (Bhatti et al. 2017) (Fig. 2a). In MRAM, one may read out each bit information, which is stored as the direction of the magnetic moment in a cell, using the magnetoresistive effect (Yuasa et al. 2004; Parkin et al. 2004). The cell is called as a magnetic tunnel junction (MTJ), in which the resistance is dependent on the relative angle between two magnetic dipole moments in the junction, i.e., of free layer and reference layer. One may also write-in the information by applying either a current (Myers et al. 1999) or a voltage (Maruyama et al. 2009). To construct a stable memory, the interaction between the nanomagnets is removed in the MRAM. In contrast, here, we intentionally use the interaction between the nanomagnets to allow the MRAM to work as a spin-glass system and perform as an RNN. Since nanomagnets are electronically separated in the MRAM, the exchange interaction between the cells does not exist. However, magnetic dipole interaction exists that allows nanomagnets to perform linear/nonlinear calculations, as it will be explained later in this chapter (Fig. 2b). The strengths of the magnetic dipole interactions are determined from the size of magnetic dipole moments and distance between the magnets. Therefore, they cannot be modified after the device has been fabricated. This means that the system can be considered as an RNN with fixed synaptic weights. As a consequence, the framework of reservoir computing (Jaeger and Haas 2004) is needed to perform machine learning.
As magnetic systems have rich dynamics, usage of dynamic states can increase the performance of magnetic reservoirs. For example, spin-torque oscillators offer a relatively high performance when used as a reservoir working at high frequency (see the chapters by Grollier and Tsunegi). In this chapter, we show that static magnetic systems have also shown relatively high performance by increasing the number of nanomagnets. In particular, we show that the voltage control of magnetic anisotropy (VCMA) offers a way to realize a large-scale, high-performance, and energy-efficient magnetic RNN.
2 Historical Design of the Magnetic Boolean Calculator
A Boolean operation element using dipole-coupled nanomagnets is called a nanomagnet logic (NML) (Cowburn and Welland 2000). NML comprises nanomagnets with single magnetic domain states. The binary state is defined by the polarity of the magnetic moment of the nanomagnet that is similar to MRAM. The nanomagnets communicate each other via dipole interaction. The magnetic moment of a nanomagnet aligns parallel to the stray field from surrounding nanomagnets. Since the direction of the stray field is determined by the major polarity of the surrounding magnets, the NML gate is basically a majority gate. A transmission line (Cowburn and Welland 2000), a NAND/NOR gate (Imre et al. 2006; Hikaru and Ryoichi 2011), a shift register (Hikaru et al. 2017), etc. have been previously realized (Orlov et al. 2008). Figure 3a, b shows a scanning electron microscope (SEM) image of the NML-NAND/NOR logic gate (Hikaru and Ryoichi 2011) and magnetic force microscope images of a typical operation results, respectively. Until now, NMLs have been designed to perform only Boolean operations. However, if the analog value of the magnetic moment vector is used, the NML becomes an attractive candidate for a physical reservoir.
3 Linear and Nonlinear Calculation Using Nanomagnets
The energy (Hamiltonian: H) of a single nanomagnet with uniaxial anisotropy under an external magnetic field is written as follows:
where \( \vec{M} \) is the magnetic moment vector, \( \hat{K} \) is the anisotropy tensor, \( K_{u} \) > 0 is the uniaxial anisotropy energy par unit volume, V is the volume of the nanomagnet, \( \mu_{0} \) is the magnetic susceptibility of vacuum, and \( \vec{H}_{\text{ext}} \) is the external magnetic field vector. If we can neglect formation of a magnetic domain inside the nanomagnet, the size of the magnetic moment, \( \left| {\vec{M}} \right| \), is the constant whereas it may change the direction. Here, we assumed a disk-like nanomagnet, where the z-axis is perpendicular to the plane of the disk (Fig. 4a). Note that \( \vec{M} \) is stable if its direction minimizes H.
In Fig. 4b, the magnetic hysteresis curves for an external field parallel to \( {\vec{\mathbf{e}}}_{x} + {\vec{\mathbf{e}}}_{z} \) (45° from the normal line of the disk) are shown. The solid and dotted curves show the x- and z-components of \( \vec{M} \), respectively. When there is no external field, because of the uniaxial magnetic anisotropy, there are two equilibrium points, which are indicated as (i) and (i’) in Fig. 4b, i.e., \( \vec{M} = \pm M{\vec{\mathbf{e}}}_{z} \), where \( {\vec{\mathbf{e}}}_{z} \) is the unit vector in the z-direction. Now, we trace the hysteresis starting from a state with \( \vec{M} = - M{\vec{\mathbf{e}}}_{z} \). It means that the x- and z-components of \( \vec{M} \) are 0 and \( - M \), respectively (Fig. 4b (i)). When the external magnetic field increases, \( \vec{M} \) begins to tilt toward the x-direction. Therefore, both Mx and Mz increase (Fig. 4b (ii)). Here, although the energy at point (ii’) is smaller than that at point (ii), the state remains at point (ii), which corresponds to a local minimum in the Hamiltonian. When the magnetic field reaches the coercive force, Hc, the saddle point that separates the two minima disappears. Then, the state does not stay at (iii) but jumps to (iii’). As a consequence, Mz becomes positive and a small jump occurs for Mx. Although a small jump exists, we can assume that Mx is a sigmoid-like function of Hext. In an array of nanomagnets, the magnetic field exerted on a nanomagnet is the sum of all dipole fields made by the other nanomagnets. Therefore, the nanomagnet calculates a sigmoid-like function of the sum of the dipole fields. Hence, the nanomagnet has a nonlinear calculation power. In contrast to Mx, Mz shows a large hysteresis. The jump from point (iii) to point (iii’) corresponds to the firing of a neuron that has a certain threshold. After firing, the nanomagnet stores the information about the sign of the previous external magnetic field until another large external field is applied. Therefore, a nanomagnet has a memory function. In Fig. 4c, the effect of voltage application on the hysteresis curve is shown. The application of a voltage reduces the magnetic anisotropy and correspondingly reduces the coercive force, Hc. As a consequence, state at point (ii) may jump to point (ii’). By using these phenomena, the information about the sum of the dipole fields can be written to the nanomagnet.
The Hamiltonian of an array of nanomagnets is written as follows:
where
Here, \( \vec{M}_{i} ,\,\hat{K}_{i} ,\,\,K_{u,i} ,\,{\text{and }}\,V_{i} \) are the magnetic moment vector, anisotropy tensor, uniaxial anisotropy energy par unit volume, and volume of the nanomagnet at site i, respectively. \( \hat{D}_{dipole,ij} ,{\text{ and }}\vec{r}_{ij} \) are the dipole matrix, and relative position vector at site i with respect to a nanomagnet at site j, respectively.
By observing the first line in Eq. (2), one may find that the Hamiltonian is similar to that of the spin glass although it is a classical Hamiltonian with dipole interaction.
From Eq. (2), one can obtain effective magnetic field, \( \vec{H}_{effct,k} \), exerted on the kth nanomagnet as follows:
The effective field is a sum of anisotropy field, dipole field, and external field. The kth magnetic moment is stable if it is parallel to the effective field at the site. Therefore, it may calculate a sigmoid-like function of the sum of those fields.
A schematic diagram of a dipole magnetic field around a nanomagnet is shown in Fig. 5a. Because of the field, two nanomagnets with perpendicular magnetic moments couple antiparallel with each other when the dipole field is smaller than the anisotropy field. Under large dipole fields, the magnetic moments turn to in-plane and align parallel to the line that connects the two nanomagnets. If there are three nanomagnets on a plane, e.g., at the vertices of an equilateral triangle (see Fig. 5b), “frustration” happens. Then, for a large dipole field, all the magnetic moments turn into the plane and align circularly; this is called the vortex state.
Now, we discuss a realistic case that employs a system with three nanomagnets (see Fig. 5c, d). Three nanomagnets are placed on a plane and arranged at the vertices of an isosceles right triangle. A small magnetic field toward +x-direction is applied to break the time-reversal symmetry (see Fig. 5). For this model, we can find a stable set of \( \left\{ {\vec{M}_{i} } \right\} \) by finding the minima in Harray. Figure 5c, d shows two stable alignments of the magnetic moments. The z and in-plane components of the magnetic moments are also shown. Here, as for an example, we employ nanomagnets with radius, thickness, and saturation magnetization as 20 nm, 2 nm, and 1.3 MA/m, respectively. Magnetic anisotropy field for perpendicular magnetization without voltage application is about 3.1 mT (2.5 kA/m). Distance between magnet 1 and 2 is 50 \( \times \sqrt 2 \) nm. In Fig. 5c, the magnetic moments of both magnets 1 and 2 point toward the +z-direction, which corresponds to the “1” state. As a consequence, the magnetic moment of magnet 0 points toward the –z-direction (“0” state) because of dipole coupling. The state is obtained using the following protocol. First, the magnetic moments of magnets 0, 1, and 2 are set to point toward the +x-, +z-, and +z-directions, respectively. Then, the system is relaxed to find a minimum in Harray near the initial state. After relaxation, a voltage is applied to reduce the perpendicular anisotropy of magnet 0. The voltage is increased slowly compared with the relaxation time to keep the system at the energy minimum. After reaching zero anisotropy, the voltage is slowly removed until it becomes zero. Using this protocol, the information about the dipole field at the position of magnet 0 is written to magnet 0. The alignment shown in Fig. 5c is natural because the distances between magnets 0 and 1 and magnets 0 and 2 are smaller than that of magnets 1 and 2. Figure 5d shows a case where the magnetic moments of magnets 1 and 2 are almost antiparallel. For this case, the frustration of the system makes the perpendicular magnetic moments unstable and all the moments tilt to make a vortex-like in-plane component. The magnetic moment of magnet 0 prefers the +z-direction as a consequence of the delicate balance of the two dipole couplings and the external field.
Without an external magnetic field, the time-reversed states are also stable and degenerate in energy. Note that, by time reversal, all the magnetic moments are reversed. With a small external magnetic field, the time-reversal symmetry is broken, and degeneracy of the time-reversed states is released. As a consequence, the time-reversed state of the antiparallel alignment of magnets 1 and 2 (Fig. 5d) is not stable. Instead, a state with reversed perpendicular moments and non-reversed in-plane moments with respect to the state shown in Fig. 5d becomes stable. The configurations of the magnetic moments of all the states are listed in Table 1. The 0 and 1 in the first and second columns represent the magnetic moments pointing toward the z- and –z-directions, respectively. The third and fourth columns show the normalized values of the z- and x-components of the magnetic moment of magnet 0, respectively. By using those four values, results for the AND, OR, and XOR operations can be obtained using only linear calculus as shown in the fifth, sixth, and seventh columns of Table 1, respectively. This means that the system with three nanomagnets has linear/nonlinear calculation ability and can be used as a reservoir for reservoir computation. Note that such property based on the time-reversal breaking can be obtained without an external field if a system with four nanomagnets is employed. The breaking of the time-reversal symmetry can also be incorporated by taking into account magnetization dynamics with energy dissipation.
4 First Trial of the Dipole-Coupled Nanomagnet Reservoir
As for a first trial, an array system with 8 × 8 × 2 (=128) nanomagnets was tested using a micromagnetic simulation. Figure 6a, b shows the schematic diagram of reservoir computing with dipole-coupled nanomagnet reservoir and the schematic diagram of the top view of the reservoir with perpendicular magnetic anisotropy film, respectively. The radius and thickness of the nanomagnets were 20 nm and 0.3 nm, respectively. The gap between the nearest neighbor dots was 10 nm. The z-components of the normalized static magnetic moments of the nanomagnets were used as the state of the nodes denoted by mz. The nodes (magnetic moments of the nanomagnets) are connected via magnetic dipole interactions. As mentioned previously, the static directions of the magnetic moments of the nanomagnets were determined from the previous direction of the magnetic moments and the dipole interaction between the entire nanomagnets in the reservoir. Therefore, the reservoir can be considered as a recurrent neural network. Some nodes were connected to the input layer u in the computer, and every nodes were connected to the output matrix w made in the computer. A dot product of the node state and the output matrix was stored in the output layer as the output of reservoir computing (o = u ∙w). Binary and analog data were stored in the input layer and the output matrix, respectively.
First, when new data were set in the input layer, they were written to the input nodes (stage 1). To evaluate the system, as will be explained later, we randomly chose 48(=8 × 6) nodes from the 64 nodes of Group I (Fig. 6b) to be the input nodes and connected them to the input layer. The data in the input layer corresponding to the directions of magnetic moment were written using a writing method for MRAM. When the value in the input layer is 0/1, the direction of the magnetic moment of the input node is set to point toward the \( {{ - z} \mathord{\left/ {\vphantom {{ - z} { + z}}} \right. \kern-0pt} { + z}} \)-direction.
Next, the state of the node was updated. To update the node state, we changed the magnetic anisotropy of the nanomagnet using the VCMA method. After writing the data to the input nodes, a bias voltage was applied to the Group II nanomagnets (stage 2) and then removed (stage 3). By applying an appropriate bias voltage to the nanomagnets, the magnetic anisotropy of the nanomagnets disappeared. With this change of magnetic anisotropy, the magnetic moments of the nanomagnets transitioned to the next state. During this transition, linear/nonlinear operations occurred as the directions of the magnetic moments of the nanomagnets changed.
The node state in stage 3 was connected to the output matrix. The node state (i.e., the z-components of the magnetic moments of the nanomagnets) was measured using a reading method for MRAM (e.g., a reading method using tunneling magnetoresistive effect). A dot product of the node state and the output matrix was stored in the output layer using an external hardware. Then, a bias voltage was applied to the Group I nanomagnets (stage 4) and then removed (stage 5). The state of the node was updated by repeating the processes from stage 1 to stage 5.
In order to evaluate the behavior of the magnetic reservoir, we performed micromagnetic simulations. We assumed that the nanomagnets take single-domain states and we used a single cell for a single nanomagnet in the simulations. The static direction of the magnetic moment was calculated by solving the Landau–Lifshitz–Gilbert (LLG) equation using the fourth-order Runge–Kutta method. The LLG equation at 0 K is as follows:
α, γLL, and Ms are the damping constant, gyromagnetic ratio in the Landau–Lifshitz form, and saturation magnetization of the nanomagnets, respectively. The simulation parameters used were as follows: α = 0.5, \( \gamma_{\text{LL}} = {{2.211} \mathord{\left/ {\vphantom {{2.211} {\left( {1 + \alpha^{2} } \right)}}} \right. \kern-0pt} {\left( {1 + \alpha^{2} } \right)}} = 1.7688\;{{\text{m}} \mathord{\left/ {\vphantom {{\text{m}} {\text{As}}}} \right. \kern-0pt} {\text{As}}} \), and \( \left| {\vec{M}_{i} } \right|/V_{i} = 1.3\,\,{\text{MA/m}} \). Here, we employed large damping constant to shorten calculation time for the simulation. The large value of α employed here does not affect the following results since we only use static states for the RC. The typical value of the α is from 0.001 to 0.1 for metals.
We performed the NARMA10 task (Atiya and Parlos 2000) to evaluate the performance of the reservoir, which was previously to evaluate the performance of reservoirs (Appeltant et al. 2011; Nakajima et al. 2015). In the NARMA10 task, the output of step ys+1 is obtained from the previous inputs us–k and outputs ys–k by the following equation:
For the input \( u_{s} \), random values ranging from 0 to 0.5 were used. The input data were normalized to an 8-bit unsigned integer before storing in the input layer. In each step, the 8-bit data was written to the 48 input nodes in the reservoir. This means that each data bit was simultaneously written to six different nodes. Since only the current input was written in a single step, the reservoir is requested to have up to 10 short memory to fulfill the NARMA10 task.
We trained the output matrix using 2408 steps of training data with the least-squares method. The performance of reservoir computing was evaluated using NRMSE (normalized root-mean-square error) expressed by the following equation:
where \( N_{\text{test}} \) is the number of test data, \( o_{\text{s}} \) and \( y_{s} \) are the output data and answer at step \( s \), respectively, and \( \sigma^{2} \left( {y_{k} } \right) \) is the variance of the answer.
Figure 7 shows a typical output result of reservoir computing with dipole-coupled nanomagnets. The solid line shows the output data of reservoir computing with trained matrix and the dotted line shows the answer data with the NARMA10 function. The reservoir with the nanomagnet radius of 40 nm, the thickness of 10 nm, the nearest inter-dot distance of 10 nm, \( K_{\text{u}} \) of 0 or 20 kJ/m3, was used in Fig. 7. The output data of the trained reservoir computing show similar values as compared to the answer data. We calculated the NRMSE of the reservoir with 602 steps of the input data. The results show that the NRMSE between the output and answer data is 0.85, which is not low enough compared with those obtained by other methods. This is because the dipole-coupled nanomagnet reservoir used for the first trial (Fig. 6b) cannot hold sufficient old input values for the NARMA10 task. By changing the position of the node connected to the input layer and the updating procedure of the node state, the short-term memory capacity (Nomura et al. 2018) can be achieved with an NRMSE of 0.23 in the NARMA10 task with dipole-coupled nanomagnet reservoir (Zhu et al. 2018).
Figure 8 shows the SEM image of the prototype reservoir with dipole-coupled nanomagnets. The prototype reservoir comprises \( 8 \times 8 \) nanomagnets with 2 \( \times \) 2 contact holes. The distance between the nanomagnet of the prototypic reservoir was 12 nm. The micromagnetic simulations confirm that this structure can be used as a dipole-coupled nanomagnet reservoir.
5 Guiding Principle for the Future
We showed that the static magnetic moment vector in a dipole-coupled nanomagnet array behaves as a reservoir. The randomness of the inter-layer/inter-node coupling may enhance the performance of the magnetic reservoir. In the first trial, we randomly connected the input layer and the nodes in the reservoir. However, inter-node randomness was not utilized. Such randomness can be implemented by randomly changing the size and arrangement of the nanomagnets. Depending on the arrangement of the nanomagnets, the dipole-coupled nanomagnet array can behave as a spin glass or spin ice (Jensen et al. 2018). These arrangements can be used as a reservoir.
As described above, the MRAM reading/writing technology can be used to read the node state. Moreover, the inter-node connection using magnetic dipole coupling solves the wiring problem in realizing the hardware for RNN. The dipole-coupled nanomagnet reservoir, which is capable of creating interfaces with semiconductor technology by various existing technologies, is a candidate for practical use of physical reservoir computing. With the MRAM technology, one may fabricate giga-nodes class reservoir in principle. The guiding principle, however, to obtain better performance in a very large reservoir has not yet established. Nonetheless, explorations on the human brain class (21 giga-neurons) reservoir computing are not a dream.
References
D.H. Ackley, G.E. Hinton, T.J. Sejnowski, A learning algorithm for boltzmann machines. Cogn. Sci. 9(1), 147–169 (1985)
D.J. Amit, H. Gutfreund, Spin-glass models of neural networks. Phys. Rev. A 32(2), 1007–1018 (1985)
L. Appeltant et al., Information processing using a single dynamical node as complex system. Nat. Commun. 2, 468 (2011)
A.F. Atiya, A.G. Parlos, New results on recurrent network training: unifying the algorithms and accelerating convergence. IEEE Trans. Neural Netw. 11(3), 697–709 (2000)
S. Bhatti et al., Spintronics based random access memory: a review. Mater. Today 20(9), 530–548 (2017)
R.P. Cowburn, M.E. Welland, Room temperature magnetic quantum cellular automata. Science 287(5457), 1466–1468 (2000)
N. Hikaru, N. Ryoichi, NAND/NOR logical operation of a magnetic logic gate with canted clock-field. Appl. Phys. Express 4(1), (2011)
N. Hikaru et al., Controlling operation timing and data flow direction between nanomagnet logic elements with spatially uniform clock fields. Appl. Phys. Express 10(12), (2017)
J.J. Hopfield, Neural networks and physical systems with emergent collective computational abilities. Proc. Natl. Acad. Sci. USA-Biol. Sci. 79(8), 2554–2558 (1982)
A. Imre et al., Majority logic gate for magnetic quantum-dot cellular automata. Science 311(5758), 205–208 (2006)
H. Jaeger, H. Haas, Harnessing nonlinearity: predicting chaotic systems and saving energy in wireless communication. Science 304(5667), 78–80 (2004)
J.H. Jensen, E. Folven, G. Tufte, Computation in artificial spin ice, in The 2018 Conference on Artificial Life: A Hybrid of the European Conference on Artificial Life (ECAL) and the International Conference on the Synthesis and Simulation of Living Systems (ALIFE) (2018), pp. 15–22
T. Maruyama et al., Large voltage-induced magnetic anisotropy change in a few atomic layers of iron. Nat. Nanotechnol. 4(3), 158–161 (2009)
E.B. Myers et al., Current-induced switching of domains in magnetic multilayer devices. Science 285(5429), 867–870 (1999)
K. Nakajima et al., Information processing via physical soft body. Sci. Rep. 5, 10487 (2015)
H. Nomura et al., Reservoir computing with dipole-coupled nanomagnets (2018), arXiv:1810.13140
A. Orlov et al., Magnetic quantum-dot cellular automata: recent developments and prospects. J. Nanoelectron. Optoelectron. 3, 1–14 (2008)
S.S.P. Parkin et al., Giant tunnelling magnetoresistance at room temperature with MgO (100) tunnel barriers. Nat. Mater. 3(12), 862–867 (2004)
S. Yuasa et al., Giant room-temperature magnetoresistance in single-crystal Fe/MgO/Fe magnetic tunnel junctions. Nat. Mater. 3(12), 868–871 (2004)
L. Zhu et al., Remarkable problem-solving ability of unicellular amoeboid organism and its mechanism. R. Soc. Open Sci. 5(12) (2018)
Acknowledgments
This work was supported by the Ministry of Internal Affairs and Communications, JAPAN.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Singapore Pte Ltd.
About this chapter
Cite this chapter
Nomura, H., Kubota, H., Suzuki, Y. (2021). Reservoir Computing with Dipole-Coupled Nanomagnets. In: Nakajima, K., Fischer, I. (eds) Reservoir Computing. Natural Computing Series. Springer, Singapore. https://doi.org/10.1007/978-981-13-1687-6_15
Download citation
DOI: https://doi.org/10.1007/978-981-13-1687-6_15
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-1686-9
Online ISBN: 978-981-13-1687-6
eBook Packages: Computer ScienceComputer Science (R0)