Particle computation: complexity, algorithms, and logic

Becker, Aaron T.; Demaine, Erik D.; Fekete, Sándor P.; Lonsford, Jarrett; Morris-Wright, Rose

doi:10.1007/s11047-017-9666-6

Particle computation: complexity, algorithms, and logic

Published: 08 December 2017

Volume 18, pages 181–201, (2019)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Natural Computing Aims and scope Submit manuscript

Particle computation: complexity, algorithms, and logic

Download PDF

Aaron T. Becker¹,
Erik D. Demaine²,
Sándor P. Fekete³,
Jarrett Lonsford¹ &
…
Rose Morris-Wright⁴

1064 Accesses
13 Citations
Explore all metrics

Abstract

We investigate algorithmic control of a large swarm of mobile particles (such as robots, sensors, or building material) that move in a 2D workspace using a global input signal (such as gravity or a magnetic field). Upon activation of the field, each particle moves maximally in the same direction until forward progress is blocked by a stationary obstacle or another stationary particle. In an open workspace, this system model is of limited use because it has only two controllable degrees of freedom—all particles receive the same inputs and move uniformly. We show that adding a maze of obstacles to the environment can make the system drastically more complex but also more useful. We provide a wide range of results for a wide range of questions. These can be subdivided into external algorithmic problems, in which particle configurations serve as input for computations that are performed elsewhere, and internal logic problems, in which the particle configurations themselves are used for carrying out computations. For external algorithms, we give both negative and positive results. If we are given a set of stationary obstacles, we prove that it is NP-hard to decide whether a given initial configuration of unit-sized particles can be transformed into a desired target configuration. Moreover, we show that finding a control sequence of minimum length is PSPACE-complete. We also work on the inverse problem, providing constructive algorithms to design workspaces that efficiently implement arbitrary permutations between different configurations. For internal logic, we investigate how arbitrary computations can be implemented. We demonstrate how to encode dual-rail logic to build a universal logic gate that concurrently evaluates and, nand, nor, and or operations. Using many of these gates and appropriate interconnects, we can evaluate any logical expression. However, we establish that simulating the full range of complex interactions present in arbitrary digital circuits encounters a fundamental difficulty: a fan-out gate cannot be generated. We resolve this missing component with the help of 2 × 1 particles, which can create fan-out gates that produce multiple copies of the inputs. Using these gates we provide rules for replicating arbitrary digital circuits.

Reconfiguring Massive Particle Swarms with Limited, Global Control

Mobile RAM and Shape Formation by Programmable Particles

Particle-Based Assembly Using Precise Global Control

Article Open access 22 June 2022

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Programmable matter refers to a substance that has the ability to change its physical properties (shape, density, moduli, conductivity, optical properties, etc.) in a programmable fashion, based upon user input or autonomous sensing. The potential applications are endless, e.g., smart materials, autonomous monitoring and repair, or minimal invasive surgery. Thus, there is a high relevance of this topic to industry and society in general, and much research has been invested in the past decade to fabricate programmable matter. However, fabrication is only part of the story: without a proper understanding of how to program that matter, complex tasks such as minimal invasive surgery will be out of reach. (Fekete et al. 2015b)

Since the first visions of massive sensor swarms, more than ten years of work on sensor networks have yielded considerable progress with respect to hardware miniaturization. The original visions of “Smart Paint” (Abelson et al. 2000) or “Smart Dust” (Kahn et al. 2000) have triggered a considerable amount of theoretical research on swarms of stationary processors, e.g., the work in Fekete and Kröller (2006, 2007), Fekete et al. (2004), Kröller et al. (2006). Recent developments in the ability to design, produce, and control particles at the micro and nanoscale and the rise of possible applications, e.g., targeted drug delivery, micro and nanoscale construction, and Lab-on-a-Chip, motivate the study of large swarms of mobile objects. But how can we control such a swarm with only limited computational power and a lack of individual control by a central authority? Local, robotics-style motion control by the particles themselves appears hopeless because the capacity for storing energy for computation, communication, and motion control is proportional to the volume, but volume shrinks with the third power of particle length.

A possible answer lies in applying a global, external force to all particles in the swarm. This resembles the logic puzzle Tilt (http://www.thinkfun.com/tilt), slide and merge games such as the 2048 puzzle (Abdelkader et al. 2016), and dexterity ball-in-a-maze puzzles such as Pigs in Clover and Labyrinth, which involve tilting a board to cause all mobile pieces to roll or slide in a desired direction. Problems of this type are also similar to sliding-block puzzles with fixed obstacles (Demaine et al. 2000; Hoffmann 2000; Holzer and Schwoon 2004; Hearn and Demaine 2005), except that all particles receive the same control inputs, as in the Tilt puzzle. In the real world, driving ferromagnetic particles with a magnetic resonance imaging (MRI) scanner gives a milli-scale example of this challenge (Vartholomeos et al. 2012). At the micro-scale, Becker et al. (2013b) demonstrate how to apply a magnetic field to simultaneously move cells containing iron particles in a specific direction within a fabricated workspace (see Fig. 1a). Other recent examples include using the global magnetic field from an MRI to guide magneto-tactic bacteria through a vascular network to deliver payloads at specific locations (Chanu et al. 2008) and using electromagnets to steer a magneto-tactic bacterium through a micro-fabricated maze (Khalil et al. 2013); however, this still involves only individual particles at a time, not the parallel motion of a whole, massive swarm. How can we manipulate the overall swarm with coarse global control, such that individual particles arrive at multiple different destinations in a (known) complex vascular network such as the one in Fig. 1(b)? And how can we use the complex interaction of the particles to carry out complex computations within the swarm?

All this gives rise to the following two families of problems, which we denote by External Computation and Internal Computation.

External computation

Considering the particle swarm as input for a given algorithmic problem, we are faced with a number of questions that need to be resolved externally, such as the following.

1.
Given a map of an environment, such as the vascular network shown in Fig. 1, along with initial and goal positions for each particle, does there exist a sequence of inputs that will bring each particle to its goal position?
2.
Given a map of an environment, such as the vascular network shown in Fig. 1, along with initial and goal positions for each particle, what is the shortest sequence of moves that will bring each particle to its goal position?
3.
Given initial and goal positions for each particle in a swarm, how can we design a set of obstacles and a sequence of moves, such that each particle reaches its goal position?

Deliberate use of existing stationary obstacles leads to a wide range of possible particle configurations. In the first part of the paper (Sects. 4, 5), we give answers to the first two questions by showing that they may lead to computationally difficult situations. We also develop several positive results for the third question (again in Sect. 5). The underlying idea is to construct artificial obstacles (such as walls) that allow arbitrary rearrangements of a given two-dimensional particle swarm. For clearer notation, we will formulate the relevant statements in the language of matrix operations, which is easily translated into plain geometric language. This paper investigates these problems in 2D discretized environments, leaving 3D and continuous environments for future work.

Internal computation

Considering the particle swarm as a complex system that can be reconfigured in various ways, we are faced with issues of the computational power of the swarm itself, such as the following.

1.
Can the complexity of particle interaction be exploited to model logical operations?
2.
Are there limits to the computational power of the particle swarm?
3.
How can we achieve computational universality with particle computation?

In the second part of the paper (Sect. 6), we give precise answers to all of these questions. In particular, we show that the logical operations and, nand, nor, and or can be implemented in our model using dual-rail logic. Using terminology from electrical engineering, we call these components that calculate logical operations gates. We establish a fundamental limitation for particle interactions: we cannot duplicate the output of a chain of gates without also duplicating the chain of gates. This means a fan-out gate cannot be generated. We resolve this missing component with the help of 2 × 1 particles, which can be used to create fan-out gates that produce multiple copies of the inputs without needing duplicate gates. Using fan-out gates, we provide rules for replicating arbitrary digital circuits.

In the following, we start by a brief formal problem definition (Sect. 2) and a discussion of related work (Sect. 3), then continue to provide the results on External Computation (Sects. 4, 5) and Internal Computation (Sects. 6, 7). We conclude with future work (Sect. 8).

2 Problem definition

2.1 Model

Our model is based on the following rules:

1.
A planar grid workspaceW contains a number of unit-size particles and some fixed unit-square obstacles. A grid cell is referenced by its Cartesian coordinates \({\mathbf {x}}=(x,y)\), and is either free for possible occupation by a particle, or a permanent obstacle, which may never be occupied by a particle. Each particle occupies one grid cell.
2.
All particles are commanded in unison: the valid commands are “Go Up” (u), “Go Right” (r), “Go Down” (d), and “Go Left” (\(\ell\)).
3.
The particles all move in the commanded direction until forward progress is blocked by a stationary obstacle or another blocked particle. A command sequence\({\mathbf {m}}\) consists of an ordered sequence of moves \(m_k\), where each \(m_k\in \{u,d,r,\ell \}\). A representative command sequence is \(\langle u,r,d,\ell ,d,r,u,\ldots \rangle\). We assume that W is bounded by obstacles and issue each command long enough for the particles to move to their maximum extent.

The algorithmic decision problem globalcontrol-manyparticles is to decide whether a given puzzle is solvable. In other words, given a fixed workspace and a start and goal location for each particle, the algorithm determines the existence of a sequence of moves that move the particles to their goal locations. See Fig. 2 for two simple instances.

3 Related work

Related work is categorized into underactuated control, manipulation, and computation.

3.1 Underactuated control

Large robot populations

Due to the efforts of roboticists, biologists, and chemists (e.g., Rubenstein et al. 2012; Ou et al. 2013; Chiang et al. 2011), it is now possible to make and field large (\(N=10^3\)–\(10^{14}\)) populations of simple robots. Potential applications for these robots include targeted medical therapy, sensing, and actuation. With large populations come two fundamental challenges: how to (1) perform state estimation for the robots and (2) control these robots.

Traditional approaches often assume independent control signals for each robot, but each additional independent signal requires bandwidth and engineering. These bandwidth requirements grow at O(N). Using independent signals becomes more challenging as the robot size decreases. Especially at the micro- and nano-scales, it is not practical to encode autonomy in the robots. Instead, the robots are controlled and interacted with using global control signals. For this reason, it may be more appropriate to call the moving agents particles and label the external control system as the robot.

More recently, robots have been constructed with physical heterogeneity so that they respond differently to a global, broadcast control signal. Examples include scratch-drive microrobots, actuated and controlled by a DC voltage signal from a substrate (Donald et al. 2013); magnetic structures with different cross-sections that could be independently steered (Floyd et al. 2011); MagMite microrobots with different resonant frequencies and a global magnetic field (Frutiger et al. 2008); and magnetically controlled nanoscale helical screws constructed to stop movement at different cutoff frequencies of a global magnetic field (Peyer et al. 2013). In previous work with robots modeled as nonholonomic unicycles, we showed that an inhomogeneity in turning speed is enough to make even an infinite number of robots controllable with regard to position. All these approaches show promise, but they require precise state estimation and heterogeneous robots. At the molecular scale, there is a bounded number of modifications that can be made to differentiate robots. In addition, the control law computation requires at best a summation over all the robot states O(N) (Becker et al. 2012, 2014c) and at worst a matrix inversion \(O(N^{2.373})\) (Becker and Bretl 2012).

In this paper we take a different approach. We assume a population of approximately identical planar particles (which could be small robots) and one global control signal that contains the direction all particles should move. In an open environment, this system is not controllable because the particles move uniformly—implementing any control signal translates the entire group identically; however, an obstacle-filled workspace allows us to break this symmetry. In previous practical work (Becker et al. 2013a), we showed that if we can command the particles to move one unit distance at a time, some goal configurations have easy solutions. Given a large free space, we have an algorithm showing that a single obstacle is sufficient for position control of N particles (video of position control: http://youtu.be/5p_XIad5-Cw). This result required incremental position control of the group of particles, i.e. the ability to advance them a uniform, fixed distance. This is a strong assumption and one that we relax in this work.

Dexterity Games

The problem we investigate is strongly related to dexterity puzzles—games that typically involve a maze and several balls that should be maneuvered to goal positions. Such games have a long history. Pigs in Clover, involving steering four balls through three concentric incomplete circles, was invented in 1880 by Charles Martin Crandall. Dexterity games are dynamic and depend on the manual skill of the player. Our problem formulation also applies the same input to every agent but imposes only kinematic restrictions on agents. This is most similar to the gravity-fed logic maze Tilt™, invented by Vesa Timonen and Timo Jokitalo and distributed by ThinkFun since 2010 (http://www.thinkfun.com/tilt).

Sliding-block puzzles

Sliding-block puzzles use rectangular tiles that are constrained to move in a 2D workspace. The objective is to move one or more tiles to desired locations. They have a long history. Hearn (2005) and Demaine and Hearn (2009) showed that tiles can be arranged to create logic gates and used this technique to prove pspace-completeness for a variety of sliding-block puzzles. Hearn expressed the idea of building computers from the sliding blocks—many of the logic gates could be connected together, and the user could propagate a signal from one gate to the next by sliding intermediate tiles. This requires the user to know precisely which sequence of gates to enable and disable. In contrast to such a hands-on approach, with our architecture we can build circuits, store parameters in memory, and then actuate the entire system in parallel using a global control signal.

3.2 Manipulation

Computational geometry: robot box-pushing

Many variations of block-pushing puzzles have been explored from a computational complexity viewpoint with a seminal paper proving NP-hardness by Wilfong (1991). The general case of motion-planning when each command moves particles a single unit in a world composed of even a single robot and both fixed and moveable squares is in the complexity class pspace-complete (Demaine and Hearn 2009; Dor and Zwick 1999; Hoffmann 2000).

Ricochet Robots (Engels and Kamphans 2005), Atomix (Holzer and Schwoon 2004), and PushPush (Demaine et al. 2000) have the same constraint that particles must move to their full extent, once they have been set in motion. This constraint reflects physical realities where, due to uncertainties in sensing, control application, and dynamic models, precise quantified movements in a specified direction are not possible. Instead, the input can be applied for a long period of time, guaranteeing that all particles move to their fullest extent. In these games the particles move to their full extent with each input, but each particle can be actuated individually. The problem complexity with global inputs to all particles is addressed in this paper.

Sensorless manipulation

The algorithms in the second half of our paper do not require feedback, and we have drawn inspiration from work on sensorless manipulation (Erdmann and Mason 1988). Sensorless manipulation explicitly maintains the set of all possible part configurations and selects a sequence of actions to reduce the size of this set and drive it toward some goal configuration. Carefully selected primitive operations can make this easier. Sensorless manipulation strategies often use a sequential composition of primitive operations, “squeezing” a part either virtually with a programmable force field or simply between two flat, parallel plates (Goldberg 1993). Some sensorless manipulation strategies take advantage of limit cycle behavior, for example, engineering fixed points and basins of attraction so that parts only exit a feeder when they reach the correct orientation (Lynch et al. 2002; Murphey et al. 2005). These two strategies have been applied to a much wider array of mechanisms such as vibratory bowls and tables (Goemans et al. 2006; Vose et al. 2009, 2012) or assembly lines (Akella et al. 2000; Goldberg 1993; van der Stappen et al. 2002), and have also been extended to situations with stochastic uncertainty (Goldberg et al. 1999; Moll and Erdmann 2002) and closed-loop feedback (Akella and Mason 1999; Murphey and Burdick 2004).

3.3 Computation

Parallel Algorithms: SIMD

Another related area of research is Single Instruction Multiple Data (SIMD) parallel algorithms (Leighton 1991). In this model, multiple processors are all fed the same instructions to execute, but they do so on different data. This model has some flexibility, for example, allowing command execution selectively only on certain processors and no operations (NOPs) on the remaining processors.

Our model is actually more extreme. The particles all respond in effectively the same way to the same instruction. The only difference is their location and which obstacles or particles will block them. In some sense, our model is essentially Single Instruction, Single Data, Multiple Locations.

Our efforts have similarities with mechanical computers, computers constructed from mechanical, not electrical components. For a fascinating nontechnical review, see McCourtney (2001). These devices have a rich history, from the Pascaline, an adding machine invented in 1642 by a 19-year old Blaise Pascal, to Herman Hollerith’s punch-card tabulator in 1890, to the mechanical devices of IBM culminating in the 1940s. These devices used precision gears, pulleys, or electric motors to carry out calculations. Though our implementations in this paper are rather basic, we require none of these precision elements to achieve computational universality—merely unit-size obstacles and sliding particles sized 2 × 1 and 1 × 1.

Collision-based computing

Collision-based computing has been defined as “computation in a structureless medium populated with mobile objects.” For a survey of this area, see the excellent collection (Adamatzky and Durand-Lose 2012). Early examples include the billiard-ball computer proposed by Fredkin and Toffoli (1982) using only spherical balls and a frictionless environment composed of elastic collisions with other balls and with angled walls. Another popular example is Conway’s Game of Life, a cellular automaton governed by four simple rules (Berlekamp et al. 2001–2004). Cells live or die based on the number of neighbors. These rules have been examined in depth and used to design a Turing-complete computer (Rendell 2002, 2011). Game of life scenarios and billiard-ball computers are fascinating but lack a physical implementation. In this paper we present a collision-based system for computation and provide a physical implementation.

Programmable matter

Clearly there is a wide range of interesting scenarios for developing approaches to programmable matter. One such model is the abstract Tile-Assembly Model (aTAM) by Winfree (Winfree 1998; Winfree et al. 1998; LaBean et al. 1999), which has sparked a wide range of theoretical and practical research. In this model, unit-sized tiles interact and bond with the help of differently labeled edges, eventually composing complex assemblies. Even though the operations and final objectives in this model are quite different from our particle computation with global inputs (e.g., key features of the aTAM are that tiles can have a wide range of different edge types, and that they keep sticking together after bonding), there is a remarkable geometric parallelism to a key result of our present paper: while it is widely believed that at the most basic level of interaction (called temperature 1), computational universality cannot be achieved (Doty et al. 2009; Maňuch et al. 2010; Meunier et al. 2014) in the aTAM with only unit-sized tiles, recent work Fekete et al. (2015a) shows that computational universality can be achieved as soon as even slightly bigger tiles are used. This resembles the results of our paper, which shows that unit-size particles are insufficient for universal computation, but employing bigger particles suffices.

4 Mazes

We prove that the general problem defined in Sect. 2 is computationally intractable.

Theorem 1

globalcontrol-manyparticles is NP-hard: given a specified goal location and an initial configuration of movable particles and fixed obstacles, it is NP-hard to decide if a move sequence exists that ends with some particle at the goal location.

Proof

We prove hardness by a reduction from 3SAT. Suppose we are given n Boolean variables \(x_1, x_2, \dots , x_n\), and m disjunctive clauses \(C_j = U_j \vee V_j \vee W_j\), where each literal \(U_j,V_j,W_j\) is of the form \(x_i\) or \(\lnot x_i\). We construct an instance of globalcontrol-manyparticles that has a solution if and only if all clauses can be satisfied by a truth assignment to the variables. This instance is composed of variable gadgets for setting individual variables true or false, clause gadgets that construct the logical or of groupings of three variables, and a check gadget that constructs the logical and of all the clauses. A particle is only delivered to the goal location if the variables have been set in such a way that the formula evaluates to true.

Variable gadgets

For each variable \(x_i\) that appears in \(k_i\) literals, we construct \(k_i\) instances of the variable gadgeti shown in Fig. 3, with a particle initially at the top of the gadget. The gadget consists of a tower of n levels, designed for the overall construction to make n total variable choices. These choices are intended to be made by a move sequence of the form \(\langle d\), l / r, d, \(l/r, \ldots , d\), l / r, d, \(r\rangle\), where the ith l / r choice corresponds to setting variable \(x_i\) to either true (l) or false (r). Thus variable gadget i ignores all but the ith choice by making all other levels lead to the same destination via both l and r. The ith level branches into two destinations chosen by either l or r, which correspond to \(x_i\) being set true or false, respectively.

In fact, the command sequence may include multiple l and r commands in a row, in which case the last l / r before a vertical u / d command specifies the final decision made at that level, and the others can be ignored. The command sequence may also include a u command, which undoes a d command if done immediately after or else does nothing; thus we can simply ignore the u command and the immediately preceding d, if it exists. We can also ignore duplicate commands (e.g., d, d becomes d) and remove any initial l / r command. After ignoring these superfluous commands, assuming a particle reaches one of the output channels, we obtain a sequence in the canonical form \(\langle d\), l / r, d, l / r, ..., d, \(r \rangle\) as desired, corresponding uniquely to a truth assignment to the n variables. (If no particle reaches the output port, it is as if the variable is neither true nor false, satisfying no clauses.) Note that all particles arrive at their output ports at exactly the same time.

Clause gadgets

For each clause, we use the or gadget shown in Fig. 4a. The or gadget has three inputs corresponding to the three literals, and input particles are initially at the top of these inputs. For each positive literal of the form \(x_i\), we connect the corresponding input to the left output of an unused instance of variable gadget i. For each negative literal of the form \(\lnot x_i\), we connect the corresponding input to the right output of an unused instance of a variable gadget i. (In this way, each variable gadget gets used exactly once.)

We connect the variable gadget to the or gadget as shown in Fig. 5: place the variable gadget above the clause so as to align the vertical output and input channels, and join them into a common channel. To make room for the three variable gadgets, we simply extend the black areas separating the three input channels in the or gadget. The unused output channel of each variable gadget is connected to a waste receptacle. Any particle reaching that end cannot return to the logic.

If any input channel of the or gadget has a particle, then it can reach the output port by the move sequence \(\langle d,\ell ,d,r \rangle\). Furthermore, because variable gadgets place all particles on their output ports at the same time, if more than one particle reaches the or gadget, they will move in unison as drawn in Fig. 4a, and only one can make it to the output port; the others will be stuck in the “waste” row, even if extra \(\langle \ell ,r,u,d \rangle\) commands are interjected into the intended sequence. Hence, a single particle can reach the output of a clause if and only if that clause (i.e., at least one of its literals) is satisfied by the variable assignment.

Check gadget

As the final stage of the computation, we check that all clauses were simultaneously satisfied by the variable assignment, using the m-input and gadget shown in Fig. 4b, c. Specifically, we place the clause gadgets along a horizontal line and connect their vertical output channels to the vertical input channels of the check gadget. Again we can align the channels by extending the black areas that separate the input channels of the and gadget, as shown in the composite diagram Fig. 5.

The intended solution sequence for the and gadget is \(\langle d, \ell , d, r \rangle\). The and gadget is designed with the downward channel exactly m units to the right from the left wall and more than 2m units from the right wall, so for any particle to reach the downward channel (and ultimately, the target location), at least m particles must be presented as input. Because each input channel will present at most one particle (as argued in a clause), a particle can reach the final destination if and only if all m clauses output a particle, which is possible exactly when all clauses are satisfied by the variable assignment.

Clearly, the size of all parts of the construction is polynomial in the size of the original 3SAT instance. This completes the reduction and the NP-hardness proof. \(\square\)

We conjecture that globalcontrol-manyparticles is in fact pspace-complete. One approach would be to simulate nondeterministic constraint logic (Hearn and Demaine 2005), perhaps using a unique move sequence of the form \(\langle d\), \(\ell /r\), d, \(\ell /r\), \(\dots \rangle\) to identify and “activate” a component. One challenge is that all gadgets must properly reset to their initial state without permanently trapping any particles. However, we are able to prove that a variant of this problem is pspace-complete in Sect. 5.3.

5 Matrices

The previous section investigated pathologically difficult configurations. This section investigates a complementary problem. Given the same particle and world constraints as before, what types of control are possible and economical if we are free to design the environment?

First, we describe an arrangement of obstacles that implement an arbitrary matrix permutation in four commands. Then we provide efficient algorithms for sorting matrices. We finish with potential applications.

5.1 A workspace for a single permutation

For our purposes, a matrix is a 2D array of particles (each possibly of a different color). For an \(a_r \times a_c\) matrix A and a \(b_r \times b_c\) matrix B, of equal total size \(N = a_r a_c = b_r b_c\), a matrix permutation assigns each element in A a unique position in B. Example constructions that execute matrix permutations of total size \(N=15\) and 100 are shown, respectively, in Figs. 6 and 7.

Theorem 2

Let A and B be matrices with dimensions as above. Any matrix permutation that transforms A into B can be executed by a set of obstacles in just four moves. For N particles, the constructed arrangement of obstacles requires\((3N+1)^2\)space and\(4N+1\)obstacles. If particles move with a speed of v, the required time for those four moves is 12N/v. and in time\(12N/{\mathrm {speed}}\).

Proof

The reader is referred to Fig. 6 and 7 for examples. The move sequence is \(\langle u,r,d,\ell \rangle\). The lower-left particle starts at (0, 0).

Move 1: We place \(a_c\) obstacles, one for each column of A, spaced vertically \(a_r-1\) units apart, such that moving u spreads the particle array into a staggered vertical line. Each particle now has its own row. Move 2: We place N obstacles to stop each particle during the move r. Because each particle has a unique row, it can be stopped at any arbitrary column by its obstacle. We leave an empty column between each obstacle to prevent collisions during the next move. Move 3: We place N particles to stop particles during the move d, which arranges the particles in their final rows. These rows are spread in a staggered horizontal line. Move 4: We place \(a_r\) obstacles in a vertical line from \((-\,1,1)\) to \((-\,1,a_c)\). Moving \(\ell\) stacks the staggered rows into the desired permutation and returns the array to the initial position. \(\square\)

By reapplying the same permutation enough times, we can return to the original configuration. The permutations shown in Fig. 6 return to the original image in two cycles, while Fig. 7 requires 740 cycles. In fact, any permutation of N elements will return to its original position after it is repeated a finite number of times (Dummit and Foote 2009).

For a two-color image, we can always construct a permutation that resets in 2 cycles. If the matrix has only two colors then for each entry in the matrix a permutation of the particles will either flip the color or keep the color constant in that given entry. If the permutation flips the color of a particular entry, then doing the permutation twice will flip this entry back to its original color. If the permutation keeps the color of a particular entry constant, then doing the permutation twice will also preserve the color of that entry. Performing the permutation twice always results in the original matrix. Such a permutation is an involution, a function that is its own inverse. An involution often does not exist for permutations on images with more than two colors.

5.2 A workspace for arbitrary permutations

Theorem 2 can be exploited to generate larger sets of permutations or even all possible permutations. There is a tradeoff between the number of introduced obstacles and the number of moves required for realizing a permutation.

We start with obstacle sets that require only a small number of moves.

Theorem 3

For an arbitrary set of k permutations of N particles, we can construct a set of O(kN) obstacles, such that we can switch from a start arrangement into any of the k permutations using at most\(O(\log k)\)force-field moves.

Proof

See Fig. 8. Build a binary tree of depth \(\log k\) for choosing between the permutations by a sequence of \(\langle r\), d, \((r/\ell )\), d, \((r/\ell )\), ..., d, \((r/\ell )\), d, \(\ell\), \(u \rangle\) with \(\log k\) decisions between r and \(\ell\), from the initial prefix \(\langle r,d \rangle\) to the final suffix \(\langle d,\ell ,u \rangle\). This gets the particles to the set of obstacles for performing the appropriate permutation. \(\square\)

Corollary 4

For any N and an arbitrary but fixed\(\varepsilon > 0\), we can construct a set of\((N!)^\varepsilon\)obstacles such that any permutation of N particles can be achieved by at most\(O(N \log N)\)force-field moves.

Proof

This follows from Theorem 3. With \(k=(N!)^{\varepsilon }/N\), \(\log k\) becomes \(\varepsilon \log (N!)- \log N\), i.e., \(O(N\log N)\). \(\square\)

Now we proceed to more economical sets of obstacles, with arbitrary permutations realized by clockwise and counterclockwise move sequences. We make use of the following easy lemma, which shows that two base permutations are enough to generate any desired rearrangement.

Lemma 5

Any permutation of N objects can be generated by the two base permutations\(p=(1,2)\)and\(q=(1,2,\ldots , N)\). Moreover, any permutation can be generated by a sequence of length at most\(N^2\)that consists of p and q.

The proof is elementary and left to the reader. This allows us to establish the following result.

Theorem 6

We can construct a set ofO(N) obstacles such that any\(a_r\times a_c\)arrangement ofN particles can be rearranged into any other\(a_r\times a_c\)arrangement\(\pi\)of the same particles, using atmost\(O(N^2)\)force-field moves.

Proof

See Fig. 9. Use Theorem 2 to build two sets of obstacles, one each for p and q, such that p is realized by the sequence \(\langle u,r,d,\ell \rangle\) (clockwise) and q is realized by \(\langle r,u,\ell ,d \rangle\) (counterclockwise). Then we use the appropriate sequence for generating \(\pi\) in \(O(N^2)\) moves. \(\square\)

Using a larger set of base permutations allows us to reduce the number of necessary moves. Again, we make use of a simple base set for generating arbitrary permutations.

Lemma 7

Any permutation of N objects can be generated by the N base permutations\(p_1=(1,2), p_2=(1,3),\dots , p_{N}=(1,(N-1))\)and\(q=(1,2,\ldots , N)\). Moreover, any permutation can be generated by a sequence of length at most N that consists of\(p_i\)andq.

The proof is again completely straightforward and left to the reader.

Theorem 8

We can construct a set of \(O(N^2)\) obstacles such that any \(a_r \times a_c\) arrangement of N particles can be rearranged into any other \(b_r \times b_c\) arrangement \(\pi\) of the same particles, using at most \(O(N \log N)\) force-field moves.

Proof

Use Theorem 2 to build N sets of obstacles, one each for \(p_1,\ldots ,p_{N-1}, q\). Furthermore, use Lemma 7 for generating all permutations with at most N different of these base permutations, and Theorem 3 for switching between these \(k=N\) permutations. Then we can get \(\pi\) with at most N cycles, each consisting of at most \(O(\log N)\) force-field moves. \(\square\)

This is the best possible with respect to the number of moves in the following sense:

Theorem 9

Suppose we have a set of obstacles such that any permutation of a rectangular arrangement ofNparticles can be achieved by at mostMforce-field moves. Then M is at least\(\Omega (N \log N)\).

Proof

Each permutation must be achieved by a sequence of force-field moves. Because each decision for a force-field move \(\langle u,d,\ell ,r \rangle\) partitions the remaining set of possible permutations into at most four different subsets, we need at least \(\Omega (\log (N!))= \Omega (N \log N)\) such moves. \(\square\)

5.3 pspace-Completeness

In Sect. 4, we showed that the problem globalcontrol-manyparticles is computationally intractable in a particular sense: given an initial configuration of movable particles and fixed obstacles, it is NP-hard to decide whether any individual particle can be moved to a specified location. In the following, we show that minimizing the number of moves for achieving a desired goal configuration for all particles is pspace-complete.

Theorem 10

Given an initial configuration of (labeled) movable particles and fixed obstacles, it is pspace -complete to compute a shortest sequence of force-field moves to achieve another (labeled) configuration.

Proof

The proof is largely based on a complexity result by Jerrum (1985), who considered the following problem: Given a permutation group specified by a set of generators and a single target permutation \(\pi\), which is a member of the group, what is the shortest expression for the target permutation in terms of the generator? This problem was shown to be pspace-complete in Jerrum (1985), even when the generator set consists of only two permutations, say, \(\pi _1\) and \(\pi _2\).

As shown in Sect. 5.2, we can realize any matrix permutation \(\pi _i\) of a rectangular arrangement of particles by a set of obstacles, such that this permutation \(\pi _i\) is carried out by a quadruple of force-field moves. We can combine the sets of obstacles for the two different permutations \(\pi _1\) and \(\pi _2\), such that \(\pi _1\) is realized by going through a clockwise sequence \(\langle u, r, d, \ell \rangle\), while \(\pi _2\) is realized by a counterclockwise sequence \(\langle r, u, \ell , d\rangle\). We now argue that a target permutation \(\pi\) of the matrix can be realized by a minimum-length sequence of m force-field moves if and only if \(\pi\) can be decomposed into a sequence of a total of n applications of permutations \(\pi _1\) and \(\pi _2\), where \(m=4n\).

The “if” part is easy: simply carry out the sequence of n permutations, each realized by a (clockwise or counterclockwise) quadruple of force-field moves. For the “only if” part, suppose we have a shortest sequence of m force-field moves to achieve permutation \(\pi\), and consider an arbitrary subsequence that starts from the base position in which the particles form a rectangular arrangement in the lower left-hand corner. It is easy to see that a minimum-length sequence cannot contain two consecutive moves that are both horizontal or both vertical: these moves would have to be be in opposite directions, and we could shorten the sequence by omitting the first move. Furthermore, by construction of the obstacle set, the first move must be u or r. Now it is easy to check that the choice of the first move determines the next three ones: u must be followed by \(\langle r, d, \ell \rangle\); similarly, r must be followed by \(\langle u, \ell , d\rangle\). Any other choice for moves 2–4 would produce a longer overall sequence or destroy the matrix by leading to an arrangement from which no recovery to a rectangular matrix is possible. Therefore, the overall sequence can be decomposed into \(m=4n\) clockwise or counterclockwise quadruples. As described, each of these quadruples represents either \(\pi _1\) or \(\pi _2\), so \(\pi\) can be decomposed into n applications of permutations \(\pi _1\) and \(\pi _2\). This completes the proof. \(\square\)

Note that the result also implies the existence of solutions of exponential length, which can occur with polynomial space. Binary counters are particular examples of such long sequences that are useful for many purposes.

6 Limitations of particle logic

After considering the complexity of rearranging given arrangements of particles, i.e., external computation, we now turn to using the particles themselves for performing logic operations, i.e., internal computation. We first establish the limitations of 1 \(\times\) 1 particles; details on designing the full range of logic gates with the help of 2\(\times\)1 particles are described in the following Sect. 7.

6.1 Dual-rail logic and fan-out gates

In Sect. 4 we showed that given only obstacles and particles that move maximally in response to an input, we can construct a variety of logic elements. These include variable gadgets that enable setting multiple copies of up to n variables to be true or false (Fig. 3) and m-input or and and gates (Fig. 4). Unfortunately, we cannot build not gates because our system of particles and obstacles is conservative—we cannot create a new particle at the output when no particle is supplied to the input. A not gate is necessary to construct a logically complete set of gates. To do this, we rely on a form of dual-rail logic, where both the state and inverse (A and \(\bar{A}\)) of each signal are propagated throughout the computation. Dual-rail logic is often used in low-power electronics to increase the signal-to-noise ratio without increasing the voltage (Zimmermann and Fichtner 1997). With dual-rail logic we can now construct the missing not gate, as shown in Fig. 10. The command sequence \(\langle d,\ell ,u,r\rangle\) inverts the input.

We now revisit the or and and gates of Fig. 4, using dual-rail logic and the four inputs \(A,\bar{A},B,\bar{B}\). The gate in the middle row of Fig. 10 can simultaneously compute and, or, nor, and nand, using the same command sequence \(\langle d,\ell ,u,r\rangle\) as the not gate. Outputs can be piped for further logic using the interconnections in Fig. 10. Unused outputs can be piped into a storage area and recycled for later use.

Dual-rail devices open up new opportunities, including xor and xnor gates, which are not conservative using single-rail logic. This gate, shown in the bottom row of Fig. 10 also outputs a constant 1 and 0.

Consider the half adder shown in Fig. 11. With an and and xor we can compactly construct a half adder. We are hindered by an inability to construct a fan-out device. The fan out of a logic gate output is the number of gate inputs it can feed or connect to. In the particle logic demonstrated thus far, each logic gate output could fan out to only one gate. This is sufficient for sum of products and product of sums operations in CPLDs (complex programmable logic devices) but insufficient for more flexible architectures. Instead, we must take any logical expression and create multiple copies of each input. For example, a half adder requires only one xor and one and gate, but our particle computation requires two A and two B inputs. In the rest of this section we prove the insufficiency of unit-sized particles for the implementation of fan-out gates and design a fan-out gate using \(2\times 1\) particles.

6.2 Only 1 × 1 particles are insufficient

First we provide terminology to define how particles interact with each other. We say that particle qblocks particle p during a move \(m_k\), if p is prevented from reaching location \({\mathbf {x}}=(x,y)\) because particle q occupies this location. As a consequence, at the end of \(m_k\), q’s location is \({\mathbf {x}}\), while particle p’s location is adjacent to \({\mathbf {x}}\), depending on the direction of \(m_k\). Furthermore, the sequence of locations \(\langle {\mathbf {s}},\ldots ,{\mathbf {g}} \rangle\) of a particle p from a start location \({\mathbf {s}}\) to its goal location \({\mathbf {g}}\) describes its path. For an unchanged sequence of force-field moves and obstacles, a particle p can only be prevented from reaching its destination by adding additional particles, or removing existing ones. We argue in the following that this will still lead to g being occupied at the end of the sequence, possibly by a different particle.

Lemma 11

If given a fixed workspace W and a command sequence\({\mathbf {m}}\)that moves a particle p from start location\({\mathbf {s}}\)to goal location\({\mathbf {g}}\), then adding additional particles anywhere inWat any stage of the command sequence cannot prevent\({\mathbf {g}}\)from being occupied at the conclusion of sequence\({\mathbf {m}}\).

Proof

Consider the effect of adding a particle to workspace W. If p never gets blocked by any particle q, then p’s path remains the same. Therefore, at the conclusion of \({\mathbf {m}}\), p occupies \({\mathbf {g}}\).

Now suppose p gets blocked by q. By the definition of blocking, q prevents p from reaching some location \({\mathbf {x}}\) because q already occupies this location. After the blocking, the command sequence will continue and so particle q will continue on p’s original path, following the same instructions and therefore ending up in the same location, \({\mathbf {g}}\), unless q gets blocked by yet another particle. By induction, additional particles will have the same effect. If q gets blocked by any other particle, then this particle will continue on p’s original path. Thus by adding more particles, it is impossible to prevent some particle from occupying \({\mathbf {g}}\) at the conclusion of \({\mathbf {m}}\). \(\square\)

Corollary 12

A not gate without dual-rail inputs cannot be constructed.

Proof

By contradiction. A particle logic not gate without dual-rail inputs has one input at \({\mathbf {s}}\), one output at \({\mathbf {g}}\), an arbitrary (possibly zero) number of asserted inputs, which are all initially occupied, and an arbitrary (possibly zero) number of waste outputs.

To satisfy the not gate conditions given a command sequence \({\mathbf {m}}\), the following conditions must be satisfied.

1.
If \({\mathbf {s}}\) is initially unoccupied, \({\mathbf {g}}\) must be occupied at the conclusion of \({\mathbf {m}}\).
2.
If \({\mathbf {s}}\) is initially occupied, \({\mathbf {g}}\) must be unoccupied at the conclusion of \({\mathbf {m}}\).

By Lemma 11, if \({\mathbf {s}}\) initially unoccupied results in \({\mathbf {g}}\) being occupied by some particle p at the conclusion of \({\mathbf {m}}\), then the addition of a particle q at \({\mathbf {s}}\) cannot prevent \({\mathbf {g}}\) from being filled, resulting in a contradiction. \(\square\)

This shows that dual-rail logic is necessary for the formation of not gates.

Additionally, we show that \(1\times 1\) particles are insufficient to produce fan-out gates. To this end, we must examine the possibilities both when we add additional particles to the scenario and when we remove them.

Lemma 13

Consider a given workspaceWwith a number of particles, two of which are\(p_1\)and\(p_2\), initially at\({\mathbf {s}}_1\)and\({\mathbf {s}}_2\). Let\({\mathbf {m}}\)be a command sequence that moves\(p_1\)and\(p_2\)to the respective goal locations\({\mathbf {g}}_1\)and\({\mathbf {g}}_2\). Then deleting either\(p_1\)or\(p_2\)from theoriginal configuration results in at least one of\({\mathbf {g}}_1\)or\({\mathbf {g}}_2\)being occupied at the conclusion of\({\mathbf {m}}\).

Proof

Without loss of generality, suppose we remove particle \(p_1\).

First suppose \(p_2\) never gets blocked by \(p_1\). Then the removal of \(p_1\) will not affect the path of \(p_2\). Particle \(p_2\) has the same number of blockings that it had before the removal of \(p_1\) and so \(p_2\) will follow the same path and occupy \({\mathbf {g}}_2\) at the conclusion.

Alternatively, suppose \(p_2\) gets blocked by \(p_1\) when \(p_1\) is occupying location \({\mathbf {x}}\). Because \(p_1\) is removed, it no longer occupies \({\mathbf {x}}\) during this move; because it was stopped in the common direction when blocking \(p_2\), particle \(p_2\) gets stopped by this obstacle at location \({\mathbf {x}}\), previously occupied by \(p_1\). Particle \(p_2\) now proceeds along the path previously traveled by \(p_1\). Effectively, \(p_2\) has replaced \(p_1\) and follows the path until it reaches \({\mathbf {g}}_1\). Successive blockngs between \(p_2\) and \(p_1\) in the original scenario are resolved in the same manner. \(\square\)

Table 1 fan-out operation. This cannot be implemented with 1 \(\times \,\) 1 particles and obstacles. Our technique uses 2 \(\times \,\) 1 particles

Full size table

In the context of programmable matter, it is natural to consider systems in which particles are moved around, but neither created nor destroyed; such a system is called conservative. As it turns out, this has important consequences.

Theorem 14

A conservative dual-logic fan-out gate cannot be constructed using only\(1\times 1\)particles.

Proof

We assume such a fan-out gate exists and reach a contradiction. Consider a fan-out gate W, dual-rail input locations \({\mathbf {s}}_{a}\), \({\mathbf {s}}_{\overline{a}}\), and dual-rail output locations \({\mathbf {g}}_{a_1}, {\mathbf {g}}_{a_2},{\mathbf {g}}_{\overline{a}_1},{\mathbf {g}}_{\overline{a}_2}\). Because particle logic is conservative, there must be at least one additional input location \({\mathbf {s}}_p\) and particle p. A fan-out gate implements the truth table shown in Table 1. Given an arbitrary command sequence \({\mathbf {m}}\):

1.
If \({\mathbf {s}}_{a}\) and \({\mathbf {s}}_p\) are initially occupied and \({\mathbf {s}}_{\overline{a}}\) vacant at the conclusion of \({\mathbf {m}}\), then \({\mathbf {g}}_{a_1}\) and \({\mathbf {g}}_{a_2}\) are occupied and the locations \({\mathbf {g}}_{\overline{a}_1}\) and \({\mathbf {g}}_{\overline{a}_2}\) are vacant.
2.
If \({\mathbf {s}}_{a}\) is initially vacant and \({\mathbf {s}}_{\overline{a}}\) and \({\mathbf {s}}_p\) are occupied at the conclusion of \({\mathbf {m}}\), then \({\mathbf {g}}_{a_1}\) and \({\mathbf {g}}_{a_2}\) are vacant and the locations \({\mathbf {g}}_{\overline{a}_1}\) and \({\mathbf {g}}_{\overline{a}_2}\) are occupied.

We will now assume that condition 1, above, is the original scenario and add and subtract particles, applying Lemmas 11 and 13, to show that it is impossible to meet condition 2.

Assume condition 1. Particles a and p start at \({\mathbf {s}}_{a}\) and \({\mathbf {s}}_p\) respectively and at the conclusion of \({\mathbf {m}}\), the locations \({\mathbf {g}}_{a1}\) and \({\mathbf {g}}_{a2}\) are occupied. Now remove particle a. According to Lemma 13, either \({\mathbf {g}}_{a_1}\) or \({\mathbf {g}}_{a_2}\) must be occupied at the conclusion of \({\mathbf {m}}\). Suppose without loss of generality that \({\mathbf {g}}_{a_1}\) is filled. By Lemma 11, adding an additional particle at location \({\mathbf {s}}_{\overline{a}}\) cannot prevent \({\mathbf {g}}_{a_1}\) from being filled. However, to meet condition 2, \({\mathbf {g}}_{a_1}\) must be vacant, thus no such gate is possible. \(\square\)

7 Device and gate design

Now we consider actually designing clock sequence, logic gates, and wiring, making use of 2 \(\times\) 1 particles.

Choosing a clock sequence

The clock sequence is the ordered set of moves that are simultaneously applied to every particle in our workspace. We call this the clock sequence because, as in digital computers, this sequence is universally applied and keeps all logic synchronized.

A clock sequence determines the basic functionality of each gate. To simplify implementation in the spirit of Reduced Instruction Set Computing (RISC), which uses a simplified set of instructions that run at the same rate, we want to use the same clock cycle for each gate and for all wiring. Our early work in Becker et al. (2014b) used a standard sequence \(\langle d,\ell ,d,r \rangle\). This sequence can be used to make and, or, and xor gates, and any of their inverses. This sequence can also be used for wiring to connect arbitrary inputs and outputs, as long as the outputs are below the inputs. Unfortunately, \(\langle d,\ell ,d,r \rangle\) cannot move any particles upwards. To connect outputs as inputs to higher-level logic requires an additional reset sequence that contains a \(\langle u \rangle\) command. Therefore, including all four directions is a necessary condition for a valid clock sequence for computation that reuses gates. The shortest sequence has four commands, each appearing once. We choose the sequence \(\langle d,\ell ,u,r \rangle\) and by designing examples, prove that this sequence is sufficient for logic gates, fan-out gates, and wiring.

This clock sequence has the attractive property of being a clockwise (CW) rotation through the possible input sequences. One could imagine our particle logic circuit mounted on a wheel rotating about an axis parallel to the ground. If the particles were moved by the pull of gravity, each counter-clockwise revolution would advance the circuit by one clock cycle. A gravity-fed hardware implementation of particle computation is shown in Fig. 12.

Limitations

The clock sequence imposes constraints on the set of reachable positions after one cycle, as illustrated in Fig. 13. If at the completion of a \(\langle d,\ell ,u,r \rangle\) cycle a particle is located at \((s_x,s_y)\), the potential locations at the end of the next cycle are any locations except \(([s_x+1,\infty ],s_y)\), and \((s_x-1,[-\infty ,s_y-1])\). For the particle to start at \((s_x,s_y)\) after a r move, it must have been stopped by an obstacle at \((s_x+1,s_y)\), so \((s_x+1,s_y)\) is unreachable. Moving to the right of this obstacle requires a move of length \(\lambda \ne 0\) in the down direction, followed by a r move, followed by a move of \(\lambda\) in the up direction. However, r is not the second move in the sequence. Because r is the final move in the clock sequence, locations directly to the right of the start location are unreachable in one cycle. Similarly, to end at any location \((s_x-1,g_y)\) with \(g_y\le s_y\) requires both an obstacle at \((s_x,g_y)\) and an initial down move to at least \((s_x,g_y-1)\), but these requirements are contradictory.

A fan-out Gate

A fan-out gate with two outputs implements the truth table in Table 1. This cannot be implemented with 1 × 1 particles and obstacles by Corollary 14. Our technique uses 2 × 1 particles. A single-input, two-output fan-out gate is shown in Fig. 14. This gate requires a dual-rail input, a supply particle, and a \(2\,\times \,1\) slider. The clockwise control sequence \(\langle d,\ell ,u,r \rangle\) duplicates the dual-rail input.

The fan-out gate can drive multiple outputs. In Fig. 15 a single input drives four outputs. This gate requires a dual-rail input, three supply particles, and a \(2\times 1\) slider. The clockwise control sequence \(\langle d,\ell ,u,r \rangle\) quadruples the dual-rail input. In general, an n-output fan-out gate with control sequence \(\langle d,\ell ,u,r \rangle\) requires a dual-rail input, \(n-1\) supply particles, and one \(2\times 1\) slider. It requires an area of size \(4n+7\times 2n+4\).

Data storage

A general-purpose computer must be able to store data. A \(2\times 1\) particle enables us to construct a read/writable data storage for one bit. A single-bit data storage latch is shown in Fig. 16. This gate is conservative, and the memory state is given by the position of the \(2\times 1\) slider: If the slider is low the memory state is true, if the slider is high the memory state is false. This gate implements the truth table in Table 2, and has three inputs Set, Clear, or Read. Only one input should be true, making this a tri-rail input. This input can be generated by logic on two dual-rail inputs: a Set/Clear input S and a Read/Write input R, where Set\(= S\wedge \overline{R}\), Clear\(= \overline{S}\wedge \overline{R}\) , and Read\(= R\). Depending on which input is active, the clockwise control sequence \(\langle d,\ell ,u,r \rangle\) will read, set, or clear the memory. The gate has a single output M that reports the memory state after the inputs have been computed. The entire gadget requires a \(16\times 8\) area. By combining an n-out fan-out gate shown in Fig. 15 with n data storage devices, we can read from an n-bit memory.

Table 2 A single-bit data storage latch with state Q, inputs Set, Clear, or Read, and outputs representing the memory state M and the inverse \(\overline{M}\).

Full size table

A binary counter

Using the fan-out gate from Sect. 7 we can generate arbitrary Boolean logic. The half adder from Fig. 11 requires a single fan-out gate, one and, and one xor gate.

We illustrate how several gates can be combined by constructing a binary counter, shown in Fig. 17. Six logic gates are used to implement a 3-bit counter. A block diagram of the device is shown in Fig. 18. The counter requires three fan-out gates, two summers (xor) gates, and one carry (and) gate. Six \(1\times 1\) particles and three \(2\times 1\) particles are used. The counter has three levels of gates actuated by CW sequence \(\langle d,\ell ,u,r \rangle\) and requires three interconnection moves \(\langle d,\ell ,u,r \rangle\), for a total of 24 moves (6 cycles) per count.

Scaling Issues

Particle computation requires multiple clock cycles, workspace area for gates and interconnections, and many particles. In this section we analyze how these scale with the size of the counter, using Fig. 18 as a reference.

Gates :: An n-bit counter requires \(3(n-1)\) gates: nfan-out gates, \(n-1\) summers (xor) gates, and \(n-2\) carry (and) gates.
Particles :: We require n 1 \(\times \,\) 1 particles, one for each bit, and n 2 \(\times \,\) 1 particles, one for each fan-out gate.
Propagation delay :: The counter requires n stages of logic and n corresponding wiring stages. Each stage requires a complete clock cycle \(\langle d,\ell ,u,r \rangle\) for a total of 8n moves.

These requirements are comparable to a ripple-carry adder: the delay for n bits is n delays and requires \(5(n-1)+2\) gates. Numerous other schemes exist to speed up the computation; however, using discrete gates allows us to use standard methods for translating a Boolean expression into gate-level logic. If speed were critical, instead of using discrete gates, we could engineer the workspace to compute the desired logic directly.

Optimal wiring schemes

With our current CW clock cycle, we cannot have outputs from the same column as inputs—outputs must be either one to the right or three to the left. Choosing one of these results in horizontal shifts at each stage and thus requires spreading out the logic gates. A more compact wiring scheme cycles through three layers that each shift right by one, followed by one layer that shifts left by three. We also want the wiring to be tight left-to-right. If our height is also limited, wire buses, shown in Fig. 17 provide a compact solution.

Optimized logic

The particle-computation presented in this section is general purpose, and Fig. 17 illustrates how a set of gate components can be composed to compute arbitrary logic. Single-purpose logic can often be more compact, as shown by Fig. 19, which shows three counters that use less area and fewer particles than Fig. 17.

8 Conclusion

This paper analyzes the problem of steering many particles with uniform inputs in a 2D environment containing obstacles. We design environments that can efficiently perform matrix operations on groups of particles in parallel, including a matrix permutation requiring only four moves for any number of particles. These matrix operations enable us to prove that the general motion planning problem is pspace-complete.

We also introduce a new model for mechanical computation. We

1.
prove the insufficiency of unit-size particles for gate fan-out;
2.
establish the necessity of dual-rail logic for Boolean logic;
3.
design fan-out gates and memory latches by employing slightly different particles; and
4.
present an architecture for device integration, a common clock sequence, and a binary counter.

There remain many interesting problems to solve. We are motivated by practical challenges in steering micro-particles through vascular networks, which are common in biology. Though some are two-dimensional, including the leaf example in Fig. 1 and endothelial networks on the surface of organs, many of these networks are three dimensional. Magnetically actuated systems are capable of providing 3D control inputs, but control design poses additional challenges.

We investigate a subset of control in which all particles move maximally. Future work should investigate more general motion—what happens if we can move all the particles a discrete distance or along an arbitrary curve? We also abstracted important practical constraints, e.g., ferromagnetic objects tend to clump in a magnetic field, and most magnetic fields are not perfectly uniform.

Finally, our research has potential applications in micro-construction, microfluidics, and nano-assembly. These applications require additional theoretical analysis to model heterogeneous objects and objects that bond when forced together, e.g., MEMS components and molecular chains.

References

Abdelkader A, Acharya A, Dasler P (2016) 2048 without new tiles is still hard. In: Demaine ED, Grandoni F (eds) 8th international conference on fun with algorithms (FUN 2016), Volume 49 of Leibniz international proceedings in informatics (LIPIcs), Dagstuhl, Germany, 2016. Schloss Dagstuhl–Leibniz-Zentrum für Informatik, pp 1:1–1:14
Abelson H, Allen D, Coore D, Hanson C, Homsy G, Thomas J, Knight F, Nagpal R, Rauch E, Sussman GJ, Weiss R (2000) Amorphous computing. Commun ACM 43(5):74–82
Article Google Scholar
Adamatzky A, Durand-Lose J (2012) Collision-based computing. In: Rozenberg G, Bäck T, Kok J (eds) Handbook of natural computing. Springer, Berlin, pp 1949–1978
Chapter Google Scholar
Akella S, Huang WH, Lynch KM, Mason MT (2000) Parts feeding on a conveyor with a one joint robot. Algorithmica 26(3):313–344
MathSciNet MATH Google Scholar
Akella S, Mason MT (1999) Using partial sensor information to orient parts. Int J Rob Res 18(10):963–997
Article Google Scholar
Becker AT, Bretl TW (2012) Approximate steering of a unicycle under bounded model perturbation using ensemble control. Int J Rob Res 28(3):580–591
Google Scholar
Becker AT, Onyuksel C, Bretl TW (2012) Feedback control of many differential-drive robots with uniform control inputs. In: IEEE/RSJ international conference on intelligent robots and systems (IROS), pp 2256–2262
Becker AT, Habibi G, Werfel J, Rubenstein M, McLurkin J (2013a) Massive uniform manipulation: controlling large populations of simple robots with a common input signal. In: IEEE/RSJ international conference on intelligent robots and systems (IROS), pp 520–527
Becker AT, Ou Y, Kim P, Kim MJ, Julius A (2013b) Feedback control of many magnetized: Tetrahymena pyriformis cells by exploiting phase inhomogeneity. In: 2013 IEEE/RSJ international conference on intelligent robots and systems (IROS), pp 3317–3323
Becker AT, Demaine ED, Fekete SP, Habibi G, McLurkin J (2014a) Reconfiguring massive particle swarms with limited, global control. In: International symposium on algorithms and experiments for sensor systems, wireless networks and distributed robotics (ALGOSENSORS 2013), lecture notes in computer science, pp 51–66
Becker AT, Demaine ED, Fekete SP, McLurkin J (2014b) Particle computation: designing worlds to control robot swarms with only global signals. In: IEEE international conference on robotics and automation (ICRA), pp 6751–6756
Becker AT, Onyuksel C, Bretl TW, McLurkin J (2014c) Controlling many differential-drive robots with uniform control inputs. Int J Rob Res 33(13):1626–1644
Article Google Scholar
Becker AT, Demaine ED, Fekete SP, Shad SHM, Morris-Wright R (2015) Tilt: the video-designing worlds to control robot swarms with only global signals. In: 31st international symposium on computational geometry (SoCG’15), pp 16–18. Video available at https://youtu.be/H6o9DTIfkn0. Accessed 07 Dec 2017
Berlekamp ER, Conway JH, Guy RK (2001–2004) Winning Ways for your mathematical plays, 2nd edn, vol 1–4. A. K. Peters Ltd, Natick
Chanu A, Felfoul O, Beaudoin G, Martel S (2008) Adapting the clinical MRI software environment for real-time navigation of an endovascular untethered ferromagnetic bead for future endovascular interventions. Magn Reson Med 59(6):1287–1297
Article Google Scholar
Chiang P-T, Mielke J, Godoy J, Guerrero JM, Alemany LB, Villagómez CJ, Saywell A, Grill L, Tour JM (2011) Toward a light-driven motorized nanocar: synthesis and initial imaging of single molecules. ACS Nano 6(1):592–597
Article Google Scholar
Demaine ED, Hearn RA (2009) Playing games with algorithms: algorithmic combinatorial game theory. In: Albert MH, Nowakowski RJ (eds) Games of no chance. MSRI publications 56. Cambridge University Press, Cambridge
Google Scholar
Demaine ED, Demaine ML, O’Rourke J (2000) PushPush and Push-1 are NP-hard in 2D. In: 12th annual Canadian conference on computational geometry (CCCG), pp 211–219
Donald BR, Levey CG, Paprotny I, Rus D (2013) Planning and control for microassembly of structures composed of stress-engineered mems microrobots. Int J Rob Res 32(2):218–246
Article Google Scholar
Dor D, Zwick U (1999) Sokoban and other motion planning problems. Comput Geom 13(4):215–228
Article MathSciNet MATH Google Scholar
Doty D, Patitz MJ, Summers SM (2009) Limitations of self-assembly at temperature 1. In: 15th international meeting on DNA computing and molecular programming (DNA15), pp 283–294
Dummit D, Foote R (2009) Abstract algebra, 3rd edn. Wiley, New York, pp 29–31
Google Scholar
Engels B, Kamphans T (2005) On the complexity of Randolph’s robot game. Technical report, Rheinische Friedrich-Wilhelms-Universität Bonn, Institut für Informatik I
Erdmann M, Mason M (1988) An exploration of sensorless manipulation. IEEE J Robot Autom 4(4):369–379
Article Google Scholar
Fekete SP, Kröller A (2006) Geometry-based reasoning for a large sensor network. In: 22nd international symposium on computational geometry (SoCG 2006), pp 475–476. Video available at http://www.computational-geometry.org/SoCG-videos/socg06video/. Accessed 07 Dec 2017
Fekete SP, Kröller A (2007) Topology and routing in sensor networks. In: 3rd international workshop algorithmic aspects wireless sensor networks (ALGOSENSORS 2007), volume 4837 of lecture notes in computer science. Springer, pp 6–15
Fekete SP, Kröller A, Pfisterer D, Fischer S, Buschmann C (2004) Neighborhood-based topology recognition in sensor networks. In: 1st international workshop algorithmic aspects wireless sensor networks (ALGOSENSORS 2004), volume 3121 of lecture notes in computer science. Springer, pp 123–136
Fekete SP, Hendricks J, Patitz MJ, Rogers TA, Schweller RT (2015a) Universal computation with arbitrary polyomino tiles in non-cooperative self-assembly. In: 26th ACM-SIAM symposium on discrete algorithms (SODA 2015). Society for Industrial and Applied Mathematics, pp 148–167
Fekete SP, Richa A, Römer K, Scheideler C (2015b) Algorithmic foundations of programmable matter. In: Dagstuhl seminar 16271
Floyd S, Diller E, Pawashe C, Sitti M (2011) Control methodologies for a heterogeneous group of untethered magnetic micro-robots. Int J Rob Res 30(13):1553–1565
Article Google Scholar
Fredkin E, Toffoli T (1982) Conservative logic. Int J Theor Phys 21(3–4):219–253
Article MathSciNet MATH Google Scholar
Frutiger DR, Kratochvi BE, Vollmers K, Nelson BJ (2008) Magmites—wireless resonant magnetic microrobots. In: IEEE international conference on robotics and automation (ICRA), pp 1770–1771
Goemans OC, Goldberg K, van der Stappen AF (2006) Blades: a new class of geometric primitives for feeding 3d parts on vibratory tracks. In: IEEE international conference on robotics and automation (ICRA), pp 1730–1736
Goldberg KY (1993) Orienting polygonal parts without sensors. Algorithmica 10(2):201–225
Article MathSciNet MATH Google Scholar
Goldberg K, Mirtich BV, Zhuang Y, Craig J, Carlisle BR, Canny J (1999) Part pose statistics: estimators and experiments. IEEE Trans Robot Autom 15(5):849–857
Article Google Scholar
Hearn RA (2005) The complexity of sliding-block puzzles and plank puzzles. In: Cipra B, Demaine ED, Demaine ML, Rodgers T (eds) Tribute to a mathemagician. A K Peters, Natick, pp 173–183
Hearn RA, Demaine ED (2005) PSPACE-completeness of sliding-block puzzles and other problems through the nondeterministic constraint logic model of computation. Theor Comput Sci 343(1):72–96
Article MathSciNet MATH Google Scholar
Hoffmann M (2000) Motion planning amidst movable square blocks: Push-* is NP-hard. In: Canadian conference on computational geometry (CCCG 2000), pp 205–210
Holzer M, Schwoon S (2004) Assembling molecules in ATOMIX is hard. Theor Comput Sci 313(3):447–462
Article MathSciNet MATH Google Scholar
Jerrum MR (1985) The complexity of finding minimum-length generator sequences. Theor Comput Sci 36:265–289
Article MathSciNet MATH Google Scholar
Kahn J, Katz R, Pister K (2000) Emerging challenges: mobile networking for smart dust. J Commun Netw 2:188–196
Article Google Scholar
Khalil ISM, Pichel MP, Reefman BA, Sukas OS, Abelmann L, Misra S (2013) Control of magnetotactic bacterium in a micro-fabricated maze. In: IEEE international conference on robotics and automation (ICRA), pp 5488–5493
Kröller A, Fekete SP, Pfisterer D, Fischer S (2006)Deterministic boundary recognition and topology extraction for large sensor networks. In: 17th ACM-SIAM symposium on discrete algorithms (SODA 2006), pp 1000–1009
LaBean T, Winfree E, Reif J (1999) Experimental progress in computation by self-assembly of DNA tilings. DNA Based Comput 5:123–140
MATH Google Scholar
Leighton FT (1991) Introduction to parallel algorithms and architectures: arrays, trees, hypercubes. Morgan Kaufmann, Los Altos
MATH Google Scholar
Lynch KM, Northrop M, Pan P (2002) Stable limit sets in a dynamic parts feeder. IEEE Trans Rob Autom 18(4):608–615
Article Google Scholar
Maňuch J, Stacho L, Stoll C (2010) Two lower bounds for self-assemblies at temperature 1. J Comput Biol 17(6):841–852
Article MathSciNet Google Scholar
McCourtney S (2001) ENIAC, the triumphs and tragedies of the world’s first computer, 2nd edn. Berkeley Trade, London
Google Scholar
Meunier PE, Patitz MJ, Summers SM, Theyssier G, Winslow A, Woods D (2014) Intrinsic universality in tile self-assembly requires cooperation. In: 25th ACM-SIAM symposium on discrete algorithms (SODA 2014), pp 752–771
Moll M, Erdmann M (2002) Manipulation of pose distributions. Int J Rob Res 21(3):277–292
Article MATH Google Scholar
Murphey TD, Burdick JW (2004) Feedback control methods for distributed manipulation systems that involve mechanical contacts. Int J Rob Res 23(7–8):763–781
Article Google Scholar
Murphey TD, Bernheisel J, Choi D, Lynch KM (2005) An example of parts handling and self-assembly using stable limit sets. In: IEEE/RSJ international conference on intelligent robots and systems (IROS), pp 1624–1629
Ou Y, Kim DH, Kim P, Kim MJ, Julius AA (2013) Motion control of magnetized tetrahymena pyriformis cells by magnetic field with model predictive control. Int J Rob Res 32(1):129–139
Article Google Scholar
Peyer KE, Zhang L, Nelson BJ (2013) Bio-inspired magnetic swimming microrobots for biomedical applications. Nanoscale 5:1259–1272
Article Google Scholar
Rendell P (2002) Chapter 18: Turing universality of the game of life. In: Adamatzky A (ed) Collision-based computing. Springer, London, pp 513–539
Chapter Google Scholar
Rendell P (2011) A universal Turing machine in Conway’s game of life. In: 2011 international conference on high performance computing and simulation (HPCS). IEEE, pp 764–772
Rubenstein M, Ahler C, Nagpal. Kilobot R (2012) A low cost scalable robot system for collective behaviors. In:IEEE international conference on robotics and automation (ICRA), pp 3293–3298
Shad HM, Morris-Wright R, Demaine ED, Fekete SP, Becker AT (2015) Particle computation: device fan-out and binary memory. In: 2015 IEEE international conference on robotics and automation (ICRA). IEEE, pp 5384–5389
ThinkFun. Tilt: gravity fed logic maze. http://www.thinkfun.com/tilt
van der Stappen A, Berretty RP, Goldberg K, Overmars M (2002) Geometry and part feeding. In: Hager GD, Christensen HI, Bunke H, Klein R (eds) Sensor based intelligent robots, Lecture Notes in Computer Science, vol 2238. Springer, Berlin, pp 259–281
Vartholomeos P, Akhavan-Sharif M, Dupont PE (2012) Motion planning for multiple millimeter-scale magnetic capsules in a fluid environment. In: IEEE international conference on robotics and automation (ICRA), pp 1927–1932
Vose TH, Umbanhowar P, Lynch KM (2009) Friction-induced velocity fields for point parts sliding on a rigid oscillated plate. Int J Rob Res 28(8):1020–1039
Article Google Scholar
Vose TH, Umbanhowar P, Lynch KM (2012) Sliding manipulation of rigid bodies on a controlled 6-DoF plate. Int J Rob Res 31(7):819–838
Article Google Scholar
Wilfong G (1991) Motion planning in the presence of movable obstacles. Ann Math Artif Intell 3(1):131–150
Article MathSciNet MATH Google Scholar
Winfree E (1998) Algorithmic self-assembly of DNA. PhD thesis, California Institute of Technology
Winfree E, Liu F, Wenzler L, Seeman N (1998) Design and self-assembly of two-dimensional DNA crystals. Nature 394:539–544
Article Google Scholar
Zimmermann R, Fichtner W (1997) Low-power logic styles: CMOS versus pass-transistor logic. IEEE J Solid State Circuits 32(7):1079–1090
Article Google Scholar

Download references

Acknowledgements

We thank an anonymous reviewer for carefully going through our work and making numerous constructive suggestions that helped to improve the presentation of our paper. We thank Hamed Mohtasham Shad for building and testing the first experimental tilt tables that brought these algorithms to life. We acknowledge the helpful discussion and motivating experimental efforts with T. pyriformis cells by Yan Ou and Agung Julius at RPI and Paul Kim and MinJun Kim at Drexel University. Preliminary versions of Sects. 4 and 5 are main topics of our paper Becker et al. (2014a) with an extra result proving the system to give rise to pspace-completeness in Sect. 5.3 from paper Becker et al. (2014b). The particle logic in Sects. 6 and 7 was introduced in Becker et al. (2014b) and completed in paper Shad et al. (2015). This work has been partially supported by the National Science Foundation (Grant Nos. [IIS-1553063] and [IIS-1619278]).

Author information

Authors and Affiliations

Department of Electrical and Computer Engineering, University of Houston, Houston, TX, USA
Aaron T. Becker & Jarrett Lonsford
Computer Science and Artificial Intelligence Laboratory, MIT, Cambridge, MA, USA
Erik D. Demaine
Department of Computer Science, TU Braunschweig, Brunswick, Germany
Sándor P. Fekete
Department of Mathematics, Brandeis University, Waltham, MA, USA
Rose Morris-Wright

Authors

Aaron T. Becker
View author publications
You can also search for this author in PubMed Google Scholar
Erik D. Demaine
View author publications
You can also search for this author in PubMed Google Scholar
Sándor P. Fekete
View author publications
You can also search for this author in PubMed Google Scholar
Jarrett Lonsford
View author publications
You can also search for this author in PubMed Google Scholar
Rose Morris-Wright
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sándor P. Fekete.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Becker, A.T., Demaine, E.D., Fekete, S.P. et al. Particle computation: complexity, algorithms, and logic. Nat Comput 18, 181–201 (2019). https://doi.org/10.1007/s11047-017-9666-6

Download citation

Published: 08 December 2017
Issue Date: 15 March 2019
DOI: https://doi.org/10.1007/s11047-017-9666-6

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Particle computation: complexity, algorithms, and logic

Abstract

Similar content being viewed by others

Reconfiguring Massive Particle Swarms with Limited, Global Control

Mobile RAM and Shape Formation by Programmable Particles

Particle-Based Assembly Using Precise Global Control

Explore related subjects

1 Introduction

External computation

Internal computation

2 Problem definition

2.1 Model

3 Related work

3.1 Underactuated control

Large robot populations

Dexterity Games

Sliding-block puzzles

3.2 Manipulation

Computational geometry: robot box-pushing

Sensorless manipulation

3.3 Computation

Parallel Algorithms: SIMD

Collision-based computing

Programmable matter

4 Mazes

Theorem 1

Proof

Variable gadgets

Clause gadgets

Check gadget

5 Matrices

5.1 A workspace for a single permutation

Theorem 2

Proof

5.2 A workspace for arbitrary permutations

Theorem 3

Proof

Corollary 4

Proof

Lemma 5

Theorem 6

Proof

Lemma 7

Theorem 8

Proof

Theorem 9

Proof

5.3 pspace-Completeness

Theorem 10

Proof

6 Limitations of particle logic

6.1 Dual-rail logic and fan-out gates

6.2 Only 1 × 1 particles are insufficient

Lemma 11

Proof

Corollary 12

Proof

Lemma 13

Proof

Theorem 14

Proof

7 Device and gate design

Choosing a clock sequence

Limitations

A fan-out Gate

Data storage

A binary counter

Scaling Issues

Optimal wiring schemes

Optimized logic

8 Conclusion

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article