Abstract
This paper focuses on the design and analysis of a P+d+f (Proportional, Derivative, and Force) variable control strategy aimed at delayed bilateral teleoperation of a manipulator robot, the ultimate goal of which is to obtain simultaneous coordination of force and position between the haptic device and the robot. The proposed controller changes the damping based on both the time delay and feedback power signal measured online. Unlike other P+d+f strategies, this proposal avoids terms with discontinuities in the controller, cancellation of human and environment forces and also prevents the explicit use of environment parameters. The proposal uses variable damping dependent on a feedback power signal, which reduces kinetic energy to ensure bounded control errors. Simulations are performed to verify that dual coordination is achieved without using explicit nonlinear damping in the controller or needing the parametric knowledge of the environment model, which is useful to apply the controller to most commercial manipulator robots.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
Teleoperation systems broaden human operators’ capabilities allowing them to remotely perform physical work in places that may be too dangerous or even unreachable for humans [1]. They are formed by two stations connected by a communication link which inevitably introduces backward and forward time delay. The robot that performs the physical work is located in the remote station, which we call follower or remote robot. On the other hand, the robot that generates the commands is located in the local station, and we will refer to it as a haptic device or leader. A human operator uses the haptic device to drive the remote robot to complete a task while he or she receives multimedia information (visual and audio) and force feedback from the remote site. This coupling is defined in the literature as bilateral teleoperation and allows the human tactile perception of the remote task (closed-loop control). These systems can be applied to diverse fields such as rescue and surveillance, spatial and ground exploration, explosives deactivation, tele-manufacturing, among others [2].
Delayed bilateral teleoperation systems (DBTS) are modeled from delayed nonlinear multi-degree-of-freedom (DOF) robotic systems, where one main issue is the destabilizing effect caused by the time delay, which is time-varying and asymmetric [3,4,5]. Many controllers mostly proposed and evaluated using DBTS of robot arm manipulators were introduced. Some of these strategies have been modified, adapted, and tested using other types of remote robots such as wheeled mobile robots [6], mobile manipulators [7,8,9], UAV (unmanned aerial vehicles) [10, 11], and teleoperation networks [12, 13]. With respect to the stability of DBTS, the majority of the schemes use passivity-based control [14] including scattering transformation, wave variable and controllers based on time-domain passivity [15,16,17], damping injection [18] and adaptive, impedance and neural network controllers [19, 20]. Other sources that induce non-passive behaviors in the system are the human operator and the environment [21]. To achieve stable position coordination, the simplest controller is the P+d type. In this type of controller, the energy extraction, based on damping, must be high enough to get stable position coordination [18, 22, 23]. But these controllers are calibrated mainly depending on the time delay and the setting is free of the human operator. Even more, the condition to reach stability is frequently fulfilled considering passive human actions represented by a spring-damping model [24]. However, this model type does not always faithfully represent the human operator behavior [21], and therefore, other models should be analyzed to get conclusions about how the controller could be designed or calibrated depending on them.
On the other hand, beyond stability, one of the crucial concepts in bilateral teleoperation is known as transparency, defined in [25] as a measure of how the human is coupled to the remote environment. In any bilateral teleoperator system design, the essential goal is to provide an adequate transmission of signals (positions, velocities, forces) to link the operator as closely as possible to the remote workspace. If the teleoperation system is completely transparent, then operators should feel a direct interaction with the remote task. Usually, there is a balance between stability and transparency. In [26], the transparency definition is quantitative extended to the time domain, isolating three components called remote, local, and instantaneous.
With the intention of achieving stable and transparent systems, different architectures have been proposed; the firstly called four-channel architectures which send a bidirectional flow of force and position, such as position–force domain passivity schemes, based mainly on experimental study and showing that the human operator does not remain passive in the position–force domain for all ranges of frequencies [27], TDPA (time-domain passivity approach) which combines passivity observers and passivity controllers [28,29,30,31] where these schemes can be included into a two-layer approach as in [32] but the main drawback of the schemes TDPA are that they need the whole energy flow which cannot be computed on one side of the teleoperation system and therefore predictors or delayed energy flow are necessary to use, and Lyapunov-based strategies addressed to achieve simultaneous tracking of force and position as in [33, 34], where P+d controllers plus force transmission (P+d+f) for both sides are applied to delayed nonlinear teleoperation of manipulator robots, and recently, in [35] a position/force control scheme is proposed for bilateral teleoperators with time-varying delays based on the estimation of velocities and forces by using only joint position measurements, assuring that position and force tracking errors become ultimately bounded with arbitrarily small ultimate bound in finite time.
A problem with schemes mentioned is that they must use delayed energy flow or rely heavily on sign functions, adding a discontinuity in control actions as well as controller parameters depending on the environment parameters, making it unfeasible for its application in practice.
Due to the aspects described above, we highlight the need for new control aimed to improve the current overall performance of DBTS in the sense of stability and transparency, via a control scheme applicable in practice. This paper proposes a strategy to get stable coordination for DBTS. The result achieved involves the coordination of force and position. To the best of the authors knowledge, this proposal is novel in the following respects: (a) design of a stable controller for force and motion coordination in a DBTS of a manipulator robot, based on the injection of variable linear damping on the haptic device, avoiding nonlinear damping and discontinuities, thus allowing it to be applied in most commercial robots that include internal configurable PID controllers; (b) the linear damping injected on the local site is changed online based on the time delay and feedback power signal, the latter is defined as the product between leader velocity and delayed environment force, and it is computed only on one side of the delayed teleoperation system allowing to detect immediately when the feedback power injects more energy into the system; (c) simulations using a PELICAN robot and simple experiments are performed evaluating the position and force errors, to verify that such errors hold bounded achieving a simultaneous coordination of force and position.
The paper is presented as follows: Sect. 2 describes the dynamic models, properties, and assumptions used in this work. In Sect. 3, a novel control scheme is proposed, and the stability analysis is performed. Section 5 shows simulation results comparing the proposed scheme with a representative controller of the state of the art, as well as simple experiments applying the scheme proposed. Finally, Sect. 6 shows the discussions and conclusions of the performed simulations and experiments.
1.1 Nomenclature
Table 1 presents the nomenclature used in the mathematical development of this paper.
2 System modeling
To analyze the delayed teleoperation system of a manipulator with force feedback, we first briefly describe the assumptions, properties, and models that this work uses.
2.1 Models
The traditional dynamic model for robot manipulators represented in Cartesian coordinates is used,
where \(\mathbf {x_\textrm{m}}, \mathbf {x_\textrm{s}}\in R^n\) are the leader and follower positions in Cartesian coordinates, respectively; \(\dot{{\textbf{x}}}_{\textbf{m}}, \dot{{\textbf{x}}}_{\textbf{s}}\) are the corresponding velocities, \(\mathbf {M_\textrm{m}}(\mathbf {x_\textrm{m}}),\mathbf {M_\textrm{s}}(\mathbf {x_\textrm{s}})\in R^{nxn}\) are the inertia matrices, \(\mathbf {C_\textrm{m}}(\mathbf {x_\textrm{m}}, \dot{{\textbf{x}}}_{\textbf{m}} ), \mathbf {C_\textrm{s}}(\mathbf {x_\textrm{s}}, \dot{{\textbf{x}}}_{\textbf{s}} )\) represent the centripetal and Coriolis forces, \(\mathbf {g_\textrm{m}}(\mathbf {x_\textrm{m}} ), \mathbf {g_\textrm{s}}(\mathbf {x_\textrm{s}} )\) describe the gravitational forces, \(\mathbf {f_\textrm{h}}\) and \(\mathbf {f_\textrm{e}}\) are the human and environment forces, and \(\mathbf {f_\textrm{m}},\mathbf {f_\textrm{s}}\) represent the forces, computed by the controller. We assume that the robots do not operate in workspace singular points.
2.2 Assumptions and properties
In this work, the following assumptions, properties and lemmas are used [14, 15, 36]:
Property 1
The inertia matrices \(\mathbf {M_\textrm{m}}\) and \(\mathbf {M_\textrm{s}}\) are symmetric positive definite.
Property 2
The matrices \({\dot{{\textbf{M}}}}_{\textbf{m}}-2\mathbf {C_\textrm{m}}\) and \({\dot{{\textbf{M}}}}_{\textbf{s}}-2\mathbf {C_\textrm{s}}\) are skew-symmetric.
Property 3
There exist \(\psi _1,\psi _2>0\) such that \(\mathbf {C_\textrm{m}}\dot{{\textbf{x}}}_{\textbf{m}}\le \psi _1\left| \dot{{\textbf{x}}}_{\textbf{m}}\right| \) and \(\mathbf {C_\textrm{s}}\dot{{\textbf{x}}}_{\textbf{s}}\le \psi _2\left| \dot{{\textbf{x}}}_{\textbf{s}}\right| \) for all time t.
Property 4
If \(\dot{{\textbf{x}}}_{\textbf{m}}, \ddot{{\textbf{x}}}_{\textbf{m}}, \dot{{\textbf{x}}}_{\textbf{s}}, \ddot{{\textbf{x}}}_{\textbf{s}}\) are bounded, then the time derivative of \(\mathbf {C_\textrm{m}}\) and \(\mathbf {C_\textrm{s}}\) are bounded too.
Assumption 1
Based on [37]. The communication channel adds forward \(h_1\) and backward \(h_2\) time delays. These delays are time-varying, bounded and asymmetric. Therefore, it is possible to establish \({\bar{h}}_1\) and \({\bar{h}}_2\) such that \(0\le h_1( t )\le {\bar{h}}_1\) and \(0\le h_2( t )\le {\bar{h}}_2\) for all t and \({\dot{h}}_1< \tau < 1\),\({\dot{h}}_2< \tau < 1\).
Assumption 2
Based on [23]. The remote environment is considered as a viscous-elastic force plus a constant force and a non-modeled signal of finite energy. Such force output is represented by:
where \(k_\textrm{e},\alpha _\textrm{e}\) are the elasticity and damping coefficients of the environment model, the vector \(\mathbf {x_\textrm{s}}-\mathbf {x_\textrm{e}}\) represents the distance that penetrates the robot on the elastic medium taking for simplicity \(\mathbf {x_\textrm{e}}={\textbf{0}}\), \({\textbf{B}}\) is a constant value and \(\mathbf {f_{\textrm{a}_\textrm{e}}}\in {\mathcal {L}}_2\cap {\mathcal {L}}_\infty \) represents a bounded-energy non-modeled signal with bounded too, being \({\bar{f}}_{\textrm{a}_\textrm{e}}\) a positive constant.
Assumption 3
Based on [37], the human operator behaves as a P+d like controller plus and a non-modeled signal of finite energy. Such model is described as follows:
where constants \(\alpha _\textrm{h} > 0\) and \(k_\textrm{h}>\) represent the human operator damping and the elastic coefficient. On the other hand, the mental reference of the operator is assumed to be composed of a constant setpoint A plus a variable part \(\mathbf {f_{\textrm{a}_\textrm{h}}}\in {\mathcal {L}}_2\cap {\mathcal {L}}_\infty \) which is assumed to hold \(\Vert \mathbf {f_{\textrm{a}_\textrm{h}}}\Vert \le {\bar{f}}_{\textrm{a}_\textrm{h}}\) and bounded, where \({\bar{f}}_{\textrm{a}_\textrm{h}}\) is a positive constant value. In this sense, \(g_\textrm{h}\) is the internal gain of the human operator that is translated into the intensity by which the setpoint is pursued. From a control perspective, A represents the system convergence point in the steady state. Although most models present in the literature are passive [24], this assumption does not cover most of the tasks that the human operator can perform, which must be explicitly evaluated for each particular task, where the human parameters can be identified following a procedure like [21].
Lemma 1
(Barbalat) [38]. For any \({\textbf{f}}:R_{\ge 0}\rightarrow R^n\), if \(\lim _{t\rightarrow \infty }\int _{0}^{t}{{\textbf{f}}( \tau )}\,\textrm{d}\tau \) exists and is finite, and \({\textbf{f}}( t )\) is uniformly continuous, i.e., \(\dot{{\textbf{f}}}( t )\in {\mathcal {L}}_\infty \), then \(\lim _{t\rightarrow \infty }{\textbf{f}}( t )=0\). Besides, for any \({\textbf{f}}:R_{\ge 0}\rightarrow R^n\), if \({\textbf{f}}( t )\in {\mathcal {L}}_2\cap {\mathcal {L}}_\infty \) and \(\dot{{\textbf{f}}}( t )\in {\mathcal {L}}_\infty \), then \(\lim _{t\rightarrow \infty }{\textbf{f}}( t )=0\).
Lemma 2
[39] For vector functions \({\textbf{a}}(. )\), \({\textbf{b}}(. )\) and a variable value h(t) with \(0\le h( t )\le {\bar{h}}\), the following expression is valid:
3 Controller for force and position coordination
We propose to couple a P+d+f structure (including the interchange of force and position signals between the local and remote sites) with a variable damping depending on a feedback power signal. The controller output \(\mathbf {f_\textrm{m}}\) (applied to the leader) and \(\mathbf {f_\textrm{s}}\) (executed by the follower) is proposed as:
where the parameters \(k_\textrm{s}\) and \(k_\textrm{m}\) represent gain values, \(k_\textrm{g}\) maps the haptic device position to the remote robot set point, \(k_\textrm{f}\) describes a force scaling, \(\alpha _\textrm{m}\) and \(\alpha _\textrm{s}\) are damping coefficients, and \(\Delta \) represents a variable linear damping that will depend on the feedback power signal measurable on the local site.
This scheme is compatible with the internal structure of commercial manipulator robots which commonly are based on PID (or modified PID) controllers. Figure 1 presents the architecture of the control system.
To understand how the proposal works, we analyze the energies involved in the leader. First, we define the feedback power signal \(P_{df}\) as the product of the leader velocity and the delayed environment force (8). This signal will be used in \(\Delta \) to vary the total damping of the leader, and in doing so, indirectly control the total energy
The energy \(E_\textrm{Pd}\) (10) is the result of the force \({\textbf{f}}_\textbf{ctr}\) (9) applied by the P+d (Proportional plus damping) controller used in the haptic device (6)
The human operator energy \({{E}_\textrm{h}}\) is the result of the force that the operator applies to the haptic device (11)
When the force of the remote environment (with its corresponding transmission delay) is applied to the haptic device, a new energetic component is introduced in the haptic device that we call feedback energy \({{E}_\textrm{f}}\) (12)
Finally, the haptic device energy \({{E}_\textrm{m}}\) is computed as the sum of \({{E}_\textrm{Pd}}\), \({{E}_\textrm{h}}\) and \({{E}_\textrm{f}}\) (13)
According to the state of the art, the added constant linear damping depends mainly on time delays. We join this concept with variable damping, obtaining a novel \(\mathbf {f_{ctr}}\) that reduces the kinetic energy to the extent that the feedback power injects more energy into the system. In contrast, when the feedback power draws energy from the leader, the proposed controller raises the damping only as a function of the time delay. This concept is illustrated in Fig. 2.
3.1 Stability analysis
First, case \({\textbf{A}}={\textbf{B}}={\textbf{0}}\) in (3) and (4) will be analyzed. The control error is defined as \({\textbf{e}}=k_\textrm{g}\mathbf {x_\textrm{m}}-\mathbf {x_\textrm{s}}\). A positive definite functional \(V( \mathbf {x_\textrm{m}},\dot{{\textbf{x}}}_{\textbf{m}},k_\textrm{g}\mathbf {x_\textrm{m}}-\mathbf {x_\textrm{s}},\mathbf {x_\textrm{s}},\dot{{\textbf{x}}}_{\textbf{s}}) =V_1+V_2+V_3+V_4\) is proposed and how its time evolution is, along the trajectories of the system from a finite initial condition. The functional is formed by the following terms:
Here, \(\kappa _i\in {\mathbb {R}}^+\) are positive constants defined in “Appendix A”.
Going on the procedure given in “Appendix A”, \(\dot{V}\) along the trajectories of the system is analyzed considering the closed-loop dynamics of both leader and follower, time delay, and the human operator and environment forces. As a result, we get:
where
with \(\kappa _4:=(k_\textrm{m}-g_\textrm{h})/(k_\textrm{s} k_\textrm{g})\).
From (18) to achieve bounded velocities, we design \(\Delta \) to dynamically compensate the term \(x = k_\textrm{f}\dot{{\textbf{x}}}_{\textbf{m}}^{\textsf{T}}\mathbf {f_\textrm{e}}(t-h_2)\), called feedback power signal, measurable on the local site.
We propose \(\Delta \) to comply with the following general conditions:
-
1.
\(\Delta (x)\) varies with the feedback power, i.e., \(\Delta (x) = f({\dot{{\textbf{x}}}_{\textbf{m}}^{\textsf{T}}\mathbf {f_\textrm{e}}(t-h_2)})\).
-
2.
\(\Delta (x)\) only extracts energy from system, then \(\Delta (x) \ge 0 \forall x\).
-
3.
To preserve transparency, \(\Delta (x)\) must remain low when the direction of \(\dot{{\textbf{x}}}_{\textbf{m}}\) and \(\mathbf {f_\textrm{e}}(t-h_2)\) are opposite, i.e., \(\lim _{x\rightarrow -\infty } \Delta (x) = 0\).
-
4.
When the direction of \(\dot{{\textbf{x}}}_{\textbf{m}}\) and \(\mathbf {f_\textrm{e}}(t-h_2)\) are the same, \(\Delta (x) \) must be a monotonically increasing function with respect to the feedback power, i.e., if \( x \le y\) then \(\Delta (x) \le \Delta (y)\).
-
5.
To preserve the stability of the system when the direction of \(\dot{{\textbf{x}}}_{\textbf{m}}\) and \(\mathbf {f_\textrm{e}}(t-h_2)\) are the same, the power extracted by \(\Delta (x)\) must be greater than the power of the feedback signal, i.e., \(\dot{{\textbf{x}}}_{\textbf{m}}^{\textsf{T}}\Delta \dot{{\textbf{x}}}_{\textbf{m}} \ge k_\textrm{f}\dot{{\textbf{x}}}_{\textbf{m}}^{\textsf{T}}\mathbf {f_\textrm{e}}(t-h_2) \forall \dot{{\textbf{x}}}_{\textbf{m}}^{\textsf{T}}\mathbf {f_\textrm{e}}(t-h_2) >0\).
Remark 1
If \(\alpha _\textrm{m}\) and \(\alpha _\textrm{s}\) are sufficiently big (high damping) to fulfill \(\lambda _\textrm{m},\lambda _\textrm{s}>0\) (19), the condition \(k_\textrm{m}>g_\textrm{h}\) is hold, then we can get from V (14)–(17) and \(\dot{V}\) (18) that variables \(\dot{{\textbf{x}}}_{\textbf{m}}\), \(\dot{{\textbf{x}}}_{\textbf{s}}\in {\mathcal {L}}_\infty \).
Next, if (18) is integrated with respect to time, we get:
Remark 2
The integral terms \(\int _{0}^{t}{\dot{{\textbf{x}}}_{\textbf{m}}(\varepsilon )^{\textsf{T}}\rho _\textrm{m}(\varepsilon )}\,\textrm{d}\varepsilon \) and \(\int _{0}^{t}{\dot{{\textbf{x}}}_{\textbf{s}}(\varepsilon )^{\textsf{T}}\rho _\textrm{s}2(\varepsilon )}\,\textrm{d}\varepsilon \) are bounded since \(\mathbf {f_{\textrm{a}_\textrm{e}}},\mathbf {f_{\textrm{a}_\textrm{h}}}\in {\mathcal {L}}_2\) (Assumptions 2 and 3) and \(\dot{{\textbf{x}}}_{\textbf{m}},\dot{{\textbf{x}}}_{\textbf{s}}\in {\mathcal {L}}_\infty \).
The design of the variable damping \(\Delta \) allows that
and due to V is radially unbounded, we get V bounded in (21) for all t, hence \(\mathbf {x_\textrm{m}},\mathbf {x_\textrm{s}}\in {\mathcal {L}}_\infty \). Therefore, \(\left\| \dot{{\textbf{x}}}_{\textbf{m}} \right\| _2^2\) and \(\left\| \dot{{\textbf{x}}}_{\textbf{s}} \right\| _2^2\) are bounded too, which implies that \(\dot{{\textbf{x}}}_{\textbf{m}},\dot{{\textbf{x}}}_{\textbf{s}}\in {\mathcal {L}}_2\). It is important to remark that as greater the magnitude of \(\Delta \) is, the system better avoids the possible increase of the feedback power injected to the leader, while if such control parameter is low, the transparency is higher. Consequently, the selection of \(\Delta \) must take into account a trade-off between stability (energy boundness) and transparency (the human operator perceiving the environment such as it is). Because of this trade-off, the expression of \( \Delta \) is a design factor that must be selected for each particular application. Without losing generality, in Sect. 5 we propose a possible expression for \(\Delta \).
In addition, the closed-loop dynamics of the leader (1) and follower (2), including the proposed controller in (6) and (7), can be represented by the following expressions:
Remark 3
Due to Remark 1 and 2, \(\dot{{\textbf{x}}}_{\textbf{m}},\dot{{\textbf{x}}}_{\textbf{s}}\in {\mathcal {L}}_\infty \cap {\mathcal {L}}_2\), and therefore, \(\int _{0}^{t}{\dot{{\textbf{x}}}_{\textbf{m}}}\,\textrm{d}t\) and \(\int _{0}^{t}{\dot{{\textbf{x}}}_{\textbf{s}}}\,\textrm{d}t\) have finite limit for \(t\rightarrow \infty \). Furthermore, all terms on the right hand of (22) and (23) are bounded from Remark 1 and Assumptions 1, 2 and 3; i.e., \(\dot{{\textbf{x}}}_{\textbf{m}},\dot{{\textbf{x}}}_{\textbf{s}}\) are uniformly continuous. Next, Lemma 1 can be applied concluding that \(\dot{{\textbf{x}}}_{\textbf{m}},\dot{{\textbf{x}}}_{\textbf{s}}\rightarrow {\textbf{0}}\) as \(t\rightarrow \infty \). Lemma 1 can also be applied to \(\int _{0}^{t}{\ddot{{\textbf{x}}}_{\textbf{m}}(\varepsilon )} \,\textrm{d}\varepsilon \) and \(\int _{0}^{t}{\ddot{{\textbf{x}}}_{\textbf{s}}(\varepsilon )}\,\textrm{d}\varepsilon \), which have finite and defined limit for \(t\rightarrow \infty \). Deriving (22) and (23) it is feasible to verify that \(\dddot{{\textbf{x}}}_{\textbf{m}}\) and \(\dddot{{\textbf{x}}}_{\textbf{s}}\) are bounded; then we get as the result that \(\ddot{{\textbf{x}}}_{\textbf{m}},\ddot{{\textbf{x}}}_{\textbf{s}}\rightarrow {\textbf{0}}\) as \(t\rightarrow \infty \). We remark that from this result it is possible to infer that \(\Delta (x) \rightarrow 0\) for \(t \rightarrow \infty \), which implies that the total damping of the haptic device tends to \(\alpha _\textrm{m}\).
3.2 Force and position dual coordination
To obtain 1, 2 and 3, \({\textbf{A}}={\textbf{B}}={\textbf{0}}\) was considered. Otherwise, a variable change is performed in order that a similar stability analysis can be carried out [34].
To get the new Krasovskii-like equilibrium solution the closed-loop dynamics is analyzed for \(t\rightarrow \infty \), in this situation let us assume that \(\dot{{\textbf{x}}}_{\textbf{m}},\ddot{{\textbf{x}}}_{\textbf{m}},\dot{{\textbf{x}}}_{\textbf{s}},\ddot{{\textbf{x}}}_{\textbf{s}}\rightarrow {\textbf{0}}\), considering \({\textbf{A}}\) and \({\textbf{B}}\) as constant values. Next, the following relations in stationary state can be expressed from (22) and (23):
where \(\bar{{\textbf{x}}}_{\textbf{m}}:=\lim _{t\rightarrow \infty }\mathbf {x_\textrm{m}}(t)\), \(\bar{{\textbf{x}}}_{\textbf{s}}:=\lim _{t\rightarrow \infty }\mathbf {x_\textrm{s}}(t)\), \(\bar{{\textbf{f}}}_{\textbf{h}}:=\lim _{t\rightarrow \infty }\mathbf {f_\textrm{h}}(t)\) and \(\bar{{\textbf{f}}}_{\textbf{e}}:=\lim _{t\rightarrow \infty }\mathbf {f_\textrm{e}}(t)\). Observing (24) and (25), it is possible to infer that:
Next, if the proposed human operator model (4) and the environment force (3) are included into (26), the relations represented below can be achieved at steady state:
Finally, analyzing (26) and (27), the following variable change is proposed:
where \(\bar{{\textbf{x}}}_{\textbf{m}}\) and \(\bar{{\textbf{x}}}_{\textbf{s}}\) are constant values. If (28) and (29) are applied to (22) and (23), and on the resulting relations the proposed human operator model (4) and the environment force (3) are included, then it is possible to get the following equations to represent the DBTS:
Now, if \(\hat{{\textbf{f}}}_{\textbf{h}}\) and \(\hat{{\textbf{f}}}_{\textbf{e}}\) are defined by:
Then, considering the relations (26) and (27), equations (30) and (31) can be rewritten, in a similar way to (22) and (23), as follows:
Remark 4
A procedure similar to the one carried on Remark1, 2 and 3 can be applied to the delayed system represented by (33) and (34), getting as a result that the teleoperation system converges for \(t\rightarrow \infty \) to an equilibrium in which a simultaneous synchronism of force (force exerted by the human operator will tend to the scaled environment force) and motion (the position mapped linearly by the leader will tend to the follower’s position) is obtained. Besides, the human operator can change the convergence point handling their internal reference \({\textbf{A}}\).
4 Experimental results
In this section, the system is implemented in practice. In these experiments, a human operator commands a remote manipulator robot through a network, to execute an inspection by contact of a soft medium. Novint Falcon is used as a 3D haptic joystick to drive a Robotis Manipulator-H as the remote robot. The human and the environment forces are measured with a Variense 3-axis force sensor, model fs103. Each device runs in a separate computer and is linked with a Ubiquiti NanoStation long-range antenna. Each device (sensors and robots) is controlled employing independent executable programs developed in C++ language. Moreover, the controllers are implemented within Matlab Simulink running in real-time mode. All internal software modules of each computer communicate with each other through shared memory. A layout of the experimental setup and views of the local and remote sites is shown in Figs. 3 and 4, respectively.
The performed test involves utilizing a remotely teleoperated manipulator, controlled over the network, to conduct a horizontal scan on a soft surface while maintaining a nearly constant vertical force. The human operator’s task is to position the robot in contact with the medium and assess its hardness by exerting a displacement/force along the z-axis. Subsequently, the operator must examine the medium’s resistance to movement by applying a displacement/force along the y-axis while keeping the z-axis position and force as constant as possible. Finally, the operator is required to return the robot to its initial position.
Although the desired position command A is not explicitly given to the operator, maintaining a constant contact force with a passive medium implies A to be constant and \(B = 0[N]\). The medium is positioned at the origin of the z-axis.
A video demonstrating teleoperation with high delay can be viewed here.
Experiments are repeated for three different communication time delays \(h = h_1+h_2 \): low, medium, and high.
For medium and high delay, a simulated time-variant delay is added to both incoming and outcoming signals. These delays \(h_i\) are generated by random sinusoidal signals mounted on random constant with the addition of white noise. The linear damping is qualitatively calibrated for each test based on (19).
4.1 Setup
For the controller implementation, first an expression for \( \Delta (t) \) is chosen, following the guidelines of Sect. 3.1. For a sake of simplicity we propose to use the function:
where \( k_{\alpha i} \) are positive constants, which was adjusted empirically: first performing an experiment and measuring the maximum value of the feedback power, then choosing a value high enough to ensure the condition 3.1. The parameters used in each experiment are summarized in Table 2.
On the other hand, \(\alpha _\textrm{m}\) and \(\alpha _\textrm{s}\) values were calculated for each \( h_i \) using (19).
4.2 Results
Figures 5, 6 and 7 show the results of the experiments. Despite the time delay (last graph in each figure), accurate position tracking on both the y-axis and z-axis is observed. In addition, even when the forces signals are (as usual) very noisy, a good force tracking of the system is observed in both, waveform and magnitude. It should be noted that the force synchronization occurs only when the robot is in contact with the medium, that is, \(F_\textrm{e} \ne 0 [N]\). When \(F_\textrm{e} = 0 [N]\) the force desynchronism occurs because the operator pushes the end effector of the haptic device, and therefore generates a measurement in the sensor.
Next, tracking error index of position \(\textrm{I}_{\textbf{x}}\) and force \(\textrm{I}_{\textbf{f}}\) and time to complete the task \(\mathrm {T_{task}}\) are obtained. Such metrics are calculated as follows:
Remark are summarized in Table 3.
From Table 3 and Figs. 5, 6 and 7, it can be seen that, as usual, system performance degrades as the time delay grows. However, it is observed that even under a high time delay of more than 2 s, the system behaves adequately, keeping the main control variables bounded, which translates into satisfactory performance in terms of stability and transparency.
5 Simulations
In this section, simulation results are presented to compare general aspects of the response of the proposed scheme with the one obtained using the controller from [34], since the latter is a good representative of the controllers were the stability condition is reached through control actions with sign functions.
Some key differences and similarities of our proposal with the most well-known papers of the state of the art are summarized in Table 4. It is important to note that our proposal doesn’t require the use of sign functions or the cancellation of exogenous forces. Its parameters are independent of the elasticity of the medium and it modifies the damping depending on a measurable power/energy signal; this way it avoids the use of observers whose estimation usually degrades with the time delay.
To contrast the results, the response of the system is compared with the case without variable damping. It should be noted that there is no theoretical evidence of the stability of the latter, despite the fact that it performs well in these few simulations.
The controllers are constructed from Eqs. (6) and (7). In the first controller (denoted as Pdf+\(\Delta \)), \(\Delta \) is chosen following the guidelines of Sect. 3.1. For the sake of simplicity, we propose to use the following function
where \( k_{\alpha i} \) are positive constants, which were adjusted empirically: first performing a simulation and measuring the maximum value of the feedback power, then choosing a value high enough to ensure the condition 3.1. For the second controller (denoted as Pdf), \(\Delta =0\) for all t. For the last controller (denoted as Pdf+\(\Gamma \)), the \(\Delta \) term of (6) is replaced with \(\Gamma \) of (39) [34]
For the simulations, the PELICAN robot model presented in [36] and used in [34] is adopted. The simulations are divided into two stages, one of penetration in the medium and another of retreat. The robot starts at a position \(q_0= [45, 135]\) \([^\circ ]\) and the medium is at a distance \(X_0 =[18.83, 7.62]\) [cm]. For the penetration part of the simulation, consider \(A_1 = [33.3, 7.62]\) [cm], which represents a penetration of 15 cm, and \({\textbf{B}}={\textbf{0}}\) (passive medium). For the retraction part, \({\textbf{A}}\) is switched to \({\textbf{A}}_2 = X_0\), which tries to bring the robot to its initial position. Through \({\textbf{A}}_1\) and \({\textbf{A}}_2\), the interaction of the end effector of the robot with the environment occurs mainly in the x axis, keeping the value of y constant.
The parameters of the controllers are set to \(k_2=5\) \([\textrm{kg}/(\textrm{m}^2\cdot \textrm{s})]\), \(k_3=5\) \([\textrm{kg}/\textrm{s}]\), \(k_\textrm{m}=10\) \([\textrm{kg}/\textrm{s}^2]\), \(g_\textrm{h}=8\) \([\textrm{kg}/\textrm{s}^2]\), \(k_\textrm{g}=1\) \([\mathrm{m/m}]\), \(k_\textrm{s}=15\) \([\mathrm{kg/s}^2]\), \(k_\textrm{f}=1\) [N/N], \(k_\textrm{e}=200\), \(k_{\alpha 1}=200\) [kg/s] and \(k_{\alpha 2}=0.1\) \([\textrm{s}^3/(\textrm{kg}\cdot \textrm{m}^2)]\).
The simulation of each control scheme is repeated using three random sinusoidal time delays, with the following maximum values: \(\max (h_\textrm{low})=100\) [ms], \(\max (h_\textrm{medium})=500\) [ms] and \(\max (h_\textrm{high})=1\) [s].
To facilitate the interpretation of the results, \(\mathbf {f_{\textrm{a}_\textrm{h}}} = \mathbf {f_{\textrm{a}_\textrm{e}}} = {\textbf{0}}\) is taken. Conservatively, \(K_\textrm{h} = \alpha _\textrm{e} =\alpha _\textrm{h} = 0\), it should be noted that values greater than 0 would contribute to the stability of the system.
Figure 8 shows that, given the same parameter settings, all schemes behave similarly, especially for cases where the time delay is high. This is a dual coordination of force and position is achieved. However, the scheme based on explicit nonlinear damping (sign functions in the controller) adds chattering.
In Fig. 9, it can be seen that the proposed variable damping only acts in the moments of retraction and rebound of the end effector. Although the magnitude of \(F_\Delta \) increases as the delay increases, its relative magnitude with respect to constant damping gets smaller, and as the speed of movement is also lower, it results in a less evident effect on the total behavior of the system. However, it is observed that the variable damping acts for a short period after retraction (maximum feedback energy) damping the initial effect and subsequently deactivating itself, thus contributing to the signal stabilization time. This behavior can be accentuated by proper parameter settings: tuning \(k_{\alpha 1}\), the time during which the variable damping acts can be adjusted, and through \(k_{\alpha 2}\) its magnitude.
6 Discussions and conclusions
In this paper, a strategy that considers the effect of time delay has been proposed in order to get a stable bilateral teleoperation of a manipulator robot. It is based on injecting linear damping, avoiding the use of nonlinear damping and dependence on the medium properties, which is impractical. The proposed controller can be applied in commercial robots which generally include PID structures.
The stability theoretical analysis together with the performed simulations give us as a result a general law to qualitatively online calibrate the controller to achieve a stable simultaneous synchronization of position and force, where the injected damping mainly depends on the time delay and feedback power that considers the leader velocity and delayed environment force.
The analysis shows that the system achieves bounded force and position errors regardless of the time delay as well as the elasticity of the medium. Although, as usual in bilateral teleoperation, the environment parameters influence the system performance, the study of this phenomenon for this family of controllers is outside the scope of this work. It is important to remark that the proposed scheme avoids the use of discontinuities in the controller and consequent chattering as well as energy observers are not used. Our proposal requires the use of force sensors and the main limitation at theoretical level is that the analysis presented does not allow assure a convergence of the force and position errors in finite time.
The results of this study contribute to the improvement of the trade-off between stability and transparency, rising and extending the potential use of bilateral teleoperation systems to a broader range of applications.
Data availability
No datasets were gererated or analysed during the current study.
References
Sheridan TB (1992) Telerobotics, automation, and human supervisory control. MIT Press, Cambridge
Antonakoglou K, Xu X, Steinbach E, Mahmoodi T, Dohler M (2018) Toward haptic communications over the 5G tactile internet. IEEE Commun Surv Tuts 20(4):3034–3059
Hokayem PF, Spong MW (2006) Bilateral teleoperation: an historical survey. Automatica 42(12):2035–2057
Richard JP (2003) Time-delay systems: an overview of some recent advances and open problems. Automatica 39(10):1667–1694
Sipahi R, Niculescu S-I, Abdallah CT, Michiels W, Gu K (2011) Stability and stabilization of systems with time delay. IEEE Control Syst Mag 31(1):38–65
Slawiñski E, Mut V, Santiago D (2016) Pd-like controller for delayed bilateral teleoperation of wheeled robots. Int J Control 89(8):1622–1631
Farkhatdinov I, Ryu J-H, Poduraev J ( 2008) A feasibility study of time-domain passivity approach for bilateral teleoperation of mobile manipulator. In: Proceedings of the IEEE international conference on control, automation and systems, pp 272– 277
Santiago DD, Slawiñski E, Mut VA (2017) Stable delayed bilateral teleoperation of mobile manipulators. Asian J Control 19(3):1140–1152
Xu G-H, Qi F, Lai Q, Iu HH-C (2020) Fixed time synchronization control for bilateral teleoperation mobile manipulator with nonholonomic constraint and time delay. IEEE Trans Circuits Syst II 67(12):3452–3456
Slawiñski E, Santiago D, Mut V (2017) Control for delayed bilateral teleoperation of a quadrotor. ISA Trans 71:415–425
Lam T, Mulder M, Van Paassen M (2008) Haptic feedback in uninhabited aerial vehicle teleoperation with time delay. J Guid Control Dyn 31(6):1728–1739
Zhao X-W, Han G, Lai Q, Yue D (2020) Multiconsensus of first-order multiagent systems with directed topologies. Mod Phys Lett B 34(23):2050240
Nuno E, Sarras I, Basanez L (2013) Consensus in networks of nonidentical Euler–Lagrange systems using P+d controllers. IEEE Trans Robot 29(6):1503–1508
Nuño E, Basañez L, Ortega R (2011) Passivity-based control for bilateral teleoperation: a tutorial. Automatica 47(3):485–495
Anderson RJ, Spong MW (1989) Bilateral control of teleoperators with time delay. IEEE Trans Autom Control 34(5):494–501
Niemeyer G, Slotline J (1991) Stable adaptive teleoperation. IEEE J Ocean Eng 16(1):152–162
Lee D, Huang K (2010) Passive-set-position-modulation framework for interactive robotic systems. IEEE Trans Robot 26(2):354–369
Nuño E, Ortega R, Barabanov N, Basañez L (2008) A globally stable PD controller for bilateral teleoperators. IEEE Trans Robot 24(3):753–758
Cho HC, Park JH (2005) Stable bilateral teleoperation under a time delay using a robust impedance control. Mechatronics 15(5):611–625
Li Y, Yin Y, Zhang D (2018) Adaptive task-space synchronization control of bilateral teleoperation systems with uncertain parameters and communication delays. IEEE Access 6:5740–5748
Dyck M, Jazayeri A, Tavakoli M (2013) Is the human operator in a teleoperation system passive? In: World haptics conference (WHC). IEEE, Daejeon, pp 683–688
Hua C-C, Liu XP (2010) Delay-dependent stability criteria of teleoperation systems with asymmetric time-varying delays. IEEE Trans Robot 26(5):925–932
Islam S, Liu XP, El Saddik A (2013) Teleoperation systems with symmetric and unsymmetric time varying communication delay. IEEE Trans Instrum Meas 62(11):2943–2953
Li H, Zhang L, Kawashima K (2018) Operator dynamics for stability condition in haptic and teleoperation system: a survey. Int J Med Robot Comput Assist Surg 14(2):1881
Lawrence DA (1993) Stability and transparency in bilateral teleoperation. IEEE Trans Robot Autom 9(5):624–637
Slawiñski E, Mut VA, Fiorini P, Salinas LR (2012) Quantitative absolute transparency for bilateral teleoperation of mobile robots. IEEE Trans Syst Man Cybern A 42(2):430–442
Shahbazi M, Atashzar SF, Tavakoli M, Patel RV (2018) Position-force domain passivity of the human arm in telerobotic systems. IEEE/ASME Trans Mechatron 23(2):552–562
Singh H, Panzirsch M, Coelho A, Ott C (2020) Proxy-based approach for position synchronization of delayed robot coupling without sacrificing performance. IEEE Robot Autom Lett 5(4):6599–6606
Chawda V, O’Malley MK (2014) Position synchronization in bilateral teleoperation under time-varying communication delays. IEEE/ASME Trans Mechatron 20(1):245–253
Panzirsch M, Pereira A, Singh H, Weber B, Ferreira E, Gherghescu A, Hann L, Exter E, Hulst F, Gerdes L et al (2022) Exploring planet geology through force-feedback telemanipulation from orbit. Sci Robot 7(65):6307
Ryu J-H, Artigas J, Preusche C (2010) A passive bilateral control scheme for a teleoperator with time-varying communication delay. Mechatronics 20(7):812–823
Franken M, Stramigioli S, Misra S, Secchi C, Macchelli A (2011) Bilateral telemanipulation with time delays: a two-layer approach combining passivity and transparency. IEEE Trans Robot 27(4):741–756
Hashemzadeh F, Tavakoli M (2015) Position and force tracking in nonlinear teleoperation systems under varying delays. Robotica 33(4):1003–1016
Mohammadi K, Talebi HA, Zareinejad M (2016) A novel position and force coordination approach in four channel nonlinear teleoperation. Comput Electr Eng 56:688–699
Guajardo-Benavides EJ, Arteaga MA (2023) On the finite time force estimation for bilateral teleoperation of robot manipulators with time varying delays. Control Eng Pract 138:105622
Kelly R, Davila VS, Perez JAL (2006) Control of robot manipulators in joint space. Springer, London
Slawiñski E, Moya V, Santiago D, Mut V (2019) Force and position-velocity coordination for delayed bilateral teleoperation of a mobile robot. Robotica 37(10):1768–1784
Hou M, Duan G, Guo M (2010) New versions of Barbalat’s lemma with applications. J Control Theory Appl 8(4):545–547
Hua C, Liu PX (2009) Delay-dependent stability analysis of teleoperation systems with unsymmetric time-varying delays. In: Proceedings of IEEE international conference on robotics and automation, pp 1146–1151
Estrada E, Yu W, Li X (2019) Stability and transparency of delayed bilateral teleoperation with haptic feedback. Int J Appl Math Comput Sci 29(4):681–692
Funding
This work was supported by the National Scientific and Technical Research Council of Argentina (CONICET).
Author information
Authors and Affiliations
Contributions
DDS, ES, LRS and VAM contributed to conceptualization; DDS, ES and LRS provided methodology; DDS and ES performed formal analysis, investigation, and writing—original draft preparation; ES and LRS performed writing—review and editing; ES and VAM contributed to funding acquisition, resources, and supervision.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Appendix A
Appendix A
From the functional \(V( \mathbf {x_\textrm{m}},\dot{{\textbf{x}}}_{\textbf{m}},k_\textrm{g}\mathbf {x_\textrm{m}}-\mathbf {x_\textrm{s}},\mathbf {x_\textrm{s}},\dot{{\textbf{x}}}_{\textbf{s}}) =V_1+V_2+V_3+V_4\) represented by (14), (15), (16) and (17), remembering that \({\textbf{e}}=k_\textrm{g}\mathbf {x_\textrm{m}}-\mathbf {x_\textrm{s}}\); its derivative along the system trajectories will be obtained.
First, the time derivative of (14), including the leader robot dynamics (1)), and Properties 1 and 2, is accomplished as:
Next, if the control action \(\mathbf {f_\textrm{m}}\) (6), including the delayed environment force \(\mathbf {f_\textrm{e}}\) (3) with \(\mathbf {B=0}\), and the human force \(\mathbf {f_\textrm{h}}\) (4), considering \(\mathbf {A=0}\), are included in (A.1), and then adding and grouping terms by convenience, it yields:
Now, \(\dot{V}_2\) is obtained from (15):
Besides, \(\dot{V}_3\) from (16) including the follower dynamics (2) as well as the control action \(\mathbf {f_\textrm{s}}\) (7) is given by:
If the environment force \(\mathbf {f_\textrm{e}}\) (3) and the human force \(\mathbf {f_\textrm{h}}\) (4) are inserted into (A.4), and on the result the terms are rearranged, the below expression is reached:
Stability analysis is hard to complete because there are terms with delayed variables in (A.2) and (A.5). To solve this, \(V_4\) (17) has been proposed, whose derivative considering Assumption 1 is obtained by:
The integral terms in (A.6) can be linked with others of (A.2) and (A.5) using Lemma 2 (5), yielding the following relations:
Finally, \(\dot{V}\) can be built joining equations (A.1)–(A.6), considering the relations given by (A.7)–(A.9) to avoid terms with integrals, knowing that the negative last term of (A.6) always overcome the last term of (A.5) according to Assumption 1, and defining constants \(\kappa _i\) as \(\kappa _1=k_\textrm{g} g_\textrm{h}+k_\textrm{h}\), \(\kappa _2=(k_\textrm{m}-g_\textrm{h})/k_\textrm{g}\), \(\kappa _3=\kappa _4(g_\textrm{h}/k_\textrm{f}+k_\textrm{e})\) and \(\kappa _4=\kappa _2/k_\textrm{s}\) in order to cancel all opposing terms. The final relation obtained to represent \(\dot{V}\) is given by:
The result achieved in (A.10) is used to complete the stability and convergence analysis of force and position signals of the leader–follower teleoperation system.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Santiago, D.D., Slawiñski, E., Salinas, L.R. et al. Force and position coordination for delayed bilateral teleoperation of a manipulator robot. Int. J. Dynam. Control 12, 1679–1693 (2024). https://doi.org/10.1007/s40435-023-01298-z
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s40435-023-01298-z