Keywords

1 Introduction

This chapter discusses a novel approach aimed at determining the relative attitude and position of objects. Such metrological scenarios frequently arise in applications that include robotics (collaborative robots, robotic arm control), military applications, and aeronautics (target configuration). One of the significant aeronautical applications is the autonomous landing control system for aircraft. This application is relevant, e.g., in radar-silence conditions, for the landing of ultralight aircraft, for drones in tracks without a control tower, and for aircraft carriers. Figure 1 illustrates the landing of an aircraft on an aircraft carrier, indicating the optical head (OH), and the target.

Fig. 1
A photograph of an aircraft carrier runway with a super imposed illustration of an inflight U A V on a landing approach towards it. 3 lines connect the O H at the front of the aircraft to 3 poles at the end of the runway labeled target.

Aircraft landing on an aircraft carrier

This metrological aspect is becoming increasingly important also in space engineering: consult Bresciani [1]. To accomplish the task, the technology has to provide quick and reliable measurements of the attitude and position of a spacecraft with respect to the other(s). Several examples of real-world space engineering applications entailing this advanced capability are briefly reviewed next.

The vision system of the STRONG satellite of the European Space Agency (ESA) represents a significant example. It has the scope of determining the line of sight and the range of the relative position between two or more satellites. A second example of interest concerns PROBA-3, ESA’s first precision formation flying mission. A pair of satellites fly together maintaining a fixed configuration as a “large rigid structure” in space to investigate formation-flying technologies. Conceptually, the PROBA-3 mission consists of two independent, three-axis stabilized mini satellites flying in a formation with relative position control accuracy of less than 1 mm, see Fig. 2. The two mini satellites are referred to as Coronograph SpaceCraft (CSC) with metric dimensions of 1.1 × 1.8 × 1.7 m3, and Occulter SpaceCraft (OSC) with dimensions of 0.9 × 1.4 × 0.9 m3. The paired satellites jointly form a 150-meter-long solar coronagraph to study the Sun’s faint corona more closely to the solar rim than has ever been achieved before.

Fig. 2
A 3D illustration of 2 rectangular satellites with thermal insulation in space. The PROBA 3 C S C satellite on the left has a big circular opening in the bottom and a smaller circle on top, with 4 rectangular solar panels on the side. The O S C on the right has a circular disk on one side and a cross shaped solar panel on the other side, with a small circular opening at the bottom.

PROBA-3 CSC (left) and OSC (right). (Credits: ESA)

A further noteworthy example is the Magsat program, a joint project by the National Aeronautics and Space Administration (NASA) and the United States Geological Survey (USGS) to measure near-earth magnetic fields on a global basis. The spacecraft consisted of two distinct parts, see Fig. 3: the instrument module that contains a vector and a scalar magnetometer, in addition to its unique basis apparatus; and the base module that contains the data-handling, power, communication, command, and attitude-control subsystems to support the instrument module. The magnetometers were deployed after launch, using a deployable scissor boom, to a position of 6 m behind the spacecraft. At this distance, the influence of magnetic materials from the instrument and base module was less than 1 nanotesla. The directional accuracy of the vector magnetometer was required to be of 20 arc seconds in all three axes (coordinates). The related error allocation analysis determined that the attitude control subsystem was required to yield an accuracy of 7 arc seconds per axis to meet the overall system requirements. The performance goal was set to 1 arc second for pitch and yaw, and 5 arc seconds for twist (roll). The pitch-yaw acquisition range required was ±0.5° as a goal, with a precision range over ±3 arc minutes. The twist system required a precision range of ±0.08° over the pitch-yaw precision range. Moreover, the dynamic range in the magnetometer platform displacement motion was set to ±0.25° in any transverse direction.

Fig. 3
A schematic diagram of the Mag sat orbital configuration. The rectangular base module on the right has 4 solar cell arrays deployed on each side with small doppler antennas on their outer edge. A thin adjustable aerodynamic trim boom is on the top. An instrument electronics and altitude transfer optical system, an instrument module, and star cameras for altitude determination are at the bottom. A scissors-type boom that has a precision sun sensor, dihedral mirror, scalar, and vector magnetometers, extends toward the left. The curvature of the earth's limb is in the background. The Sun's direction is upwards and the flight direction is toward the right.

Magsat orbital configuration. (Credits: NASA)

Another interesting application is represented by HAYABUSA-2, an asteroid exploration mission by the Japanese Space Exploration Agency (JAXA), aimed at studying Asteroid 1999 JU3. The spacecraft deploys the MASCOT lander built in Europe for an in situ study of surface composition and properties, see Fig. 4. The landing site is selected prior to the MASCOT deployment by evaluating the global asteroid map, but restricted to the illuminated asteroid side. During a sampling “dress rehearsal” maneuver, the main spacecraft descends from its home position (HP) to an altitude of approximately 100 m and deploys the lander by initializing a proper delta-vv) through the separation mechanism. The lander then freefalls to the surface, while HAYABUSA-2 ascends back to its HP located at a distance of 15 km. During the first phase, after being released, MASCOT reaches the asteroid. Here, the metrological aspects are particularly demanding due to the requested accuracy. Similar scenarios occur, e.g., in active debris removal, object rendezvous, and formation-flying.

Fig. 4
An illustration of the phases of the HAYABUSA 2 mission for a noon landing. The rectangular spacecraft with 2 solar panels extended on either side starts from the home position at 15 kilometers from the comet. Within a few hours, the HAYABUSA 2 descends and deploys target markers prior to MASCOT. The MASCOT is deployed at an altitude of 100 meters, and it takes 20 through 30 minutes to descend, while the HAYABUSA 2 ascends to H P. The directions of the Earth and the Sun are upwards.

HAYABUSA-2 deployment phase (left) and MASCOT separation (right). (Credits: JAXA)

The OCSD (Optical Communications and Sensor Demonstration) program is the first in a new series of six NASA-managed demonstration missions adopting CubeSat to test technologies that enable new uses of these miniature satellites, measuring approximately 10 × 10 × 10 cm3. The first mission is a technical demonstration of the various systems testing the laser communication terminal. The next two satellites will be deployed as a pair to demonstrate the ability to maneuver small spacecraft in close proximity (approximately 200 m) to one another using low-cost sensors and a novel water propulsion, see Fig. 5. This technology can enhance the ability of small spacecraft to work in coordination with other satellites to explore asteroids, planets, and moons, as well as inspecting other spacecraft.

Fig. 5
A 3D illustration depicts the proximity operations of 2 Cube Sats. The cube sat 1 on the right has a small circular sun sensor on the top, with an imager and an optical flow sensor on the bottom. A conical laser beam within a bigger radar beam emanates from the bottom of cube sat 1 and reaches cube sat 2 below it.

Two CubeSats in close proximity operations. (Credits: Aerospace Corp)

The Athena (Advanced Telescope for High-ENergy Astrophysics) mission of ESA, see https://sci.esa.int/web/athena, is selected in this chapter as a specific case study for the innovative metrological approach proposed. Athena is based on an X-ray telescope (see Fig. 6) designed to address the Cosmic Vision science theme “The Hot and Energetic Universe.” This satellite will utilize a telescope with a 12-m focal length and two primary instruments: the high-resolution X-ray Integral Field Unit (X-IFU) and the Wide Field Imager (WFI) featuring a moderate resolution and a large field of view. The high-precision measurement of the telescope line of sight (LoS) internal misalignments is crucial. The on-board metrological instrument is expected to be placed as close as possible to the node and focal plane center of the telescope mirror to detect at a very high precision the errors in the LoS vector, due to inevitable thermal distortions.

Fig. 6
A 3D illustration of the Athena telescope. A large elongated conical telescope has a small rectangular block on the rear, a hood in the front, and 2 long solar panels extended on its side. An illustration of a person is next to it for scale.

Athena telescope. (Credits: ESA Concurrent Design Facility)

Metrological problems of the type illustrated above are usually tackled by adopting very complex systems that often use different technologies in parallel: for optical metrology, consult, e.g., Tyson [13] and Yoshizawa [14]. The most frequently used metrological solutions considered in space applications include the Universal Lateral and Longitudinal Integrated Sensor (ULLIS), Hexa-Dimensional Optical Metrology (HDOM), ATV Videometer, and Rendezvous Laser Vision System (RELAVIS).

  • The Universal Lateral and Longitudinal Integrated Sensor (ULLIS) is a sensor composed of a detector, an optical system, and an electronic unit on one side, in addition to a set of retroreflectors on the other side. This device has the capability to measure both the lateral and longitudinal positions of one side with respect to the other.

  • The Hexa-Dimensional Optical Metrology (HDOM) is a metrological system aimed at providing the 3D-position and 3D-attitude of one side with respect to the reference frame of another side.

  • The ATV Videometer is a metrological system based on visual techniques utilized to support the docking of the Automated Transfer Vehicle (ATV, ESA) to the International Space Station (ISS).

  • The Rendezvous Laser Vision System (RELAVIS) is a system designed to support autonomous space operations. RELAVIS provides accurate detection, tracking, and estimation of spacecraft for rendezvous-docking operations, satellite inspection, and servicing operations.

A promising alternative approach is based on the innovative bifocal concept, see Bresciani and Musso [2,3,4]. This system is characterized by a double optical train, which yields information about position and attitude of an object with respect to another, for all six degrees of freedom. It covers a variable range of possible distances, i.e., from ten meters to tens of kilometers, depending on the application and measurement purpose. In the specific case of space engineering (the application field considered in this chapter), the bifocal metrological systems can provide precise information about the mutual position and attitude of two spacecraft in a short, medium, and far range involving only one sensor and covering either fine or coarse accuracies. In fact, the bifocal metrology is the first optical projective system able to tackle six degrees of freedom metrological scenarios that adopts a single sensor. A simple analytical algorithm solves the related six degrees of freedom problem without the need for high computing power. The optical system is very simple and compact: in addition to the light target, composed by only three Light Emitted Diodes (LEDs), has no particular complexity in terms of volume accommodation, technological developments, space qualifications, and on-ground characterization. All the components of this metrological system can be selected among the already space-qualified items.

These key aspects of the bifocal metrology with respect to the existing metrologies make the new approach suitable for a wide range of space applications: rendezvous and docking, co-orbiting satellites, large space instruments based on formation flying technology, extendable structures, and CubeSat networks.

The remainder of this chapter is structured as follows. Section 2 presents the bifocal system concept. Section 3 is dedicated to the mathematical aspects, concerning the projected image generation, the inverse problem of reconstructing the target position and attitude by the projected image measurement, the analytical solution, and the relative error analysis. Section 4 discusses two optimization problems regarding the bifocal system sizing and the light spot shaping, respectively. A real-world application is illustrated in Sect. 5, and conclusions are presented in Sect.6.

2 Bifocal System Concept

The bifocal system discussed here consists of the following components:

  • A light target consisting of three LEDs placed at the vertices of an isosceles triangle.

  • A two-channel optical head (each channel with its own focal length) that focuses the light target image on an image sensor.

The working principle of the system is shown in Fig. 7, when two spacecrafts denoted by S1 and S2 are considered. The optical head consisting of the optical trains OT1 and OT2 is installed on S1, and the three LEDs are positioned on S2.

Fig. 7
A schematic diagram of the bifocal system between 2 satellites S 2 and S1. In scenario 1, S 2 is far from S 1, and O T 1 works. 2 sets of light beams from 3 light sources in S 2 are incident on the optical head of S 1. 1 set of beams enters the O T 1 with convex and square lenses, and another enters the lens of O T 2 through reflective mirrors. All the beams are focused on a square C C D. In scenario 2, below the S 2 is close to the S 1, where O T 2 works.

Bifocal system: basic concept

The joint presence of two focal lengths allows using the device as a single focal metrology system with different operative ranges (coarse metrology) or as a metrology system with a better accuracy than a classical projection metrology system (fine metrology). When the spacecrafts are either far or close, a coarse measurement can be performed. When both target images are projected on the image sensor (charged coupled device, CCD) a refined measurement can be carried out: see Fig. 8.

Fig. 8
An illustration for C C D target image projection. A rectangle that represents the C C D has 2 sets of 3 dots arranged inside it, in a concentric triangular formation. The outer dots are generated by O T 1, while the inner dots are generated by O T 2.

CCD target image projection

Considering the target light spots (three for each optical channel) on the projected image plane, through a simple mathematical algorithm requiring little computing power, it is possible to determine the position and attitude accurately (considering six degrees of freedom, including the roll, pitch, and yaw angles), with respect to the optical head.

3 Mathematical Model

In this section, the analytical computation of position and attitude of a generic target with respect to all the six degrees of freedom using a bifocal optical system is described: for further details, consult Bresciani and Musso [2,3,4].

3.1 3D-Transformation Matrix

Consider an orthogonal right-handed 3D-coordinate system with origin O and axes x, y, and z. Angular rotations around the axes x, y, and z can be represented by the following rotation matrices, see, e.g., Ghali [6]:

$$ {R}_x\left(\theta \right)=\left[\begin{array}{c}\begin{array}{cc}1& \kern0.5em 0\\ {}0& \kern0.75em \cos \theta \end{array}\kern1em \begin{array}{cc}0&\ 0\\ {}\sin \theta &\ 0\end{array}\\ {}\begin{array}{cc}0& -\sin \theta \\ {}0& 0\end{array}\kern1em \begin{array}{cc}\cos \theta & 0\\ {}0& 1\end{array}\end{array}\right], $$
(1-1)
$$ {R}_y\left(\beta \right)=\left[\begin{array}{c}\begin{array}{cc}\cos \beta & 0\\ {}0& 1\end{array}\kern1.25em \begin{array}{cc}-\sin \beta & \kern0.5em 0\\ {}0& \kern0.5em 0\end{array}\\ {}\begin{array}{cc}\sin \beta & 0\\ {}0& 0\end{array}\kern1.5em \begin{array}{cc}\cos \beta & \kern1em 0\\ {}0& \kern1em 1\end{array}\end{array}\right], $$
(1-2)
$$ {R}_z\left(\gamma \right)=\left[\begin{array}{c}\begin{array}{cc}\cos \gamma & \sin \gamma \\ {}-\sin \gamma & \cos \gamma \end{array}\kern1.25em \begin{array}{cc}0& \kern0.5em 0\\ {}0& \kern0.5em 0\end{array}\\ {}\begin{array}{cc}\kern1.75em 0& \kern1.25em 0\\ {}\kern1.75em 0& \kern1.25em 0\end{array}\kern1em \begin{array}{cc}\kern1.5em 1& \kern0.5em 0\\ {}\kern1.5em 0& \kern0.5em 1\end{array}\ \end{array}\right], $$
(1-3)

where θ, β, and γ are the rotation angles with respect to x, y, and z.

The 3D-translation matrix is defined by

$$ {T}_{xyz}\left(\Delta x,\Delta y,\Delta z\right)=\left[\begin{array}{c}\begin{array}{cc}\ 1\kern1.25em & 0\\ {}\ 0\kern1.25em & 1\end{array}\kern1.25em \begin{array}{cc}0& \kern0.5em 0\\ {}0& \kern0.5em 0\end{array}\\ {}\begin{array}{cc}0& \kern1em 0\\ {}\Delta x& \Delta y\end{array}\kern1em \begin{array}{cc}1&\ 0\\ {}\Delta z& 1\end{array}\ \end{array}\right], $$
(2)

where Δx, Δy, and Δz are the displacements along the axes x, y, and z, respectively.

Rotations and translations are applied to a point with coordinates (X, Y, Z) by the following transformation:

$$ {\displaystyle \begin{array}{c}\left[{X}^{\prime },{Y}^{\prime },{Z}^{\prime },1\right]=\left[X,Y,Z,1\right]{R}_x\left(\theta \right){R}_y\left(\beta \right){R}_z\left(\gamma \right){T}_{xyz}\left(\Delta x,\Delta y,\Delta z\right)\\ {} = \left[X,Y,Z,1\right]M\left(\theta, \beta, \gamma, \Delta x,\Delta y,\Delta z\right)\end{array}} $$
(3)

Here (X, Y, Z) is the original point, and (X, Y, Z) is the point after the roto-translation shown above. The matrix M(θ, β, γ, ∆x, ∆y, ∆z) = Rx(θRy(βRz(γTxyz (∆x, ∆y, ∆z) can be expressed as

$$ M\left(\theta, \beta, \gamma, \Delta x,\Delta y,\Delta z\right)=\left[\begin{array}{c}\begin{array}{c@{\quad}c@{\quad}c@{\quad}c}\cos \gamma \cos \beta & \sin \gamma \cos \beta & -\sin \beta & 0 \\\sin \theta \sin \beta \cos \gamma -\sin \gamma \cos \theta & \sin \gamma \sin \beta \sin \theta +\cos \gamma \cos \theta& \sin \theta \cos \beta & 0\\\cos \theta \sin \beta \cos \gamma -\sin \gamma \sin \theta & \sin \gamma \sin \beta \cos \theta -\cos \gamma \sin \theta&\cos \theta \cos \beta &0 \\\Delta x& \Delta y& \Delta z& 1\\\end{array} \end{array}\right] $$
(4)

Hence, the explicit form of Eq. (3) is

$$ \left[{X}^{\prime },{Y}^{\prime },{Z}^{\prime },1\right]=\left[X,Y,Z,1\right]\left[\begin{array}{c}\begin{array}{c@{\quad}c@{\quad}c@{\quad}c}\textrm{cos}\upgamma \textrm{cos}\beta & \sin \gamma \cos \beta & -\sin \beta & 0\\\sin \theta \sin \beta \cos \gamma -\sin \gamma \cos \theta & \sin \gamma \sin \beta \sin \theta +\cos \gamma \cos \theta& \sin \theta \cos \beta & 0\\\cos \theta \sin \beta \cos \gamma -\sin \gamma \sin \theta & \sin \gamma \sin \beta \cos \theta -\cos \gamma \sin \theta &\cos \theta \cos \beta & 0\\\Delta x& \Delta y &\Delta z& 1 \\\end{array} \end{array}\right] $$
(5)

The transformation shown above is used as a basic underlying concept for the optical application discussed in this chapter. Prior to introducing the bifocal approach, the mono-focal case is outlined, consisting of a simple optical projective system. In this scenario, the relationship between the object P and its projected image Pp is illustrated in Fig. 9.

Fig. 9
A lens diagram for the mono-focal projective system with a convex lens at the center. An object P is above the principal axis on the left at a distance of D from the lens. 2 beams from the object enter the lens, one parallel to the axis and 1 through the optical center. They are focused by the lens and form an image P subscript P below the axis at a length of l to the right of the lens.

Mono-focal projective system

In Fig. 9, D is the horizontal distance between the target point P and the lens utilized, while l is the horizontal distance between the lens center and the projection Pp. The distance between the lens focus and the projection is denoted by f. The axes x and y of the right-handed reference frame (x, y, z) considered here lie on the projection plane (y is the vertical axis); the axis z (oriented toward the target point P) is parallel to D and l. The following expression holds:

$$ {P}_{\textrm{p}}\left({X}_{\textrm{P}},{Y}_{\textrm{P}},{Z}_{\textrm{P}}\right)={P}_{\textrm{p}}\left(\frac{lX}{D},\frac{lY}{D},0\right) $$
(6)

where (X, Y, Z) are the coordinates of the target point P.

Considering the relation (6), Eq. (5) links the image point Pp(XP, YP, ZP) and the corresponding Pp(XP, YP, ZP) obtained by the rotations θ, β, and γ and the displacements Δx, Δy, and Δz along the axes. The following equation holds:

$$ \left[{X}_{\textrm{P}}^{\prime },{Y}_{\textrm{P}}^{\prime },0,\frac{l}{D}\right]=\frac{l}{D}\left[X,Y,0,1\right]\left[\begin{array}{c}\begin{array}{c@{\quad}c@{\quad}c@{\quad}c}\cos \gamma \cos \beta & \sin \gamma \cos \beta& -\sin \beta & 0\\\sin \theta \sin \beta \cos \gamma -\sin \gamma \cos \theta & \sin \gamma \sin \beta \sin \theta +\cos \gamma \cos \theta &\sin \theta \cos \beta & 0\\\cos \theta \sin \beta \cos \gamma -\sin \gamma \sin \theta & \sin \gamma \sin \beta \cos \theta -\cos \gamma \sin \theta &\cos \theta \cos \beta & 0\\\Delta x& \Delta y&\Delta z& 1\end{array}\ \end{array}\right]. $$
(7)

3.2 Bifocal Problem Statement

The bifocal system discussed in this chapter is depicted schematically by Fig. 10. Here A is the target and B details the positions of the three light sources P1(X1, Y1, Z1), P2(X2, Y2, Z2), P3(X3, Y3, Z3) selected as the target. C represents the bifocal optical system as a whole (optical head), consisting of two equal-size lenses and four mirrors. D and E are the perpendicular and parallel views, respectively, of the image plane.

Fig. 10
A schematic diagram of the bifocal system. The rectangular C C D with 2 sets of 3 concentric triangular dots is on the top. The bottom has the side view of the optical head with 2 convex mirrors on top and bottom with a square mirror on either side of each lens, at a distance of d 1 and d 2 respectively.

Schematic representation of a bifocal system

The bifocal system is, in fact, a combination of two mono-focal systems involving lenses 1 and 2, with l1 and l2 as the corresponding internal optical lengths. The two generated images (colored in blue and red for lenses 1 and 2) have the magnitudes M(f1) and M(f2), depending on the focal distances f1 and f2, respectively. The two associated sets of projected points, generated by the target three (light-)points for lenses 1 and 2 are denoted by P1p1(X1p1, Y1p1, Z1p1), P2p1(X2p1, Y2p1, Z2p1), P3p1(X3p1, Y3p1, Z3p1), and P1p2(X1p2, Y1p2, Z1p2), P2p2(X2p2, Y2p2, Z2p2), P3p2(X3p2, Y3p2, Z3p2).

The general bifocal problem consists of determining the position and attitude of a target (identified by three light sources) with respect to a given initial condition, by measuring the positions of the corresponding projected six points generated by the bifocal system. Specifically, it is assumed that the initial positions of the three target (light-)points P1(X1, Y1, Z1), P2(X2, Y2, Z2), P3(X3, Y3, Z3) are known with respect to the assigned reference frame (associated with the bifocal optical head), as well as their projected images P1p1(X1p1, Y1p1, Z1p1), P2p1(X2p1, Y2p1, Z2p1), P3p1(X3p1, Y3p1, Z3p1), and P1p2(X1p2, Y1p2, Z1p2), P2p2(X2p2, Y2p2, Z2p2), P3p2(X3p2, Y3p2, Z3p2), generated by lenses 1 and 2. After rotations θ, β, and γ of the target (around the axes x, y, and z, respectively), and displacements Δx, Δy, and Δz of the target along the axes x, y, and z, the projected points P1p1(X1p1, Y1p1, Z1p1), P2p1(X2p1, Y2p1, Z2p1), P3p1(X3p1, Y3p1, Z3p1), generated by lens 1, are correlated with the corresponding initial projected points P1p1(X1p1, Y1p1, Z1p1), P2p1(X2p1, Y2p1, Z2p1), P3p1(X3p1, Y3p1, Z3p1), generated by lens 1, by three equations having the general form of Eq. (5). Similarly, three equations of the general form of Eq. (5) relate the new projected points P1p2(X1p2, Y1p2, Z1p2), P2p2(X2p2, Y2p2, Z2p2), P3p2(X3p2, Y3p2, Z3p2), generated by lens 2, with the corresponding initial projected images P1p1(X1p1, Y1p1, Z1p1), P2p1(X2p1, Y2p1, Z2p1), P3p1(X3p1, Y3p1, Z3p1) generated by lens 2.

In general, a roto-translation of the target θ, β, γ, Δx, Δy, and Δz (with respect to the known initial condition) can be expressed by a system of six equations of the form of Eq. (5), each corresponding to one of the projected points P1p1(X1p1, Y1p1, Z1p1), P2p1(X2p1, Y2p1, Z2p1), P3p1(X3p1, Y3p1, Z3p1), P1p2(X1p2, Y1p2, Z1p2), P2p2(X2p2, Y2p2, Z2p2), and P3p2(X3p2, Y3p2, Z3p2). The bifocal problem consists of inverting this system of equations so that the new configuration of the target (with respect to the initial condition), expressed by θ, β, γ, Δx, Δy, and Δz can be derived by the direct measurements of the projected points shown above: here we assume that P1p1(X1p1, Y1p1, Z1p1), P2p1(X2p1, Y2p1, Z2p1), and P3p1(X3p1, Y3p1, Z3p1) are known.

3.3 Mathematical Solution

In order to solve the inverted system of equations introduced in Sect. 3.2, as per the initial conditions, the three target light-points are specified with respect to the optical head reference frame as follows:

$$ {\displaystyle \begin{array}{c}{P}_1\left({X}_1,{Y}_1,{Z}_1\right)=\left({X}_0,0,0\right),\\ {}{P}_2\left({X}_2,{Y}_2,{Z}_2\right)=\left(\hbox{--} {X}_0,0,0\right),\\ {}{P}_3\left({X}_3,{Y}_3,{Z}_3\right)=\left(0,{Y}_0,0\right).\end{array}} $$

As shown above, P1 and P2 are placed (symmetrically) along the x-axis at a distance X0 from the origin, while P3 is on the y-axis at a distance Y0 from the origin. Since P1 and P2 are not affected by rotations around the x axis, Eq. (5) can be applied to both corresponding projected points, obtained either by lens 1 or 2, by adopting the simplifying condition θ = 0. This yields the following specific equations for the projections P1p1 and P2p1:

$$ \left[{X}_{1\textrm{p}1}^{\prime },{Y}_{1\textrm{p}1}^{\prime },0,\frac{{\textrm{l}}_1}{D+{d}_1}\right]=\frac{l_1}{D+{d}_1}\left[{X}_0,0,0,1\right]\left[\arraycolsep5pt \begin{array}{cccc} \cos \gamma\,\cos \beta & \sin \gamma\,\cos \beta & -\sin \beta & 0\\ {}-\sin \gamma & \cos \gamma & 0& 0\\ {}\sin \beta\,\cos \gamma & \sin \gamma\,\sin \beta & \cos \beta & 0\\ {}\Delta x& \Delta y& \Delta z& 1\end{array}\right], $$
(8-1)
$$ \left[{X}_{2\textrm{p}1}^{\prime },{Y}_{2\textrm{p}1}^{\prime },0,\frac{l_1}{D+{d}_1}\right]=\frac{l_1}{D+{d}_1}\left[-{X}_0,0,0,1\right]\left[\arraycolsep3.8pt \begin{array}{cccc}\cos \gamma\,\cos \beta & \sin \gamma\,\cos \beta & -\sin \beta \kern0.5em & 0\\ {}-\sin \gamma & \cos \gamma & 0& 0\\ {}\sin \beta\,\cos \gamma & \sin \gamma\,\sin \beta & \cos \beta & 0\\ {}\Delta x& \Delta y& \Delta z& 1\end{array}\right], $$
(8-2)

where D + d1 is the working distance of lens 1. After algebraic manipulations that are omitted, the following equations are obtained:

$$ \Delta {X}_{12\textrm{p}1}^{\prime }=\frac{2{l}_1}{D+{d}_1}{X}_0\cos \gamma \cos \beta, $$
(9-1)
$$ \Delta {Y}_{12\textrm{p}1}^{\prime }=\frac{2{l}_1}{D+{d}_1}{X}_0\sin \gamma \cos \beta, $$
(9-2)

where ΔX12p1 ≡ X1p1 − X2p1 and ΔY12p1 ≡ Y1p1 − Y2p1. Further technical details concerning the mathematical formulation outlined here are presented by Bresciani and Musso [2,3,4].

Operating exactly the same way as lens 1, for lens 2 analogous equations are obtained:

$$ \Delta {X}_{12\textrm{p}2}^{\prime }=\frac{2{l}_2}{D+{d}_2}{X}_0\cos \gamma \cos \beta, $$
(10-1)
$$ \Delta {Y}_{12\textrm{p}2}^{\prime }=\frac{2{l}_2}{D+{d}_2}{X}_0\sin \gamma \cos \beta, $$
(10-2)

where ΔX12p2 = X1p2 − X2p2 and ΔY12p2 = Y1p2 − Y2p2.

Combining (9-1), (9-2), (10-1), and (10-2) yields the following relations:

$$ \gamma =a\tan \left(\frac{\Delta {Y}_{12\textrm{p}1}^{\prime }}{\Delta {X}_{12\textrm{p}1}^{\prime }}\right), $$
(11)
$$ D=\frac{l_1{d}_2\Delta {X}_{12\textrm{p}2}^{\prime }-{l}_2{d}_1\Delta {X}_{12\textrm{p}1}^{\prime }}{l_1\Delta {X}_{12\textrm{p}1}^{\prime }-{l}_2\Delta {X}_{12\textrm{p}2}^{\prime }}, $$
(12)
$$ \beta =a\cos \left(\frac{\Delta {X}_{12\textrm{p}1}^{\prime}\left(D+{d}_1\right)}{2{l}_1{X}_0\cos \left(a\tan \left(\frac{\Delta Y{\prime}_{12\textrm{p}1}}{\Delta X{\prime}_{12\textrm{p}1}}\right)\right)}\right). $$
(13)

Additionally, by (8-1), (8-2), (11), (12), and (13), the displacements Δx and Δy can be expressed, respectively, as follows:

$$ \Delta x={X}_{1\textrm{p}1}^{\prime}\frac{D+{d}_1}{l_1}-{X}_0\cos \gamma \cos \beta $$
(14)
$$ \Delta y={Y}_{1\textrm{p}1}^{\prime}\frac{D+{d}_1}{l_1}-{Y}_0\cos \gamma \cos \beta $$
(15)

Applying Eq. (5) to the projected point P3p1 (generated by lens 1), the following equations hold:

$$ {X}_{3\textrm{p}1}^{\prime }=\frac{l_1{Y}_0}{D+{d}_1}\left(\sin \vartheta \sin \beta \cos \gamma -\sin \gamma \cos \vartheta +\frac{\Delta x}{Y_0}\right), $$
(16-1)
$$ {Y}_{3\textrm{p}1}^{\prime }=\frac{l_1{Y}_0}{D+{d}_1}\left(\sin \vartheta \sin \beta \sin \gamma +\cos \gamma \cos \vartheta +\frac{\Delta y}{Y_0}\right). $$
(16-2)

Finally, ϑ can be derived by (16-1) and (16-2), giving rise to the following expressions:

$$ \vartheta =a\sin \left(\frac{B}{\sin \beta}\right), $$
(17)
$$ B=\frac{\left({X}_{3\textrm{p}1}^{\prime }{Y}_0-\Delta xQ\right)\cos \gamma +\left({Y}_{3\textrm{p}1}^{\prime }{Y}_0-\Delta yQ\right)\sin \gamma }{Q{Y}_0}, $$
$$ Q=\frac{l_1{Y}_0}{D+{d}_1}. $$

To summarize the above discussion, the solutions obtained in this section for the inverted system introduced in Sect. 3.2 are represented by relations (11), (12), (13), (14), (15), and (17). The following obvious conditions are postulated:

$$ \gamma \in \left]-\frac{\pi }{2},\frac{\pi }{2}\right[\kern1.75em \Delta {X}_{12\textrm{p}1}\ne 0, $$
(18)
$$ {l}_1\Delta {X}_{12\textrm{P}1}^{\prime }-{l}_2\Delta {X}_{12\textrm{p}2}^{\prime}\ne 0\kern1em \left(\frac{X_{1\textrm{p}1}^{\prime }}{l_2}\ne \frac{X_{1\textrm{p}2}^{\prime }}{l_1}\right), $$
(19)
$$ \beta \in \left]0,\pi \right[, $$
(20)
$$ \vartheta \in \left[-\frac{\pi }{2},\frac{\pi }{2}\right]. $$
(21)

It is understood that X0 > 0, Y0 > 0, l2 > l1 ≠ 0 , d2 > d2 ≠ 0. It should be also noted that, due to the construction of the bifocal system, the following relations hold:

$$ \Delta {X}_{12\textrm{p}1}^{\prime}\ne 0,\frac{l_1{d}_2\Delta {X}_{12\textrm{p}2}^{\prime }-{l}_2{d}_1\Delta {X}_{12\textrm{p}1}^{\prime }}{l_1\Delta {X}_{12\textrm{P}1}^{\prime }-{l}_2\Delta {X}_{12\textrm{p}2}^{\prime }}>0,\textrm{and}\ {l}_1\Delta {X}_{12\textrm{P}1}^{\prime }-{l}_2\Delta {X}_{12\textrm{p}2}^{\prime}\ne 0. $$

During the algebraic manipulation carried out to obtain expression (13), it has been implicitly assumed that \( -1\le \frac{\Delta {X}_{12\textrm{p}1}^{\prime}\left(D+{d}_1\right)}{2{l}_1{X}_0\cos \gamma}\le 1 \). Therefore, expression (13) is well defined for all given values of ΔX12p1 and ΔY12p1 (recall also that, by (18), \( \gamma \ne \pm \frac{\pi }{2} \)). Analogous considerations hold for (17) by (20). Note additionally that the existence conditions for γ can be extended considering the whole angle [0, 2π ] provided that it is possible to identify, each time, the specific sub-intervals involved (e.g., \( \gamma \in \left]-\frac{\pi }{2},\frac{\pi }{2}\right[ \) or \( \gamma \in \left]\frac{\pi }{2},\frac{3\pi }{2}\right[ \)). In any case, the conditions \( \gamma \ne \pm \frac{\pi }{2} \) and \( \gamma \ne \frac{3\pi }{2} \) must be satisfied. Analogous considerations hold for β, while no extension for ϑ would be of use.

3.4 Error Analysis

This section studies the impact of measurement errors, relative to the image-points, on the computation of the position and attitude of the target (by the inverse equation system corresponding to (11), (12), (13), (14), (15), and (17)). More precisely, taking a worst-case perspective, the maximum error for the terms ϑ, β, γ, D, Δx, and Δy is estimated, assuming that the absolute value of the measurement error associated with the variables X1p1, Y1p1, X2p1, Y2p1Y3p1,\( {X}_{1\textrm{p}2}^{\prime } \), and\( {X}_{2\textrm{p}2}^{\prime } \) is less than a given E. The distance D is deemed to represent the most critical aspect, especially when large distances are involved. The maximum error |εD|max with respect to the distance D is briefly discussed next.

Considering Eq. (12), the error ∣εD∣ with respect to the actual distance D is expressed as follows:

$$ \mid {\varepsilon}_{\textrm{D}}\mid =\left|\frac{l_1{d}_2\Delta {X}_{12\textrm{p}2}^{\prime }-{l}_2{d}_1\Delta {X}_{12\textrm{p}1}^{\prime }+{l}_1{d}_2{\varepsilon}_{12\textrm{p}2}^{\prime }-{l}_2{d}_1{\varepsilon}_{12\textrm{p}1}^{\prime }}{l_1\Delta {X}_{12\textrm{p}1}^{\prime }-{l}_2\Delta {X}_{12\textrm{p}2}^{\prime }+{l}_1{\varepsilon}_{12\textrm{p}1}^{\prime }-{l}_2{\varepsilon}_{12\textrm{p}2}^{\prime }}-D\right|, $$
(22)

where ε12p1 and \( {\varepsilon}_{12\textrm{p}2}^{\prime } \) represent the measurement errors corresponding to ΔX12p1 and \( {\Delta X}_{12\textrm{p}2}^{\prime } \), respectively. Depending on the specific technology adopted, with the estimated maximum error E, the following bounds are considered:

$$ \left|{\varepsilon}_{12\textrm{p}1}^{\prime}\right|\le E, $$
(23-1)
$$ \left|{\varepsilon}_{12\textrm{p}2}^{\prime}\right|\le E. $$
(23-2)

Since \( \frac{l_1{d}_2\Delta {X}_{12\textrm{p}2}^{\prime }-{l}_2{d}_1\Delta {X}_{12\textrm{p}1}^{\prime }}{l_1\Delta {X}_{12\textrm{p}1}^{\prime }-{l}_2\Delta {X}_{12\textrm{p}2}^{\prime }}=D \) (D ≥ 0), and l1ΔX12p1 − l2ΔX12p2 < 0 (l2 > l1 and ΔX12p2 > ΔX12p1 due to the way the optical system is constructed), the following relation holds:

$$ {\left|{\varepsilon}_{\textrm{D}}\right|}_{\textrm{max}}=\kern0.5em {\max}_{\vartheta, \beta, \gamma, \Delta x,\Delta y,\kern0.5em {\varepsilon}_{12\textrm{p}1}^{\prime },{\varepsilon}_{12\textrm{p}2}^{\prime }}\left|\frac{l_1{d}_2\Delta {X}_{12\textrm{p}2}^{\prime }-{l}_2{d}_1\Delta {X}_{12\textrm{p}1}^{\prime }+{l}_1{d}_2{\varepsilon}_{12\textrm{p}2}^{\prime }-{l}_2{d}_1{\varepsilon}_{12\textrm{p}1}^{\prime }}{l_1\Delta {X}_{12\textrm{p}1}^{\prime }-{l}_2\Delta {X}_{12\textrm{p}2}^{\prime }+{l}_1{\varepsilon}_{12\textrm{p}1}^{\prime }-{l}_2{\varepsilon}_{12\textrm{p}2}^{\prime }}-D\right|, $$
(24)

where ϑ, β, γ, ∆x, ∆y vary within given intervals (delimiting the roto-translations to consider). To find a first approximation \( {\left|\overset{\sim }{\varepsilon_{\textrm{D}}}\right|}_{\textrm{max}} \) for (24) (by involving a reduced number of variables), the following terms are introduced:

$$ {\eta}_{\textrm{D}}^{-}={\max}_{\vartheta, \beta, \gamma, \Delta x,\Delta y,\Delta z}\left\{D-\frac{l_1{d}_2\Delta {X}_{12\textrm{p}2}^{\prime }-{l}_2{d}_1\Delta {X}_{12\textrm{p}1}^{\prime }+{l}_1{d}_2E+{l}_2{d}_1E}{l_1\Delta {X}_{12\textrm{p}1}^{\prime }-{l}_2\Delta {X}_{12\textrm{p}2}^{\prime }-{l}_1E-{\textrm{l}}_2E}\right\}, $$
(25-1)
$$ {\eta}_{\textrm{D}}^{+}={\max}_{\vartheta, \beta, \gamma, \Delta x,\Delta y,\Delta z}\left\{\frac{l_1{d}_2\Delta {X}_{12\textrm{p}2}^{\prime }-{l}_2{d}_1\Delta {X}_{12\textrm{p}1}^{\prime }-{l}_1{d}_2E-{l}_2{d}_1E}{l_1\Delta {X}_{12\textrm{p}1}^{\prime }-{l}_2\Delta {X}_{12\textrm{p}2}^{\prime }+{l}_1E+{l}_2E}-D\right\}. $$
(25-2)

Here we assume \( \frac{l_1{d}_2\Delta {X}_{12\textrm{p}2}^{\prime }-{l}_2{d}_1\Delta {X}_{12\textrm{p}1}^{\prime }+{l}_1{d}_2E+{l}_2{d}_1E}{l_1\Delta {X}_{12\textrm{p}1}^{\prime }-{l}_2\Delta {X}_{12\textrm{p}2}^{\prime }-{l}_1E-{l}_2E}>0 \), and \( \frac{l_1{d}_2\Delta {X}_{12\textrm{p}2}^{\prime }-{l}_2{d}_1\Delta {X}_{12\textrm{p}1}^{\prime }-{l}_1{d}_2E-{l}_2{d}_1E}{l_1\Delta {X}_{12\textrm{p}1}^{\prime }-{l}_2\Delta {X}_{12\textrm{p}2}^{\prime }+{l}_1E+{l}_2E}>0 \). The approximate term \( {\left|\overset{\sim }{\varepsilon_{\textrm{D}}}\right|}_{\textrm{max}} \) (\( {\left|\overset{\sim }{\varepsilon_{\textrm{D}}}\right|}_{\textrm{max}}>{\left|{\varepsilon}_{\textrm{D}}\right|}_{\textrm{max}} \)) is thus expressed as

$$ {\left|\overset{\sim }{\varepsilon_{\textrm{D}}}\right|}_{\textrm{max}}=\max \left\{{\eta}_{\textrm{D}}^{-},{\eta}_{\textrm{D}}^{+}\ \right\}. $$
(26)

Let us remark that more than one single critical roto-translation in (24) or (26) could occur that gives rise to the maximum error: for example, due to symmetrical conditions. The issue of identifying the whole set of critical roto-translations is not trivial, and it is not considered in this chapter: it could be the subject of further research.

4 Optimization Aspects

Mathematical optimization models have been introduced in Sect. 3.4 concerning the error analysis of the studied optical system. There, a worst-case approach is adopted to estimate the maximum possible error with respect to the distance D (due to measurement errors relative to the projection differences ΔX12p1 and ΔX12p1 associated with lenses 1 and 2, respectively). In this section, two optimization problems are discussed with the scope of maximizing the overall performances of the bifocal system illustrated in this chapter. The first optimization problem, discussed in Sect. 4.1, is aimed at finding the size of the optical head so that the error with respect to distance D becomes as small as possible.

The second optimization problem concerns the specific features of the light spots utilized as the target image for the bifocal system. In fact, the light spots are not idealized geometrical points (as assumed in the mathematical formulation of Sect. 3), but they have an actual size. Therefore, each light spot is identified as a surface by the image sensor (CCD). In general, the light energy intensity on this surface does not have a uniform distribution. The concept of light spot centroid is introduced to represent the distribution of the light energy intensity on the light spot surface. This concept is the equivalent to the center of mass for a mass distribution in a geometric domain. Each light spot centroid is identified with one of the three single light points utilized by the bifocal system, to determine the roto-translation of the whole target as discussed in Sect. 3. In general, a roto-translation of the target yields a variation in the light energy intensity distribution on the surface of each light spot. The position of each light spot centroid typically varies as well with respect to the CCD. This centroid displacement is thus due to not only the actual roto-translation of the whole target, but also to the change in the light energy intensity distribution on the light spot surface. Therefore, the centroid displacement on each light spot surface, obtained after a roto-translation of the target, also depends on the light energy intensity distribution of the spot before the roto-translation. If the roto-translations of the target are limited with respect to the initial (nominal) condition, then the light energy intensity distribution of each light spot in the nominal condition can be properly shaped a priori, to minimize the centroid displacement. Consequently, under these proximity conditions, the precision of the bifocal metrological approach is maximized. This optimization aspect is discussed in Sect. 4.2.

4.1 System Sizing Optimization

The maximum error |εD|max related to the distance D, as expressed by (24) depends implicitly also on the following parameters that characterize the bifocal system: d1, d2 (corresponding to D + d1, D + d2 working distances of lenses 1 and 2, respectively), l1, l2 (internal optical lengths of lenses 1 and 2, respectively), and f1, f2 (focal distances of lens 1 and 2, respectively). These parameters can vary within their ranges: \( {d}_1\in \left[\underline{D}_{1},\overline{D_1}\right] \), \( {d}_2\in \left[\underline{D}_{2},\overline{D_2}\right] \), \( {l}_1\in \left[\underline{L_1},\overline{L_1}\right] \), \( {l}_2\in \left[\underline{L_2},\overline{L_2}\right] \), \( {f}_1\in \left[\underline{F_1},\overline{F_1}\right] \), \( {f}_2\in \left[\underline{F_2},\overline{F_2}\right] \), where \( \underline{D}_{1}, \) \( \underline{D}_{2},\underline{L_1}, \) \( \underline{L_2}, \) \( \underline{F_1}, \) \( \underline{F_2} \) and \( \overline{D_1} \), \( \overline{D_2},\overline{L_1} \), \( \overline{L_2}, \) \( \overline{F_1} \), \( \overline{F_2} \) are the lower and upper technological bounds. Next, the following minimax optimization problem is considered:

$$ {\left|\overline{\varepsilon_{\textrm{D}}}\right|}_{\textrm{min}}=\kern0.75em {\min}_{d_1,{d}_2,{l}_1,{l}_2,{f}_1,{f}_1}\left\{{\max}_{\vartheta, \beta, \gamma, \Delta x,\Delta y,{\varepsilon}_{12\textrm{p}1}^{\prime },{\varepsilon}_{12\textrm{p}2}^{\prime }}\left|\frac{l_1{d}_2\Delta {X}_{12\textrm{p}2}^{\prime }-{l}_2{d}_1\Delta {X}_{12\textrm{p}1}^{\prime }+{l}_1{d}_2{\varepsilon}_{12\textrm{p}2}^{\prime }-{l}_2{d}_1{\varepsilon}_{12\textrm{p}1}^{\prime }}{l_1\Delta {X}_{12\textrm{p}1}^{\prime }-{l}_2\Delta {X}_{12\textrm{p}2}^{\prime }+{l}_1{\varepsilon}_{12\textrm{p}1}^{\prime }-{l}_2{\varepsilon}_{12\textrm{p}2}^{\prime }}-D\right|\right\}. $$
(27)

Here, \( {\left|\overline{\varepsilon_{\textrm{D}}}\right|}_{\textrm{min}}\kern0.5em \)is the minimum error related to the distance D, with respect to the parameters d1, d2, l1, l2, f1, and f2, in the worst case as expressed by (24). Considering the difficulty of this optimization problem, especially if a global optimization point of view is adopted (see, e.g., [5, 8, 11]), the variables \( \vartheta, \beta, \gamma, \Delta x,\Delta y,{\varepsilon}_{12\textrm{p}1}^{\prime },{\varepsilon}_{12\textrm{p}2}^{\prime } \), and \( {\varepsilon}_{12\textrm{p}2}^{\prime } \) can be fixed in (27) at the values obtained by solving optimization problem (24). This implies that problem (27) can be simplified as follows:

$$ {\left|\overline{\varepsilon_{\textrm{D}}}\right|}_{\textrm{min}}=\kern0.75em {\min}_{d_1,{d}_2,{l}_1,{l}_2,{f}_1,{f}_1}\left\{\left|\frac{l_1{d}_2\Delta {X}_{12\textrm{p}2}^{\prime }-{l}_2{d}_1\Delta {X}_{12\textrm{p}1}^{\prime }+{l}_1{d}_2{\varepsilon}_{12\textrm{p}2}^{\prime }-{l}_2{d}_1{\varepsilon}_{12\textrm{p}1}^{\prime }}{l_1\Delta {X}_{12\textrm{p}1}^{\prime }-{\textrm{l}}_2\Delta {X}_{12\textrm{p}2}^{\prime }+{l}_1{\varepsilon}_{12\textrm{p}1}^{\prime }-{l}_2{\varepsilon}_{12\textrm{p}2}^{\prime }}-D\right|\right\}, $$
(28)

where only d1, d2, l1, l2, f1, f1 are considered as variables, while \( \Delta {X}_{12\textrm{p}2}^{\prime },\Delta {X}_{12\textrm{p}1}^{\prime },\\ {\varepsilon}_{12\textrm{p}2}^{\prime },\varepsilon {\prime}_{12\textrm{p}1} \) take instead the values corresponding to the worst-case solution provided by (24). In this framework it is implicitly assumed that exclusively the critical roto-translation corresponding to the solution obtained for (24) is considered. If the number of critical roto-translations providing the same error |εD|max as the solution of (24) is available, then it is possible to properly extend optimization problem (28) in order to consider all of these. For this purpose, the error

$$ {\left(\left|\overline{\varepsilon_{\textrm{D}}}\right|\right)}_{\alpha }={\left|\frac{l_1{d}_2\Delta {X}_{12\textrm{p}2}^{\prime }-{l}_2{d}_1\Delta {X}_{12\textrm{p}1}^{\prime }+{l}_1{d}_2{\varepsilon}_{12\textrm{p}2}^{\prime }-{l}_2{d}_1{\varepsilon}_{12\textrm{p}1}^{\prime }}{l_1\Delta {X}_{12\textrm{p}1}^{\prime }-{l}_2\Delta {X}_{12\textrm{p}2}^{\prime }+{l}_1{\varepsilon}_{12\textrm{p}1}^{\prime }-{l}_2{\varepsilon}_{12\textrm{p}2}^{\prime }}-D\right|}_{\alpha } $$

can be associated with each critical roto-translation solution α (fixing the corresponding values \( \vartheta, \beta, \gamma, \Delta x,\Delta y,{\varepsilon}_{12\textrm{p}1}^{\prime },{\varepsilon}_{12\textrm{p}2}^{\prime } \) obtained from (24)). The extended optimization problem can be then expressed as \( {\min}_{d_1,{d}_2,{l}_1,{l}_2,{f}_1,{f}_1}\sum_{\alpha }{\left(\left|\overline{\varepsilon_{\textrm{D}}}\right|\right)}_{\alpha } \).

4.2 Light Spot Optimization

In this section, first we recall the concept of light spot centroid: then the spot optimization model is formulated. As we pointed out, the concept of centroid in optical metrology is analogous to that of the center of mass relative to a continuous mass distribution within a given domain. As is known, this is formulated as follows:

$$ {u}_{\textrm{c}}=\frac{\int_{\textrm{S}}{u}_{\beta}\rho (u) du}{\int_{\textrm{S}}\rho (u) du}. $$
(29)

Here, uc is the center of mass, in the three-dimensional case uβ with β = 1, 2, 3 are the coordinates in the given reference frame, S is the continuous domain, and ρ(u) is the associated density function. In the optical context, an analogous expression holds, when the mass density is replaced by the light energy intensity I(u). Regarding the discretized image created on the CCD, the following two-dimensional light intensity centroid is introduced:

$$ {X}_{\textrm{c}}=\frac{\sum_{i,j}{X}_{ij}I\left({X}_{ij},{Y}_{ij}\right)}{I_{\textrm{T}}},\kern2.00em {Y}_{\textrm{c}}=\frac{\sum_{i,j}{Y}_{ij}I\left({X}_{ij},{Y}_{ij}\right)}{I_{\textrm{T}}}. $$
(30)

Here, (Xij,Yij) ∈ 0 × 0 (denoting non-negative integer coordinates) are the points (pixels) representing the CCD (discretized) domain with respect to a given orthogonal reference frame (O, X, Y) associated with the CCD. The integer-valued function I(Xij,Yij) ∈  is the light energy intensity. Without loss of generality, it is assumed that (X00, Y00) = (0, 0), and that Xij, Yij ≥ 0 holds for all pairs i, j. Any roto-translation of a target (consisting of light spots) generates, in general, a displacement in the corresponding light spot centroids. Therefore, considering a general light spot, the relative centroid displacement with respect to the CCD reference frame coordinates is expressed as follows:

$$ \Delta {X}_{\textrm{c}}=\left|\frac{\sum_{ij}{X}_{ij}{I}_{ij}-\sum_{ij}{X}_{ij}{I}_{ij}^{\prime }}{I_{\textrm{T}}}\right|,\kern2.00em \Delta {Y}_{\textrm{c}}=\left|\frac{\sum_{ij}{Y}_{ij}{I}_{ij}-\sum_{ij}{Y}_{ij}{I}_{ij}^{\prime }}{I_{\textrm{T}}}\right|. $$
(31)

Here, Iij and \( {I}_{ij}^{\prime } \) represent (with a simplified notation) the intensities I(Xij, Yij) and I ′ (Xij, Yij) before and after the centroid displacement, respectively. Moreover, IT ≡  ∑ Iij with the intensity conservation condition shown below:

$$ \sum_{ij}{I}_{ij}=\sum_{ij}{I}_{ij}^{\prime }. $$
(32)

Therefore, the overall Euclidean centroid displacement is

$$ {\Delta }_{\textrm{c}}=\frac{1}{I_{\textrm{T}}}\sqrt{\Delta {X_{\textrm{c}}}^2+\Delta {Y_{\textrm{c}}}^2}. $$
(33)

In the following, it is assumed that the light spots corresponding to the initial target position and attitude with respect to the bifocal system can be generated so that their intensity distributions are identical to each other. Therefore, the optimization problem discussed here focuses on a single light spot. Its initial intensity distribution w(Xij,Yij) ∈ , expressed in bit units and referred to in the following as the nominal state, is considered together with a changed state w(Xij,Yij) ∈  (also expressed in bit units). In the following, these notations are simplified as wij and wij. The changed state wij corresponds to an overall intensity distribution variation, considered as representative of the perturbations of the nominal light spot in a limited neighborhood of the initial conditions.

The objective of the optimization approach proposed is to determine an intensity distribution wij for the nominal light spot to minimize the overall centroid displacement, as expressed by (33) corresponding to the perturbed intensity state wij. Moreover, we want to determine the intensity distribution wij so that it is highly “regular,” symmetrical with respect to the centroid and strongly concentrated around it (while vanishing as more external areas are reached). The function \( {w}_{ij}^{\prime }={f}_{ij}\Big({w}_{ij} \), ϑ, β, γ, Δx, Δy, Δz) that, for each (Xij,Yij), associates the perturbed intensity wij with the nominal intensity wij and the general roto-translation ϑ, β, γ, Δx, Δy, Δz (recall Sect. 3) is very difficult to express mathematically.

Observe that considering all possible perturbations (ϑ, β, γ, Δx, Δy, Δz) in a limited neighborhood of the nominal condition (corresponding to ϑ = 0, β = 0, γ = 0, Δx = 0, Δy = 0, Δz = 0) would lead to a challenging minimax problem. These aspects are therefore omitted in the model discussed here, providing instead a simplified approximate formulation of the problem. This model is expressed in terms of mixed integer linear programming (MILP), consult, e.g., Hillier [7], Minoux [9], and Nemhauser [10]. The convention of using capital letters for constants and lower-case letters for variables is adopted henceforth.

A discretized domain (sub-domain of the CCD containing the light spot) is described by the integer coordinates (Xij,Yij) assuming that i ∈ [0, N], j ∈ [0, N], and N is an even integer. The perturbed intensity distribution \( {w}_{ij}^{\prime } \) is supposed to be determined by imposing that \( \forall i,j\kern0.5em \left|{w}_{ij}-{w}_{ij}^{\prime}\right|\le \overline{\Delta W} \) and \( \sum_{ij}\left|{w}_{ij}-{w}_{ij}^{\prime}\right|={P}_{\textrm{T}} \), with \( \overline{\Delta W} \) and PT as appropriately chosen positive constants, in addition to the total intensity conservation Eq. (32) applied to wij and \( {w}_{ij,}^{\prime } \), i.e., \( \sum_{ij}w= \) \( \sum_{ij}w{\prime}_{ij} \). All this is, as a matter of fact, just a simplifying trick to avoid the non-trivial challenge of expressing the actual function \( {w}_{ij}^{\prime }={f}_{ij}\Big({w}_{ij} \), ϑ, β, γ, Δx, Δy, Δz) explicitly, as well as of dealing with a far more challenging minimax problem. The resulting formulation is therefore a surrogate model, defined in order to provide approximate practical solutions, while removing excessive complexity.

Following this approach, in addition to the intensity conservation condition (32), we assume that it is possible to choose \( \overline{\Delta W} \)and PT so that the conditions listed above can realistically condition the intensity distributions \( {w}_{ij}^{\prime } \) as approximately representative of all considered perturbations. In this perspective, these conditions, acting on the intensity variations both at a pixel and at an overall level, can be interpreted as a relaxation of the constraint \( {w}_{ij}^{\prime }={f}_{ij}\Big({w}_{ij} \), ϑ, β, γ, Δx, Δy, Δz) and a significant simplification of the original minimax problem. The values for \( \overline{\Delta W} \)and PT have to be estimated a priori, and this necessarily entails some arbitrariness. For instance, these parameters could be chosen bearing in mind the most critical or the most frequent cases. In general, different estimations lead to diverse solutions for the intensity distribution wij of the nominal light spot. The alternative solutions obtained can be compared to each other, to select (by means of dedicated numerical simulations) the most suitable scenario from a practical point of view.

Considering all aspects stated above, the simplified model is formulated next, explaining step by step the meaning of the expressions involved. First, the binary variables σij ∈ {0, 1}, \( {\delta}_{ij}^{-}\in \left\{0,1\right\} \), and \( {\delta}_{ij}^{+}\in \left\{0,1\right\} \) are defined as follows:

$$ {\displaystyle \begin{array}{ll}{\sigma}_{ij}=1& \textrm{if}\kern0.5em \textrm{pixel}\;\left(i,j\right)\;\textrm{is}\ \textrm{active}\ \left(\textrm{i}.\textrm{e}.,{w}_{ij}>0\right)\\ {}{\sigma}_{ij}=0& \textrm{otherwise},\end{array}} $$
$$ {\displaystyle \begin{array}{ll}{\delta}_{ij}^{-}=1& \textrm{if}\kern0.5em {w}_{ij}>{w}_{ij}^{\prime },\\ {}{\delta}_{ij}^{-}=0& \textrm{otherwise},\end{array}} $$
$$ {\displaystyle \begin{array}{ll}{\delta}_{ij}^{+}=1& \textrm{if}\kern0.5em {w}_{ij}<{w}_{ij}^{\prime },\\ {}{\delta}_{ij}^{-}=0& \textrm{otherwise}.\end{array}} $$

Here \( {w}_{ij},{w}_{ij}^{\prime}\in \) 0.

The variables \( {d}_{ij}^{-}\in {\mathbb{N}}_0 \) and \( {d}_{ij}^{+}\in {\mathbb{N}}_0 \)are defined next as follows:

$$ {\displaystyle \begin{array}{ll}{d}_{ij}^{-}={w}_{ij}-{w}_{ij}^{\prime }& \textrm{if}\kern0.5em {w}_{ij}>{w}_{ij}^{\prime },\\ {}{d}_{ij}^{-}=0& \textrm{otherwise},\end{array}} $$
$$ {\displaystyle \begin{array}{ll}{d}_{ij}^{+}={w}_{ij}^{\prime }-{w}_{ij}& \textrm{if}\kern0.5em {w}_{ij}<{w}_{ij}^{\prime },\\ {}{d}_{ij}^{+}=0& \textrm{otherwise}.\end{array}} $$

The variables σij, \( {\delta}_{ij}^{-} \), \( {\delta}_{ij}^{+} \), \( {d}_{ij}^{-} \), and \( {d}_{ij}^{+} \) are interrelated as shown below:

$$ \forall i,j\kern1.5em {\delta}_{ij}^{-}+{\delta}_{ij}^{+}\le {\sigma}_{ij}, $$
(34)
$$ \forall i,j\kern1.5em {d}_{ij}^{-}\ge \underline{\Delta W}{\delta}_{ij}^{-}, $$
(35-1)
$$ \forall i,j\kern1.5em {d}_{ij}^{+}\ge \underline{\Delta W}{\delta}_{ij}^{+}, $$
(35-2)
$$ \forall i,j\kern1.5em {d}_{ij}^{-}\le \overline{\Delta W}{\delta}_{ij}^{-}, $$
(36-1)
$$ \forall i,j\kern1.5em {d}_{ij}^{+}\le \overline{\Delta W}{\delta}_{ij}^{+}. $$
(36-2)

Here the positive integers \( \underline{\Delta W} \)and \( \overline{\Delta W} \) represent the minimum and maximum admissible intensity change, expressed in bit units, for each active pixel (i, j). The lower bounds (35-1) and (35-2) are introduced to easily realize detectable changes in the intensity distribution \( {w}_{ij}^{\prime } \). The following lower and upper bounds hold for both the nominal and perturbed states wij and \( {w}_{ij}^{\prime } \), respectively, in each active pixel:

$$ \forall i,j\kern1.5em {w}_{ij}-{d}_{ij}^{-}\ge \underline{W}{\sigma}_{ij}, $$
(37-1)
$$ \forall i,j\kern1.5em {w}_{ij}+{d}_{ij}^{+}\le \overline{W}{\sigma}_{ij}. $$
(37-2)

Here the positive integers \( \underline{W} \)and \( \overline{W} \) represent the minimum and maximum admissible intensity associated with each pixel (i, j). From (37-1) and (37-2) it can also be observed that all active pixels in the nominal condition coincide with the active pixels of the perturbed state: this way, the overall shape of the light spot remains unaltered.

The next condition represents the overall intensity perturbation imposed, expressed again in bit units:

$$ \sum_{ij}\left({d}_{ij}^{-}+{d}_{ij}^{+}\right)={P}_{\textrm{T}} $$
(38)

Here PT is a positive integer.

The overall intensity (WT) conservation condition, involving both the nominal and the perturbed states wij and wij, is formulated as:

$$ {W}_{\textrm{T}}=\sum_{ij}{w}_{ij}=\sum_{ij}{w}_{ij}^{\prime }. $$
(39)

As a reasonable general condition, the intensity distribution wij is expected to approximate a bell-shaped continuous function with compact support. (For a similar shape, one can think of an appropriately truncated normal distribution.) The conditions introduced next pursue this overall shape by regularity and symmetry criteria. Specifically, the conditions given below determine a monotonically increasing or decreasing trend, with respect to the central pixel \( \left(\hat{i},\hat{j}\right) \)for the nominal intensity distribution wij:

$$ \forall i,j\mid i<\hat{i} - 1\kern2.50em {w}_{ij}+\underline{DW}{\sigma}_{ij}\le {w}_{i+1,j}, $$
(40-1)
$$ \forall j\kern2.5em {w}_{\hat{i}-1,j}\le {w}_{\hat{i}j}, $$
(40-2)
$$ \forall j\kern2.25em {w}_{\hat{i}+1,j}\le {w}_{\hat{i}j}, $$
(40-3)
$$ \forall i,j\mid i>\hat{i} + 1\kern2em {w}_{ij}-\underline{DW}{\sigma}_{ij}\ge {w}_{i+1,j}, $$
(40-4)
$$ \forall i,j\mid j<\hat{j} - 1\kern2.75em {w}_{ij}+\underline{DW}{\sigma}_{ij}\le {w}_{i,j-1}, $$
(40-5)
$$ \forall i\kern2.75em {w}_{i,\hat{j}-1}\le {w}_{i\hat{j}}, $$
(40-6)
$$ \forall i\kern2.5em {w}_{i,\hat{j}+1}\le {w}_{i\hat{j}\prime } $$
(40-7)
$$ \forall \textrm{i},j\mid j>\hat{j} + 1\kern2.25em {w}_{ij}-\underline{DW}{\sigma}_{ij}\ge {w}_{i,j+1}. $$
(40-8)

Here, \( \hat{i}, \hat{j}=\frac{N}{2} \), and the positive integer \( \underline{DW} \) is the minimum intensity increment/decrement (expressed in bit units) corresponding to two adjacent indices i, i + 1 or j, j + 1, except for the indices immediately adjacent to \( \hat{i} \) and \( \hat{j} \). The following “Lipschitzian” conditions are further imposed to the nominal intensity distribution wij to prevent sudden increments/decrements in adjacent indices:

$$ \forall i,j\mid i<\hat{i\ }\kern1em {w}_{i+1,j}-{w}_{\textrm{i}j}\le \overline{DW}, $$
(41-1)
$$ \forall i,j\mid i>\hat{i\ }\kern1em {w}_{ij}-{w}_{i+1,j}\le \overline{DW}, $$
(41-2)
$$ \forall i,j\mid j<\hat{j\ }\kern1em {w}_{i,j+1}-{w}_{ij}\le \overline{DW}, $$
(41-3)
$$ \forall i,j\mid j>\hat{j\ }\kern1.25em {w}_{ij}-{w}_{i,j+1}\le \overline{DW}, $$
(41-4)

where \( \overline{DW} \) is an appropriate bound, compliant with the given technological restrictions.

Several “weak” symmetry conditions are introduced additionally, to induce a desired overall trend of the nominal intensity distribution wi, j. Their formulation is shown below:

$$ \forall j\kern1em {w}_{0j}={w}_{Nj}, $$
(42-1)
$$ \forall j\kern1em {w}_{\hat{i}-1,j}={w}_{\hat{i}+1,j}, $$
(42-2)
$$ \forall j\kern1em \sum_{i<\hat{i}}{w}_{ij}=\sum_{i>\hat{i}}{w}_{ij}, $$
(42-3)
$$ \forall i\kern1em {w}_{i0}={w}_{iN}, $$
(42-4)
$$ \forall i\kern1em {w}_{i,\hat{j}-1}={w}_{i,j+1}, $$
(42-5)
$$ \forall i\kern0.75em \sum_{j<\hat{J}}{w}_{ij}=\sum_{j>\hat{j}}{w}_{ij}. $$
(42-6)

To consider realistic scenarios, it can be further imposed that all the pixels corresponding to non-decreasing intensity with respect to the nominal distribution (i.e., wij ≥ wij) delimit convex areas of the domain (Xij,Yij). For example, additional conditions such as ∀i, j \( \sum_{\overline{i}>i,j}{d}_{ij}^{-}\le {N}^2\left(1-{\delta}_{ij}^{+}\right) \) can be introduced. These conditions express the property that from a certain index i (not determined a priori) all subsequent pixels with index \( \overline{i}>i \) have a non-decreasing intensity and vice versa. (However, these aspects are not discussed in this chapter.)

Since in any real context the intensity is expected to be maximal in a central area of the spot, this criterion is selected as the optimization objective. The overall centroid displacement is instead bounded by the following inequalities:

$$ \frac{\sum_{ij}{X}_{ij}{w}_{ij}-\sum_{ij}{X}_{ij}{w}_{ij}^{\prime }}{W_{\textrm{T}}}\le {\overline{\Delta }}_{\textrm{C}}, $$
(43-1)
$$ \frac{\sum_{ij}{Y}_{ij}{w}_{ij}-\sum_{ij}{Y}_{ij}{w}_{ij}^{\prime }}{W_{\textrm{T}}}\le {\overline{\Delta }}_{\textrm{C}}, $$
(43-2)

Here \( {\overline{\Delta }}_{\textrm{C}} \) is an appropriate estimation of the maximum acceptable displacement for the centroid. Finally, the following optimization criterion is introduced:

$$ \max \sum_{\left(i,j\right)\in {A}_{\textrm{C}}}{w}_{ij}. $$
(44)

where AC represents a proper central area.

It can be observed that the model solutions are necessarily related to the choice of the parameters \( \overline{\Delta W} \)and PT, assumed to represent the specific case under study. As anticipated, a number of different choices can be compared to select the most suitable option from a practical point of view. This approach could however be significantly improved by adopting a multi-scenario optimization point of view. In this perspective, instead of \( \overline{\Delta W} \)and PT a set of parameters \( {\overline{\Delta W}}_{\alpha } \)and PTα (α = 1, …, NS) are considered. A perturbed intensity \( {w}_{ij\alpha}^{\prime } \) is associated with the corresponding parameters \( {\overline{\Delta W}}_{\alpha } \)and PTα, and the whole model is extended by properly replicating and readjusting the conditions and the objective function involved. However, this aspect – as a possible topic of future research – is not investigated here.

5 An Illustrative Case Study

As mentioned in Sect. 1, the bifocal system has recently been considered for possible utilization for the Athena satellite. The case study is only briefly outlined here, omitting (for confidentiality reasons) most of the technical details.

The two instruments, high-resolution X-ray Integral Field Unit (X-IFU) and Wide Field Imager (WFI) mentioned in Sect. 1, are accommodated within the Focal Plane Module (FPM). The Mirror Assembly Module (MAM) is placed at an assigned distance (12 m) from the FPM. Since the telescope line of sight is determined by the telescope mirror position and attitude, the on-board metrology is aimed at supporting the active control of the MAM focal point position with respect to the FPM instruments. The proposed design for the bifocal metrology on board Athena considers:

  • The bifocal optical head mounted on the MAM and aligned with it

  • One light target on each instrument

The overall light target is essentially composed by three LEDs defining an isosceles triangle. With reference to this specific case study, three main aspects have been investigated in depth (by applying the concepts discussed in Sects. 3.4, 4.1 and 4.2):

  • Error analysis for the measured distance between the MAM focal point position and the FPM instruments

  • Optimal system sizing

  • Light spot optimization

The optimization models considered for both error analysis and system sizing have been solved by the global-local nonlinear optimization software package LGO [12]. Again, technical details are not reported here for confidentiality reasons. As a main result, a trade-off concerning the optimal focal distances f1 and f2 has been made. On the one hand, these terms should be close to each other (f1 ~ f2) and as large as possible, compatibly with the maximum admissible dimension of the optical head. On the other hand, a small difference between f1 and f2 can give rise to possible partial overlapping between the light spot projections on the CCD.

The MILP model adopted for the light spot optimization has been solved by the IBM ILOG CPLEX solver (version 12.3). Several possible solutions have been considered. Figures 11 and 12 illustrate two examples of optimal nominal light spots obtained for the case study (the physical units have been omitted for confidentiality reasons).

Fig. 11
A 3D graph for example 1 of the nominal intensity distribution. The plot has a square on top with 4 triangular openings on 4 sides. The whole plot is divided into 7 layers of different shades.

Case study: nominal intensity distribution example1

Fig. 12
A 3D graph for example 2 of the nominal intensity distribution. The plot has a 3-D trapezoidal pyramid shape divided into4 layers of different shades. The bottom layer of the trapezoidal pyramid extends horizontally around its base with fluctuations that eventually become flat.

Case study: nominal intensity distribution example2

6 Concluding Remarks

The work presented here discusses an innovative metrological approach, based on a recently patented bifocal optical system [2,3,4]. This system is aimed at measuring the position and attitude of a target item, identified by means of three light spots, almost instantaneously and with high precision. The approach proposed is simple and efficient: here it is discussed in relation to advanced space engineering applications, but not limited to these.

In this chapter, the concept of the bifocal optical system is discussed in depth, investigating some mathematical aspects relevant the overall methodology adopted, including error analysis, system sizing optimization, and light spot optimization. Finally, a real-word case study is highlighted.

Future research can be aimed at advancements in relation to both the error analysis and system sizing optimization problems discussed here. In this perspective, enhancements of the global optimization approach followed aimed at improving the global search can be expected. Concerning the light spot optimization problem, to extend solution adaptability and reliability, the corresponding MILP model could be considered in a multi-scenario optimization approach.