Introduction to Yasuura’s Method of Modal Expansion with Application to Grating Problems

Matsushima, Akira; Matsuda, Toyonori; Okuno, Yoichi

doi:10.1007/978-3-319-74890-0_8

Akira Matsushima¹⁴,
Toyonori Matsuda¹⁵ &
Yoichi Okuno¹⁶

Part of the book series: Springer Series on Atomic, Optical, and Plasma Physics ((SSAOPP,volume 99))

708 Accesses

Abstract

In this chapter we introduce the theory of the Yasuura’s method based on modal expansion and explain the methods of numerical computation in detail for several grating problems. After a sample problem we discuss the methods for solving two types of problems that require additional knowledge and steps, that is, scattering by a dielectric cylinder and diffraction by a grating. Some numerical results are shown to give an evidence of an experimental rule for the number of linear equations in formulating the least-squares problem that determines the modal coefficients. After confirming the rule we show a couple of examples of practical interest, i.e., scattering by a relatively deep metal grating, plasmon surface waves on a metal grating placed in conical mounting, scattering by a metal surface modulated in two directions, and scattering by periodically located dielectric spheres. To provide supplementary explanations of particular problems, four appendices are given; H-wave scattering from a cylinder, the normal equation and related topics, conical diffraction by a dielectric grating, and comparison of modal functions and the algorithm of the smoothing procedures.

Access provided by CONRICYT-eBooks. Download chapter PDF

Fourier Modal Method and Its Applications to Inverse Diffraction, Near-Field Imaging, and Nonlinear Optics

An SVD in Spherical Surface Wave Tomography

Kirchhoff’s theory for optical diffraction, its predecessor and subsequent development: the resilience of an inconsistent theory

Article 23 February 2016

8.1 Introduction

In Chap. 6 of the last edition [27] we have introduced Yasuura’s method of modal expansion from two main points of view: one was the relation with the fictitious or equivalent source methods ; and another was the employment of smoothing procedures (SP’s) [10, 24, 25, 31, 41, 42] to obtain rapidly converging solutions. We needed the first point to have the method recognized as one of the modal expansion methods having firm theoretical foundations and a wide range of application. While in the second point we tried to explain our tool to cope with the problem of slow convergence . Because we had been working with the separated solutions as modal functions, we were often troubled by their poor approximation power. Accordingly, Yasuura et al. hit upon an idea of the SP, which works to accelerate the convergence of solutions by reducing the higher-order oscillations on the boundary. The SP, hence, is an important step in solving a 2-D problem^{Footnote 1} where the cross section of the obstacle is strongly deformed from the coordinate curves of a separable system of coordinates.

In the last edition we have included: (1) the theory and the method of numerical execution of the original form of Yasuua’s method, which we call the conventional Yasuura’s method (CYM) today; (2) Yauura’s method with a smoothing or a singular-smoothing procedure (YMSP or YMSSP); and (3) numerical examples obtained mainly by the YMSP and YMSSP. In the present chapter, however, we decided to omit a greater part of SP-related topics in view of the recent trend in computational electromagnetics. That is, the methods for 3-D as well as 2-D analysis of structures made of a dielectric are required in various areas. Instead of removing the SP, we include a detailed explanation on the solution process by the CYM. We hope this helps those who are interested in solving their problems by using Yasuura’s methods, CYM, YMSP, and YMSSP. Because the process with the SP’s are almost in common with that of the CYM, the detailed introduction of the CYM execution process would be useful not only for the CYM users but also for those who intend to employ the YMSP or YMSSP.

The contents of this chapter are as follows: In Sect. 8.2 we first introduce the theory of the CYM briefly and explain the method of numerical computation in detail taking a sample problem. Then, we move on to the methods for solving two types of problems that require additional knowledge and steps: (1) scattering by a dielectric cylinder; and (2) diffraction by a grating . In Sect. 8.3 we show some numerical results. The aim of Sect. 8.3.1 is to give an evidence of an experimental rule for the number of sampling points or, in general, the number of linear equations in formulating the least-squares problem that determines the modal coefficients. Computational results show the number should be twice as many as the number of unknown modal coefficients. After confirming the rule we show a couple of examples of practical interest in Sects. 8.3.2–8.3.5: Scattering by a relatively deep metal grating, Plasmon surface waves on a metal grating placed in conical mounting, Scattering by a metal surface modulated in two directions, and Scattering by periodically located dielectric spheres. Section 8.4 is a conclusion where we state some additional remarks. Finally, four appendices follow mainly providing supplementary explanations of particular problems: 1 H-wave scattering from a cylinder; 2 The normal equation and related topics; 3 Conical diffraction by a dielectric grating; and 4 Comparison between two types of modal functions and a brief introduction to the algorithm with the SP.

8.2 Yasuura’s Method of Modal Expansion

In this section we introduce the foundations of Yasuura’s method of modal expansion. We start by formulating a sample problem: plane wave scattering by a perfectly-conducting (PC) cylinder, the problem from which we can learn the essential part of the method together with important concepts and ideas in Yasuura’s method.

8.2.1 Scattering by a Perfectly-Conducting Cylinder

The geometry of the sample problem is shown in Fig. 8.1. The closed curve C is the cross section and $\mathrm{S}_\mathrm{e}$ is the exterior infinite region of C. We denote a point in $\mathrm{S}_\mathrm{e}$ by $\mathbf{r}\,(r,\theta )$; and one on C by an arc-length s along C measured counterclockwise from a fixed point $s_0$. $\mathrm{S}_\mathrm{e}{}_0$ is an arbitrary closed region that is entirely inside $\mathrm{S}_\mathrm{e}$. Let the incident plane wave be polarized in z and

$$\begin{aligned} \mathbf{E}^\mathrm{i}(\mathbf{r}) = \mathbf{u}_z F(\mathbf{r}) = \mathbf{u}_z \exp [-\mathrm{i} kr \cos (\theta -\iota )], \end{aligned}$$

(8.1)

where $\mathbf{u}_z$ is a unit vector in z-direction, $\iota $ is the angle of incidence shown in Fig. 8.1, and $k = 2\pi /\lambda = \omega /c$ is the wavenumber of the incident field. The $e^{\mathrm{i}\omega t}$ time dependence is assumed. This case of polarization is called E-wave,^{Footnote 2} which is one of the two basic polarizations. We will deal with an E-wave problem in this section and summarize important results of an H-wave case, which is another basic polarization, in Appendix 1.

In the present problem a surface current flows in the z-direction exciting a scattered wave polarized in z again:

$$\begin{aligned} \mathbf{E}^\mathrm{s}(\mathbf{r}) = \mathbf{u}_z \varPsi (\mathbf{r}). \end{aligned}$$

(8.2)

Other non-zero components of the scattered wave, $\mathbf{H}^\mathrm{s}(\mathbf{r}) = \mathbf{u}_x H_x^\mathrm{s}(\mathbf{r}) + \mathbf{u}_y H_y^\mathrm{s}(\mathbf{r})$, can be obtained by^{Footnote 3}

$$\begin{aligned} \mathbf{H}^s(\mathbf{r}) = \frac{\mathrm{i}}{\omega \mu _0} \nabla \varPsi (\mathbf{r}) \times \mathbf{u}_z, \end{aligned}$$

(8.3)

where $\nabla = (\partial /\partial x, \partial /\partial y, 0)$ is the 2-D nabla operator. Hence, our target is $\varPsi (\mathbf{r})$ and we can state our sample problem as:

Problem 1 E-wave, PC. Find the scattered electric field $\varPsi (\mathbf{r})$ that satisfies:

(D1):: The 2-D Helmholtz equation in $\mathrm{S}_\mathrm{e}$
$$\begin{aligned} \nabla ^2 \varPsi (\mathbf{r}) + k^2 \varPsi (\mathbf{r}) = 0 \quad (\mathbf{r}\in \mathrm{S}_\mathrm{e}), \end{aligned}$$
(8.4)
(D2):: The 2-D radiation condition at infinity
$$\begin{aligned} \sqrt{r} \left( \frac{\partial \varPsi (\mathbf{r})}{\partial r} + ik\varPsi (\mathbf{r})\right) \rightarrow 0 \quad (r \rightarrow \infty ), \end{aligned}$$
(8.5)
(D3):: The boundary condition
$$\begin{aligned} \varPsi (s) = f(s) \equiv -F(s) \quad (s \in \,\mathrm{C},\, \mathrm{i.e.},\ 0 \le s \le C). \end{aligned}$$
(8.6)

Here, $\nabla ^2 = \partial ^2/\partial x^2 + \partial ^2/\partial y^2$ denotes the 2-D Laplacian. The condition given by (8.6) is called Dirichlet’s or the first-kind boundary condition.

8.2.2 Modal Functions, Approximate Solution, and Least-Squares Boundary Matching

Here we introduce the analytical part of Yasuura’s method [38,39,40]. Because it is one of the modal expansion methods, we need: (i) definition of a set of modal functions; (ii) a method to construct an approximate solution; and (iii) the sense in which the solution approximates the boundary condition. Let us see these points below.

8.2.2.1 Definition of the Set of Modal Functions

Modal functions for the sample problem are solutions of Helmholtz’s equation (8.4) satisfying some additional requirements. Here, we define a set of modal functions $\{\varphi _m(\mathbf{r}):\ m=1,2,\ldots \}$ as a countable set that satisfy the following three requirements:

(M1):: Each $\varphi _m(\mathbf{r})$ satisfies the Helmholtz equation in $\mathrm{S}_\mathrm{e}$;
(M2):: Each $\varphi _m(\mathbf{r})$ meets the 2-D radiation condition;
(M3):: Both the set of boundary values $\{\varphi _m(s):\,m=1,2,\ldots \}$ and the set of normal derivatives $\{\partial \varphi _m(s)/\partial \nu :\,m=1,2,\ldots \}$ are complete (or total)^{Footnote 4} in the function space $\text{ H } = L^2(\mathrm{C})$ consisting of all the square-integrable functions defined on the boundary C.

The first two requirements are natural and easy to understand; but the third is rather complicated and needs explanation. Here, we would like to call readers’ attention to the fact that (M3) is a little different from the original requirement given in [39], which seems to be lacking in concreteness than the statement above. We have modified the original statement to require completeness of the boundary values.

Now, let us see a couple of examples first to facilitate the understanding. Then, we will give additional explanations for this issue throughout this section.

Example 1

The set of radiative separated solutions

$$\begin{aligned} \varphi _m(\mathbf{r}) = H_m^{(2)}(kr)\exp (\mathrm{i}m\theta ) \quad (m=0,\pm 1, \pm 2,\ldots ), \end{aligned}$$

(8.7)

where $H_m^{(2)}(kr)$ is the second kind Hankel function of order m and the coordinate origin should be inside $\mathrm{S}_\mathrm{i}$, the complimentary region of $\mathrm{S}_\mathrm{e}$.^{Footnote 5}

Example 2

Let L be a smooth closed curve that is entirely inside $\mathrm{S}_\mathrm{i}$ and $\mathrm{D}_\mathrm{e}$ be an exterior infinite region of L. As shown in Fig. 8.2, $\mathrm{S}_\mathrm{e}$ is a subregion of $\mathrm{D}_\mathrm{e}$; and $\mathrm{D}_\mathrm{i}$, the complementary region of $\mathrm{D}_\mathrm{e}$, is a subregion of $\mathrm{S}_\mathrm{i}$. Now, let an enumerable set of functions $\{f_m(s): m=1,2,\ldots \}$ be complete in the function space $L^2(\mathrm{L})$. Then, the set of potential functions defined in $\mathrm{D}_\mathrm{e}$ with $f_m(t)$’s as double-layer density functions on L

$$\begin{aligned} \varphi _m(\mathbf{r}) = -\int \limits _{\mathrm{L}} f_m(t)\,\frac{\partial \psi (kR)}{\partial \nu _t}\,dt \quad (\mathbf{r}\in \mathrm{D}_\mathrm{e};\ R=\overline{t\mathbf{r}};\ m=1,2,\ldots ) \end{aligned}$$

(8.8)

is a set of modal functions in $\mathrm{D}_\mathrm{e}$ provided that k does not coincide with a member of $\{k_\mathrm{H}(\mathrm{D}_\mathrm{i})\}$, the set of eigenvalues of the homogeneous H-wave (Neumann) problem in $\mathrm{D}_\mathrm{i}$.^{Footnote 6} Here, R is the distance between t and $\mathbf{r}$, $\psi (kR) = H_0^{(2)}(kR)/4\mathrm{i}$ is the free-space Green’s function , and $\partial /\partial \nu _t$ denotes normal derivative at t. Note that the ensemble of single-layer potentials can also be the set of modal functions provided $k \not \in \{k_\mathrm{E}(\mathrm{D}_\mathrm{i})\}$, the set of eigenvalues of homogeneous E-wave (Dirichlet) problem in $\mathrm{D}_\mathrm{i}$.

Example 3

Monopole fields whose poles $\mathbf{p}_m$ are located on L

$$\begin{aligned} \varphi _m(\mathbf{r}) = H_0^{(2)}(kR_m) \quad (\mathbf{r}\in \mathrm{D}_\mathrm{e};\; R_m = \overline{\mathbf{p}_m\mathbf{r}};\; m = 1,2,\ldots ,M) \end{aligned}$$

(8.9)

form a set of modal functions in $\mathrm{D}_\mathrm{e}$ when we let $M \rightarrow \infty $ while letting $\overline{\mathbf{p}_m \mathbf{p}_{m+1}} \rightarrow 0$ provided there is no internal resonance in $\mathrm{D}_\mathrm{i}$ [26, 30].

Example 4

The set of multiple-multipole fields whose poles $\mathbf{p}_m$ are on L

$$\begin{aligned} \begin{aligned} \varphi _{mn}(\mathbf{r})&= H_n^{(2)}(kR_m)\exp (\mathrm{i}n\theta _m) \\&(\mathbf{r}\in \mathrm{D}_\mathrm{e};\; R_m = \overline{\mathbf{p}_m\mathbf{r}};\; m=1,2,\ldots ,M; \; n=0, \pm 1, \pm 2, \ldots ) \end{aligned} \end{aligned}$$

(8.10)

is also an example of modal functions.

8.2.2.2 Construction of an Approximate Solution

To define an approximate solution, we first choose a set of modal functions from among possible candidates. Let us take the set of separated solutions in the following analysis. This is because the set of separated solutions is one of the most familiar functions and each member has physical meaning.^{Footnote 7} Thus we can define an approximate solution as a finite summation of the outgoing separated solutions (8.7) with unknown coefficients:

$$\begin{aligned} \varPsi _N(\mathbf{r}) = \sum _{m=-N}^N A_m(M)\,\varphi _m(\mathbf{r}). \end{aligned}$$

(8.11)

Here, $A_m(M)$ means that the $A_m$ coefficient depends on $M = 2N+1$, the number of modal functions employed.^{Footnote 8} Because of the definition of modal functions, the approximate solution already satisfies the requirements (D1) and (D2). The $A_m$ coefficients, hence, should be determined so that the solution meets the boundary condition in a sense of approximation. Let us call this procedure boundary matching and keep in mind that the sense of approximation in boundary matching determines a method of solution.

We employ the least-squares approximation in Yasuura’s method, i.e., minimization of mean-squares boundary residual. We will see in Sect. 8.2.2.3 that this is a promising way in boundary matching provided the completeness of the set of boundary values (M3) is guaranteed.

8.2.2.3 Least-Squares Boundary Matching

We employ integral representations of the solutions to explain the method of solution including convergence of the approximate solutions. For this purpose let us define the Green’s function of our problem, $G(\mathbf{r},\mathbf{r}')$, satisfying Helmholtz’s equation with a unit source at r, radiation condition with respect to $\mathbf{r}'$, and a homogeneous boundary condition^{Footnote 9}

$$\begin{aligned} G(\mathbf{r},s) = 0\quad (\mathbf{r}\in \mathrm{S}_\mathrm{e};\; s \in \,\mathrm{C}). \end{aligned}$$

(8.12)

Using the Green’s formula to $\varPsi (\mathbf{r}')$ and $G(\mathbf{r}, \mathbf{r}')$, we have

$$\begin{aligned} \varPsi (\mathbf{r}) = -\int \limits _{s=0}^C \partial _\nu G(\mathbf{r},s)\varPsi (s)\,ds = -\int \limits _{s=0}^C \partial _\nu G(\mathbf{r},s) f(s)\, ds \quad (\mathbf{r}\in \mathrm{S}_\mathrm{e}). \end{aligned}$$

(8.13)

Here, $\partial _\nu $ denotes the normal derivative at s and the second equality comes from (D3). Besides, we get a similar representation for the approximate solution $\varPsi _N(\mathbf{r})$. Subtracting (8.13) from the representation of $\varPsi _N(\mathbf{r})$ side by side, we have

$$\begin{aligned} \varPsi _N(\mathbf{r}) - \varPsi (\mathbf{r}) = -\int \limits _{s=0}^C \partial _\nu G(\mathbf{r},s) [\varPsi _N(s) - f(s)]\,ds\quad (\mathbf{r}\in \mathrm{S}_\mathrm{e}). \end{aligned}$$

(8.14)

Although (8.14) is a formal representation, we can deduce useful results starting from it.

Let the observation point r be inside the closed region $\mathrm{S}_\mathrm{e}{}_0$ in Fig. 8.1. Then, $\partial _\nu G(\mathbf{r},s)$ is a continuous function of s because there is a non-zero distance between s and r. Taking the absolute value of both sides of (8.14) and applying Cauchy–Schwarz’s inequality to the right-hand side, we obtain

$$\begin{aligned} \big |\varPsi _N(\mathbf{r})-\varPsi (\mathbf{r})\big | \le \sqrt{\int \limits _{s=0}^C |\partial _\nu G(\mathbf{r}, s)|^2\, ds}\,\big \Vert \varPsi _N - f \big \Vert \quad (\mathbf{r}\in \mathrm{S}_\mathrm{e}). \end{aligned}$$

(8.15)

Here, $\Vert f \Vert $ stands for the Euclidean norm of a function f(s) defined by

$$\begin{aligned} \Vert f \Vert = \left[ \int \limits _{s=0}^C |f(s)|^2 \, ds\right] ^{1/2}. \end{aligned}$$

(8.16)

Because the integrand on the right of (8.15) is a continuous function of r, the integral, as a function of r, has a maximum inside the closed region $\mathrm{S}_\mathrm{e}{}_0$:

$$\begin{aligned} G(\mathrm{S}_\mathrm{e}{}_0) = \max _{\mathbf{r}\in \,\mathrm{S}_\mathrm{e0}} \sqrt{\int \limits _{s=0}^C |\partial _\nu G(\mathbf{r}, s)|^2\, ds}\quad (\mathrm{S}_\mathrm{e}{}_0 \subset \,\mathrm{S}). \end{aligned}$$

(8.17)

Thus we have an estimation

$$\begin{aligned} \left| \varPsi _N(\mathbf{r}) - \varPsi (\mathbf{r}) \right| \le G(\mathrm{S}_\mathrm{e}{}_0) \big \Vert \varPsi _N - f \big \Vert \quad (\mathbf{r}\in \mathrm{S}_\mathrm{e}{}_0 \subset \mathrm{S}_\mathrm{e}), \end{aligned}$$

(8.18)

which means that the maximal absolute error in $\mathrm{S}_\mathrm{e}{}_0$ cannot exceed the product of the mean-squares boundary residual and a factor of proportionality $G(\mathrm{S}_\mathrm{e}{}_0)$. Note that the latter depends on the region $\mathrm{S}_\mathrm{e}{}_0$ but does not depend on r.

Now, let us remember the completeness (M3) of the set of boundary values of modal functions. Because the given boundary value f(s) is a member of $\mathbf{H}= L^2(\mathrm{C})$, for given any positive number $\varepsilon $, there is a positive integer $N_0$ such that

$$\begin{aligned} \big \Vert \varPsi _N - f \big \Vert < \varepsilon \quad (N > N_0). \end{aligned}$$

(8.19)

That is, there exists a sequence of boundary values of the approximate solutions $\{ \varPsi _0(s), \varPsi _1(s), \varPsi _2(s), \ldots \}$ that converges to the true boundary value f(s) in the mean-squares sense:

$$\begin{aligned} \big \Vert \varPsi _N - f \big \Vert \rightarrow 0\quad (N \rightarrow \infty ). \end{aligned}$$

(8.20)

Referring to (8.18), we can conclude that the corresponding sequence of approximate solutions $\{ \varPsi _0(\mathbf{r}), \varPsi _1(\mathbf{r}), \varPsi _2(\mathbf{r}), \ldots \}$ converges to $\varPsi (\mathbf{r})$ uniformly in the closed region $\mathrm{S}_\mathrm{e}{}_0$ ^{Footnote 10}: for given any positive number $\varepsilon $, there is a positive integer $N_0(\mathrm{S}_\mathrm{e}{}_0, \varepsilon )$ such that

$$\begin{aligned} \left| \varPsi _N(\mathbf{r}) - \varPsi (\mathbf{r}) \right| < \varepsilon \quad (\mathbf{r}\in \mathrm{S}_\mathrm{e}{}_0 \subset \mathrm{S}_\mathrm{e};\; N > N_0(\mathrm{S}_\mathrm{e}{}_0, \varepsilon )). \end{aligned}$$

(8.21)

We can get such a sequence by solving repeatedly the following least-squares problem (LSP) stated in the function space $\mathbf{H}$.

LSP 1: E-wave, PC. Find the coefficients $A_m(M)$ $(m=0, \pm 1, \ldots , \pm N;\; M=2N+1)$ that minimize the normalized mean-squares boundary residual

$$\begin{aligned} E_N = \frac{\big \Vert \varPsi _N - f \big \Vert ^2}{\left\| f \right\| ^2} = \frac{1}{\left\| f \right\| ^2}\, \left\| \sum _{m=-N}^N A_m(M)\varphi _m - f \right\| ^2. \end{aligned}$$

(8.22)

Note that the least-squares boundary matching means a relaxation of the boundary condition because (8.6) implies $\Vert \varPsi - f \Vert = 0$; but the converse is not always true. The smoothing procedure (SP), which we mentioned in Introduction, is an extension of the relaxation idea: we minimize $\Vert \int (\varPsi -f)\, ds \Vert $ instead of $\Vert \varPsi -f \Vert $; and extinction of the latter is stronger than vanishing of the former [10, 24, 31, 41, 42]. Although the Yasuura’s method with the SP is a strong tool for 2-D problems, we shall not get deeply in this subject.

8.2.3 Method of Numerical Solution

Because computers cannot handle continuous functions, we need (i) method of discretization of LSP 1 and (ii) method of solution to the discretized problem.^{Footnote 11}

8.2.3.1 Method of Discretization

To discretize the problem we first locate J $(\ge M)$ sampling points on C, assign a set of integers from 0 through J to them, and get a numbered set of sampling points $\{s_0, s_1, \ldots , s_J\}$. Because C is a closed curve, we give two numbers, 0 and J, to the point $s = 0$. Two methods are usually used in locating the points:

1.
An equal division of the boundary C: In most applications we can recommend to use the points given by
$$\begin{aligned} s_j = \frac{jC}{J} \quad (j=0,1,2,\ldots ,J) \end{aligned}$$
(8.23)
without any reservation in theory. If we take this method, we may have to solve an (transcendental) (8.23) in locating the points.
2.
An equal division with respect to a coordinate variable: For example, if C is represented as $r=r(\theta )$, it must be convenient to use the discretization
$$\begin{aligned} \theta _j = \frac{j2\pi }{J} \quad (j=0,1,2,\ldots ,J). \end{aligned}$$
(8.24)
This choice, however, means a variable transformation in (8.13) and in other integrals on C and will lead us solving a weighted least-squares problem unexpectedly. Users should notice this and be careful in applying this method of location in a problem where the boundary C is strongly deformed from a circle.^{Footnote 12}

Having located the sampling points on C, we can define discretized forms of the functions f(s), $\varphi _m(s)$, and so on:

$$\begin{aligned} \mathbf{f} = \left[ f(s_1)\ f(s_2)\ \cdots \ f(s_J) \right] ^\mathrm{T} \end{aligned}$$

(8.25)

and

$$\begin{aligned} \varvec{\varphi }_m = \left[ \varphi _m(s_1)\ \varphi _m(s_2)\ \cdots \ \varphi _m(s_J) \right] ^\mathrm{T}. \end{aligned}$$

(8.26)

Here, the superscript T denotes a transposed vector or matrix and the discretized forms are J-dimensional complex-valued column vectors. Next, we define a $J \times M$ matrix by

$$\begin{aligned} \varPhi = \left[ \varvec{\varphi }_{-N}\ \varvec{\varphi }_{-N+1}\ \cdots \ \varvec{\varphi }_N \right] = \begin{bmatrix} \varphi _{-N}(s_1)&\varphi _{-N+1}(s_1)&\cdots&\varphi _N(s_1) \\ \varphi _{-N}(s_2)&\varphi _{-N+1}(s_2)&\cdots&\varphi _N(s_2) \\ \vdots&\vdots&\ddots&\vdots \\ \varphi _{-N}(s_J)&\varphi _{-N+1}(s_J)&\cdots&\varphi _N(s_J) \\ \end{bmatrix} \end{aligned}$$

(8.27)

which is usually termed a Jacobian matrix. Finally, defining an M-dimensional solution vector

$$\begin{aligned} \mathbf{A} = \left[ A_{-N}(M)\ A_{-N+1}(M)\ \cdots \ A_N(M) \right] ^\mathrm{T}, \end{aligned}$$

(8.28)

we can represent a discretized form of an approximate solution on C in vector-matrix notation

$$\begin{aligned} \varvec{\varPsi }_N = \sum _{m=-N}^N A_m(M)\varvec{\varphi }_m = \varPhi \mathbf{A}. \end{aligned}$$

(8.29)

Thus, we have an approximation to the mean-squares boundary residual in (8.22):

$$\begin{aligned} E_{NJ} = \frac{\left\| \varPhi \mathbf{A} - \mathbf{f} \right\| ^2}{\left\| \mathbf{f} \right\| ^2}. \end{aligned}$$

(8.30)

Here, $\Vert \mathbf{f}\Vert $ denotes a Euclidean norm of a J-dimensional complex-valued vector f. Because C is a closed curve and $f(s_0)=f(s_J)$, etc., (8.30) can be understood as a trapezoidal-rule approximation of (8.22). Now, we can state a discretized form of LSP 1 as follows:

DLSP 1: E-wave, PC. Find the solution vector A that minimizes the numerator of (8.30).

Here arises an important issue of the number of sampling points^{Footnote 13}: How many J do we need? If we answer to this question in generality, we should say: It depends. However, employing the results of examination in Sect. 8.3.1, we can state an experimental rule:

$$\begin{aligned} J \doteq 2(2N+1) = 2M. \end{aligned}$$

(8.31)

Here, the symbol $\doteq $ means that the number on the right-hand side is usually sufficient in finding the scattered field. This might be considerably smaller than what uninitiates expect because DLSP 1 with the number J of (8.31) does not seem to be a good approximation of LSP 1. This is because an inner product $(C/J) \mathbf{f}^\dag \mathbf{g}$ implicitly included in the norm on the right of (8.30) cannot be a precise approximation of (f, g) in (8.22) if either $f(\mathbf{r})$ or $g(\mathbf{r})$ is a higher-order space harmonic. Nevertheless, DLSP 1 with (8.31) gives an approximate solution having converged with respect to J. We know this welcome nature of DLSP 1 since we started solving the scattering problem on a computer in early 70s. At that time the method in Sect. 8.2.3.2 was not known widely and we solved the problem using a normal equation (NE; see Appendix 2). We found (8.31) was effective even in using the NE where the inner products $(C/J) \mathbf{f}^\dag \mathbf{g}$ appeared explicitly as the matrix elements. In the method of solution that we introduce next, we do not have to calculate these inner products. That is one of the advantages of the method.

8.2.3.2 Solution Method to the Discretized Problem

To solve the least-squares problem in the J-dimensional vector space, we employ orthogonal decomposition of the Jacobian matrix : the singular-value decomposition (SVD) and the QR decomposition (QRD) [15]. They have the following features:

The SVD informs us of the character of the Jacobian matrix through singular values. This is helpful in designing and testing a process of numerical solution, in particular, choice of modal functions, number and location of sampling points, etc. Instead, the computational complexity, in both memory and time, is bigger than that of the QRD.
The QRD needs less computation than the SVD and solves the problem provided no rank deficiency occurs.^{Footnote 14}

Hence, we recommend the use of the SVD for designing and testing the discretized least-squares problem. After the problem is established, application of the QRD is appropriate. Let us see how to use these decompositions in examining and solving DLSP 1.

Utilization of the SVD

Applying the SVD, we get a decomposition of the Jacobian matrix in the form

$$\begin{aligned} \varPhi = \mathrm{U}\varSigma \mathrm{V}^\dag , \end{aligned}$$

(8.32)

where U $(J \times J)$ and V $(M \times M)$ are unitary matrices, and $\dag $ denotes Hermitian conjugation: $\mathrm{V}^\dag = \bar{\mathrm{V}}^\mathrm{T}$. $\varSigma $ is a stack of an $M \times M$ diagonal matrix and a $(J-M) \times M$ zero matrix. The diagonal elements of $\varSigma $, $\sigma _m$, are non-negative and are called the singular values of $\varPhi $. Arranging the M singular values in the order of decreasing magnitude, we have $\sigma _1 \ge \sigma _2 \ge \cdots \ge \sigma _M$ $(M=2N+1)$. Let us call $\sigma _1$ and $\sigma _M$ by $\sigma _\mathrm{max}$ and $\sigma _\mathrm{min}$ because this order of $\sigma _m$ does not necessarily agree with the order of modal functions. The following items are widely known and accepted:

The singular values are non-negative square roots of the eigenvalues of a positive semidefinite Hermitian matrix $\varPhi ^\dag \varPhi $: $\sigma _m(\varPhi ) = \sqrt{\lambda _m(\varPhi ^\dag \varPhi )}$. And, vanish of the smallest singular value, $\sigma _\mathrm{min} = 0$, means $\det \varPhi ^\dag \varPhi =0$. Because $\varPhi ^\dag \varPhi $ is the coefficient matrix of the NE (8.106) in Appendix 2, this is a serious problem: the least-squares problem does not have a unique solution. Although $\sigma _\mathrm{min}=0$ in strict sense seldom occurs in practice, very tiny $\sigma _\mathrm{min}$ is not rare and causes substantial rank deficiency.
The ratio of the maximum singular value to the minimum
$$\begin{aligned} \mathrm{cond}(\varPhi ) = \frac{\sigma _\mathrm{max}}{\sigma _\mathrm{min}} \end{aligned}$$
(8.33)
defines the condition number of $\varPhi $, which shows the degree of numerical difficulty in solving the least-squares problem with the Jacobian matrix $\varPhi $. In general, a problem with a small $\mathrm{cond}(\varPhi )$ is easy to solve and is termed well-conditioned; while one with a huge $\mathrm{cond}(\varPhi )$ is difficult and called ill-conditioned . In this connection an empirical rule is known: if the reciprocal of $\mathrm{cond}(\varPhi )$ is of the same order as or smaller than the machine epsilon^{Footnote 15} of the system of floating-point numbers, effective rank of $\varPhi $ might be less than M and DLSP 1 may not be solved properly.

Although our main purpose to employ the SVD is to check the nature of $\varPhi $, we can solve DLSP 1 in the following way:

(a)
Modifying $\Vert \varPhi \mathbf{A} - \mathbf{f} \Vert ^2$ by insertion of (8.32), we have
$$\begin{aligned} \Vert \varPhi \mathbf{A}-\mathbf{f} \Vert ^2 = \Vert \mathrm{U}^\dag (\varPhi \mathbf{A}-\mathbf{f}) \Vert ^2 = \Vert \varSigma \mathrm{V}^\dag \mathbf{A} - \mathrm{U}^\dag \mathbf{f} \Vert ^2 = \Vert \varSigma \mathbf{B} - \mathbf{d} \Vert ^2. \end{aligned}$$
(8.34)
Here, we have used that the matrices U and V are unitary and that a unitary transformation does not change the norm of a vector. Also, note that the last equal sign defines the vectors B and d.
(b)
We get the solution to DLSP 1 from
$$\begin{aligned} B_m = \frac{d_m}{\sigma _m} \quad (m=1,2, \ldots , M\; (=2N+1)) \end{aligned}$$
(8.35)
and the squared norm by
$$\begin{aligned} \Vert \varPhi \mathbf{A}-\mathbf{f} \Vert ^2 = \sum _{j=M+1}^J |d_j|^2. \end{aligned}$$
(8.36)

Utilization of the QRD

Employment of the QRD leads us to a decomposition of the form

$$\begin{aligned} \varPhi = \mathrm{Q}\tilde{\mathrm{R}} = \mathrm{Q}\left[ \begin{array}{c} \mathrm{R} \\ 0 \end{array} \right] , \end{aligned}$$

(8.37)

where Q is a $J \times J$ unitary matrix and $\tilde{\mathrm{R}}$ is a stack of $M \times M$ upper triangular matrix R and a $(J-M) \times M$ zero matrix. Having the decomposition (8.37), we can solve DLSP 1 by the following procedure:

(a)
Inserting (8.37) into $\Vert \varPhi \mathbf{A}-\mathbf{f}\Vert ^2$, we have
$$\begin{aligned} \Vert \varPhi \mathbf{A}-\mathbf{f}\Vert ^2 = \Vert \mathrm{Q}^\dag (\varPhi \mathbf{A}-\mathbf{f})\Vert ^2 = \Vert \tilde{\mathrm{R}}{} \mathbf{A} - \mathrm{Q}^\dag \mathbf{f}\Vert ^2 \equiv \left\| \left[ \begin{array}{c} \mathrm{R\mathbf{A}} \\ 0 \end{array} \right] - \left[ \begin{array}{c} \mathbf{d} \\ \mathbf{z} \end{array} \right] \right\| ^2, \end{aligned}$$
(8.38)
where the last equality defines the vectors d and z.
(b)
We obtain the solution by solving
$$\begin{aligned} \mathrm{R}{} \mathbf{A} = \mathbf{d}. \end{aligned}$$
(8.39)
Because R is triangular, we need only back substitution to solve (8.39). The residual norm is given by
$$\begin{aligned} \Vert \varPhi \mathbf{A}-\mathbf{f}\Vert ^2 = \Vert \mathbf{z}\Vert ^2. \end{aligned}$$
(8.40)

8.2.4 Application to Dielectric or Metal Obstacles

This section introduces the Yasuura’s method applied to problems with dielectric or metal obstacles [35, 43,44,45]. Although metals have unique nature, we here regard a metal as a dielectric with a complex permittivity depending on the frequency. Therefore we consider a material whose permittivity and refractive index are given by complex numbers $\varepsilon $ and $n=\sqrt{\varepsilon /\varepsilon _0}$. Usually the material is penetrable and there is a non-zero transmitted field in $\mathrm{S}_\mathrm{i}$, the complementary region of $\mathrm{S}_\mathrm{e}$. Thus we have two unknown functions $\varPsi _{\,\mathrm i}(\mathbf{r})$ $(\mathbf{r}\in \mathrm{S}_\mathrm{i})$ and $\varPsi _\mathrm{e}(\mathbf{r})$ $(\mathbf{r}\in \mathrm{S}_\mathrm{e})$; and we need two boundary conditions to determine the two unknown functions. The continuity of tangential components of the electric and magnetic field satisfies the necessity.

8.2.4.1 E-Wave Scattering by a Cylindrical Obstacle Made of a Dielectric

Let us assume that the obstacle in Fig. 8.1 is made of a dielectric and that an E-wave is incident. The electric field in $\mathrm{S}_\mathrm{e}$ is a sum of the incident and the scattered wave: $\mathbf{u}_z(F + \varPsi _\mathrm{e})(\mathbf{r})$; while the field in the interior region $\mathrm{S}_\mathrm{i}$ is the transmitted field $\mathbf{u}_z \varPsi _{\,\mathrm i}(\mathbf{r})$. They are the solutions of Helmholtz’s equation in each region:

$$\begin{aligned} \left\{ \begin{array}{ll} \left( \nabla ^2+k^2 \right) \varPsi _\mathrm{e}(\mathbf{r}) = 0 &{} \quad (\mathbf{r}\in \mathrm{S}_\mathrm{\mathrm{e}}), \\ \left( \nabla ^2+(nk)^2 \right) \varPsi _{\,\mathrm i}(\mathbf{r}) = 0 &{} \quad (\mathbf{r}\in \mathrm{S}_\mathrm{\mathrm{i}}). \end{array} \right. \end{aligned}$$

(8.41)

The exterior solution, in addition, should meet the radiation condition (8.5). The continuity of tangential components of electric and magnetic fields requires the boundary conditions^{Footnote 16}

$$\begin{aligned} \left\{ \begin{array}{l} \varPsi _\mathrm{e}(s) - \varPsi _{\,\mathrm i}(s) = f(s) \equiv -F(s), \\ \dfrac{\partial \varPsi _\mathrm{e}(s)}{\partial \nu } - \dfrac{\partial \varPsi _{\,\mathrm i}(s)}{\partial \nu } = g(s) \equiv -\dfrac{\partial F(s)}{\partial \nu }. \end{array} \right. \end{aligned}$$

(8.42)

Thus we get a boundary-value problem for $\varPsi _\mathrm{e}(\mathbf{r})$ and $\varPsi _{\,\mathrm i}(\mathbf{r})$:

Problem 2 E-wave, dielectric. Find the electric fields $\varPsi _\mathrm{e}(\mathbf{r})$ and $\varPsi _{\,\mathrm i}(\mathbf{r})$ that satisfy (8.41), (8.5), and (8.42).

Note that in dealing with an H-wave problem, the second line of (8.42) should include the refractive index n or permittivity $\varepsilon $ (see (8.100) in Appendix 1).

8.2.4.2 Modal Functions and Approximate Solutions

We need two sets of modal functions to solve the problem, one is for $\varPsi _\mathrm{e}(\mathbf{r})$ and another is for $\varPsi _{\,\mathrm i}(\mathbf{r})$. Let us call them the exterior and interior modal functions and represent them as $\{\varphi _{\mathrm{e}m}(\mathbf{r})\}$ and $\{\varphi _{{\,\mathrm i}m}(\mathbf{r})\}$. They should satisfy the requirements below, which are almost in common with the conditions from (M1) through (M3) given in Sect. 8.2.2.1

(MD1):: Each member of the set of exterior modal functions satisfies the Helmholtz equation in $\mathrm{S}_\mathrm{e}$ and meets the radiation condition at infinity.
(MD2):: Each member of the set of interior modal functions satisfies the Helmholtz equation in $\mathrm{S}_\mathrm{i}$.
(MD3):: The sets of boundary values $\{\varphi _{\mathrm{e}m}(s)\}$ and $\{\varphi _{{\,\mathrm i}m}(s)\}$, and the sets of normal derivatives $\{\partial \varphi _{\mathrm{e}m}(s)/\partial \nu \}$ and $\{\partial \varphi _{{\,\mathrm i}m}(s)/\partial \nu \}$ are all complete in the function space H.

Here, we take the sets of separated solutions again because they are familiar to many people working with boundary-value problems. Then, the exterior and interior modal functions are:

$$\begin{aligned} \left\{ \begin{array}{l} \varphi _{\mathrm{e}m}(\mathbf{r})=H_m^{(2)}(kr)\exp (\mathrm{i}m\theta ), \quad \varphi _{{\,\mathrm i}m}(\mathbf{r})=J_m(nkr)\exp (\mathrm{i}m\theta ) \\ \quad \quad \quad (m=0,\pm 1,\pm 2,\ldots ). \end{array}\right. \end{aligned}$$

(8.43)

Here, $J_m(nkr)$ stands for the Bessel function of order m. Then, we can define approximate solutions in $\mathrm{S}_\mathrm{e}$ and $\mathrm{S}_\mathrm{i}$ as

$$\begin{aligned} \left\{ \begin{array}{ll} \displaystyle \varPsi _{\mathrm{e}N}(\mathbf{r}) = \sum _{m=-N}^N A_{\mathrm{e}m}(M)\varphi _{\mathrm{e}m}(s) &{} \quad (\mathbf{r}\in \mathrm{S}_\mathrm{e}), \\ \displaystyle \varPsi _{\mathrm{i}N}(\mathbf{r}) = \sum _{m=-N}^N A_{\mathrm{i}m}(M)\varphi _{{\,\mathrm i}m}(s) &{} \quad (\mathbf{r}\in \mathrm{S}_\mathrm{i}). \end{array}\right. \end{aligned}$$

(8.44)

They satisfy the Helmholtz equation in each region and $\varPsi _{\mathrm{e}N}$ meets the radiation condition.

8.2.4.3 Error Estimation and Least-Squares Boundary Matching

After some analytical work we get error estimations similar to (8.18)^{Footnote 17}:

$$\begin{aligned} \begin{aligned} |\varPsi _{\mathrm{e}N}(\mathbf{r}) - \varPsi _\mathrm{e}(\mathbf{r})|\ \le&\; G_{\mathrm{e}1}(\mathrm{S}_{\mathrm{e}0}) \left\| \frac{\partial \varPsi _{\mathrm{e}N}}{\partial \nu } - \frac{\partial \varPsi _{\mathrm{i}N}}{\partial \nu } - g \right\| \\&+ G_{\mathrm{e}2}(\mathrm{S}_{\mathrm{e}0})\, \Vert \varPsi _{\mathrm{e}N}-\varPsi _{\mathrm{i}N} - f \Vert \quad (\mathbf{r}\in \mathrm{S}_{\mathrm{e}0} \subset \mathrm{S}_\mathrm{e}) \end{aligned} \end{aligned}$$

(8.45)

and

$$\begin{aligned} \begin{aligned} |\varPsi _{\mathrm{i}N}(\mathbf{r}) - \varPsi _{\,\mathrm i}(\mathbf{r})|\ \le&\; G_{\mathrm{i}1}(\mathrm{S}_{\mathrm{i}0}) \left\| \frac{\partial \varPsi _{\mathrm{e}N}}{\partial \nu } - \frac{\partial \varPsi _{\mathrm{i}N}}{\partial \nu } - g \right\| \\&+ G_{\mathrm{i}2}(\mathrm{S}_{\mathrm{i}0})\, \Vert \varPsi _{\mathrm{e}N}-\varPsi _{\mathrm{i}N} - f \Vert \quad (\mathbf{r}\in \mathrm{S}_{\mathrm{i}0} \subset \mathrm{S}_\mathrm{i}). \end{aligned} \end{aligned}$$

(8.46)

Here, $\mathrm{S}_{\mathrm{e}0}$ and $\mathrm{S}_{\mathrm{i}0}$ are arbitrary closed regions in $\mathrm{S}_\mathrm{e}$ and $\mathrm{S}_\mathrm{i}$, and $G_{\mathrm{p}q}$ (p = e, i; q = 1, 2) are positive constants depending on $\mathrm{S}_{\mathrm{e}0}$ and $\mathrm{S}_{\mathrm{i}0}$.

We can prove that: provided the sets of modal functions satisfy the requirement (MD3), there exists a sequence of pairs of approximate solutions

$$\begin{aligned} \left[ \begin{array}{c} \varPsi _{\mathrm{e}0}(\mathbf{r}) \\ \varPsi _{\mathrm{i}0}(\mathbf{r}) \end{array} \right] ,\; \left[ \begin{array}{c} \varPsi _{\mathrm{e}1}(\mathbf{r}) \\ \varPsi _{\mathrm{i}1}(\mathbf{r}) \end{array} \right] ,\; \ldots \;, \left[ \begin{array}{c} \varPsi _{\mathrm{e}N}(\mathbf{r}) \\ \varPsi _{\mathrm{i}N}(\mathbf{r}) \end{array} \right] ,\; \ldots \end{aligned}$$

(8.47)

whose boundary values and normal derivatives satisfy

$$\begin{aligned} E_N \equiv \frac{\big \Vert \varPsi _{\mathrm{e}N} - \varPsi _{\mathrm{i}N} - f \big \Vert ^2}{\Vert f \Vert ^2} + \frac{\big \Vert \partial \varPsi _{\mathrm{e}N}/\partial \nu - \partial \varPsi _{\mathrm{i}N}/\partial \nu - g \big \Vert }{\Vert g\Vert ^2} \rightarrow 0\ \quad (N \rightarrow \infty ). \end{aligned}$$

(8.48)

The sequence (8.47), hence, converges to the true solutions of the problem uniformly in wider sense in $\mathrm{S}_\mathrm{e}$ and $\mathrm{S}_\mathrm{i}$:

$$\begin{aligned} \varPsi _{\mathrm{p}N}(\mathbf{r}) \rightarrow \varPsi _\mathrm{p}(\mathbf{r})\quad (N \rightarrow \infty ;\ \mathrm{p} = 1,2;\; \text{ uniformly } \text{ in } \mathrm{S}_\mathrm{p0}\text{) }. \end{aligned}$$

(8.49)

Members of such a sequence can be found by solving the least-squares problem:

LSP 2: E-wave, dielectric. Find the modal coefficients $\{A_{\mathrm{p}m}(M):\; m=0,\pm 1, $ $\ldots , \pm N\}$ (p = e, i) that minimize the normalized mean-square error $E_N$ defined in (8.48).

It is worth to note the following matters: Because the convergence in (8.48) is a consequence of completeness of the four sets of boundary functions in a product space $\mathbf{H}\times \mathbf{H}$, the choice of denominators in (8.48), $\Vert f \Vert ^2$ and $\Vert g \Vert ^2$, is no more than a convention to get non-dimensional quantities or to unify the units.^{Footnote 18} Speaking from a computational point of view, however, the ratio $\Vert f \Vert ^2/\Vert g \Vert ^2$ may have an effect on the condition number of LSP 2 and, sometimes it is effective to introduce a parameter $\gamma \;(0< \gamma < 1)$ to modify the definition of $E_N$ as

$$\begin{aligned} E_N \equiv \gamma \, \frac{\big \Vert \varPsi _{\mathrm{e}N} - \varPsi _{\mathrm{i}N} - f \big \Vert ^2}{\Vert f \Vert ^2} + (1-\gamma )\, \frac{\big \Vert \partial \varPsi _{\mathrm{e}N}/\partial \nu - \partial \varPsi _{\mathrm{i}N}/\partial \nu - g \big \Vert ^2}{\Vert g \Vert ^2}. \end{aligned}$$

(8.50)

The parameter should be determined by optimization to get a permissible condition number.

8.2.4.4 Notes on the Method of Numerical Computation

In above formulation we have $2M = 2(2N + 1)$ unknowns. If we apply the rule in Sect. 8.3.1 (and also in Sect. 8.2.3.1), we need $2\times 2M = 4M$ linear equations. The number of sampling points required for the 4M equations, however, is 2M again. This is because we have two equations at each sampling point: the first and the second equation of (8.42).

Let us follow the method of discretization in Sect. 8.2.3.1. Locating $J\; (= M = 2(2N + 1))$ sampling points on C, we define J-dimensional vectors

$$\begin{aligned} \left\{ \begin{array}{l} \mathbf{f}=\left[ f(s_1)\ f(s_2)\ \cdots \ f(s_J)\right] ^\mathrm{T}, \\ \mathbf{g}=\left[ g(s_1)\ g(s_2)\ \cdots \ g(s_J)\right] ^\mathrm{T}, \end{array} \right. \end{aligned}$$

(8.51)

$$\begin{aligned} \left\{ \begin{array}{l} \varvec{\varphi }_{\mathrm{e}m} = \left[ \varphi _{\mathrm{e}m}(s_1)\;\varphi _{\mathrm{e}m}(s_2)\; \cdots \;\varphi _{\mathrm{e}m}(s_J) \right] ^\mathrm{T}, \\ \varvec{\varphi }_{\mathrm{i}m} = \left[ \varphi _{{\,\mathrm i}m}(s_1)\;\varphi _{{\,\mathrm i}m}(s_2)\; \cdots \;\varphi _{{\,\mathrm i}m}(s_J) \right] ^\mathrm{T} \end{array} \right. \end{aligned}$$

(8.52)

and

$$\begin{aligned} \left\{ \begin{array}{l} \partial _\nu \varvec{\varphi }_{\mathrm{e}m} = \left[ \partial _\nu \varphi _{\mathrm{e}m}(s_1)\; \partial _\nu \varphi _{\mathrm{e}m}(s_2)\;\cdots \;\partial _\nu \varphi _{\mathrm{e}m}(s_J) \right] ^\mathrm{T}, \\ \partial _\nu \varvec{\varphi }_{\mathrm{i}m} = \left[ \partial _\nu \varphi _{{\,\mathrm i}m}(s_1)\; \partial _\nu \varphi _{{\,\mathrm i}m}(s_2)\;\cdots \;\partial _\nu \varphi _{{\,\mathrm i}m}(s_J) \right] ^\mathrm{T}, \end{array} \right. \end{aligned}$$

(8.53)

where the mode-number m runs from $-N$ to N.

Next, we construct four $J \times M$ matrices

$$\begin{aligned} \left\{ \begin{array}{l} \varPhi _{11} = \left[ \varvec{\varphi }_{\mathrm{e},-N}\;\varvec{\varphi }_{\mathrm{e},-N+1}\;\cdots \; \varvec{\varphi }_{\mathrm{e},N}\right] , \\ \varPhi _{12} = \left[ \varvec{\varphi }_{\mathrm{i},-N}\;\varvec{\varphi }_{\mathrm{i},-N+1}\;\cdots \; \varvec{\varphi }_{\mathrm{i},N}\right] \end{array} \right. \end{aligned}$$

(8.54)

and

$$\begin{aligned} \left\{ \begin{array}{l} \varPhi _{21} = \left[ \partial _\nu \varvec{\varphi }_{\mathrm{e},-N}\;\partial _\nu \varvec{\varphi }_{\mathrm{e},-N+1}\;\cdots \; \partial _\nu \varvec{\varphi }_{\mathrm{e},N}\right] . \\ \varPhi _{22} = \left[ \partial _\nu \varvec{\varphi }_{\mathrm{i},-N}\;\partial _\nu \varvec{\varphi }_{\mathrm{i},-N+1}\;\cdots \; \partial _\nu \varvec{\varphi }_{\mathrm{i},N}\right] . \end{array}\right. \end{aligned}$$

(8.55)

Arranging the four matrices, we get a $2J \times 2M$ Jacobian matrix

$$\begin{aligned} \varPhi = \left[ \begin{array}{cc} p\varPhi _{11} &{} p\varPhi _{12} \\ q\varPhi _{21} &{} q\varPhi _{22} \end{array} \right] . \end{aligned}$$

(8.56)

Here,

$$\begin{aligned} p = \frac{\gamma }{\mathbf{f\,}^\dag \mathbf{f}}, \quad q = \frac{1-\gamma }{\mathbf{g}^\dag \mathbf{g}} \end{aligned}$$

(8.57)

are normalizing constants with the parameter $\gamma $ appeared in (8.50).^{Footnote 19} Finally, defining a 2M-dimensional solution vector

$$\begin{aligned} \mathbf{A}= \left[ \begin{array}{c} \mathbf{A}_\mathrm{e} \\ \mathbf{A}_\mathrm{i} \end{array} \right] , \end{aligned}$$

(8.58)

where

$$\begin{aligned} \left\{ \begin{array}{l} \mathbf{A}_\mathrm{e} = \left[ A_{\mathrm{e},-N}(M)\ A_{\mathrm{e},-N+1}(M)\;\cdots \ A_{\mathrm{e},N}(M) \right] ^\mathrm{T}, \\ \mathbf{A}_\mathrm{i} = \left[ A_{\mathrm{i},-N}(M)\ A_{\mathrm{i},-N+1}(M)\;\cdots \ A_{\mathrm{i},N}(M) \right] ^\mathrm{T} \end{array} \right. \end{aligned}$$

(8.59)

are $M\;(=2N+1)$ dimensional column vectors. Thus, we can state a discretized problem as:

DLSP 2: E-wave, dielectric. Find the solution vector A that minimizes the discretized form of normalized boundary residual

$$\begin{aligned} E_{NJ} = \left\| \varPhi \mathbf{A}- \left[ \begin{array}{c} p\,\mathbf{f} \\ q\,\mathbf{g} \end{array}\right] \right\| ^2 = \left\| \begin{array}{c} p\varPhi _{11}\mathbf{A}_\mathrm{e} + p\varPhi _{12}\mathbf{A}_\mathrm{i} -p\mathbf{f} \\ q\varPhi _{21}\mathbf{A}_\mathrm{e} + q\varPhi _{22}\mathbf{A}_\mathrm{i} -q\mathbf{g} \end{array} \right\| ^2. \end{aligned}$$

(8.60)

8.2.5 Application to Gratings

Here we consider the problem of plane-wave diffraction by a grating and state the points of difference from scattering by a cylindrical obstacle. The book edited by Petit [32] includes a nice introduction to Yasuura’s method applied to grating problems as of late 70s.

8.2.5.1 Diffraction by a PC Grating

Figure 8.3 shows the cross section of a grating, an incident wave, and the system of coordinates. The cross section C is periodic in X with a period d and the surface is uniform in Z. The semi-infinite region S over C is a vacuum and the region below C is occupied by a PC. We assume C is represented by a single-valued smooth function

$$\begin{aligned} \mathrm{C}:\ y = \eta (x), \end{aligned}$$

(8.61)

where $\eta (x)$ is periodic in x, $\eta (x+d)=\eta (x)$, and (x, y) denotes a point on C.

Let an electromagnetic wave having an electric field

$$\begin{aligned} \mathbf{u}_ZF(\mathbf{r}) = \mathbf{u}_Z \exp (-\mathrm{i}kX\sin \theta + \mathrm{i}kY\cos \theta ) \end{aligned}$$

(8.62)

is incident on the grating. This case of polarization is termed E-wave, TE wave, or s-polarization.^{Footnote 20} Here, $\mathbf{r}= (X, Y)$ is a point in S, $\mathbf{u}_Z$ is a unit vector in Z, and $\theta $ is the angle of incidence shown in Fig. 8.3. The diffracted electric field has only a Z-component, which we describe by $\varPsi (\mathbf{r})$. $\varPsi (\mathbf{r})$ is the solution of the following problem.

Problem 3 E-wave, PC grating. Find $\varPsi (\mathbf{r})$ that satisfies the conditions below:

(GD1):: The 2-D Helmholtz equation in S;
(GD2):: A radiation condition in Y that $\varPsi (\mathbf{r})$ propagates or attenuates in positive Y;
(GD3):: A periodicity condition
$$\begin{aligned} \varPsi (X+d, Y) = \exp (-\mathrm{i}kd\sin \theta )\,\varPsi (X,Y); \end{aligned}$$
(8.63)
(GD4):: The boundary condition
$$\begin{aligned} \varPsi (x, \eta (x)) = f(x) \equiv -F(x, \eta (x)). \end{aligned}$$
(8.64)

Conditions (GD1) and (GD4) are common to the case of cylindrical obstacle in Sect. 8.2.1, while (GD2) is quite different from (D2) and (GD3) is a new requirement. These differences come from the pseudo-periodic nature of the problem. Because a grating has a periodic structure, and because we have assumed a plane-wave incidence, the phenomena at (X, Y) and $(X+d, Y)$ are almost the same; the only discrepancy can be seen in the phase difference (8.63). Hence, if we divide S by vertical lines $X=0,\pm d, \pm 2d, \ldots $ as shown in Fig. 8.3, the diffracted fields in neighboring strip regions are the same except for the phase shift. This is a characteristic feature of a grating problem called quasi- or pseudo-periodicity and explains why the 1-D radiation condition appears in (GD2). In solving a grating problem, hence, we can assume that the observation point $\mathbf{r}= (X, Y)$ is inside the first strip region $\mathrm{S}_1$ $(0 < X \le d;\; Y \ge \eta (X))$ shown in Fig. 8.3.

8.2.5.2 Modal Functions, Approximate Solution, and Key Points in the Solution Method

Here again we choose separated solutions as modal functions. The separated solutions satisfying the periodicity are known as Floquet modes. We take the Floquet modes satisfying the radiation condition (GD2)

$$\begin{aligned} \varphi _m(\mathbf{r}) = \exp (-\mathrm{i}\alpha _mX - \mathrm{i}\beta _mY) \quad (m=0, \pm 1, \pm 2,\ldots ) \end{aligned}$$

(8.65)

as the set of modal functions, where

$$\begin{aligned} \alpha _m = k\sin \theta +\frac{2m\pi }{d}, \quad \beta _m = \sqrt{k^2 - \alpha _m^2} \quad (\text{ Re }\,\beta _m \ge 0,\ \text{ Im }\,\beta _m \le 0). \end{aligned}$$

(8.66)

The term $k\sin \theta $ in $\alpha _m$ is for the periodicity, the definition of $\beta _m$ implies the Helmholtz equation, and the sign of $\beta _m$ (positive or negative imaginary) is for the radiation condition.

We construct an approximate solution following the way we took in Sect. 8.2.2.2:

$$\begin{aligned} \varPsi _N(\mathbf{r}) = \sum _{m=-N}^N A_m^\mathrm{E}(M)\,\varphi _m(\mathbf{r}). \end{aligned}$$

(8.67)

This solution satisfies conditions (GD1), (GD2), and (GD3). Hence, the $A_m^\mathrm{E}$ coefficients^{Footnote 21} should be determined so that the solution satisfies the boundary condition (GD4) approximately. Let us see briefly the least-squares boundary matching works to yield a sequence of solutions converging to the true solution.

Some analysis starting from an assumption that $\mathbf{r}$ is inside a closed region $\text{ S }_{10}\; (\subset \text{ S }_1)$ leads us to an estimation

$$\begin{aligned} |\varPsi _N(\mathbf{r})-\varPsi (\mathbf{r})| \le G(\text{ S }_{10})\Vert \tilde{\varPsi }_N - \tilde{f}\Vert \quad (\mathbf{r}\in \text{ S }_{10} \subset \text{ S }_1). \end{aligned}$$

(8.68)

Here, G is a positive constant depending on the closed region $\text{ S }_{10}$ and the quantities with tildes, e.g. $\tilde{f}$, mean periodic functions derived from the pseudo-periodic functions:

$$\begin{aligned} \tilde{f}(s) = \exp (\mathrm{i}\alpha _{\,0} x)f(x,y) = -\exp (-\mathrm{i}\beta _{\,0} y), \end{aligned}$$

(8.69)

$$\begin{aligned} \tilde{\varPsi }_N(s)=\exp (\mathrm{i}\alpha _{\,0} x)\varPsi _N(x,y)= \sum _{m=-N}^N A_m^\mathrm{E}(M)\,\tilde{\varphi }_m(x,y), \end{aligned}$$

(8.70)

and

$$\begin{aligned} \tilde{\varphi }_m(x,y) = \exp (\mathrm{i}\alpha _{\,0} x)\varphi _m(x,y) = \exp \left( -\frac{2m\pi \mathrm{i}x}{d} - \mathrm{i}\beta _my\right) . \end{aligned}$$

(8.71)

The norm of a function g(s) defined on $\text{ C }_1$, the first period of C, is defined by

$$\begin{aligned} \Vert g \Vert = \left[ \int \limits _{s=0}^C |g(s)|^2 ds\right] ^{1/2}, \end{aligned}$$

(8.72)

where C denotes the length of $\text{ C }_1$. Thus we have a least-squares problem:

LSP 3: E-wave, PC grating. Find the $A_m^\mathrm{E}$ coefficients that minimize the numerator of the normalized mean-square error

$$\begin{aligned} E_N = \frac{\left\| \tilde{\varPsi }_N - \tilde{f} \right\| ^2}{\left\| \tilde{f} \right\| ^2}. \end{aligned}$$

(8.73)

The modification of the boundary values to define the periodic functions is the key point in the solution of grating problems. Introducing the modification, we can establish a correspondence between one period of the grating surface $\text{ C }_1$ and the cross section of a cylindrical obstacle C in Sect. 8.2.1.^{Footnote 22} The method of numerical solution for LSP 3 is similar to that in Sect. 8.2.3. To solve the problem of diffraction by a grating made of dielectric or metal we can combine the method in this section with that in Sect. 8.2.4. Guidance to the problem of conical diffraction can be found in Appendix 3.

8.3 Numerical Examples

In this section we show some results of numerical computations obtained by the methods in the last section. First, we examine the nature of the Jacobian matrices taking grating problems as examples to show the validity of the experimental rule (8.31). Meanwhile we add some comments that are useful in applying the method. Then we give the results of four problems of practical interest.

8.3.1 Rule on the Number of Sampling Points

We have solved the problem of diffraction by a grating made of PC and by one made of BK7 optical glass varying the number of sampling points or of linear equations. The results support our experimental rule. In addition, we have made a comparison between the two methods of locating the sampling points, (8.23) and (8.24), introduced in Sect. 8.2.3.1 and found little difference in the rage $J \ge 2M$ for the problem parameters employed in numerical analysis.

8.3.1.1 A PC Grating

We consider the grating shown in Fig. 8.3 and assume that the cross section C is given by^{Footnote 23}

$$\begin{aligned} \mathrm{C}: y = H\left( \cos \frac{2\pi x}{d} - 1\right) . \end{aligned}$$

(8.74)

We assume also that an E- or H-polarized plane wave is incident at $\theta = 0$ (normal incidence). Other physical parameters are: $d=556$ nm, $H/d=0.15$, and $\lambda = 500$ nm.^{Footnote 24} The computational parameters are: the number of truncation $N = 20$; the total number of modal functions $M = 41$; and the number of sampling points J is in the range $M \le J \le 4M$. This means that the number of unknown coefficients is M and the number of linear equations is between M and 4M.

The first example, Fig. 8.4, shows the convergence of the solution and related parameters in the E-wave. The curves in Fig. 8.4a includes the maximum and minimum singular value, the condition number $\text{ cond }(\varPhi )$, $E_{20\,J}$ of (8.30), and an error on the power balance

$$\begin{aligned} e_{NJ} = 1-\sum _\mathrm{prop} \rho _m = 1-\sum _{\beta _m > 0}\frac{\beta _m}{\beta _{\,0}}\, |A_m^\mathrm{E}(M,J)|^2, \end{aligned}$$

(8.75)

where $\sum _\mathrm{prop}$ and $\sum _{\beta _m > 0}$ mean the summation in respect to the propagating orders.^{Footnote 25} We observe these quantities are approaching final values with increasing J; and have converged for $J \ge 2M$.^{Footnote 26} Figure 8.4b illustrates the convergence of $A_0^\mathrm{E}(20,J)$ and $A_{10}^\mathrm{E}(20,J)$ coefficients. The former has converged before reaching $J=2M$; while the latter is with small ripples until $J=2.2M$. We, however, can neglect this oscillation in finding the diffracted wave because the mode with $m=10$ is evanescent and cannot be observed at a point apart from the grating surface.

The second set of figures, Fig. 8.5, displays the same thing for the H-wave. The curves in Fig. 8.5a show the max and min singular value, $\text{ cond }(\varPhi )$, $E_{20\,J}$, and $e_{20\,J}$. While in Fig. 8.5b we show the convergence of $A_0^\mathrm{H}(20, J)$ and $A_{10}^\mathrm{H}(20, J)$. We observe all the quantities have converged substantially in the range $J \ge 2M$.

The third example, Fig. 8.6, shows the convergence of solutions: N dependence of the normalized mean-square error $E_N$ and energy error $e_N$ of E- and H-wave solutions. The rule $J=2M$ is applied. Because the surface modulation is moderate in the problem, we get precise solutions with $10^{-6}$ or $10^{-4}$ percent energy error easily for both E- and H-wave problem.^{Footnote 27} It is worth to mention that a modal coefficient—e.g. $A_m^\mathrm{E}(M)$, as a function of M, converges to a final value: $A_m^\mathrm{E}(M) \rightarrow A_m^\mathrm{E}$ $(M \rightarrow \infty )$. This convergence, however, is not uniform with respect to m.

8.3.1.2 A BK7 Optical Glass Grating

Here we examine the case of a dielectric grating made of an optical glass BK7 whose refractive index is 1.5139 [1]. Other parameters are the same as in Sect. 8.3.1.1. In the present problem we have transmitted fields ($\mathbf{E}^\mathrm{t}$ and $\mathbf{H}^\mathrm{t}$) in the region $\text{ V }_2$ below the grating surface in addition to the reflected fields ($\mathbf{E}^\mathrm{r}$ and $\mathbf{H}^\mathrm{r}$) over the grating $\text{ V }_1$. We, hence, define approximate solutions following (8.44) in Sect. 8.2.4. That is, we employ Floquet modes in $\text{ V }_1$ and $\text{ V }_2$ and construct approximations of leading fields in each region in the form of finite linear combinations of the Floquet modes. Let the number of truncation be N. Then, we have $2(2N+1)=2M$ unknown coefficients in total.

Figure 8.7 shows the convergence of the solution and related parameters as functions of the number of sampling points J in the E-wave and $N=20$. We observe that all the errors and the parameters have converged in the range $J \ge 2M =2(2N+1)$ except for small ripples. Figure 8.8 shows the same thing in the H-wave. This means that the number of linear equations in the least-squares problem can be twice as many as the number of unknowns ($2 \times 2M = 4(2N+1)$; see Sect. 8.2.4.4).

Figure 8.9 illustrates the N dependence of the errors of the solutions. We get precise solutions with $10^{-5}$ percent energy error easily on a personal computer.

8.3.2 Scattering by Relatively Deep Gratings

Yasuura’s method, when combined with the partition of the groove region, can solve the problem of diffraction from a deep grating with a depth-to-period ratio beyond unity. In the conventional Yasuura’s method without partition, this ratio is said to be about 0.5 and a little less than 0.4 in the E- and H-wave cases, respectively. In the present subsection, some numerical results are given for the scattering by relatively deep gratings using a combination of up-and down-going Floquet modal functions [22].

The period and height of the sinusoidal profile are d and 2H, respectively, as shown in Fig. 8.10. At first we deal with a perfectly conducting grating as a fundamental problem where the electromagnetic fields exist only in the vacuum region. The semi-infinite region over the grating surface is divided into an upper half plane $U_0$ and a groove region a fictitious boundary (a horizontal line). The latter is further divided into shallow horizontal layers $\text {U}_1, \text {U}_2, \ldots , \text {U}_Q$ again by fictitious boundaries.

An approximate solution in $\text {U}_0$, that is $\varPsi _{0N}(\mathbf{r})$, is defined in a usual manner as (8.67), while the solutions in $\text {U}_q$ ($q = 1, 2, \ldots , Q$) include not only the up-going but also the down-going modal functions as

$$\begin{aligned} \varPsi _{qN}(\mathbf{r}) = \sum _{m=-N}^N \left[ A_{qm}^+(N)\, \varphi _m^+ (\mathbf{r}-\mathbf{u}_Y y_q) + A_{qm}^-(N)\, \varphi _m^- (\mathbf{r}-\mathbf{u}_Y y_{q-1}) \right] , \end{aligned}$$

(8.76)

where $\varphi _m^\pm (\mathbf{r}) = \exp (\mathrm{i}\alpha _m X \pm \mathrm{i}\beta _m Y)$, and the plane $Y = y_q$ is the boundary between $\text {U}_q$ and $\text {U}_{q+1}$. Thus the total number of unknown coefficients is $(2N + 1)(2Q + 1)$. These coefficients should be determined in order that the solutions meet the boundary condition (GD4) and an additional set of boundary conditions on the Q fictitious boundaries:

$$\begin{aligned} \left\{ \begin{array}{l} \displaystyle \left. \left( F\delta _{q0} + \varPsi _q \right) \right| _{Y=y_q+0} = \left. \varPsi _{q+1} \right| _{Y=y_q-0}, \\ \displaystyle \left. \frac{\partial \left( F\delta _{q0} + \varPsi _q \right) }{\partial Y} \right| _{Y=y_q-0} = \left. \frac{\partial \varPsi _{q+1}}{\partial Y} \right| _{Y=y_q-0}, \end{array} \right. \end{aligned}$$

(8.77)

where $\delta _{q0}$ is Kronecker’s delta. The mean-square error is defined in the same form as (8.73), but the integration range in the norm (8.72) must include not only the grating surface but also the fictitious boundaries.

Let us check the convergence of the results obtained by the present method. Figure 8.11 shows the variation of the normalized mean-square error and the energy error as functions of the number of truncation N for both E- and H-wave incidence. As is observed in these figures, the mean-square error decreases as N increases. An approximate solution with 0.1 percent energy error is accomplished at $N = 14$ for an E-wave. In the H-wave case convergence of solutions is not so fast as in the E-wave case. We attain to one percent energy error at $N = 23$ in that case of polarization.

Figure 8.12 shows comparison of reflection efficiency for a perfectly conducting grating as functions of the incident angle at E-wave incidence. The numbers (N, Q) are (15, 4), (15, 5), and (30, 20) as $H/d=0.31$, 0.4, and 1.066, respectively. The curves and symbols represent the present results and the results by the integral equation method [46]. We find good agreement between the results. For dielectric gratings, partition must be made not only in the vacuum region but also in the dielectric one. As a result, numbers of unknown modal coefficients and boundary conditions become doubled compared with the previous case.

Figure 8.13 shows comparison of transmission efficiency for a dielectric grating as functions of the incident angle at H-wave incidence. The numbers (N, Q) are (11,4). The curves and symbols represent the present results and the results by the finite element method [20]. We find that the results agree with each other except for the grazing limit.

Although there are a couple of methods that are capable of solving the problems of extremely deep gratings, the present results make sense because they show a limit of a conventional modal-expansion approach when using the Floquet modes as basis functions.

8.3.3 Plasmon Surface Waves Excited on a Metal Grating Placed in Conical Mounting

We show some numerical results in regard to plasmon surface waves excitation on a metal grating placed in conical mounting [29]. Conical mounting is an optical arrangement in which the plane of incidence is not perpendicular to grooves of a grating as shown in Fig. 8.14. Readers can find detailed description of problem in Appendix 3. We here illustrate the results obtained by the method explained there.

We deal with a sinusoidal silver grating whose surface profile is given by $z = H\sin (2\pi x/d)$. The upper region $\text{ V }_1$ over the grating surface is assumed to be vacuum with a refractive index $n_1 = 1$ and the grating is made of silver with a complex refractive index $n_2$. As an incident light we consider an electromagnetic plane wave, which is specified by the wavenumber in vacuum ($k = 2\pi /\lambda $), the polar angle ($\theta $) between the wavevector and the grating normal, and the azimuthal angle ($\phi $) between the X axis and the plane of incidence.

The diffracted fields in the conical mounting are decomposed into a TE and a TM component which mean that the relevant electric and magnetic field are perpendicular to the plane of incidence. The efficiency of the mth-order diffracted mode in $\text{ V }_1$, hence, is represented as $\rho _m = \rho _m^\mathrm{TE} + \rho _m^\mathrm{TM}$. Here, $\rho _m^\mathrm{TE}$ or $\rho _m^\mathrm{TM}$ is the efficiency of the TE- or TM-component of the mth-order diffracted mode.^{Footnote 28} In the numerical examples below we deal with a shallow grating made of silver with a period $d = 0.556\,\upmu $m and an amplitude $H = 0.0278\,\upmu $m. Yasuura’s method provides sufficiently reliable results for the problem of such a grating at the truncation number of the approximate solutions $N = 10$.

Figure 8.15 shows the efficiency of the 0th-order diffracted mode $\rho _0$ and the total diffraction efficiency $\rho ^\mathrm{Total}$ as functions of wavelength $\lambda $.^{Footnote 29} The incident light is in the TM incidence—a polarization angle $\delta = \pi /2$ in (8.114) of Appendix 3—where the magnetic field is perpendicular to the plane of incidence. The polar angle and the azimuthal angle are chosen as $\theta = 9.2^\circ $ and $\phi = 30^\circ $. As a complex refractive index of silver n we take the interpolated values for the experimental data in the literature [8]. In the figure we observe partial absorption of incident light at $\lambda = 0.515\,\upmu $m and $\lambda = 0.650\,\upmu $m as dips in the total efficiency curve.^{Footnote 30} As we will see later, the dips are associated with plasmon resonance absorption, which is caused by coupling of surface plasmons with an evanescent mode diffracted by the grating [21, 34].

Figure 8.16 shows the 0th-order efficiency $\rho _0$ and the TE and TM component $\rho _0^\mathrm{TM}$ and $\rho _0^\mathrm{TE}$ as functions of $\theta $ with a fixed azimuthal angle $\phi = 30^\circ $. The wavelength is chosen as $\lambda = 0.650\,\upmu $m and a refractive index is $n_2 = 0.07 - \mathrm{i}4.2$. Remaining parameters are the same as those of Fig. 8.15. We observe in Fig. 8.16 partial absorption of incident light at $\theta = 9.2^\circ $, we call it a resonance angle, as a dip in the $\rho _0$ curve. In addition we notice that $\rho _0^\mathrm{TM}$ takes a minimal value at the resonance angle, but $\rho _0^\mathrm{TE}$ increases there to the contrary. This illustrates the enhancement of TM-TE mode conversion [5] that a TM component of the incident light is strongly converted into a TE component of the 0th-order diffracted light when plasmon resonance absorption occurs in a metal grating in conical mounting.

In Fig. 8.17 we show the expansion coefficient of the $-1$st-order TM vector modal function $A_{1\,-1}^\mathrm{TM}$ defined in (8.120) of Appendix 3 as a function of $\theta $. The parameters in the figure are the same as those of Fig. 8.16 where the $-1$st-order mode is evanescent. The solid curve in Fig. 8.17 represents the real part of the expansion coefficient and the dashed curve is the imaginary part. From this result we observe the resonance property of the expansion coefficient $A_{1\,-1}^\mathrm{TM}$ at the angle of incidence $\theta = 9.2^\circ $ and confirm that the TM component of the $-1$st-order evanescent mode couples with surface plasmons at the resonance angle. We thus demonstrate that plasmon resonance absorption is associated with coupling of surface plasmons with an evanescent mode diffracted by a metal grating.

We note that the excitation of surface plasmons is largely affected by the azimuthal angle $\phi $. Figure 8.18 shows the plasmon resonance absorption for several $\phi $’s under the same parameters as those of Fig. 8.16. We observe that the resonance angle varies with $\phi $ as shown in Fig. 8.18. This means direction of propagation depends on $\phi $, the direction in which the plasmon surface wave propagates. The azimuthal angle $\phi $ has also large influence on the enhancement of TM-TE mode conversion through plasmon resonance absorption. For example, a TM component of the 0th-order diffracted mode almost vanishes at the resonance absorption at $\phi =45^\circ $, but a TE component becomes to be 0.7 there.

8.3.4 Scattering by a Metal Bigrating

In this subsection we deal with a 3-D problem: diffraction by a metal bigrating whose surface profile is periodically corrugated in two directions. We briefly describe the formulation of Yasuura’s method for solving the problem by a metal bigrating and then show numerical results of plasmon resonance absorption in the grating [16].

We consider a bisinusoidal metal grating shown in Fig. 8.19. The surface profile of the grating is given by

$$\begin{aligned} \eta (x, y) = H \left[ \sin \left( \frac{2\pi x}{d} \right) + \sin \left( \frac{2\pi y}{d} \right) \right] . \end{aligned}$$

(8.78)

The upper region $\text{ V }_1$ over the grating surface S$_0$ is vacuum with a refractive index $n_1 = 1$ and the region $\text{ V }_2$ below the grating surface consists of a lossy metal with a complex refractive index $n_2$. The permeability of the metal is assumed to be $\mu _0$.

The incident light is an electromagnetic plane wave

$$\begin{aligned} \left[ \begin{array}{c} \mathbf {E}^\mathrm{i} \\ \mathbf {H}^\mathrm{i} \end{array} \right] (\mathbf{r})=\left[ \begin{array}{c} \mathbf {e}^\mathrm{i} \\ \mathbf {h}^\mathrm{i} \end{array} \right] \exp ( -\mathrm{i}\mathbf{k}^\mathrm{i} \cdot \mathbf{r}). \end{aligned}$$

(8.79)

Here, r is the position vector for an observation point, $\mathbf{k}^\mathrm{i}$ is the wavevector of the incident wave, and $\mathbf{h}^\mathrm{i} = (1/\omega \mu _0)\, \mathbf{k}^\mathrm{i} \times \mathbf{e}^\mathrm{i}$. The wavevector is given by

$$\begin{aligned} \mathbf{k}^\mathrm{i} = (\alpha , \beta , -\gamma ) \end{aligned}$$

(8.80)

with $\alpha =n_1 k \sin \theta \cos \phi $, $\beta =n_1 k \sin \theta \sin \phi $, and $\gamma =n_1 k \cos \theta $. Here, $k\; (= 2\pi /\lambda )$ is the wavenumber in vacuum, and $\theta $ is the polar angle between the Z axis and the incident wavevector, and $\phi $ is the azimuthal angle between the X axis and the plane of incidence.

We denote the diffracted electric and magnetic fields by $\mathbf {E}_{\ell }^\mathrm{d}(\text{ P })$, $\mathbf {H}_{\ell }^\mathrm{d}(\text{ P })$ in the regions $\text{ V }_\ell $ $(\ell = 1, 2)$. Here we explain briefly Yasuura’s method for finding the diffracted fields. We first introduce TE and TM vector modal functions defined in the region $\text{ V }_\ell $ $(\ell = 1, 2)$:

$$\begin{aligned} \left\{ \begin{array}{ll} \varvec{\varphi }_{\ell mn}^\mathrm{TE}(\mathbf{r}) = \mathbf{e}_{\ell mn}^\mathrm{TE} \exp (-\mathrm{i}\mathbf{k}_{\ell mn} \cdot \mathbf{r}), &{} \quad \displaystyle \mathbf{e}_{\ell mn}^\mathrm{TE} = \frac{\mathbf{k}_{\ell mn} \times \mathbf{u}_Z}{\left| \mathbf{k}_{\ell mn} \times \mathbf{u}_Z \right| }, \\ \varvec{\varphi }_{\ell mn}^\mathrm{TM}(\mathbf{r}) = \mathbf{e}_{\ell mn}^\mathrm{TM} \exp (-\mathrm{i}\mathbf{k}_{\ell mn} \cdot \mathbf{r}), &{} \quad \displaystyle \mathbf{e}_{\ell mn}^\mathrm{TM} = \frac{\mathbf{e}_{\ell mn}^\mathrm{TE} \times \mathbf{k}_{\ell mn}}{\left| \mathbf{e}_{\ell mn}^\mathrm{TE} \times \mathbf{k}_{\ell mn} \right| } \\ \quad \quad \quad (m,\; n = 0, \pm 1, \pm 2, \ldots ). \end{array} \right. \end{aligned}$$

(8.81)

Here, $\mathbf{u}_Z$ is a unit vector in the Z-direction and $\mathbf{k}_{\ell mn}$ $(\ell = 1, 2)$ is the wavevector of the (m, n)th-order diffracted wave:

$$\begin{aligned} \mathbf{k}_{1mn} = (\alpha _m, \beta _n, \gamma _{1 mn}), \quad \mathbf{k}_{2mn} = (\alpha _m, \beta _n, -\gamma _{2 mn}) \end{aligned}$$

(8.82)

with

$$\begin{aligned} \left\{ \begin{array}{l} \displaystyle \alpha _m = \alpha + \frac{2m \pi }{d}, \quad \beta _n = \beta + \frac{2n \pi }{d}, \quad \gamma _{\ell m} = \sqrt{(n_\ell k)^2 - (\alpha _m^2+\beta _n^2)} \\ \quad \quad \quad (\mathrm{Re}\,\gamma _{\ell mn} \ge 0,\; \mathrm{Im}\,\gamma _{\ell mn} \le 0). \end{array} \right. \end{aligned}$$

(8.83)

We form approximate solutions for the diffracted electric and magnetic fields:

$$\begin{aligned} \left[ \begin{array}{c} \mathbf{E}_{\ell N}^\mathrm{d} \\ \mathbf{H}_{\ell N}^\mathrm{d} \end{array} \right] (\mathbf{r}) = \sum _{m,n=-N}^N A_{\ell mn}^\mathrm{TE} \left[ \begin{array}{c} \varvec{\varphi }_{\ell mn}^\mathrm{TE} \\ \varvec{\psi }_{\ell mn}^\mathrm{TE} \end{array} \right] (\mathbf{r}) + \sum _{m,n=-N}^N A_{\ell mn}^\mathrm{TM} \left[ \begin{array}{c} \varvec{\varphi }_{\ell mn}^\mathrm{TM} \\ \varvec{\psi }_{\ell mn}^\mathrm{TM} \end{array} \right] (\mathbf{r}) \quad (\ell = 1, 2) \end{aligned}$$

(8.84)

with

$$\begin{aligned} \varvec{\psi }_{\ell mn}^\mathrm{q}(\mathbf{r}) =\frac{1}{\omega \mu _0}\, \mathbf{k}_\mathrm{\ell mn} \times \varvec{\varphi }_{\ell mn}^\mathrm{q}(\mathbf{r}) \quad (\mathrm{q} = \mathrm{TE, TM}). \end{aligned}$$

(8.85)

The expansion coefficients $A_{\ell mn}^\mathrm{TE}$, $A_{\ell mn}^\mathrm{TM}$ are determined so that the approximate solutions $\mathbf{E}_{\ell N}^\mathrm{d}(\mathrm{P})$, $\mathbf{H}_{\ell N}^\mathrm{d}(\mathrm{P})$ satisfy the boundary conditions in a weighted least-squares sense. To do this, we minimize the mean-square error

$$\begin{aligned} \begin{aligned} E_N =&\int \limits _\mathrm{S} \left| \varvec{\nu }\times \left( \mathbf{E}_{1N}^\mathrm{d} + \mathbf{E}^\mathrm{i} - \mathbf{E}_{2N}^\mathrm{d} \right) (s) \right| ^2\, dS \\&+ Z_0^2 \int \limits _\mathrm{S} \left| \varvec{\nu }\times \left( \mathbf{H}_{1N}^\mathrm{d} + \mathbf{H}^\mathrm{i} - \mathbf{H}_{2N}^\mathrm{d} \right) (s) \right| ^2\, dS, \end{aligned} \end{aligned}$$

(8.86)

where S is one period cell of the grating surface S$_0$, $\varvec{\nu }$ is a unit normal vector to the grating surface, and $Z_0$ is an intrinsic impedance of the medium of $\text{ V }_1$.

The mean-square error $E_N$ is discretized by applying a two-dimensional trapezoidal rule where the number of divisions in the X- and Y-directions is chosen to be $J=2(2N+1)$. The discretized LSP with $24(2N+1)^2 \times 4(2N+1)^2$ Jacobian is solved by QRD.

The diffraction efficiency $\rho _{mn}$ of the (m, n)th-order mode $(\gamma _{1m}\ge 0)$ in $\text{ V }_1$ is given by

$$\begin{aligned} \rho _{mn} = \rho _{mn}^\mathrm{TE} + \rho _{mn}^\mathrm{TM}, \end{aligned}$$

(8.87)

where the efficiency of the (m, n)th-order TE or TM mode is given by

$$\begin{aligned} \rho _{mn}^\mathrm{TE} = \frac{\gamma _{1m}}{\gamma }\, |A_{1mn}^\mathrm{TE}|^2,\quad \rho _{mn}^\mathrm{TM} = \frac{\gamma _{1m}}{\gamma }\, |A_{1mn}^\mathrm{TM}|^2. \end{aligned}$$

(8.88)

We show the plasmon resonance absorption in a bisinusoidal grating made of silver [12]. We consider a shallow bisinusoidal grating with a corrugation depth $H = 0.0075\,\upmu $m and a period $d = 0.556\,\upmu $m. The wavelength of the incident light is chosen as $\lambda = 0.650\,\upmu $m where only the (0, 0)th-order diffracted mode propagates. We take $n_2 = 0.07 - \mathrm{i}4.2$ as the refractive index of silver at this wavelength.

Figure 8.20 shows the diffraction efficiency of the (0, 0)th-order diffracted mode $\rho _{00}$ as functions of the polar angle $\theta $ when the azimuthal angle $\phi = 30^\circ $ is fixed. In the efficiency curve we observe four dips A, B, C, and D at which incident light power is strongly absorbed by the grating. The dips are associated with absorption that is caused by the coupling of surface plasmons with an evanescent mode diffracted by a bisinusoidal silver grating. This is confirmed from Fig. 8.21 where the expansion coefficients (a) $A_{1\, -10}^\mathrm{TM}$ and (b) $A_{10\, -1}^\mathrm{TM}$ are plotted as functions of $\theta $ under the same parameters as in Fig. 8.20. The solid curves in Fig. 8.21 represent the real part of the expansion coefficient and the dashed curves are the imaginary part. In Fig. 8.21a, a resonance property of the $A_{1\,-10}^\mathrm{TM}$ curve at $\theta =9.5^\circ $, i.e., a dip A, illustrates that the TM component of the $(-1, 0)$ evanescent mode couples with surface plasmons at a dip A. From the resonant property of the $A_{10-1}^\mathrm{TM}$ curve in Fig. 8.21b we confirm that dips B and D are associated with the coupling of the $(0,-1)$ evanescent mode with surface plasmons. Similarly, we can show a dip C is caused by coupling of the $(-1,-1)$ evanescent mode.

When an incident light with $\phi = 45^\circ $ illuminates a bisinusoidal grating at the specific angle of $\theta $, i.e., the resonance angle, two surface plasmon waves are excited and propagate in directions symmetric with respect to the plane of incidence. The absorption associated with the two surface plasmon waves is called simultaneous resonance absorption [12]. Figure 8.22 shows an example of the simultaneous resonance absorption where the $(-1, 0)$th- and $(0, -1)$st-order evanescent modes couple simultaneously with two surface plasmon waves at the same polar angle $\theta = 12.2^\circ $. The two surface plasmon waves excited simultaneously on the grating surface interact with each other and the interference of the surface plasmon waves causes the standing wave in the vicinity of the grating surface. This is confirmed in Fig. 8.23, where the X and Y components of Poynting’s vector $\mathbf{S}$ on the surface 0.01d above the one-unit cell of the grating surface are plotted as the vector $(S_X, S_Y)$.

8.3.5 Scattering by Periodically Located Spheres

Some numerical results are given for the scattering by dielectric spheres located periodically in three directions [17]. This kind of structure is a fundamental model of photonic crystals having properties of electromagnetic or optical band gaps .

As shown in Fig. 8.24, the structure is composed by stacking cubic unit cell regions with a volume $d^3$, each of which includes a sphere with radius a and relative permittivity $\varepsilon _r$. The number of spheres is infinity along the both X and Y axes, and the two-dimensionally infinite periodic structures are stacked to compose finite Q layers in the Z direction. At present we limit ourselves to the case where either electric or magnetic field of the incident plane wave is perpendicular to the page, allowing us to use only one incident angle $\theta $.

In the upper and lower semi-infinite spaces, the approximate wave functions $(\mathbf{E}_{0N}(\mathbf{r}), \mathbf{H}_{0N}(\mathbf{r}))$ and $(\mathbf{E}_{Q+1\,N}(\mathbf{r}), \mathbf{H}_{Q+1\,N}(\mathbf{r}))$ are expressed in terms of modal coefficients $A_{0mn}^\mathrm{TE,TM} (N)$ and $A_{Q+1\,mn}^\mathrm{TE,TM} (N)$, respectively. The set of modal functions here is the same as that employed in Sect. 8.3.4 for the two-dimensional periodic structures. On the other hand, for the fields in the areas of periodically distributed spheres, a set of vector spherical wave functions $\left\{ \mathbf{m}_{mn}^\mathrm{e,h}(\mathbf{r}), \mathbf{n}_{mn}^\mathrm{e,h}(\mathbf{r}) \right\} $ is used to write the approximate wave functions. In the cube region of the layer $\sharp q$, they are expressed by

$$\begin{aligned} \left\{ \begin{array}{l} \displaystyle \left[ \begin{array}{c} \mathbf{E}_{qN}(\mathbf{r}) \\ Z_0 \mathbf{H}_{qN}(\mathbf{r}) \end{array} \right] = \sum _{n=1}^{3N} \sum _{m=-n}^n \left[ \begin{array}{cc} \mathbf{m}_{mn}^\mathrm{e}(\mathbf{r}_q) &{} \mathbf{n}_{mn}^\mathrm{e}(\mathbf{r}_q) \\ -\mathrm{i}\mathbf{n}_{mn}^\mathrm{h}(\mathbf{r}_q) &{} -\mathrm{i}\mathbf{m}_{mn}^\mathrm{h}(\mathbf{r}_q) \end{array} \right] \left[ \begin{array}{c} A_{qmn}^\mathrm{TE}(N) \\ A_{qmn}^\mathrm{TM}(N) \end{array} \right] \\ \quad \quad \quad (q = 1, 2, \ldots , Q), \end{array} \right. \end{aligned}$$

(8.89)

where $Z_0$ is the intrinsic impedance of vacuum and $\mathbf{r}_q = (r_q, \theta _q, \phi _q)$ is a position vector with its origin placed at the center of the qth sphere on the Z axis. Note that the truncation number is selected as 3N in order to maintain the balance with the half spaces from the viewpoint of the degree of approximation. The spherical wave functions $\left\{ \mathbf{m}_{mn}^\mathrm{e,h}(\mathbf{r}_q), \mathbf{n}_{mn}^\mathrm{e,h}(\mathbf{r}_q) \right\} $ are written by combination of the spherical Bessel functions of the nth order, the associated Legendre functions $P_n^{|m|}(\cos \theta _q)$, and the exponential (trigonometric) functions $\exp (\mathrm{i}m\phi _q)$.^{Footnote 31} The functions with respect to $\mathbf{r}_q$ are constructed beforehand so that they automatically satisfy the continuity conditions for $E_\theta $, $E_\phi $, $H_\theta $, and $H_\phi $ over the spherical surfaces $r_q = a$ [17]. As a result, the present problem is reduced to the determination of the modal coefficients such that the remaining boundary conditions on the horizontal planes

$$\begin{aligned} \left\{ \begin{array}{l} \mathbf{u}_Z \times \left( \mathbf{E}_q, \mathbf{H}_q \right) = \mathbf{u}_Z \times \left( \mathbf{E}_{q+1}, \mathbf{H}_{q+1} \right) \\ \quad \quad \quad (\text{ between } \text{ the } \text{ layers } \sharp q \text{ and } \sharp q+1;\; q = 0, 1, 2, \ldots , Q) \end{array} \right. \end{aligned}$$

(8.90)

and the periodicity conditions on the vertical planes

$$\begin{aligned} \left\{ \begin{array}{l} \left. \mathbf{u}_X \times \left( \mathbf{E}_q, \mathbf{H}_q \right) \exp (\mathrm{i}kd \sin \theta ) \right| _{X=-d/2+0} = \left. \mathbf{u}_X \times \left( \mathbf{E}_q, \mathbf{H}_q \right) \right| _{X= d/2-0}, \\ \left. \mathbf{u}_Y \times \left( \mathbf{E}_q, \mathbf{H}_q \right) \right| _{Y=-d/2+0} = \left. \mathbf{u}_Y \times \left( \mathbf{E}_q, \mathbf{H}_q \right) \right| _{Y= d/2-0} \\ \quad \quad \quad (q = 1, 2, \ldots , Q) \end{array} \right. \end{aligned}$$

(8.91)

should be satisfied on the faces of the unit cells in the sense of least-squares. In the boundary conditions (8.91), we count the upper and lower half spaces by the numbers $\sharp 0$ and $\sharp Q+1$, respectively.

Figure 8.25 shows the normalized mean-square error and energy error as functions of the truncation number N. We find that both errors decrease monotonically when N increases. The period d is 0.8 times as the wavelength of the incident wave $\lambda $ (${=}2\pi /k$). Since the wavelength in dielectric material is shorter than that in the air, we need large N for big spheres. However, even at $a/d = 0.3$, these errors become less than 1% if $N \ge 4$.

Figure 8.26 is drawn to observe the effect of increasing the layer number on the band of total transmission and total reflection. For the single layer at $Q = 1$, we find two reflection points at $d/\lambda \approx 0.77$ and 0.91. When the layer is increased, these points are changed to reflection bands.

Figure 8.27 presents the reflected power for each mode as a function of incident angle for a 4-layered structure. We observe the power is totally reflected when $\theta $ is less than about 40$^\circ $. This property disappears for larger $\theta $ due to the emergence of the $(-1,0)$th higher order modes having a cutoff angle $\theta = 46^\circ $.

We should note that introduction of sequential accumulation in the process of QR decomposition reduces the computation time from $O(Q^3)$ to $O(Q^1)$ and the memory requirement from $O(Q^2)$ to $O(Q^1)$, with Q being a number of sphere layers. See [17] for the detailed data.

8.4 Conclusions

Because of the reasons we have stated in Sect. 8.1, we reviewed Yasuura’s method of modal expansion attaching importance to the process of solution by the CYM: choice of modal functions; a finite-sum approximate solution; least-squares boundary matching; location and number of sampling points; and solution method for the LSP. In addition, we included guidances for handling dielectric obstacles and gratings placed in planer or conical mounting. Still more, we gave a comparison between separated solutions and monopole fields in approximation power.

As for applications to 3D, we have only two grating problems in Sects. 8.3.4 and 8.3.5. Because we have been working in diffraction gratings, we do not have appropriate examples that show the effectiveness of the CYM in 3D scattering problems. However, our former colleagues have solved the problems using the CYM and published their results [11, 13]. Speaking from a theoretical point of view, they have employed the set of multipole functions as the modal functions whose completeness has been proven by Calderón [6].

We hope that the contents of this chapter would be useful for researchers and engineers who need reliable methods for solving electromagnetic boundary-value problems.

Notes

1.
The reason why we set a limit “2-D” is that the SP, in the present form, is available only in 2-D problems. This is because we employ an indefinite integral to realize a low-pass spatial filter.
2.
It is also termed Transverse-Electric (TE) wave, which means the electric field is orthogonal to the xy-plane. While in the H-wave (or TM-wave) the magnetic field has the z-component alone.
3.
The component is called a leading field if it gives other nonzero components as in (8.3). Note that the derivation of $\mathbf{H}^\mathrm{s}$ by (8.3) is a proper procedure because the sequence of our approximate solutions converges to the true solution uniformly in wider sense in the exterior region $\mathrm{S}_\mathrm{e}$ as we will see later.
4.
We hope the readers consult a treatise on Functional Analysis, e.g. [14], in case of need.
5.
If C is a circle centered at the origin, it is apparent that the sets of boundary values and normal derivatives are both complete because the members of each set are nothing other than the Fourier bases . Even in case if C is not a circle, the sets are still complete because of Example 2: let L be a circle centered at the origin and take the Fourier bases for $f_m(t)$ in (8.8), then we get a set of separated solutions.
6.
This is not a strong exception because we can modify the contour L (and hence $\mathrm{D}_\mathrm{i}$) slightly to avoid the coincidence. Example 2 is a key theorem of generation of complete sets, which has been proven by Yasuura and Itakura [39] as an analogy of Runge’s (or Runge-Walsh’s) theorem known in Theory of Complex Functions.
7.
Unfortunately, separated solutions are not very efficient in a problem where C is strongly modulated from a circle (or, in general, a coordinate surface of the system of coordinates employed). As an example, we show a comparison between types of modal functions: the separated solutions (8.7) and monopole fields (8.9) in Appendix 4.
8.
This dependence is natural because the boundary values of modal functions, in general, do not form an orthogonal set in H. This type of summation is usually called a flexible summation. Note that the approximate solution is defined in a finite summation of modal functions. By considering a sequence of finite-sum solutions, we can avoid the constraint of the convergence area of an infinite series solution. Yasuura’s original papers [38,39,40] has been written from this point of view. Reference [9] includes an interpretation of the difference between series and sequence solutions.
9.
$G(\mathbf{r},\mathbf{r}')$ is a total electric field observed at $\mathbf{r}'\,(\not = \mathbf{r})$ when a unit line source is placed at $\mathbf{r}$ in Fig. 8.1. Note that employment of the Green function satisfying (8.12) is for convenience and is not essential: The whole theory has been established in [38,39,40], where the free-space Green function alone was used.
10.
This kind of convergence is called uniform convergence in wider sense in $\mathrm{S}_\mathrm{e}$.
11.
Until the middle of 80s we employed normal equations (NE) in solving LSP 1. Now we solve the problem using the method in Sect. 8.2.3.2. We state the reason why we stopped using the NE and attach some comments in Appendix 2.
12.
On the other hand, there is a possibility to make possible use of the weighting function accompanying the variable transformation. For example, a Schwarz–Christoffel-type transformation works to remove the singularity of Green’s function in a problem of an edged cross section [23].
13.
It is more reasonable to ask “How many linear equations do we need?” This is because (i) we get two equations at one sampling point in a 2-media problem (see Sect. 8.2.4); and (ii) we should understand (8.31) as a relation between the numbers of equations J and unknowns M.
14.
In addition, the solution by a QRD program, usually, is not inferior in accuracy to one by an SVD program. This may be because of the greater computational complexity of the SVD.
15.
The machine epsilon, EPS, is the minimum positive number that satisfies $1 + \mathrm{EPS} > 1$ in the floating-point system employed.
16.
To get the latter we set $\mathbf{u}_\nu \times (\mathbf{H}_\mathrm{e} - \mathbf{H}_\mathrm{i}) = 0$. Insertion of $\mathbf{H}_\mathrm{e} = (\mathrm{i}/\omega \mu )\nabla E_\mathrm{e}\times \mathbf{u}_z$ etc. finds the desired relation. Here, $E_\mathrm{e} = F + \varPsi _\mathrm{e}$ stands for the total electric field in $\mathrm{S}_\mathrm{e}$.
17.
We cannot include the derivation of equations from (8.45) through (8.49) because it takes much space. Interested readers can find the details in [35, 43,44,45]. The paper by Petit and Cadilhac [33] is also helpful.
18.
The use of intrinsic impedance is also possible and is widely employed. That is: find the coefficients by minimization of $|\text{ error } \text{ in } \text{ E }|^2 + Z_0^2 |\text{ error } \text{ in } \text{ H }|^2$. Here, $Z_0$ is the intrinsic impedance of vacuum or surrounding material. We use this formulation in Appendix 3.
19.
If the compensation by $\gamma $ is not necessary, we can set $p=1/ \mathbf{f\,}^\dag \mathbf{f}$ and $q=1/ \mathbf{g}^\dag \mathbf{g}$ or use the intrinsic impedance.
20.
s stands for senkrecht (German) , which means the electric field is perpendicular to the plane of incidence, the plane spanned by $\mathbf{u}_Y$ (grating normal) and the incident wavevector.
21.
The superscript E denotes that the coefficients concern the E-wave. Later we will also use the superscripts H, TE, and TM in accordance with polarizations.
22.
If we employ the SP, this correspondence is essentially important because we need periodicity of the functions defined on the boundary. In using Yasuura’s method without the SP, we can say the following points: (1) If we get the solution through the NE, this modification is not necessary because it is done automatically in calculating the inner products; (2) While if we employ the QRD or SVD: (2.i) The modification may accelerate the convergence of the solutions because the target function and the modal functions are periodically continuous after modification; (2.ii) And, a quadrature by parts (or rectangular-rule) approximation is equivalent to a trapezoidal-rule in numerical integrations.
23.
Note that the bias setting (we used ${-}1$ here) has an effect on the accuracy of numerical computation when the grating is deep.
24.
Although the use of normalization by wavelength (i.e., $kd = 2\pi d/\lambda $ etc.) is convenient in handling a problem with a PC obstacle, we employ real length here.
25.
$\rho _m$ is referred to as the (reflection) efficiency of the mth order.
26.
When the number of truncation is small (e.g., $N \le 10$), we sometimes observe a phenomenon that the condition number continues to decrease slightly beyond $J=2M$ due to tiny increment of $\sigma _\mathrm{min}$.
27.
We should notice, however, that the accuracy of an H-wave solution is lower than that of an E-wave solution by one or two digits. This is observed generally; and was Yasuura’s motivation of introducing the SP. His idea came from the fact that a Neumann problem for an electrostatic potential is equivalent to a Dirichlet problem for a stream function. The prototype of the SP, hence, was called an algorithm using the stream function in a wave field.
28.
The efficiencies are given by $\rho _{m}^\mathrm{TE}=\left( \gamma _{1m}/\gamma _{10} \right) |A_{1m}^\mathrm{TE}|^2$ and $\rho _{m}^\mathrm{TM}=\left( \gamma _{1m}/\gamma _{10} \right) |A_{1m}^\mathrm{TM}|^2$ where $\gamma _{1m}$ is the propagation constant in the Z-direction of the mth-order propagating mode $\left( \mathrm{Re} \left( \gamma _{1m} \right) \ge 0 \right) $ concerning the upper region $\text{ V }_1$, and $A_{1m}^\mathrm{TE}$ and $A_{1m}^\mathrm{TM}$ are the expansion coefficients of the approximate solutions defined in (8.120) of Appendix 3.
29.
$\rho ^\mathrm{Total}$ is a summation of $\rho _m$ over the propagating orders.
30.
$1-\rho ^\mathrm{Total}$ represents the ratio of the absorbed light power by a metal grating to the incident light power.
31.
The vector $\mathbf{m}_{mn}^\mathrm{e,h}(\mathbf{r}_q)$ is perpendicular to the $r_q$ axis, whereas $\mathbf{n}_{mn}^\mathrm{e,h}(\mathbf{r}_q)$ has an $r_q$ component. That is, the superscript TE (TM) in (8.89) means transverse electric (transverse magnetic) with respect to $r_q$.
32.
If $f \in \varPhi _N$, then $E_N = 0$. This, however, cannot occur in practice: For example, even in the case of scattering from a circular cylinder made of a PC, we need an infinite series to represent a rigorous solution because the boundary value has a form $\exp [-\mathrm{i}ka\cos (\theta _s - \iota )]$. In addition, note that $\varPhi _N$ is closed.
33.
We get (8.104) by setting $(\varphi _m, \varPsi _N-f) = 0$ $(m=0, \pm 1,\ldots , \pm N)$; or from $\partial E_N/\partial \overline{A}_m = 0$.
34.
Assume a PC surface-relief grating with a TE-wave incidence, for simplicity, and imagine the surface current induced. It apparently has a Z-oriented ingredient, which excites a TM-wave component.
35.
$\mathbf{e}^\mathrm{TE}$ is perpendicular to the plane of incidence; the fact that the magnetic field accompanying $\mathbf{e}^\mathrm{TM}$ is orthogonal to the plane can be seen by manipulation.
36.
We can use monopole fields also in the grating problems discussed in Sect. 8.2.5. A countably infinite set of monopoles located periodically in x—i.e., the location is given by $(x_1 + \ell d, y_1)$ $(0< x_1 \le d;\; y_1 < \eta (x_1);\; \ell = 0, \pm 1, \pm 2, \ldots )$—radiates a plane wave [4, 36] satisfying (GD1) and (GD2). If we let the monopoles be accompanied by phase factors $\exp (\mathrm{i}\ell kd\sin \theta )$, the plane wave meets the periodicity (GD3). Increasing the number of monopoles in the first strip region to M, i.e., $(x, y) = (x_1, y_1), (x_2, y_2), (x_3, y_3), \ldots , (x_M, y_M)$, and repeating the same procedure, we have a set of M plane waves, which is the desired set of modal functions [28, 37].
37.
Although the employment of polyphase wave functions is effective because of the periodicity, we do not use them for simplicity.
38.
According to the result of numerical computation, an optimum d was in the rage [0.85, 0.90] when the total number of poles was between 40 and 120. If we increased (or decreased) the number of poles, the optimum d approached 0.90 (or 0.85). Note, however, that the trends were observed in solving a particular problem with specific computational parameters and are no more than reference data.
39.
The result of sample calculation has shown that the use of $|\text{ BC }|^\alpha $ ($\alpha > 1$) instead of BC (i.e., further emphasis of the convex part in locating poles) gives better solutions.
40.
We have applied the rule $J=2M$ and have omitted J.
41.
The relation is referred to as the optical theorem, which implies energy conservation.
42.
We have employed the monopole fields and have seen their effectiveness [30]. It is worth noting that inclusion of a few dipoles located near the convex part of L in addition to the monopoles improves the efficiency greatly. This might be related to Cadilhac-Petit’s opinion [7] in locating the poles near an internal focus.
43.
We got the elements under the assumption that the length of C is 1. This is convenient in mathematical analysis and does not affect applications to obstacles made of a lossless material including PC. In dealing with a lossy material, in particular a metal in light frequency, the normalization should be accompanied by a law of similitude in time-dependent EM field [19] and, hence, the use of actual length might be appropriate.

References

M. Bass (ed.), Handbook of Optics; Volume II — Devices, Measurements, and Properties, 2nd edn. (McGraw-Hill, 1995)
Google Scholar
R.H.T. Bates, Analytic constraints on electromagnetic field computations. IEEE Trans. Microw. Theory Tech. MTT-23(8), 605–623 (1975)
Google Scholar
R.H.T. Bates, J.R. James, I.N.L. Gallett, R.F. Millar, An overview of point matching. Radio Electron. Eng. 43(3), 193–200 (1973)
Article Google Scholar
A. Boag, Y. Leviatan, A. Boag, Analysis of two-dimensional electromagnetic scattering from a periodic grating of cylinders using a hybrid current method. Radio Sci. 23(4), 612–624 (1988)
Article ADS Google Scholar
G.P. Bryan-Brown, J.R. Sambles, M.C. Hutley, Polarization conversion through the excitation of surface plasmons on a metallic grating. J. Modern Opt. 37(7), 1227–1232 (1990)
Article ADS Google Scholar
A.P. Calderón, The multipole expansion of radiation fields. J. Ration. Mech. Anal. (J. Math. Mech.) 3, 523–537 (1954)
Google Scholar
M. Cadilhac, R. Petit, On the diffraction problem in electromagnetic theory: a discussion based on concepts of functional analysis including an example of practical application, in Huygens’ Principle 1690–1990: Theory and Applications, Studies in Mathematical Physics, ed. by H. Blok, et al. (Elsevier, Amsterdam, 1992)
Google Scholar
G. Hass, L. Hardley, Optical properties of metal, in American Institute of Physics Handbook, ed. by D.E. Gray, 2nd ed. (McGraw-Hill, 1963), pp. 6–107
Google Scholar
J.P. Hugonin, R. Petit, M. Cadilhac, Plane-wave expansions used to describe the field diffracted by a grating. J. Opt. Soc. Am. 71(5), 593–598 (1981)
Article ADS Google Scholar
H. Ikuno, K. Yasuura, Numerical calculation of the scattered field from a periodic deformed cylinder using the smoothing process on the mode-matching method. Radio Sci. 13(6), 937–946 (1978)
Article ADS Google Scholar
H. Ikuno, M. Gondoh, M. Nishimoto, Numerical analysis of electromagnetic wave scattering from an indented body of revolution. Trans. IEICE Electron. E74-C(9), 2855–2863 (1991)
Google Scholar
T. Inagaki, J.P. Goudonnet, J.W. Little, E.T. Arakawa, Photoacoustic study of plasmon-resonance absorption in a bigrating. J. Opt. Soc. Am. B 2(3), 433–439 (1985)
Article ADS Google Scholar
M. Kawano, H. Ikuno, M. Nishimoto, Numerical analysis of 3-D scattering problems using the Yasuura method. Trans. IEICE Electron. E79-C(10), 1358–1363 (1996)
Google Scholar
A.N. Kolmogorov, S.V. Fomin, Elements of the Theory of Functions and Functional Analysis (Dover, New York, 1999)
Google Scholar
C.L. Lawson, R.J. Hanson, Solving Least Squares Problems (Prentice-Hall, New Jersey, 1974)
Google Scholar
T. Matsuda, D. Zhou, Y. Okuno, Numerical analysis of plasmon-resonance absorption in a bisinusoidal metal grating. J. Opt. Soc. Am. A 19(4), 695–701 (2002)
Google Scholar
A. Matsushima, Y. Momoka, M. Ohtsu, Y. Okuno, Efficient numerical approach to electromagnetic scattering from three-dimensional periodic array of dielectric spheres using sequential accumulation. Progr. Electromagn. Res. 69, 305–322 (2007)
Article Google Scholar
R.F. Millar, Rayleigh hypothesis and a related least-squares solution to scattering problems for periodic surfaces and other scatterers. Radio Sci. 8(8–9), 785–796 (1973)
Article ADS MathSciNet Google Scholar
H. Nakano, Frequency-independent antennas: spirals and log-periodics, in Modern Antenna Handbook, ed. by C.A. Balanis (Wiley, New Jersey, 2008), pp. 263–323
Google Scholar
Y. Nakata, M. Koshiba, M. Suzuki, Finite-element analysis of plane wave diffraction from dielectric gratings. Trans. IEICE Jpn. J69-C(12), 1503–1511 (1986)
Google Scholar
M. Neviér, The homogeneous problems, in Electromagnetic Theory of Gratings, ed. by R. Petit (Springer, Berlin, 1980), pp. 123–157
Google Scholar
M. Ohtsu, Y. Okuno, A. Matsushima, T. Suyama, A Combination of up- and down-going Floquet modal functions used to describe the field inside grooves of a deep grating. Progr. Electromagn. Res. 64, 293–316 (2006)
Article Google Scholar
Y. Okuno, A numerical method for solving edge-type scattering problems. Radio Sci. 22(6), 941–946 (1987)
Article ADS Google Scholar
Y. Okuno, The mode-matching method, in Analysis Methods in Electromagnetic Wave Problems, ed. by E. Yamashita (Artech House, 1990), pp. 107–138
Google Scholar
Y. Okuno, An introduction to the Yasuura method, in Analytical and Numerical Methods in Electromagnetic Wave Theory, ed. by M. Hashimoto, M. Idemen, O.A. Tretyakov (Science House, 1993), pp. 515–565
Google Scholar
Y. Okuno, H. Ikuno, Completeness of the boundary values of equivalent sources. Mem. Fac. Eng. Kumamoto Univ. 38(1), 1–8 (1993)
ADS Google Scholar
Y. Okuno, H. Ikuno, Yasuura’s method, its relation to the fictitious source methods, and its advancements in the solution of 2D problems, in Generalized Multipole Techniques for Electromagnetic and Light Scattering, ed. T. Wriedt (Elsevier, Amsterdam, 1999)
Google Scholar
Y. Okuno, T. Matsuda, T. Kuroki, Diffraction efficiency of a grating with deep grooves, in Proceedings of the 1995 Sino-Japanese Joint Meeting on Optical Fiber Science and Electromagnetic Theory (OFSET’95), vol. 1 (Tianjin, China, 1995), pp. 106–111
Google Scholar
Y. Okuno, T. Suyama, R. Hu, S. He, T. Matsuda, Excitation of surface plasmons on a metal grating and its application to an index sensor. Trans. IEICE Electron. E90-C(7), 1507–1514 (2007)
Google Scholar
Y. Okuno, H. Yamaguchi, The idea of equivalent sources in the Yasuura method, in Proceedings 1992 International Symposium on Antennas Propagat (ISAP’92), vol. 1E3-2 (Sapporo, Japan, 1992)
Google Scholar
Y. Okuno, K. Yasuura, Numerical algorithm based on the mode-matching method with a singular-smoothing procedure for analysing edge-type scattering problems. IEEE Trans. Antennas Propagat. 30(4), 580–587 (1982)
Article ADS MATH Google Scholar
R. Petit (ed.), Electromagnetic Theory of Gratings (Springer, Berlin, 1980)
Google Scholar
R. Petit, M. Cadilhac, Electromagnetic theory of gratings: some advances and some comments on the use of the operator formalism. J. Opt. Soc. Am. A 7(9), 1666–1674 (1990)
Article ADS MathSciNet Google Scholar
H. Raether, Surafce plasmon and roughness, in Surface Polaritons — Electromagnetic Waves at Surfaces and Interfaces, ed. by V.M. Agranovich, D.L. Mills (North Holland, 1982), pp. 331–403
Google Scholar
M. Tomita, K. Yasuura, The Rayleigh expansion theorem for the boundary value problem in two media. Kyushu Univ. Tech. Rep. 52(2), 142–154 (1979)
Google Scholar
J.R. Wait, Reflection from a wire grid parallel to a conducting plane. Can. J. Phys. 32, 571–579 (1954)
Article ADS MATH Google Scholar
X. Xu, B.W. Chen, R. Gong, M. Zheng, Use of auxiliary source fields in Yasuura’s method, in Proceedings of the 2017 IEEE International Conference on Computational Electromagnetics (ICCEM2017), vol. 2C1.2 (Kumamoto, Japan, 2017)
Google Scholar
K. Yasuura, T. Itakura, Approximation method for wave functions (I). Kyushu Univ. Tech. Rep. 38(1), 72–77 (1965)
Google Scholar
K. Yasuura, T. Itakura, Complete set of wave functions – approximation method for wave functions (II). Kyushu Univ. Tech. Rep. 38(4), 378–385 (1966)
Google Scholar
K. Yasuura, T. Itakura, Approximation algorithm by complete set of wave functions – approximation method for wave functions (III). Kyushu Univ. Tech. Rep. 39(1), 51–56 (1966)
Google Scholar
K. Yasuura, H. Ikuno, Smoothing process on the mode-matching method for solving two-dimensional scattering problems. Mem. Fac. Eng. Kyushu Univ. 37(4), 175–192 (1977)
Google Scholar
K. Yasuura, Y. Okuno, Singular-smoothing procedure on Fourier analysis. Mem. Fac. Eng. Kyushu Univ. 41(2), 123–141 (1981)
Google Scholar
K. Yasuura, M. Tomita, Convergency of approximate wave functions on the boundary – the case of inner domain. Kyushu Univ. Tech. Rep. 52(1), 79–86 (1979)
Google Scholar
K. Yasuura, M. Tomita, Convergency of approximate wave functions on the boundary – the case of outer domain. Kyushu Univ. Tech. Rep. 52(1), 87–93 (1979)
Google Scholar
K. Yasuura, M. Tomita, Numerical analysis of plane wave scattering from dielectric cylinders. Trans. IECE Jpn. 62-B(2), 132–139 (1979)
Google Scholar
K.A. Zaki, A.R. Neureuther, Scattering from a perfectly conducting surface with a sinusoidal height profile: TE polarization. IEEE Trans. Antennas Propagat. AP-19(2), 208–214 (1971)
Google Scholar

Download references

Acknowledgements

The authors thank Mr. BenWen Chen and Mr. Rui Gong, Centre for Optical and Electromagnetic Research, South China Academy of Advanced Optoelectronics, South China Normal University for preparing the figures in Sect. 8.3.1 including numerical computations.

One of the authors (A.M.) wish to express his thanks to Japan Society for Promotion of Science (JSPS) for partial support to the work in Sects. 8.3.2 and 8.3.5 under Grant Number JP15K06023 (KAKENHI).

Another one of the authors (Y.O.) is grateful to Prof. S. He, COER-SCNU, COER-ZJU, and JORSEP-KTH for his continuous help and encouragement.

Author information

Authors and Affiliations

Kumamoto University, Kumamoto, 860-8555, Japan
Akira Matsushima
National Institute of Technology, Kumamoto College, Kumamoto, 861-1102, Japan
Toyonori Matsuda
South China Normal University, Guangzhou, 510006, China
Yoichi Okuno

Authors

Akira Matsushima
View author publications
You can also search for this author in PubMed Google Scholar
Toyonori Matsuda
View author publications
You can also search for this author in PubMed Google Scholar
Yoichi Okuno
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Akira Matsushima .

Editor information

Editors and Affiliations

Leibniz-Institut für Werkstofforientierte Technologien—IWT, Bremen, Germany
Thomas Wriedt
Faculty of Computational Mathematics and Cybernetics, Lomonosov Moscow State University, Moscow, Russia
Yuri Eremin

Appendices

Appendix 1: H-Wave Scattering by a PC Cylinder

Let us consider a problem where an H-wave (TM-wave) is incident to the obstacle shown in Fig. 8.1. That is, the incident wave is polarized in the xy-plane so that the incident magnetic field has only a z-component

$$\begin{aligned} \mathbf{H}^\mathrm{i}(\mathbf{r}) = \mathbf{u}_z F(\mathbf{r}) = \mathbf{u}_z \exp [-\mathrm{i}kr \cos (\theta -\iota )]. \end{aligned}$$

(8.92)

The scattered magnetic field has only a z-component

$$\begin{aligned} \mathbf{H}^\mathrm{s}(\mathbf{r}) = \mathbf{u}_z \varPsi (\mathbf{r}) \end{aligned}$$

(8.93)

which is a leading field of the problem. Thus, we have

Problem 1’: H-wave, PC. Find $\varPsi (\mathbf{r})$ that satisfies:

(N1):

The 2-D Helmholtz equation in $\mathrm{S}_\mathrm{e}$;

(N2):

The 2-D radiation condition at infinity;

(N3):

The boundary condition

$$\begin{aligned} \partial _\nu \varPsi (s) = g(s) \equiv -\partial _\nu F(s) \quad (s \in \text{ C }). \end{aligned}$$

(8.94)

Here, $\partial _\nu $ denotes a normal derivative at s. Equation (8.94) is called Neumann’s or the second-kind boundary condition.

Employing the Green’s (or Neumann’s) function of this boundary-value problem satisfying a homogeneous boundary condition

$$\begin{aligned} \partial _\nu N(\mathbf{r}, s) = 0\quad (\mathbf{r}\in \mathrm{S}; s \in \mathrm{C}), \end{aligned}$$

(8.95)

we get a formal representation similar to (8.13)

$$\begin{aligned} \varPsi _N(\mathbf{r}) - \varPsi (\mathbf{r}) = -\int \limits _{s=0}^C N(\mathbf{r},s) \left[ \partial _\nu \varPsi _N(s) - g(s)\right] \,ds\quad (\mathbf{r}\in \mathrm{S}). \end{aligned}$$

(8.96)

Here, $\varPsi _N$ denotes an approximate solution defined by

$$\begin{aligned} \varPsi _N(\mathbf{r}) = \sum _{m=-N}^N B_m(M)\varphi _m(\mathbf{r}). \end{aligned}$$

(8.97)

After a discussion similar to that in Sect. 8.2.2.3, we have a least-squares problem for the H-wave problem:

LSM 1’: H-wave, PC. Find the coefficients $B_m(M)$ $(m=0, \pm 1, \ldots , \pm N)$ that minimize the mean-squares boundary residual

$$\begin{aligned} E_N = \frac{\left\| \partial _\nu \varPsi _N - g \right\| ^2}{\Vert g \Vert ^2} = \frac{1}{\Vert g \Vert ^2} \left\| \sum _{m=-N}^N B_m(M)\partial _\nu \varphi _m - g \right\| ^2. \end{aligned}$$

(8.98)

We can solve this problem on a computer following the procedure in Sect. 8.2.3. Approximations to other nonzero components can be found by

$$\begin{aligned} \mathbf{E}_N^\mathrm{s}(\mathbf{r}) = \frac{1}{\mathrm{i}\omega \varepsilon _0}\nabla \varPsi _N(\mathbf{r}) \times \mathbf{u}_z. \end{aligned}$$

(8.99)

It is worth noting that in an H-wave scattering from a dielectric obstacle, the boundary condition (8.42) should be altered slightly. Let $\mathbf{H}^\mathrm{s}(\mathbf{r})=\mathbf{u}_z\varPsi _\mathrm{e}(\mathbf{r})$, and $\mathbf{H}^\mathrm{t}(\mathbf{r})=\mathbf{u}_z\varPsi _{\,\mathrm i}(\mathbf{r})$, then we have

$$\begin{aligned} \left\{ \begin{array}{l} \varPsi _\mathrm{e}(s)-\varPsi _{\,\mathrm i}(s)=f(s)\equiv -F(s)\\ \partial _\nu \varPsi _\mathrm{e} - n^{-2} \partial _\nu \varPsi _{\,\mathrm i}(s) = g(s) \equiv -\partial _\nu F(s), \end{array}\right. \end{aligned}$$

(8.100)

where the second line means the electric-field continuity and $n^2 = \varepsilon /\varepsilon _0$.

Appendix 2: Solution of LSP 1 by a Normal Equation and Related Topics

Although we do not use a normal equation in numerical analysis, we look over the solution method by the equation because it is an important theoretical tool in working with a least-squares problem. Let us define an inner product between two functions in $\mathbf{H}= L^2(0, C)$ by

$$\begin{aligned} (f,g)=\int \limits _{s=0}^C \overline{f(s)}g(s)\,ds, \end{aligned}$$

(8.101)

then we find that $\Vert f \Vert = \sqrt{(f,f)}$. Employing these relations, we modify (8.22) to obtain

$$\begin{aligned} E_N = \!\! \sum _{m=-N}^N \sum _{n=-N}^N \overline{A_m}(\varphi _m,\varphi _n) A_n - \!\! \sum _{m=-N}^N \overline{A_m} (\varphi _m,f) - \!\! \sum _{n=-N}^N (f,\varphi _n)A_n + \Vert f \Vert ^2. \end{aligned}$$

(8.102)

The predictable M is not shown.

Now we define a subspace of $\mathbf{H}$, $\varPhi _N$, spanned by the boundary values of a finite number of modal functions $\{\varphi _0(s),\varphi _{\pm 1}, \ldots ,$ $\varphi _{\pm N}\}$. An element of $\varPhi _N$ can be represented as

$$\begin{aligned} \varPsi _N(s) = \sum _{n=-N}^N A_n\,\varphi _n(s). \end{aligned}$$

(8.103)

Apparently, there is a minimum value of $E_N$, which is a squared distance between f(s) and a point in $\varPhi _N$.^{Footnote 32} The minimum is achieved when (8.103) agrees with the foot of a perpendicular line from f(s) to the surface of $\varPhi _N$. The necessary and sufficient condition for this is that: The $A_m$ coefficients are the solutions of the set of linear equations

$$\begin{aligned} \sum _{n=-N}^N (\varphi _m,\varphi _n)\,A_n = (\varphi _m, f)\quad (m=0,\pm 1,\ldots ,\pm N). \end{aligned}$$

(8.104)

This is referred to as the normal equation (NE) of LSP 1 and is a formal solution to the problem.^{Footnote 33}

Next, let us consider the minimization from a computational point of view. That is, we try to find the $A_m$ coefficients using the sampled values of boundary functions; and the functions are represented by J-dimensional complex-valued vectors $\mathbf{f}$, $\varvec{\varphi }_m$, and $\varvec{\varPsi }_N$ as in Sect. 8.2.3. This leads us to DLSP 1. We know the orthogonal decompositions are useful tools for solving the problem. However, setting them aside, we here consider a NE based on DLSP 1. Because the Jacobian matrix $\varPhi $ is $J \times M$ $(J > M)$, the set of linear equations

$$\begin{aligned} \varPhi \mathbf{A} = \mathbf{f} \end{aligned}$$

(8.105)

is over-determined and does not have a usual solution. However, if we multiply (8.105) by $\varPhi ^\dag $ from the left, we have

$$\begin{aligned} \text{ H } \mathbf{A} = \mathbf{b}, \end{aligned}$$

(8.106)

where

$$\begin{aligned} \text{ H } = \varPhi ^\dag \varPhi \end{aligned}$$

(8.107)

is an $M \times M$ positive-definite Hermitian matrix provided $\varPhi $ is full rank. And,

$$\begin{aligned} \mathbf{b} = \varPhi ^\dag \mathbf{f} \end{aligned}$$

(8.108)

is an M-dimensional right-hand side. Usually, (8.106) is referred to as the NE of DLSP 1 and has been employed as a standard method of solution for a long time.

Obviously, (8.106) is an approximation of (8.104). For example, an (m, n)th element of the coefficient matrix of (8.104) can be represented as

$$\begin{aligned} (\varphi _m,\varphi _n) = \int \limits _{s=0}^C\overline{\varphi _m(s)}\varphi _n(s)\, ds \simeq \frac{C}{J}\sum _{j=1}^J\overline{\varphi _m(s_j)}\varphi _n(s_j) = \frac{C}{J}\,\varvec{\varphi }_m^\dag \varvec{\varphi }_n. \end{aligned}$$

(8.109)

The right-hand side of (8.109) is the (m, n)th entry of H multiplied by the line element C / J. Hence, (8.104) and (8.106) are essentially the same thing, and they have common weak points in numerical computations. Widely-accepted key observations are:

The NE is rigorous, in principle, and can be employed in theoretical considerations;
The NE combined with Gaussian elimination (diagonal pivoting assumed) is equivalent to the (modified) Schmidt QRD except for the next two items;
The NE may lose information in constructing $\mathrm{H} = \varPhi ^\dag \varPhi $, and this process is time consuming usually;
The NE is dominated by the condition number of H that is square of the original condition number: $\text{ cond }(\text{ H }) = [\text{ cond }(\varPhi )]^2$.

The last item means (8.104) and (8.106) are more sensitive to computational errors than LSP 1 and DLSP 1. Therefore, the NE’s are more difficult to solve on a computer than the original least-squares problems. We, hence, do not recommend the use of (8.104) or (8.106). Even if we are working in the case where the inner products in (8.104) can be calculated analytically, we should not employ (8.104) because of the last item.

Before closing this Appendix, we would like to state a couple of comments on (8.105). Apparently, J cannot be less than M because (8.105) is indeterminate for $J < M$. If we set $J = M$, we have a point-matching method (PMM) or a collocation method. The method is known to be effective if the contour C coincides with a part of a coordinate curve of a system of coordinates in which Helmholtz’s equation is separable; and that the modal functions are the separated solutions in that system. Convergence of the PMM solution is related to the validity of the Rayleigh hypothesis [2, 3, 18].

In Yasuura’s method we usually set $J = 2M$ as we see in Sect. 8.3.1. That is, we employ 2M linear equations to determine M unknown coefficients. This may be understood as a small device or improvement of the PMM. However, this produces good results such as proof of convergence, wide range of application, and so on with little increase of computational complexity as a reasonable cost.

Appendix 3: Conical Diffraction by a Grating

In Sect. 8.2.5 we dealt with diffraction by a grating, where all the field components were functions of two variables (X and Y) and two independent cases of polarization [E-wave (TE, s) and H-wave (TM, p)] existed. In addition, the directions of propagating diffraction-orders were parallel to the plane of incidence. These were possible because: (1) the grating surface was uniform in Z; and (2) the plane of incidence was in parallel to the direction of periodicity $\mathbf{u}_X$. Here, we concisely examine the problem of a lossless dielectric grating in which the second condition is not satisfied, i.e., the plane of incidence makes a nonzero angle $\phi $ with the positive X-direction as shown in Fig. 8.14a. We will see that

The field components are functions of X, Y, and Z, but the dependence on Y—the direction of uniformity—is limited;
The two cases of polarization are not independent, i.e., both TE and TM diffracted waves exist for TE (or TM) incidence^{Footnote 34};
The direction of propagating orders lie on the surface of a cone whose vertex agrees with the coordinate origin O; the direction of the zeroth mode is on the plane of incidence at the same time.

Because of the third characteristic, this arrangement ($\phi \not = 0$) is called conical mounting and the term conical diffraction is used. In this connection, the arrangement in Sect. 8.2.5 is termed planar mounting.

Let the incident wave be

$$\begin{aligned} \left[ \begin{array}{c} \mathbf{E}^\mathrm{i} \\ \mathbf{H}^\mathrm{i} \end{array} \right] (\mathbf{r}) = \left[ \begin{array}{c}{} \mathbf{e}^\mathrm{i}\\ \mathbf{h}^\mathrm{i} \end{array}\right] \exp (-\mathrm{i}\mathbf{k}^\mathrm{i} \cdot \mathbf{r}). \end{aligned}$$

(8.110)

Here, $\mathbf{e}^\mathrm{i}$ and $\mathbf{h}^\mathrm{i}$ are electric- and magnetic-field amplitude, which are related by

$$\begin{aligned} \mathbf{h}^\mathrm{i} = \frac{1}{\omega \mu _0}\, \mathbf{k}^\mathrm{i} \times \mathbf{e}^\mathrm{i} \end{aligned}$$

(8.111)

and $\mathbf{k}^\mathrm{i}$ is the incident wavevector defined by

$$\begin{aligned} \mathbf{k}^\mathrm{i} = (n_1 k\sin \theta \cos \phi , n_1 k\sin \theta \sin \phi , -n_1 k\cos \theta ) \equiv (\alpha , \beta , -\gamma ) \end{aligned}$$

(8.112)

with $\theta $ being the polar angle between the wavevector $\mathbf{k}^\mathrm{i}$ and the grating normal $\mathbf{u}_Z$. We decompose the incident wave into a TE(s)- and a TM(p)-component, where TE (or TM) means that the electric (or magnetic) field of the relevant incident wave is perpendicular to the plane of incidence. To do this, we define two unit vectors that span a plane orthogonal to the incident wavevector

$$\begin{aligned} \mathbf{e}^\mathrm{TE}=(\sin \phi ,-\cos \phi ,0), \quad \mathbf{e}^\mathrm{TM}=(\cos \theta \cos \phi ,\cos \theta \sin \phi ,\sin \theta ). \end{aligned}$$

(8.113)

They give the directions of the incident electric fields that are in the TE- and TM-polarization.^{Footnote 35} Thus the decomposition is

$$\begin{aligned} \mathbf{e}^\mathrm{i} = \mathbf{e}^\mathrm{TE}\cos \delta + \mathbf{e}^\mathrm{TM}\sin \delta , \end{aligned}$$

(8.114)

where $\delta $ is a polarization angle shown in Fig. 8.14b. $\delta = 0$ and $\pi /2$ mean TE- and TM-incidence. Hence, an incident wave has three angular parameters: $\phi $, $\theta $, and $\delta $.

We consider the problem to seek the diffracted electric and magnetic field in the semi-infinite regions $\text{ V }_1$ and $\text{ V }_2$ over and below the grating surface $\text{ S }_\mathrm{G}$.

Problem 4 conical, dielectric grating. Find the solutions that satisfy the following requirements:

(CD1):: The Helmholtz equation in $\text{ V }_1$ and $\text{ V }_2$;
(CD2):: Radiation conditions in the positive and negative Z-direction;
(CD3):: A periodicity condition that: the relation $f(X+d,Y,Z) = e^{\mathrm{i}\alpha d}f(X,Y,Z)$ holds for any component of the diffracted wave, and the phase constant in Y is $\beta $;
(CD4):: The total tangential component of electric and magnetic field must be continuous across the grating surface $\text{ S }_\mathrm{G}$.

Dealing with a problem of conical diffraction, we should keep in mind the unique nature of the problem. First, because every field component has a common phase constant $\beta $ in Y, it is sufficient to match the boundary condition on a cross section between the grating surface and a plane $Y = \text{ const }$. The conically-mounted gratings, hence, belong to the class of quasi-3-D structures. Second, because the TE- and TM-wave are not independent, we always need both TE and TM vector modal functions in constructing approximate solutions.

We define the modal functions satisfying (CD1)–(CD3) by

$$\begin{aligned} \left\{ \begin{array}{l} \varvec{\varphi }_{\ell m}^\mathrm{TE}(\mathbf{r}) = \mathbf{e}_{\ell m}^\mathrm{TE} \exp (-\mathrm{i}\mathbf{k}_{\ell m} \cdot \mathbf{r}), \quad \varvec{\varphi }_{\ell m}^\mathrm{TM}(\mathbf{r}) = \mathbf{e}_{\ell m}^\mathrm{TM} \exp (-\mathrm{i}\mathbf{k}_{\ell m} \cdot \mathbf{r}) \\ \quad \quad \quad (\ell = 1, 2;\; m = 0, \pm 1, \pm 2, \ldots ). \end{array} \right. \end{aligned}$$

(8.115)

Here,

$$\begin{aligned} \mathbf{e}_{\ell m}^\mathrm{TE} = \frac{ \mathbf{k}_{\ell m} \times \mathbf{u}_Z}{|\mathbf{k}_{\ell m} \times \mathbf{u}_Z|}, \quad \mathbf{e}_{\mathrm{p}m}^\mathrm{TM} = \frac{ \mathbf{e}_{\ell m}^\mathrm{TE} \times \mathbf{k}_{\ell m}}{|\mathbf{e}_{\ell m}^\mathrm{TE} \times \mathbf{k}_{\ell m}|} \quad (\ell = 1, 2), \end{aligned}$$

(8.116)

$$\begin{aligned} \mathbf{k}_{1m} = (\alpha _m, \beta , \gamma _{1m}),\quad \mathbf{k}_{2m} = (\alpha _m, \beta , -\gamma _{2m}), \end{aligned}$$

(8.117)

and

$$\begin{aligned} \left\{ \begin{array}{l} \displaystyle \alpha _m = \alpha + \frac{2m\pi }{d}, \quad \gamma _{\ell m} = \sqrt{(n_\ell k)^2 - (\alpha _m^2 + \beta ^2)} \\ \quad \quad \quad (\text{ Re }\,\gamma _{\ell m} \ge 0,\; \text{ Im }\,\gamma _{\ell m} \le 0). \end{array} \right. \end{aligned}$$

(8.118)

Note that the functions in (8.115) are for constructing electric fields. For the magnetic fields we get

$$\begin{aligned} \varvec{\psi }_{\ell m}^\mathrm{q}(\mathbf{r}) = \frac{1}{\omega \mu _0}\, \mathbf{k}_{\ell m} \times \varvec{\varphi }_{\ell m}^\mathrm{q}(\mathbf{r}) \quad (\ell = 1, 2;\; \mathrm{q}= \mathrm{TE, TM}) \end{aligned}$$

(8.119)

through Maxwell’s equations. Finite linear combinations of the modal functions define approximate solutions:

$$\begin{aligned} \left[ \begin{array}{c} \mathbf{E}_{\ell N} \\ \mathbf{H}_{\ell N} \end{array} \right] (\mathbf{r}) =\sum _{m=-N}^N A_{\ell m}^\mathrm{TE} \left[ \begin{array}{c} \varvec{\varphi }_{\ell m}^\mathrm{TE}\\ \varvec{\psi }_{\ell m}^\mathrm{TE}\end{array}\right] (\mathbf{r}) + \sum _{m=-N}^N A_{\ell m}^\mathrm{TM} \left[ \begin{array}{c} \varvec{\varphi }_{\ell m}^\mathrm{TM}\\ \varvec{\psi }_{\ell m}^\mathrm{TM} \end{array}\right] (\mathbf{r}) \quad (\ell = 1, 2) \end{aligned}$$

(8.120)

Here, the number of modal functions M is neglected.

The unknown coefficients in (8.120) should be determined in order that the solutions satisfy the boundary condition (CD4) approximately in the mean-squares sense. For this purpose we first consider the cross section C between the grating surface $\mathrm{S}_\mathrm{G}$ and a plane $Y = 0$. This is the same thing as the periodic curve C in Sect. 8.2.5. In a similar way to one in Sect. 8.2.5, we define the primary period $\mathrm{S}_1$, whose boundary $\mathrm{C}_1$ ($\subset \mathrm{C}$), the function space $\mathbf{H}$ consisting of all the square integrable functions on $\mathrm{C}_1$, and the norm $\Vert f \Vert $ of a function f(s). Then, we can state the least-squares problem that determines the unknown coefficients:

LSP 4: conical, dielectric grating. Find the coefficients $A_{\ell m}^\mathrm{TM}$ and $A_{\ell m}^\mathrm{TM}$ $(\ell = 1, 2;\; m=0, \pm 1, \ldots , \pm N)$ that minimize the mean-square error

$$\begin{aligned} E_N = \left\| \varvec{\nu }\times \left( \tilde{\mathbf{E}}_{1N} + \tilde{\mathbf{E}}^\mathrm{i} - \tilde{\mathbf{E}}_{2N} \right) \right\| ^2 + Z_0^2 \left\| \varvec{\nu }\times \left( \tilde{\mathbf{H}}_{1N} + \tilde{\mathbf{H}}^\mathrm{i} - \tilde{\mathbf{H}}_{2N} \right) \right\| ^2. \end{aligned}$$

(8.121)

Here, $Z_0$ denotes the intrinsic impedance of vacuum and $\tilde{\mathbf{E}}^\mathrm{i}$ etc. mean periodic functions with respect to x defined in the same way as one in (8.69)–(8.71). The method of discretization and the solution method are found in Sect. 8.2.4.

Appendix 4: Comparison of Modal Functions and Algorithm of the SP

Here we show some results of effectiveness comparison between three kinds of modal functions in solving a sample problem^{Footnote 36}: E-wave scattering from a PC cylinder whose cross section is given by^{Footnote 37}

$$\begin{aligned} \text{ C }: r_s = a(1+0.2 \cos 3\theta _s). \end{aligned}$$

(8.122)

Let us normalize every quantity having dimension of length by the total length of C. And, we assume the incident wave comes along the x-axis from the negative x-direction (i.e., $\iota =0$).

The modal functions considered here are: (a) the separated solutions, which we defined by (8.7) in Sect. 8.2.2; (b) monopole fields defined by (8.9); (c) monopole fields whose poles are located densely near the convex part of C. Because the separated solutions are known widely, we explain the monopole fields below:

(b) Equally spaced poles.:

Let L be a similar curve to C with the ratio of similitude $d\ (0< d < 1)$.^{Footnote 38} We arrange M poles on L at regular intervals. Then, the distance between two poles is L / M where L is the length of L.

(c) Concentration of poles near the convex parts of L.:

(i) First, we draw the similar curve L. (ii) Next,we calculate the curvature $\kappa (t)$ of L as a function of $t\;(\in \text{ L })$, and add some positive bias c in order that the biased curvature (BC) be no less than 0: $\tilde{\kappa }(t) = \kappa (t) + c\ (\ge 0)$. (iii) Thirdly, we define a probability density function by normalizing the BC.^{Footnote 39}

$$\begin{aligned} f(t) = \frac{\displaystyle \int \limits _0^t \tilde{\kappa }(t')\, dt'}{\displaystyle \int \limits _0^L \tilde{\kappa }(t')\, dt'}. \end{aligned}$$

(8.123)

Thus we get the number of poles between $t_1$ and $t_2$ by

$$\begin{aligned} n(t_1, t_2) = M\int \limits _{t_1}^{t_2} f(t)\, dt. \end{aligned}$$

(8.124)

We have solved the problem using the method explained in Sects. 8.2.2 and 8.2.3. We used three kinds of modal functions (a), (b), and (c); and tried at two frequencies: $ka = 10$ and 30. The parameter d was set to be 0.87. To see the accuracy of a solution we calculated two kinds of errors: the normalized mean-square error $E_M(\text{ m })$ and the error on energy balance (or on the optical theorem) $e_M(\text{ m })$. The former is the same thing as one defined in (8.22) and (8.30)^{Footnote 40} except that the subscript shows the total number of modal functions. The latter shows the deviation from a proportional relation between the forward scattering amplitude and total cross section.^{Footnote 41} The argument m shows the type of modal functions: m $=$ sep, esp, and pcc, which mean (a) separated solutions, (b) equally-spaced poles, and (c) poles concentrated near the convex parts.

Results at $ka=10$.:: Because the obstacle size is handy, the $E_M$ errors fall off rapidly: $E_{45}(\mathrm{sep})$, $E_{35}(\mathrm{esp})$, and $E_{31}(\mathrm{pcc})$ are below 1%. As for the $e_M$ errors of the solutions, the situation is different. The solutions with esp or pcc modal functions converge rapidly as $e_M(\mathrm{esp})$ and $e_M(\mathrm{pcc})$ are below 1% at $M \simeq 30$. On the other hand, $e_{31}(\mathrm{sep})$ is about 10%. Increasing M to 70, we have: $e_{70}(\mathrm{esp})=9\times 10^{-5}$%; $e_{70}(\mathrm{pcc})=1\times 10^{-5}$%; and $e_{71}(\mathrm{sep})=4$%.
Results at $ka=30$.:: The advantage of the monopole fields is clear in this range of frequency. Setting $M\simeq 100$, we have $E_{101}(\mathrm{sep})=4$%, $E_{100}(\mathrm{esp})=2\times 10^{-1}$%, $E_{100}(\mathrm{pcc})=2\times 10^{-3}$%, $e_{101}(\mathrm{sep})=7$%, $e_{100}(\mathrm{esp})=2\times 10^{-1}$%, and $e_{100}(\mathrm{pcc})=5\times 10^{-3} $%. The pcc modal functions seems to be the best choice in solving the problem. In fact, we can find an accurate solution with a $10^{-5}$% $e_M$ error by setting $\text{ m } = \text{ pcc }$ and $M=120$.

These results mean that the potential of a combination of separated solutions is not so strong in describing scattered fields from obstacles deformed strongly from a circle. We have two ways to cope with this issue: (i) employment of a set of modal functions other than the separated solutions^{Footnote 42}; and (ii) employment of the SP.

The Algorithm of the SP

Here we include a guidance how to apply the SP in the boundary-matching process based on Yasuura’s method of modal expansion for convenience. We start from DLSP 1, i.e., minimization of the numerator of (8.30), $\Vert \varPhi \mathbf{A}- \mathbf{f} \Vert ^2$. Instead of minimizing it directly, we force a constraint

$$\begin{aligned} (\mathbf{1}, \varPhi \mathbf{A}- \mathbf{f}) = 0 \end{aligned}$$

(8.125)

on the M-dimensional solution vector A, where the parentheses mean an inner product and $\mathbf{1}=[1\ 1\ \cdots \ 1]^\mathrm{T}$ is a J-dimensional constant vector.

An operator of the smoothing procedure (in a discretized form) is a $J\times J$ matrix given by

$$\begin{aligned} \mathrm{K}^{(p)} = \left[ K^{(p)}_{j\ell } \right] , \end{aligned}$$

(8.126)

where p means the order of the SP. The explicit forms of the matrix elements for $p=1, 2$, and 3 are^{Footnote 43}:

$$\begin{aligned} \left\{ \begin{array}{l} \displaystyle K^{(1)}_{j\ell } = u(j-\ell ) - \frac{j-\ell }{J} - \frac{1}{2}, \\ \displaystyle K^{(2)}_{j\ell } = -\frac{1}{2} \left[ \frac{(j-\ell )^2}{J^2} - \frac{|j-\ell |}{J} + \frac{1}{6} \right] , \\ \displaystyle K^{(3)}_{j\ell } = \frac{1}{6} \left[ \frac{|j-\ell |^3}{J^3} - \frac{3(j-\ell )^2}{2J^2} + \frac{|j-\ell |}{2J} + \frac{1}{6} \right] . \end{array} \right. \end{aligned}$$

(8.127)

Thus we can state a method of solution with the SP as follows:

DLSP 3: E-wave, PC, SP. Find the solution vector A that minimizes the discretized mean-square error

$$\begin{aligned} E_{MJ}= \frac{\Vert \mathrm{K}^{(p)}(\varPhi \mathbf{A}- \mathbf{f}) \Vert ^2}{\Vert \mathrm{K}^{(p)}{} \mathbf{f\,} \Vert ^2} \end{aligned}$$

(8.128)

under the constraint (8.125).

Two ways are possible to solve this conditioned least-squares problem: (i) employment of Lagrange’s multiplier; and (ii) elimination of a modal coefficient by using the constraint. Although (i) is a standard way in handling a constraint, we take (ii) because our constraint is simple and can eliminate one of the M unknowns to deduce a least-squares problem with $M-1$ unknowns.

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Matsushima, A., Matsuda, T., Okuno, Y. (2018). Introduction to Yasuura’s Method of Modal Expansion with Application to Grating Problems. In: Wriedt, T., Eremin, Y. (eds) The Generalized Multipole Technique for Light Scattering. Springer Series on Atomic, Optical, and Plasma Physics, vol 99. Springer, Cham. https://doi.org/10.1007/978-3-319-74890-0_8

Download citation

DOI: https://doi.org/10.1007/978-3-319-74890-0_8
Published: 10 March 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-74889-4
Online ISBN: 978-3-319-74890-0
eBook Packages: Physics and AstronomyPhysics and Astronomy (R0)

Publish with us

Policies and ethics