Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Miniaturized optical components are very popular nowadays. For example, lenses in cellular phones need to be small and light weight and shall be equipped with an optical zoom. Micro-optical components are employed also in sensors, for data transmission, and chemical analysis.

Miniaturization is a disadvantage for optical imaging because every optical component is an aperture in the optical path and every aperture results in diffraction. As shown on the left of Fig. 127, light due to its wave nature does not propagate as straight lines but shows diffraction. The light intensity far behind a narrow aperture illuminated by parallel light does not show a rectangular profile. For the same reason, the focus of a lens with diameter DL and focal length fB cannot be arbitrarily small but the intensity distribution IL is given by the following equation:

Fig. 127
figure 1_14

Optical diffraction of light waves at an aperture (left) and a lens (right)

$$\rm {I_{\rm{L}}} = {I_0}\;{\left[ {\frac{{\sin \left( {\pi\,\,x\left( {{{{{D_{\rm{L}}}}} \left/ {{\lambda\,\,{f_{\rm{B}}}}} \right.}} \right)} \right)}}{{\left( {\pi\,\,x\left( {{{{{D_{\rm{L}}}}} \left/ {{\lambda\,\,{f_{\rm{B}}}}} \right.}} \right)} \right)}}} \right]^2}. $$
(314)

In the above equation, λ, x, and I0 are the wavelength of the light used, distance from the optical axis, and intensity of the light illuminating the aperture, respectively. The intensity distribution calculated with (314) is shown on the right of Fig. 127. Obviously, a smaller diameter DL of the lens broadens the intensity distribution. As a consequence, an image projected with a lens becomes sharper when the lens diameter is enlarged and miniaturization of the lens results in a diffuse projection. Therefore, there are always other reasons for miniaturization of optical components which are more important for a certain application than the loss in sharpness, e.g., small mass, small size, and low cost.

If an optical spectrum is to be analyzed, usually a diffraction grating is employed. A diffraction grating is a large number of narrow optical apertures arranged next to each other. Typically, transparent or reflecting parallel slits are made on a surface which is opaque besides the slits.

To describe how an optical grating works, first only two transparent slits are considered – a so-called double slit. A plane wave arriving at the two slits can be considered behind the double slit as two point sources of light which are emitting light in phase as spherical waves (cf. Fig. 128a). At certain angles, the wave crests and wave troughs of these waves coincide. On a screen far behind the double slit, the light intensity shows maxima at these angles. The formation of maxima is called constructive interference, the formation of minima where the crest of one wave compensates the trough of another wave is called destructive interference.

Fig. 128
figure 2_14

Light interference behind a double slit (a, b) and a grating (c)

The angles β of maximum intensity are found where the optical path difference between the two waves is an integer multiple of the wavelength λ. Thus, as seen in Fig. 128b, intensity maxima are found where the following equation is fulfilled:

$$\rm m\,\,\lambda = g\sin \left( \beta \right) \Rightarrow \sin \left( \beta \right) \approx \beta = \frac{{m\,\lambda }}{g}. $$
(315)

In the above equation, g is the distance of the centers of the two slits. It is called the grid constant. m is an integer which is called the diffraction order.

If not only two but also several or many slits with a constant distance g are employed, the maxima of all the spherical waves propagating from these slits interfere with each other and generate a grid of narrow maxima of equal intensity on the screen (cf. Fig. 128c). Except the zero order maximum (m = 0), the angle of constructive interference β is a function of the wavelength. The higher the order of a maximum is, the more is the spectrum spread and the larger is the resolution for analyzing it. As a consequence, a grid can be used to separate the light into its colors and to display and analyze its spectrum.

If the light is not illuminating the grid perpendicular, the angle of incidence needs also to be taken into account when the position of the maxima are calculated. Figure 129 shows that the following equation needs to be satisfied for constructive interference:

Fig. 129
figure 3_14

Interference at a not perpendicularly illuminated grid

$$\rm m\,\,\lambda = g\left( {\sin \left( \alpha \right) + \sin \left( \beta \right)} \right) \Rightarrow \sin \left( \alpha \right) + \sin \left( \beta \right) = \frac{{m\,\lambda }}{g} \approx \alpha + \beta. $$
(316)

Note that α and β as shown in Fig. 129 are defined to be positive. One of them can be negative when it is on the opposite side of the dashed line normal to the grid. By choosing α the designer can control the position of, e.g., the first-order maximum of a certain wavelength.

Besides transmission grids, there are also reflection grids. As shown in Fig. 130a, at a reflection grid, the reflected light is subject to interference. The angle β in (316) is negative in most cases of a reflection gird.

Fig. 130
figure 4_14

Reflection grid (a) and step gird (b)

If a grid is to be used with low-intensity light, it is a disadvantage that roughly half of the light to be analyzed is absorbed. This problem is avoided when a step grid is employed as shown in Fig. 130b. The light reflected at a step is interfering with those reflected at the neighboring steps. As a result of the step, there is a phase difference between light reflected at different steps.

According to the reflection law, the angle α r of the reflected light is the same as the angle of incidence α (cf. Fig. 131):

Fig. 131
figure 5_14

Reflection at a mirror

$$\rm \alpha = {\alpha_{\rm{r}}}. $$
(317)

Due to the slope angle ΦSt of the steps, there is a wavelength for which the angles of reflection and constructive interference coincide. For this wavelength, the so-called blaze wavelength λ B, and, to some smaller extent, the neighboring wavelengths the intensity of the interference maximum is enhanced. This effect can be employed for the design of spectrometers. If the detector is less sensitive in a certain range of wavelengths, e.g., extreme blue light, this can be compensated partly with a blaze in this range. Another possibility is that a certain range of the spectrum is of high importance for analytical purposes.

In Fig. 132, it is seen that the reflection law results in:

Fig. 132
figure 6_14

Blaze angle at a step grid

$$\rm \alpha - {\Phi_{\rm{St}}} = {\Phi_{\rm{St}}} - \beta \Rightarrow {\Phi_{\rm{St}}} = \frac{{\alpha + \beta }}{2}. $$
(318)

Note that β in Fig. 132 is negative according to the definition of (316). If β is the blaze angle for the blaze wavelength λ B, both (318) and (316) need to be fulfilled, and the step angle ΦSt is derived as:

$$\rm {\Phi_{\rm{St}}} = \frac{{m\,\,{\lambda_{\rm{B}}}}}{{2\,g}}. $$
(319)

The blaze angle β at which constructive interference and reflection coincide is a function of the angle of incidence. It is found from the two above equations:

$$\rm \beta = \frac{{m\,\,{\lambda_{\rm{B}}}}}{g} - \alpha. $$
(320)

If the screen shown in Fig. 128b is far behind the double slit, the angles β of light emitted from the neighboring slits and interfering at a certain position on the screen are nearly the same. Therefore, (315) and (316) have been derived correctly. In microtechnique, however, the distance of the screen is never far from the gird. Therefore, the plain of the grid needs to be bent to achieve that light emitted at the same difference between incident angle α and diffraction angle β, i.e., the same phase difference, arrives at the same point. This problem is solved when the light source (or entrance slit of a spectrometer), the gird, and the screen (or detector of a spectrometer) are arranged on a circle (cf. Fig. 133a). Such a circle is called a Rowland circle. It is a geometrical fact that every triangle inside of a given circumcircle and with one side identical to a certain secant shows the same angle opposite to the secant (cf. Fig. 133b).

Fig. 133
figure 7_14

Rowland circle

The surfaces of optical components may show only small roughness, because even a small roughness will result in interference effects. Typically, the components need to be even within a margin which is a 20th of the wavelength.

If light is arriving at a mirror at an angle of incidence of 0°, the incident wave is interfering with the reflected one. As a consequence, the so-called standing wave is generated and the intensity of the light in front of the mirror shows maxima and minima (cf. Fig. 134). At the surface of the mirror, there is a minimum and the distance from one minimum to the next one is half of the wavelength of the light. This phenomenon is observed when a partly transparent screen is placed very near to the surface of the mirror and moved back and forth to the mirror. Standing light waves are also the reason for interference patterns in photo resists on a reflecting substrate which cause uneven side walls of resist patterns after development.

Fig. 134
figure 8_14

Standing light wave on a reflecting substrate (left) and resist edge after developing (right)

An optical component suitable for miniaturization which is often used for analytical purposes is the Fabry-Perot interferometer. It consists of two mirrors which are arranged in parallel to each other (cf. Fig. 135). At least one of these mirrors is partly transparent. Therefore, light can enter into the space between the mirrors and a small part of this light is escaping out of the interferometer again. The integer multiples of the wavelengths λ of the light leaving the interferometer equal the double distance d between the mirrors:

Fig. 135
figure 9_14

Fabry-Perot interferometer and standing light waves developing inside

$$\rm m\,\,\lambda = 2\,\,d. $$
(321)

The reason for this effect is that the standing waves developing on the surfaces of the two mirrors can only interfere constructively if their minima and maxima coincide. That is, the standing wave between the mirrors needs to have a minimum on the surfaces of both mirrors.

The path of light may also change its direction by refraction. Refraction occurs when the medium in which the light is propagating shows a change of its refractive index. The refractive index is the ratio of the velocity of light in vacuum and in the medium. So, the refractive index of the vacuum equals 1. In a medium, the velocity of light is a little bit smaller than in vacuum, and, therefore, the refractive index of media is a little bit larger than 1.

When light is propagating towards a down step in refraction index at an angel of incidence α larger than 0°, it is partly reflected at an angle α r according to (317), and partly transmitted at an angle α t into the medium with the smaller refraction index (see Fig. 136a). One side of each wave crest will arrive earlier at the interface than the other (cf. left side of wave crest in Fig. 136b). The transmitted part of the side arriving earlier will continue propagating with a larger velocity c1, while the other side for a small time t retains its former velocity c2. The distances c1t and c2t the light is traveling within the time t are not equal. Therefore, the direction of propagation will change to a larger angle α t.

Fig. 136
figure 10_14

Reflection and refraction of light

From the geometry in Fig. 136b, it is seen that:

$$\rm {c_1}t = z\cos \left( {90^\circ - {\alpha_t}} \right) = z\sin \left( {{\alpha_t}} \right)\;{\hbox{and}}\;{c_2}\,\,t = z\sin \left( \alpha \right) $$
$$\rm \Rightarrow \frac{{{c_1}\,\,t}}{{\sin \left( {{\alpha_t}} \right)}} = \frac{{{c_2}\,\,t}}{{\sin \left( \alpha \right)}} \Rightarrow \frac{{{c_1}}}{{{c_2}}} = \frac{{{n_2}}}{{{n_1}}} = \frac{{\sin \left( {{\alpha_t}} \right)}}{{\sin \left( \alpha \right)}}. $$
(322)

Equation (322) is Snell’s law with which the refraction angle can be calculated. Since n2 is larger than n1, at a certain angle of incidence α Tot, α t needs to become larger than 90°. This would mean that the light goes back into the medium where it comes from. As a consequence, for angels of incidence larger than α Tot, there is no refraction but only reflection according to the reflection law (317). α Tot is called the angle of total reflection. It can be calculated from (322) by choosing 90° for the angle αt of the transmitted light, i.e., the sine equals 1:

$$\rm \sin \left( {{\alpha_{\rm{Tot}}}} \right) = \frac{{{n_1}}}{{{n_2}}} \Rightarrow {\alpha_{\rm{Tot}}} = \arcsin \left( {\frac{{{n_1}}}{{{n_2}}}} \right). $$
(323)

If a layer with a larger refraction index n2 is surrounded by layers with a smaller one n1, it can be employed as an optical waveguide. Light propagating at an angle larger than the angle of total reflection is confined in the middle layer. Optical fibers are based on this principal, but light is also guided in a layer which is extending laterally and surrounded by layers with a smaller refractive index.

The angle of total reflection limits the maximum angle θ max at which light may enter or exit an optical waveguide (see Fig. 137). The refraction index n0 outside of the waveguide is smaller than inside, and the refraction angel α t is calculated by Snell’s law (322). From Fig. 137, it is seen that sin(α t) = cos(α Tot). Snell’s law is now:

Fig. 137
figure 11_14

Reflection and refraction of light

$$\rm {n_0}\sin \left( {{\theta_{\max }}} \right) = {n_2}\sin \left( {{\alpha_t}} \right) = {n_2}\cos \left( {{\alpha_{\rm{Tot}}}} \right). $$
(324)

The sum of the squares of sine and cosine of every angle α equals 1: sin2(α) + cos2(α) = 1. If the angle of total reflection is inserted into this equation and (323) is used for sin(α Tot), (324) yields:

$$\rm {n_0}\sin \left( {{\theta_{\max }}} \right) = {n_2}\cos \left( {{\alpha_{\rm{Tot}}}} \right) = n_2\sqrt {{1 - \frac{{{n_1^2}}}{{{n_2^2}}}}} = \sqrt {{{n_2^2} - {n_1^2}}} : = {\hbox{NA}}{.} $$
(325)

The quantity NA defined in the above equation is the numerical aperture of the optical waveguide which is a measure of the illumination angle which can be achieved. The numerical aperture is only a function of the refraction indices of the materials it is made of, while the maximum angel θ max at which light may enter into or exit from the waveguide is also a function of the refraction index of the medium outside the fiber.

In optical waveguides, the component of the light perpendicular to the mirrors needs to fulfill the condition of standing waves as in a Fabry-Perot interferometer (cf. 321) [78]. As a consequence, the following equation needs to be fulfilled by the light propagating in the waveguide [53]:

$$\rm m\,\,\lambda = 2\,\,d\cos \left( {{\alpha_{\rm{m}}}} \right) \Rightarrow {\alpha_{\rm{m}}} = {\rm arc} \,cos\left( {\frac{{m\,\,\lambda }}{{2\,\,d}}} \right). $$
(326)

In the above equation, α m is the angle of incidence inside of the optical fiber (Fig. 138). Obviously, light can propagate inside an optical fiber only at distinct angles α m. The light propagating at a certain angle is called a mode. The number of modes is limited because the angle of incidence cannot become larger than the angle of total reflection:

Fig. 138
figure 12_14

Propagation of light in a waveguide at mode m

$$\rm m\; \le \frac{{2\,\,d}}{\lambda }\;\cos \left( {{\alpha_{\rm{Tot}}}} \right). $$
(327)

If the height d of the waveguide is small, only a small number or even only one mode can propagate inside. The kernel of one mode waveguides typically is approximately 1 μm high, while multimode waveguides have a kernel, 100 μm or more in height. An advantage of one mode waveguides is that all the light is traveling the same distance when propagating through the guide. Light which is reflected at larger angles α m of incidence has a longer way to travel, and, therefore, arrives a bit later than the light of other modes. As a consequence, a data bit transmitted through several modes becomes wider, while propagating through an optical fiber. This effect limits the data rate which can be sent through the fiber. Therefore, one mode fibers allow larger data rates.

Equation (327) can be interpreted also in another way: There is a lower limit for the wavelength λ of light which can propagate through a thin waveguide. Thus, a waveguide works as a filter:

$$\rm \lambda \; \le \frac{{2\,\,d}}{m}\cos \left( {{\alpha_{\rm{Tot}}}} \right). $$
(328)

The light confined in an optical waveguide is not able to leave the middle layer according to classical theory. However, quantum mechanics is needed to describe all aspects of light propagation in waveguides. According to quantum mechanics, the wave which is employed to describe the light is a measure of the probability that a light quantum, a so-called photon, is found at a certain position. The wave function is not zero in the layer with the larger refraction index but extends into this layer with decreasing amplitude. The part of the wave function which is in the “forbidden” region is called the evanescent field (cf. Fig. 139). Thus, there is a little probability that photons exist in the layers with the larger refraction index and even outside of the entire waveguide. As a consequence, the light inside of the waveguide can be affected by media outside. Some sensors employ this effect.

Fig. 139
figure 13_14

Light tunneling from a waveguide to a neighboring one (a) and power in the two waveguides (b)

Due to the evanescent field, there is a way for the light in a waveguide to “tunnel” through the barrier of the lower refractive index into a neighboring region with higher refraction index. After some time and propagation length, the entire wave function has shifted to the other waveguide (cf. Fig. 139a), and after the same distance of propagation it has shifted back into the initial waveguide. A part of the light is absorbed during tunneling, and, therefore, the amplitude of the wave function is reducing.

This effect can be employed to distribute light from a single into two waveguides. The power as a function of the position x in waveguides 1 and 2, respectively, can be calculated with the following equations:

$$\rm {P_1}(x) = {\cos^2}\left( {\kappa {\rm x}} \right){{\hbox{e}}^{ - {\alpha_{\rm{op}}}x}}, $$
(329)
$$\rm {P_2}(x) = {\sin^2}\left( {\kappa {\rm x}} \right){{\hbox{e}}^{ - {\alpha_{\rm{op}}}x}}. $$
(330)

The above equations are plotted in Fig. 139b. The coupling constant κ and the damping constant α op are a function of the geometry of the waveguides and the modes of the light propagating in them.

When light is to be distributed from one waveguide into two, they need to be designed in parallel for the distance π/4κ, as seen from Fig. 139b and (329) and (330). Figure 140 shows such a design and as an alternative a so-called Y-coupler. The Y-coupler is easier to design, because nothing needs to be known about the wave function and modes in the waveguide, but the angle at the junction may not be larger than about 0.5° to avoid too much loss of the light where the required angle of incidence at the change in refraction index cannot be matched.

Fig. 140
figure 14_14

Distribution of light from one into two waveguides by (a) tunneling into a neighboring waveguide and (b) a Y-coupler

In some materials, the refractive index nop is a function of the electric field Eel applied to them. This is called the electro-optical effect. By employing this effect, it is possible to modulate the amplitude of light and to switch it on and off. In general, the refractive index can be described by:

$$\rm {n_{\rm{op}}} = {n_2} - \frac{1}{2}\;{r_{\rm{P}}}\,\,n_2^3\,\,{E_{\rm{el}}} - \frac{1}{2}{r_{\rm{K}}}\,\,n_2^3\,\,E_{\rm{el}}^2. $$
(331)

In the above equation, n2 is the refractive index without any electric field and rP and rK are material constants: the Pockels constant and the Kerr constant, respectively. Two kinds of the electro-optical effect are distinguished: the Pockels effect and the Kerr effect. The Pockels effect is linear with the electrical field. It is represented by the term in (331), which is linear with the electric field. The other electro-optical effect is the Kerr effect. With the Kerr effect, the refractive index of a material is changed with the square of the electric field [the last term in (331)]. The part of an electro-optical material in which the Pockels or the Kerr effect is employed is called a Pockels cell or a Kerr cell, respectively.

Tables 17 and 18 show some materials with comparatively large Pockels and Kerr constants, respectively.

Table 17 Pockels constants
Table 18 Kerr constants

As described above, the refractive index is the ratio of the velocity of light in a certain material and in vacuum. Therefore, decreasing the refractive index results in a smaller delay of the propagating light.

This is employed in a device called Mach-Zehnder interferometer (cf. Fig. 141). Light coming from the left is distributed into two waveguides and changed in one of them. After reunification of the waveguides, the two waves interfere resulting in a changed amplitude.

Fig. 141
figure 15_14

Mach-Zehnder interferometer with a Pockels cell (region between the electrodes) in one of its arms

If a Pockels cell is installed in one of the arms of a Mach-Zehnder interferometer, the delay of the wave in that arm can be manipulated by an electric signal and, this way, the amplitude of the outgoing light is controlled. The phase shift Δϕ in a Pockels cell with length LPZ of a wave which shows a wavelength of λ 0 in vacuum is according to (331):

$$\rm \Delta \varphi = \frac{{\pi\,\, {L_{\rm{PZ}}}}}{{{\lambda_0}}}{r_{\rm{P}}}\,\,n_2^3\,\,{E_{\rm{el}}}. $$
(332)

The light is switched off when the phase shift of the wave is Δϕ = π. Therefore, the above equation yields for switching off the light after a Mach-Zehnder interferometer with a Pockels cell:

$$\rm {E_{\rm{el}}} = \frac{{{\lambda_0}}}{{{L_{\rm{PZ}}}\,\,{r_{\rm{P}}}\,\,n_2^3}} \Rightarrow U = \frac{{{d_{\rm{C}}}\,\,{\lambda_0}}}{{{L_{\rm{PZ}}}\,\,{r_{\rm{P}}}\,\,n_2^3}}. $$
(333)

In the above equation, U and dC are the voltage applied to the electrodes and their distance, respectively. Light switching with a Pockels cell is very quick. Thus, light pulses, only a few nanoseconds in length, can be generated and sent as data bits into an optical fiber.

It is also possible to modulate the intensity of the light by changing the phase shift in a Pockels cell. If Ie and Ia denote the intensity of the light at the input and output of the cell, respectively, the modulation is described by:

$$\rm {I_{\rm{a}}} = {I_{\rm{e}}}\,\,{\cos^2}\left( {\frac{{\pi\,\, {L_{\rm{PZ}}}}}{{{\lambda_0}\,\,{b_{\rm{PZ}}}}}{r_{\rm{P}}}\,\,n_2^3\,\,U} \right). $$
(334)

The above equation is shown in Fig. 142. When light is to be modulated with a Pockels cell, the nearly linear part of the cos2-function around π/4 is used.

Fig. 142
figure 16_14

Linear part of the cos2-function marked by a dashed ellipse

There are also devices employing the effect that some materials turn the polarization angle of light (cf. Fig. 143). Before entering into the device, the light passes a polarization filter. At the end of the device, a second polarization filter is arranged adjusted for light polarized perpendicular to the first filter. Thus, only light with a polarization direction turned by 90° in the cell can pass it. This is also a way to generate data bits which are sent through an optical fiber.

Fig. 143
figure 17_14

Device employing turning of polarization direction of light for sending data bits into an optical fiber

Exercises

Problem 27

The resolution of an optical instrument is defined as the distance between neighboring points of the image which can be distinguished from each other. According to Lord Rayleigh, two points can be distinguished from each other, if the intensity maximum of one point lies in the minimum of the neighboring one.

  1. (a)

    Find a way to calculate the resolution of an ideal lens on the basis of (314) (page 177). Derive an equation for the calculation of the resolution limit.

  2. (b)

    It does not help to make the distance of the pixels of a CCD chip narrower than the resolution of the lens. The lens of the digital camera of a mobile phone has a diameter of 2 mm and a focus length of 28 mm. Calculate for the wavelength of visible light with the maximum intensity in the sun light (550 nm) the minimum distance of pixels which makes sense.

  3. (c)

    For comparison, calculate the resolution of the lens of a 35 mm camera at the same wavelength. Assume that the lens of that camera has a diameter of 30 mm and a focal length of 50 mm.