1 Introduction

End-point detector (EPD) is thin film measurement equipment developed to control the amount of etching thickness and the remaining time to stop the etching process, which is very critical in semiconductor industry to assure the quality and performance of the production line of wafers. Traditional EPD system widely used in plasma etcher called as dry etcher, measures the film thickness of the wafer sitting still in plasma gas without any motion or any chemicals on the wafer. However, in the spin etcher shown in Fig. 1 the wafer is etched by chemicals while it rotates at a high speed, so that the productivity is very good because of its etching speed and low cost (Quirk and Serda 2001). Especially in case of 12-inch wafer production line, the etching speed increases even more comparing to 8-inch wafer and the etching area almost doubles, so that the monitoring technique like EPD is vital to quality control and assurance. In addition, the uniformity of thin films is very important because the etching speed changes much in the radial direction of the 12-inch wafer (Kim 1999; Maynard and Hershkowitz 1995).

Fig. 1
figure 1

Configuration of 12-inch wafer spin etcher

End-point detector system for the spin etcher requires special capabilities very different from that for the plasma etcher in its functions, one of which is the real-time measurement to enable to monitor the remaining thickness of the thin film during etching process. The other one, which is more critical to measure the thin film thickness in the spin etcher, is the accuracy of measurement because the interference signal from the thin film would deteriorate due to the rotational motion of the wafer and fluid flow generated by the chemicals. Therefore it is necessary to know how the disturbances (vibration and chemical flow) affect the interference characteristics and the measurement accuracy of the film thickness.

In this paper a single layer model for EPD considering the vibration and chemical flow is proposed and the interference pattern of reflected lights from the model is described. A prototype interferometer with software program to measure the thin film thickness using mathematical model were developed and applied to thin films including silicon-oxide and poly-silicon to investigate its effectiveness and performance.

2 Theory of white light interferometry

Electric field of a light propagating in the direction of propagation vector k at a position of vector r with a phase angle ϕ and amplitude E0 is expressed by E(t, r) = E0·exp(it+k·r+ϕ)) and can be simplified in one-dimensional direction of x and in a medium of the refractive index n and the extinction coefficient κ as following (Xu et al. 1995)

$$E(t,x) = E_0 \cdot {\text {e}}^{- \kappa kx} \cdot {\text {e}}^{i(\omega t + nkx + \phi)}, $$
(1)

where k is the wave number of the light given by 2π/λ and λ is the wavelength of the light in the air. If the light is incident from the air of refractive index n to a transparent thin film deposited on the silicon substrate with the refractive index n1 and the thickness h as shown in Fig. 2, two reflected lights, U1 and U2, are produced from the interfaces of the layered substrate. From the path difference of U1 and U2 in Fig. 2, the phase difference Δφ between U1 and U2 is calculated easily from the geometric condition if the difference of the path length is Δx, such as

$$\Delta \phi = k \cdot \Delta x = k \cdot \left({\frac{{2n_1 h}}{{\cos \theta _1}} - 2n_0 h \cdot \sin \theta _0 \tan \theta _1}\right)$$
(2)

In Eq. 2 the first term of the parenthesis represents the path difference inside the film and the second in the air. Even though additional phase changes of U1 and U2 take place during the reflection and transmission at the interfaces, it is neglected in this paper because its contribution is very small for the materials considered in the paper comparing to the path difference. The superposition of both U1 and U2 yields a constructive and destructive interference according to the phase difference, which leads to the variation of the intensity of the total reflected light measured in the optical probe. If the reflected lights, U1 and U2, are expressed in the form of Eq. 1 without the oscillating term ω t, they becomes

$$U_1 = A_1 \cdot {\text{e}}^{i(nkx + \phi _1)} \quad {\text {and}}\; U_2 = A_2 \cdot {\text{e}}^{i(nkx + \phi _2)}, $$
(3)

where A1 and A2 are the amplitudes of the reflected lights U1 and U2, and φ1 and φ2 are the phase angle of lights, respectively. Then the intensity I of the total reflected light is given by (Wang et al. 1995)

$$\begin{aligned} \mbox {\boldmath$I$} &= \frac{1}{{2Z}}(U_1 + U_2) \cdot \overline {(U_1 + U_2)} = \mbox {\boldmath$I$}_1 + \mbox {\boldmath$I$}_2 + \frac{1}{{2Z}}(A_1 A_2 {\text{e}}^{i\Delta \phi} + A_1 A_2 {\text{e}}^{- i\Delta \phi})\\ & = \mbox {\boldmath$I$}_1 + \mbox {\boldmath$I$}_2 + \frac{{A_1 A_2}}{{2Z}}\cos (\Delta \phi), {\text{where }}\Delta \phi = \phi _1 - \phi _2 \\ & = \mbox {\boldmath$I$}_1 + \mbox {\boldmath$I$}_2 + \frac{{A_1 A_2}}{{2Z}}\cos [(2kh)(n_1 \sec \theta _1 - n_0 \sin \theta _0 \tan \theta _1)] \\ \end {aligned}$$
(4)

In Eq. 4 \(\mbox {\boldmath$I$}_1 \) and \(\mbox {\boldmath$I$}_2 \) are the intensities of U1 and U2 respectively and it gives the expression for the relationship between the thickness h of thin film and the intensity of total reflected light I over the range of wavelength (wave number) when using the white light. Interference pattern of SiO2 simulated in the computer by Eq. 4 for the normal incidence is shown for example in Fig. 3.

Fig. 2
figure 2

Interference of reflected lights from a thin film

Fig. 3
figure 3

Interference pattern of SiO2 film(500 nm)

However, in case that the wafer in the Fig. 2 rotates and vibrates, the light path and the incident angle θ 0 including refraction angles fluctuate with time due to the disturbance. Moreover the etching chemical flowing on the wafer with high speed may also affect the magnitude of reflected lights, A1 and A2, because of the attenuation of light energy in the chemical. This means that Eq. 4 becomes corrupted and noisy due to the vibration and the chemical flow. In order to include the effects in Eq. 4, two random variables are introduced, which are the thickness d of the chemical and the incident angle ɛ θ due to the vibration. Assuming that A1 and A2 are reduced only by the attenuation in the chemical of thickness d, the attenuated amplitudes of A1 and A2, \(\hat A_1 \) and \(\hat A_2, \) can be written using the extinction coefficient κ c of the chemical by,

$$\begin{aligned} \hat A_1 &= A_1 {\text e}^{- \kappa _c kd} \approx A_1 \cdot (1 - \kappa _c kd), \\ \hat A_2 & = A_2 {\text e}^{- \kappa _c kd} \approx A_2 \cdot (1 - \kappa _c kd)\;{\text{for\;small}}\;\kappa _c d \\ \end{aligned} $$
(5)

It is also assumed that the incident angle θ 0 is a random variable with a mean value \({\overline \theta} _0 \) and a random noise ɛ θ, that is, \(\theta _0 = \overline {\theta _0} + \varepsilon _\theta .\) Accordingly the refraction angle is expressed as the random noise ɛ θ from the Snell’s law, which is \(\theta _1 = n_0/n_1(\overline {\theta _0} + \varepsilon _\theta)\) for a small ɛ θ. Then the total intensity I in Eq. 4 can be expressed for small d and normal incidence \((\overline {\theta _0}) = 0\) as

$$\begin{aligned} \mbox {\boldmath$I$}(k) &= (1 - k \cdot \kappa _c d)^2 \left\{ \mbox {\boldmath$I$}_1 + \mbox {\boldmath$I$}_2 + \frac{{A_1 A_2}}{{2Z}}\cos [(2kh)(n_1 \sec \theta _1 - n_0 \sin \theta _0 \tan \theta _1)] \right\} \\ & \approx (1 - 2k \cdot \kappa _c d)\left\{ \mbox {\boldmath$I$}_1 + \mbox {\boldmath$I$}_2 + \frac{{A_1 A_2}}{{2Z}}\cos [2khn_1 (1 - \varepsilon _\theta ^2)] \right\}\\ \end{aligned} $$
(6)

Replacing the wave number k by ν for notational convenience, Eq. 6 becomes

$$\mbox {\boldmath$I$} (\nu) \approx (1 - 2\nu \cdot \kappa _c d) \left\{ \mbox {\boldmath$I$}_1 + \mbox {\boldmath$I$}_2 + \frac{{A_1 A_2}}{{2Z}}\cos [2\nu hn_1 (1 - \varepsilon _\theta ^2)] \right\} $$
(7)

In addition to the disturbances from the vibration and fluid flow, a random noise ηR due to the electrical noise and white noise from the environment corrupts the signal I as well, so that Eq. 7 is rewritten by

$$\begin{aligned} \mbox {\boldmath$I$}(\nu) &\approx (1 - 2\nu \cdot \kappa _c d)\left\{\mbox {\boldmath$I$}_1 + \mbox {\boldmath$I$}_2 + \frac{{A_1 A_2}}{{2Z}}\cos [2\nu hn_1 (1 - \varepsilon _\theta ^2)]\right\}+\eta _{\text{R}} \\ & \approx (\mbox {\boldmath$I$}_1 + \mbox {\boldmath$I$}_2) + \frac{{A_1 A_2}}{{2Z}}\cos (2\nu hn_1) - (2\nu \cdot \kappa _c d)\left\{\mbox {\boldmath$I$}_1 + \mbox {\boldmath$I$}_2 + \frac{{A_1 A_2}}{{2Z}}\cos (2\nu hn_1) \right\} \\ &\quad + \frac{{A_1 A_2}}{{2Z}}\sin (2\nu hn_1) \cdot \varepsilon _\theta ^2 + {\text{}}\eta _{\text{R}}\\ \end{aligned} $$
(8)

This equation indicates that the interference signal I in Eq. 4 is disturbed by the attenuation effect from the chemical thickness d, the vibration ɛ θ, and the random white noise ηR. The first and second terms are the interference signal when there are no vibration and no chemical flow. The third term in Eq. 8 describes the interference signal induced by the chemical flow and the fourth one due to the vibration. It is clearly seen that the chemical effect can be much more severe than the vibration effect because it is directly coupled with the total mean intensity I1+I2.

The intensity spectrum represented by Eq. 8 can be very different from the ideal interference pattern shown in Fig. 3 if κ c d of the chemical and vibration effect cannot be negligible. In Fig. 4, a distorted interference spectrum computed in the computer for an extreme case of \(\kappa _c \cdot (d/h)=0.1 \) and ɛ θ=0 is shown to describe the effect of chemical film on the spectrum signal, where the more distortion is made for the light of a shorter wavelength. The vibration effect on the spectrum of Eq. 8 is expected to act like a random noise such as ηR. This effect is represented in Fig. 5 when \(\kappa _c \cdot (d/h)=0\) and ɛ θ=0.1 (random noise between 0 and 0.1), where the vibration appears to be an additive noise and easier to handle than the attenuation effect of the chemical because the chemical effect is linearly dependent on the wavelength in Eq. 8.

Fig. 4
figure 4

Distortion of interference pattern from SiO2 film due to a thick chemical flow

Fig. 5
figure 5

Interference pattern due to vibration

3 Thickness measurement of thin film

Thickness of thin film can be calculated basically using Eq. 8 by measuring the spectral intensity I of reflected light from the thin film on the wafer although the vibration level ɛ θ and the chemical effect κ c d are unkown in general. If the extinction effect of chemical film and vibration are not small, the interference signal can be corrupted very much by the disturbances and may lead to an large error. However, in case that the attenuation of the light inside the film and the chemical is small and the vibration level is not high, the fourth and fifth terms of the interference pattern in Eq. 8 becomes small comparing with other terms. Fortunately the films and the chemical flow considered in this paper are so thin, hundreds of nanometers, that the extinction effect is very small. On these assumptions, the effects of the chemical and vibration can be treated as a noise with non-zero mean and a specific variance as can be seen from Eq. 8. If these noise-like effects are removed somehow by means of filtering technique and curve fitting, the film thickness h can be easily calculated from the filtered intensity signal \(\mbox {\boldmath${\hat I}$}(\nu)\) in Eq. 8 by the following relations (Wang et al. 1995).

  1. 1.

    In case that the film thickness h is very thick comparing to the wavelength λ, i.e., \(h/\lambda >>1\) or \(h\nu >>1.\) Multiple minimum and maximum peaks are obtained and the thickness of the film is calculated using any adjacent two ultimate peaks at υ m and υm+1 by

    $$h = \frac{{\pi m}}{{2n_1 \nu _m}},\;{\text{where\;the\;positive\;integer\;}}m = \frac{{\nu _{m + 1}}}{{\nu _{m + 1} - \nu _m}}$$
    (9)
  2. 2.

    In case that the film thickness h is as thick as the wavelength λ, i.e., h/λ ~ 1 or h ν ~ 1. Interference occurs one time. Either of minimum peak or maximum peak appears in the spectrum from which the thickness of the film is calculated by

    $$h = \frac{\pi}{{n_1 \upsilon _{\min}}}{\text{ for the min}}.{\text{ peak},} h = \frac{\pi}{{n_1 \nu _{\max}}}{\text{for the max}}.{\text{peak}}$$
    (10)
  3. 3.

    In case that the film thickness h is smaller than the wavelength λ, i.e., \(h/\lambda <<1\) or \(h\nu <<1.\) Interference peak cannot be seen from the spectrum because the argument of the cosine function becomes very small. The thickness of the film is obtained by

    $$h = \frac{1}{{4\pi n_1}}\sqrt {f(\nu)},\;{\text{where}}\;f(\upsilon ) = {{\frac{{\partial ^3 I(\nu)}}{{\partial \nu ^3}}} \mathord{\left/ {\vphantom {{\frac{{\partial ^3 I(\nu)}}{{\partial \nu ^3}}} {\frac{{\partial I(\nu)}}{{\partial \nu}}}}} \right. \kern-\nulldelimiterspace}{\frac{{\partial I(\nu)}}{{\partial \nu}}}}$$
    (11)

4 Experimental setup and signal processing

4.1 Configuration of interferometry system

Measurement system developed for the experiment is composed of three parts, which are a light source (Tungsten-Halogen lamp) generating a wide-band white light from UV to visible, a spectrometer (Ocean Optics S2000-UV-VIS, USA), and an optical probe detecting the reflected light from the wafer. The light signal is detected by an optical lens system placed 200 mm high above the wafer with a focusing lens for increasing the distance between the probe and the wafer without losing the reflected light power. The reason is that the probe may be damaged by the fumes produced by the chemicals and there should be an enough space to transport and handle the wafer. The optical signal measured and transferred through the optical fiber of 400 μm diameter to the spectrometer is A/D converted into 2,048 data with 2 MHz sampling rate in 20 ms as shown in Fig. 6. Thin films on 12-inch wafer specimens prepared for the thickness measurement include silicon oxide (100 nm/500 nm/600 nm in thickness) and poly-silicon (200 nm and 300 nm) fabricated and inspected very precisely in the factory. A simple proto-type of spin etcher shown in Fig. 7 is constructed to rotate the wafers and spray de-ionized (DI) water instead of the hazardous chemicals for the safety reason and connected with the measurement system (EPD) developed for this study. The use of DI water instead of the chemical does not make big difference in the experimental results since chemical solutions used for etching process in semiconductor manufacturing line contain only few percentages of toxic chemicals like HF and NNO3 in DI water. Rotating speed of the proto-type spin etcher is designed to control manually using DC-servo motor from 60 rpm to 3,600 rpm and the flow rate of DI water circulating in the closed loop by pump is also regulated.

Fig. 6
figure 6

Configuration of white light interferometry system

Fig. 7
figure 7

Photograph of experimental setup (a) interferometry system (b) test wafer

4.2 Noise filtering using adaptive algorithm

As indicated in the Sect. 3, the interference signal \(\mbox {\boldmath$I$}(\nu )\) contains not only the thickness information but also the disturbance signals from vibration and chemicals plus other noise sources like a statistical and electrical noise induced by light photons and measurement instruments in experiment. In order to determine the film thickness using Eqs. 9, 10 and 11 from this corrupted signal \(\mbox {\boldmath$I$}(\nu)\), adaptive filter is adopted in this study to remove the disturbing noises of different characteristics. Let S(ν) be the degraded noisy signal of the spectrum signal \(\mbox {\boldmath$I$}(\nu)\) by the additive noise n(ν) which includes all of the noises considered above, i.e.,

$$S(\nu)=\mbox {\boldmath$I$}(\nu)+ n(\nu),$$
(12)

where n(ν) is assumed as a random signal with non-zero mean and variance σ 2. Then the optimum estimator \(\mbox {\boldmath${\hat I}$}(\nu )\) of the signal \(\mbox {\boldmath$I$}(\nu )\) is computed from the adaptive algorithm by (Lee 1980).

$$\mbox {\boldmath${\hat I}$}(\nu) ={\overline S}(\nu)+ g(\nu)[S(\nu)-{\overline S}(\nu)], {\text {where the gain }}g(\nu) = \frac{{Q(\nu)}}{{Q(\nu) + \sigma ^2}},$$
(13)

where \({\overline S}(\nu)\) and Q(ν) are a-priori known mean and variance of S(ν) at a point ν. This adaptive filter acts like a high pass filter in a region where the variance of the signal S(ν) is large, i.e., the signal will be sharpened. Similarly, it behaves like a low-pass filter in a region where the variance of a signal is small; in this case the signal will be smoothed. Window size (frequency band or wavelength band) in this filter will greatly affect the quality of processed signal due to the local statistics in the window. If the window size is too small, the noise filtering algorithm is not effective. If the window is too large, subtle details of the signal will be lost in the filtering process. Therefore, it is necessary to optimize the effective band width for the signal from the preliminary experiments. The bandwidth of wavelength used for filtering in experiment is 20 nm for best performance.

4.3 Calculation of film thickness by curve fitting

Curve fitting is very important not only to reduce the random noise but also to determine with high accuracy the wavelength at which the maximum or minimum interference occurs in the spectrum. Even though the noise is reduced by the adaptive filtering, the wavelength of extreme peaks is difficult to find from the filtered spectrum because the spectrum near the peak points is very sensitive to the small noise which may cause an error in the film thickness. Thus it is necessary to find a smooth mathematical function that represents the measured interference pattern \(\mbox {\boldmath$I$}(\nu)\) as precisely as possible from the corrupted noisy signal due to the vibration and the chemical flow. Polynomial function is used for this curve fitting because the derivation and root-finding for the maximum and minimum can be performed very easily and fast. Several orders of the polynomial are tested and found to be almost same in performance if it is higher than third order. Extreme points of the polynomial function, which is expressed as ν m in Eq. 9, are calculated by finding the roots of the first derivative of the function using Newton-Rapson method. In order to do these successive procedures including the adaptive filtering and manipulation of the spectrum signal in easy manner, a Window based signal processing program written in C++ program is developed. The program controls the spectrometer, select the spectrum range of interest by the menu button on the screen, and perform the curve fitting and root finding automatically. One of the measurement results obtained by the program is shown in Fig. 8, where a noisy spectrum signal of oxide film is represented well by a fourth order polynomial function.

Fig. 8
figure 8

Curve fitting of a noisy interference spectrum for oxide film

5 Experimental results and discussion

Silicon-oxide and poly-silicon films widely used in the semiconductor manufacturing processes are tested in experiment to investigate the performance and feasibility of the white light interferometry system and the signal processing program developed in this study. Thin films are deposited on the 12-inch wafer in the factory using CVD and inspected by ellipsometry. A bare wafer without any films is also prepared to obtain a reference signal from the silicon substrate. The bare wafer is mounted first on the spinner of the spin etcher as shown in Fig. 7 without spinning to measure the reflection light signal from the silicon substrate using upright reflection probe. The signal is analyzed by the spectrometer and the spectrum of the signal is stored in the computer as a reference spectrum. This reference spectrum is used later to remove the background spectrum from the room environment and the characteristic spectrum of the light source. By doing this procedure, the noise and characteristic peaks from fluorescent lights and bulbs, or even the light source in the laboratory room can be canceled out from the interference signal. Then the wafer with thin film is placed and spin on the spinner at 1,200 rpm, at which speed the vibration level is detected about 50 μm in peak-to-peak around the outer rim of the wafer. DI water is supplied on the center of the rotating wafer through a small hose above the spinner at the flowing rate of 0.3 l/min as shown in Fig. 7. The water is then spread out into a very thin film with the relatively uniform thickness. Spectrums of the reflected light from the oxide and poly-silicon films in this operating condition are measured and one example for these spectrums of 500 nm silicon-oxide film is presented in Fig. 9. In this figure the lower spectrum is the raw spectrum (non scale) acquired from the film and the upper one is the interference spectrum obtained after subtracting the raw spectrum by the reference spectrum. In Fig. 9, it can be seen that a clear sinusoidal interference pattern is observed over the spectrum mainly in the ultraviolet region. This part of the interference spectrum is selected and zoomed using the cursor in the screen or automatically by the signal processing program. Figure 10 represents this example, in which the spectrum from 600 nm oxide film is enlarged and curve fitted using a fourth-order polynomial function. Another interference spectrum produced by the program after the curve fitting is described in Fig. 11 for 500 nm oxide film. This curve is used as the intensity function \(\mbox {\boldmath$I$}(\nu)\) of Eq. 9 through Eq. 11 to calculate the thickness of the films. Total time for the acquisition of the spectrum and signal processing for thickness measurement at a point on the wafer takes about 400 ms, which comes from mainly integration time to collect the enough intensity of the reflected light. Therefore the processing time can be reduced remarkably if more powerful light source is used.

Fig. 9
figure 9

Spectrum from a bare wafer and 500 nm oxide film

Fig. 10
figure 10

Polynomial curve fitting of the interference spectrum from 600 nm oxide film

Fig. 11
figure 11

Curve fitted interference spectrum for 500 nm oxide film

Table 1 summarizes the experimental results for the thin films, where a very good accuracy can be seen and the maximum error of 1.8% is achieved in the presence of vibration at 1,200 rpm and the water flow on the wafer. For both of silicon oxide and poly-silicon films, more accurate result is obtained when the thickness of the film increases. This reason is that the phase difference of the reflected light in the film becomes large if the film is thick comparing with the wavelength, so that SNR increases. Errors observed in case of the oxide films thicker than 500 nm is less than 0.5% at all measuring points of the wafer. Adaptive filtering and the curve-fitting algorithm presented in the paper are proved to work well with the interferometry sysem from the experimental results. Although the vibration itself produces more noise as the spinning speed increases, the noise is removed efficiently by the adaptive filter and the curve fitting. However, the programs do not work well at a low rotating speed below 500 rpm because the attenuation effect increases very much especially in UV band as the water film becomes thick at the low speed.

Table 1 Experimental results for thickness measurement

6 Conclusion

Mathematical model for a single layer wafer subject to vibration and chemical flow in spin etchers is proposed and the effects of vibration and chemical fluid film on the light interferometry are described. From the model, an explicit mathematical expression for the interference pattern corrupted by vibration and chemical film are derived. Vibration affects the interference spectrum in such way as it behaves like a random noise corrupting the interference signal. The vibration-induced noise increases over the entire spectrum range as the vibration level increases with rotating speed. Chemical flow makes little effects on the spectrum data if the rotating speed is high over 500 rpm, where the film get very thin. But when the film thickness increases as the rotating speed reduces, the chemical (DI water) distorts the interference spectrum pattern severely.

A prototype real-time thickness measurement system and signal processing program for spin etcher have been also developed to measure the thin film thickness of 12-inch wafers based on the mathematical model using the wide-band white light interferometry. Adaptive filtering and curve fitting technique are applied to determine the thickness of the silicon oxide and poly-silicon films on the wafer from the interference pattern over the wide range of wavelength. From the experimental results, the wide band white light from UV to visible range is proved to be very effective to reduce the measurement error due to the vibration and the chemical flow (DI water flow) in the spin etcher. Accuracy of the measurement achieved in experiment is approximately within 1.8% of film thickness at the rotating speed of 1,200 rpm and at the flow rate of 0.3 l/min. The accuracy of measurement decreases relatively as the film thickness decreases. The reason is partly because the SNR becomes low according to the small change of phase made in the very thin film and partly because of the error produced by curve fitting and differentiation.