Abstract
The method of disentangling of stellar spectra and its applications is briefly reviewed and its complementarity with the interpretation of other observational methods like photometry and interferometry is discussed. Special attention is given to the comparison with Doppler tomography of interacting binaries and to the Bayesian estimate of errors.
Access provided by Autonomous University of Puebla. Download chapter PDF
Similar content being viewed by others
Keywords
- Doppler Tomography
- Local Thermodynamic Equilibrium
- Component Spectrum
- Stellar Atmosphere
- Synthetic Spectrum
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
7.1 Introduction
Observations and the corresponding theory of binary and multiple stars provide us with a clue to the physics of stars and hence also to the universe beyond the Solar system:
-
From observations of a visual binary, we can determine its orbital period P, inclination i, eccentricity e and periastron longitude ω, the angular distance α between the components (and hence the ratio of the semi-major axis a to the distance d of the system) and light ratio L 1∕L 2 of the components.
-
From observations of an eclipsing binary , we can also assess P, i, with some limitations also e and ω, and in addition to L 1∕L 2 also the ratios of the components’ radii R 1, 2 to a.
-
From observations of Doppler shifts of lines in spectra of spectroscopic binaries we can also obtain P, e and ω. Moreover, we can get the semi-amplitudes of radial-velocity curves K 1, 2, i.e. for a reasonable estimate of i we know the orbital velocities v 1, 2 and we can calculate the absolute size of a and also the masses M 1, 2 of the component stars.
For spectroscopic binaries which are simultaneously eclipsing, we can thus determine the basic physical parameters of the stars – M 1, 2 and R 1, 2, and from colour photometry or spectroscopy also the temperatures T eff1,2 and hence the luminosities L 1, 2 of the component stars and the photometric distance d of the system. From these data, we can calculate the angular separation α, including for stars that we cannot resolve by direct imaging (e.g., α ≤ 0. 2 μ as for the first binary measured in M31 [15]).
However, the goal of indirect imaging which is usually dependent on some theoretical model of the object (e.g., in this case the model of components radiation and the interstellar absorption) is not to compete with observational techniques of direct imaging, not to say to replace them. To the contrary, the improvement of such techniques is even more desirable in connection with indirect imaging because they may provide a complementary information enabling to verify the theory and to improve the underlying assumptions. For instance, until recently, there were practically no binaries observed as both visual and spectroscopic because the former technique prefers wide orbits while the former close binaries with higher amplitudes of radial velocities . Interferometry, however, enables higher angular resolution, while improved spectroscopic instrumentation and data processing provides better spectral resolution, so that the overlap of these two sets of techniques increases. From a combination of interferometric and spectroscopic observations of (even non-eclipsing) binaries we can determine α, i, v 1, 2, a and hence also the geometric distance d (e.g., [23], which solved the Pleiades distance controversy caused by an error in Hipparcos results).
The observations of binaries can thus provide us with more information than those of single stars, but their interpretation is at the same time more difficult. In particular, the spectra (as well as the colours) of the component stars are blended and can only be distinguished by taking into account their variations with the orbital motion. While for single stars it is usually sufficient to fit their observed spectra with synthetic spectra parameterised with T eff, log g (the surface gravity) and abundances of elements X i , the physics of binaries is more complicated due to their interaction and proximity effects, such as tidal forces or reflection effects. More sophisticated theoretical tools and data-processing methods are thus needed to interpret the observational data. Such methods actually reveal structure within the binary systems and belong thus to the methods of indirect imaging.
7.2 Disentangling of Spectra of Multiple Stars
The classical treatment of observed spectra of spectroscopic binaries consists in measuring radial velocities (RVs), v j | j = 1 n, of the n component stars (Fig. 7.1). There are various techniques to measure RVs, but they all require some model of the component spectra, at least a simple assumption that the centres of some observed spectral lines correspond to the v j of a particular component j. The RV curves, i.e. the set of v j (t) at different times t, are then fitted to find the orbital parameters p of the system. A simultaneous solution of other observational data such as photometry or interferometry is advantageous because these data may better constrain some of the parameters and the set of solved p may be enriched also by some more physical parameters (M, R, T eff etc.). To relate the stellar parameters found with the spectra of individual stars, it is desirable to separate the spectrum F j of each component j from the observed spectra F of the whole system of n components. This is possible thanks to varying conditions in the blending of components in different exposures with different Doppler shifts (cf. e.g., [2]) or with different light ratios, e.g., during an eclipse (cf. [20]). The information on both the F j | j = 1 n and p is contained in the observed spectra, but it is entangled there because to retrieve one of them we need to know the other.
The basic idea of the method of disentangling spectra (cf. [5, 20]) is to solve simultaneously (or, actually, iteratively) the component spectra F j | j = 1 n and directly the orbital parameters p (instead of the intermediate step of RVs) by fitting all the observed spectra with a model of the form
where x ≡ c lnλ∕λ 0 is the logarithmic wavelength scale and the broadening function is given by a simple Doppler-shifted Dirac δ-function
This means that the separated component spectra F j appropriate to the observed object are also used for the treatment of RVs instead of a theoretical model or another template needed in the classical approach.
7.2.1 Fourier Disentangling
The model given by Eq. (7.1) is linear in F j and non-linear in p. A χ 2-fit of the observed spectra with respect to F j (x) thus yields linear condition equations of the (generally large) dimension given by the number of bins of the spectra times the number of component stars. Its solution can be further simplified using the Fourier transformation \(F_{j}(x) \rightarrow \tilde{F } _{j}(y)\) which converts the convolution in Eq. (7.1) into a product. The condition equation then reads
where σ l is the noise of exposure F(x, t l ) obtained at time t l and \(\tilde{F } (y,t_{l})\) is the y-mode of Fourier transform of F(x, t l ). Variation of the right-hand side with respect to \(\delta \tilde{F }_{k}^{{\ast}}(y)\) obviously yields a set of n linear equations for \(\tilde{F } _{j}(y)\), i.e. separated subsets of equations for individual Fourier modes
This makes the separation of spectra, i.e. the solution of the linear part of the disentangling, numerically easier and more efficient, compared to the solution in the wavelength domain. The optimisation with respect to δ p can be then performed by numerical minimisation of the right-hand side of Eq. (7.3), e.g., using the simplex method.
It is obvious that a necessary (but not sufficient) condition to get a non-singular matrix on the left-hand side of Eq. (7.4) is N ≥ n and the vectors \(\tilde{\varDelta }_{k}\) should differ for at least n different t l , i.e. the observations should sufficiently cover orbital phases with different \(\tilde{\varDelta }_{k}\). Regarding the intention to minimise the right-hand side of Eq. (7.3) with respect to p, it should be positive, i.e. Eq. (7.4) should be overdetermined and hence N > n. A significant over-determinacy is desirable because it can reduce the random noise in the disentangled F j (x) originating from the observational noise of F(x, t l ).
For a pure orbital Doppler shift, the broadening Δ j (x, t; p) is given by Eq. (7.2) and its Fourier transform can be calculated analytically
with s j (t) = 1. It can be seen from here that \(\tilde{\varDelta }_{j}(y = 0,t;p) = 1\) and, hence, the matrix on the left-hand side of Eq. (7.4) is singular for the zeroth Fourier mode y = 0. This corresponds to the obvious fact (which is also true for the separation of the spectra in the wavelength domain) that the continuum level is insensitive to the orbital motion and consequently it cannot be disentangled from spectroscopy alone. The disentangled component spectra F j (x) are thus defined up to additive constants, the sum of which (over j) should give the mean value of all observed F(x, t l ) in a particular spectral region. To get a completely separated spectra of each component like a spectrum of a single star, it is necessary to find proportions in which this sum has to be distributed between the components. This information on light ratios of the components can be in principle obtained, e.g., from interferometry (this is an example of complementarity of the direct and indirect imaging). In the case of eclipsing binaries the light ratio is usually obtained from photometry . The light ratio is wavelength dependent and is usually measured in broad wavelength regions. A normalisation of the disentangled spectra from the common continuum of the whole system to the proper continuum of each component should thus take into account the levels of continua influenced by the presence of spectral lines which are different for each component in each spectral region. Another possibility of interpretation of the disentangled spectra is to fit them by model spectra scaled in the intensity with a general unknown linear transformation.
The change of light ratio in the course of an eclipse and possibly also in other orbital phases of tidally distorted or mutually illuminated components of binaries also varies the strengths of lines of all components visible in the spectra. To get a better fit of the observed spectra in this case, line-strength factors s j (t) are introduced in Eq. (7.5) – cf. [6]. A set of linear equations for the values s j (t l ) | j = 1 n | l = 1 N can be obtained from Eq. (7.3) and solved iteratively with the separation of F j (x) – the model is bilinear in these two sets of unknown variables. The solution of s j yields photometric information from the spectra about the luminosity of one component relative to the other one. The decrement of light due to an eclipse depends on the limb darkening which in turn depends on the source function S ν (τ ν ). In the approximation of plane-parallel stellar atmosphere
which means that the dominant linear term (k = 1) in the dependence of the surface intensity I ν ∼ (1 − u + uμ) on the directional cosine μ is proportional to the first derivative of S ν with respect to the monochromatic optical depth τ ν . In the local thermodynamic equilibrium (LTE) approximation , S ν is given by the temperature structure of the atmosphere and dS ν ∕dτ ν is proportional to the vertical gradient (in geometric depth) of the temperature and inversely proportional to the monochromatic opacity . In contrary to the frequently adopted approximation that the change of limb darkening is negligible within the narrow line-profile and the value of u found from photometry (dominated by the continuum) can be fixed across the lines, u rapidly decreases from its value at the continuum toward the centre of line (cf. Fig. 7.2). Actually, the absorption lines are present in the spectra just owing to this change. In consequence, the signal contained in spectral lines normalised with respect to the overall continuum of the system may be enhanced not only for the component in foreground but also for the partly eclipsed component when only an edge of its disc is eclipsed which little contributes to the line. The disentangling of the line-strength factors s j (t) thus enables to map the surface and actually also the depth structure of the eclipsed star. It may also reduce errors in rectification of the observed spectra (i.e. their normalisation with respect to the continuum), which, however, limits the information gain about the structure of the atmosphere. The line-strength factors also enable to disentangle the telluric spectrum [6, 7].
7.2.2 Generalised Disentangling
Taking into account non-LTE or higher order terms in Eq. (7.6), the spectral lines may change during an eclipse, not only in strength, but also in shape. In the approximation of a thin stellar atmosphere, the observed spectral flux F(x, t) is given by an integral of the surface intensity I j over the visible surface s of all components j,
where μ and v j are known functions of s. If in analogy with Eq. (7.6) the intensity can be expressed in a form of superposition of several spectral functions F j k,
then the disentangling can be generalised from Eq. (7.1) to
where the broadening functions
are now dependent not only on the orbital parameters p, but also on other geometrical and physical parameters, as determined by the decomposition in Eq. (7.8).
In particular, if we treat eclipses of a rotating star, its line profile is rotationally broadened and its shape is distorted in the course of a partial eclipse. This so-called “rotational effect” [1, 8, 12, 14, 17, 18] is usually treated as a deviation of measured RVs during the eclipse from an exact Keplerian motion of the component’s centres of mass. However, its value depends on the method of measurement and definition of RV of an asymmetric line-profile. The line-profiles are usually modelled in the form of Eq. (7.1), i.e. as a convolution of the profiles of a non-rotating star with a rotational broadening function. The rotational broadening, however, depends on the limb darkening – it has a semi-circular shape for a rigidly rotating sphere with uniform intensity and a parabolic shape for intensity linearly darkened to zero at the disc edge (see Fig. 7.3). As can be seen in Fig. 7.2, the line-profile generally varies across the disc (instead of a simple scaling) and, following Eq. (7.6), it should be expressed as a sum of contributions with different limb darkenings. A generalisation from Eqs. (7.1) to (7.9) for the two or more modes k of the limb darkening can thus fit the observed spectra better. The eclipse then enables to map the variations of line-profiles and hence the atmospheric structure if the quality of the spectroscopic data (S/N, resolution, phase coverage) is sufficient. In addition to the synthetic spectra, more detailed results from model atmospheres are needed for interpretation of such results (as well as the other moments of I ν (μ)). In any case, even if the data are not sufficient to distinguish between different modes k, it is preferable to fit the data with a proper model of shape of Δ j than to modify v j only in Eq. (7.2) because we can get a better fit of the data and more reliable parameters from the eclipse (e.g. the radii, rotational velocities or even inclinations of rotational axes, cf. [1]).
In addition to the Doppler shift according to Eq. (7.2), the broadening functions in the generalised disentangling can thus also include intrinsic variations of line profiles caused by various reasons – either geometric (e.g., eclipses, tidal distortion, spots) or physical. A frequent example of the latter case is pulsation. In the case of radial pulsations, the broadening functions for individual limb-darkening modes can be calculated explicitly (as well as their Fourier transforms , cf. [11]). This enables to disentangle directly the pulsation velocities of Cepheids and other radial pulsators and to avoid the need of the so-called projection factor which relates the effective RV measured in a particular way to the pulsation velocity. The line-strength factors s j (t l ) introduced in Eq. (7.5) as the simplest generalisation of Eq. (7.2), which are different for different lines in Cepheids depending on the influence of the temperature variations during the pulsation period on the rate of a particular transition, can reveal the temperature. In combination with photometric observations of luminosity variations or interferometric observations of changes of angular diameter, the disentangling of spectra of Cepheids may thus provide primary method of photometric or geometric determination of distance known as the Baade–Wesselink calibration of the luminosity–period relation .
7.2.3 Constrained Disentangling
As mentioned in the introduction, the aim of separating the spectra of the components is to compare them with theoretical models and to find the parameters of the component’s atmospheres (cf. the “Fit of spectra” in Fig. 7.1). This comparison is usually performed by means of a least-squares fit similarly to the separation itself. Provided we have a grid of model spectra, which we believe correspond to some of the component stars, or a code for their synthesis, we can use them as template(s) by which the disentangling is constrained to a subspace of all possible solutions. The sum of the component spectra on the right-hand side of Eq. (7.1) by which we model the observed spectra F(x, t; p) then generally splits into the part with the unknown component spectra F j (x) | j = 1 m (where m < n), which are to be disentangled and with the components constrained by templates G j | j = m+1 n that are, however, still Doppler shifted and possibly also broadened (e.g., by some unknown rotation speed). For the separation of F j we thus solve a set of linear equations
of the order of m only, while the template-constrained components G j are subtracted from the observations on the right-hand side (they are the additional input to the disentangling in Fig. 7.1). The lower number of unknown spectra makes the solution less sensitive to errors on the observed data – on the other hand, the number of unknown parameters p in broadenings at both F j and G j usually increases a bit. For instance, the systemic velocity γ is uncertain for the standard disentangling and can only be determined when a template G j is accepted. The line strengths of G j are usually given relatively to the continuum level of the component j, while in the observed spectra F they are normalised by the sum of continua. Free line-strength factors s j are thus to be solved. They reduce at the same time the uncertainty in the levels of continua discussed in Sect. 7.2.1.
As it was already mentioned and shown in Fig. 7.1, it is advantageous also in the classical approach to treat the spectroscopy together with other types of data because these may better constrain some of the parameters. The simplest way to utilise such an additional information in the disentangling is to fix the values of parameters which we know well from the other data and to converge only those, which can better be determined from the spectroscopy. However, the constraint has often the form of a hypersurface f( p) = 0 in the space of parameters p. For instance, the times of primary and secondary eclipses in binaries depend on the time of periastron passage, eccentricity and periastron longitude (slightly also on the inclination), but cannot determine any of these parameters unless information about some of them is available from elsewhere, e.g., from the RV-curve. The minimising of χ 2 in the p-space should thus be constrained to the hypersurface or to the cross-section of the hypersurfaces if more different constraints f k ( p) = 0 are given by various observations. Such a minimisation can be performed using the method of Lagrange multipliers λ k , i.e. instead of Eq. (7.3), we minimise its more complex variant
If the constraints f k = 0 are observational, they are not sharp, but due to the observational error of a probabilistic nature only. The functions f k 2( p) can thus be chosen as (O − C)2, where C = C( p) is a model of the observations, O, and the multipliers λ k = σ k −2 are given by errors on these observations. The disentangling thus turns into a simultaneous solution, i.e. χ 2-fitting of all available data. From the point of view of accuracy and conservation of information, it is preferable to fit the data in their original form than to reduce them into some intermediate values, such as RVs instead of the source spectra or separations and positions angles instead of visibilities for spectroscopic or interferometric binaries. On the other hand, such extracted data preserved in the literature are often the only information available, hence it is still worth to enrich, e.g., the disentangling of some spectra by simultaneous solution of RV-curves from other exposures.
7.2.4 Numerical Representation
The observed and solved spectra as well as their Fourier transforms were treated in the previous sections as continuous functions of x or y, respectively. In practice, however, the spectra are usually observed and registered using a chip with discrete pixels and represented in the data-processing from the rough data to the final separated component spectra as a finite set of values in discrete bins. It means that the space of RVs in which the spectra and the broadening functions are represented is quantised. Nevertheless, the position of a smooth line stretched over several bins can be determined with a sub-pixel resolution from asymmetry of the signal in surrounding bins. A sub-pixel resolution in RVs enables to determine the orbital parameters with a higher precision, but this, however, requires representing with a similar resolution the operator δ(x − v j ) in Eq. (7.2) or in its Fourier transform Eq. (7.5). For an integer multiple of the bin size Δ x , the convolution with the shifted δ-function is given by multiplication with a unit off-diagonal matrix in the x-representation and by exp(iyv j ) in the y-representation, where the v j must be rounded to the integer multiple of Δ x to keep the periodicity in the finite set of y-values. The change of signal in a sub-pixel shift for v < Δ x can be found from its Taylor expansion, in which its derivatives are approximated from the differences in the neighbouring bins (cf. [9]), i.e.
and its Fourier transform
The Fourier disentangling with this sub-pixel resolution can be performed using the author’s code KOREL, which is now available as an on-line service of the Virtual Observatory (cf. [21]). A question frequently asked by its users is what signal-to-noise ratio (S/N) of the input spectra is needed for applicability of the method of disentangling. A general simple answer is “any, but the better is the input S/N, the better is the S/N of the disentangled component spectra and the better is the accuracy of the disentangled parameters”. However, the problem is more complex. The precision of the results also depends on the degree of the overdetermination, because the noise in disentangled spectra decreases as \(\sqrt{ N}\) with the number N of the input spectra. But other important parameters are the spectral resolution, phase coverage and information content of the input spectra. The useful signal for disentangling are the spectral lines, i.e. deviations of the spectrum from the continuum, while the S/N compares the noise with the continuum level. To clarify this problem, a more detailed analysis is presented in the Appendix.
7.3 Disentangling of Spectra of Interacting Binaries
The method of spectra disentangling assumes that the observed system consists of several discrete sources with some general unknown spectra, the changes of which can be parameterised by means of the broadening functions Δ j . In the case of interacting binaries with mass exchange, there is a significant contribution of radiation from the circumstellar matter continuously moving around the component stars with a complicated velocity field. The methods of Doppler tomography are used in these cases to map the distribution of the emission sources in the velocity space (cf. [13]). The intrinsic spectrum of an infinitesimal volume of the source is mostly supposed to be a Dirac δ-function in frequency at least in the vicinity of each line used for the Doppler imaging. This simplifying assumption is obviously false. (E.g., the presence of other lines or absorption features in P-Cygni profiles violate the mirror symmetry in the opposite phase supposed for the standard Doppler tomography.) It would thus be desirable to disentangle intrinsic spectra in each point of the velocity space (or even better in each point of the geometric space). This problem, however, does not have a unique solution, because a narrow line of source spread over a wide region of the velocity space results in the same spectra as a wide line from a narrow source.
Nevertheless, it is possible to approximate the smooth distribution of the circumstellar matter in the interacting binaries by several blobs of gas and to disentangle their spectra and (as a free parameter) also their positions in the velocity space. As an example, disentangling of Cyg X-1 is shown in Fig. 7.4 (cf. [22] for details). Although this source is variable on a wide range of time-scale, which violates the underlying assumptions of both the Doppler tomography as well as the disentangling, both techniques are able to reveal main features of the circumstellar matter (cf. [4, 19]).
Even in the cases when the assumptions of both these Doppler techniques are satisfied, the resolved distribution of the radiation sources in the velocity space does not yield their distribution in space and the consequent angular resolution. Generally there is no one-to-one correspondence between these spaces due to the non-uniformity of the velocity field. To get an information about the spatial structure of the source, the interpretation must be constrained by an appropriate theoretical model (based e.g. on radiation hydrodynamics, cf. [3]) of the source and its radiation. The method of synthetic Dopplerograms is promising for this purpose (cf. [4, 16]).
A direct comparison of the observations with synthetic spectra based on sophisticated models of the sources may be more precise because it may avoid discrepancies caused by the simplifying assumptions of the methods of indirect imaging. However, these techniques are useful for indicating the dominant features in the observed phenomena and their achievements as well as failures are intuitive in suggesting the way in which the theory should proceed.
Notes
- 1.
A typical example is the epoch of a periodic RV curve, where for each solution there is an infinite set of solutions differing by an integer multiple of the period. These solutions are usually taken as equivalent. The errors and correlation of the epoch with the period depend on which of these solutions we choose. The period itself may have several solutions due to aliasing in quasi-periodically sampled data or due to an interference of data obtained in two time-remote observational seasons.
- 2.
If a quantity x has a probability distribution exp(−(x − x 0)2∕(2α 2)) and quantity y a distribution exp(−(y − y 0)2∕(2β 2)) then the linear combination ax + by has a mean value 〈ax + by〉 = ax 0 + by 0 and the quadratic error 〈(a(x − x 0) + b(y − y 0))2〉 = a 2 α 2 + b 2 β 2. Similarly for a sum of more statistically independent quantities we get recursively 〈∑ i = 1 n a i x i 〉 = ∑ i = 1 n a i x i0 and 〈(∑ i = 1 n a i (x i − x i0))2〉 = ∑ i = 1 n a i 2 α i 2.
- 3.
In the discrete representation of N input spectra, the residual noise after disentangling of m components is given by the sum S 0 ≃ (N − m)Kσ 2. However, it should be noted that both the signal and its noise may be rescaled by the Fourier transform and hence (due to the use of FFT) the residuum on output from KOREL is KS 0.
References
Albrecht, S.: The long history of the Rossiter – McLaughlin effect and its recent applications. IAUS 282, 379–384 (2012)
Bagnuolo, W.G., Jr., Gies, D.R.: Tomographic separation of composite spectra – the components of the O-star spectroscopic binary AO cassiopeiae. ApJ 376, 266–271 (1991)
Čechura, J., Hadrava, P.: Stellar wind in state transitions of high-mass X-ray binaries. A&A 575, A5 (2015)
Čechura, J., Vrtilek, S.D., Hadrava, P.: Interpreting the X-ray state transitions of Cygnus X-1. MNRAS 450, 2410–2422 (2015)
Hadrava, P.: Orbital elements of multiple spectroscopic stars. A&AS 114, 393–396 (1995)
Hadrava, P.: Relative line photometry of eclipsing binaries. A&AS 122, 581–584 (1997)
Hadrava, P.: Disentangling telluric lines in stellar spectra. A&A 448, 1149–1152 (2006)
Hadrava, P.: Line profiles during eclipses of binary stars. ASPC 370, 164–168 (2007)
Hadrava, P.: Notes on the disentangling of spectra. I. Enhancement in precision. A&A 494, 399–402 (2009)
Hadrava, P., Kubát, J.: Limb darkening and line-profile variations in eclipsing binaries. ASPC 288, 149–152 (2003)
Hadrava, P., Šlechta, M., Škoda, P.: Notes on disentangling of spectra. II. Intrinsic line-profile variability due to cepheid pulsations. A&A 507, 397–404 (2009)
Holt, R.J.: Spectroscopic determination of stellar rotation. Astron. Astro-Phys. 12, 646 (1893)
Marsh, T.R., Horne, K.: Images of accretion discs. II – Doppler tomography. MNRAS 235, 269–286 (1988)
McLaughlin, D.B.: Some results of a spectrographic study of the Algol system. ApJ 60, 22–31 (1924)
Ribas, I., Jordi, C., Vilardell, F., et al.: First determination of the distance and fundamental properties of an eclipsing binary in the andromeda galaxy. ApJ 635, L37–L40 (2005)
Richards, M.T., Cocking, A.S., Fisher, J.G., et al.: Images of gravitational and magnetic phenomena derived from two-dimensional back-projection Doppler tomography of interacting binary stars. ApJ 795, 160 (2014)
Rossiter, R.A.: On the detection of an effect of rotation during eclipse in the velocity of the brigher component of beta Lyrae, and on the constancy of velocity of this system. ApJ 60, 15–21 (1924)
Schlesinger, F.: The Algol-variable δ Librae. Publ. Allegh. Obs. 1, 123–134 (1910)
Sharova, O.I., Agafonov, M.I., Karitskaya, E.A., et al.: Doppler tomography in 2D and 3D of the X-ray binary Cyg X-1 for June 2007. IAUS 282, 201–202 (2012)
Simon, K.P., Sturm, E.: Disentangling of composite spectra. A&A 281, 286 (1994)
Škoda, P., Hadrava, P.: Fourier disentangling using the technology of virtual observatory. ASPC 435, 71–75 (2010)
Yan, J.Z., Liu, Q.Z., Hadrava, P.: Optical spectroscopic observations of Cyg X-1 = HDE 226868. AJ 136, 631–640 (2008)
Zwahlen, N., North, P., Debernardi, Y., et al.: A purely geometric distance to the binary star Atlas, a member of the Pleiades. A&A 425, L45–L48 (2004)
Acknowledgements
This work has been supported by the Czech Science Foundation (GAČR 14-37086G) – Albert Einstein Center for Gravitation and Astrophysics.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Appendix: Bayesian Estimation of Parameters Errors
Appendix: Bayesian Estimation of Parameters Errors
The errors of parameters of a stellar system determined by disentangling of spectra can be estimated using Bayesian statistics. The errors are caused mainly by the noise in the observed spectra but they are also influenced by the phase distribution of the observations and by the sensitivity of the available data to a particular parameter. Usually we “know” in advance only that the values of parameters p can be expected in some “reasonable” regions (e.g., hundreds of km/s for orbital RVs, days or from hours to months for the orbital period, the line strengths and widths corresponding to the spectral types and rotational broadening, etc.). If previous studies of the same system exist their results can be taken as a more specific limitation of the possible range of p (and a first estimate of the new solution), which we want to verify or improve using a set of new data. If the old data are available they can be included together with the new ones into the new solution, otherwise the reliability of the old results must be estimated and they can be included into the new solution as constraints to the solution in the space of parameters p. Having the set of new observations, i.e. the spectra F(x), we can find the “posterior” probability P( p | F) of p agreeing with the new data according to Bayes’ theorem
where P( p) is the “prior” probability of p (i.e. either a smooth characteristic of the wide range of “reasonable” values or the constraint resulting from the old data), P(F | p) is the “liability” that for a chosen value of p the particular shape of F(x) will be detected and P(F) is a general probability do detect the signal F(x). The best new solution p max can be then defined as the maximum of P( p | F) in the p-space or we can find a mean value
(if P( p | F) is normalised to unity, i.e., ∫ P( p | F)dp = 1). In both cases the result depends on our choice of measures in the p space. The errors of resulting parameters p can be estimated from the behaviour of P( p | F) around the found solution. Because p is generally multidimensional (and different dimensions can be of different nature), it is not sufficient to attribute some error bar to each component of p separately as it is a common habit. If P( p | F) is sufficiently smooth around its maximum, it can be expanded into a Taylor series up to quadratic terms in variation of δ p and the non-diagonal components of the corresponding quadratic form determine the correlation of the parameters. Generally, however, there may be several local maxima in the p-space, which may be treated as different solutions.Footnote 1 The errors of these different solutions must be then determined from a local behaviour of P( p | F). Generally, the information about the studied system is more completely characterised by mapping of P( p | F) than by a simple list of found values of p and their errors.
7.1.1 Errors of Line Strengths and Radial Velocities
We shall illustrate the calculation of P( p | F) first on a toy model of measurements of one spectrum of a single star. Let us assume that the observed spectrum F(x) is given by
where δ F(x) is a random observational noise. We fit F(x) by a model f(x, p) dependent on a parameter (or a set of parameters) p by minimising the residual error
integrated over the whole observed region D of x. The equation(s) for p thus reads
The random fluctuations δ F in the observed signal F(x) blur this condition and result in a deviation δ p of the solution from its correct value. These variations are related by the condition
For example, let the spectrum be rectified, i.e. normalised to the continuum, which means f = 1 −φ, where φ(x) corresponds to the spectral lines. We shall assume first that the only unknown free parameter p is the strength of lines, i.e., the model has the form
where φ 1 is a pattern of the line-profile(s) imprinted into the observed spectrum F(x) with an unknown amplitude (e.g., due to uncertainty in element abundances, due to a contamination of the signal by light of another star or due to instrumental error in subtracting dark signal). Equation (7.19) is then a simple linear equation with solution
and in agreement with Eq. (7.20) its variations are given by
The integrals in these equations are actually summations over K individual pixels (each one of size Δ x = D∕K) in real observations,
Let us assume that the probability distribution of the noise δ F of the signal F(x) in each pixel can be approximated as a Gaussian with standard deviation σ, i.e., the probability of its value δ F in one pixel x i is
and, hence, the statistical mean value is 〈δ F 2〉 = σ 2. In addition, the noise in different pixels is supposed to be statistically independent, i.e.,
The probability distribution of error δ p is then also GaussianFootnote 2 with
The squared error of p thus decreases inversely proportionally with the number of pixels covering the profile φ(x). It can be seen from Eqs. (7.22) and (7.23) that the strength p and its uncertainty δ p are predominantly determined by the parts of the spectrum where the lines φ 1 of the model are deep.
If the unknown free parameter p is the Doppler shift of the spectrum, the model has the form
where φ 0(x) is a model line-profile in laboratory wavelength scale. The residual noise
is then no more a simple quadratic function of p and Eq. (7.19), which reads now
may have several solutions for p corresponding to coincidences of some improperly identified observed lines with wrong lines in the model. The uncertainty of p in each of these solutions can be estimated from the depth and width of the local minimum of the residual noise. Equation (7.20) reads now
where we have skipped the term \(\frac{d^{2}\varphi _{ 0}} {dx^{2}} (\varphi _{0}-\varphi )\) which should vanish at the correct value p = p 0 where φ = φ 0 (this need not be true at a false minimum). This equation shows that the error δ p of RV is dominated by the noise δ F in the steepest parts of the line profile. Analogously to Eq. (7.28), the mean squared value of this error is now
The difference between Eqs. (7.28) and (7.33) is due to the fact that the fit of line strengths is most sensitive to the part of the spectrum where the line is deepest, while the Doppler shift is most sensitive to the wings where the line profile is steepest.
7.1.2 Errors in Multidimensional Space of Parameters
We can optimise the fit of the observed spectrum simultaneously with respect to the line-strength, Doppler shift and some additional parameters. Generally, if the model spectrum φ 0(x, p) is a function of several parameters p i | i = 1 m, we get for them a set of m generally non-linear equations for p
Linearising, we arrive at the set of equations
for variations δ p j caused by variations δ F(x) of the observed spectrum. The matrix M ij is given here by
and the right-hand side by
Depending on the form of the model φ 0, some variation δ F can be compensated by a change of different p i , so that their variations may be correlated
Assuming that the noise δ F(x) is statistically independent at different pixels x k , x l , we have
If the noise 〈δ F(x)2〉 = σ 2 is statistically the same in all pixels and if the second term in the integral in Eq. (7.36) is negligible compared to the first one (which is the case if the fit is good and F(x) ≃ 1 −φ 0(x, p)), then
and
As an example, let us investigate a fit by a simple Gaussian profile
Equation (7.34) then reads
where k = 0, 2, 1 in conditions for p 1, p 2, p 3, resp. The variation of φ 0 reads
so that neglecting the second term in Eq. (7.36), the matrix M has the form
and hence according to Eq. (7.41) the correlation matrix of parameter errors reads
It means that the errors of the depth p 1 and the width p 2 of the line-profile given by Eq. (7.42), which are due to the part of the perturbation δ F symmetric with respect to the centre of the line, are anti-correlated, while the error in the position p 3 of the line centre is not correlated with them and its squared value is half of that of the width.
Results of a Monte Carlo simulation of this example can be seen in Fig. 7.5. The red line shows the profile (cf. Eq. (7.42)) for p 1 = 0. 2, p 2 = 0. 2 and p 3 = 0. 0. The black line has added a randomly generated noise with σ = 0. 04 in 1024 pixels in the displayed interval of x ∈ (−1, +1), i.e., Δ x = 2−9 ≃ 0. 00195. The blue line gives the best fit to this particular simulated measurement, which was found for values of parameters p 1 ≃ 0. 2002, p 2 ≃ 0. 2085 and p 3 = 0. 0029. The figure included in the bottom left corner shows the scatter of p 1 and p 2 around their true values for 1000 different choices of δ F (the drawn parts of coordinates correspond to ± 0. 01). Similarly the histogram in the right corner shows the distribution of p 3 (the width of each histogram column is 0.001). If the noise σ is decreased then in agreement with Eq. (7.38) or its approximation (Eq. (7.41)) also the uncertainty 〈δ p j δ p k 〉 of the parameters p is reduced. This agrees with the Bayes theorem (Eq. (7.15)) according to which the probability P( p | F) of a larger difference of the true value of p from its best fit to particular data F(x) decreases with decreasing probability P(F | p) that the random noise will mimic a wrong spectrum.
For real observed spectra we do not know the exact value of σ (which can only be estimated from the level of signal integrated during the exposure), while we may have a rough estimate only of Δ x and, generally, we do not know the explicit form of the matrix M ij and its inversion. However, we can estimate these values from the value S 0 of the residuum S( p) in its minimum and from variations of S(p) with respect to variations of p. If the spectrum is correctly fitted by a proper model, the residuum S 0 should be given by the noiseFootnote 3 only, i.e.,
The value of Δ x and hence for known D also the number K of statistically independent pixels of noise can be estimated from auto-correlation of the residual noise. In our example, the numerical simulation results in S 0∕Δ x ≃ 1. 73, giving σ ≃ 0. 0411 in good agreement with the value 0.04 for which the noise was generated. The residuum S( p 3) is drawn by the green line in Fig. 7.5 as a function of p 3 for p 1, 2 fixed to their best values (the zero level is shifted to the bottom of the panel for S( p 3)). Its behaviour can be estimated substituting φ 0 from Eqs. (7.42) into (7.30) also for φ,
The residual S( p 3) thus increases with the square of δ p 3 around its minimum
but it approaches saturation at level \(S_{\infty } = S_{0} + \sqrt{2\pi }p_{1}^{2}p_{2}\) for large δ p 3 (in agreement with numerical results S ∞ ∕Δ x ≃ 12. 5 in our example). This saturation corresponds to the possibility that the line visible in the spectrum is actually a random fluctuation of the noise, while the real line is in the distant wavelength δ p 3, but it is hidden in another noise fluctuation. The probability of this “hidden” solution is proportional to the width D of the wavelength interval. This may be in principle infinite, so that the probability of this solution may be high, even for a small value of σ. The limitation of D to the interval (−1, +1) in our example has a character of the prior P( p 3) which is chosen equal to zero out of this interval.
Regarding the assumption of the Gaussian noise (7.26), the liability P(δ F | p) of the noise δ F(x i ) | i = 1 K in all pixels is
Following Bayes’ theorem (Eq. (7.15)), the probability of p for known F (and a ’flat’ prior P( p)) reads
We thus obtain from Eq. (7.49) the probability distribution of radial velocity δ p 3 of the “visible” line
which yields, according to the definition (Eq. (7.16)), a mean value of squared velocity shift
in agreement with component { j, k} = { 3, 3} of Eq. (7.46).
It is worth noting that the error \(\sqrt{ \langle \delta p_{3}^{2}\rangle }\) on the radial velocity p 3 is proportional to the ratio σ∕p 1 of the noise to the line depth. Regarding the problem of desirable S/N-ratio mentioned in Sect. 7.2.4, we can see from this relation that there is no limitation. However, the higher the S/N-ratio is, the more precise is the determination of the disentangled parameters. From the observational and instrumental point of views, the S/N of spectra refers to the ratio of the noise to the overall spectral flux, i.e. the signal means the continuum. However, it is obvious that the signal which yields an information about RV and other spectral features is the modulation of the flux by spectral lines. It is thus desirable to optimise in the observations the ratio of line depths to the (photon) noise. The RV error is also proportional to the square root of the line width p 2 which indicates that a higher precision can be achieved from sharper lines, although they are covered by smaller number of pixels and are thus more influenced by fluctuations in photon counts.
In the disentangling technique, we do not have explicitly given in Eq. (7.17) the template spectrum f(x, p), like in Eqs. (7.21) or (7.42), but a more complicated expression like the right-hand side of Eqs. (7.1) or (7.9). The component spectra F j (x) whose imprints we want to find in the observations are now also unknown parameters that can be retrieved with some errors only. In analogy to Eq. (7.28), this error is proportional to the noise σ of the input spectra. However, usually the separation of the component spectra is overdetermined, as explained in Sect. 7.2.1 and hence, following the relations in the Footnote 2, the amplitude of the noise of the disentangled spectra decreases inversely proportionally to the square root of the number of observed spectra. The dependence of the model f(x, p) on nonlinear parameters p in the broadening functions Δ j (x, t; p) is more complicated and generally cannot be expressed explicitly like in the above given toy models. However, it is possible to map the distribution of the residual noise numerically as a function of these parameters.
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this chapter
Cite this chapter
Hadrava, P. (2016). Disentangling of Stellar Spectra. In: Boffin, H., Hussain, G., Berger, JP., Schmidtobreick, L. (eds) Astronomy at High Angular Resolution. Astrophysics and Space Science Library, vol 439. Springer, Cham. https://doi.org/10.1007/978-3-319-39739-9_7
Download citation
DOI: https://doi.org/10.1007/978-3-319-39739-9_7
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-39737-5
Online ISBN: 978-3-319-39739-9
eBook Packages: Physics and AstronomyPhysics and Astronomy (R0)