Abstract
Model orbits have been fitted to 27 physical double stars listed in a 1922 catalog. A Markov Chain Monte Carlo technique was applied to estimate best-fitting values and associated uncertainties for the orbital parameters. Dynamical masses were calculated using parallaxes from the Hipparcos mission and are presented in this paper with the estimates of the orbital parameters for the 27 systems. The resulting mass estimates of the current study are in good agreement with a recently published study, as are comparisons with the orbital parameters listed by the Washington Double Star catalog, confirming the validity of the optimization methodology.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
Double stars make observing targets which are popular for various reasons including an interest in the practicalities of obtaining valuable scientific data with small telescopes. Such measurements can indicate whether the two stars, observed to be close to the sky, are a physical double, as the two stars should slowly shift their relative positions with time as they orbit about each other. Binary stars are important to astronomy as they allow direct determination of stellar masses. Where the stars are not observed to be following such orbits, their proximity in the sky might mean that they are actually gravitationally remote from each other and simply in a similar line of sight from Earth (i.e., an optical double). In practice, it turns out that this situation is often not the case. In passing, we note that Argyle (2004), MacEvoy & Tirion (2015) and Argyle et al. (2019) provide useful background materials for observing and analyzing visual doubles.
A major goal of the current paper is to outline the testing of an algorithm based on Markov Chain Monte Carlo (MCMC) optimization. This note documents our final testing of a Bayesian-based methodology by comparing systems with known results and placing these results into the literature for later use by the double-star community. The rationale behind these tests is that agreement of our findings with literature results would lend confidence for later general applications of the method, such as for systems without known orbital solutions. A noteworthy point is that our method provides uncertainties for the derived parameters, something not provided for many orbital solutions in the literature.
The paper, therefore, outlines the automated estimation of values and uncertainties of orbital parameters to a selection of physical double binaries listed in Miller & Pitman (1922), and in particular from their Table 1 of ‘First Class’ systems that those authors considered to possess well determined orbital estimates (and therefore good systems for the planned testing). Miller & Pitman (1922) did not present the orbital solutions and so we make use of the parameter estimates adopted in the Washington Double Star (WDS) catalog (Mason et al. 2022). We sourced the positional data from the WDS, current to 2023. Our paper presents estimates for the orbital parameters for these systems using all the available data. These results are in agreement with the solutions given in the WDS with the advantage that single sigma uncertainties are presented for all the estimated parameters. This is reached through the use of an optimization technique based on Bayesian statistics, which is described in the following section.
2 Markov Chain Monte Carlo
A Markov Chain is a probabilistic model describing the likelihood of future states based on the currently observed state. It is a ‘memory-less’ process, typically based on a matrix giving the transition probabilities between one observed state to another. The current state of the process depends only on the immediately previous one. A chain is built up of repeated steps through this transition matrix.
Markov Chain Monte Carlo (MCMC) combines such chains with a Monte Carlo approach, or basically random probabilities (Privault 2013; Hogg & Foreman-Mackey 2018). This combination allows MCMC to explore and then characterize a distribution by randomly sampling that distribution without requiring knowledge of the distribution’s mathematical properties (van Ravenzwaaij et al. 2018). It is a Bayesian statistical technique where inferences about unknown quantities (such as model parameters or predictions) are made by combining prior ‘knowledge’ (often called ‘beliefs’ in the literature) about those quantities together with observations.
No u-turn sampler (NUTS) avoids the random walk behavior of more simple MCMC algorithms, making a faster exploration of possible model parameter sets and a faster convergence to an optimal set of parameter estimates. It handles multiple parameter models better than simpler techniques, which struggle with these higher dimensional problems. We used this Hamiltonian Monte Carlo (HMC) technique for these reasons.
We are not the first authors to apply an MCMC method to visual double star data, but the technique is not yet widely used in the field (see, e.g., Sahlmann et al. 2013; Lucy 2014; Mendez et al. 2017).
3 Analysis
The orbit of a (visual) binary star system can be described on the xy plane as (see, e.g., Ribas et al. 2002):
where a is the semi-major axis of the orbit, measured in arcsec. e is the orbital eccentricity. \(\nu \) is the true anomaly (or function of time) of the orbit of the stars about their barycenter. i is the inclination, the angle between the plane of projection and the orbital plane. Position angles were precessed to the year 2000.
We implemented this model as the fitting function in the stan programming language,Footnote 1 using the NUTS MCMC variant to perform the optimization. We note that if we were only interested in point estimates for the parameters, there are superior optimization techniques which can reach such estimates with less computational effort. Our key interest in using MCMC was to see how well-constrained the parameter estimates are rather than just the optimal estimates alone. The stan code was called from the R programming language (R Core Team 2021), where we handled data processing and additional analysis. The role of MCMC was to adjust the model parameters so that the predicted positions became close to the actual data. In other words, the optimizer trialed different estimates for the parameters in the model function, measuring how well the model based on this function fitted the observed data. The measure of fit employed the \(\chi ^2\) function (see Bevington 1969). The minimum chain length was 100,000 steps, with four chains being run simultaneously. Convergence about an optimal solution set was assessed through trace plots (charts plotting parameter estimates by step position along the chains), which should be statistically random about the optimal estimates, i.e., no trends should remain. We also used the R̂ statistic (Sinharay 2003) to assess convergence.
Best-fit solutions (and one standard deviation uncertainty) are listed in Table 1 for each modeled system. Figure 1 plots data for two example systems, along with the best-fit projected orbits based on the parameters given in Table 1.
The orbital solutions generally agree with those listed by WDS as the adopted solutions for that catalog, with the advantage that uncertainties for the parameters are given for all solutions. Not all WDS solutions have uncertainties provided for the parameter estimates, as seen in Figures 2 and 3. The NUTS-based uncertainties are generally larger than those given for the WDS solutions, even with our naive handling of errors. Eccentricity has the highest relative uncertainty out of the optimized parameters, followed by the argument of periastron. 42 Com Ber (BD \(+\) 18 2697) has an inclination close to 90\(^{\circ }\) and was difficult to model, leading to large uncertainties in parameter estimates for that system.
The dynamical (or combined stellar) mass \(M_d\) of such binary systems can be calculated if the parallax is known via an equation (Malkov et al. 2012) based on Kepler’s third law:
where both a and the parallax \(\pi \) are in milli-arcsec, P is in years, and \(M_d\) is in solar masses. Thirteen of the systems had Gaia DR3 parallaxes, and all 27 had Hipparcos parallaxes. The Hipparcos (ESA 1997; Perryman 2008) and Gaia (Gaia Collaboration 2016, 2022) parallaxes agreed well (regressing the Hipparcos parallaxes onto the corresponding Gaia values resulted in a slope of \(0.998 \pm 0.007\) assuming a zero intercept). We therefore used the Hipparcos parallaxes for comparison given the good correlation and also the fact that more systems had parallax estimates in the Hipparcos dataset than the Gaia one. We compared our estimates for the dynamical masses (Table 2) with those from Malkov et al. (2012), see Figure 4. We found good agreement, indicating that our methodology appears reasonable, with the advantage that confidence intervals are generated for the optimized parameters. However, we also note that Malkov et al. (2012) made use of the reduction by van Leeuwen (2007) of Hipparcos astrometric data which improved parallax accuracies by up to a factor of four times for stars brighter than \(H_p = 8\), as well as the later analyses of Al-Wardat et al. (2021) and Masda & Al-Wardat (2023) which demonstrated that the van Leeuwen (2007) parallaxes were superior to both the original Hipparcos and the DR3 Gaia estimates. Calculating dynamical masses using the van Leeuwen (2007) parallaxes (see Table 2) led to an improvement in the Pearson correlation coefficient from 0.905 to 0.960 (for the masses calculated in the current paper compared with those from Malkov et al. 2012), in line with the comments by van Leeuwen 2007, Al-Wardat et al. (2021) and Masda & Al-Wardat (2023). We therefore recommend using the mass estimates given in the column \(M_{VL}\) of Table 2 as the final estimates of the dynamical masses for the studied systems.
4 Discussion
This paper presents in Table 1 new estimates of the orbital elements and uncertainties for a selection of systems listed in Miller & Pitman (1922), based on MCMC optimization. We also calculate dynamical masses using Equation (1) plus original (ESA 1997; Perryman 2008) and refined (van Leeuwen 2007) Hipparcos parallaxes (see Table 2), which we show to be in good agreement with Malkov et al. (2012). Figure 4(b) shows the best comparison between our results and those of Malkov et al. (2012). Comparison of the orbital elements is made between those adopted by the WDS and those derived by our MCMC method (see Figures 2 and 3). The estimates agree well, with the NUTS-based uncertainties tending to be significantly larger than those in the WDS-adopted solutions (not all such solutions provide formal errors). This comparison of known systems gives us confidence that the HMC-based technique presented could be reliably applied to new systems without previously published solutions. Indeed, we have used an earlier version of this methodology to analyze a multiple star system (Erdem et al. 2022), with the astrometric analysis complementing and extending the spectroscopic and photometric analyses. We intend to use this methodology as we extend our survey of detailed studies of multiple systems (such as Erdem et al. 2022) and recommend it to other researchers interested in not only estimating the orbital parameters but gaining insight into the accuracy of these estimates. We also hope that the orbital parameters (and accompanying uncertainties) presented by this paper for the 27 systems involved in the testing will be of interest to double-star researchers and that the paper acts as a record of the careful testing made of the methodology before its use for systems with no published estimates for orbital parameters or dynamical masses.
Notes
For further details on STAN see https://github.com/stan-dev/stan and https://mc-stan.org/users/documentation/
References
Al-Wardat M. A., Hussein A. M., Al-Naimiy H. M., Barstow M. A. 2021, PASP, 38, e002. https://doi.org/10.1017/pasa.2020.50
Argyle B. 2004, Observing and Measuring Visual Double Stars (London: Springer-Verlag)
Argyle B., Swan M., James A. 2019, An Anthology of Visual Double Stars, Cambridge University Press
Bevington P. R. 1969, Data Reduction and Analysis for the Physical Sciences (New York: McGraw Hill)
Erdem A., Surgit D., Ozkardes B. et al. 2022, MNRAS, 515, 6151
ESA 1997, ESA Special Publication, 1200
Gaia Collaboration, Prusti T., de Bruijne T. H. J. et al. 2016, A &A, 595, 1
Gaia Collaboration, Vallenari A., Brown A. G. A. et al. 2022, Gaia Data Release 3: Summary of the content and survey properties, arXiv e-prints, arXiv:2208.00211
Hartkopf W. I., Mason B. D., Worley C. E. 2001, AJ, 122, 3472
Hogg D. W., Foreman-Mackey D. 2018, ApJSS, 236, 11
Lucy L. B. 2014, A &A, 563, 126
MacEvoy B., Tirion W. 2015, The Cambridge Double Star Atlas, 2nd edition. Cambridge University Press
Malkov O. Y., Tamaziani V. S., Docobol J. A., Chulkov D. A. 2012, A &A, 546, 69
Masda S., Al-Wardat M. 2023, Advances in Space Research, 72, 649
Mason B. D., Wycoff G. L., Hartkopf W. I. 2022, The Washington Double Star Catalog
Mendez R. A., Claveria R. M., Orchard M. E., Silva J. F. 2017, AJ, 154, 187
Miller J. A., Pitman J. H. 1922, AJ, 34, 127
Perryman M. 2008, Astronomical Applications of Astrometry: Ten Years of Exploitation of the Hipparcos Satellite Data, Cambridge University Press, ISBN 9780521514897, https://doi.org/10.1017/CBO9780511575242
Privault N. 2013, Understanding Markov Chains. Springer Undergraduate Mathematics Series, Springer Singapore, https://doi.org/10.1007/978-981-13-0659-4
R Core Team 2021, R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria
Ribas I., Arenou F., Guinan E. F. 2002, AJ, 123, 2033
Ricker G. R. Winn J. N., Vanderspek R. et al. 2014, Proc. SPIE, Vol. 9143, https://doi.org/10.1117/12.2063489
Sahlmann J., Lazorenko P. F., Ségransan D. et al. 2013, A &A, 556, A133
Sinharay S., 2003, Assessing Convergence of the Markov Chain Monte Carlo Algorithm: A Review, ETS Research Report Series, i-52
Stan Development Team 2021, RStan: the R interface to Stan, http://mc-stan.org/
van Leeuwen F. 2007, A &A, 474, 653
van Ravenzwaaij D., Cassey P., Brown S. D. 2018, Psychon. Bull. Rev., 25, 143. https://doi.org/10.3758/s13423-016-1015-8
Worley C. E., Heintz W. D. 1983, The Fourth Catalog of Orbits of Visual Binary Stars, Publ. US Nav. Obs., 2d Ser., 24, Part 7 (Washington: GPO)
Acknowledgements
This research has used the Washington Double Star (WDS) Catalog maintained at the US Naval Observatory. We thank Dr Rachel Matson for extracting data from the WDS for us. This work has made use of data from the European Space Agency (ESA) mission Gaia (https://www.cosmos.esa.int/gaia), processed by the Gaia Data Processing and Analysis Consortium (DPAC, https://www.cosmos.esa.int/web/gaia/dpac/consortium). Funding for the DPAC has been provided by national institutions, particularly the institutions participating in the Gaia Multilateral Agreement. We thank the University of Queensland for the collaboration software. We thank the anonymous referee for their helpful comments and guidance, which improved this paper.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Ersteniuk, M., Banks, T., Budding, E. et al. Markov Chain Monte Carlo optimization applied to double stars from Miller & Pitman research. J Astrophys Astron 45, 9 (2024). https://doi.org/10.1007/s12036-024-09997-5
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s12036-024-09997-5