Markov Chain Monte Carlo optimization applied to double stars from Miller & Pitman research

Ersteniuk, Maksym; Banks, Timothy; Budding, Edwin; Rhodes, Michael D.

doi:10.1007/s12036-024-09997-5

Markov Chain Monte Carlo optimization applied to double stars from Miller & Pitman research

Published: 05 March 2024

Volume 45, article number 9, (2024)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Journal of Astrophysics and Astronomy Aims and scope Submit manuscript

Markov Chain Monte Carlo optimization applied to double stars from Miller & Pitman research

Download PDF

Maksym Ersteniuk¹,
Timothy Banks ORCID: orcid.org/0000-0001-9445-4588^1,2,
Edwin Budding^3,4 &
…
Michael D. Rhodes⁵

126 Accesses
Explore all metrics

Abstract

Model orbits have been fitted to 27 physical double stars listed in a 1922 catalog. A Markov Chain Monte Carlo technique was applied to estimate best-fitting values and associated uncertainties for the orbital parameters. Dynamical masses were calculated using parallaxes from the Hipparcos mission and are presented in this paper with the estimates of the orbital parameters for the 27 systems. The resulting mass estimates of the current study are in good agreement with a recently published study, as are comparisons with the orbital parameters listed by the Washington Double Star catalog, confirming the validity of the optimization methodology.

Tutorial: Asteroseismic Data Analysis with DIAMONDS

Precise Masses, Ages, and Orbital Parameters of the Binary Systems HIP 11352, HIP 70973, and HIP 72479

Article 01 January 2021

An analytic algorithm to calculate the inclination, ascending node, and semimajor axis of spectroscopic binary orbits using a single speckle measurement and the parallax

Article 01 October 2014

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Double stars make observing targets which are popular for various reasons including an interest in the practicalities of obtaining valuable scientific data with small telescopes. Such measurements can indicate whether the two stars, observed to be close to the sky, are a physical double, as the two stars should slowly shift their relative positions with time as they orbit about each other. Binary stars are important to astronomy as they allow direct determination of stellar masses. Where the stars are not observed to be following such orbits, their proximity in the sky might mean that they are actually gravitationally remote from each other and simply in a similar line of sight from Earth (i.e., an optical double). In practice, it turns out that this situation is often not the case. In passing, we note that Argyle (2004), MacEvoy & Tirion (2015) and Argyle et al. (2019) provide useful background materials for observing and analyzing visual doubles.

Table 1 Parameter estimates from MCMC fitting to the Miller & Pitman systems. Uncertainties are single sigma (one standard deviation). See the text for the explanation of the symbols used as the column titles other than ‘Epoch’, which is the time of phase zero for the orbital ephemeris, the orbital period (P) in years, and‘WDS’, which gives the Washington Double Star catalog ID. The final column (G) on the right employs the orbit grading scheme of Worley & Heintz (1983) as outlined inHartkopf et al. (2001).

Full size table

A major goal of the current paper is to outline the testing of an algorithm based on Markov Chain Monte Carlo (MCMC) optimization. This note documents our final testing of a Bayesian-based methodology by comparing systems with known results and placing these results into the literature for later use by the double-star community. The rationale behind these tests is that agreement of our findings with literature results would lend confidence for later general applications of the method, such as for systems without known orbital solutions. A noteworthy point is that our method provides uncertainties for the derived parameters, something not provided for many orbital solutions in the literature.

The paper, therefore, outlines the automated estimation of values and uncertainties of orbital parameters to a selection of physical double binaries listed in Miller & Pitman (1922), and in particular from their Table 1 of ‘First Class’ systems that those authors considered to possess well determined orbital estimates (and therefore good systems for the planned testing). Miller & Pitman (1922) did not present the orbital solutions and so we make use of the parameter estimates adopted in the Washington Double Star (WDS) catalog (Mason et al. 2022). We sourced the positional data from the WDS, current to 2023. Our paper presents estimates for the orbital parameters for these systems using all the available data. These results are in agreement with the solutions given in the WDS with the advantage that single sigma uncertainties are presented for all the estimated parameters. This is reached through the use of an optimization technique based on Bayesian statistics, which is described in the following section.

2 Markov Chain Monte Carlo

A Markov Chain is a probabilistic model describing the likelihood of future states based on the currently observed state. It is a ‘memory-less’ process, typically based on a matrix giving the transition probabilities between one observed state to another. The current state of the process depends only on the immediately previous one. A chain is built up of repeated steps through this transition matrix.

Markov Chain Monte Carlo (MCMC) combines such chains with a Monte Carlo approach, or basically random probabilities (Privault 2013; Hogg & Foreman-Mackey 2018). This combination allows MCMC to explore and then characterize a distribution by randomly sampling that distribution without requiring knowledge of the distribution’s mathematical properties (van Ravenzwaaij et al. 2018). It is a Bayesian statistical technique where inferences about unknown quantities (such as model parameters or predictions) are made by combining prior ‘knowledge’ (often called ‘beliefs’ in the literature) about those quantities together with observations.

No u-turn sampler (NUTS) avoids the random walk behavior of more simple MCMC algorithms, making a faster exploration of possible model parameter sets and a faster convergence to an optimal set of parameter estimates. It handles multiple parameter models better than simpler techniques, which struggle with these higher dimensional problems. We used this Hamiltonian Monte Carlo (HMC) technique for these reasons.

We are not the first authors to apply an MCMC method to visual double star data, but the technique is not yet widely used in the field (see, e.g., Sahlmann et al. 2013; Lucy 2014; Mendez et al. 2017).

3 Analysis

Table 2 Dynamical solar masses ($M_D$) based on Hipparcos parallaxes (given in the column ‘Hipparcos’ as milli-arcsec) and parameter estimates from the MCMC orbital fitting. Column $M_M$ lists the dynamical masses given in Malkov et al. (2012). $M_{VL}$ lists the dynamical mass estimates using the van Leeuwen (2007) parallaxes (listed in column ‘van Leeuwen’ and units of milli-arcsec). Errors in $M_D$ and $M_{VL}$ are one standard deviation.

Full size table

The orbit of a (visual) binary star system can be described on the xy plane as (see, e.g., Ribas et al. 2002):

$$\begin{aligned} x&= \frac{a ( 1 - e^2 )}{1 + e \cos {\nu }} [ \cos { (\nu + \omega )} \sin {\Omega }\\&\quad \,\,+ \sin {(\nu + \omega )} \cos {\Omega } \cos {i}], \\ y&= \frac{a ( 1 - e^2 )}{1 + e \cos {\nu }} [ \cos {(\nu + \omega )} \cos {\Omega }\\&\quad \,\,- \sin {(\nu + \omega )} \sin {\Omega } \cos {i} ], \end{aligned}$$

where a is the semi-major axis of the orbit, measured in arcsec. e is the orbital eccentricity. $\nu $ is the true anomaly (or function of time) of the orbit of the stars about their barycenter. i is the inclination, the angle between the plane of projection and the orbital plane. Position angles were precessed to the year 2000.

We implemented this model as the fitting function in the stan programming language,^{Footnote 1} using the NUTS MCMC variant to perform the optimization. We note that if we were only interested in point estimates for the parameters, there are superior optimization techniques which can reach such estimates with less computational effort. Our key interest in using MCMC was to see how well-constrained the parameter estimates are rather than just the optimal estimates alone. The stan code was called from the R programming language (R Core Team 2021), where we handled data processing and additional analysis. The role of MCMC was to adjust the model parameters so that the predicted positions became close to the actual data. In other words, the optimizer trialed different estimates for the parameters in the model function, measuring how well the model based on this function fitted the observed data. The measure of fit employed the $\chi ^2$ function (see Bevington 1969). The minimum chain length was 100,000 steps, with four chains being run simultaneously. Convergence about an optimal solution set was assessed through trace plots (charts plotting parameter estimates by step position along the chains), which should be statistically random about the optimal estimates, i.e., no trends should remain. We also used the R̂ statistic (Sinharay 2003) to assess convergence.

Best-fit solutions (and one standard deviation uncertainty) are listed in Table 1 for each modeled system. Figure 1 plots data for two example systems, along with the best-fit projected orbits based on the parameters given in Table 1.

The orbital solutions generally agree with those listed by WDS as the adopted solutions for that catalog, with the advantage that uncertainties for the parameters are given for all solutions. Not all WDS solutions have uncertainties provided for the parameter estimates, as seen in Figures 2 and 3. The NUTS-based uncertainties are generally larger than those given for the WDS solutions, even with our naive handling of errors. Eccentricity has the highest relative uncertainty out of the optimized parameters, followed by the argument of periastron. 42 Com Ber (BD $+$ 18 2697) has an inclination close to 90$^{\circ }$ and was difficult to model, leading to large uncertainties in parameter estimates for that system.

The dynamical (or combined stellar) mass $M_d$ of such binary systems can be calculated if the parallax is known via an equation (Malkov et al. 2012) based on Kepler’s third law:

$$\begin{aligned} M_{d} = \frac{a^3}{\pi ^3 P^2}, \end{aligned}$$

(1)

where both a and the parallax $\pi $ are in milli-arcsec, P is in years, and $M_d$ is in solar masses. Thirteen of the systems had Gaia DR3 parallaxes, and all 27 had Hipparcos parallaxes. The Hipparcos (ESA 1997; Perryman 2008) and Gaia (Gaia Collaboration 2016, 2022) parallaxes agreed well (regressing the Hipparcos parallaxes onto the corresponding Gaia values resulted in a slope of $0.998 \pm 0.007$ assuming a zero intercept). We therefore used the Hipparcos parallaxes for comparison given the good correlation and also the fact that more systems had parallax estimates in the Hipparcos dataset than the Gaia one. We compared our estimates for the dynamical masses (Table 2) with those from Malkov et al. (2012), see Figure 4. We found good agreement, indicating that our methodology appears reasonable, with the advantage that confidence intervals are generated for the optimized parameters. However, we also note that Malkov et al. (2012) made use of the reduction by van Leeuwen (2007) of Hipparcos astrometric data which improved parallax accuracies by up to a factor of four times for stars brighter than $H_p = 8$, as well as the later analyses of Al-Wardat et al. (2021) and Masda & Al-Wardat (2023) which demonstrated that the van Leeuwen (2007) parallaxes were superior to both the original Hipparcos and the DR3 Gaia estimates. Calculating dynamical masses using the van Leeuwen (2007) parallaxes (see Table 2) led to an improvement in the Pearson correlation coefficient from 0.905 to 0.960 (for the masses calculated in the current paper compared with those from Malkov et al. 2012), in line with the comments by van Leeuwen 2007, Al-Wardat et al. (2021) and Masda & Al-Wardat (2023). We therefore recommend using the mass estimates given in the column $M_{VL}$ of Table 2 as the final estimates of the dynamical masses for the studied systems.

4 Discussion

This paper presents in Table 1 new estimates of the orbital elements and uncertainties for a selection of systems listed in Miller & Pitman (1922), based on MCMC optimization. We also calculate dynamical masses using Equation (1) plus original (ESA 1997; Perryman 2008) and refined (van Leeuwen 2007) Hipparcos parallaxes (see Table 2), which we show to be in good agreement with Malkov et al. (2012). Figure 4(b) shows the best comparison between our results and those of Malkov et al. (2012). Comparison of the orbital elements is made between those adopted by the WDS and those derived by our MCMC method (see Figures 2 and 3). The estimates agree well, with the NUTS-based uncertainties tending to be significantly larger than those in the WDS-adopted solutions (not all such solutions provide formal errors). This comparison of known systems gives us confidence that the HMC-based technique presented could be reliably applied to new systems without previously published solutions. Indeed, we have used an earlier version of this methodology to analyze a multiple star system (Erdem et al. 2022), with the astrometric analysis complementing and extending the spectroscopic and photometric analyses. We intend to use this methodology as we extend our survey of detailed studies of multiple systems (such as Erdem et al. 2022) and recommend it to other researchers interested in not only estimating the orbital parameters but gaining insight into the accuracy of these estimates. We also hope that the orbital parameters (and accompanying uncertainties) presented by this paper for the 27 systems involved in the testing will be of interest to double-star researchers and that the paper acts as a record of the careful testing made of the methodology before its use for systems with no published estimates for orbital parameters or dynamical masses.

Notes

For further details on STAN see https://github.com/stan-dev/stan and https://mc-stan.org/users/documentation/

References

Al-Wardat M. A., Hussein A. M., Al-Naimiy H. M., Barstow M. A. 2021, PASP, 38, e002. https://doi.org/10.1017/pasa.2020.50
Article ADS Google Scholar
Argyle B. 2004, Observing and Measuring Visual Double Stars (London: Springer-Verlag)
Argyle B., Swan M., James A. 2019, An Anthology of Visual Double Stars, Cambridge University Press
Bevington P. R. 1969, Data Reduction and Analysis for the Physical Sciences (New York: McGraw Hill)
Erdem A., Surgit D., Ozkardes B. et al. 2022, MNRAS, 515, 6151
Article ADS Google Scholar
ESA 1997, ESA Special Publication, 1200
Gaia Collaboration, Prusti T., de Bruijne T. H. J. et al. 2016, A &A, 595, 1
Gaia Collaboration, Vallenari A., Brown A. G. A. et al. 2022, Gaia Data Release 3: Summary of the content and survey properties, arXiv e-prints, arXiv:2208.00211
Hartkopf W. I., Mason B. D., Worley C. E. 2001, AJ, 122, 3472
Hogg D. W., Foreman-Mackey D. 2018, ApJSS, 236, 11
Lucy L. B. 2014, A &A, 563, 126
Google Scholar
MacEvoy B., Tirion W. 2015, The Cambridge Double Star Atlas, 2nd edition. Cambridge University Press
Malkov O. Y., Tamaziani V. S., Docobol J. A., Chulkov D. A. 2012, A &A, 546, 69
Google Scholar
Masda S., Al-Wardat M. 2023, Advances in Space Research, 72, 649
Article ADS Google Scholar
Mason B. D., Wycoff G. L., Hartkopf W. I. 2022, The Washington Double Star Catalog
Mendez R. A., Claveria R. M., Orchard M. E., Silva J. F. 2017, AJ, 154, 187
Article ADS Google Scholar
Miller J. A., Pitman J. H. 1922, AJ, 34, 127
Article ADS Google Scholar
Perryman M. 2008, Astronomical Applications of Astrometry: Ten Years of Exploitation of the Hipparcos Satellite Data, Cambridge University Press, ISBN 9780521514897, https://doi.org/10.1017/CBO9780511575242
Privault N. 2013, Understanding Markov Chains. Springer Undergraduate Mathematics Series, Springer Singapore, https://doi.org/10.1007/978-981-13-0659-4
R Core Team 2021, R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria
Ribas I., Arenou F., Guinan E. F. 2002, AJ, 123, 2033
Article ADS Google Scholar
Ricker G. R. Winn J. N., Vanderspek R. et al. 2014, Proc. SPIE, Vol. 9143, https://doi.org/10.1117/12.2063489
Sahlmann J., Lazorenko P. F., Ségransan D. et al. 2013, A &A, 556, A133
Google Scholar
Sinharay S., 2003, Assessing Convergence of the Markov Chain Monte Carlo Algorithm: A Review, ETS Research Report Series, i-52
Stan Development Team 2021, RStan: the R interface to Stan, http://mc-stan.org/
van Leeuwen F. 2007, A &A, 474, 653
ADS Google Scholar
van Ravenzwaaij D., Cassey P., Brown S. D. 2018, Psychon. Bull. Rev., 25, 143. https://doi.org/10.3758/s13423-016-1015-8
Article PubMed Google Scholar
Worley C. E., Heintz W. D. 1983, The Fourth Catalog of Orbits of Visual Binary Stars, Publ. US Nav. Obs., 2d Ser., 24, Part 7 (Washington: GPO)

Download references

Acknowledgements

This research has used the Washington Double Star (WDS) Catalog maintained at the US Naval Observatory. We thank Dr Rachel Matson for extracting data from the WDS for us. This work has made use of data from the European Space Agency (ESA) mission Gaia (https://www.cosmos.esa.int/gaia), processed by the Gaia Data Processing and Analysis Consortium (DPAC, https://www.cosmos.esa.int/web/gaia/dpac/consortium). Funding for the DPAC has been provided by national institutions, particularly the institutions participating in the Gaia Multilateral Agreement. We thank the University of Queensland for the collaboration software. We thank the anonymous referee for their helpful comments and guidance, which improved this paper.

Author information

Authors and Affiliations

Department of Physical Science and Engineering, Harper College, 1200 W Algonquin Rd, Palatine, IL, 60067, USA
Maksym Ersteniuk & Timothy Banks
Nielsen, 675 6th Ave, New York, 10011, USA
Timothy Banks
Carter Observatory, 40 Salamanca Road, Kelburn, Wellington, 6012, New Zealand
Edwin Budding
School of Chemical and Physical Sciences, Victoria University of Wellington, PO Box 600, Wellington, 6140, New Zealand
Edwin Budding
Brigham Young University, Provo, UT, 84602, USA
Michael D. Rhodes

Authors

Maksym Ersteniuk
View author publications
You can also search for this author in PubMed Google Scholar
Timothy Banks
View author publications
You can also search for this author in PubMed Google Scholar
Edwin Budding
View author publications
You can also search for this author in PubMed Google Scholar
Michael D. Rhodes
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Timothy Banks.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Ersteniuk, M., Banks, T., Budding, E. et al. Markov Chain Monte Carlo optimization applied to double stars from Miller & Pitman research. J Astrophys Astron 45, 9 (2024). https://doi.org/10.1007/s12036-024-09997-5

Download citation

Received: 28 November 2023
Accepted: 09 January 2024
Published: 05 March 2024
DOI: https://doi.org/10.1007/s12036-024-09997-5

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Markov Chain Monte Carlo optimization applied to double stars from Miller & Pitman research

Abstract

Similar content being viewed by others

Tutorial: Asteroseismic Data Analysis with DIAMONDS

Precise Masses, Ages, and Orbital Parameters of the Binary Systems HIP 11352, HIP 70973, and HIP 72479

An analytic algorithm to calculate the inclination, ascending node, and semimajor axis of spectroscopic binary orbits using a single speckle measurement and the parallax

1 Introduction

2 Markov Chain Monte Carlo

3 Analysis

4 Discussion

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Markov Chain Monte Carlo optimization applied to double stars from Miller & Pitman research

Abstract

Similar content being viewed by others

Tutorial: Asteroseismic Data Analysis with DIAMONDS

Precise Masses, Ages, and Orbital Parameters of the Binary Systems HIP 11352, HIP 70973, and HIP 72479

An analytic algorithm to calculate the inclination, ascending node, and semimajor axis of spectroscopic binary orbits using a single speckle measurement and the parallax

1 Introduction

2 Markov Chain Monte Carlo

3 Analysis

4 Discussion

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation