Measure Transformer Semantics for Bayesian Machine Learning

Borgström, Johannes; Gordon, Andrew D.; Greenberg, Michael; Margetson, James; Van Gael, Jurgen

doi:10.1007/978-3-642-19718-5_5

Johannes Borgström¹⁷,
Andrew D. Gordon¹⁷,
Michael Greenberg¹⁸,
James Margetson¹⁷ &
…
Jurgen Van Gael¹⁷

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 6602))

Included in the following conference series:

European Symposium on Programming

1761 Accesses
23 Citations

Abstract

The Bayesian approach to machine learning amounts to inferring posterior distributions of random variables from a probabilistic model of how the variables are related (that is, a prior distribution) and a set of observations of variables. There is a trend in machine learning towards expressing Bayesian models as probabilistic programs. As a foundation for this kind of programming, we propose a core functional calculus with primitives for sampling prior distributions and observing variables. We define combinators for measure transformers, based on theorems in measure theory, and use these to give a rigorous semantics to our core calculus. The original features of our semantics include its support for discrete, continuous, and hybrid measures, and, in particular, for observations of zero-probability events. We compile our core language to a small imperative language that has a straightforward semantics via factor graphs, data structures that enable many efficient inference algorithms. We use an existing inference engine for efficient approximate inference of posterior marginal distributions, treating thousands of observations per second for large instances of realistic models.

Download to read the full chapter text

Chapter PDF

A Higher-Order Language for Markov Kernels and Linear Operators

Probabilistic Abstract Interpretation: From Trace Semantics to DTMC’s and Linear Regression

Probabilistic Programming Language and its Incremental Evaluation

References

Abadi, M., Rogaway, P.: Reconciling two views of cryptography (the computational soundness of formal encryption). J. Cryptology 15(2), 103–127 (2002)
Article MathSciNet MATH Google Scholar
Barthe, G., Grégoire, B., Béguelin, S.Z.: Formal certification of code-based cryptographic proofs. In: POPL, pp. 90–101. ACM, New York (2009)
Google Scholar
Billingsley, P.: Probability and Measure, 3rd edn. Wiley, Chichester (1995)
MATH Google Scholar
Bonawitz, K.A.: Composable Probabilistic Inference with Blaise. PhD thesis, MIT, Available as Technical Report MIT-CSAIL-TR-2008-044 (2008)
Google Scholar
Borgström, J., Gordon, A.D., Greenberg, M., Margetson, J., Van Gael, J.: Measure transformer semantics for Bayesian machine learning. Technical report, Microsoft Research (2011)
Google Scholar
Dalvi, N.N., Ré, C., Suciu, D.: Probabilistic databases: diamonds in the dirt. Commun. ACM 52(7), 86–94 (2009)
Article Google Scholar
Erwig, M., Kollmansberger, S.: Functional pearls: Probabilistic functional programming in Haskell. J. Funct. Program. 16(1), 21–34 (2006)
Article MATH Google Scholar
Fraser, D.A.S., McDunnough, P., Naderi, A., Plante, A.: On the definition of probability densities and sufficiency of the likelihood map. J. Probability and Mathematical Statistics 15, 301–310 (1995)
MathSciNet MATH Google Scholar
Goodman, N., Mansinghka, V.K., Roy, D.M., Bonawitz, K., Tenenbaum, J.B.: Church: a language for generative models. In: UAI, pp. 220–229. AUAI Press (2008)
Google Scholar
Gupta, V., Jagadeesan, R., Panangaden, P.: Stochastic processes as concurrent constraint programs. In: POPL 1999, pp. 189–202 (1999)
Google Scholar
Herbrich, R., Minka, T., Graepel, T.: TrueSkill(TM): A Bayesian skill rating system. In: Advances in Neural Information Processing Systems 20 (2007)
Google Scholar
Jaynes, E.T.: 15.7 The Borel-Kolmogorov paradox. In: Probability Theory: The Logic of Science, pp. 467–470. CUP (2003)
Google Scholar
Jones, C., Plotkin, G.D.: A probabilistic powerdomain of evaluations. In: LICS, pp. 186–195. IEEE Computer Society, Los Alamitos (1989)
Google Scholar
Kiselyov, O., Shan, C.: Monolingual probabilistic programming using generalized coroutines. In: UAI (2009)
Google Scholar
Koller, D., Friedman, N.: Probabilistic Graphical Models. The MIT Press, Cambridge (2009)
MATH Google Scholar
Koller, D., McAllester, D.A., Pfeffer, A.: Effective Bayesian inference for stochastic programs. In: AAAI/IAAI, pp. 740–747 (1997)
Google Scholar
Kozen, D.: Semantics of probabilistic programs. J. Comput. Syst. Sci. 22(3), 328–350 (1981)
Article MathSciNet MATH Google Scholar
Kschischang, F.R., Frey, B.J., Loeliger, H.-A.: Factor graphs and the sum-product algorithm. IEEE Transactions on Information Theory 47(2), 498–519 (2001)
Article MathSciNet MATH Google Scholar
Kwiatkowska, M.Z., Norman, G., Parker, D.: Quantitative analysis with the probabilistic model checker PRISM. ENTCS 153(2), 5–31 (2006)
Google Scholar
Lowe, G.: Quantifying information flow. In: CSFW, pp. 18–31. IEEE Computer Society, Los Alamitos (2002)
Google Scholar
MacKay, D.J.C.: Information Theory, Inference, and Learning Algorithms. CUP (2003)
Google Scholar
McCallum, A., Schultz, K., Singh, S.: FACTORIE: Probabilistic programming via imperatively defined factor graphs. In: Poster at 23rd Annual Conference on Neural Information Processing Systems, NIPS (2009)
Google Scholar
McIver, A., Morgan, C.: Abstraction, refinement and proof for probabilistic systems. Monographs in computer science. Springer, Heidelberg (2005)
MATH Google Scholar
McSherry, F.: Privacy integrated queries: an extensible platform for privacy-preserving data analysis. In: SIGMOD Conference, pp. 19–30. ACM, New York (2009)
Google Scholar
Minka, T., Winn, J., Guiver, J., Kannan, A.: Infer.NET 2.3 (November 2009), Software available from http://research.microsoft.com/infernet
Minka, T., Winn, J.M.: Gates. In: NIPS, pp. 1073–1080. MIT Press, Cambridge (2008)
Google Scholar
Minka, T.P.: Expectation Propagation for approximate Bayesian inference. In: UAI, pp. 362–369. Morgan Kaufmann, San Francisco (2001)
Google Scholar
Ntzoufras, I.: Bayesian Modeling Using WinBUGS. Wiley, Chichester (2009)
Book MATH Google Scholar
Panangaden, P.: Labelled Markov processes. Imperial College Press, London (2009)
Google Scholar
Park, S., Pfenning, F., Thrun, S.: A probabilistic language based upon sampling functions. In: POPL, pp. 171–182. ACM, New York (2005)
Google Scholar
Pfeffer, A.: IBAL: A probabilistic rational programming language. In: Nebel, B. (ed.) IJCAI, pp. 733–740. Morgan Kaufmann, San Francisco (2001)
Google Scholar
Pfeffer, A.: The design and implementation of IBAL: A General-Purpose Probabilistic Language. In: Statistical Relational Learning. MIT Press, Cambridge (2007)
Google Scholar
Ramsey, N., Pfeffer, A.: Stochastic lambda calculus and monads of probability distributions. In: POPL, pp. 154–165 (2002)
Google Scholar
Reed, J., Pierce, B.C.: Distance makes the types grow stronger: A calculus for differential privacy. In: ICFP, pp. 157–168 (2010)
Google Scholar
Saheb-Djahromi, N.: Probabilistic LCF. In: Winkowski, J. (ed.) MFCS 1978. LNCS, vol. 64, pp. 442–451. Springer, Heidelberg (1978)
Chapter Google Scholar
Syme, D., Granicz, A., Cisternino, A.: Expert F#. Apress (2007)
Google Scholar
Winn, J., Minka, T.: Probabilistic programming with Infer.NET. Machine Learning Summer School lecture notes (2009), http://research.microsoft.com/~minka/papers/mlss2009/
Winn, J.M., Bishop, C.M.: Variational message passing. Journal of Machine Learning Research 6, 661–694 (2005)
MathSciNet MATH Google Scholar

Download references

Author information

Authors and Affiliations

Microsoft Research, USA
Johannes Borgström, Andrew D. Gordon, James Margetson & Jurgen Van Gael
University of Pennsylvania, USA
Michael Greenberg

Authors

Johannes Borgström
View author publications
You can also search for this author in PubMed Google Scholar
Andrew D. Gordon
View author publications
You can also search for this author in PubMed Google Scholar
Michael Greenberg
View author publications
You can also search for this author in PubMed Google Scholar
James Margetson
View author publications
You can also search for this author in PubMed Google Scholar
Jurgen Van Gael
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Facultad de Informatica (UPM), IMDEA Software, Campus Montegancedo, 28660, Boadilla del Monte, Madrid, Spain
Gilles Barthe

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Borgström, J., Gordon, A.D., Greenberg, M., Margetson, J., Van Gael, J. (2011). Measure Transformer Semantics for Bayesian Machine Learning. In: Barthe, G. (eds) Programming Languages and Systems. ESOP 2011. Lecture Notes in Computer Science, vol 6602. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-19718-5_5

Download citation

DOI: https://doi.org/10.1007/978-3-642-19718-5_5
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-19717-8
Online ISBN: 978-3-642-19718-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Measure Transformer Semantics for Bayesian Machine Learning

Abstract

Chapter PDF

Similar content being viewed by others

A Higher-Order Language for Markov Kernels and Linear Operators

Probabilistic Abstract Interpretation: From Trace Semantics to DTMC’s and Linear Regression

Probabilistic Programming Language and its Incremental Evaluation

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Measure Transformer Semantics for Bayesian Machine Learning

Abstract

Chapter PDF

Similar content being viewed by others

A Higher-Order Language for Markov Kernels and Linear Operators

Probabilistic Abstract Interpretation: From Trace Semantics to DTMC’s and Linear Regression

Probabilistic Programming Language and its Incremental Evaluation

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation