Relevant Representations for the Inference of Rational Stochastic Tree Languages

Denis, François; Gilbert, Édouard; Habrard, Amaury; Ouardi, Faïssal; Tommasi, Marc

doi:10.1007/978-3-540-88009-7_5

François Denis¹,
Édouard Gilbert²,
Amaury Habrard¹,
Faïssal Ouardi¹ &
…
Marc Tommasi²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5278))

Included in the following conference series:

International Colloquium on Grammatical Inference

475 Accesses
1 Citations

Abstract

Recently, an algorithm - DEES - was proposed for learning rational stochastic tree languages. Given a sample of trees independently and identically drawn according to a distribution defined by a rational stochastic language, DEES outputs a linear representation of a rational series which converges to the target. DEES can then be used to identify in the limit with probability one rational stochastic tree languages. However, when DEES deals with finite samples, it often outputs a rational tree series which does not define a stochastic language. Moreover, the linear representation can not be directly used as a generative model. In this paper, we show that any representation of a rational stochastic tree language can be transformed in a reduced normalised representation that can be used to generate trees from the underlying distribution. We also study some properties of consistency for rational stochastic tree languages and discuss their implication for the inference. We finally consider the applicability of DEES to trees built over an unranked alphabet.

This work was partially supported by the Atash project ANR-05-RNTL00102 and the Marmota project ANR-05-MMSA-0016.

Access provided by Autonomous University of Puebla. Download to read the full chapter text

Chapter PDF

Learning Tree Languages

Stochastic Context-Free Grammars, Regular Languages, and Newton’s Method

Breadth-First Serialisation of Trees and Rational Languages

References

Denis, F., Habrard, A.: Learning rational stochastic tree languages. In: Hutter, M., Servedio, R.A., Takimoto, E. (eds.) ALT 2007. LNCS (LNAI), vol. 4754, pp. 242–256. Springer, Heidelberg (2007)
Chapter Google Scholar
Booth, T., Thompson, R.: Applying probabilistic measures to abstract languages. IEEE Transactions on Computers 22(5), 442–450 (1973)
Article MATH MathSciNet Google Scholar
Wetherell, C.S.: Probabilistic languages: A review and some open questions. ACM Comput. Surv. 12(4), 361–379 (1980)
Article MATH MathSciNet Google Scholar
Comon, H., Dauchet, M., Gilleron, R., Jacquemard, F., Lugiez, D., Löding, C., Tison, S., Tommasi, M.: Tree automata techniques and applications (2007) (release October 12, 2007), http://tata.gforge.inria.fr/
Berstel, J., Reutenauer, C.: Recognizable formal power series on trees. Theoretical Computer Science 18, 115–148 (1982)
Article MATH MathSciNet Google Scholar
Ésik, Z., Kuich, W.: Formal tree series. Journal of Automata, Languages and Combinatorics 8(2), 219–285 (2003)
MATH MathSciNet Google Scholar
Denis, F., Esposito, Y.: Rational stochastic languages. Technical report, LIF - Université de Provence (2006), http://hal.ccsd.cnrs.fr/ccsd-00019728
Denis, F., Gilbert, E., Habrard, A., Ouardi, F., Tommasi, M.: Relevant representations for the inference of rational stochastic tree languages. Technical report, LIF, LIFL, and INRIA (2008), http://hal.archives-ouvertes.fr/hal-00293511/en/
Denis, F., Esposito, Y., Habrard, A.: Learning rational stochastic languages. In: Lugosi, G., Simon, H.U. (eds.) Learning theory. LNCS, pp. 274–288. Springer, Heidelberg (2006)
Chapter Google Scholar
Stolcke, A.: An efficient probabilistic context-free parsing algorithm that computes prefix probabilities. Computional Linguistics 21(2), 165–201 (1995)
MathSciNet Google Scholar
Brüggemann-Klein, A., Murata, M., Wood, D.: Regular tree and regular hedge languages over unranked alphabets. Technical report, Hong Kong University Theoretical Computer Science Center, Version 1 (2001)
Google Scholar
Carme, J., Niehren, J., Tommasi, M.: Querying unranked trees with stepwise tree automata. In: van Oostrom, V. (ed.) RTA 2004. LNCS, vol. 3091, pp. 105–118. Springer, Heidelberg (2004)
Google Scholar
Droste, M., Vogler, H.: Weighted logics for XML (manuscript, 2007), http://www.orchid.inf.tu-dresden.de/gdp/monographs/r20.ps

Download references

Author information

Authors and Affiliations

Laboratoire d’Informatique Fondamentale, CNRS, Aix-Marseille Université,
François Denis, Amaury Habrard & Faïssal Ouardi
Laboratoire d’Informatique Fondamentale de Lille (L.I.F.L.), INRIA and É.N.S. Cachan,
Édouard Gilbert & Marc Tommasi

Authors

François Denis
View author publications
You can also search for this author in PubMed Google Scholar
Édouard Gilbert
View author publications
You can also search for this author in PubMed Google Scholar
Amaury Habrard
View author publications
You can also search for this author in PubMed Google Scholar
Faïssal Ouardi
View author publications
You can also search for this author in PubMed Google Scholar
Marc Tommasi
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Alexander Clark François Coste Laurent Miclet

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Denis, F., Gilbert, É., Habrard, A., Ouardi, F., Tommasi, M. (2008). Relevant Representations for the Inference of Rational Stochastic Tree Languages. In: Clark, A., Coste, F., Miclet, L. (eds) Grammatical Inference: Algorithms and Applications. ICGI 2008. Lecture Notes in Computer Science(), vol 5278. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-88009-7_5

Download citation

DOI: https://doi.org/10.1007/978-3-540-88009-7_5
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-88008-0
Online ISBN: 978-3-540-88009-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Relevant Representations for the Inference of Rational Stochastic Tree Languages

Abstract

Chapter PDF

Similar content being viewed by others

Learning Tree Languages

Stochastic Context-Free Grammars, Regular Languages, and Newton’s Method

Breadth-First Serialisation of Trees and Rational Languages

References

Author information

Authors and Affiliations

Editor information

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Relevant Representations for the Inference of Rational Stochastic Tree Languages

Abstract

Chapter PDF

Similar content being viewed by others

Learning Tree Languages

Stochastic Context-Free Grammars, Regular Languages, and Newton’s Method

Breadth-First Serialisation of Trees and Rational Languages

References

Author information

Authors and Affiliations

Editor information

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation