A Polynomial Algorithm for the Inference of Context Free Languages

Clark, Alexander; Eyraud, Rémi; Habrard, Amaury

doi:10.1007/978-3-540-88009-7_3

Alexander Clark¹,
Rémi Eyraud² &
Amaury Habrard²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5278))

Included in the following conference series:

International Colloquium on Grammatical Inference

504 Accesses
19 Citations

Abstract

We present a polynomial algorithm for the inductive inference of a large class of context free languages, that includes all regular languages. The algorithm uses a representation which we call Binary Feature Grammars based on a set of features, capable of representing richly structured context free languages as well as some context sensitive languages. More precisely, we focus on a particular case of this representation where the features correspond to contexts appearing in the language. Using the paradigm of positive data and a membership oracle, we can establish that all context free languages that satisfy two constraints on the context distributions can be identified in the limit by this approach. The polynomial time algorithm we propose is based on a generalisation of distributional learning and uses the lattice of context occurrences. The formalism and the algorithm seem well suited to natural language and in particular to the modelling of first language acquisition.

Access provided by Autonomous University of Puebla. Download to read the full chapter text

Chapter PDF

Learning Context Free Grammars with the Finite Context Property: A Correction of A. Clark’s Algorithm

Learning Context-Free Grammars from Positive Data and Membership Queries

Underlying Principles and Recurring Ideas of Formal Grammars

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Higuera, C.D.L., Oncina, J.: Inferring deterministic linear languages. In: Kivinen, J., Sloan, R.H. (eds.) COLT 2002. LNCS (LNAI), vol. 2375, pp. 185–200. Springer, Heidelberg (2002)
Chapter Google Scholar
Yokomori, T.: Polynomial-time identification of very simple grammars from positive data. Theoretical Computer Science 298(1), 179–206 (2003)
Article MATH MathSciNet Google Scholar
Clark, A., Eyraud, R.: Polynomial identification in the limit of substitutable context-free languages. Journal of Machine Learning Research 8, 1725–1745 (2007)
MathSciNet Google Scholar
Marcus, S.: Algebraic Linguistics; Analytical Models. Academic Press, N. Y (1967)
MATH Google Scholar
Gazdar, G., Klein, E., Pullum, G., Sag, I.: Generalised Phrase Structure Grammar. Basil Blackwell, Malden (1985)
Google Scholar
Boullier, P.: A Cubic Time Extension of Context-Free Grammars. Grammars 3, 111–131 (2000)
Article MATH MathSciNet Google Scholar
Asveld, P.: Generating all permutations by context-free grammars in Chomsky normal form. Theoretical Computer Science 354(1), 118–130 (2006)
Article MATH MathSciNet Google Scholar
Gold, E.M.: Language identification in the limit. Information and Control 10, 447–474 (1967)
Article MATH Google Scholar
Angluin, D.: Queries and concept learning. Mach. Learn. 2(4), 319–342 (1988)
Google Scholar
Pitt, L.: Inductive inference, dfa’s, and computational complexity. LNCS (LNAI), pp. 8–14. Springer, Heidelberg (1989)
Google Scholar
de la Higuera, C.: Characteristic sets for polynomial grammatical inference. Machine Learning 27(2), 125–138 (1997)
Article MATH Google Scholar
Adriaans, P.: Learning shallow context-free languages under simple distributions. Algebras, Diagrams and Decisions in Language, Logic and Computation 127 (2002)
Google Scholar
Horning, J.J.: A Study of Grammatical Inference. PhD thesis, Stanford University, Computer Science Department, California (1969)
Google Scholar
Clark, A.: PAC-learning unambiguous NTS languages. In: Sakakibara, Y., Kobayashi, S., Sato, K., Nishino, T., Tomita, E. (eds.) ICGI 2006. LNCS (LNAI), vol. 4201, pp. 59–71. Springer, Heidelberg (2006)
Chapter Google Scholar
Klein, D., Manning, C.: Corpus-based induction of syntactic structure: models of dependency and constituency. In: Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics, pp. 478–485 (2004)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, Royal Holloway, University of London,
Alexander Clark
Laboratoire d’Informatique Fondamentale, University of Aix-Marseille, CNRS,
Rémi Eyraud & Amaury Habrard

Authors

Alexander Clark
View author publications
You can also search for this author in PubMed Google Scholar
Rémi Eyraud
View author publications
You can also search for this author in PubMed Google Scholar
Amaury Habrard
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Alexander Clark François Coste Laurent Miclet

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Clark, A., Eyraud, R., Habrard, A. (2008). A Polynomial Algorithm for the Inference of Context Free Languages. In: Clark, A., Coste, F., Miclet, L. (eds) Grammatical Inference: Algorithms and Applications. ICGI 2008. Lecture Notes in Computer Science(), vol 5278. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-88009-7_3

Download citation

DOI: https://doi.org/10.1007/978-3-540-88009-7_3
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-88008-0
Online ISBN: 978-3-540-88009-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

A Polynomial Algorithm for the Inference of Context Free Languages

Abstract

Chapter PDF

Similar content being viewed by others

Learning Context Free Grammars with the Finite Context Property: A Correction of A. Clark’s Algorithm

Learning Context-Free Grammars from Positive Data and Membership Queries

Underlying Principles and Recurring Ideas of Formal Grammars

Keywords

References

Author information

Authors and Affiliations

Editor information

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

A Polynomial Algorithm for the Inference of Context Free Languages

Abstract

Chapter PDF

Similar content being viewed by others

Learning Context Free Grammars with the Finite Context Property: A Correction of A. Clark’s Algorithm

Learning Context-Free Grammars from Positive Data and Membership Queries

Underlying Principles and Recurring Ideas of Formal Grammars

Keywords

References

Author information

Authors and Affiliations

Editor information

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation