Planar Languages and Learnability

Clark, Alexander; Florêncio, Christophe Costa; Watkins, Chris; Serayet, Mariette

doi:10.1007/11872436_13

Alexander Clark²³,
Christophe Costa Florêncio²³,
Chris Watkins²³ &
…
Mariette Serayet²⁴

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4201))

Included in the following conference series:

International Colloquium on Grammatical Inference

537 Accesses
4 Citations

Abstract

Strings can be mapped into Hilbert spaces using feature maps such as the Parikh map. Languages can then be defined as the pre-image of hyperplanes in the feature space, rather than using grammars or automata. These are the planar languages. In this paper we show that using techniques from kernel-based learning, we can represent and efficiently learn, from positive data alone, various linguistically interesting context-sensitive languages. In particular we show that the cross-serial dependencies in Swiss German, that established the non-context-freeness of natural language, are learnable using a standard kernel. We demonstrate the polynomial-time identifiability in the limit of these classes, and discuss some language theoretic properties of these classes, and their relationship to the choice of kernel/feature map.

Access provided by Autonomous University of Puebla. Download to read the full chapter text

Chapter PDF

The Strong, Weak, and Very Weak Finite Context and Kernel Properties

Learning Tree Languages

Distributional Models for Lexical Semantics: An Investigation of Different Representations for Natural Language Learning

References

Aizerman, M.A., Braverman, E.M., Rozonoer, L.I.: Theoretical foundations of the potential function method in pattern recognition. Automation and Remote Control 25, 821–837 (1964)
MathSciNet Google Scholar
Asveld, P.R.J.: Generating all permutations by context-free grammars in Chomsky normal form. Theoretical Computer Science (TCS) 354(1), 118–130 (2006)
Article MATH MathSciNet Google Scholar
Clark, A., Florêncio, C.C., Watkins, C.: Languages as hyperplanes: grammatical inference with string kernels. In: Fürnkranz, J., Scheffer, T., Spiliopoulou, M. (eds.) ECML 2006. LNCS (LNAI), vol. 4212, pp. 90–101. Springer, Heidelberg (2006)
Chapter Google Scholar
de la Higuera, C.: Characteristic sets for polynomial grammatical inference. In: Miclet, L., de la Higuera, C. (eds.) ICGI 1996. LNCS, vol. 1147. Springer, Heidelberg (1996)
Google Scholar
Mark Gold, E.: Language identification in the limit. Information and Control 10, 447–474 (1967)
Article MATH Google Scholar
Huybregts, R.: The weak inadequacy of context-free phrase structure grammars. In: de Haan, G.J., Trommelen, M., Zonneveld, W. (eds.) Van Periferie naar Kern, Foris, Dordrecht (1984)
Google Scholar
Joshi, A.K., Schabes, Y.: Tree-adjoining grammars. In: Rosenberg, G., Salomaa, A. (eds.) Handbook of Formal Languages, vol. 3, pp. 69–123. Springer, New York (1996)
Google Scholar
Kontorovich, L.: Learning linearly separable languages. Technical Report CMU-CALD-04-105, School of Computer Science, CMU (2004)
Google Scholar
Motoki, T., Shinohara, T., Wright, K.: The correct definition of finite elasticity: Corrigendum to identification of unions. In: The Fourth Workshop on Computational Learning Theory. Morgan Kaufmann, San Mateo, Calif (1991)
Google Scholar
Salomaa, A.: On languages defined by numerical parameters. Technical Report 663, Turku Centre for Computer Science (2005)
Google Scholar
Shieber, S.M.: Evidence against the context-freeness of natural language. Linguistics and Philosophy 8, 333–343 (1985)
Article Google Scholar
Shawe-Taylor, J., Christianini, N.: Kernel Methods for Pattern Analysis. Cambridge University Press, Cambridge (2004)
Google Scholar
Watkins, C.: Dynamic alignment kernels. Technical Report CSD-TR-98-11, Department of Computer Science, Royal Holloway College, University of London (1999)
Google Scholar
Wright, K.: Identification of unions of languages drawn from an identifiable class. In: The 1989 Workshop on Computational Learning Theory, pp. 328–333. Morgan Kaufmann, San Mateo (1989)
Google Scholar
Yokomori, T., Kobayashi, S.: Learning local languages and their application to DNA sequence analysis. IEEE Trans. Pattern Anal. Mach. Intell. 20(10), 1067–1079 (1998)
Article Google Scholar
Yokomori, T.: Polynomial-time learning of very simple grammars from positive data. In: Proceedings of the Fourth Annual Workshop on Computational Learning Theory, University of California, Santa Cruz, August 5–7, 1991, pp. 213–227. ACM Press, New York (1991)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, University of London, Royal Holloway, Egham, TW20 0EX, UK
Alexander Clark, Christophe Costa Florêncio & Chris Watkins
Faculté des Sciences et Techniques, Département Informatique, 23, Rue du Docteur Paul Michelon, 42023 Cedex 2, Saint-Etienne, France
Mariette Serayet

Authors

Alexander Clark
View author publications
You can also search for this author in PubMed Google Scholar
Christophe Costa Florêncio
View author publications
You can also search for this author in PubMed Google Scholar
Chris Watkins
View author publications
You can also search for this author in PubMed Google Scholar
Mariette Serayet
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Biosciences and Informatics, Keio University, 3-14-1 Hiyoshi, Kohoku-ku, 223-8522, Yokohama, Japan
Yasubumi Sakakibara
Dept. of Computer Science, Kyoto Sangyo University, Kamigamo Motoyama, Kita-ku, Kyoto, Japan
Satoshi Kobayashi
Japan Biological Informatics Consortium, 10F TIME24 Building, 2-45 Aomi, Koto-ku, 135-8073, Tokyo, Japan
Kengo Sato
Department of Information and Communication Engineering, Graduate School of Electro-Communications, The University of Electro-Communications, 1-5-1 Chofugaoka, Chofu-shi, 182-8585, Tokyo, Japan
Tetsuro Nishino
Department of Information and Communication Engineering, Faculty of Electro-Communications, The University of Electro-Communications, Chofugaoka 1–5–1, Chofu, 182-8585, Tokyo, Japan
Etsuji Tomita

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Clark, A., Florêncio, C.C., Watkins, C., Serayet, M. (2006). Planar Languages and Learnability. In: Sakakibara, Y., Kobayashi, S., Sato, K., Nishino, T., Tomita, E. (eds) Grammatical Inference: Algorithms and Applications. ICGI 2006. Lecture Notes in Computer Science(), vol 4201. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11872436_13

Download citation

DOI: https://doi.org/10.1007/11872436_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-45264-5
Online ISBN: 978-3-540-45265-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Planar Languages and Learnability

Abstract

Chapter PDF

Similar content being viewed by others

The Strong, Weak, and Very Weak Finite Context and Kernel Properties

Learning Tree Languages

Distributional Models for Lexical Semantics: An Investigation of Different Representations for Natural Language Learning

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Planar Languages and Learnability

Abstract

Chapter PDF

Similar content being viewed by others

The Strong, Weak, and Very Weak Finite Context and Kernel Properties

Learning Tree Languages

Distributional Models for Lexical Semantics: An Investigation of Different Representations for Natural Language Learning

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation