Abstract
The first part of this paper will give a general view of Knowledge Discovery in Data (KDD) in order to insist on how much it differs from the fields it stems from, and in some cases, how much it opposes them.
The second part will a definition of Knowledge Discovery in Texts (KDT), as opposed to what is known presently under the name of information retricval, information extraction, or knowledge extraction. I will provide an example of a real-life set of rules obtained by what I want to define as KDT techniques.
Preview
Unable to display preview. Download preview PDF.
References
Bhandari, I. “Attribute focusing: Machine-Assisted Knowledge discovery Applied to Software Production Process Control”, Knowledge Acquisition 6, 271–294, 1994.
Cohen, P., Schrag R., Jones E., Pease A., Lin A., Starr B., Gunning D., Burke M., “The DARPA High-Performance Knowledge Bases Project,” AI Magazine 19, vol. 4, pp. 25–49, 1998.
Constant “L’analyseur syntaxique Sylex,” in Cinquième école d’été du CNET, 1995.
Darànyi, S., Abrànyi, A., Kovàcs G., “Knowledge Extraction From Ethnopoetic Texts by Multivariate Statistical Methods,” in Ephraim Nissan, Klaus Schmidt (Eds.) “From Information to Knowledge” Intellect, Oxford, GB, 1995, pp. 261–268.
Feldman, R., Unpublished Communication at ECML workshop on Text Mining, 1998.
Gras R., Lahrer A., “L’implication statistique: une nouvelle méthode d’analyse des données,” Mathématiques Informatique et Sciences Humaines 120:5–31, 1993.
Grefenstette G., “Short Query Linguistic Expansion Techniques: Palliating One-Word Queries by Providing Intermediate Structure to Text,” in Maria Theresa Paziensa (Ed.) “Information Extraction,” Springer 1997, pp.
Grishman R., “Information Extraction: Techniques and Challenges,” in Maria Theresa Paziensa (Ed.) “Information Extraction,” Springer 1997, pp. 10–27.
Hogenraad, R., Bestgen, Y., Nysten, J. L. “Terrorist Rhetoric: Texture and Architecture,” in Ephraim Nissan, Klaus Schmidt (Eds.) “From Information to Knowledge” Intellect, Oxford, GB, 1995, pp. 48–59.
Kodratoff Y., Tecuci G.: “DISCIPLE-1: Interactive Apprentice System in weak theory Fields”, Proc. IJCAI-87, Milan Aug. 87, pp. 271–273. See also: “What is an Explanation in DISCIPLE” Proc. Intern. Workshop in ML, Irvine 1987, pp. 160–166.
Kodratoff Y, Bisson G. “The epistemology of conceptual clustering: KBG, an implementation”, Journal of Intelligent Information System, 1:57–84, 1992.
Kodratoff Y., “Induction and the Organization of Knowledge”, Machine Learning: A Multistrategy Approach, volume 4, Tecuci G. et Michalski R. S. (Eds.), pages 85–106. Morgan-Kaufmann, San Francisco, CA, 1994.
Kodratoff Y., “Research topics in Knowledge Discovery in Data and Texts,” submitted. MUC-n, 199m; n=3–6; m=1, 2, 3, 5; Proceedings of the nth Message Understanding Conference, Morgan Kaufmann, 199m.
Nédellec C., Faure D., “A Corpus-based Conceptual Clustering Method for Verb Freames and Ontology Acquisition,” in P. Velardi (Ed.) LREC workshop, pp. 5–12, Granada, May 1998.
Partridge D., “The Case for Inductive Programming,” IEEE Computer 30, 1, 36–41, 1997. A more complete version in: “The Case for Inductive Computing Science,” in Computational Intelligence and Software Engineering, Pedrycz & Peters (Eds.) World Scientific, in press.
Searle J. R. Minds, brains & science, Penguin books, London 1984.
Searle J. R., Scientific American n0 262, 1990, pp. 26–31.
Sebag M., “2nd order Understandability of Disjunctive Version Spaces,” Workshop on Machine Learning and Comprehensibility organized at IJCAI-95, LRI Report, Universite Paris-Sud, 1995.
Sebag M., “Delaying the choice of bias: A disjunctive version space approach,” Proc. 13th International Conference on Machine Learning, Saitta L. (Ed.), pp. 444–452, Morgan Kaufmann, CA 1996.
Suzuki E. “Autonomous Discovery of Reliable Exception Rules” Proc. KDD-97, 259–262, 1997.
Suzuki, E., Kodratoff Y., “Discovery of Surprising Exception Rules Based on Intensity of Implication”, in Principles of Data Mining and Knowledge Discovery, Zytkow J. & Quafafou M. (Eds.), pp. 10–18, LNAI 1510, Springer, Berlin 1998.
Tecuci G., Building Intelligent Agents, Academic Press, 1988.
Think, June 1993, a review published by ITK, Univ. Tilburg, Warandelaan 2, PO Box 90153, 5000 Le Tilburg, The Netherlands.
Wilks, Y., “Information Extraction as a Core Language Technology”, in Maria Theresa Paziensa (Ed.) “Information Extraction,” Springer 1997, pp. 1–9.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1999 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kodratoff, Y. (1999). Knowledge discovery in texts: A definition, and applications. In: Raś, Z.W., Skowron, A. (eds) Foundations of Intelligent Systems. ISMIS 1999. Lecture Notes in Computer Science, vol 1609. Springer, Berlin, Heidelberg . https://doi.org/10.1007/BFb0095087
Download citation
DOI: https://doi.org/10.1007/BFb0095087
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-65965-5
Online ISBN: 978-3-540-48828-6
eBook Packages: Springer Book Archive