Abstract
The advent of computing has exacerbated the problem of overwhelming information. Advanced information management strategies such as Information Extraction, Information Filtering, Information Retrieval, and Text Categorization are becoming important to manage the deluge of information. Information Extraction (IE) systems can be used to automatically extract relevant information from free-form text for update to databases or for report generation. This paper describes the major challenge of knowledge representation issues in an information extraction task-representing the meaning of the input text, the knowledge of the field of application (or domain application) and the knowledge about the target information to be extracted. In this research, we have chosen a directed graph structure to represent the input text meaning, a domain ontology to represent the domain application and a frame representation to capture the target information to be extracted. We discuss in this paper how these knowledge structures interplay to perform the task of information extraction.
Preview
Unable to display preview. Download preview PDF.
References
Appelt D E, J Bear, J R Hobbs, D Israel and M Tyson (1992). “SRI International FASTUS System” Proc. MUC-4, Morgan Kaufmann: 143–147.
Bobrow G. Daniel and Winograd Terry (1977). “An Overview of KRL, a Knowledge Representation Language”. Cognitive Science 1(1), 1977, 3–46.
Bobrow G. Daniel, R M Kaplan, M Kay, D A Norman, H Thompson and T Winograd (1977). “GUS, A Frame-Driven Dialog System” Artificial Intelligence, North-Holland Publishing Company 1977:155–173.
Brachman R.J. and Schmolze J.G. (1985). “An Overview of the KL-ONE Knowledge Representation System”. Cognitive Science 9: 171–216.
Charniak Eugene (1978). “On the use of framed knowledge in language comprehension”. Artificial Intelligence 11: 225–265
DARPA (1991). Proc. of Third Message Understanding Conference (MUC-3). Morgan Kaufmann Publishers Inc.
DARPA (1992). Proc. of Fourth Message Understanding Conference (MUC-4). Morgan Kaufmann Publishers Inc.
DARPA (1993). Proc. of Fifth Message Understanding Conference (MUC-5). Morgan Kaufmann Publishers Inc.
DARPA (1995). Proc. of Sixth Message Understanding Conference (MUC-6). Morgan Kaufmann Publishers Inc.
Jensen Karen, Heidorn E. George, Richardson D. Stephen 1993 Natural Language Processing: The PLNLP Approach. Kluwer Academic Publishers, Boston/Dordrecht/London. Chapter 16: 203–214. Chapter 21: 273–283.
Krupka G, P Jacobs, L Rau, and L Iwanska (1991). “The GE NLToolset System” Proc. MUC-3, Morgan Kaufmann.
Marco Costantino, Richard G. Morgan, Russell J. Collingham, Roberto Garigliano (1997). “Natural Language Processing and Information Extraction: Qualitative Analysis of Financial News Articles” Proc. of Conference on Computational Intelligence for Financial Engineering (CIFEr’97), New York City, March 23–25, 1997.
Nyberg H. Eric (1988). “The FrameKit User’s Guide Version 2.0”, Carnegie Mellon University, 1988.
Tan Sian Lip, Tong Loong Cheong (1993). “A statistical approach to automatic text extraction.” Asian Libraries, Vol. 3 No 1, Mar 1993: 46–54.
Tan Sian Lip, Aw Ai Ti (1993). “Domain specific information Extraction—a NLP-Enable application.” Proc. of the First Symposium on Intelligent Systems Applications (SISA ’93), Singapore, Nov 1993.
Tong Loong Cheong, Wee Li Kwang, Goh Ann Loo, Lee Chee Qwun, and Teo Pit Koon (dy1992). “A Telex Destination Identification System.” Proc. First Singapore Int. Conf. on Intelligent Systems (SPICIS 92), Sep 1992: 281–287.
Allen James (1987). “Natural Language Understanding”. University of Rochester. Menlo Park: The Benjamin/Cummings Publishing Company, Inc.
Wan Kwee Ngim, Tong Loong Cheong, Lynda Ang Seok Lay (1993). “Automatic Categorisation of Cargo Descriptions.” Proc. of the First Symposium on Intelligent Systems Applications (SISA ’93), Singapore, Nov 1993.
Tong Loong Cheong, Angela Wee Li Kwang, Augustina Gunawan, Goh Ann Loo, Lee Chee Qwun, and Shu Huey Leng (1994). “A Pragmatic Information Extraction Architecture for the Message Formatting Expert (MFE) System”. Proc. Second Singapore Int. Conf. on Intelligent Systems (SPICIS 94), Nov 1994: B371–377.
Wee Li Kwang Angela, Tong Loong Cheong, Chng Tiak Jung (1997). “DeNews—A Personalized News System.” Journal of Expert Systems with Applications, Vol. 13, 1997, Elsevier Science Ltd., UK 0957-4174/97.
Dolan C P, S R Goldman, T V Cuda and A M Nakamura (1991). “Hughes Trainable Text Skimmer” Proc. MUC-3, Morgan Kaufmann: 155–162.
Tong Loong Cheong, Low Poh Lian (1991). “Automatic Text Abstraction-Prospects and a Proposed R&D Plan” Information Technology. Journal of SCS, Vol 4 No 2, Sep 1991: 85–94.
Julian Kupiec, Jan O. Pedersen, Francine Chen (1995). “A Trainable Document Summarizer”. Proc. of the 18th ACM/SIGIR Conference, 1995: 68–73.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1998 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Angela, W.L.K., Cheong, T.L., Lim, T.C. (1998). Knowledge representation issues in information extraction. In: Lee, HY., Motoda, H. (eds) PRICAI’98: Topics in Artificial Intelligence. PRICAI 1998. Lecture Notes in Computer Science, vol 1531. Springer, Berlin, Heidelberg . https://doi.org/10.1007/BFb0095291
Download citation
DOI: https://doi.org/10.1007/BFb0095291
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-65271-7
Online ISBN: 978-3-540-49461-4
eBook Packages: Springer Book Archive