Abstract
A knowledge structure identifies how people think and displays a macro view of human perception. By discovering the hidden structural relations of knowledge, significant reasoning patterns are retrieved to enhance further knowledge sharing and distribution. However, the utilization of such approaches is apt to be limited due to the lack of hierarchical features and the problem of information overload, which make it difficult to enhance comprehension and provide effective navigation. To address these critical issues, we propose a new approach to construct a tree-based knowledge structure from corpus which can reveal the significant relations among knowledge objects and enhance user comprehension. The effectiveness of the proposed method is demonstrated with two representative public data sets. The evaluation results show that the method presented in this work achieves remarkable consistency with the domain-specific knowledge structure, and is capable of reflecting appropriate similarities among knowledge objects along with hierarchical implications in the document classification task.
Article PDF
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Avoid common mistakes on your manuscript.
References
Bradley JH, Paul R, Seeman E (2006) Analyzingthe structure of expert knowledge. Inf Manag 43:77–91
Chen NS, Kinshuk Wei CW, Chen HJ (2008) Mining e-learning domain concept map from academic articles. Comput Educ 50:1009–1021
Tseng SS, Sue PC, Su JM, Weng JF, Tsai WN (2007) A new approach for constructing the concept map. Comput Educ 49:691–707
Eppler MJ, RA Burkhard (2007) Visual representations in knowledge management: framework and cases. J Knowl Manag 11:112–122
Murphy GL, Lassaline ME (1997) Hierarchical structure in concepts and the basic level of categorization. In: Lamberts K, Shanks D (eds) Knowledge, concepts and categories. MIT Press, Cambridge
Stumme G, Taouil R, Bastide Y, Pasquier N, Lakhal L (2002) Computing iceberg concept lattices with titanic. Data Knowl Eng 42:189–222
Rajapakse RK, Denham M (2006) Text retrieval with more realistic concept matching and reinforcement learning. Inform Process Manag 42:1260–1275
Belohlavek R, Dvorak J, Outrata J (2007) Fast factorization by similarity in formal concept analysis of data with fuzzy attributes. J Comput Syst Sci 73:1012–1022
Sparrow J (1998) Knowledge in organizations: access to thinking at work. Sage, London
Novak JD (1993) How do we learn our lesson? Taking students through the process. Sci Teach 60:50–55
Ruiz-Primo MA, Schultz SE, Li M, Shavelson RJ (2001) Comparison of the reliability and validity of sores from two concept-mapping techniques. J Res Sci Teach 38:260–278
Wang J (2003) A knowledge network constructed by integrating classification, thesaurus, and metadata in digital library. Int Inf Libr Rev 35:383–397
Xu JJ, Chen H (2005) Crimenet explorer: a framework for criminal network knowledge discovery. ACM Trans Inform Syst 23:201–226
Schvaneveldt RW (1990) Pathfinder associative networks: studies in organization. Albex Publishing, Norwood
Chen RC, Liang JY, Pan RH (2008) Using recursive art network to construction domain ontology based on term frequency and inverse document frequency. Expert Syst Appl 34:488–501
Fenza G, Loia V, Senatore S (2008) A hybrid approach to semantic web services matchmaking. Int J Approx Reason 48:808–828
Lee CS, Kao YF, Kuo YH, Wang MH (2007) Automated ontology construction for unstructured text documents. Data Knowl Eng 60:547–566
Reformat M, Ly C (2009) Ontological approach to development of computing with words based systems. Int J Approx Reason 50:72–91
Lee CS, Jian ZW, Huang LK (2005) A fuzzy ontology and its application to news summarization. IEEE Trans Syst Man Cybern, Part B, Cybern 35:859–880
Stumme G (2003) Off to new shores: conceptual knowledge discovery and processing. Int J Human-Comput Stud 59:287–325
Priss U (2006) Formal concept analysis in information science. In: Cronin B (ed) Annual review of information science and technology (arist), vol 40. Information Today Medford, New Jersey, pp 521–543
Ganter B, Wille R (1999) Formal concept analysis: mathematical foundations. Springer, New York
Tho QT, Hui SC, Fong, Cao TH (2006) Automatic fuzzy ontology generation for semantic web. IEEE Trans Knowl Data Eng 18:842–856
Chi YL (2007) Elicitation synergy of extracting conceptual tags and hierarchies. Expert Syst Appl 32:349–357
Formica A, Missikoff M (2004) Inheritance processing and conflicts in structural generalization hierarchies. ACM Comput Surv 36:263–290
Carpineto C, Romano G (2004) Exploiting the potential of concept lattices for information retrieval with credo. J Univ Comput Sci 10:985–1013
Everitt B (1993) Cluster analysis. Edward Arnold, London
Steinbach M, Karypis G, Kumar V (2000) A comparison of document clustering techniques. Paper presented at the KDD Workshop on Text Mining, Boston, MA, USA
Treeratpituk P, Callan J (2006) Automatically labeling hierarchical clusters. Paper presented at the proceedings of the 6th national conference on digital government research, San Diego, USA
Chung W, Chen H, Nunamaker JF Jr (2005) A visual framework for knowledge discovery on the web: an empirical study of business intelligence exploration. J Manag Inf Syst 21:57–84
Cimiano P, Hotho A, Staab S (2005) Learning concept hierarchies from text corpora using formal concept analysis. J Artif Intell Res 24:305–339
Lammari N, Metais E (2004) Building and maintaining ontologies: a set of algorithms. Data Knowl Eng 48:155–176
Sanderson M, Lawrie D (2000) Build, testing and applying concept hierarchies. In: Advances in information retrieval: recent research from the center for intelligent information retrieval, vol 7. Springer, New York, pp 235–266
Yates RB, Neto BR (1999) Modern information retrieval. ACM Press, New York
Glover E Pennock DM, Lawrence S, Krovetz R (2002) Inferring hierarchical descriptions. In: Proceedings of the 20th international conference on information and knowledge management (CIKM), McLean, Virginia, pp 507–514
Wu HC, Luk RWP, Wong KF, Kwok KL (2008) Interpreting TF-IDF term weights as making relevance decisions. ACM Trans Inf Syst 26:1–37
Rodríguez MA, Egenhofer MJ (2003) Determining semantic similarity among entity classes from different ontologies. IEEE Trans Knowl Data Eng 15:442–456
Tang J, Li J, Liang B, Huang X, Li Y, Wang K (2006) Using bayesian decision for ontology mapping. Web Semantics: Science, Services and Agents on the World Wide Web 4:243–262
Fellbaum C (1998) Wordnet: an electronic lexical database. MIT Press, Cambridge
Li Y, Bandar ZA, McLean D (2003) An approach for measuring semantic similarity between words using multiple information source. IEEE Trans Knowl Data Eng 15:871–882
Author information
Authors and Affiliations
Corresponding author
Additional information
The work of S.-T. Li is partly supported by National Science Council, Taiwan under contract NSC98-2410-H-006-007.
Rights and permissions
About this article
Cite this article
Li, ST., Tsai, FC. Constructing tree-based knowledge structures from text corpus. Appl Intell 33, 67–78 (2010). https://doi.org/10.1007/s10489-010-0243-2
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10489-010-0243-2