Abstract
Prior to mining data for knowledge, selecting a potentially useful set of target data is necessary. Mining with missing attribute values increases uncertainty and decreases discovery accuracy. We present an instance selection method that determines the mining usability of an instance based on knowledge about which attributes are missing and the relative significance of the various attributes as defined by a domain expert. Knowledge-based instance selection (KbIS) is an instance utility metric that incorporates domain knowledge into a multi-criteria decision-making technique for instance selection.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Bosc, P. and Lietard, M. 1997. Quantified statements and some interpretations for the OWA operator. In The Ordered Weighted Averaging Operators: Theory and Application, R. R. Yager and J. Kacprzyk (Eds.). Kluwer, pp. 241–257.
Brachman, R. J. and T. Anand. 1996. The process of knowledge discovery in databases: A human-centered approach. In Advances in Knowledge Discovery and Data Mining, U. M. Fayyad, G. Piatetsky-Shapiro, P. Smyth, and R. Uthurusamy (Eds.). Menlo Park, CA: AAAI Press, pp. 37–57.
Cheeseman, P., J. Kelly, M. Self, J. Stutz, W. Taylor, and D. Freeman. 1988. AutoClass: A Bayesian classification system. Proc. 5th Int. Conf. on Machine Learning, pp. 54–64.
Famili, A. <Fazel.Famili©iit.nrc.ca>. Personal communication about 20% heuristic method. (3 November 1997).
Famili, A., W. M. Shen, R. Weber, and E. Simoudis. 1997. Data preprocessing and intelligent data analysis. Intelligent Data Analysis 1(1): <http://www.elsevier.com/locate/da> (21 January 1997).
Fayyad, U. M., G. Piatetsky-Shapiro, and P. Smyth. 1996. From data mining to knowledge discovery: An overview. In Advances in Knowledge Discovery and Data Mining, U. M. Fayyad, G. Piatetsky-Shapiro, P. Smyth, and R. Uthurusamy (Eds.). Menlo Park, CA: AAAI Press, pp. 1–34.
Grinstein, G. G. 1996. Harnessing the human in knowledge discovery. In KDD-96, Proc. 2nd Int. Conf. on Knowledge Discovery and Data Mining, Portland, Oregon, pp. 384–385.
Holsheimer, M. and A. Siebes. 1994. Data mining: The search for knowledge in databases. Amsterdam, The Netherlands: CWI, Report No. CS-R9406.
Huber, P. J. 1997. From large to huge: A statistician’s reactions to KDD and DM. KDD-97, Proc. 3rd Int. Conf. on Knowledge Discovery and Data Mining, Newport Beach, California., pp. 304–308.
Kacprzyk, J., M. Fedrizzi, and H. Nurmi. 1997. OWA operators in group decision making and consensus reaching under fuzzy preferences and fuzzy majority. In The Ordered Weighted Averaging Operators: Theory and Application, R. R. Yager and J. Kacprzyk (Eds.). Kluwer, pp.193–206.
Liu, B. and W. Hsu. 1996. Post-Analysis of learned rules. Proc. 13th Natl. Conf. on AI and 8th Innovative Appl. of AI, pp. 164–168.
Mitchell, T. M. 1997. Machine Learning. Boston, MA: McGraw-Hill.
Owrang, M. M. and F. H. Grupe. 1996. Using domain knowledge to guide database knowledge discovery. Exp. Syst. with Appl. 10(2):173–180.
Quinlan, J. R. 1993. C4.5: Programs for Machine Learning. San Mateo, CA: Morgan Kaufmann.
Weiss, S. M. and N. Indurkhya. 1997. Predictive Data Mining: A Practical Guide. San Francisco, CA: Morgan Kaufmann.
Yager, R. R. 1998. On ordered weighted averaging aggregation operators in multi-criteria decision making. IEEE Trans. on Syst., Man, and Cybern. 18:183–190.
Yager, R. R. 1993. On ordered weighted averaging aggregation operators. In Readings in Fuzzy Sets for Intelligent Systems, D. Dubois, H. Prade, and R. R. Yager (Eds.). San Mateo, CA: Morgan Kaufmann, pp. 80–87.
Yager, R. R. 1997a. Criteria importances in OWA aggregation. Proc. 6th Int. Conf. on Fuzzy Syst., Barcelona Spain, pp. 1677–1682.
Yager, R. R. 1997b. On the inclusion of importances in OWA aggregations. In The Ordered Weighted Averaging Operators: Theory and Application, R. R. Yager and J. Kacprzyk (Eds.). Kluwer, pp. 41–59.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2001 Springer Science+Business Media Dordrecht
About this chapter
Cite this chapter
Wright, P., Hodges, J. (2001). KBIS: Using Domain Knowledge to Guide Instance Selection. In: Liu, H., Motoda, H. (eds) Instance Selection and Construction for Data Mining. The Springer International Series in Engineering and Computer Science, vol 608. Springer, Boston, MA. https://doi.org/10.1007/978-1-4757-3359-4_15
Download citation
DOI: https://doi.org/10.1007/978-1-4757-3359-4_15
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4419-4861-8
Online ISBN: 978-1-4757-3359-4
eBook Packages: Springer Book Archive