KBIS: Using Domain Knowledge to Guide Instance Selection

Wright, Peggy; Hodges, Julia

doi:10.1007/978-1-4757-3359-4_15

Peggy Wright³ &
Julia Hodges⁴

Part of the book series: The Springer International Series in Engineering and Computer Science ((SECS,volume 608))

293 Accesses
1 Citations

Abstract

Prior to mining data for knowledge, selecting a potentially useful set of target data is necessary. Mining with missing attribute values increases uncertainty and decreases discovery accuracy. We present an instance selection method that determines the mining usability of an instance based on knowledge about which attributes are missing and the relative significance of the various attributes as defined by a domain expert. Knowledge-based instance selection (KbIS) is an instance utility metric that incorporates domain knowledge into a multi-criteria decision-making technique for instance selection.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

An Efficient Approach for Instance Selection

Information Selection and Data Compression RapidMiner Library

Instance Selection for the Nearest Neighbor Classifier: Connecting the Performance to the Underlying Data Structure

References

Bosc, P. and Lietard, M. 1997. Quantified statements and some interpretations for the OWA operator. In The Ordered Weighted Averaging Operators: Theory and Application, R. R. Yager and J. Kacprzyk (Eds.). Kluwer, pp. 241–257.
Chapter Google Scholar
Brachman, R. J. and T. Anand. 1996. The process of knowledge discovery in databases: A human-centered approach. In Advances in Knowledge Discovery and Data Mining, U. M. Fayyad, G. Piatetsky-Shapiro, P. Smyth, and R. Uthurusamy (Eds.). Menlo Park, CA: AAAI Press, pp. 37–57.
Google Scholar
Cheeseman, P., J. Kelly, M. Self, J. Stutz, W. Taylor, and D. Freeman. 1988. AutoClass: A Bayesian classification system. Proc. 5th Int. Conf. on Machine Learning, pp. 54–64.
Google Scholar
Famili, A. <Fazel.Famili©iit.nrc.ca>. Personal communication about 20% heuristic method. (3 November 1997).
Google Scholar
Famili, A., W. M. Shen, R. Weber, and E. Simoudis. 1997. Data preprocessing and intelligent data analysis. Intelligent Data Analysis 1(1): <http://www.elsevier.com/locate/da> (21 January 1997).
Google Scholar
Fayyad, U. M., G. Piatetsky-Shapiro, and P. Smyth. 1996. From data mining to knowledge discovery: An overview. In Advances in Knowledge Discovery and Data Mining, U. M. Fayyad, G. Piatetsky-Shapiro, P. Smyth, and R. Uthurusamy (Eds.). Menlo Park, CA: AAAI Press, pp. 1–34.
Google Scholar
Grinstein, G. G. 1996. Harnessing the human in knowledge discovery. In KDD-96, Proc. 2nd Int. Conf. on Knowledge Discovery and Data Mining, Portland, Oregon, pp. 384–385.
Google Scholar
Holsheimer, M. and A. Siebes. 1994. Data mining: The search for knowledge in databases. Amsterdam, The Netherlands: CWI, Report No. CS-R9406.
Google Scholar
Huber, P. J. 1997. From large to huge: A statistician’s reactions to KDD and DM. KDD-97, Proc. 3rd Int. Conf. on Knowledge Discovery and Data Mining, Newport Beach, California., pp. 304–308.
Google Scholar
Kacprzyk, J., M. Fedrizzi, and H. Nurmi. 1997. OWA operators in group decision making and consensus reaching under fuzzy preferences and fuzzy majority. In The Ordered Weighted Averaging Operators: Theory and Application, R. R. Yager and J. Kacprzyk (Eds.). Kluwer, pp.193–206.
Chapter Google Scholar
Liu, B. and W. Hsu. 1996. Post-Analysis of learned rules. Proc. 13th Natl. Conf. on AI and 8th Innovative Appl. of AI, pp. 164–168.
Google Scholar
Mitchell, T. M. 1997. Machine Learning. Boston, MA: McGraw-Hill.
MATH Google Scholar
Owrang, M. M. and F. H. Grupe. 1996. Using domain knowledge to guide database knowledge discovery. Exp. Syst. with Appl. 10(2):173–180.
Article Google Scholar
Quinlan, J. R. 1993. C4.5: Programs for Machine Learning. San Mateo, CA: Morgan Kaufmann.
Google Scholar
Weiss, S. M. and N. Indurkhya. 1997. Predictive Data Mining: A Practical Guide. San Francisco, CA: Morgan Kaufmann.
Google Scholar
Yager, R. R. 1998. On ordered weighted averaging aggregation operators in multi-criteria decision making. IEEE Trans. on Syst., Man, and Cybern. 18:183–190.
Article MathSciNet Google Scholar
Yager, R. R. 1993. On ordered weighted averaging aggregation operators. In Readings in Fuzzy Sets for Intelligent Systems, D. Dubois, H. Prade, and R. R. Yager (Eds.). San Mateo, CA: Morgan Kaufmann, pp. 80–87.
Google Scholar
Yager, R. R. 1997a. Criteria importances in OWA aggregation. Proc. 6th Int. Conf. on Fuzzy Syst., Barcelona Spain, pp. 1677–1682.
Google Scholar
Yager, R. R. 1997b. On the inclusion of importances in OWA aggregations. In The Ordered Weighted Averaging Operators: Theory and Application, R. R. Yager and J. Kacprzyk (Eds.). Kluwer, pp. 41–59.
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

Engineer Research and Development Center, U.S. Army Corps of Engineers, Vicksburg, MS, USA
Peggy Wright
Department of Computer Science, Mississippi State University, Mississippi State, MS, USA
Julia Hodges

Authors

Peggy Wright
View author publications
You can also search for this author in PubMed Google Scholar
Julia Hodges
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Arizona State University, USA
Huan Liu
Osaka University, Japan
Hiroshi Motoda

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Wright, P., Hodges, J. (2001). KBIS: Using Domain Knowledge to Guide Instance Selection. In: Liu, H., Motoda, H. (eds) Instance Selection and Construction for Data Mining. The Springer International Series in Engineering and Computer Science, vol 608. Springer, Boston, MA. https://doi.org/10.1007/978-1-4757-3359-4_15

Download citation

DOI: https://doi.org/10.1007/978-1-4757-3359-4_15
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4419-4861-8
Online ISBN: 978-1-4757-3359-4
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

KBIS: Using Domain Knowledge to Guide Instance Selection

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

An Efficient Approach for Instance Selection

Information Selection and Data Compression RapidMiner Library

Instance Selection for the Nearest Neighbor Classifier: Connecting the Performance to the Underlying Data Structure

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

KBIS: Using Domain Knowledge to Guide Instance Selection

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

An Efficient Approach for Instance Selection

Information Selection and Data Compression RapidMiner Library

Instance Selection for the Nearest Neighbor Classifier: Connecting the Performance to the Underlying Data Structure

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation