Deriving High Confidence Rules from Spatial Data Using Peano Count Trees

Perrizo, William; Ding, Qin; Ding, Qiang; Roy, Amalendu

doi:10.1007/3-540-47714-4_9

William Perrizo⁷,
Qin Ding⁷,
Qiang Ding⁷ &
…
Amalendu Roy⁷

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2118))

Included in the following conference series:

International Conference on Web-Age Information Management

350 Accesses
6 Citations

Abstract

The traditional task of association rule mining is to find all rules with high support and high confidence. In some applications, such as mining spatial datasets for natural resource location, the task is to find high confidence rules even though the support may be low. In still other applications, such as the identification of agricultural pest infestations, the task is to find high confidence rules preferably while the support is still very low. The basic Apriori algorithm cannot be used to solve these problems efficiently since it relies on first identifying all high support itemsets. In this paper, we propose a new model to derive high confidence rules for spatial data regardless of their support level. A new data structure, the Peano Count Tree (P-tree), is used in our model to represent all the information we need. P-trees represent spatial data bit-by-bit in a recursive quadrant-by-quadrant arrangement. Based on the P-tree, we build a special data cube, the Tuple Count Cube (T-cube), to derive high confidence rules. Our algorithm for deriving confident rules is fast and efficient. In addition, we discuss some strategies for avoiding over-fitting (removing redundant and misleading rules).

This work was partially supported by a U. S. - G. S. A. VAST grant. Patents are pending on the P-Tree Data Mining Technology.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Frequent Itemsets and Association Rules with a Certain Probability in Data Mining

Geospatial Dimension in Association Rule Mining: The Case Study of the Amazon Charcoal Tree

A novel linear assorted classification method based association rule mining with spatial data

Article 02 February 2021

References

R. Agrawal, T. Imielinski, A. Swami. Mining Association Rules Between Sets of Items in Large Database. ACM SIGMOD 1993.
Google Scholar
R. Agrawal, R. Srikant. Fast Algorithms for Mining Association Rules. VLDB 1994.
Google Scholar
R. Srikant, R. Agrawal. Mining Quantitative Association Rules in Large Relational Tables. ACM SIGMOD 1996.
Google Scholar
J. S. Park, M. Chen, P. S. Yu. An effective Hash-Based Algorithm for Mining Association Rules. ACM SIGMOD 1995.
Google Scholar
J. Han, J. Pei, Y. Yin. Mining Frequent Patterns without Candidate Generation. ACM SIGMOD 2000.
Google Scholar
R. J. Bayardo. Brute-Force Mining of High-Confidence Classification Rules. KDD 1997.
Google Scholar
E. Cohen, et al. Finding Interesting Associations without Support Pruning. VLDB 2000.
Google Scholar
K. Wang, S. Zhou, Y. He. Growing Decision Trees on Support-less Association Rules. KDD 2000.
Google Scholar
V. Gaede, O. Gunther. Multidimensional Access Methods. Computing Surveys, 30(2), 1998.
Google Scholar
H. Samet. The quadtree and related hierarchical data structure. ACM Computing Survey, 16, 2, 1984.
Google Scholar
H. Samet. Applications of Spatial Data Structures. Addison-Wesley, 1990.
Google Scholar
H. Samet. The Design and Analysis of Spatial Data Structures. Addison-Wesley, 1990.
Google Scholar
R. A. Finkel, J. L. Bentley. Quad trees: A data structure for retrieval of composite keys. Acta Informatica, 4, 1, 1974.
Google Scholar
HH-code. Available at http://www.statkart.no/nlhdb/iveher/hhtext.htm
B. Liu, W. Hsu, Y. Ma. Integrating classification and association rule mining. KDD 1998.
Google Scholar
J. Dong, W. Perrizo, Q. Ding and J. Zhou. The Application of Association Rule Mining on Remotely Sensed Data. ACM Symposium on Applied Computing, 2000.
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, North Dakota State University, Fargo, ND, 58105-5164, USA
William Perrizo, Qin Ding, Qiang Ding & Amalendu Roy

Authors

William Perrizo
View author publications
You can also search for this author in PubMed Google Scholar
Qin Ding
View author publications
You can also search for this author in PubMed Google Scholar
Qiang Ding
View author publications
You can also search for this author in PubMed Google Scholar
Amalendu Roy
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Information and Software Engineering, George Mason University, Fairfax, VA, 22030-4444, USA
X. Sean Wang
Department of Computer Science and Engineering, Northeastern University, Shenyang, 110004, China
Ge Yu
Department of Computer Science, Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong, China
Hongjun Lu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Perrizo, W., Ding, Q., Ding, Q., Roy, A. (2001). Deriving High Confidence Rules from Spatial Data Using Peano Count Trees. In: Wang, X.S., Yu, G., Lu, H. (eds) Advances in Web-Age Information Management. WAIM 2001. Lecture Notes in Computer Science, vol 2118. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-47714-4_9

Download citation

DOI: https://doi.org/10.1007/3-540-47714-4_9
Published: 28 June 2001
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-42298-3
Online ISBN: 978-3-540-47714-3
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

Deriving High Confidence Rules from Spatial Data Using Peano Count Trees

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Frequent Itemsets and Association Rules with a Certain Probability in Data Mining

Geospatial Dimension in Association Rule Mining: The Case Study of the Amazon Charcoal Tree

A novel linear assorted classification method based association rule mining with spatial data

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Deriving High Confidence Rules from Spatial Data Using Peano Count Trees

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Frequent Itemsets and Association Rules with a Certain Probability in Data Mining

Geospatial Dimension in Association Rule Mining: The Case Study of the Amazon Charcoal Tree

A novel linear assorted classification method based association rule mining with spatial data

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation