Mining Association Rules from XML Data

Braga, Daniele; Campi, Alessandro; Klemettinen, Mika; Lanzi, PierLuca

doi:10.1007/3-540-46145-0_3

Daniele Braga⁷,
Alessandro Campi⁷,
Mika Klemettinen⁸ &
…
PierLuca Lanzi⁹

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2454))

Included in the following conference series:

International Conference on Data Warehousing and Knowledge Discovery

1301 Accesses
16 Citations

Abstract

The eXtensible Markup Language (XML) rapidly emerged as a standard for representing and exchanging information. The fastgrowing amount of available XML data sets a pressing need for languages and tools to manage collections of XML documents, as well as to mine interesting information out of them. Although the data mining community has not yet rushed into the use of XML, there have been some proposals to exploit XML. However, in practice these proposals mainly rely on more or less traditional relational databases with an XML interface. In this paper, we introduce association rules from native XML documents and discuss the new challenges and opportunities that this topic sets to the data mining community. More specifically, we introduce an extension of XQuery for mining association rules. This extension is used throughout the paper to better define association rule mining within XML and to emphasize its implications in the XML context.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Clustering XML Documents Using Frequent Edge-Sets

Towards Linked Open Data Enabled Data Mining

Discovering Contextual Association Rules in Relational Databases

References

Rakesh Agrawal, Tomasz Imielinski, and Arun Swami. Mining association rules between sets of items in large databases. In P. Buneman and S. Jajodia, editors, SIGMOD93, pages 207–216, Washington, D.C., USA, May 1993.
Google Scholar
Rakesh Agrawal, Tomasz Imielinski, and Arun N. Swami. Mining association rules between sets of items in large databases. In Peter Buneman and Sushil Jajodia, editors, Proceedings of the 1993 ACM SIGMOD International Conference on Management of Data, pages 207–216, Washington, D.C., 26-28 1993.
Google Scholar
Rakesh Agrawal and Ramakrishnan Srikant. Mining sequential patterns. In Philip S. Yu and Arbee L. P. Chen, editors, Proc. 11th Int. Conf. Data Engineering, ICDE, pages 3–14. IEEE Press, 6-10 1995.
Google Scholar
Helena Ahonen, Oskari Heinonen, Mika Klemettinen, and A. Inkeri Verkamo. Mining in the phrasal frontier. In Principles of Data Mining and Knowledge Discovery, pages 343–350, 1997.
Google Scholar
J. Han and Y. Fu. Discovery of multiple-level association rules from large databases. In Proc. of 1995 Int’l Conf. on Very Large Data Bases (VLDB’95), Zürich, Switzerland, September 1995, pages 420–431, 1995.
Google Scholar
Jiawei Han and Micheline Kamber. Data Mining Concepts and Techniques. Morgan Kaufmann, San Francisco (CA).
Google Scholar
Tomasz Imielinski and Aashu Virmani. MSQL: A query language for database mining, 1999.
Google Scholar
Brian Lent, Arun N. Swami, and Jennifer Widom. Clustering association rules. In ICDE, pages 220–231, 1997.
Google Scholar
Heikki Mannila, Hannu Toivonen, and et al. Discovering frequent episodes in sequences (extended abstract), August 1995.
Google Scholar
Heikki Mannila, Hannu Toivonen, and A. Inkeri Verkamo. Efficient algorithms for discovering association rules. In Usama M. Fayyad and Ramasamy Uthurusamy, editors, AAAI Workshop on Knowledge Discovery in Databases (KDD-94), pages 181–192, Seattle, Washington, 1994. AAAI Press.
Google Scholar
Rosa Meo, Giuseppe Psaila, and Stefano Ceri. A new sql-like operator for mining association rules. In VLDB'96, September 3–6, 1996, Mumbai (Bombay), India, pages 122–133.
Google Scholar
Rosa Meo, Giuseppe Psaila, and Stefano Ceri. A tightly-coupled architecture for data mining. In ICDE, pages 316–323, Orlando, Florida, USA, February 1998.
Google Scholar
M. Rajman and R. Besanon. Text mining: Natural language techniques and text mining applications, 1997.
Google Scholar
Lisa Singh, Peter Scheuermann, and Bin Chen. Generating association rules from semi-structured documents using an extended concept hierarchy. In CIKM, pages 193–200, 1997.
Google Scholar
Ramakrishnan Srikant and Rakesh Agrawal. Mining generalized association rules. In The VLDB Journal, pages 407–419, 1995.
Google Scholar
Ramakrishnan Srikant and Rakesh Agrawal. Mining quantitative association rules in large relational tables. In H. V. Jagadish and Inderpal Singh Mumick, editors, Proceedings of the 1996 ACM SIGMOD International Conference on Management of Data, pages 1–12, Montreal, Quebec, Canada, 4-6 1996.
Google Scholar
World Wide Web Consortium. Extensible Markup Language (XML) Version 1.0 W3C Recommendation. http://www.w3c.org/xml/, February 1998.
World Wide Web Consortium. XML Path Language (XPath) Version 1.0, W3C Recommendation. http://www.w3c.org/tr/xpath/, November 1999.
World Wide Web Consortium. XQuery 1.0: An XML Query Language W3C Working Draft. http://www.w3.org/TR/2001/WD-xquery-20010607, June 2001.

Download references

Author information

Authors and Affiliations

Dipartimento di Elettronica e Informazione, Politecnico di Milano, P.za L. da Vinci 32, I-20133, Milano, Italy
Daniele Braga & Alessandro Campi
Nokia Group, Nokia Research Center, P.O.Box 407, FIN-00045, Finland
Mika Klemettinen
Dipartimento di Elettronica e Informazione, Artificial Intelligence and Robotic Laboratory Politecnico di Milano, Italy
PierLuca Lanzi

Authors

Daniele Braga
View author publications
You can also search for this author in PubMed Google Scholar
Alessandro Campi
View author publications
You can also search for this author in PubMed Google Scholar
Mika Klemettinen
View author publications
You can also search for this author in PubMed Google Scholar
PierLuca Lanzi
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Graduate School of Informatics, Kyoto University, Yoshida-Honmachi, Sakyo-ku, 606-8501, Kyoto, Japan
Yahiko Kambayashi
Institute for Computer Science and Business Informatics, University of Vienna, Liebiggasse 4, 1010, Vienna, Austria
Werner Winiwarter
Center for Spatial Information Science (CSIS), University of Tokyo, 4-6-1, Komaba, Meguro-ku, 153-8904, Tokyo, Japan
Masatoshi Arikawa

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Braga, D., Campi, A., Klemettinen, M., Lanzi, P. (2002). Mining Association Rules from XML Data. In: Kambayashi, Y., Winiwarter, W., Arikawa, M. (eds) Data Warehousing and Knowledge Discovery. DaWaK 2002. Lecture Notes in Computer Science, vol 2454. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-46145-0_3

Download citation

DOI: https://doi.org/10.1007/3-540-46145-0_3
Published: 02 September 2002
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-44123-6
Online ISBN: 978-3-540-46145-6
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

Mining Association Rules from XML Data

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Clustering XML Documents Using Frequent Edge-Sets

Towards Linked Open Data Enabled Data Mining

Discovering Contextual Association Rules in Relational Databases

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Mining Association Rules from XML Data

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Clustering XML Documents Using Frequent Edge-Sets

Towards Linked Open Data Enabled Data Mining

Discovering Contextual Association Rules in Relational Databases

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation