Abstract
Functional dependencies are an integral part of database theory and they form the basis for normalizing relational tables up to BCNF. With the increasing relevance of the data-centric aspects of XML, it is pertinent to study functional dependencies in the context of XML, which will form the basis for further studies into XML keys and normalization. In this work, we investigate the design of functional dependencies in XML databases. We propose FD XML , a notation and DTD for representing functional dependencies in XML. We observe that many databases are hierarchical in nature and the corresponding nested XML data1 may inevitably contain redundancy. We develop a model based on FD XML to estimate the amount of data replication in XML data. We show how functional dependencies in XML can be verified with a single pass through the XML data, and present supporting experimental results. A platformindependent framework is also drawn up to demonstrate how the techniques proposed in this work can enrich the semantics of XML.
This work was done while the author was on a research scholarship from the National University of Singapore.
For this paper, XML data refers to data represented in XML. It is not to be confused with the W3C Note XML-Data.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
W. W. Armstrong. Dependency Structures of Database Relationships. In Proceedings of the tri-annual IFIP Conf 74, N-H (Amsterdam), 1974.
Peter Bunemana, Susan Davidson, Wenfei Fan, Carmem Hara, and Wang-Chiew Tan. Keys for XML. In Proceedings of the WWW’10, Hong Kong, China, 2001.
P. A. Bernstein. Synthesizing Third Normal Form Relations from Functional Dependencies. ACM Transactions on Database Systems, 1(4):277–298, Dec 1976.
Tim Bray, Jean Paoli, C. M. Sperberg-McQueen, and Eve Maler. Extensible arkup Language (XML) 1.0 (Second Edition). http://www.w3.org/TR/2000/REC-xml-20001006, 2000.
James Clark and Steve DeRose. XML Path Language (XPath) Version 1.0. Available at http://www.w3.org/TR/xpath, 1999.
E. F. Codd. A Relational Model of Data for Large Shared Data Banks. j-CACM, 13(6):377–387, June 1970.
E. F. Codd. Further Normalization of the Database Relational Model. R. Rustin, Ed. Prentice-Hall, Englewood Cliffs, NJ, 1972.
Dongwon Lee and Wesley W. Chu. Comparative Analysis of Six XML Schema Languages. SIGMOD Record, 29(3):76–87, 2000.
D. Fallside. XML Schema Part 0: Primer. Available at http://www.w3.org/TR/xmlschema-0/, 2000.
Mary Fernandez and Jonathan Robie. XML Query Data Model. W3C Working Draft. Available at http://www.w3.org/TR/query-datamodel/, 2001.
W Fan and J Siméon. Integrity Constraints for XML. In Proceedings of the Nineteenth ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems, Dallas, Texas, USA, pages 23–34. ACM, 2000.
Dongwon Lee and Wesley W. Chu. Constraints-Preserving Transformation from XML Document Type Definition to Relational Schema. In Proceedings of the 19th International Conference on Conceptual Modeling, pages 323–338, 2000.
Michael Ley. DBLP Bibliography. Available at http://www.informatik.uni-trier.de/ley/db/, 2001.
Tok Wang Ling, Cheng Hian Goh, and Mong Li Lee. Extending classical functional dependencies for physical database design. Information and Software Technology, 9(38):601–608, 1996.
J. McHugh, S. Abiteboul, R. Goldman, D. Quass, and J. Widom. Lore: A Database Management System for Semistructured Data. SIGMOD Record, 26(3), 1997.
David Megginson. SAX: The Simple API for XML. Available at http://www.megginson.com/SAX/, 2001.
Tsou and Fischer. Decomposition of a Relation Scheme into Boyce-Codd Normal Form. SIGACTN: SIGACT News, 14, 1982.
Kevin Williams et al. Professional XML Databases. Wrox Press Inc, 2000.
W3C DOM Working Group. Document Object Model (DOM). Available at http://www.w3.org/DOM/, 2001.
Grant E. Weddell. Reasoning About Functional Dependencies Generalized for Semantic Data Models. ACM Transactions on Database Systems, 17(1):32–64, Mar 1992.
Jennifer Widom. Data Management for XML: Research Directions. IEEE Data Engineering Bulletin, 22(3):44–52, 1999.
Xiaoying Wu, Tok Wang Ling, Mong Li Lee, and Gillian Dobbie. Designing Semistructured Databases Using the ORA-SS Model. In Proceedings of the 2nd International Conference on Web Information Systems Engineering (WISE). IEEE Computer Society, 2001.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Lee, M.L., Ling, T.W., Low, W.L. (2002). Designing Functional Dependencies for XML. In: Jensen, C.S., et al. Advances in Database Technology — EDBT 2002. EDBT 2002. Lecture Notes in Computer Science, vol 2287. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45876-X_10
Download citation
DOI: https://doi.org/10.1007/3-540-45876-X_10
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-43324-8
Online ISBN: 978-3-540-45876-0
eBook Packages: Springer Book Archive