Abstract
This work addresses the automatic generation of conceptual models for XML-oriented databases, which in many cases have little or no support for schemata. Our techniques are based on both an incremental clustering algorithm, which groups together the incoming XML documents according to their structural similarities, and a schema inference method, which maintains dynamically the schema of each detected document cluster. Our proposal takes into consideration the schema evolution. For this purpose, we have adapted the Toodor document model that describes the temporal properties of the XML document types.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Aramburu, M.J., Berlanga, R.: A temporal object-oriented model for digital libraries of documents. Concurrency: Practice and Experience 13(11) (2001)
Chamberlin, D., Robie, J., Florescu, D.: Quilt: An XML query language for heterogeneous data sources. In: Suciu, D., Vossen, G. (eds.) WebDB 2000. LNCS, vol. 1997, pp. 53–62. Springer, Heidelberg (2001)
Cluet, S., Veltri, P., Vodislav, D.: Views in a large scale XML repository. In: VLDB 2001, pp. 271–280 (2001)
Hélide.: The G Web Applications Platform (2002), http://www.helide.com
Mena, E., Illarramendi, A., Kashyap, V., Sheth, A.P.: OBSERVER: An approach for query processing in global information systems based on interoperation across pre-existing ontologies. Distributed and Parallel Databases 8(2), 223–271 (2000)
W3C Consortium. XML schema (2002), http://www.w3.org/XML/Schema
W3C Consortium. XQuery 1.0: An XML Query Language (2002), http://www.w3.org/xquery
Zhang, K., Shasha, D.: Simple fast algorithms for the editing distance between trees and related problems. SIAM Journal of Computing 18(6), 1245–1262 (1989)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Sanz, I., Pérez, J.M., Berlanga, R., Aramburu, M.J. (2003). XML Schemata Inference and Evolution. In: Mařík, V., Retschitzegger, W., Štěpánková, O. (eds) Database and Expert Systems Applications. DEXA 2003. Lecture Notes in Computer Science, vol 2736. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-45227-0_12
Download citation
DOI: https://doi.org/10.1007/978-3-540-45227-0_12
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-40806-2
Online ISBN: 978-3-540-45227-0
eBook Packages: Springer Book Archive