Abstract
Using database technology for the administration of digital libraries offers many advantages in a multi-user and distributed environment. However, conventional DBMS are not particularly suited to manage semi-structured data with heterogeneous, irregular, evolving structures as in the case of SGML documents found in digital libraries. To overcome the difficulties imposed by the rigid schema of conventional systems, several schema-less approaches have been proposed. Using instead unconstrained, extensible schemata offered by object-oriented semantic network systems, we are able both to map document specific structures as database classes, and to model the associated constraint information as integrated schema annotations. In this paper we present the benefits of this approach to create, access and process heterogeneous SGML documents, and in particular to exploit the shared semantics of evolving SGML structures. A respective application is currently being implemented in the context of the AQUARELLE project.
Work partially supported by European TELEMATICS Project AQUARELLE.
Preview
Unable to display preview. Download preview PDF.
References
Description de l'Architecture Générale du Projet GEODOC. Technical report, Grif S.A., 78053 St Quentin en Yvelines Cedex, December 1993.
The Extensible Markup Language. Internet Draft, 1997.Availiable at http://www.jtauber.com/xml/.
S. Abiteboul. Querying Semi-Structured Data. In Foto Afrati and Phokion Kolaitis, editors, Database Theory-ICDT'97, volume LNCS 1186 of Lecture Notes in Computer Science, pages 1–18, Delphes, Greece, January 1997. Springer Verlag.
S. Abiteboul, D. Quass, J. McHugh, J. Widom, and J. Wiener. The Lorel Query Language for Semi-Structured Data. Journal of Digital Libraries, 1(1):68–88, November 1997.
A. Analyti, P. Constantopoulos, and N. Spyratos. On the Definition of Semantic Networks Semantics. Technical Report ICS/TR-187, Institute Of Computer Science-FORTH, February 1997. Available at http://www.ics.forth.gr/proj/isst /Publications/TechnicalReports.html.
T. Arnold-Moore, M. Fuller, B. Lowe, J. Thom, and R. Wilkinson. The ELF Data Model and SGQL Query Language for Structured Documents. In Proc. of the Australian Database Conference, pages 17–26, Adelaid, Australia, January 1995.
D. Barnard, L. Burnard, and C. M. Sperberg-McQueen. Lessons Learned From Using sgml in the Text Encoding Initiative. Computer & Interface, 18:3–10, 1996.
L. Bielawski and J. Boyle. Electronic Document Management Systems: A User Centered Approach for Creating, Distributing and Managing Online Publications. Prentice Hall, 1997.
G. Blake, M. Consens, P. Kilpelainen, and P. Larson. Text/Relational Database Management Systems: Harmonizing SQL and SGML. In ADBA'94, pages 267–280, 1994.
K. Böhm, K. Aberer, and E. Neuhold. Administering Structured Documents in Digital Libraries. In Digital Libraries-Current Issues, DL'94, Newark, NJ, USA, 1995. LNCS 916, Springer Verlang.
M. W. Bright, A. R. Hurson, and S. H. Pakzad. A Taxonomy and Current Issues in Multidatabase Systems. IEEE Computer, 25(3):50–59, March 1992.
P. Buneman, S. Davidson, G. Hillebrand, and D. Sucie. A Query Language and Optimization Techniques for Unstructured Data. In SIGMOD'96, pages 505–516, Montreal, Quebec, Canada, June 1996.
OCLC Online Computer Library Center. Fred: The SGML Grammar Builder. Available at “http://www.ocle.org:80/fred/”, 1995.
V. Christophides, S. Abiteboul, S. Cluet, and M. Scholl. From Structured Documents to Novel Query Facilities. In SIGMOD'94, pages 313–324, Minneapolis, Minnesota, USA, May 1994.
V. Christophides, S. Cluet, and G. Moerkotte. Evaluating Queries with Generalized Path Expressions. In SIGMOD'96, pages 413–422, Montreal, Quebec, Canada, June 1996.
V. Christophides and A. Rizk. Querying Structured Documents with Hypertext Links using OODBMS. In ECHT'94, pages 186–197, Edinburgh, United Kingdom, September 1994. ACM.
P. Constantopoulos. Cultural Documentation: The CLIO System. Technical Report 115, Institute of Computer Science, FORTH, January 1994.
L. Elasry.SGML-DBOO Stockage et Manipulation de Documents Structurés. Master's thesis, Université SORBONE, September 1992.
Euroclid. Le Parseur SGML d'Euroclid. Internal document, Euroclid, 12, Avenue des Prés 78180 Montigny le Bretonneux, 1991.
P. Francois. Generalized SGML Repositories: Requirements and Modelling. Computer Standards & Interfaces, 18:11–24, 1996.
P. Futtersack and Q.N. Vuong. Modélisation et Stockage de Documents SGML. Collection de notes internes de la Direction des Études et Recherches 95N000039, EDF-DER, Service IPN. Département SID. 1 Av. du Général-de-Gaulle, 92141 Clamart Cedex, 1995.
C. Goldfarb. The SGML Handbook. Clarendon Press, Oxford, 1990.
R. Goldman and J. Widom. DataGuides: Enabling Query Formulation and Optimization in Semi-Structured databases. Stanford Technical Report, 1997.
Institute Of Computer Science (FORTH)-Hellas. SIS-Semantic Index System, version 2.1 edition, May 1997.
ISO. Information Processing-Text and Office Systems-Standard Generalized Markup Language (SGML). ISO 8879, 1986.
ISO/IEC. Information Technology-Hypermedia/Time-based Structuring Language (HyTime). ISO/IEC 10744, 1992.
P. Kilpeläinen and D. Wood. Exceptions in SGML Document Grammars. Submitted for publication, 1995.
R. Light. Getting a handle on Exhibition Catalogues: the Project OHIO DTD. Available at “http://www.cimi.org/cimi”, Consortium for Interchange of Museum Information, 1995.
J. Le Maitre, E. Murisasco, and M. Rolbert. SmlgQL un Langage d'Interrogation de Documents SGML. In BDA'95, pages 431–446, Nancy, France, August 1995.
J. Mylopoulos, A. Borgida, M. Jarke, and M. Koubarakis. Telos: Representing knowledge about Information Systems. ACM Transactions on Information Systems, 8(4), October 1990.
A. Nica and E. A. Rundensteiner. Uniform Structured Document Handling using a Constraint-based Object Approach. In Digital Libraries: Research and Technology Advances, ADL'95 Forum, pages 83–101, McLean, Virginia, USA, May 1996. LNCS 1082, Springer-Verlag.
D. Raggett.HyperText Markup Language Specification Version 3.0. Internet Draft, March 1995. Avaliable at http://www.w3.org/hypertext/WWW/MarkUp/html3/CoverPage.html.
A. Ramfos, N.J. Fiddian, and W.A. Gray. Object-oriented to relational interschema meta-translation. In Workshop on heterogeneous databases, December 1989.
D. Raymond, F. Tompa, and D. Wood. From Data Representation to Data Model: Meta-semantics Issues in the Evolution of SGML. Computer & Interface, 18:25–36, 1996.
A. Rizk, F. Malézieux, and M. Scholl.Analyse des éléments du Système d'Information: Définition SGML de la Struture des Dossiers de l'Inventaire. Convention de recherche n 295b212 0016008011, Euroclid, 1996.
J. F. Roddick. A Survey of Schema Versioning Issues for Database Systems. Information and Software Technology, 37(7):383–393, 1995.
R. Sacks-Davis, W. Wen, A. Kent, and K. Ramamohanarao. Complex Object Support for a Document Database System. In Thirteenth Australian Computer Science Conference, pages 322–333, Victoria, Australia, 1990. Monash University.
A. P. Sheth and J. A. Larson. Federated Database Systems for Managing Distributed Heterogeneous, and Autonomous Databases. ACM Computing Surveys, 22(3):183–236, September 1990.
J. Warmer and S. Egmond. The Implementation of the Amsterdam SGML Parser. Electronic Publishing, 2(2):65–90, July 1989.
D. Wood. Standard Generalized Markup Language: Mathematical and Philosophical Issues. In Computer Science Today: Recent Trends and Developments. LNCS 1000, 1995.
R. Zicari. A Framework for Schema Updates in an Object-Oriented Database system. In IEEE Data Engineering Conference, Kobe, Japan, 1991.
J. Zobel, J. A. Thom, and R. Sacks-Davis. Efficiency of Nesting Relational Document Database Systems. In VLDB'91, pages 91–102, Barcelona, Catalonia, Spain, September 1991.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1997 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Christophides, V., Dörr, M., Fundulaki, I. (1997). A semantic network approach to semi-structured documents repositories. In: Peters, C., Thanos, C. (eds) Research and Advanced Technology for Digital Libraries. ECDL 1997. Lecture Notes in Computer Science, vol 1324. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0026735
Download citation
DOI: https://doi.org/10.1007/BFb0026735
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-63554-3
Online ISBN: 978-3-540-69597-4
eBook Packages: Springer Book Archive