Abstract
The role of data warehouse for business analytics cannot be undermined for any enterprise, irrespective of its size. But the growing dependence on web services has resulted in a situation where the enterprise data is managed by multiple, autonomous service providers. The goal of our work is to investigate and devise an approach to address the trade-off between scalability and adaptability in large scale integration with numerous ever-evolving web services. We present our prototype DaWeS (Data warehouse fed with Web Services) and explore how ETL using the mediation approach benefits this trade-off for enterprises with complex data warehousing requirements. The semantic web research community has proposed various standards like WSDL, WADL, hRESTS, SAWSDL for describing web service interface (API) the usage of which could have solved our requirement of automated integration. DaWeS looks to fill the current gap between the industry and research community by taking into account the key characteristics of the aforementioned description languages and using a declarative approach in order to reduce the manual effort. We also present to the semantic web research community the optimization heuristics (to reduce the API operation calls) and semantic challenges (auto-adaptability especially in the wake of an API change) devised while building DaWeS.
Chapter PDF
Similar content being viewed by others
Keywords
References
Benslimane, D., Dustdar, S., Sheth, A.P.: Services mashups: The new generation of web applications. IEEE Internet Computing 12(5), 13–15 (2008)
Bray, T., Paoli, J., Sperberg-McQueen, C.M., Maler, E., Yergeau, F.: Extensible markup language (xml). World Wide Web Journal 2(4), 27–66 (1997)
Calì, A., Calvanese, D., Lenzerini, M.: Data integration under integrity constraints. In: Seminal Contributions to Information Systems Engineering, pp. 335–352. Springer (2013)
Calì, A., Calvanese, D., Martinenghi, D.: Dynamic query optimization under access limitations and dependencies. J. UCS 15(1), 33–62 (2009)
Chawathe, S., Garcia-Molina, H., Hammer, J., Ireland, K., Papakonstantinou, Y., Ullman, J., Widom, J.: The tsimmis project: Integration of heterogeneous information sources. In: Proceedings of IPSJ Conference, pp. 7–18 (1994)
Coalition, D., Ankolekar, A., Burstein, M., Hobbs, J.R., Lassila, O., Martin, D., Mcdermott, D., Narayanan, S., Mcilraith, S.A., Paolucci, M., Payne, T., Sycara, K.: Daml-s: Web service description for the semantic web (2002)
Duschka, O.M., Genesereth, M.R.: Answering recursive queries using views. In: PODS, pp. 109–116 (1997)
Duschka, O.M., Genesereth, M.R., Levy, A.Y.: Recursive query plans for data integration. J. Log. Program. 43(1), 49–73 (2000)
Dustdar, S., Schreiner, W.: A survey on web services composition. International Journal on Web and Grid Services 1(1), 1–30 (2005)
Friedman, M., Levy, A.Y., Millstein, T.D.: Navigational plans for data integration. In: AAAI/IAAI, pp. 67–73. AAAI Press / The MIT Press (1999)
Gottlob, G., Schwentick, T.: Rewriting ontological queries into small nonrecursive datalog programs. In: KR. AAAI Press (2012)
Grahne, G., Kiricenko, V.: Towards an algebraic theory of information integration. Inf. Comput. 194(2), 79–100 (2004)
Group, S.W.: Semantic annotations for wsdl, w3c working draft. The World Wide Web Consortium, W3C (2006)
Hansen, M., Madnick, S.E., Siegel, M.: Data integration using web services. In: DIWeb, pp. 3–16. University of Toronto Press (2002)
IRIS: Integrated Rule Inference System - API and User Guide (2008), http://www.iris-reasoner.org/pages/user_guide.pdf
Kay, M.: Xsl transformations (xslt) version 2.0. W3C Recommendation 23 (2007)
Kopecký, J., Gomadam, K., Vitvar, T.: hrests: An html microformat for describing restful web services. In: WI-IAT 2008. IEEE Computer Society Press (2008)
Martin, D., Burstein, M., Hobbs, J., Lassila, O., McDermott, D., McIlraith, S., Narayanan, S., Paolucci, M., Parsia, B., Payne, T.: Owl-s: Semantic markup for web services. W3C Member Submission 22, 2007–4 (2004)
Noy, N.F.: Semantic integration: A survey of ontology-based approaches. SIGMOD Record 33 2004 (2004)
Roth, M.T., Schwarz, P.M.: Don’t scrap it, wrap it! a wrapper architecture for legacy data sources. In: VLDB 1997, pp. 266–275 (1997)
Truong, H.L., Dustdar, S.: On analyzing and specifying concerns for data as a service. In: Kirchberg, M., Hung, P.C.K., Carminati, B., Chi, C.H., Kanagasabai, R., Valle, E.D., Lan, K.C., Chen, L.J. (eds.) APSCC, pp. 87–94. IEEE (2009)
Ullman, J.D.: Information integration using logical views. Theor. Comput. Sci. 239(2), 189–210 (2000)
Vassiliadis, P., Simitsis, A.: Extraction, transformation, and loading. In: Encyclopedia of Database Systems, pp. 1095–1101. Springer (2009)
W3C: Web Service Description Language 1.1 (2001), http://www.w3.org/TR/wsdl
Wiederhold, G.: Mediators in the architecture of future information systems. Computer 25(3), 38–49 (1992)
Zhu, F., Turner, M., Kotsiopoulos, I.A., Bennett, K.H., Russell, M., Budgen, D., Brereton, P., Keane, J.A., Layzell, P.J., Rigby, M., Xu, J.: Dynamic data integration using web services. In: ICWS. IEEE Computer Society (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Samuel, J. (2014). Towards a Data Warehouse Fed with Web Services. In: Presutti, V., d’Amato, C., Gandon, F., d’Aquin, M., Staab, S., Tordai, A. (eds) The Semantic Web: Trends and Challenges. ESWC 2014. Lecture Notes in Computer Science, vol 8465. Springer, Cham. https://doi.org/10.1007/978-3-319-07443-6_61
Download citation
DOI: https://doi.org/10.1007/978-3-319-07443-6_61
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-07442-9
Online ISBN: 978-3-319-07443-6
eBook Packages: Computer ScienceComputer Science (R0)