Abstract
In this paper, we propose an extensible architecture for database engines where satellite databases are used to scale out and implement additional functionality for a centralized database engine. The architecture uses a middleware layer that offers consistent views and a single system image over a cluster of machines with database engines. One of these engines acts as a master copy while the others are read-only snapshots which we call satellites. The satellites are lightweight DBMSs used for scalability and to provide functionality difficult or expensive to implement in the main engine. Our approach also supports the dynamic creation of satellites to be able to autonomously adapt to varying loads. The paper presents the architecture, discusses the research problems it raises, and validates its feasibility with extensive experimental results.
Article PDF
Similar content being viewed by others
Avoid common mistakes on your manuscript.
References
Altinel, M., Bornhövd, C., Krishnamurthy, S., Mohan, C., Pirahesh, H., Reinwald, B.: Cache tables: paving the way for an adaptive database cache. In: Proceedings of the 29th International Conference on Very Large Data Bases. Berlin, Germany, September 9–12, 2003
Amir, Y., Tutu, C.: From Total Order to Database Replication. Technical report, CNDS (2002). http://www.citeseer. nj.nec.com/amir02from.html
Amir, Y., Moser, L.E., Melliar-Smith, P.M., Agarwal, D.A., Ciarfella, P.: The totem single-ring ordering and membership protocol. ACM Trans. Comput. Syst. 13(4), 311–342 (1995). http://www.citeseer.nj.nec.com/amir95totem.html
Amiri, K., Park, S., Tewari, R., Padmanabhan, S.: DBProxy: A dynamic data cache for web applications. In: Proceedings of the 19th International Conference on Data Engineering, Bangalore, India, 5–8, March 2003
Amza, C., Cox, A.L., Zwaenepoel, W.: A comparative evaluation of transparent scaling techniques for dynamic content servers. In: ICDE ’05: Proceedings of the 21st International Conference on Data Engineering (ICDE’05), pp. 230–241 (2005)
Amza, C., Cox, A.L., Zwaenepoel, W.: Distributed versioning: Consistent replication for scaling back-end databases of dynamic content web sites. In: Middleware 2003, ACM/IFIP/USENIX International Middleware Conference, Rio de Janeiro, Brazil, June 16–20, Proceedings (2003)
Berenson, H., Bernstein, P., Gray, J., Melton, J., O’Neil, E., O’Neil, P.: A critique of ANSI SQL isolation levels. In: Proceedings of the SIGMOD International Conference on Management of Data, pp. 1–10 (1995)
Bernstein P.A., Hadzilacos V., Goodman N. (1987). Concurrency control and recovery in database systems. Addison-Wesley, Reading
Borzsonyi, S., Kossmann, D., Stocker, K.: The skyline operator. In: IEEE Conference on Data Engineering, pp 421–430. Heidelberg, Germany (2001)
Cecchet, E.: C-JDBC: a middleware framework for database clustering. IEEE Data Engineering Bulletin, vol. 27, no. 2 (2004)
Chomicki, J., Godfrey, P., Gryz, J., Liang, D.: Skyline with presorting. In: ICDE, pp 717–816 (2003)
Council, T.T.P.P.: TPC-W, a transactional web e-commerce benchmark. http://www.tpc.org/tpcw/
Daffodil Replicator: http://sourceforge.net/projects/daffodilreplica
Daudjee, K., Salem, K.: Lazy database replication with ordering guarantees. In: Proceedings of the 20th international conference on data engineering (ICDE 2004), Boston, MA, USA, pp 424–435, 30 March – 2 April 2004
Fekete, A.D.: Serialisability and snapshot isolation. In: Proceedings of the Australian Database Conference, pp. 210–210 (1999)
Fekete, A., Liarokapis, D., O’Neil, E., O’Neil, P., Sasha, D.: Making snapshot isolation serializable. http://www.cs.umb.edu/isotest/snaptest/snaptest.pdf
Härder, T., Bühmann, A.: Query processing in constraint-based database caches. IEEE Data Engineering Bulletin, vol. 27, no. 2 (2004)
Jiménez-Peris, R., Patiño-Martínez, M., Kemme, B., Alonso, G.: Improving the scalability of fault-tolerant database clusters. In: IEEE 22nd International Conference on Distributed Computing Systems, ICDCS’02, Vienna, Austria, pp. 477–484 (2002)
Kemme, B.: Database replication for clusters of workstations. Ph.D. thesis, Diss. ETH No. 13864, Department of Computer Science, Swiss Federal Institute of Technology Zurich (2000)
Kemme, B.: Implementing database replication based on group communication. In: Proceedings of the International Workshop on Future Directions in Distributed Computing (FuDiCo 2002), Bertinoro, Italy (2002). http://www.citeseer. nj.nec.com/pu91replica.html
Kemme, B., Alonso, G.: Don’t be lazy, be consistent: Postgres-R, a new way to implement database replication. In: Proceedings of the 26th International Conference on Very Large Databases, 2000
Larson, P.A., Goldstein, J., Zhou, J.: Transparent mid-tier database caching in SQL server. In: Proceedings of the 2003 ACM SIGMOD International Conference on Management of Data, pp 661–661. ACM Press, (2003). DOI http://www.doi.acm.org/10.1145/872757.872848
Lin, Y., Kemme, B., Patiño-Martínez, M., Jiménez-Peris,~R.: Middleware based data replication providing snapshot isolation. In: SIGMOD’05: Proceedings of the 2005 ACM SIGMOD International Conference on Management of Data, pp.~419–430 (2005)
Lipasti, M.H.: Java TPC-W Implementation Distribution of Prof. Lipasti’s Fall 1999 ECE 902 Course. http://www.ece.wisc. edu/pharm/tpcw.shtml
Microsoft SQL Server 2005: http://www.microsoft.com/sql/ 2005
Milan-Franco, J.M., Jiménez-Peris, R., Patiño-Martínez, M., Kemme, B.: Adaptive distributed middleware for data replication. In: Middleware 2004, ACM/IFIP/USENIX 5th International Middleware Conference, Toronto, Canada, October 18–22, Proceedings (2004)
Oracle Database Documentation Library: Oracle Streams Concepts and Administration, Oracle Database Advanced Replication, 10g Release 1 (10.1) (2003). http://www.otn. oracle.com
Plattner, C., Alonso, G.: Ganymed: Scalable replication for transactional web applications. In: Middleware 2004, 5th ACM/IFIP/USENIX International Middleware Conference, Toronto, Canada, October 18–22, Proceedings (2004)
PostgreSQL Global Development Group. http://www.postgresql.org
Project, T.E.O.: Octopus, a simple Java-based Extraction, Transformation, and Loading (ETL) tool. http://www.octopus. objectweb.org/
Röhm, U., Böhm, K., Schek, H., Schuldt, H.: FAS – A freshness-sensitive coordination middleware for a cluster of OLAP components. In: Proceedings of the 28th International Conference on Very Large Data Bases (VLDB 2002), Hong Kong, China, pp. 754–765 (2002)
Schenkel, R., Weikum, G.: Integrating snapshot isolation into transactional federation. In: Cooperative Information Systems, 7th International Conference, CoopIS 2000, Eilat, Israel, Proceedings, 6–8 September 2000
Stolte, E., von Praun, C., Alonso, G., Gross, T.R.: Scientific data repositories: designing for a moving target. In: Proceedings of the 2003 ACM SIGMOD International Conference on Management of Data, pp. 349–360 (2003)
Weikum G., Vossen G. (2002). Transactional information systems. Morgan Kaufmann, San Francisco
Wu, S., Kemme, B.: Postgres-R(SI): combining replica control with concurrency control based on snapshot isolation. In: ICDE ’05: Proceedings of the 21st International Conference on Data Engineering (ICDE’05), pp. 422–433
Author information
Authors and Affiliations
Corresponding author
Additional information
Part of this work was done while Tamer Özsu was a visiting guest at ETH Zurich.
Rights and permissions
About this article
Cite this article
Plattner, C., Alonso, G. & Özsu, M.T. Extending DBMSs with satellite databases. The VLDB Journal 17, 657–682 (2008). https://doi.org/10.1007/s00778-006-0026-x
Received:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00778-006-0026-x