Abstract
The following paper describes some common aspects of stream data processing systems. The paper consists of two main parts – first showing the short description, tests results and conclusions of an implemented system – the AGKPStream, while the second part focuses on proposed solutions, created upon experiences gained during development of mentioned system, as well as knowledge collected during learning about some concepts of a StreamAPAS system. The first discussed issue is a tuple construction – basic data representation. It concerns tuple time model, tuple schema and a tuple decorator. Afterwards, the stream query and scheduling problems are described.
Access provided by Autonomous University of Puebla. Download to read the full chapter text
Chapter PDF
Similar content being viewed by others
Keywords
References
Abhirup, C., Ajit, S.: A Partition-based Approach to Support Streaming Updates over Persistent Data in an Active Data Warehouse. In: Proceedings of the 2009 IEEE International Symposium on Parallel & Distributed Processing, IPDPS 2009, pp. 1–11. IEEE Computer Society, Washington, DC (2009)
Gorawski, M.: Extended Cascaded Star Schema and ECOLAP Operations for Spatial Data Warehouse. In: Corchado, E., Yin, H. (eds.) IDEAL 2009. LNCS, vol. 5788, pp. 251–259. Springer, Heidelberg (2009)
Gorawski, M.: Time complexity of page filling algorithms in Materialized Aggregate List (MAL) and MAL/TRIGG materialization cost. Control and Cybernetics 38(1), 153–172 (2009)
Gorawski, M., Gorawski, M.: Balanced spatio-temporal data warehouse with RMVB, STCAT and BITMAP indexes. In: PARELEC 2006: International Symposium On Parallel Computing In Electrical Engineering, pp. 43–48 (2006)
Gorawski, M., Malczok, R.: Indexing Spatial Objects in Stream Data Warehouse. In: Nguyen, N.T., Katarzyniak, R., Chen, S.-M. (eds.) Advances in Intelligent Information and Database Systems. SCI, vol. 283, pp. 53–65. Springer, Heidelberg (2010)
Gorawski, M., Marks, P.: Checkpoint-based resumption in data warehouses. In: Software Engineering Techniques: Design for Quality. IFIP, vol. 227, pp. 313–323. Springer, US (2006)
Gorawski, M., Marks, P.: Resumption of data extraction process in parallel data warehouses. In: Wyrzykowski, R., Dongarra, J., Meyer, N., Waśniewski, J. (eds.) PPAM 2005. LNCS, vol. 3911, pp. 478–485. Springer, Heidelberg (2006)
Gorawski, M., Morzy, T., Wrembel, R.: Special Issue on: Techniques of Advanced Data Processing and Analysis Introduction. Control and Cybernetics 38(1) (2009)
Kozielski, S., Wrembel, R. (eds.): New Trends in Data Warehousing and Data Analysis. Annals of Information Systems, vol. 3. Springer, US (2009)
Morzy, T.: Extraction, Transformation, and Loading Processes. In: Data Warehouses and Olap: Concepts, Architectures and Solutions, pp. 88–110 (2007)
Brian, B., Shivnath, B., Mayur, D., Rajeev, M., Dilys, T.: Operator scheduling in data stream systems. VLDB J. 13(4), 333–353 (2004)
Gorawski, M.: Advanced Data Warehouses. Habilitation, Studia Informatica 30(3B). Pub. House of Silesian Univ. of Technology (2009)
Gorawski, M., Chrószcz, A.: Synchronization Modeling in Stream Processing. In: Morzy, T., Härder, T., Wrembel, R. (eds.) Advances in Databases and Information Systems. AISC, vol. 186, pp. 91–102. Springer, Heidelberg (2013)
Gorawski, M., Malczok, R.: Towards stream data parallel processing in spatial aggregating index. In: Wyrzykowski, R., Dongarra, J., Karczewski, K., Wasniewski, J. (eds.) PPAM 2007. LNCS, vol. 4967, pp. 209–218. Springer, Heidelberg (2008)
Gorawski, M., Malczok, R.: Answering Range-Aggregate Queries over Objects Generating Data Streams. In: Kitagawa, H., Ishikawa, Y., Li, Q., Watanabe, C. (eds.) DASFAA 2010. LNCS, vol. 5982, pp. 436–439. Springer, Heidelberg (2010)
Gorawski, M., Marks, P.: Distributed stream processing analysis in high availability context. In: Proceedings of the Second International Conference on Availability, Reliability and Security, ARES, pp. 61–68 (2007)
Roger, S.B., Jonathan, G., Mohamed, H.A., Hong, M.: Consistent Streaming Through Time: A Vision for Event Stream Processing. In: Third Biennial Conference on Innovative Data Systems Research, CIDR 2007, Asilomar, CA, USA (2007)
Gorawski, M.: Architecture of Parallel Spatial Data Warehouse: Balancing Algorithm and Resumption of Data Extraction. In: Proceedings of the 2005 conference on Software Engineering: Evolution and Emerging Technologies, pp. 49–59. IOS Press, Amsterdam (2005)
Gorawski, M., Chroszcz, A.: Optimization of operator partitions in stream data warehouse. In: Proceedings of the ACM 14th international workshop on Data Warehousing and OLAP, pp. 61–66. ACM, New York (2011)
Gorawski, M., Gorawski, M.: Modified R-MVB tree and BTV algorithm used in a distributed spatio-temporal data warehouse. In: Wyrzykowski, R., Dongarra, J., Karczewski, K., Wasniewski, J. (eds.) PPAM 2007. LNCS, vol. 4967, pp. 199–208. Springer, Heidelberg (2008)
Gorawski, M., Marks, P.: Towards reliability and fault-tolerance of distributed stream processing system. In: DEPCOS-RELCOMEX 2007 International Conference on Dependability of Computer Systems, pp. 246–253. IEEE Computer Society, Washington, DC (2007)
Gorawski, M., Marks, P., Gorawski, M.: Collecting data streams from a distributed radio-based measurement system. In: Haritsa, J.R., Kotagiri, R., Pudi, V. (eds.) DASFAA 2008. LNCS, vol. 4947, pp. 702–705. Springer, Heidelberg (2008)
Waas, F., Wrembel, R., Freudenreich, T., Theile, M., Koncilia, C., Furtado, P.: On-Demand ELT Architecture for Right-Time BI: Extending the Vision. International Journal on Data Warehousing and Mining (to appear, 2013)
Wrembel, R.: A Survey of Managing the Evolution of Data Warehouses. IJDWM 5(2), 24–56 (2009)
Gorawski, M., Chroszcz, A.: StreamAPAS: Query Language and Data Model. In: Proceedings of the Third International Conference of Complex, Intelligent and Software Intensive Systems, CISIS 2009, pp. 75–82. Springer, Heidelberg (2009)
Gorawski, M., Chrószcz, A.: Query Processing Using Negative and Temporal Tuples in Stream Query Engines. In: Szmuc, T., Szpyrka, M., Zendulka, J. (eds.) CEE-SET 2009. LNCS, vol. 7054, pp. 70–83. Springer, Heidelberg (2012)
Mohamed, A.S., Panos, K.C., Alexandros, L., Kirk, P.: Efficient scheduling of heterogeneous continuous queries. In: Proceedings of the 32nd International Conference on Very Large Data Bases, VLDB 2006, pp. 511–522. Endowment (2006)
Timothy, M.S., Bradford, P., Zhu, Y., Luping, D., Elke, A.R.: An Adaptive Multi-Objective Scheduling Selection Framework for Continuous Query Processing. In: Proceedings of the 9th International Database Engineering & Application Symposium, IDEAS 2005, pp. 445–454. IEEE Computer Society, Washington, DC (2005)
Jestratjew, A., Kwiecien, A.: Performance of HTTP Protocol in Networked Control Systems. IEEE Trans. Industrial Informatics 9(1), 271–276 (2013)
Patroumpas, K., Sellis, T.: Subsuming multiple sliding windows for shared stream computation. In: Eder, J., Bielikova, M., Tjoa, A.M. (eds.) ADBIS 2011. LNCS, vol. 6909, pp. 56–69. Springer, Heidelberg (2011)
Gorawski, M., Marks, P.: Fault-tolerant distributed stream processing system. In: International Workshop on Database and Expert Systems Applications – DEXA, pp. 395–399 (2006)
Gorawski, M., Malczok, R.: AEC Algorithm: A Heuristic Approach to Calculating Density-Based Clustering Eps Parameter. In: Yakhno, T., Neuhold, E.J. (eds.) ADVIS 2006. LNCS, vol. 4243, pp. 90–99. Springer, Heidelberg (2006)
Gorawski, M., Malczok, R.: Towards automatic Eps calculation in density-based clustering. In: Manolopoulos, Y., Pokorný, J., Sellis, T.K. (eds.) ADBIS 2006. LNCS, vol. 4152, pp. 313–328. Springer, Heidelberg (2006)
Gorawski, M., Marks, P.: Towards automated analysis of connections network in distributed stream processing system. In: Haritsa, J.R., Kotagiri, R., Pudi, V. (eds.) DASFAA 2008. LNCS, vol. 4947, pp. 670–677. Springer, Heidelberg (2008)
Gorawski, M., Lorek, M., Gorawska, A.: CUDA Powered User-Defined Types and Aggregates. In: International Workshop on Engineering Object-Oriented Parallel Software (IEEE AINA_EOOPS-2013). IEEE CS (to appear, 2013)
Jestratjew, A., Kwiecień, A.: Using Cloud Storage in Production Monitoring Systems. In: Kwiecień, A., Gaj, P., Stera, P. (eds.) CN 2010. CCIS, vol. 79, pp. 226–235. Springer, Heidelberg (2010)
Kwiecień, A., Sidzina, M.: Dual Bus as a Method for Data Interchange Transaction Acceleration in Distributed Real Time Systems. In: Kwiecień, A., Gaj, P., Stera, P. (eds.) CN 2009. CCIS, vol. 39, pp. 252–263. Springer, Heidelberg (2009)
Kwiecień, A., Opielka, K.: Industrial Networks in Explosive Atmospheres. In: Kwiecień, A., Gaj, P., Stera, P. (eds.) CN 2011. CCIS, vol. 160, pp. 367–378. Springer, Heidelberg (2011)
Skrzewski, M.: Analyzing Outbound Network Traffic. In: Kwiecień, A., Gaj, P., Stera, P. (eds.) CN 2011. CCIS, vol. 160, pp. 204–213. Springer, Heidelberg (2011)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Gorawski, M., Gorawska, A., Pasterak, K. (2013). Evaluation and Development Perspectives of Stream Data Processing Systems. In: Kwiecień, A., Gaj, P., Stera, P. (eds) Computer Networks. CN 2013. Communications in Computer and Information Science, vol 370. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-38865-1_31
Download citation
DOI: https://doi.org/10.1007/978-3-642-38865-1_31
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-38864-4
Online ISBN: 978-3-642-38865-1
eBook Packages: Computer ScienceComputer Science (R0)