New Challenges in Petascale Scientific Databases

Szalay, Alexander

doi:10.1007/978-3-540-69497-7_1

Alexander Szalay¹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 5069))

Included in the following conference series:

International Conference on Scientific and Statistical Database Management

1254 Accesses
5 Citations

Abstract

Scientific data is doubling every year. Virtual Observatories are established over every scale of the physical world: from elementary particles to materials, biological systems, environmental observatories, remote sensing, and the universe. These collaborations collect increasing amounts of data, often close to a rate of petabytes per year. Many scientists will soon obtain most of their data from large scientific repositories of data, often stored in the form of databases. The talk will discuss the different requirements for such databases, and discuss user behavior in a few concrete examples taken from astronomy, in particular from the 6 year usage of the Sloan Digital Sky Survey database. Interesting query patterns are emerging, where users create custom “crawlers” to break large queries into many repetitive ones. The trial-and-error behavior of many exploratory projects will be also discussed. The talk will also present various scalable alternatives to large scientific analysis facilities.

Access provided by Autonomous University of Puebla. Download to read the full chapter text

Chapter PDF

PanDA: Production and Distributed Analysis System

Article Open access 23 January 2024

Rucio: Scientific Data Management

Article Open access 09 August 2019

Large-scale data services for science: Present and future challenges

Article 04 September 2016

Author information

Authors and Affiliations

Department of Physics and Astronomy, The Johns Hopkins University, 3701 San Martin Drive, Baltimore, MD 21218
Alexander Szalay

Authors

Alexander Szalay
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Bertram Ludäscher Nikos Mamoulis

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Szalay, A. (2008). New Challenges in Petascale Scientific Databases. In: Ludäscher, B., Mamoulis, N. (eds) Scientific and Statistical Database Management. SSDBM 2008. Lecture Notes in Computer Science, vol 5069. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-69497-7_1

Download citation

DOI: https://doi.org/10.1007/978-3-540-69497-7_1
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-69476-2
Online ISBN: 978-3-540-69497-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

New Challenges in Petascale Scientific Databases

Abstract

Chapter PDF

Similar content being viewed by others

PanDA: Production and Distributed Analysis System

Rucio: Scientific Data Management

Large-scale data services for science: Present and future challenges

Author information

Authors and Affiliations

Editor information

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

New Challenges in Petascale Scientific Databases

Abstract

Chapter PDF

Similar content being viewed by others

PanDA: Production and Distributed Analysis System

Rucio: Scientific Data Management

Large-scale data services for science: Present and future challenges

Author information

Authors and Affiliations

Editor information

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation