Abstract
This chapter discusses the importance of web archiving, briefly presents its history from the beginning with the Internet Archive in 1996 and exposes the challenges with archiving certain types of online data.
Access provided by Autonomous University of Puebla. Download to read the full chapter text
Chapter PDF
Similar content being viewed by others
References
Archive Team (2021) Geocities. Archive Team. Available at: https://www.archiveteam.org/index.php?title=GeoCities. Accessed on 29 January 2021
Archive-it (2021) Human rights documentation initiative. Archive-it. Available at: https://archive-it.org/collections/1475. Accessed on 29 January 2021
Bruns A (2018) The library of congress twitter archive: a failure of historic proportions. Medium. Available at: https://medium.com/dmrc-at-large/the-library-of-congress-twitter-archive-a-failure-of-historic-proportions-6dc1c3bc9e2c. Accessed on 2 January 2021
Clement (2020) Number of social network users worldwide from 2010 to 2023. Statista. Available at: https://www.statista.com/statistics/278414/number-of-worldwide-social-network-users/. Accessed on 1 April 2020
Digital Preservation Coalition (2021a) About. Digital Preservation Coalition. Available at: https://www.dpconline.org/about. Accessed on 29 January 2021
Digital Preservation Coalition (2021b) Web-archiving. Digital Preservation Coalition. Available at: https://www.dpconline.org/handbook/content-specific-preservation/web-archiving. Accessed on 29 January 2021
Espenschied D (2016) Rhizome releases first public version of webrecorder. Rhizome. Available at: https://rhizome.org/editorial/2016/aug/09/rhizome-releases-first-public-version-of-webrecorder/. Accessed on 9 August 2020
FIAF-International Federation of Film Archives (2021) FIAF’s Mission. FIAF. Availble at: https://www.fiafnet.org/pages/Community/Mission-FIAF.html. Accessed on 29 January 2021
Film Foundation (2021) About us-film preservation. The Film Foundation. Available at: https://web.archive.org/web/20130312021638/http://www.film-foundation.org/common/11004/aboutAboutUs.cfm?clientID=11004&sid=2&ssid=5. Accessed on 29 January 2021
Fischer T (2020) How big is the web? Lifewire. Available at: https://www.lifewire.com/how-big-is-the-web-4065573. Accessed on 15 January 2020
Gilbertson S (2010) Geocities lives on a massive torrent download. Wired. Available at: https://www.wired.com/2010/11/geocities-lives-on-as-massive-torrent-download/. Accessed on 1 November 2020
IIPC-International Internet Preservation Consortium (2017) Tools and Software. Git Hub. Available at: https://github.com/iipc/iipc.github.io/wiki/Tools-and-Software. Accessed on 29 January 2021
IIPC-International Internet Preservation Consortium (2021) About IIPC. Net-Preserve. Available at: http://netpreserve.org/about-us/. Accessed on 29 January 2021
INA (2021) Dépôt légal radio, télé et web. Institut national de l’audiovisuel. Available at: https://institut.ina.fr/institut/statut-missions/depot-legal-radio-tele-et-web. Accessed on 29 January 2021
Internet Archive (2009) Geocities special collection 2009. Internet Archive. Available at: https://archive.org/web/geocities.php. Accessed on 29 January 2021
Internet Archive (2021) About the Internet Archive. Internet Archive. Available at: https://archive.org/about/. Accessed on 29 January 2021
Kenez P (2001) A history of Bezhin meadow. In: LaValley AJ, Scherr BP (eds) Eisenstein at 100: a reconsideration. Rutgers University Press, New Jersey
Lee HB, Nazareno F, Jung SH, Cho WS (2011) A vertical search engine for school information based on Heritrix and Lucene. In: Lee G, Howard D, Ślęzak D (eds) Convergence and hybrid information technology. Springer, Berlin
Leetaru K (2017) Are web archives failing the modern web: video, social media, dynamic pages and the mobile web. Forbes. Available at: https://www.forbes.com/sites/kalevleetaru/2017/02/24/are-web-archives-failing-the-modern-web-video-social-media-dynamic-pages-and-the-mobile-web/#53a22d3845b1. Accessed on 24 February 2020
Manning CD, Raghavan P, Schutze H (2008) Introduction to information retrieval. Cambridge University Press, Cambridge
Masanès J (2006) Web archiving. Springer, New York
Mohr G, Kimpton M, Stack M, Ranitovic I (2004) Introduction to heritrix, an archival quality web crawler. In: Proceedings of the 4th International Web Archiving Workshop IWAW’04
National Archives (2021) Twitter archives. National archives-UK Government. Available at: https://webarchive.nationalarchives.gov.uk/twitter/. Accessed on 29 January 2021
Niu J (2012) An overview of web archiving. D-Lib Mag 18(3–4). Available at: http://www.dlib.org/dlib/march12/niu/03niu1.html
Ohlheiser A (2013) Most of America’s silent films are lost forever. The Atlantic. Available at: https://www.theatlantic.com/culture/archive/2013/12/most-americas-silent-films-are-lost-forever/355775/. Accessed on 4 December 2020
Riley H, Crookston M (2015) Use of the NZ web archive: introduction and context. National Library of New Zealand. Available at: https://natlib.govt.nz/librarians/reports-and-research/use-of-the-nz-web-archive/introduction
Stone B (2010) Tweet preservation. Blog Twitter. Available at: https://blog.twitter.com/official/en_us/a/2010/tweet-preservation.html. Accessed on 14 April 2020
Wikimedia Foundation, Inc (2020) List of web archiving initiatives. https://en.wikipedia.org/wiki/List_of_Web_archiving_initiatives, last update on 21 April 2020. Accessed on 28 April 2020
World Wide Web Foundation (2021) History of the Web. Available at: https://webfoundation.org/about/vision/history-of-the-web/. Accessed on 29 January 2021
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this chapter
Cite this chapter
Major, D., Gomes, D. (2021). Web Archives Preserve Our Digital Collective Memory. In: Gomes, D., Demidova, E., Winters, J., Risse, T. (eds) The Past Web. Springer, Cham. https://doi.org/10.1007/978-3-030-63291-5_2
Download citation
DOI: https://doi.org/10.1007/978-3-030-63291-5_2
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-63290-8
Online ISBN: 978-3-030-63291-5
eBook Packages: Computer ScienceComputer Science (R0)