Web Archive Processing
Title
Web Archive Processing
Subject
Web Archiving
Description
Web Archive Processing by Mike Smorul is a pdf document that describes different types of archiving strategies and how they work. It describes the indexing of various websites and how they tackle managing a large quantity of websites within the archives. It details the different manager components available to help in managing the data and gives an example of how the URLs are documented. It goes into more detail about how their management and storage is designed and what they used to get it as efficient as it is, not to mention describing how the process works and giving stats on how it performs.
Creator
Smorul, Mike
Date
2010, 09/28
Contributor
Jordan Lunsford
Type
Document
Bibliographic Citation
Smorul, Mike. “Web Archiving Processing.” Library of Congress, August 18, 2010. http://www.digitalpreservation.gov/meetings/documents/othermeetings/Smorul-Web_Archive.pdf.
Local URL
http://www.digitalpreservation.gov/meetings/documents/othermeetings/Smorul-Web_Archive.pdf
Files
Collection
Citation
Smorul, Mike, “Web Archive Processing,” Digital Archiving Resources, accessed January 6, 2025, https://dar.cah.ucf.edu/items/show/456.