Digital Archiving Resources

Web Archive Processing

Title

Web Archive Processing

Subject

Web Archiving

Description

Web Archive Processing by Mike Smorul is a pdf document that describes different types of archiving strategies and how they work. It describes the indexing of various websites and how they tackle managing a large quantity of websites within the archives. It details the different manager components available to help in managing the data and gives an example of how the URLs are documented. It goes into more detail about how their management and storage is designed and what they used to get it as efficient as it is, not to mention describing how the process works and giving stats on how it performs.

Creator

Smorul, Mike

Date

2010, 09/28

Contributor

Jordan Lunsford

Type

Document

Bibliographic Citation

Smorul, Mike. “Web Archiving Processing.” Library of Congress, August 18, 2010. http://www.digitalpreservation.gov/meetings/documents/othermeetings/Smorul-Web_Archive.pdf.

Local URL

http://www.digitalpreservation.gov/meetings/documents/othermeetings/Smorul-Web_Archive.pdf

Files

Web Archive Processing.jpg

Collection

Citation

Smorul, Mike, “Web Archive Processing,” Digital Archiving Resources, accessed April 27, 2024, https://dar.cah.ucf.edu/items/show/456.