Functionalities of Web Archives
Web archiving
The functionalities that are important to the users of web archives range from basic searching and browsing to advanced personalized and customized services, data mining, and website reconstruction. The author examined ten of the most established English language web archives to determine which functionalities each of the archives supported, and how they compared.
A functionality checklist was designed, based on use cases created by the International Internet Preservation Consortium (IIPC), and the findings of two related user studies. The functionality review was conducted, along with a comprehensive literature review of web archiving methods, in preparation for the development of a web archiving course for Library and Information School students. This paper describes the functionalities used in the checklist, the extent to which those functionalities are implemented by the various archives, and discusses the author's findings.
Niu, Jinfang
D-Lib Magazine
2012-03
Polk, Victoria
2012 Jinfang Niu
Online Journal
http://dlib.org/dlib/march12/niu/03niu2.html
Web Archiving
Web archiving
The PADI Web Archiving webpage provides an overview of global web archiving practices as well as resources on issues related to web archiving. It briefly defines different models for and approaches to web archiving before providing case studies of archiving approaches used in various countries, including Australia, Austria, Canada, the Czech Republic, Denmark, Finland, France, Germany, Japan, Lithuania, the Netherlands, New Zealand, Norway, Portugal, Sweden, the U.K., and the U.S.A. It then describes five “collaborative initiatives” in which multiple institutions and countries contribute towards cooperative web archiving projects. Next, the webpage lists web archiving resources from each country discussed, including example archives as well as papers and articles on archiving issues. Finally, it includes an extensive list of relevant resources from around the world, including articles, surveys, evaluations, bibliographies, discussion lists, and more.
PADI: Preserving Access to Digital Information
National Library of Australia
2004-05
Polk, Victoria
Website
http://pandora.nla.gov.au/pan/10691/20110824-1153/www.nla.gov.au/padi/topics/738.html
A Vision of the Role and Future of Web Archives
Web archiving
The history of the web and the record of its impact on society may never fully be realized if measures to record and preserve its content are not carefully and consistently maintained. Leetaru identifies the inconsistencies in web archiving by public institutions, such as the Library of Congress, and commercial enterprises, such as the New York Times. He explains current trends for limiting the size of the “crawl” (ingesting web content into the archive) and the web site’s rate of change may not promote discovery of patterns and insights for future scholars and historians. Leetaru proposes web archiving institutions solicit the users and data miners for selecting and presenting the web archive’s content and developing the protocol for ingesting web artefacts.
In addition to increasing the collaboration of a broad web archiving community, Leetaru suggests web archives should also increase the visibility of its holdings and provide sufficient contextual information for the different versions and replacements of web content. Like Wikipedia’s chronology of updates and editions for each page of content, Leetaru believes a web archive should reveal the source code as well as origins of its content. In response to copyright restrictions and rights to privacy, he recommends “snapshots” and limiting algorithms to “surface-level analyses.” By opening access to the intellectual content of the web artefacts and simultaneously adhering to both property and technical standards, preservation of the web archive and the potential for future research can be assured.
Leetaru, Kalev
University of Illinois
2012
Polk, Victoria
Document
http://netpreserve.org/sites/default/files/resources/VisionRoles.pdf
Historical Infrastructures for Web Archiving: Annotation of Ephemeral Collections for Researchers of Cultural Heritage Institutions
Web Archiving
Archivists, historians, and scholars agree that the accumulation of data circulating daily on the Web should be preserved. Yet, there are inconsistencies and gaps in the type of access to web archives created by various academic, public, and private institutions. Authors Dougherty and van den Heuvel explain web archives constructed by institutions such as the Internet Archive or the Library of Congress are designed primarily for retrieval of content in contrast to more specialized academic web archives that focus on data mining and discovery. The authors suggest all web archiving programs expand the annotations in the metadata to include multiple sources and digital ephemera (including the advertising, blogging, and tagging typically ignored by web crawlers). They refer to both Wikipedia and to a nineteenth century historian as models for recording origins, versions, and composing thick descriptions written for each item. By rendering this extensive metadata transparent and making it accessible for all types of users, web archives can provide the cyberstructure suitable for both e-research and e-heritage.
Dougherty, Meghan
Heuvel, Charles van den
MIT6
2009
Polk, Victoria
Document
http://web.mit.edu/comm-forum/mit6/papers/Dougherty_Heuvel.pdf
Archiving websites: a practical guide for information management professionals
Web archiving
Drawing on the author's experience of managing the National Archives' web archiving programme together with lessons learned from other international initiatives, this book offers a comprehensive overview of current best practice, together with practical guidance on establishing a web archiving programme.
Brown, Adrian
Facet Publishing
2006
Branch, Justin
CILIP: the Chartered Institute of Library and Information Professionals 2009-2015.
Book
ISBN: 9781856045537
Online Art Ephemera: Web Archiving at the National Museum of Women in the Arts
Web archiving
Abstract Artist and art subject file collections contain important primary source ephemera for art historical research—but what happens when the ephemera are online? The National Museum of Women in the Arts has been web archiving art-related online ephemera using the Internet Archive's Archive-It since November 2011. This case study presents the considerations and challenges of archiving such types of material and provides a foundation for arts institutions to begin more collaborative web archiving.
Slania, Heather
Art Documentation: Journal of the Art Libraries of North America
1 March 2013
Mulligan, Paige
Copyright 2013 The University of Chicago Press
Journal Article
Best of Both Worlds: Museums, Libraries, and Archives in the Digital Age
Archives
In this text Clough uses the example of the Smithsonian museum to ask and then answer the question, "How can we prepare ourselves to reach the generation of digital natives who bring a huge appetite-and aptitude-for the digital world?" His text discusses how the digital archiving of 2D materials for the Library of Congress and the National Archives have certainly paved the way for future digital archiving but how there are greater challenges for the museum and places attempting to put 3D objects into 2D Internet access. He explores how the impact of the digital world affects libraries and museums, specifically the Smithsonian. Clough emphasizes and attempts to provide answers for the difficulties of creating a digital world, such as making 3D objects 2D for online access and interaction.
Clough, Wayne G.
Smithsonian Books
2013
Polk, Victoria
E-Book
http://www.amazon.com/Best-Both-Worlds-Libraries-Archives-ebook/dp/B00EVT2XG0/ref=sr_1_2?s=books&ie=UTF8&qid=1430924416&sr=1-2&keywords=digital+archive
The Importance of Web Archives For Humanities
Web archiving
This article finds that though the web is a valuable resource for historical research, its information is significantly ephemeral. The text focuses mainly on web archiving and how its aim is to obtain, preserve, and provide access to the published online historical information. Essentially, what the article is emphasizing is how web archiving is very useful for preserving institutional and personal memories. In this text they provide examples of tools that enable historical research through web archives specifically highlighting web archives potential for the Humanities in general.
Gomes, Daniel
Costa, Miguel
Journal Of Humanities & Arts Computing: A Journal Of Digital Humanities
2014
Polk, Victoria
Edinburgh University Press 2014
Journal Article
Metadata For A Web Archive: PREMIS And XMP As Tools For The Task
Web archiving
In this article the standards of metadata and tools are explored and questioned to see if they are the best options for ensuring protection and permanence of web archiving objects, such as snapshots of websites. The text questions whether the evolution of web interfaces can even be documented. Through studies it explores the possibility of encoding Preservation Metadata: Implementation Strategies (PREMIS) within Adobe's eXtensible Metadata Platform (XMP). And through their findings they found it is in part impossible to encode PREMIS within XMP. However the article does seek to find best practices related to metadata standards and tools that are relevant and useful for web archiving.
Romaniuk, Laurentia
Library Philosophy & Practice
2014
Polk, Victoria
Journal Article
Oldies But Goodies: Archiving Web- Based Information
Web archiving
Phyllis Holman Weisbard discusses the ways of archiving web-based information. With so much former print versions of materials now available electronically, what she focuses on is how material that never had a print version (born digitals) are in the most danger of disappearing. Web domains lapse, e-zines lose funding and as a result their materials disappear. Weisbard focuses her attention on the Internet Archive and pays particular attention to its Wayback Machine. Wayback crawls through millions of websites (using Alexa software) and saves versions of these sites. She then turns her attention to web archiving projects that focus solely on women. She gives URLs for a blog resource on women’s voices, describes Aletta, Institute for Women’s History, and how the staff has created hundreds of items of women’s e-zines and newsletters, LOCKSS, and Portico (other initiatives dedicated to preserving the writings of women). This article has information on web archiving technology focusing on preserving women’s writings. Weisbard’s article shows visuals of each site so you can examine each interface. Her essay is a call to action for Women’s Studies scholars to be more proactive in preserving of these female voices by working in collaboration with librarians and archivists.
Weisbard, Phyllis Holman
2011
Elena Rogalle
Journal article