Upload
mary-elings
View
48
Download
0
Embed Size (px)
Citation preview
More Product, Less Process: Mass Digitization of Special Collections
Mary W. Elings
Archivist for Digital Collections
The Bancroft Library
University of California
Berkeley
Museum Computer Network Annual Conference - Chicago 2007
Comparing the Players...• Google
– all books– scanning off-site– human-assisted
scanners
– 3000 books/day– funded internally– some restrictions on
access to full library
• OCA– only public domain– scanning in-house
– human-assisted “scribe” scanners
– 1000 books/week– funded externally/partners– full, open access to entire
library
Contracts and Agreements
• Karen Coyle’s assessment of the University of Michigan and University of California contracts with Google:
http://kcoyle.blogspot.com/2006/08/dotted-line.html
How is this changing standards?
• JPEG2000 instead of TIFF as master file
• OCR plain text instead of TEI encoding
• PDF acceptable format
Note: these standards are also in use by other digitization efforts such as the National Digital Newspaper Program (NDNP)http://www.loc.gov/ndnp/
OCA: Sloan Funded Collections
• Bancroft Library of the University of California at Berkeley: Key primary texts documenting the California Gold Rush and Western expansion.
• Boston Public Library: The John Adams collection, which is the complete personal library of the Founding Father, lifelong book collector and second President of the United States.
• The Getty Research Institute: Major collection of books on art and architecture and an alternate collection on the performing arts.
• Johns Hopkins University Libraries: The James Birney Collection of Anti- Slavery materials
NASA and Internet Archive
• Space Act– to develop a massive online archive of photography,
film and video from NASA’s 50-year history
– five-year, non-exclusive agreement wherein the Internet Archive will digitize, host and manage still, moving and computer-generated imagery produced by NASA.
http://www.nasa.gov/home/hqnews/2007/aug/HQ_07178_Internet_Archive.html