Upload
others
View
0
Download
0
Embed Size (px)
Citation preview
Rosetta installed in March 2012
Production Environment Development / Testing Environment
Integrate with Library Workflows & Processes Custom Harvest Tool & Ingest Processes
Ingest various types of collections: University Academic records, many born digital New Library repositories Camera raw images University videos Audio Digitization project Unstructured folders of archival objects
Investigate alternative long term storage Amazon S3 Cloud Storage Hitachi - LG Data Storage (HLDS) Optical Archive System DPN – Rosetta Proof of Concept Project
Digital Commons (bepress) Digital Files: images, text, audio, video, office documents, etc. All university theses and dissertations Single and compound objects Rosetta Harvester - In process
University Records SharePoint Records Management system Harvest permanent & archival records into Rosetta In-office copy on M-Disc as requested
SharePoint 2013 interface (REST) Phase 1 in testing Archive permanent objects Configure files to archive by path / category / date
Create SIPS from SharePoint Export Export selected content to Rosetta server directory Create METS xml from metadata and objects
Ingest SIPS into Rosetta Ingest SIP Create Collection folders in Rosetta
SharePoint 2013 Administrative Records
University Photographer Archives Existing photos – 40 TB Add 500,000 images each year Canon Raw images, video, others
Library Media Projects Academic Lecture Videos – 5 TB Audio Digitization Project – 25 TB School of Music Performance archive – 5 TB
Historical, Institutional, Archival content Unstructured folders of archival objects
Browse a file path and record information Output metadata to a spreadsheet (csv, .xls, .xlsx) Archivists add additional information Create SIP from metadata & folders of objects Ingest folder path into Rosetta
Digital Preservation Network: Includes 60+ research libraries throughout the United States. Formed to ensure that the complete scholarly record is preserved. DPN is a dark archive with a federated approach to preservation. Harold B. Lee Library is a charter member. Rosetta Proof of Concept
Export Rosetta content Create bags with Bagit Ingest in DPN node
Local Servers. Virtual Data Center Tape archive. Offsite storage Millenniata M-disc archive
Additional copy on M-Disc DVD and Blu-ray Copies are cataloged and stored in Special Collections Resistant to heat, light, temperature, magnetism, bit rot
Amazon S3 Cloud Storage HLDS Optical Archive System
Setup for Rosetta Define Rosetta Storage and Rules Location of IEs: Local storage Location of Files: Stored on Amazon S3
Amazon S3 Cloud Storage HLDS Optical Archive System
Hitachi-LG Data Storage (HLDS), a joint venture between Hitachi and LG Enterprise Blu-ray Disc™ for Archival Storage Millenniata M-Discs (planned) Expandable / Unlimited Storage Available world-wide
HLDS Optical Archive System Full unit has a capacity of 1 PB Multiple units can be connected Cartridges can be stored off-line Offline & Remote Replication Lower total cost of ownership
Rosetta Test Configuration Each library has a capacity of 100 TB Currently using Blu-ray Disc™ 200 GB discs Test of Millenniata M-Discs Planned
Rosetta Testing Rosetta IEs: Stored on local server Rosetta Files : Stored on Optical Archive System Confirmed Write / Read from Rosetta Testing MD5 fixity check Testing off-line cartridges
Optical Archive System
Rosetta
Permanent (HDD)
Storage Buffer
Optical Library
Storage Control Server
File File
IEs Files
What we are looking for in storage . . . Sufficient capacity for our ever increasing content Reasonable long term cost
Lower total costs of ownership Reduced cost of refreshing or migration
Reliable and recoverable Archival media Industry Storage Partner Multiple copies, locations Secure storage
Accessible Rosetta Network
Questions?