20
A Digital Preservation Repository for Duke University Libraries Jim Coble Digital Repository Services Duke University Libraries [email protected] TRLN Annual Meeting 2013

A Digital Preservation Repository for Duke University Libraries

  • Upload
    alina

  • View
    69

  • Download
    3

Embed Size (px)

DESCRIPTION

A Digital Preservation Repository for Duke University Libraries. Jim Coble Digital Repository Services Duke University Libraries j [email protected]. Preservation Repository. Improve digital asset preservation processes First Use Case: Digital Collections Master Files - PowerPoint PPT Presentation

Citation preview

Page 1: A Digital Preservation Repository for Duke University Libraries

A Digital Preservation Repository for Duke University Libraries

Jim Coble

Digital Repository Services

Duke University Libraries

[email protected]

TRLN Annual Meeting 2013

Page 2: A Digital Preservation Repository for Duke University Libraries

Preservation Repository

Improve digital asset preservation processes First Use Case: Digital Collections Master Files

Digitized content, in-house and out-sourced 380,000 archival master files (~ 20 TB) Primarily still images, with some audio and

video

TRLN Annual Meeting 2013

Page 3: A Digital Preservation Repository for Duke University Libraries

Current Scenario (Typical)

Archival master files Produced by library’s Digital Production Center (DPC) Stored on filesystem ACE-AM for periodic checksum validation

Descriptive metadata Produced by Cataloging and Metadata Services

department Maintained in CONTENTdm (or elsewhere)

Technical metadata Generated and maintained by DPC

Nothing ties these elements together except local knowledge and a DPC identifier

TRLN Annual Meeting 2013

Page 4: A Digital Preservation Repository for Duke University Libraries

Initial Project Goal

TRLN Annual Meeting 2013

Descriptive Metadata

Preservation Repository

DPC Technical Metadata

Archival Master Files

Page 5: A Digital Preservation Repository for Duke University Libraries

Technology

Fedora Commons Repository Hydra Project Framework

Fedora (repository) Solr (index) Blacklight (discovery and access) Hydra-Head (object creation / management)

TRLN Annual Meeting 2013

Page 6: A Digital Preservation Repository for Duke University Libraries

Timeline

Spring 2012: Prototype using Fedora command line utilities and Django using “found time”

June 2012: Project formally launched February 2013: Initial pilot completed June 2013: Production preservation repository

launched with two collections ingested

TRLN Annual Meeting 2013

Page 7: A Digital Preservation Repository for Duke University Libraries

Ingest

Large amount of content to ingest 380,000 archival master files (~ 20 TB)

Batch ingest mechanism Reads content files from file system Pulls in corresponding descriptive and technical

metadata

Creates three PREMIS (Preservation) Event records for each ingested object Ingestion Ingest Validation Initial Fixity Check

TRLN Annual Meeting 2013

Page 8: A Digital Preservation Repository for Duke University Libraries

Validation PreservationEvent

In PreservationEvent eventMetadata datastream …

TRLN Annual Meeting 2013

Page 9: A Digital Preservation Repository for Duke University Libraries

Export Sets

Delivering archival master files to authorized patrons upon request

Current process is manual DPC staff locate master file(s) on filesystem Possibly create a zip file Place file(s) in pick-up location or copy onto CD, DVD,

etc., for delivery

TRLN Annual Meeting 2013

Page 10: A Digital Preservation Repository for Duke University Libraries

Export Sets

Built on bookmark functionality Staff member searches for content-bearing objects of

interest and bookmarks them Export set can be created from bookmark list

Content files are retrieved from the repository and bundled into a zip file Staff member can download and deliver to patron Zip file includes a README manifest listing the

content files with basic metadata

TRLN Annual Meeting 2013

Page 11: A Digital Preservation Repository for Duke University Libraries

TRLN Annual Meeting 2013

Screenshot

Walk-Through

Page 12: A Digital Preservation Repository for Duke University Libraries

Repository Home Page

TRLN Annual Meeting 2013

Page 13: A Digital Preservation Repository for Duke University Libraries

Collection Index

TRLN Annual Meeting 2013

Page 14: A Digital Preservation Repository for Duke University Libraries

Collection Content: Items

TRLN Annual Meeting 2013

Page 15: A Digital Preservation Repository for Duke University Libraries

Creating Export Set

TRLN Annual Meeting 2013

Page 16: A Digital Preservation Repository for Duke University Libraries

Creating Export Set

TRLN Annual Meeting 2013

Page 17: A Digital Preservation Repository for Duke University Libraries

Export Set Created

TRLN Annual Meeting 2013

Page 18: A Digital Preservation Repository for Duke University Libraries

Export Set Zip File

TRLN Annual Meeting 2013

Page 19: A Digital Preservation Repository for Duke University Libraries

Future Plans

Version 1.1 – By September 2013 Interface improvements Refactored batch ingest

Future enhancements Ingest (batch and individual) performed by library staff Editing capability

Future Use Cases Faculty scholarship, electronic theses and

dissertations Electronic records and other born-digital content Datasets Image library for teaching / learning

TRLN Annual Meeting 2013

Page 20: A Digital Preservation Repository for Duke University Libraries

Questions?

Jim Coble

[email protected]

Digital Repository Services

Duke University Libraries

Project

https://github.com/duke-libraries/dul-hydra

TRLN Annual Meeting 2013