A Digital Preservation Repository for Duke University Libraries

  • View
    57

  • Download
    3

Embed Size (px)

DESCRIPTION

A Digital Preservation Repository for Duke University Libraries. Jim Coble Digital Repository Services Duke University Libraries j im.coble@duke.edu. Preservation Repository. Improve digital asset preservation processes First Use Case: Digital Collections Master Files - PowerPoint PPT Presentation

Transcript

A Digital Preservation Repository for Duke University Libraries

A Digital Preservation Repository for Duke University LibrariesJim CobleDigital Repository ServicesDuke University Librariesjim.coble@duke.eduTRLN Annual Meeting 20131Preservation RepositoryImprove digital asset preservation processesFirst Use Case: Digital Collections Master FilesDigitized content, in-house and out-sourced380,000 archival master files (~ 20 TB)Primarily still images, with some audio and video

TRLN Annual Meeting 2013Current Scenario (Typical)Archival master filesProduced by librarys Digital Production Center (DPC)Stored on filesystemACE-AM for periodic checksum validationDescriptive metadataProduced by Cataloging and Metadata Services departmentMaintained in CONTENTdm (or elsewhere)Technical metadataGenerated and maintained by DPCNothing ties these elements together except local knowledge and a DPC identifierTRLN Annual Meeting 2013Initial Project GoalTRLN Annual Meeting 2013Descriptive MetadataPreservation RepositoryDPC Technical MetadataArchival Master FilesTechnologyFedora Commons RepositoryHydra Project FrameworkFedora (repository)Solr (index)Blacklight (discovery and access)Hydra-Head (object creation / management)TRLN Annual Meeting 2013TimelineSpring 2012: Prototype using Fedora command line utilities and Django using found timeJune 2012: Project formally launchedFebruary 2013: Initial pilot completedJune 2013: Production preservation repository launched with two collections ingested

TRLN Annual Meeting 2013IngestLarge amount of content to ingest380,000 archival master files (~ 20 TB)Batch ingest mechanismReads content files from file systemPulls in corresponding descriptive and technical metadataCreates three PREMIS (Preservation) Event records for each ingested objectIngestionIngest ValidationInitial Fixity Check

TRLN Annual Meeting 2013Validation PreservationEventIn PreservationEvent eventMetadata datastream

TRLN Annual Meeting 2013

Export SetsDelivering archival master files to authorized patrons upon requestCurrent process is manualDPC staff locate master file(s) on filesystemPossibly create a zip filePlace file(s) in pick-up location or copy onto CD, DVD, etc., for deliveryTRLN Annual Meeting 2013Export SetsBuilt on bookmark functionalityStaff member searches for content-bearing objects of interest and bookmarks themExport set can be created from bookmark listContent files are retrieved from the repository and bundled into a zip fileStaff member can download and deliver to patronZip file includes a README manifest listing the content files with basic metadata

TRLN Annual Meeting 2013TRLN Annual Meeting 2013ScreenshotWalk-ThroughRepository Home PageTRLN Annual Meeting 2013

Collection IndexTRLN Annual Meeting 2013

Collection Content: ItemsTRLN Annual Meeting 2013

Creating Export SetTRLN Annual Meeting 2013

Creating Export SetTRLN Annual Meeting 2013

Export Set CreatedTRLN Annual Meeting 2013

Export Set Zip FileTRLN Annual Meeting 2013

Future PlansVersion 1.1 By September 2013Interface improvementsRefactored batch ingestFuture enhancementsIngest (batch and individual) performed by library staffEditing capabilityFuture Use CasesFaculty scholarship, electronic theses and dissertationsElectronic records and other born-digital contentDatasetsImage library for teaching / learning

TRLN Annual Meeting 2013Questions?Jim Coblejim.coble@duke.eduDigital Repository ServicesDuke University Libraries

Projecthttps://github.com/duke-libraries/dul-hydra

TRLN Annual Meeting 2013