A Digital Preservation Repository for Duke University Libraries

  • Published on
    10-Jan-2016

  • View
    57

  • Download
    3

DESCRIPTION

A Digital Preservation Repository for Duke University Libraries. Jim Coble Digital Repository Services Duke University Libraries j im.coble@duke.edu. Preservation Repository. Improve digital asset preservation processes First Use Case: Digital Collections Master Files - PowerPoint PPT Presentation

Transcript

A Digital Preservation Repository for Duke University LibrariesA Digital Preservation Repository for Duke University LibrariesJim CobleDigital Repository ServicesDuke University Librariesjim.coble@duke.eduTRLN Annual Meeting 20131Preservation RepositoryImprove digital asset preservation processesFirst Use Case: Digital Collections Master FilesDigitized content, in-house and out-sourced380,000 archival master files (~ 20 TB)Primarily still images, with some audio and videoTRLN Annual Meeting 2013Current Scenario (Typical)Archival master filesProduced by librarys Digital Production Center (DPC)Stored on filesystemACE-AM for periodic checksum validationDescriptive metadataProduced by Cataloging and Metadata Services departmentMaintained in CONTENTdm (or elsewhere)Technical metadataGenerated and maintained by DPCNothing ties these elements together except local knowledge and a DPC identifierTRLN Annual Meeting 2013Initial Project GoalTRLN Annual Meeting 2013Descriptive MetadataPreservation RepositoryDPC Technical MetadataArchival Master FilesTechnologyFedora Commons RepositoryHydra Project FrameworkFedora (repository)Solr (index)Blacklight (discovery and access)Hydra-Head (object creation / management)TRLN Annual Meeting 2013TimelineSpring 2012: Prototype using Fedora command line utilities and Django using found timeJune 2012: Project formally launchedFebruary 2013: Initial pilot completedJune 2013: Production preservation repository launched with two collections ingestedTRLN Annual Meeting 2013IngestLarge amount of content to ingest380,000 archival master files (~ 20 TB)Batch ingest mechanismReads content files from file systemPulls in corresponding descriptive and technical metadataCreates three PREMIS (Preservation) Event records for each ingested objectIngestionIngest ValidationInitial Fixity CheckTRLN Annual Meeting 2013Validation PreservationEventIn PreservationEvent eventMetadata datastream TRLN Annual Meeting 2013Export SetsDelivering archival master files to authorized patrons upon requestCurrent process is manualDPC staff locate master file(s) on filesystemPossibly create a zip filePlace file(s) in pick-up location or copy onto CD, DVD, etc., for deliveryTRLN Annual Meeting 2013Export SetsBuilt on bookmark functionalityStaff member searches for content-bearing objects of interest and bookmarks themExport set can be created from bookmark listContent files are retrieved from the repository and bundled into a zip fileStaff member can download and deliver to patronZip file includes a README manifest listing the content files with basic metadata TRLN Annual Meeting 2013TRLN Annual Meeting 2013ScreenshotWalk-ThroughRepository Home PageTRLN Annual Meeting 2013Collection IndexTRLN Annual Meeting 2013Collection Content: ItemsTRLN Annual Meeting 2013Creating Export SetTRLN Annual Meeting 2013Creating Export SetTRLN Annual Meeting 2013Export Set CreatedTRLN Annual Meeting 2013Export Set Zip FileTRLN Annual Meeting 2013Future PlansVersion 1.1 By September 2013Interface improvementsRefactored batch ingestFuture enhancementsIngest (batch and individual) performed by library staffEditing capabilityFuture Use CasesFaculty scholarship, electronic theses and dissertationsElectronic records and other born-digital contentDatasetsImage library for teaching / learningTRLN Annual Meeting 2013Questions?Jim Coblejim.coble@duke.eduDigital Repository ServicesDuke University LibrariesProjecthttps://github.com/duke-libraries/dul-hydraTRLN Annual Meeting 2013

Recommended

View more >