20
Keeping the pieces together: The Role of METS in the Preservation of Digital Content Robin Wendler Harvard University Library January 16, 2005 [Men in crate looking up, photograph, ca. 1905. Harvard University Archives HUK 363 p (Fig. 8) ]

Keeping the pieces together: The Role of METS in the Preservation of Digital Content Robin Wendler Harvard University Library January 16, 2005 [Men in

  • View
    216

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Keeping the pieces together: The Role of METS in the Preservation of Digital Content Robin Wendler Harvard University Library January 16, 2005 [Men in

Keeping the pieces together: The Role of METS in the Preservation of Digital Content

Robin WendlerHarvard University LibraryJanuary 16, 2005

[Men in crate looking up, photograph, ca. 1905. Harvard University Archives HUK 363 p (Fig. 8) ]

Page 2: Keeping the pieces together: The Role of METS in the Preservation of Digital Content Robin Wendler Harvard University Library January 16, 2005 [Men in

Standards everywhere, but nothing there for me…

• As of 2000– A plethora of descriptive metadata standards– Emerging standards for digital conversion, BUT– No open standard for representing a digital object

• Display and navigation• Archiving• Exchange / Transport

– Digital repositories managing files, not objects

Page 3: Keeping the pieces together: The Role of METS in the Preservation of Digital Content Robin Wendler Harvard University Library January 16, 2005 [Men in

OAIS Functional Entities

Page 4: Keeping the pieces together: The Role of METS in the Preservation of Digital Content Robin Wendler Harvard University Library January 16, 2005 [Men in

OAIS Information Model

Page 5: Keeping the pieces together: The Role of METS in the Preservation of Digital Content Robin Wendler Harvard University Library January 16, 2005 [Men in

METS to the rescue

• Metadata Encoding and Transmission Standard– An XML schema for encoding descriptive,

administrative, and structural metadata about a digital object

– http://www.loc.gov/standards/mets/

– An initiative of the Digital Library Federation– Development: METS Editorial Board– Principle author: Jerome McDonough, NYU– Website maintained by: Library of Congress

Page 6: Keeping the pieces together: The Role of METS in the Preservation of Digital Content Robin Wendler Harvard University Library January 16, 2005 [Men in

METS Basics

• METS provides a framework for– Content files– Metadata– Relationships

• Suitable for – Open Archival Information Systems

• Archival information package (AIP)• Submission information package (SIP)• Dissemination information package (DIP)

– Display and navigation of digital objects– Sharing of digital objects among libraries and archives

Page 7: Keeping the pieces together: The Role of METS in the Preservation of Digital Content Robin Wendler Harvard University Library January 16, 2005 [Men in

Structure of a METS File

METS

metsHdr

structMap

admSec

dmdSec

fileSec

structLink

behaviorSec*

Inventory or manifest of component files

Header describing METS file itself

Descriptive metadata

Administrative metadata: -- technical, source, rights, provenance

Structure map: the heart of METS

Structural map linking, i.e., hyperlinks

Executable behaviors

* Not commonly used

Page 8: Keeping the pieces together: The Role of METS in the Preservation of Digital Content Robin Wendler Harvard University Library January 16, 2005 [Men in

Structure Map

Title pagePreface page i page iiChapter 1 page 1 page 2…

<div LABEL=“Title page”><div LABEL=“title page” ORDER=“1” TYPE= <fptr FILEID=“A”></div>

<div LABEL=“Preface”><div LABEL= “page i” ORDER=“2” ORDERLABEL=“i”> <fptr FILEID=“B”></div><div LABEL= “page ii” ORDER=“3”> <fptr> FILEID=“C”></div>

<div LABEL=“Chapter 1”><div LABEL=“page 1” ORDER=“4”> <fptr FILEID=“D”></div><div LABEL=“page 2” ORDER=“5”> <fptr FILEID=“E”>…

Page 9: Keeping the pieces together: The Role of METS in the Preservation of Digital Content Robin Wendler Harvard University Library January 16, 2005 [Men in
Page 10: Keeping the pieces together: The Role of METS in the Preservation of Digital Content Robin Wendler Harvard University Library January 16, 2005 [Men in
Page 11: Keeping the pieces together: The Role of METS in the Preservation of Digital Content Robin Wendler Harvard University Library January 16, 2005 [Men in

Referring to Metadata

METS

metsHdr

structMap

admSec

dmdSec

fileSec

structLink

behaviorSec

METS does not define descriptive or administrative metadata elements. dmdSec and admSec are buckets or sockets where externally-defined metadata can be supplied or referenced

Three techniques:

• In-line XML• Wrapped base-64 encoded data• Pointers to external information (e.g., URNs, handles)

METS Board endorses range of recommended “extension schemas”

Page 12: Keeping the pieces together: The Role of METS in the Preservation of Digital Content Robin Wendler Harvard University Library January 16, 2005 [Men in
Page 13: Keeping the pieces together: The Role of METS in the Preservation of Digital Content Robin Wendler Harvard University Library January 16, 2005 [Men in
Page 14: Keeping the pieces together: The Role of METS in the Preservation of Digital Content Robin Wendler Harvard University Library January 16, 2005 [Men in

Book Chapter 1 page 1 page 2…

<div LABEL=“Reports of the president and treasurer” DMDID=“D1”> <div LABEL=“Chapter 1” DMDID=“CH1”>

<div LABEL=“page 1” ORDER=“3”><fptr FILEID=“D” >

<div LABEL=“page 2” ORDER=“4”><fptr FILEID=“E”>…

<dmdSec ID=“D1” ><mdWrap MDTYPE="MODS">   <xmlData>    <mods:mods xmlns:mods="http://www.loc.gov/mods/v3" xsi:schemaLocation=http://www.loc.gov/mods/v3 …>     <mods:name>      <mods:displayForm> Radcliffe College</mods:displayForm>     </mods:name>     <mods:titleInfo>      <mods:title> Reports of the president and treasurer for...</mods:title>     </mods:titleInfo>     </mods:mods>   </xmlData>  </mdWrap> <mdRef LOCTYPE=“URL” MDTYPE=“MARC” xlink:href=http://... BNI3165”/>

Use of MODS Extension Schema for Descriptive Metadata

Catalog record

Catalog record

Page 15: Keeping the pieces together: The Role of METS in the Preservation of Digital Content Robin Wendler Harvard University Library January 16, 2005 [Men in

Referring to Content Files

METS

metsHdr

structMap

admSec

dmdSec

fileSec

structLink

behaviorSec

Digital Content can exist inside or outside a METS file.

Three techniques:

• In-line XML• Wrapped base-64 encoded data• Pointers to external information (e.g., URNs, handles)

Page 16: Keeping the pieces together: The Role of METS in the Preservation of Digital Content Robin Wendler Harvard University Library January 16, 2005 [Men in

Structure Map

Chapter 1 page 1 page 2…

<div LABEL=“Chapter 1”><div LABEL=“page 1” ORDER=“3”>

<fptr FILEID=“D” ><div LABEL=“page 2” ORDER=“4”>

<fptr FILEID=“E”>…

<fileSec> <fileGrp> <file ID=“D" ADMID=“TMD01” MIMETYPE="image/jpg“>

<FLocat LOCTYPE="URL" xlink:href="http://nrs.harvard.edu..." />

</file>

<amdSec> <techMD ID=“TMD01”> <mdWrap MDTYPE="NISOIMG" LABEL="Service Copy Technical Metadata"> <xmlData> <mix:mix> <mix:BasicImageParameters> <mix:Format> <mix:MIMEType>image/jpg</mix:MIMEType> <mix:ByteOrder>big-endian</mix:ByteOrder>

<…

Use of MIX Extension Schema for Image Technical Metadata

Page 17: Keeping the pieces together: The Role of METS in the Preservation of Digital Content Robin Wendler Harvard University Library January 16, 2005 [Men in

Profiles

• Challenge: – METS is very flexible– Flexibility allows variant practices– Variant practices undermine interoperability

• Response:– Create profiles: documented ways of using METS

• Profiles constrain practice (this is a good thing)

• Specify required structures, extension schemas, vocabularies, etc.

– Communities of interest develop and register shared profiles – XML schema for METS profiles

• Human-readable, not machine-actionable

Page 18: Keeping the pieces together: The Role of METS in the Preservation of Digital Content Robin Wendler Harvard University Library January 16, 2005 [Men in

Adoption

• Library of Congress• California Digital Library• Harvard University Library• Oxford University• MIT/DSpace• National Library of Wales• Stanford University Library• Indiana University Library• University of California,

Berkeley

• University of Chicago Library• University of Graz, Austria• Florida Center for Library

Automation• Göttingen State and University

Library• OCLC Digital Archive• RLG Cultural Materials• Philadelphia Museum of Art• University of Alberta• … among others

Used By:

Page 19: Keeping the pieces together: The Role of METS in the Preservation of Digital Content Robin Wendler Harvard University Library January 16, 2005 [Men in

Benefits

• Modeling the whole object, not just files

Open standard+ XML encoding + Growing base of tools----------------------------------= Manageable, sharable, preservable digital

objects

Page 20: Keeping the pieces together: The Role of METS in the Preservation of Digital Content Robin Wendler Harvard University Library January 16, 2005 [Men in

Thank You!

Robin Wendler

[email protected]

Baker Library, Harvard Business School, Historical CollectionsTC6512.0001:6512