46
Case History: Library of Congress Audio-Visual Prototyping Project METS Opening Day (2003), Revised For the CUL Metadata Working Group July 22, 2004 Carl Fleischhauer

Case History: Library of Congress Audio-Visual Prototyping Project

  • Upload
    adem

  • View
    43

  • Download
    0

Embed Size (px)

DESCRIPTION

Case History: Library of Congress Audio-Visual Prototyping Project. METS Opening Day (2003), Revised For the CUL Metadata Working Group July 22, 2004 Carl Fleischhauer Office of Strategic Initiatives, Library of Congress [email protected]. The AV Project. - PowerPoint PPT Presentation

Citation preview

Page 1: Case History: Library of Congress  Audio-Visual Prototyping Project

Case History:Library of Congress

Audio-Visual Prototyping Project

METS Opening Day (2003), Revised

For the CUL Metadata Working Group

July 22, 2004

Carl Fleischhauer

Office of Strategic Initiatives, Library of Congress

[email protected]

Page 2: Case History: Library of Congress  Audio-Visual Prototyping Project

The AV Project

• Preservation, sense one: reformatting into digital-file form

• Preservation, sense two: sustaining digital objects

• Participation by Motion Picture, Broadcasting, and Recorded Sound Division (M/B/RS) and the American Folklife Center

Page 3: Case History: Library of Congress  Audio-Visual Prototyping Project

Reformatting Documentation

• About the source – original disc or tape being reformatted– <amdSec><sourceMD><AMD audio ext schema>

• About the process– how the copy file was made, what devices/tools– <amdSec><digiProvMD><PMD process ext schema>

• About the outcome– characteristics and features of the copy file– <amdSec><techMD><AMD audio ext schema>

Page 4: Case History: Library of Congress  Audio-Visual Prototyping Project

Diagram of Extension Schemas See also: http://lcweb.loc.gov/rr/mopic/avprot/metsmenu2.html

<mets>

<dmdSec> descriptive metadata <MODS> MODS standard as maintained by LC

<techMD> technical metadata <AMD> audio (file) metadata rolled our own, using data dictionary from AES <MIX> image (file) metadata standard as maintained by LC, data dictionary from NISO

<rightsMD> rights and access mgt metadata <RMD> access “category” metadata rolled our own, just tracking categories

<sourceMD> source metadata <AMD> audio (source) metadata same schema as AMD above <MIX> image (source) metadata same schema as MIX above

<digiProv> digital provenance metadata <PMD> digital provenance metadata rolled our own, data dictionary from AES, with some simplifications

<behaviorSec> behavior section did not use will METS profiles play this role?

<fileGrp> file group (inventory) from METS proper

<structMap> structural map from METS proper

</mets>

Page 5: Case History: Library of Congress  Audio-Visual Prototyping Project

PRODUCERS

ADMINISTRATION

DATAMANAGEMENT

ARCHIVALSTORAGE

INGEST ACCESS

CONSUMERS

PRESERVATION PLANNING

Reference Model for an Open Archival Information System (OAIS)

SIPs (Submission Information Packages) will be

produced by the AV preservation activity, ready to

submit to LC’s future digital repository.

Page 6: Case History: Library of Congress  Audio-Visual Prototyping Project
Page 7: Case History: Library of Congress  Audio-Visual Prototyping Project

AV Project Web Site Home Page http://lcweb.loc.gov/rr/mopic/avprot/

Page 8: Case History: Library of Congress  Audio-Visual Prototyping Project

AV Project Extension Schema Page http://lcweb.loc.gov/rr/mopic/avprot/metsmenu2.html

Page 9: Case History: Library of Congress  Audio-Visual Prototyping Project

AV Project Initial Data Capture System

MS-Access Database - Collation Input Screen

Page 10: Case History: Library of Congress  Audio-Visual Prototyping Project

Top level: work

Second level: sound recordings

Third level: disc sides

Fourth level: cuts

Page 11: Case History: Library of Congress  Audio-Visual Prototyping Project

Recorded Sound Processing Section

Content selected for reformatting

1. Initial creation or copying-in of metadata

Workflow Sidebar

Page 12: Case History: Library of Congress  Audio-Visual Prototyping Project

Recorded Sound Processing Section

Content selected for reformatting

1. Initial creation or copying-in of metadata

LC Recording Lab or offsite contractor

Scanning activity

2. Creation of second layer of metadata

Workflow Sidebar

Page 13: Case History: Library of Congress  Audio-Visual Prototyping Project

Recorded Sound Processing Section

Content selected for reformatting

1. Initial creation or copying-in of metadata

LC Recording Lab or offsite contractor

Scanning activity

2. Creation of second layer of metadata

3. Return loop to processing, edit and possible addition of third layer of metadata

Workflow Sidebar

Page 14: Case History: Library of Congress  Audio-Visual Prototyping Project

The AV METS System Today

Page 15: Case History: Library of Congress  Audio-Visual Prototyping Project

OUTCOME ONE: A VIRTUAL DIGITAL OBJECT (SIP)

Logical storage structure based in a UNIX filesystem

master -- family of logical directories where the master files are stored (there is a parallel set of “service” directories)

afc -- “owner” is the American Folklife Center

afc1941001 -- group or aggregate of items, often from an actual collection

sr05 -- item directory (at the level of the digital object, counterpart to a bib record or “line” in a finding aid)

sr05am.wav -- the master file for side A of this disc

sr05am.wav -- the master file for side B of this disc

Index of master/afc/afc1941001/sr05

Page 16: Case History: Library of Congress  Audio-Visual Prototyping Project

OUTCOME ONE: VIRTUAL DIGITAL OBJECT

The fileGrp segment of a METS instance “binds” the object

Includes logical pathnames for files, future switch to persistent names possible.

Page 17: Case History: Library of Congress  Audio-Visual Prototyping Project

OUTCOME 2: PRESENTATION OF OBJECT

Presentation in Browser

Page 18: Case History: Library of Congress  Audio-Visual Prototyping Project

Zoom on Image in Presentation

Page 19: Case History: Library of Congress  Audio-Visual Prototyping Project

Interim username/password access management

Page 20: Case History: Library of Congress  Audio-Visual Prototyping Project

In the Presentation: Metadata Map for the Dedicated

Page 21: Case History: Library of Congress  Audio-Visual Prototyping Project

sourceMD data from the Metadata Map

Extension schema content displayed as name-value pairs

Page 22: Case History: Library of Congress  Audio-Visual Prototyping Project

Generator takes data from the database and makes METS XML

Page 23: Case History: Library of Congress  Audio-Visual Prototyping Project

Snapshot of the database back end

Page 24: Case History: Library of Congress  Audio-Visual Prototyping Project

Selection from the database diagram: tables for METS id, agent information, and structMap data

Page 25: Case History: Library of Congress  Audio-Visual Prototyping Project

Selection from the database diagram: tables for extension schema data for image source, video source, and audio source

Page 26: Case History: Library of Congress  Audio-Visual Prototyping Project

Selection from the database diagram: tables for digiProv (“digitization process”) information

Page 27: Case History: Library of Congress  Audio-Visual Prototyping Project

Builder: the data-entry front end to the database

Page 28: Case History: Library of Congress  Audio-Visual Prototyping Project

Builder: template making tool

Page 29: Case History: Library of Congress  Audio-Visual Prototyping Project

Builder: tool to shape a structMap using indent, outdent, up, and down. May be used in both template and individual object modes.

Page 30: Case History: Library of Congress  Audio-Visual Prototyping Project

“Cut wizard” – a twenty more like this one tool

Page 31: Case History: Library of Congress  Audio-Visual Prototyping Project

Part of MODS descriptive data for a recorded interview with a former enslaved person.

Page 32: Case History: Library of Congress  Audio-Visual Prototyping Project

File Association Tool

Page 33: Case History: Library of Congress  Audio-Visual Prototyping Project

Tool to append a MODS record

Page 34: Case History: Library of Congress  Audio-Visual Prototyping Project

Two samples from the MODS entry and editing tool.

+ repeats the section

x and – delete sections or subsections

Page 35: Case History: Library of Congress  Audio-Visual Prototyping Project

Selection from the online data dictionary

Page 36: Case History: Library of Congress  Audio-Visual Prototyping Project

Some METS objects, by title

Page 37: Case History: Library of Congress  Audio-Visual Prototyping Project

Administration Tool Menu

Page 38: Case History: Library of Congress  Audio-Visual Prototyping Project

Example of data entry screen

Blue terms are used to select separate data entry screens

Page 39: Case History: Library of Congress  Audio-Visual Prototyping Project

Some Shortcomings

• Cumbersome data entry – many screens, many actions

• Bugs – hard to get them all fixed now that the contractor is gone

• Best if users understand METS and the structMap – barrier to entry for new team members

• Does not include tools for bulk compilation from pre-existing data

Page 40: Case History: Library of Congress  Audio-Visual Prototyping Project
Page 41: Case History: Library of Congress  Audio-Visual Prototyping Project

Distributed Data Entry

• Each team enters its own data in less cumbersome “local” tools

• Tool for descriptive data, especially copying in and out of the ILS

• Tool for data about the source item and certain technical aspects, copied in and out of MAVIS

• Tool for digiProv data, “the engineers’ form”• Tool or a MAVIS extension to encode the

structMap

Page 42: Case History: Library of Congress  Audio-Visual Prototyping Project

Supporting Tools

• Approach being discussed– Dispersed tools produce XML outputs– Centralized tool gathers and compiles the

various XML data units into a METS instance– Downstream facility to manage the METS

XML documents

Page 43: Case History: Library of Congress  Audio-Visual Prototyping Project

Supporting Concept• METS profiles

– LC implementation in early development by Morgan Cundiff

• Rationale– METS is very flexible, need to narrow use within

an organization or community– Profiles establish limits that make for more

efficient tool-building and more efficient work– Profile-governed objects will enhance

interoperability between repositories

Page 44: Case History: Library of Congress  Audio-Visual Prototyping Project

METS Profile for “simple phonodisc”

• Relatively simple object but profile with some detail

• May evolve into a more general profile for a wider range of phonodiscs

• For now: agnostic about administrative metadata

Page 45: Case History: Library of Congress  Audio-Visual Prototyping Project

Show examples

Page 46: Case History: Library of Congress  Audio-Visual Prototyping Project

METS Profile for “simple phonodisc”

• Limit to discs partly for management of semantics

• Apply judgment to find the right point between specific and general– not much experience yet

• History of reformatting may inhibit our imagination – we still are using terms that fit the source object and not the digital object