Walsh Metadata Reuse OR2013.pdf

Preview:

Citation preview

Metadata Reuse Workflows and Methods for DSpace Repositories

Maureen P. Walsh Open Repositories 2013

Metadata Reuse Workflows and Methods for DSpace Repositories

Charlottetown, PEI July 12, 2013

Metadata Repurposing Workflows

•Background / Context •Metadata Services and Archiving Options

•Metadata Repurposing Overview •MARC Catalog Metadata •Embedded Image Metadata •EAD Finding Aid Metadata •Printed Text Metadata

The Ohio State University’s Institutional Repository

Knowledge Bank Mission: …to collect, preserve, and

distribute the digitally formatted intellectual output

of the University…

76 Communities 50,733 Items

108,440 Content Files

Knowledge Bank Archived Items

KB Metadata Application Profile • Core set of metadata elements and Dublin Core Metadata Element Set mappings to

• improve retrieval accuracy and resource discovery • facilitate multi-institutional interoperability and quality control • comply with the Open Archives Initiative Protocol for Metadata Harvesting • enable collection migration, import & export between the Knowledge Bank and other systems as necessary

KB Collection Core Element Set

Individual Item Submission

Customized Input Forms (Display)

Customized Input Forms (XML)

Customized Item Templates (Display)

Customized Item Templates (Dublin Core)

Importing Items in Batch

Batch Loading – DSpace Simple Archive Format

Batch Loading – Metadata CSV

Batch Loading – Dublin Core

DSpace Dublin Core XML DSpace Item Record

Building the Simple Archive Format

Custom Perl Scripts • Examples http://hdl.handle.net/1811/46845

Stand-alone Java Tool • Simple Archive Format Packager / SAFBuilder https://wiki.duraspace.org/display/DSPACE/Simple+Archive+Format+Packager

Repurposing MARC Metadata

Repurposing MARC Metadata

MARC - Catalog

Dublin Core - IR

Repurposing MARC Metadata

• XSLT Workflow • Export Tab Delimited Records Workflow

XSLT Workflow

XSLT Workflow

[Truncated] XSLT

MarcEdit by Terry Reese

Full example available at: http://hdl.handle.net/1811/47564

Export Tab Delimited Records Workflow

MarcEdit by Terry Reese

MarcEdit CSV Export

Batch Load CSV

Repurposing Embedded

Image Metadata

Embedded Metadata Workflow

Adobe Photoshop

Extracting Metadata with ExifTool

ExifTool by Phil Harvey http://owl.phy.queensu.ca/~phil/exiftool/

Adobe Photoshop

ExifTool CSV Export

ExifTool CSV Export Mapping to DC

ExifTool ‘Targeted’ CSV Export

Batch Load CSV

Simple Archive Format Packager

Repurposing EAD

Finding Aid Metadata

Repurposing EAD Metadata

• XSLT Workflow • xml2csv Workflow

Repurposing EAD Metadata

EAD

Online Finding Aid

XSLT Workflow

<oXygen/> XML Editor

<oXygen/> XML Editor

xml2csv Workflow

A7Soft xml2csv http://www.a7soft.com/xml2csv.html

Repurposing Printed

Text Metadata

Delimited Text Workflow

Delimited Text Workflow PSPad

Delimited Result

Thank You Maureen P. Walsh Associate Professor Institutional Repository Services Librarian The Ohio State University Libraries walsh.260@osu.edu