19
New Century, New Metadata Thomas Krichel http://openlib.org/home/krichel University of Surrey, Hitotsubashi University and Long Island University

New Century, New Metadata Thomas Krichel University of Surrey, Hitotsubashi University and Long Island University

Embed Size (px)

Citation preview

Page 1: New Century, New Metadata Thomas Krichel  University of Surrey, Hitotsubashi University and Long Island University

New Century, New Metadata

Thomas Krichelhttp://openlib.org/home/krichel

University of Surrey, Hitotsubashi University and Long Island University

Page 2: New Century, New Metadata Thomas Krichel  University of Surrey, Hitotsubashi University and Long Island University

Why Metadata

FunInformation retrievalSupport organization of social process

Page 3: New Century, New Metadata Thomas Krichel  University of Surrey, Hitotsubashi University and Long Island University

Crisis of Author Self-archiving

Formal archivingSmallMetadata poor

Informal archivingInformation retrieval difficultLack of support infrastructure

Page 4: New Century, New Metadata Thomas Krichel  University of Surrey, Hitotsubashi University and Long Island University

Improving formal archiving

Strengthen the metadata provisionBroaden the mission of archivingAllow usage of archived material in many

user servicesBetter report on archive material usageStrengthen the relationship with overlay

services

Page 5: New Century, New Metadata Thomas Krichel  University of Surrey, Hitotsubashi University and Long Island University

Improving Informal Archiving

Build standardized metadata supply format

Harvest that metadata into larger digital libraries

Offer archival backup for papers

Page 6: New Century, New Metadata Thomas Krichel  University of Surrey, Hitotsubashi University and Long Island University

Metadata to Support Self-archiving

Simple to composeIntuitive vocabulary that is specific to the

academic process, e.g. “author” instead of “creator”

Widely applicableAll disciplines and publication forms

High quality i.e. controlled

Page 7: New Century, New Metadata Thomas Krichel  University of Surrey, Hitotsubashi University and Long Island University

Metadata Control

Any processing that is done to the metadata before its inclusion in a user service.

Essential in a situation where metadata is harvested.

Page 8: New Century, New Metadata Thomas Krichel  University of Surrey, Hitotsubashi University and Long Island University

Types of Control

Syntactic controlRelational controlRetrieval controlIdentity controlVerity controlAccession control

Page 9: New Century, New Metadata Thomas Krichel  University of Surrey, Hitotsubashi University and Long Island University

Basic Model

Four different record typesDocumentGroupPersonOrganization

Page 10: New Century, New Metadata Thomas Krichel  University of Surrey, Hitotsubashi University and Long Island University

Group and document

There is only one document type.Groups are used to refine the status of

the document.Group construct meant to be defined by

librarians, publishers and other intermediaries.

Page 11: New Century, New Metadata Thomas Krichel  University of Surrey, Hitotsubashi University and Long Island University

Person and Institution

Person and institution admit very similar attributes

It is hoped that organizational information will be contributed by intermediaries.

Page 12: New Century, New Metadata Thomas Krichel  University of Surrey, Hitotsubashi University and Long Island University

Implementation of Basic Model

RePEc100000 documents100 groups (series)500 authors5000 institutions

Examplehttp://ideas.uqam.ca/EDIRC/data/frbgvus.html

Possible to do the same thing for ReLIS

Page 13: New Century, New Metadata Thomas Krichel  University of Surrey, Hitotsubashi University and Long Island University

Basic Grammar

XML syntaxThree groups of XML elements

Nouns: element for items describedAdjectives: elements that describe nounsVerbs: elements that relate nouns

Page 14: New Century, New Metadata Thomas Krichel  University of Surrey, Hitotsubashi University and Long Island University

Modular Design

<person><isauthorof>

<document><ispublishedby>

<organization><hasmember>

<person></person>

</hasmember></organization>

</ispublishedby></document>

</isauthorof></person>

Page 15: New Century, New Metadata Thomas Krichel  University of Surrey, Hitotsubashi University and Long Island University

Relational Design

<person id=“kmarxthered”><email> [email protected]</email> </person>

<document id=“kapital”> <title>Das Kapital</title><hasauthor> <person id=“kmarxthered”/> </hasauthor></document>

Page 16: New Century, New Metadata Thomas Krichel  University of Surrey, Hitotsubashi University and Long Island University

Other features

Lang qualifier to all elements, it ISO 639-1 if there are two letters and the bibliographic variant of ISO 639-2 if three letters.

Nouns have id.Verbs have startdate and enddate

qualifiers, and of course have id.Adjectives can have child elements.

Page 17: New Century, New Metadata Thomas Krichel  University of Surrey, Hitotsubashi University and Long Island University

Remaining Problems

Resolvability rules for identifiersDates and historySubject classification using the group

mechanismAliasing of element names

Page 18: New Century, New Metadata Thomas Krichel  University of Surrey, Hitotsubashi University and Long Island University

To be done…

Complete list of verbs and adjectivesSchema designParsing and validation software.Conversion with test collection ReLIS.

Page 19: New Century, New Metadata Thomas Krichel  University of Surrey, Hitotsubashi University and Long Island University

Collaboration is welcome

Thanks for listening.

Have a happy New Year.