View
12
Download
0
Category
Preview:
Citation preview
Jane Greenberg (janeg@email.unc.edu)
Metadata Research Center/School of Info. + Lib. Sci.
University of North Carolina at Chapel Hill
Beyond Zebra: Taking RDA
beyond MARC
ALA Annual Conference
June 25, 2011
Overview
1. MARC moment
2. Bibliographic universe world of information
output
– Visual aids
3. DRYAD a case study
– Introduce Dryad
– Why not MARC
– RDA potential
4. Concluding remarks
A MARC Moment
MARC MARC Authority Sources MARBI Concise MARC format
MARC Forum (listserv) MARC Relators MARC FAQ
Unicode-MARC Forum MARC-XML Understanding MARC
Bibliographic universe ≈??
Traditional Evolved/evolving+
World of information output
data, information, knowledge… World of recorded knowledge
Bibliographic entities Information objects
Books
Sound recordings
Images
Archives
Music
Bibliographic entities
People
Activities/events
Data
Relationships
Places
RDF graphs,
graph theory, information, links,
relationship,
context…
from IA3 (Adaptive
Information, Adaptive
Innovation, Adaptive
Infrastructure)
Mike Bergman http://www.mkbergman.com
Also from
IA3, Mike
Bergman
Questions…and the Dryad repository
Graph images are nice, but how to we get there?
Where does RDA fit in?
Consequences?
ALA Annual Conference 2011
Enter Dryad…
- What is Dryad?
- Why not MARC?
10
Data underlying peer-reviewed articles in the
basic and applied biosciences
As of Jun 25, 2011, Dryad contains 769 data packages and
1856 data files, associated with articles in 81 journals
ALA Annual Conference 2011
Why not MARC?
- automatic propagation of metadata
- author generated metadata (low burden)
- handshaking/linking and sharing metadata
- promoting data-reuse, and tracking it
~ versioning
From: managing.editor@molecol.com
Date: April 19, 2011 3:09:22 PM EDT
To: Author
Cc: journal-submit@datadryad.org
Subject: Dryad entry for MEC-11-0140.R1
Dear Author
Many thanks for agreeing to participate in the Dryad project. To upload your data, please click the link below- it will take you directly to your entry in the Dryad database.
http://datadryad.org/submit?journalID=MolEcol&manu=223330
<deleted text>
Once you have uploaded your data please include the Dryad identifier in your manuscript. Please let me know if you have any questions about this process.
All the best,
Tim Vines,
Managing Editor, Molecular Ecology
Pre-populated
metadata
field
DATA
FILE
DATA PACKAGE METADATA
ARTICLE METADATA
ARTICLE
GENBANK
OBJECT
TREEBASE
OBJECT
DRYAD NOT
DRYAD
Data file
identifier
Metadata describing data package
recorded here
Related article citation displayed on
package page
Genbank ID
Tree Base ID
URL
Metadata describing article
recorded here
Article identifier (usually DOI)
OTHER
OBJECT
Data package identifier
DATA FILE METADATA
metadata describing data
file recorded here
DATA
FILE
Data file
identifier
Data package identifier
DATA FILE METADATA
metadata describing data
file recorded here
ALA Annual Conference 2011
Results of a keyword search in Dryad
<?xml version="1.0" encoding="UTF-8" ?>
- <rdf:RDF xmlns="http://datadryad.org/" xmlns:rdf="http://www.w3.org/1999/02/22-
rdf-syntax-ns#" xmlns:dc="http://purl.org/dc/elements/1.1/"
xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#">
- <rdf:Description rdf:about="http://hdl.handle.net/10255/dryad.82">
<dc:title>Data from: Hunting to extinction: biology and regional economy
influence extinction risk and the impact of hunting in artiodactyls</dc:title>
<dc:creator>Price, Samantha A.</dc:creator>
<dc:creator>Gittleman, John L.</dc:creator>
<dc:subject>phylogenetic comparative methods</dc:subject>
<snip>
<dc:description>Half of all artiodactyls (even-toed hoofed mammals) are…
<dc:publisher>Royal Society Publishing</dc:publisher>
<dc:date>2008-02-27T17:42:57Z</dc:date>
<dc:relation>Proceedings of the Royal Society</dc:relation>
<snip>
<dc:relation>doi:10.1098/rspb.2007.0505</dc:relation>
<dc:relation>http://purl.org/phylo/treebase/phylows/study/T
B2:S1271?format=html</dc:relation> </rdf:Description>
- <rdf:Description rdf:about="http://hdl.handle.net/10255/dryad.234">
<dc:title>Data from: Towards a worldwide wood economics spectrum</dc:title>
<dc:creator>Zanne, Amy E.</dc:creator>
<rdf:RDF
xmlns:dryadt="http://rio.cs.utep.edu/ciserver/ciprojects/s
data/DryadTypes.owl#"
xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-
ns#" xmlns:owl="http://www.w3.org/2002/07/owl#"
xmlns:xsd="http://www.w3.org/2001/XMLSchema#"
xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#" >
<rdf:Description rdf:nodeID="A0"> <wdo:hasOutput
rdf:resource="http://rio.cs.utep.edu/ciserver/ciprojects/CI
Miner/ciminer-workflow.owl#i20"/> <rdf:type
rdf:resource="http://rio.cs.utep.edu/ciserver/ciprojects/sd
ata/DryadTypes.owl#DCType"/> </rdf:Description>
EN"></rdfs:comment> </rdf:Description> </rdf:RDF> ….
Openlink Data Explorer (ODE) / LOD4DataONE
https://notebooks.dataone.org/lod4dataone/
Aida Gandara, Data One
Data reuse and data object
relationships
Equivalence Derivative
Whole-part
Sequential
A (=same
data set
on paper)
A (=data
set in
Excel)
A (=same
data set
in SAS)
A1 (=part 1
of a data set)
C
(=data set
A revised)
B (=data
set A annotated)
A
(=data set)
A (=data set)
A1 (=a subset
of A)
A2 (=part 2
of a data set)
DataCite, ver 2.1 (RDA vocabulary) http://www.datacite.org/schema/DataCite-MetadataKernel_v2.1.pdf
dcterms:relation
dcterms:conformsTo:
dcterms:isReferencedBy
dcterms:references
dcterms:isVersionOf
dcterms:hasVersion
dcterms:isFormatOf
dcterms:hasFormat
dcterms:isPartOf
dcterms:hasPart
dcterms:isReplacedBy dcterms:replaces
dcterms:source
RDA at play
- Data sets are
works
- Authors are
entities /
ORCID
ALA Annual Conference 2011
Back to…Why not MARC?
- automatic propagation of metadata
- author generated metadata (low burden)
- handshaking/linking and sharing metadata
- promoting data-reuse, and tracking it
~ versioning
Baker, T. (2007), Singapore Framework
Dryad DCAP (Dublin Core
Application Profile), ver. 3.0 https://www.nescent.org/wg/dryad/images/8/8b/Dryad3.0.pdf
bibo (The Bibliographic
Ontology)
dcterms (Dublin Core terms)
dryad (Dryad) (property:
Dryadstatus)
DwC (Darwin Core) Simple: automatic metadata gen;
heterogeneous datasets
Interoperable: harvesting, cross-system
searching
Semantic Web compatible: sustainable;
supporting machine processing
Data-package centric
2 pronged approach ~ DDpace
(Greenberg, et al, 2009)
Next steps: Alignment with Dryad-UK scheme
(Shotton, et al, 2011)
Map to DataCite; ORCID
Concluding remarks
Alignment of research and
implementation goals (more
immediate needs may not be
the most interesting,
vice/versa)
– Priorities, language barriers,
large team
Infrastructure not “fully”
there; planning for the
future
Synergy between implementation and research (a live lab)
Preparing for new potential
Seeing some benefits…
Intellectually exciting
Challenges
Pros, Benefits
Many people and organizations to acknowledge
Dryad Consortium Board, journal partners, and data authors: NESCent: Kevin Clarke, Hilmar Lapp, Heather Piwowar, Peggy Schaeffer, Ryan
Scherle, Todd Vision UNC-CH <Metadata Research Center>: Jose R. Pérez-Agüera, Sarah Carrier,
Elena Feinstein, Jane Greenberg, Lina Huang, Robert Losee, Hollie White, Craig Willis
U British Columbia: Michael Whitlock / NCSU Digital Libraries: Kristin Antelman HIVE: Library of Congress, USGS, and The Getty Research Institute; and
workshop hosts Yale/TreeBASE: Youjun Guo, Bill Piel DataONE: Rebecca Koskela, Bill Michener, Dave Veiglais, Aida Gandara, and
many others British Library: Lee-Ann Coleman, Adam Farquhar, Brian Hole Oxford University: David Shotton Atmire.com: Mark Diggory
http://datadryad.org http://blog.datadryad.org http://datadryad.org/wiki http://code.google.com/p/dryad Facebook: Dryad Twitter: @datadryad
Recommended