View
105
Download
1
Embed Size (px)
DESCRIPTION
Latest trends in Data Analysis for the Scholarly and Academic Publishing Community by Lee-Ann Coleman, PhD, Head of Science, Technology and Medicine, The British Library for the October 16, 2013 NISO Virtual Conference: Revolution or Evolution: The Organizational Impact of Electronic Content.
Citation preview
A National Library: Playing a role in data
Lee-Ann Coleman PhD
Head of Science, Technology & Medicine
@ScienceBL
www.bl.uk 2
Custodians of old books
www.bl.uk 3
National library of the UK
Here for everyone who wants to do research
Archiving since 1662
Provide access to 45k eJournals & newspapers, eBooks, datasets &
800 bibliographic databases
2M sound recordings, 4M maps, 5M reports, theses, conference papers, the world’s largest patents collection (c.50M) & 8M stamps
Legal deposit incl. non-print publications (from April 2013)
Print occupies > 600km shelving
300TB of data in the Digital Library
And we are embracing digital
www.bl.uk 4
Science team
Catering for contemporary science
Developing services Engaging and inspiring
Collaborations & PartnershipsResearch
Managing collectionsDelivering new content
www.bl.uk 5
Information lifecycle
www.bl.uk 6
The value of research data
• Data are a vital part of the scientific record
• But what is/should be/will be the role of libraries in this changing landscape?
• Data as a format is very different from traditional library content, so are libraries equipped with the knowledge, technology and capacity to deal with it?
• How should libraries prepare for this?
We examined the landscape of data and assessed the services that the British Library might offer
www.bl.uk 7
Testing dataset discovery
A service involving a ‘new’ material type raised questions about:
• Users• Selection• Metadata• Operational sustainability
Preliminary work:• Studies conducted on our behalf• Literature review of user behaviour• Internal scoping to define suitable
processes and systems
Lead to a pilot service, using existing systems
SD
AS
M A
rchi
ves.
Pub
lic D
omai
n V
ia F
lickr
www.bl.uk 8
Selection criteria
These considered:
Scope: Subject Value to research
Access: Restrictions Stability Copyright
Quality: Creators Publishers
www.bl.uk 9
Datasets discovery in Explore the British Library
9
>500 research datasetsEnvironmental ScienceTropical & Rare Diseases
www.bl.uk 10
Sub-disc
ipline
Publisher /
creator
Access
restr
ictions
Usage re
stricti
ons
Methodology
File fo
rmats
/ media
Tools / f
unctions
Geographic
cove
rage
Temporal c
overa
ge0%
10%
20%
30%
40%
50%
60%
70%
80%
90%
100%
Metadata for SEARCH
Results
• Usage statistics suggest the service was used to find research data
October
'10
November
December
January
'11
Febru
aryMarc
h
April '11
MayJune
July '11
August
Septem
ber
October
'11
November
December
January
'12
0.0
10.0
20.0
30.0
40.0
50.0
60.0
70.0
80.0
% conversion from dataset view to click through
• A wide variety of approaches were used to search
www.bl.uk 11
The benefits of citing data
• Checking facts
• Obtaining easier access to data
• Enabling re-use of data
• Providing acknowledgement to a wider group – the data centre, curators etc.
• Supporting openness and transparency
Reich NG, Perl TM, Cummings DAT, Lessler J (2011) Visualizing Clinical Evidence: Citation Networks for the Incubation Periods of Respiratory Viral Infections. PLoS ONE 6(4): e19496. doi:10.1371/journal.pone.0019496
www.bl.uk 12
Why finding and citing data is not easy
• No widely used method to identify datasets
• No widely used method to cite datasets
• No effective way to link between articles and datasets
• How can we solve these challenges?
www.bl.uk 13
The Digital Object Identifier is a persistent identifier that directs users to an online object, even if it changes location.
Why DOIs?
• Most widely used identifier for research articles
• Researchers and publishers already know how to use them
• Puts datasets on the same playing field as articles
• The DOI system offers an easy way to connect the article with the underlying data
Why DOIs?
www.bl.uk 14
DataCite
• Established in 2009 as a not-for-profit organisation
• A member of the International DOI Foundation
• A Registration Agency for DOI names
• 18 full members from Europe, North America, Asia and Australia (2m DOIs)
• Members work with data centres in their own countries
• Provide a shared infrastructure for minting DOIs
www.bl.uk 15
British Library's role in DataCite
International DOI Foundation
DataCite
MemberInstitution
Data CentreData CentreData Client
Member
• The British Library is one of 18 international members of DataCite
• We are an allocating agent
• We provide the DataCite infrastructure, enabling UK Data Centres to ‘mint’ DOIs for data
• While the aim is to support researchers, we do not work with individuals - they must deposit to a data centre/institution
www.bl.uk 17
Examples of UK data centres with DOIs
DOI:10.5285/1a91c7d1-ec44-4858-9af2-98d80f169bbd