Upload
others
View
0
Download
0
Embed Size (px)
Citation preview
Leslie Goh
TCI-TRF-Scopus Symposium
OPEN DATA, FAIR DATA
Where do we stand
|2
9 Billion Dollars Lost Overnight
|
|
|
|
Source: Scopus data as of July 9 2019, Scholix data extracted on May 5, 2019 – CAGR = Compound Annual Growth Rate
Research Data Management adoption is growing very fast worldwide
|
0%
10%
20%
30%
40%
50%
60%
70%
80%
90%
100%
2016 2017 2018
How often researchers have made their data openly available
Never &rarely
Frequently &sometimes
0
10
20
30
40
50
60
70
80
20
06
20
07
20
08
20
09
20
10
20
11
20
12
20
13
20
14
20
15
20
16
20
17
20
18
Number of UK institutions adopting a data management
policy
Why is Research Data Management (RDM) “hot” now?
Three trends are coming together, leading to an acceleration of RDM practices
1) Computational:▪ Faster, easier, cheaper, more computational
methods of doing science▪ Coming of age of analytics yield new layers of
insight on same data
2) Funding bodies & institutions:▪ Funding agencies driving data sharing mandates
aiming to improve re-use of data and reproducibility of research
▪ Which is followed by institutions adopting data management policies and plans
3) Researchers:▪ a new generation of researchers, more focused
on data and data sharing▪ “Research Data is a first class citizen”
Source: http://www.dcc.ac.uk/resources/policy-and-legal/institutional-data-policies
http://v2.sherpa.ac.uk/juliet/
https://figshare.com/articles/The_State_of_Open_Data_Report_2018/7195058 (normalized per year bracket)
Example of trend 2
Examples of trend 3:
How researchers value data citations
Date of first peer reviewed publication
Respondents who value a data citation same or more than an article citation
Respondents who value a data citation less than an article citation
<1990 64% 36%
90s 65% 35%
00s 60% 40%
10s 72% 28%
|
Increasing Focus On Requiring Data
|
Data policies are increasingly being adopted
|
https://www.elsevier.com/authors/author-resources/research-data/data-guidelines
Growing Trends in Data Management Requirements
|11
https://www.nature.com/news/1-500-scientists-lift-the-lid-on-reproducibility-1.19970
Key take-away:“My-own” means that I (and anybody else)
could not re-use my own data
|
The benefits of sharing research data are clear…
Q: To better understand your attitudes towards research data access, please think about the research data that typically is not published (e.g. not summary charts, tables or images), and indicate how much you agree or disagree with the following statements.
Strongly agree/Agree
Neither agree nordisagree/Don’t know
Strongly disagree/Disagree
https://www.universiteitleiden.nl/en/research/research-output/social-and-behavioural-sciences/open-data-the-researcher-perspective
|
Sharing Data Works:
25% Higher Citation Impact
09.01.2020
https://arxiv.org/pdf/1907.02565.pdf
|
The impact of RDM best practices on publications
14Source: SciVal, data extracted on October 30, 2018
|
The impact of RDM best practices on publications
15Source: SciVal, data extracted on September, 2019
|
16
ELSEVIER’S RESEARCH MANAGEMENT DATA SOLUTION – MENDELEY DATA
https://www.elsevier.com/connect/6-insights-from-leading-universities-on-managing-research-data-effectively
Source:
|
For example: research data on
an institutional repository
For example: research data on
subject or domain repository
Private datasetsFor example: data shared privately
by researchers working on a project
Internal datasets
External datasets
The problem: finding the best way to keep track of datasets
|
Example: University of Virginia
18
LibraData data repository launched March 29, 2016: 149 datasets indexed as of 29 Sep 2019
Source: https://dataverse.org/blog/uva-library-launches-%E2%80%9Clibra-data%E2%80%9D%E2%80%94university-virginia-dataverse-repository
https://dataverse.lib.virginia.edu/dataverse/uva
|
19
LibraData
149
External
Repositories
578
This is not at all uncommon: we have found that in most universities more
than 80% of the datasets are published in external data repositories
At the University of Virginia, 80% of the datasets are published in 12 other repositories
|
Librarians are aware of this challenge
20
“[…] there is a broad spectrum of research data management (RDM) practices among our researchers. Some of the variation is associated with the multitude of data types that are the results of instrument-specific scientific output. Another factor contributing to RDM methods is a range of experimentation types and processes dependent upon discipline-specific practices.”
“We see the library playing a different role – one that focuses on aspects such as tracking archival practices of raw research data sets, assigning appropriate ontologies/metadata, providing appropriate linking and checking that the data has been stored in a secure location.”
Dr. Andrew C. White, Director of Library Information Services at Rensselaer Polytechnic Institute (RPI)
Source: https://www.elsevier.com/connect/6-insights-from-leading-universities-on-managing-research-data-effectively
data.mendeley.com
Data RepositoryStore results in a trusted data repository
Store up to 100 GB of data per
dataset in many formats
Describe how experiments
can be reproduced
Long-term storage: never lose data
Link back to protocols
Create DOI
for Citation
(or university prefix)
Keep track of
versions of dataset
On your S3
Or on DANS
On your (local) S3 or on Elsevier cloud
Metadata:
Dublin Core and Google Science Datasets markup
Open licences & indexed in OpenAire
• Achieve credibility, visibility and integrity of key research outputs
• Keep track of your data inside and outside your institution
• Maintain visibility of events in the research data management space
• Improve adoption of data sharing tools by researchers
• Communicate the value of data sharing to researchers during the
research process
Research
article
published
Share,
publish or
link data
Monitor
progress and
provide
guidance
Generate
dashboardsInitial
inquiry
about data
Data MonitorProactively engage with researchers in the RDM space
|
Enrichment PipelineMendeley
Data Search Index
API
DATACITE
DATACITEGEOR
OC RCSB-PDB
Core repositories• Deep-indexed• some don’t use
DataCite
OpenAIREScholix
Hub
CrossREF
Enrichment• Data – article relationship
Enrichment• Institutional affiliation• Author information
Filter by• Institution name / ID• Author name / ID• Object DOI / ID• Object type• Related publication
DOI• Repository name• Last updated date
range
How do we deal with key requirement?“I want to ingest data only into Pure” → we only ingest data repositories and also allow you to select the object type to ingest
Our solution: enrichment pipeline + API
|24
Multiple institution IDs supported:
Mendeley, SciVal, Scopus
A Flexible Integration
Multiple data repositories supported
Non-relevant datasets can be rejected
Data Monitor for inside and outside of your InstitutionRun adoption email campaigns and keep track of research data
Select data sharing options relevant
for researchers
Customize emails Track adoption
Monitor progress
Collect and edit metadata
|
26
For example: research data on
an institutional repository
For example: research data on
subject or domain repository
Private datasetsFor example: data shared privately
by researchers working on a project
Internal datasets
External datasets
The problem: helping researchers share data within their project when they collaborate
With Mendeley Data Manager,
researchers can:
• Share data privately in your research
group, or project
• Also works for collaborators outside
the institution (they can take part in
projects but not start new projects)
• Gather research data from all your
data sources as it’s generated,
including ELNs, instruments etc
• Annotate research data with detailed,
subject-specific metadata (helped by
automated annotation tools)
• Curate data according to project or
institutional workflows
• Prepare to publish data on your
repository of choice
• Open APIs allow: tailored upload
forms, automated workflows, and
workflows to download, analyse and
re-upload data files
Manager helps researchers move from raw files to datasets
Data ManagerActive research data collaboration and workflow tool, which enables research
groups to gather/organize, annotate and share data all in one place.
Note: leftmost active/external data column
will be completed before June 2018
28
IMPORTANCE OF RDM
29
“Monash researchers must comply with grant funder mandates for data management, sharing and reporting and we encourage them to use data management planning at the early stages of their research lifecycle. We are not yet required to report compliance with grant funders’ mandates, but we believe it is important to be ready to do so in the future.” – Andrew Harrison
https://www.youtube.com/watch?v=N2zK3s
Atr-4&feature=youtu.be
10’th Jan , TCI-TRF-Scopus Symposium
Priyanka Chatterjee
Solution Sales Manager, Digital Commons
Open science trends
and how they influence
the impact of your
journal
OPEN SCIENCE
Open science : Improving global access toknowledge
The Impact of open science : Outcomes & benefits
How does open science elevate journal’s visibility ?
OPEN SCIENCE
Open science : Improving global access toknowledge
The Impact of open science : Outcomes & benefits
How does open science elevate journal’s visibility ?
09.01.2020
Building a shared
definition of Open
Science*
i.e. there are many pillars,
but ultimately, Open
Science is just good
science
Open
Access
Open
Data
Open
Metrics
Research Integrity
and Reproducib
ility
Science
&
Society
Open Tools and Software
Open Science
*https://ec.europa.eu/digital-single-market/en/open-science
09.01.2020
What dynamics are driving the shift to Open Science?
Technology
Big Data
Social Web
Web
Government Policies Transparency
Desire for transparency/reliability
in research results
✓ Everyone is a
publisher
✓ Content access is
everywhere
✓ cheaper computing
power, tools
OPEN SCIENCE
1 2 3
Open Data Mandates
✓ Publicly funded
research should be
made available
✓ Easy access to
data above and
beyond research
The Challenge and opportunity for Journals
Commercial publishers and commercial journal databases only cover a fraction of scholarly publications
Tens of thousands of journals exist outside the scope of commercial publishers:• Regional / local language journals• Journals in niche and emerging fields• Journals in arts, humanities, social sciences• Journals with applied and general audiences
These journals have great value…. How to build them for global success?
• Founded in 1999 by Professors from University of California, Berkeley
• All-in-one journal publishing platform with hosting and editorial management tools
• Integrated with Modern Institutional Repository: publish and manage conferences, books, cultural heritage, multimedia, or anything your institution produces
Digital Commons
20 Years of
Journal Publishing
Key factors driving Journals success
09.01.2020
Professiona
l JournalsVisibility
Easy-to-
Use
Expert
Guidance
Easy-to-
Grow
Digital Commons
Publishing Community :
Strength in Numbers
Research Journals by DisciplineMost are Open Access Types of Journals
Deep dive into Digital Common’s Content
97%
3
%3%
Digital Commons journals are indexed in highly
regarded journal directories and databases
OPEN
SCIENC
E
Open science : Improving global access to
knowledge
The Impact of open science : Outcomes &
benefits
How does open science elevate journal’s
visibility ?
Open Science Success Stories
Open Science initiatives that
have created real-world impact
- The article has over 600 downloads, nearly 5 times
more than the average article in the journal receives
- Many readers were located in the region of the
outbreak, providing them with timely relevant access
to important health information
- Other readers included health and disease
policymakers: the US Center for Disease Control and
Kaiser Foundation Health Plan
- Because this journal was hosted in the IR, it had
critical societal impact
- Aurora Health Care is a healthcare system
provider in the US
- They started and publish an Open Access
journal called the “Journal of Patient-Centered
Research and Reviews”, which is hosted on
Aurora’s IR
- They published an article in 2016 about the
“Elizabethkingia” bacterial outbreak crisis that
caused more than 20 deaths in the US
Open Access Journals
- The dataset has been downloaded more than
900 times from the IR
- And the data has been featured by major
international media and publishing companies
- Because the data was in the IR, the impact
and visibility of the research output was
enhanced
- Thomas Crowther was an environmental
studies fellow at Yale University in 2015
- He researched global tree densities and
collected over 420,000 data points
- An article associated with the research and
data was published in Nature
- And the full dataset made available in Yale’s
IR
Open Data
NY Department of Environmental
Conservation draws on research
from College at Brockport and
Joseph Makarewicz’s student
research to develop their
watershed policy.
“Besides enhancing the reputation of the college,…it now
provides some information to people that I’m a leader in
the field.
It’s also surprisingly led to a number of counties coming
forth and suggesting that they may be willing to fund
some research and scholarship in these areas.”
- Prof. Joseph Makarewicz
Open Research
- The Claremont Colleges showcase their
2000+ senior theses in their repository,
Scholarship@Claremont.
- Among them is Kendyl Klein’s theses,
“Why don’t I look like her? The impact of
social media on female body image.”
46
- The thesis was cited by Elle Magazine and has now been downloaded nearly 200,000 times by readers in 193 countries.
- It was also picked up by Cosmopolitan Magazine and The Washington Post, bringing additional
- Kendyl has noticed that three different State Departments of Education show up as frequent readers, which inspired her to look into the possibility of pursuing the topic of body positivity and social media outreach in an educational capacity.
Open Theses
The Impact of Open Access
Open Metrics measure impact & encourage open
practices
Art History and Archeology Professor Justin Walsh discovered that his work was cited in a major new outlet though the use of open metrics in this Digital Commons author dashboard
Open Metrics on author dashboard new insights
OPEN
SCIENC
E
Open science : Improving global access to
knowledge
The Impact of open science : Outcomes &
benefits
How does open science elevate journal’s
visibility ?
Creating journal visibility – A strategic approach
09.01.2020
Intuitive
workflo
w
Visibility
Turn
Key
JOURNAL
1
2
3
4
5
Design & Branding
Professional Publishing
One stop shop for journal editing
SEO & google indexing
Scalability & Hosted
DIGITAL
COMMONS
72% of scholars start their search here
Discoverable on the open web ?
• Google referrer traffic across
Digital Commons sites increases
steadily
• Google Scholar referrer traffic across
all Digital Commons sites increases
steadily
Digital Commons Discoverability & Visibility
Discovery on the Open Web: Top rankings for journals
| 55
Search engine optimization in Digital Commons
✓ Sitemap XML and robots.txt files provided for search engine crawlers
✓ Logical site structure and navigation
✓ Indication of new content to crawlers
✓ High uptime (>99%) and quick page load
✓ Server supports If-Modified-Since http header for less resource-intensive crawling
✓ Metadata has machine-readable tags (“metatags”)
✓ Unique and informative page titles
✓ Full-text articles published in PDF by default
✓ Assets can be rendered by crawlers
✓ Secure site connections with valid https certificates
✓ Fields for introductory text for collections and publications
✓ Flexible options for accurate, descriptive, and appropriate metadata
✓ SEO configurations for controlling page title and search description at site, community, and publication levels
✓ …And more
Stuff DC gives you:
Stuff DC facilitates:
Stuff you can do:
❑ Upload early and regularly
❑ Link to repository content from your university website, library website, LibGuides, Wikipedia, SSRN, social media, etc.
❑ Provide accurate, rich, descriptive metadata
❑ Include abstracts
❑ Include introductory text for collections and publications
❑ Use your SEO configuration fields
❑ Monitor performance with Digital Commons reports
❑ Ask for a Discoverability Report
❑ Talk to your bepress Consultant!
✓ A product manager dedicated to tracking and increasing impact
✓ Monitoring of key SEO indicators
✓ Communication with specialized search engines such as Google Scholar to assure widespread coverage
✓ Regular enhancements stay current with best practices
✓ Troubleshooting, investigation, and remediation as needed
✓ Support and advice from a dedicated Consultant
✓ Shared wisdom from Digital Commons community
Average journal experience: Migrating to Digital
Commons generates significant visibility
• 250K+ total downloads
over 5 years:
• By Year 5:
• 500+ total articles
• 80K+ annual
downloads
• 12% increase per
year in annual
downloads0
50000
100000
150000
200000
250000
300000
Year 1 Year 2 Year 3 Year 4 Year 5
Ave
rag
e c
um
ula
tive
do
wn
loa
ds
Average journal experience: Per article visibility
improves quickly
0
20
40
60
80
100
120
140
160
Year 1 Year 2 Year 3 Year 4 Year 5
• ~150 downloads per
article by Year 5:
• 10%+ annual growth in
download/article from
Year 2 to Year 5
Do
wn
loa
ds p
er
art
icle
Before and After: Scopus Journals that move to
Digital Commons more than double their
average annual CiteScore growth
4% 9%
Compound average annual
CiteScore growth BEFORE Digital
Commons
Compound average annual
CiteScore growth AFTER Digital
Commons
09.01.2020
Supporting Scopus Compliance Category Criteria Facilitated by DC Advising with Premier
Basic Criteria
Peer-reviewed content ✔
English title, abstracts and keywords ✔
References in Roman script ✔
Regular publication with ISSN ✔
Public publishing ethics statement ✔
Advanced Criteria
Journal Policy
Convincing editorial policy ✔
Type of peer review ✔
Diversity in geographical distribution of editors ✔
Diversity in geographical distribution of authors ✔
Content
Academic contribution to the field ✔
Clarity of abstracts ✔
Quality of and conformity to the stated aims and scope of the
journal
Readability of articles ✔
Journal StandingCitedness of journal articles in Scopus
Editor standing
Publishing Regularity No delays or interruptions in the publication schedule ✔
Online Availability
Full journal content available online ✔
English language journal home page available ✔
Quality of journal home page ✔
Today over 600 institutions use Digital Commons to support over 2000 journals
How can we partner with you to enhance the journal value and visibility ?
Contact priyanka at [email protected] to know about successful journeys of many journals on board
Thank you