Upload
kimberly-eke
View
4.073
Download
1
Embed Size (px)
Citation preview
How libraries can help rescue data
9 February 2017
Downloadpdf for
reading!
ResearchQuality
Data
using workflow protocols to ensure quality & trust
Levels of library commitmentWe have Archive-It & cando deep web archiving &
document “uncrawlables”
We know of specific data our community needs;
we will rescue them.
2
3We can harvest “uncrawlables”
for an agency through a data rescue event or dedicated team
1We can survey our researchers to nominate data sets & raise awareness in our community
4
ResearchQuality
Data
using workflow protocols to ensure quality & trust
Levels of library commitmentWe have Archive-It & cando deep web archiving &
document “uncrawlables”
We know of specific data our community needs;
we will rescue them.
2
3We can harvest “uncrawlables”
of a program through a data rescue event or dedicated team
1We can survey our researchers to nominate data sets & raise awareness in our community
4
Survey researchers & raise awareness
Nomination Formto collect data they use & need
Ask researchers & use
1 ResearchQuality
Data
Survey researchers & raise awareness
Raise awareness by:attending & writing about a Data Refuge event
highlighting waysyour repositorycan preserve data
Nomination Formto collect data they use & need
Ask researchers & use
1 hold workshops & panels on data storytelling
ResearchQuality
Data
ResearchQuality
Data
using workflow protocols to ensure quality & trust
Levels of library commitmentWe have Archive-It & cando deep web archiving &
document “uncrawlables”
We know of specific data our community needs;
we will rescue them.
2
3We can harvest “uncrawlables”
of a program through a data rescue event or dedicated team
1We can survey our researchers to nominate data sets & raise awareness in our community
4
Web archive & document uncrawlables
2to see what agencies
can be claimed
US Agency Coordination Spreadsheet
Use the
ResearchQuality
Data
Web archive & document uncrawlables
2to see what agencies
can be claimed
US Agency Coordination Spreadsheet
Use theSubmit a request to
claim an agency& use your Archive-It
account to go as deeply as possible
ResearchQuality
Data
Web archive & document uncrawlables
2to see what agencies
can be claimed
US Agency Coordination Spreadsheet
Use theSubmit a request to
claim an agency& use your Archive-It
account to go as deeply as possible
ResearchQuality
Data
Document “uncrawlables”
as you run into archiving errors
Web archive & document uncrawlables
2to see what agencies
can be claimed
US Agency Coordination Spreadsheet
Use theSubmit a request to
claim an agency& use your Archive-It
account to go as deeply as possible
ResearchQuality
Data
Document “uncrawlables”
as you run into archiving errors
We need a Chrome plug-into launch an uncrawlables form to make this easy & consistent across all libs!
(similar to this one)
HELP!
ResearchQuality
Data
using workflow protocols to ensure quality & trust
Levels of library commitmentWe have Archive-It & cando deep web archiving &
document “uncrawlables”
We know of specific data our community needs;
we will rescue them.
2
3We can harvest “uncrawlables”
of a program through a data rescue event or dedicated team
1We can survey our researchers to nominate data sets & raise awareness in our community
4
Rescue community-specific dataIf you already know of high value, high priority data
sets for your community, get them!
ResearchQuality
Data3
Rescue community-specific data
Workflow Protocols
If you already know that there are data sets of high value & high priority for your community, get them!
Use your library’s preservation
github.com/datarefuge/workflow
ResearchQuality
Data3
in the spirit of
Rescue community-specific data
It’s only DataRefugeif it follows QA processes
for a trusted chain of custody
If you already know that there are data sets of high value & high priority for your community, get them!
ResearchQuality
Data3
Workflow Protocols
Use your library’s preservation
github.com/datarefuge/workflowin the spirit of
Rescue community-specific data
We are working on a ckan Registry to link to
downloaded files in your repository
It’s only DataRefugeif it follows QA processes
for a trusted chain of custody
If you already know that there are data sets of high value & high priority for your community, get them!
ResearchQuality
Data3
Workflow Protocols
Use your library’s preservation
github.com/datarefuge/workflowin the spirit of
3
Rescue community-specific data
It’s only DataRefuge if it follows QA processes
for a trusted chain of custody
If you already know that there are data sets of high value & high priority for your community, get them!
ResearchQuality
Data
Let us know when you’re done by claiming it on the US Agency
spreadsheet!
Workflow Protocols
Use your library’s preservation
github.com/datarefuge/workflowin the spirit of
We are working on a ckan Registry to link to
downloaded files in your repository
ResearchQuality
Data
using workflow protocols to ensure quality & trust
Levels of library commitmentWe have Archive-It & cando deep web archiving &
document “uncrawlables”
We know of specific data our community needs;
we will rescue them.
2
3We can harvest “uncrawlables”
of a program through a data rescue event or dedicated team
1We can survey our researchers to nominate data sets & raise awareness in our community
4
Harvest all uncrawlablesThis is the highest level of commitment. You are
dedicated to harvesting all of the data you can find.
ResearchQuality
Data4
Harvest all uncrawlables
Claim an Agency using the spreadsheet
This is the highest level of commitment. You are dedicated to harvesting all of the data you can find.
ResearchQuality
Data4
Harvest all uncrawlables
Host a Data Rescue Event
or designate an internal library team
This is the highest level of commitment. You are dedicated to harvesting all of the data you can find.
ResearchQuality
Data
Claim an Agency using the spreadsheet
4
Harvest all uncrawlablesThis is the highest level of commitment. You are
dedicated to harvesting all of the data you can find.
ResearchQuality
Data
Claim an Agency using the spreadsheet
4
Host a Data Rescue Event
or designate an internal library team
Use established library preservation workflow protocols to maintain
trusted chain of custody
Harvest all uncrawlables
Use established library preservation workflow protocols to maintain
trusted chain of custody
This is the highest level of commitment. You are dedicated to harvesting all of the data you can find.
ResearchQuality
Data
Claim an Agency using the spreadsheet
4We are working on a
ckan Registry to link to downloaded files in
your repository
Host a Data Rescue Event
or designate an internal library team
Repeat the cycle as needed
Claim an agency
Documentuncrawlables
Harvestuncrawlables
Verify/QAuncrawlables
Registeruncrawlables
Updatespreadsheet
ResearchQuality
Data
using workflow protocols to ensure quality & trust
Levels of library commitmentWe have Archive-It & cando deep web archiving &
document “uncrawlables”
We know of specific data our community needs;
we will rescue them.
2
3We can harvest “uncrawlables”
for an agency through a data rescue event or dedicated team
1We can survey our researchers to nominate data sets & raise awareness in our community
4
ResearchQuality
Data
using workflow protocols to ensure quality & trust
URLS2
3
1
4
Nominate data setsdocs.google.com/forms/d/e/1FAIpQLSd8JeRxvyVrBASrc4D42Z6nz8yIsQuu_cGVRtO5uWO9yjlBFw/viewform?c=0&w=1
US Agency spreadsheet
Claim an Agency form
docs.google.com/spreadsheets/d/1yIrhFrZkv2Yhdk48W_P5bd-C-jxLtbzJFcm2E5oq-ec/edit
docs.google.com/forms/d/e/1FAIpQLScE7iuLIEbEd0hkkP9_zB5skXwKqL8EeW9hVlB4JSIkvCvm6Q/viewform?c=0&w=1
Workflow protocols examplegithub.com/datarefuge/workflow
Chain of custodygithub.com/datarefuge/chain-of-custody
US Agency spreadsheet
Claim an Agency form
docs.google.com/spreadsheets/d/1yIrhFrZkv2Yhdk48W_P5bd-C-jxLtbzJFcm2E5oq-ec/edit
docs.google.com/forms/d/e/1FAIpQLScE7iuLIEbEd0hkkP9_zB5skXwKqL8EeW9hVlB4JSIkvCvm6Q/viewform?c=0&w=1
Workflow protocols examplegithub.com/datarefuge/workflow
Chain of custodygithub.com/datarefuge/chain-of-custody
librariesnetwork.org
Icons courtesy of iconmonstr.com