26
How libraries can help rescue data 9 February 2017 Download pdf for reading!

How libraries can help: Levels of data rescue

Embed Size (px)

Citation preview

Page 1: How libraries can help: Levels of data rescue

How libraries can help rescue data

9 February 2017

Downloadpdf for

reading!

Page 2: How libraries can help: Levels of data rescue

ResearchQuality

Data

using workflow protocols to ensure quality & trust

Levels of library commitmentWe have Archive-It & cando deep web archiving &

document “uncrawlables”

We know of specific data our community needs;

we will rescue them.

2

3We can harvest “uncrawlables”

for an agency through a data rescue event or dedicated team

1We can survey our researchers to nominate data sets & raise awareness in our community

4

Page 3: How libraries can help: Levels of data rescue

ResearchQuality

Data

using workflow protocols to ensure quality & trust

Levels of library commitmentWe have Archive-It & cando deep web archiving &

document “uncrawlables”

We know of specific data our community needs;

we will rescue them.

2

3We can harvest “uncrawlables”

of a program through a data rescue event or dedicated team

1We can survey our researchers to nominate data sets & raise awareness in our community

4

Page 4: How libraries can help: Levels of data rescue

Survey researchers & raise awareness

Nomination Formto collect data they use & need

Ask researchers & use

1 ResearchQuality

Data

Page 5: How libraries can help: Levels of data rescue

Survey researchers & raise awareness

Raise awareness by:attending & writing about a Data Refuge event

highlighting waysyour repositorycan preserve data

Nomination Formto collect data they use & need

Ask researchers & use

1 hold workshops & panels on data storytelling

ResearchQuality

Data

Page 6: How libraries can help: Levels of data rescue

ResearchQuality

Data

using workflow protocols to ensure quality & trust

Levels of library commitmentWe have Archive-It & cando deep web archiving &

document “uncrawlables”

We know of specific data our community needs;

we will rescue them.

2

3We can harvest “uncrawlables”

of a program through a data rescue event or dedicated team

1We can survey our researchers to nominate data sets & raise awareness in our community

4

Page 7: How libraries can help: Levels of data rescue

Web archive & document uncrawlables

2to see what agencies

can be claimed

US Agency Coordination Spreadsheet

Use the

ResearchQuality

Data

Page 8: How libraries can help: Levels of data rescue

Web archive & document uncrawlables

2to see what agencies

can be claimed

US Agency Coordination Spreadsheet

Use theSubmit a request to

claim an agency& use your Archive-It

account to go as deeply as possible

ResearchQuality

Data

Page 9: How libraries can help: Levels of data rescue

Web archive & document uncrawlables

2to see what agencies

can be claimed

US Agency Coordination Spreadsheet

Use theSubmit a request to

claim an agency& use your Archive-It

account to go as deeply as possible

ResearchQuality

Data

Document “uncrawlables”

as you run into archiving errors

Page 10: How libraries can help: Levels of data rescue

Web archive & document uncrawlables

2to see what agencies

can be claimed

US Agency Coordination Spreadsheet

Use theSubmit a request to

claim an agency& use your Archive-It

account to go as deeply as possible

ResearchQuality

Data

Document “uncrawlables”

as you run into archiving errors

We need a Chrome plug-into launch an uncrawlables form to make this easy & consistent across all libs!

(similar to this one)

HELP!

Page 11: How libraries can help: Levels of data rescue

ResearchQuality

Data

using workflow protocols to ensure quality & trust

Levels of library commitmentWe have Archive-It & cando deep web archiving &

document “uncrawlables”

We know of specific data our community needs;

we will rescue them.

2

3We can harvest “uncrawlables”

of a program through a data rescue event or dedicated team

1We can survey our researchers to nominate data sets & raise awareness in our community

4

Page 12: How libraries can help: Levels of data rescue

Rescue community-specific dataIf you already know of high value, high priority data

sets for your community, get them!

ResearchQuality

Data3

Page 13: How libraries can help: Levels of data rescue

Rescue community-specific data

Workflow Protocols

If you already know that there are data sets of high value & high priority for your community, get them!

Use your library’s preservation

github.com/datarefuge/workflow

ResearchQuality

Data3

in the spirit of

Page 14: How libraries can help: Levels of data rescue

Rescue community-specific data

It’s only DataRefugeif it follows QA processes

for a trusted chain of custody

If you already know that there are data sets of high value & high priority for your community, get them!

ResearchQuality

Data3

Workflow Protocols

Use your library’s preservation

github.com/datarefuge/workflowin the spirit of

Page 15: How libraries can help: Levels of data rescue

Rescue community-specific data

We are working on a ckan Registry to link to

downloaded files in your repository

It’s only DataRefugeif it follows QA processes

for a trusted chain of custody

If you already know that there are data sets of high value & high priority for your community, get them!

ResearchQuality

Data3

Workflow Protocols

Use your library’s preservation

github.com/datarefuge/workflowin the spirit of

Page 16: How libraries can help: Levels of data rescue

3

Rescue community-specific data

It’s only DataRefuge if it follows QA processes

for a trusted chain of custody

If you already know that there are data sets of high value & high priority for your community, get them!

ResearchQuality

Data

Let us know when you’re done by claiming it on the US Agency

spreadsheet!

Workflow Protocols

Use your library’s preservation

github.com/datarefuge/workflowin the spirit of

We are working on a ckan Registry to link to

downloaded files in your repository

Page 17: How libraries can help: Levels of data rescue

ResearchQuality

Data

using workflow protocols to ensure quality & trust

Levels of library commitmentWe have Archive-It & cando deep web archiving &

document “uncrawlables”

We know of specific data our community needs;

we will rescue them.

2

3We can harvest “uncrawlables”

of a program through a data rescue event or dedicated team

1We can survey our researchers to nominate data sets & raise awareness in our community

4

Page 18: How libraries can help: Levels of data rescue

Harvest all uncrawlablesThis is the highest level of commitment. You are

dedicated to harvesting all of the data you can find.

ResearchQuality

Data4

Page 19: How libraries can help: Levels of data rescue

Harvest all uncrawlables

Claim an Agency using the spreadsheet

This is the highest level of commitment. You are dedicated to harvesting all of the data you can find.

ResearchQuality

Data4

Page 20: How libraries can help: Levels of data rescue

Harvest all uncrawlables

Host a Data Rescue Event

or designate an internal library team

This is the highest level of commitment. You are dedicated to harvesting all of the data you can find.

ResearchQuality

Data

Claim an Agency using the spreadsheet

4

Page 21: How libraries can help: Levels of data rescue

Harvest all uncrawlablesThis is the highest level of commitment. You are

dedicated to harvesting all of the data you can find.

ResearchQuality

Data

Claim an Agency using the spreadsheet

4

Host a Data Rescue Event

or designate an internal library team

Use established library preservation workflow protocols to maintain

trusted chain of custody

Page 22: How libraries can help: Levels of data rescue

Harvest all uncrawlables

Use established library preservation workflow protocols to maintain

trusted chain of custody

This is the highest level of commitment. You are dedicated to harvesting all of the data you can find.

ResearchQuality

Data

Claim an Agency using the spreadsheet

4We are working on a

ckan Registry to link to downloaded files in

your repository

Host a Data Rescue Event

or designate an internal library team

Page 23: How libraries can help: Levels of data rescue

Repeat the cycle as needed

Claim an agency

Documentuncrawlables

Harvestuncrawlables

Verify/QAuncrawlables

Registeruncrawlables

Updatespreadsheet

Page 24: How libraries can help: Levels of data rescue

ResearchQuality

Data

using workflow protocols to ensure quality & trust

Levels of library commitmentWe have Archive-It & cando deep web archiving &

document “uncrawlables”

We know of specific data our community needs;

we will rescue them.

2

3We can harvest “uncrawlables”

for an agency through a data rescue event or dedicated team

1We can survey our researchers to nominate data sets & raise awareness in our community

4

Page 25: How libraries can help: Levels of data rescue

ResearchQuality

Data

using workflow protocols to ensure quality & trust

URLS2

3

1

4

Nominate data setsdocs.google.com/forms/d/e/1FAIpQLSd8JeRxvyVrBASrc4D42Z6nz8yIsQuu_cGVRtO5uWO9yjlBFw/viewform?c=0&w=1

US Agency spreadsheet

Claim an Agency form

docs.google.com/spreadsheets/d/1yIrhFrZkv2Yhdk48W_P5bd-C-jxLtbzJFcm2E5oq-ec/edit

docs.google.com/forms/d/e/1FAIpQLScE7iuLIEbEd0hkkP9_zB5skXwKqL8EeW9hVlB4JSIkvCvm6Q/viewform?c=0&w=1

Workflow protocols examplegithub.com/datarefuge/workflow

Chain of custodygithub.com/datarefuge/chain-of-custody

US Agency spreadsheet

Claim an Agency form

docs.google.com/spreadsheets/d/1yIrhFrZkv2Yhdk48W_P5bd-C-jxLtbzJFcm2E5oq-ec/edit

docs.google.com/forms/d/e/1FAIpQLScE7iuLIEbEd0hkkP9_zB5skXwKqL8EeW9hVlB4JSIkvCvm6Q/viewform?c=0&w=1

Workflow protocols examplegithub.com/datarefuge/workflow

Chain of custodygithub.com/datarefuge/chain-of-custody

Page 26: How libraries can help: Levels of data rescue

librariesnetwork.org

Icons courtesy of iconmonstr.com