How libraries can help: Levels of data rescue

Preview:

Citation preview

How libraries can help rescue data

9 February 2017

Downloadpdf for

reading!

ResearchQuality

Data

using workflow protocols to ensure quality & trust

Levels of library commitmentWe have Archive-It & cando deep web archiving &

document “uncrawlables”

We know of specific data our community needs;

we will rescue them.

2

3We can harvest “uncrawlables”

for an agency through a data rescue event or dedicated team

1We can survey our researchers to nominate data sets & raise awareness in our community

4

ResearchQuality

Data

using workflow protocols to ensure quality & trust

Levels of library commitmentWe have Archive-It & cando deep web archiving &

document “uncrawlables”

We know of specific data our community needs;

we will rescue them.

2

3We can harvest “uncrawlables”

of a program through a data rescue event or dedicated team

1We can survey our researchers to nominate data sets & raise awareness in our community

4

Survey researchers & raise awareness

Nomination Formto collect data they use & need

Ask researchers & use

1 ResearchQuality

Data

Survey researchers & raise awareness

Raise awareness by:attending & writing about a Data Refuge event

highlighting waysyour repositorycan preserve data

Nomination Formto collect data they use & need

Ask researchers & use

1 hold workshops & panels on data storytelling

ResearchQuality

Data

ResearchQuality

Data

using workflow protocols to ensure quality & trust

Levels of library commitmentWe have Archive-It & cando deep web archiving &

document “uncrawlables”

We know of specific data our community needs;

we will rescue them.

2

3We can harvest “uncrawlables”

of a program through a data rescue event or dedicated team

1We can survey our researchers to nominate data sets & raise awareness in our community

4

Web archive & document uncrawlables

2to see what agencies

can be claimed

US Agency Coordination Spreadsheet

Use the

ResearchQuality

Data

Web archive & document uncrawlables

2to see what agencies

can be claimed

US Agency Coordination Spreadsheet

Use theSubmit a request to

claim an agency& use your Archive-It

account to go as deeply as possible

ResearchQuality

Data

Web archive & document uncrawlables

2to see what agencies

can be claimed

US Agency Coordination Spreadsheet

Use theSubmit a request to

claim an agency& use your Archive-It

account to go as deeply as possible

ResearchQuality

Data

Document “uncrawlables”

as you run into archiving errors

Web archive & document uncrawlables

2to see what agencies

can be claimed

US Agency Coordination Spreadsheet

Use theSubmit a request to

claim an agency& use your Archive-It

account to go as deeply as possible

ResearchQuality

Data

Document “uncrawlables”

as you run into archiving errors

We need a Chrome plug-into launch an uncrawlables form to make this easy & consistent across all libs!

(similar to this one)

HELP!

ResearchQuality

Data

using workflow protocols to ensure quality & trust

Levels of library commitmentWe have Archive-It & cando deep web archiving &

document “uncrawlables”

We know of specific data our community needs;

we will rescue them.

2

3We can harvest “uncrawlables”

of a program through a data rescue event or dedicated team

1We can survey our researchers to nominate data sets & raise awareness in our community

4

Rescue community-specific dataIf you already know of high value, high priority data

sets for your community, get them!

ResearchQuality

Data3

Rescue community-specific data

Workflow Protocols

If you already know that there are data sets of high value & high priority for your community, get them!

Use your library’s preservation

github.com/datarefuge/workflow

ResearchQuality

Data3

in the spirit of

Rescue community-specific data

It’s only DataRefugeif it follows QA processes

for a trusted chain of custody

If you already know that there are data sets of high value & high priority for your community, get them!

ResearchQuality

Data3

Workflow Protocols

Use your library’s preservation

github.com/datarefuge/workflowin the spirit of

Rescue community-specific data

We are working on a ckan Registry to link to

downloaded files in your repository

It’s only DataRefugeif it follows QA processes

for a trusted chain of custody

If you already know that there are data sets of high value & high priority for your community, get them!

ResearchQuality

Data3

Workflow Protocols

Use your library’s preservation

github.com/datarefuge/workflowin the spirit of

3

Rescue community-specific data

It’s only DataRefuge if it follows QA processes

for a trusted chain of custody

If you already know that there are data sets of high value & high priority for your community, get them!

ResearchQuality

Data

Let us know when you’re done by claiming it on the US Agency

spreadsheet!

Workflow Protocols

Use your library’s preservation

github.com/datarefuge/workflowin the spirit of

We are working on a ckan Registry to link to

downloaded files in your repository

ResearchQuality

Data

using workflow protocols to ensure quality & trust

Levels of library commitmentWe have Archive-It & cando deep web archiving &

document “uncrawlables”

We know of specific data our community needs;

we will rescue them.

2

3We can harvest “uncrawlables”

of a program through a data rescue event or dedicated team

1We can survey our researchers to nominate data sets & raise awareness in our community

4

Harvest all uncrawlablesThis is the highest level of commitment. You are

dedicated to harvesting all of the data you can find.

ResearchQuality

Data4

Harvest all uncrawlables

Claim an Agency using the spreadsheet

This is the highest level of commitment. You are dedicated to harvesting all of the data you can find.

ResearchQuality

Data4

Harvest all uncrawlables

Host a Data Rescue Event

or designate an internal library team

This is the highest level of commitment. You are dedicated to harvesting all of the data you can find.

ResearchQuality

Data

Claim an Agency using the spreadsheet

4

Harvest all uncrawlablesThis is the highest level of commitment. You are

dedicated to harvesting all of the data you can find.

ResearchQuality

Data

Claim an Agency using the spreadsheet

4

Host a Data Rescue Event

or designate an internal library team

Use established library preservation workflow protocols to maintain

trusted chain of custody

Harvest all uncrawlables

Use established library preservation workflow protocols to maintain

trusted chain of custody

This is the highest level of commitment. You are dedicated to harvesting all of the data you can find.

ResearchQuality

Data

Claim an Agency using the spreadsheet

4We are working on a

ckan Registry to link to downloaded files in

your repository

Host a Data Rescue Event

or designate an internal library team

Repeat the cycle as needed

Claim an agency

Documentuncrawlables

Harvestuncrawlables

Verify/QAuncrawlables

Registeruncrawlables

Updatespreadsheet

ResearchQuality

Data

using workflow protocols to ensure quality & trust

Levels of library commitmentWe have Archive-It & cando deep web archiving &

document “uncrawlables”

We know of specific data our community needs;

we will rescue them.

2

3We can harvest “uncrawlables”

for an agency through a data rescue event or dedicated team

1We can survey our researchers to nominate data sets & raise awareness in our community

4

ResearchQuality

Data

using workflow protocols to ensure quality & trust

URLS2

3

1

4

Nominate data setsdocs.google.com/forms/d/e/1FAIpQLSd8JeRxvyVrBASrc4D42Z6nz8yIsQuu_cGVRtO5uWO9yjlBFw/viewform?c=0&w=1

US Agency spreadsheet

Claim an Agency form

docs.google.com/spreadsheets/d/1yIrhFrZkv2Yhdk48W_P5bd-C-jxLtbzJFcm2E5oq-ec/edit

docs.google.com/forms/d/e/1FAIpQLScE7iuLIEbEd0hkkP9_zB5skXwKqL8EeW9hVlB4JSIkvCvm6Q/viewform?c=0&w=1

Workflow protocols examplegithub.com/datarefuge/workflow

Chain of custodygithub.com/datarefuge/chain-of-custody

US Agency spreadsheet

Claim an Agency form

docs.google.com/spreadsheets/d/1yIrhFrZkv2Yhdk48W_P5bd-C-jxLtbzJFcm2E5oq-ec/edit

docs.google.com/forms/d/e/1FAIpQLScE7iuLIEbEd0hkkP9_zB5skXwKqL8EeW9hVlB4JSIkvCvm6Q/viewform?c=0&w=1

Workflow protocols examplegithub.com/datarefuge/workflow

Chain of custodygithub.com/datarefuge/chain-of-custody

librariesnetwork.org

Icons courtesy of iconmonstr.com

Recommended