33
Minnesota Digital Library and HathiTrust Prototype an Image Preservation Archive 5 April 2011 5 April 2011 CNI Spring Task Force Meeting CNI Spring Task Force Meeting

Minnesota Digital Library and HathiTrust

  • Upload
    yuval

  • View
    38

  • Download
    0

Embed Size (px)

DESCRIPTION

Minnesota Digital Library and HathiTrust. Prototype an Image Preservation Archive. 5 April 2011 CNI Spring Task Force Meeting. John Butler, University of Minnesota John Weise, University of Michigan Eric Celeste, Consultant, MDL. - PowerPoint PPT Presentation

Citation preview

Page 1: Minnesota Digital Library and HathiTrust

Minnesota Digital Library and HathiTrust

Minnesota Digital Library and HathiTrust

Prototype an Image Preservation Archive Prototype an Image Preservation Archive

5 April 20115 April 2011CNI Spring Task Force MeetingCNI Spring Task Force Meeting

Page 2: Minnesota Digital Library and HathiTrust

John Butler, University of Minnesota

John Weise, University of Michigan

Eric Celeste, Consultant, MDL

John Butler, University of Minnesota

John Weise, University of Michigan

Eric Celeste, Consultant, MDL

Page 3: Minnesota Digital Library and HathiTrust

Minnesota Digital Library, Butler

HathiTrust’s Interest and Role, Weise

Prototype and Lessons Learned, Celeste

Minnesota Digital Library, Butler

HathiTrust’s Interest and Role, Weise

Prototype and Lessons Learned, Celeste

Page 4: Minnesota Digital Library and HathiTrust

Minnesota Digital Library Coalition

Minnesota Digital Library Coalition

•Conceived in 2001

•Early years – LSTA paycheck to paycheck

•Minitex (UMN) now the administrative home

•Signature project — Minnesota Reflections

•62k images, maps, documents

•120 cultural heritage institutions

•Conceived in 2001

•Early years – LSTA paycheck to paycheck

•Minitex (UMN) now the administrative home

•Signature project — Minnesota Reflections

•62k images, maps, documents

•120 cultural heritage institutions

Page 5: Minnesota Digital Library and HathiTrust

MDL AccessMDL Access•“...not possible without you”•“...not possible without you”

Page 6: Minnesota Digital Library and HathiTrust

MDL AccessMDL Access•“...we’ve got ours and we’re keeping it”•“...we’ve got ours and we’re keeping it”

Page 7: Minnesota Digital Library and HathiTrust

...something we can all agree upon

...something we can all agree upon

•Viewed as common infrastructureEconomies/imperatives of scaleAccess need not be affected•but might it be?Attracting broader interest• including public radio & televisionA gift from Minnesotans•Arts and Cultural Heritage FundingA digital MLAC for the state

•Viewed as common infrastructureEconomies/imperatives of scaleAccess need not be affected•but might it be?Attracting broader interest• including public radio & televisionA gift from Minnesotans•Arts and Cultural Heritage FundingA digital MLAC for the state

MDL Preservation

MDL Preservation

Page 8: Minnesota Digital Library and HathiTrust

MDL AspirationsMDL Aspirations•State-wide Digital Preservation Services•State-wide Digital Preservation Services

Page 9: Minnesota Digital Library and HathiTrust

Nesting ConsortiaNesting Consortia

Policies

Agreements

Standards

HathiTrustHathiTrust

Standards•File FormatsFile Formats•ProceduresProcedures

Formats•BooksJournalsBooksJournalsExperiment: Experiment: images & images & audioaudio

Policies•GovernanceGovernance•RightsRights•CostsCosts

Mission•AccessAccess•PreservationPreservation•Research fociResearch foci

Minnesota D

igital Library

Minnesota D

igital Library

Page 10: Minnesota Digital Library and HathiTrust

HathiTrust’s Interest and Role

HathiTrust’s Interest and Role

Page 11: Minnesota Digital Library and HathiTrust

Mission of HathiTrustMission of HathiTrust

•Contribute to the common good by…

• collecting,

• organizing,

• preserving,

• communicating,

• and sharing

•…the record of human knowledge.

•Contribute to the common good by…

• collecting,

• organizing,

• preserving,

• communicating,

• and sharing

•…the record of human knowledge.

Page 12: Minnesota Digital Library and HathiTrust

Preservation PhilosophyPreservation Philosophy

• Maximize…

• partner contributions

• use of available resources

• Maximize…

• partner contributions

• use of available resources

Page 13: Minnesota Digital Library and HathiTrust

Long Term Functional Objectives

Long Term Functional Objectives

• TRAC compliance

• Robust discovery mechanisms

• Open service definition (APIs)

• Support for formats beyond books & journals

• Data mining tools

• All functional objectives… http://www.hathitrust.org/objectives

• TRAC compliance

• Robust discovery mechanisms

• Open service definition (APIs)

• Support for formats beyond books & journals

• Data mining tools

• All functional objectives… http://www.hathitrust.org/objectives

Page 14: Minnesota Digital Library and HathiTrust

HathiTrust’s InterestHathiTrust’s Interest

• Help MDL find a solution

• Explore image support

• Leverage MDL resources

• Empower MDL in the process

• Draft ingest specifications

•Establish a viable model for support of MDL and similar preservation cooperatives.

• Help MDL find a solution

• Explore image support

• Leverage MDL resources

• Empower MDL in the process

• Draft ingest specifications

•Establish a viable model for support of MDL and similar preservation cooperatives.

Page 15: Minnesota Digital Library and HathiTrust

DesignDesign

• Operational logistics at the scale of HathiTrust have led to solutions that favor:

• consistency and standardization

• simplicity over complexity

• practicality over ideology

• Operational logistics at the scale of HathiTrust have led to solutions that favor:

• consistency and standardization

• simplicity over complexity

• practicality over ideology

Page 16: Minnesota Digital Library and HathiTrust

HathiTrust’s RoleHathiTrust’s Role

• Scoping and guidance

• Lower barriers

• Raise bars

• Learn, grow and eventually provide

• Object ingest specifications for images

• Object preparation tools, esp. validation

• Scoping and guidance

• Lower barriers

• Raise bars

• Learn, grow and eventually provide

• Object ingest specifications for images

• Object preparation tools, esp. validation

Page 17: Minnesota Digital Library and HathiTrust

All Together NowAll Together Now• HathiTrust really is a collaborative effort.

• MDL and Michigan worked together under HathiTrust governance.

• HathiTrust really is a collaborative effort.

• MDL and Michigan worked together under HathiTrust governance.

Page 18: Minnesota Digital Library and HathiTrust

What did MDL actually send to HathiTrust?

What did MDL actually send to HathiTrust?

Page 19: Minnesota Digital Library and HathiTrust

What MDL sent to HathiTrust

What MDL sent to HathiTrust

Packages

Reflections Simple Reflections Simple ContoneContone

22,186

Reflections Compound Reflections Compound ObjectsObjects

888

Minnesota Historical Minnesota Historical SocietySociety

6,860

Total 29,934

Page 20: Minnesota Digital Library and HathiTrust

Items GB

Simple JP2Simple JP2 22,186 429

Compound JP2Compound JP2 13,844 407

Compound Bitonal TIFFCompound Bitonal TIFF 13,272 1

JPEGJPEG 9,575 12

Total 49,302 849

What MDL sent to HathiTrust

What MDL sent to HathiTrust

Page 21: Minnesota Digital Library and HathiTrust

Number of items transferredNumber of items transferred Amount of data transferredAmount of data transferred

ObjectsObjects

What MDL sent to HathiTrust

What MDL sent to HathiTrust

Page 22: Minnesota Digital Library and HathiTrust

An “object” sent to An “object” sent to HathiTrust is a HathiTrust is a “Submission “Submission Information Information Package” and Package” and consists of many consists of many parts: a METS file parts: a METS file with a variety of with a variety of metadata, a set of metadata, a set of image files, and a image files, and a set of set of corresponding text corresponding text files. The image files. The image files are the files are the “items” sent.“items” sent.

What MDL sent to HathiTrust

What MDL sent to HathiTrust

Page 23: Minnesota Digital Library and HathiTrust

What did we learn from working together?

What did we learn from working together?

Page 24: Minnesota Digital Library and HathiTrust

Get the full report at...http://mndigital.org/projects/preservation/

Get the full report at...http://mndigital.org/projects/preservation/

Page 25: Minnesota Digital Library and HathiTrust

What is a master?What is a master?

Page 26: Minnesota Digital Library and HathiTrust

Where is the identifier?

Where is the identifier?

(C

C-B

Y-N

C-N

D)

Som

e r

igh

ts r

ese

rved

by M

art

in G

om

mel

(C

C-B

Y-N

C-N

D)

Som

e r

igh

ts r

ese

rved

by M

art

in G

om

mel

Page 27: Minnesota Digital Library and HathiTrust

Metadata madness!Metadata madness!

(CC

-BY-N

C)

Som

e r

igh

ts r

ese

rved

by S

alt

ate

mp

o(C

C-B

Y-N

C)

Som

e r

igh

ts r

ese

rved

by S

alt

ate

mp

o

Page 28: Minnesota Digital Library and HathiTrust

Trust us.Trust us.

(CC

-BY-N

C)

Som

e r

igh

ts r

ese

rved

by n

ick

see

(CC

-BY-N

C)

Som

e r

igh

ts r

ese

rved

by n

ick

see

Page 29: Minnesota Digital Library and HathiTrust

Who’s looking?Who’s looking?

(CC

-BY)

Som

e r

igh

ts r

ese

rved

by a

pd

k(C

C-B

Y)

Som

e r

igh

ts r

ese

rved

by a

pd

k

Page 30: Minnesota Digital Library and HathiTrust

No free lunch.No free lunch.

(CC

-BY-N

C)

Som

e r

igh

ts r

ese

rved

by f

reefo

tou

k(C

C-B

Y-N

C)

Som

e r

igh

ts r

ese

rved

by f

reefo

tou

k

Page 31: Minnesota Digital Library and HathiTrust

No free lunch.No free lunch.

(CC

-BY-N

C)

Som

e r

igh

ts r

ese

rved

by f

reefo

tou

k(C

C-B

Y-N

C)

Som

e r

igh

ts r

ese

rved

by f

reefo

tou

k

Time as Producer

Cost as Producer

Time as Aggregat

or

Cost as Aggregat

or

Programmer 672h 0m 0s

$8,000 2352h 0m 0s

$28,000

Metadata Assistant

100h 0m 0s

$2,000

Manager 40h 0m 0s $2,800 30h 0m 0s $2,100

Totals $12,800 $30,100

Page 32: Minnesota Digital Library and HathiTrust

Next Steps for MDL?Next Steps for MDL?

•Trials with MetaArchive and OCLC Digital Archive.

•Paper and phone evaluations of a few others.

•Decisions in early summer about how to proceed.

•Trials with MetaArchive and OCLC Digital Archive.

•Paper and phone evaluations of a few others.

•Decisions in early summer about how to proceed.

Page 33: Minnesota Digital Library and HathiTrust

Questions?Questions?