28
HATHI TRUST A Shared Digital Repository Digital Repositories for Preservation and Access Digital Directions 2013 Jeremy York July 22, 2013 Unless otherwise noted, these slides and their contents are licensed under a Creative Commons Attribution Unported License .

HATHI TRUST A Shared Digital Repository Digital Repositories for Preservation and Access Digital Directions 2013 Jeremy York July 22, 2013 Unless otherwise

Embed Size (px)

Citation preview

Page 1: HATHI TRUST A Shared Digital Repository Digital Repositories for Preservation and Access Digital Directions 2013 Jeremy York July 22, 2013 Unless otherwise

HATHI TRUST A Shared Digital Repository

Digital Repositories for Preservation and Access

Digital Directions 2013Jeremy YorkJuly 22 2013

Unless otherwise noted these slides and their contents are licensed under a Creative Commons Attribution Unported License

Digital repositories

bull Primary mission to preserve contentbull Performs actions to this end

Reasons to preserve content

bull For accessbull Guard against threats to contentndash Digitization accepted method of preservation

reformattingndash Digital deteriorates is fragile

Reasons to provide access

bull Meet needs of designated communitybull Check on integrity of contentbull Content that is accessible is more likely to be

valued and preserved in the future

Reasons access might not be offered

bull Copyrightbull Privacybull Licensingbull Needs of user communityndash Content available elsewhere

bull Technical limitationsndash Networking and storage requirements

A number of models

bull Full user access to preserved digital objectsbull No end-user access to digital objectsbull Delayed or triggered user access to digital

objectsbull Partial access to digital objects

Requirements to preserve content

bull OAISndash ldquoAn OAIS is an Archive consisting of an

organizationof people and systems that has accepted the responsibility to preserve information and make it available for a Designated Communityrdquo [does not imply unrestricted access]

OAIS

bull Support information modelndash Define target of preservation (content data and representation

information)

ndash Define metadata needed to preserve identify contextualize information (PDI)

bull Fulfill responsibilitiesndash Accept information from Producers

ndash Obtain control sufficient to preserve

ndash Ensure understandable to designated community

ndash Ensure preservation

ndash Make available to designated community with information supporting authenticity

Ensure preservation

bull Some strategiesndash Transformationndash Validationndash Checks on integrityndash Replicationndash Choice of formats ndash Migration

TRAC

bull Starts with ldquoa mission to provide reliable long-term access to managed digital resources to its designated community now and into the futurerdquo

bull Encompassesndash Organizational Infrastructurendash Digital Object Managementndash Technical Infrastructure

TRAC (2)

bull Borrows vocabulary from OAISbull Adapts ideas for applying criteria from nestor

and Digital Curation Centrendash Documentation (evidence)ndash Transparencyndash Adequacyndash Measurability

OAIS TRAC

TransparencyDocumentation Adequacy Measurability

Provenance ContextReference Fixity

Access Rights

Designated Community

Mission

Organizational Infrastructure

Digital Object Management

Technical Infrastructure

Representation InformationContent Data Preservation

Actions

Authenticity ReliabilityIntegrity

Preserve Content

Where does access come in

bull Some level of access is necessaryndash Management integrity

bull What is preserved may not be what is most useful to the end user

bull Implications across the repository

Content formats

bull Can the content you are preserving be delivered over the Webndash Will you be storing derivative files

ndash Is some kind of transformation needed

ndash Do the files offer consistent functionality

bull Implications for scale of repository access systems changes to services

bull In HathiTrustndash Limited to 3 formats largely uniform in technical characteristics

bull ITU G4 TIFFbull JPEG2000bull Unicode (with and without coordinates)

Storage of information about content

bull Is information about object adequately available for both preservation and accessndash Structural informationndash Preservation information with implications for

interfacebull HathiTrust uses METS as a wrapperndash Available for preservation and access

Content Package

imagesSource METStext

HTMETS

Zip

Architecture

imagesSource METStext

HTMETS

uc1pairtree_rootb3543486b34543486

b34543486zip

b34543486metsxml

Storage

bull Does the storage system support needs for ingest and access

bull In HathiTrustndash Need to have fast access to repository systems to

support services

Security

bull Data Integrityndash Checksum validation digital object provenance

bull Physical securityndash Biometric door systems locked racks

bull Network securityndash Firewalling vulnerability scanning

bull Application securityndash Developer best practices input validation

bull Access controlhellip

Differential access to content

bull Rights databasendash Ensures appropriate access

bull Holdings databasendash Facilitates lawful uses of materials

AuthenticationAuthorization

bull Mechanisms to enable differential access ensure security and appropriate use

User services

bull Bibliographic and full-text search indexesbull Collection-building capabilitiesbull User interfaces

APIs and Datasets

bull Data APIbull Bibliographic APIbull OAIbull ldquoHathifilesrdquobull Datasets

More

bull Qualitybull User Supportbull Correction

Provide Access

Content PackageContent Formats Architecture Storage

AuthenticationSecurity Authorization Differential Access

Services User InterfacesLawful Uses

APIs and Datasets

CopyrightAgreements

User Support

Indexes

CorrectionInformation Quality

Content PackageContent Formats

Architecture Storage

AuthenticationSecurity Authorization Differential Access

Services User InterfacesLawful Uses

APIs and Datasets

CopyrightAgreements

User Support

Indexes

CorrectionInformation Quality

OAIS TRAC

TransparencyDocumentation Adequacy Measurability

Provenance ContextReference Fixity

Access Rights

Designated Community

Mission

Organizational Infrastructure

Digital Object Management

Technical Infrastructure

Representation InformationContent Data Preservation

Actions

Authenticity ReliabilityIntegrity

Preservation

Access

Thank you

How to find out more

bull About httpwwwhathitrustorgaboutbull Twitter httptwittercomhathitrustbull Facebook httpwwwfacebookcomhathitrustbull Monthly newsletter ndash httpwwwhathitrustorgupdatesndash RSS httpwwwhathitrustorgupdates_rss

bull Contact us feedbackissueshathitrustorgbull Blogs httpwwwhathitrustorgblogsndash Large-scale Searchndash Perspectives from HathiTrust

  • Digital Repositories for Preservation and Access
  • Digital repositories
  • Reasons to preserve content
  • Reasons to provide access
  • Reasons access might not be offered
  • A number of models
  • Requirements to preserve content
  • OAIS
  • Ensure preservation
  • TRAC
  • TRAC (2)
  • Slide 12
  • Where does access come in
  • Content formats
  • Storage of information about content
  • Content Package
  • Architecture
  • Storage
  • Security
  • Differential access to content
  • AuthenticationAuthorization
  • User services
  • APIs and Datasets
  • More
  • Slide 25
  • Slide 26
  • Thank you
  • How to find out more
Page 2: HATHI TRUST A Shared Digital Repository Digital Repositories for Preservation and Access Digital Directions 2013 Jeremy York July 22, 2013 Unless otherwise

Digital repositories

bull Primary mission to preserve contentbull Performs actions to this end

Reasons to preserve content

bull For accessbull Guard against threats to contentndash Digitization accepted method of preservation

reformattingndash Digital deteriorates is fragile

Reasons to provide access

bull Meet needs of designated communitybull Check on integrity of contentbull Content that is accessible is more likely to be

valued and preserved in the future

Reasons access might not be offered

bull Copyrightbull Privacybull Licensingbull Needs of user communityndash Content available elsewhere

bull Technical limitationsndash Networking and storage requirements

A number of models

bull Full user access to preserved digital objectsbull No end-user access to digital objectsbull Delayed or triggered user access to digital

objectsbull Partial access to digital objects

Requirements to preserve content

bull OAISndash ldquoAn OAIS is an Archive consisting of an

organizationof people and systems that has accepted the responsibility to preserve information and make it available for a Designated Communityrdquo [does not imply unrestricted access]

OAIS

bull Support information modelndash Define target of preservation (content data and representation

information)

ndash Define metadata needed to preserve identify contextualize information (PDI)

bull Fulfill responsibilitiesndash Accept information from Producers

ndash Obtain control sufficient to preserve

ndash Ensure understandable to designated community

ndash Ensure preservation

ndash Make available to designated community with information supporting authenticity

Ensure preservation

bull Some strategiesndash Transformationndash Validationndash Checks on integrityndash Replicationndash Choice of formats ndash Migration

TRAC

bull Starts with ldquoa mission to provide reliable long-term access to managed digital resources to its designated community now and into the futurerdquo

bull Encompassesndash Organizational Infrastructurendash Digital Object Managementndash Technical Infrastructure

TRAC (2)

bull Borrows vocabulary from OAISbull Adapts ideas for applying criteria from nestor

and Digital Curation Centrendash Documentation (evidence)ndash Transparencyndash Adequacyndash Measurability

OAIS TRAC

TransparencyDocumentation Adequacy Measurability

Provenance ContextReference Fixity

Access Rights

Designated Community

Mission

Organizational Infrastructure

Digital Object Management

Technical Infrastructure

Representation InformationContent Data Preservation

Actions

Authenticity ReliabilityIntegrity

Preserve Content

Where does access come in

bull Some level of access is necessaryndash Management integrity

bull What is preserved may not be what is most useful to the end user

bull Implications across the repository

Content formats

bull Can the content you are preserving be delivered over the Webndash Will you be storing derivative files

ndash Is some kind of transformation needed

ndash Do the files offer consistent functionality

bull Implications for scale of repository access systems changes to services

bull In HathiTrustndash Limited to 3 formats largely uniform in technical characteristics

bull ITU G4 TIFFbull JPEG2000bull Unicode (with and without coordinates)

Storage of information about content

bull Is information about object adequately available for both preservation and accessndash Structural informationndash Preservation information with implications for

interfacebull HathiTrust uses METS as a wrapperndash Available for preservation and access

Content Package

imagesSource METStext

HTMETS

Zip

Architecture

imagesSource METStext

HTMETS

uc1pairtree_rootb3543486b34543486

b34543486zip

b34543486metsxml

Storage

bull Does the storage system support needs for ingest and access

bull In HathiTrustndash Need to have fast access to repository systems to

support services

Security

bull Data Integrityndash Checksum validation digital object provenance

bull Physical securityndash Biometric door systems locked racks

bull Network securityndash Firewalling vulnerability scanning

bull Application securityndash Developer best practices input validation

bull Access controlhellip

Differential access to content

bull Rights databasendash Ensures appropriate access

bull Holdings databasendash Facilitates lawful uses of materials

AuthenticationAuthorization

bull Mechanisms to enable differential access ensure security and appropriate use

User services

bull Bibliographic and full-text search indexesbull Collection-building capabilitiesbull User interfaces

APIs and Datasets

bull Data APIbull Bibliographic APIbull OAIbull ldquoHathifilesrdquobull Datasets

More

bull Qualitybull User Supportbull Correction

Provide Access

Content PackageContent Formats Architecture Storage

AuthenticationSecurity Authorization Differential Access

Services User InterfacesLawful Uses

APIs and Datasets

CopyrightAgreements

User Support

Indexes

CorrectionInformation Quality

Content PackageContent Formats

Architecture Storage

AuthenticationSecurity Authorization Differential Access

Services User InterfacesLawful Uses

APIs and Datasets

CopyrightAgreements

User Support

Indexes

CorrectionInformation Quality

OAIS TRAC

TransparencyDocumentation Adequacy Measurability

Provenance ContextReference Fixity

Access Rights

Designated Community

Mission

Organizational Infrastructure

Digital Object Management

Technical Infrastructure

Representation InformationContent Data Preservation

Actions

Authenticity ReliabilityIntegrity

Preservation

Access

Thank you

How to find out more

bull About httpwwwhathitrustorgaboutbull Twitter httptwittercomhathitrustbull Facebook httpwwwfacebookcomhathitrustbull Monthly newsletter ndash httpwwwhathitrustorgupdatesndash RSS httpwwwhathitrustorgupdates_rss

bull Contact us feedbackissueshathitrustorgbull Blogs httpwwwhathitrustorgblogsndash Large-scale Searchndash Perspectives from HathiTrust

  • Digital Repositories for Preservation and Access
  • Digital repositories
  • Reasons to preserve content
  • Reasons to provide access
  • Reasons access might not be offered
  • A number of models
  • Requirements to preserve content
  • OAIS
  • Ensure preservation
  • TRAC
  • TRAC (2)
  • Slide 12
  • Where does access come in
  • Content formats
  • Storage of information about content
  • Content Package
  • Architecture
  • Storage
  • Security
  • Differential access to content
  • AuthenticationAuthorization
  • User services
  • APIs and Datasets
  • More
  • Slide 25
  • Slide 26
  • Thank you
  • How to find out more
Page 3: HATHI TRUST A Shared Digital Repository Digital Repositories for Preservation and Access Digital Directions 2013 Jeremy York July 22, 2013 Unless otherwise

Reasons to preserve content

bull For accessbull Guard against threats to contentndash Digitization accepted method of preservation

reformattingndash Digital deteriorates is fragile

Reasons to provide access

bull Meet needs of designated communitybull Check on integrity of contentbull Content that is accessible is more likely to be

valued and preserved in the future

Reasons access might not be offered

bull Copyrightbull Privacybull Licensingbull Needs of user communityndash Content available elsewhere

bull Technical limitationsndash Networking and storage requirements

A number of models

bull Full user access to preserved digital objectsbull No end-user access to digital objectsbull Delayed or triggered user access to digital

objectsbull Partial access to digital objects

Requirements to preserve content

bull OAISndash ldquoAn OAIS is an Archive consisting of an

organizationof people and systems that has accepted the responsibility to preserve information and make it available for a Designated Communityrdquo [does not imply unrestricted access]

OAIS

bull Support information modelndash Define target of preservation (content data and representation

information)

ndash Define metadata needed to preserve identify contextualize information (PDI)

bull Fulfill responsibilitiesndash Accept information from Producers

ndash Obtain control sufficient to preserve

ndash Ensure understandable to designated community

ndash Ensure preservation

ndash Make available to designated community with information supporting authenticity

Ensure preservation

bull Some strategiesndash Transformationndash Validationndash Checks on integrityndash Replicationndash Choice of formats ndash Migration

TRAC

bull Starts with ldquoa mission to provide reliable long-term access to managed digital resources to its designated community now and into the futurerdquo

bull Encompassesndash Organizational Infrastructurendash Digital Object Managementndash Technical Infrastructure

TRAC (2)

bull Borrows vocabulary from OAISbull Adapts ideas for applying criteria from nestor

and Digital Curation Centrendash Documentation (evidence)ndash Transparencyndash Adequacyndash Measurability

OAIS TRAC

TransparencyDocumentation Adequacy Measurability

Provenance ContextReference Fixity

Access Rights

Designated Community

Mission

Organizational Infrastructure

Digital Object Management

Technical Infrastructure

Representation InformationContent Data Preservation

Actions

Authenticity ReliabilityIntegrity

Preserve Content

Where does access come in

bull Some level of access is necessaryndash Management integrity

bull What is preserved may not be what is most useful to the end user

bull Implications across the repository

Content formats

bull Can the content you are preserving be delivered over the Webndash Will you be storing derivative files

ndash Is some kind of transformation needed

ndash Do the files offer consistent functionality

bull Implications for scale of repository access systems changes to services

bull In HathiTrustndash Limited to 3 formats largely uniform in technical characteristics

bull ITU G4 TIFFbull JPEG2000bull Unicode (with and without coordinates)

Storage of information about content

bull Is information about object adequately available for both preservation and accessndash Structural informationndash Preservation information with implications for

interfacebull HathiTrust uses METS as a wrapperndash Available for preservation and access

Content Package

imagesSource METStext

HTMETS

Zip

Architecture

imagesSource METStext

HTMETS

uc1pairtree_rootb3543486b34543486

b34543486zip

b34543486metsxml

Storage

bull Does the storage system support needs for ingest and access

bull In HathiTrustndash Need to have fast access to repository systems to

support services

Security

bull Data Integrityndash Checksum validation digital object provenance

bull Physical securityndash Biometric door systems locked racks

bull Network securityndash Firewalling vulnerability scanning

bull Application securityndash Developer best practices input validation

bull Access controlhellip

Differential access to content

bull Rights databasendash Ensures appropriate access

bull Holdings databasendash Facilitates lawful uses of materials

AuthenticationAuthorization

bull Mechanisms to enable differential access ensure security and appropriate use

User services

bull Bibliographic and full-text search indexesbull Collection-building capabilitiesbull User interfaces

APIs and Datasets

bull Data APIbull Bibliographic APIbull OAIbull ldquoHathifilesrdquobull Datasets

More

bull Qualitybull User Supportbull Correction

Provide Access

Content PackageContent Formats Architecture Storage

AuthenticationSecurity Authorization Differential Access

Services User InterfacesLawful Uses

APIs and Datasets

CopyrightAgreements

User Support

Indexes

CorrectionInformation Quality

Content PackageContent Formats

Architecture Storage

AuthenticationSecurity Authorization Differential Access

Services User InterfacesLawful Uses

APIs and Datasets

CopyrightAgreements

User Support

Indexes

CorrectionInformation Quality

OAIS TRAC

TransparencyDocumentation Adequacy Measurability

Provenance ContextReference Fixity

Access Rights

Designated Community

Mission

Organizational Infrastructure

Digital Object Management

Technical Infrastructure

Representation InformationContent Data Preservation

Actions

Authenticity ReliabilityIntegrity

Preservation

Access

Thank you

How to find out more

bull About httpwwwhathitrustorgaboutbull Twitter httptwittercomhathitrustbull Facebook httpwwwfacebookcomhathitrustbull Monthly newsletter ndash httpwwwhathitrustorgupdatesndash RSS httpwwwhathitrustorgupdates_rss

bull Contact us feedbackissueshathitrustorgbull Blogs httpwwwhathitrustorgblogsndash Large-scale Searchndash Perspectives from HathiTrust

  • Digital Repositories for Preservation and Access
  • Digital repositories
  • Reasons to preserve content
  • Reasons to provide access
  • Reasons access might not be offered
  • A number of models
  • Requirements to preserve content
  • OAIS
  • Ensure preservation
  • TRAC
  • TRAC (2)
  • Slide 12
  • Where does access come in
  • Content formats
  • Storage of information about content
  • Content Package
  • Architecture
  • Storage
  • Security
  • Differential access to content
  • AuthenticationAuthorization
  • User services
  • APIs and Datasets
  • More
  • Slide 25
  • Slide 26
  • Thank you
  • How to find out more
Page 4: HATHI TRUST A Shared Digital Repository Digital Repositories for Preservation and Access Digital Directions 2013 Jeremy York July 22, 2013 Unless otherwise

Reasons to provide access

bull Meet needs of designated communitybull Check on integrity of contentbull Content that is accessible is more likely to be

valued and preserved in the future

Reasons access might not be offered

bull Copyrightbull Privacybull Licensingbull Needs of user communityndash Content available elsewhere

bull Technical limitationsndash Networking and storage requirements

A number of models

bull Full user access to preserved digital objectsbull No end-user access to digital objectsbull Delayed or triggered user access to digital

objectsbull Partial access to digital objects

Requirements to preserve content

bull OAISndash ldquoAn OAIS is an Archive consisting of an

organizationof people and systems that has accepted the responsibility to preserve information and make it available for a Designated Communityrdquo [does not imply unrestricted access]

OAIS

bull Support information modelndash Define target of preservation (content data and representation

information)

ndash Define metadata needed to preserve identify contextualize information (PDI)

bull Fulfill responsibilitiesndash Accept information from Producers

ndash Obtain control sufficient to preserve

ndash Ensure understandable to designated community

ndash Ensure preservation

ndash Make available to designated community with information supporting authenticity

Ensure preservation

bull Some strategiesndash Transformationndash Validationndash Checks on integrityndash Replicationndash Choice of formats ndash Migration

TRAC

bull Starts with ldquoa mission to provide reliable long-term access to managed digital resources to its designated community now and into the futurerdquo

bull Encompassesndash Organizational Infrastructurendash Digital Object Managementndash Technical Infrastructure

TRAC (2)

bull Borrows vocabulary from OAISbull Adapts ideas for applying criteria from nestor

and Digital Curation Centrendash Documentation (evidence)ndash Transparencyndash Adequacyndash Measurability

OAIS TRAC

TransparencyDocumentation Adequacy Measurability

Provenance ContextReference Fixity

Access Rights

Designated Community

Mission

Organizational Infrastructure

Digital Object Management

Technical Infrastructure

Representation InformationContent Data Preservation

Actions

Authenticity ReliabilityIntegrity

Preserve Content

Where does access come in

bull Some level of access is necessaryndash Management integrity

bull What is preserved may not be what is most useful to the end user

bull Implications across the repository

Content formats

bull Can the content you are preserving be delivered over the Webndash Will you be storing derivative files

ndash Is some kind of transformation needed

ndash Do the files offer consistent functionality

bull Implications for scale of repository access systems changes to services

bull In HathiTrustndash Limited to 3 formats largely uniform in technical characteristics

bull ITU G4 TIFFbull JPEG2000bull Unicode (with and without coordinates)

Storage of information about content

bull Is information about object adequately available for both preservation and accessndash Structural informationndash Preservation information with implications for

interfacebull HathiTrust uses METS as a wrapperndash Available for preservation and access

Content Package

imagesSource METStext

HTMETS

Zip

Architecture

imagesSource METStext

HTMETS

uc1pairtree_rootb3543486b34543486

b34543486zip

b34543486metsxml

Storage

bull Does the storage system support needs for ingest and access

bull In HathiTrustndash Need to have fast access to repository systems to

support services

Security

bull Data Integrityndash Checksum validation digital object provenance

bull Physical securityndash Biometric door systems locked racks

bull Network securityndash Firewalling vulnerability scanning

bull Application securityndash Developer best practices input validation

bull Access controlhellip

Differential access to content

bull Rights databasendash Ensures appropriate access

bull Holdings databasendash Facilitates lawful uses of materials

AuthenticationAuthorization

bull Mechanisms to enable differential access ensure security and appropriate use

User services

bull Bibliographic and full-text search indexesbull Collection-building capabilitiesbull User interfaces

APIs and Datasets

bull Data APIbull Bibliographic APIbull OAIbull ldquoHathifilesrdquobull Datasets

More

bull Qualitybull User Supportbull Correction

Provide Access

Content PackageContent Formats Architecture Storage

AuthenticationSecurity Authorization Differential Access

Services User InterfacesLawful Uses

APIs and Datasets

CopyrightAgreements

User Support

Indexes

CorrectionInformation Quality

Content PackageContent Formats

Architecture Storage

AuthenticationSecurity Authorization Differential Access

Services User InterfacesLawful Uses

APIs and Datasets

CopyrightAgreements

User Support

Indexes

CorrectionInformation Quality

OAIS TRAC

TransparencyDocumentation Adequacy Measurability

Provenance ContextReference Fixity

Access Rights

Designated Community

Mission

Organizational Infrastructure

Digital Object Management

Technical Infrastructure

Representation InformationContent Data Preservation

Actions

Authenticity ReliabilityIntegrity

Preservation

Access

Thank you

How to find out more

bull About httpwwwhathitrustorgaboutbull Twitter httptwittercomhathitrustbull Facebook httpwwwfacebookcomhathitrustbull Monthly newsletter ndash httpwwwhathitrustorgupdatesndash RSS httpwwwhathitrustorgupdates_rss

bull Contact us feedbackissueshathitrustorgbull Blogs httpwwwhathitrustorgblogsndash Large-scale Searchndash Perspectives from HathiTrust

  • Digital Repositories for Preservation and Access
  • Digital repositories
  • Reasons to preserve content
  • Reasons to provide access
  • Reasons access might not be offered
  • A number of models
  • Requirements to preserve content
  • OAIS
  • Ensure preservation
  • TRAC
  • TRAC (2)
  • Slide 12
  • Where does access come in
  • Content formats
  • Storage of information about content
  • Content Package
  • Architecture
  • Storage
  • Security
  • Differential access to content
  • AuthenticationAuthorization
  • User services
  • APIs and Datasets
  • More
  • Slide 25
  • Slide 26
  • Thank you
  • How to find out more
Page 5: HATHI TRUST A Shared Digital Repository Digital Repositories for Preservation and Access Digital Directions 2013 Jeremy York July 22, 2013 Unless otherwise

Reasons access might not be offered

bull Copyrightbull Privacybull Licensingbull Needs of user communityndash Content available elsewhere

bull Technical limitationsndash Networking and storage requirements

A number of models

bull Full user access to preserved digital objectsbull No end-user access to digital objectsbull Delayed or triggered user access to digital

objectsbull Partial access to digital objects

Requirements to preserve content

bull OAISndash ldquoAn OAIS is an Archive consisting of an

organizationof people and systems that has accepted the responsibility to preserve information and make it available for a Designated Communityrdquo [does not imply unrestricted access]

OAIS

bull Support information modelndash Define target of preservation (content data and representation

information)

ndash Define metadata needed to preserve identify contextualize information (PDI)

bull Fulfill responsibilitiesndash Accept information from Producers

ndash Obtain control sufficient to preserve

ndash Ensure understandable to designated community

ndash Ensure preservation

ndash Make available to designated community with information supporting authenticity

Ensure preservation

bull Some strategiesndash Transformationndash Validationndash Checks on integrityndash Replicationndash Choice of formats ndash Migration

TRAC

bull Starts with ldquoa mission to provide reliable long-term access to managed digital resources to its designated community now and into the futurerdquo

bull Encompassesndash Organizational Infrastructurendash Digital Object Managementndash Technical Infrastructure

TRAC (2)

bull Borrows vocabulary from OAISbull Adapts ideas for applying criteria from nestor

and Digital Curation Centrendash Documentation (evidence)ndash Transparencyndash Adequacyndash Measurability

OAIS TRAC

TransparencyDocumentation Adequacy Measurability

Provenance ContextReference Fixity

Access Rights

Designated Community

Mission

Organizational Infrastructure

Digital Object Management

Technical Infrastructure

Representation InformationContent Data Preservation

Actions

Authenticity ReliabilityIntegrity

Preserve Content

Where does access come in

bull Some level of access is necessaryndash Management integrity

bull What is preserved may not be what is most useful to the end user

bull Implications across the repository

Content formats

bull Can the content you are preserving be delivered over the Webndash Will you be storing derivative files

ndash Is some kind of transformation needed

ndash Do the files offer consistent functionality

bull Implications for scale of repository access systems changes to services

bull In HathiTrustndash Limited to 3 formats largely uniform in technical characteristics

bull ITU G4 TIFFbull JPEG2000bull Unicode (with and without coordinates)

Storage of information about content

bull Is information about object adequately available for both preservation and accessndash Structural informationndash Preservation information with implications for

interfacebull HathiTrust uses METS as a wrapperndash Available for preservation and access

Content Package

imagesSource METStext

HTMETS

Zip

Architecture

imagesSource METStext

HTMETS

uc1pairtree_rootb3543486b34543486

b34543486zip

b34543486metsxml

Storage

bull Does the storage system support needs for ingest and access

bull In HathiTrustndash Need to have fast access to repository systems to

support services

Security

bull Data Integrityndash Checksum validation digital object provenance

bull Physical securityndash Biometric door systems locked racks

bull Network securityndash Firewalling vulnerability scanning

bull Application securityndash Developer best practices input validation

bull Access controlhellip

Differential access to content

bull Rights databasendash Ensures appropriate access

bull Holdings databasendash Facilitates lawful uses of materials

AuthenticationAuthorization

bull Mechanisms to enable differential access ensure security and appropriate use

User services

bull Bibliographic and full-text search indexesbull Collection-building capabilitiesbull User interfaces

APIs and Datasets

bull Data APIbull Bibliographic APIbull OAIbull ldquoHathifilesrdquobull Datasets

More

bull Qualitybull User Supportbull Correction

Provide Access

Content PackageContent Formats Architecture Storage

AuthenticationSecurity Authorization Differential Access

Services User InterfacesLawful Uses

APIs and Datasets

CopyrightAgreements

User Support

Indexes

CorrectionInformation Quality

Content PackageContent Formats

Architecture Storage

AuthenticationSecurity Authorization Differential Access

Services User InterfacesLawful Uses

APIs and Datasets

CopyrightAgreements

User Support

Indexes

CorrectionInformation Quality

OAIS TRAC

TransparencyDocumentation Adequacy Measurability

Provenance ContextReference Fixity

Access Rights

Designated Community

Mission

Organizational Infrastructure

Digital Object Management

Technical Infrastructure

Representation InformationContent Data Preservation

Actions

Authenticity ReliabilityIntegrity

Preservation

Access

Thank you

How to find out more

bull About httpwwwhathitrustorgaboutbull Twitter httptwittercomhathitrustbull Facebook httpwwwfacebookcomhathitrustbull Monthly newsletter ndash httpwwwhathitrustorgupdatesndash RSS httpwwwhathitrustorgupdates_rss

bull Contact us feedbackissueshathitrustorgbull Blogs httpwwwhathitrustorgblogsndash Large-scale Searchndash Perspectives from HathiTrust

  • Digital Repositories for Preservation and Access
  • Digital repositories
  • Reasons to preserve content
  • Reasons to provide access
  • Reasons access might not be offered
  • A number of models
  • Requirements to preserve content
  • OAIS
  • Ensure preservation
  • TRAC
  • TRAC (2)
  • Slide 12
  • Where does access come in
  • Content formats
  • Storage of information about content
  • Content Package
  • Architecture
  • Storage
  • Security
  • Differential access to content
  • AuthenticationAuthorization
  • User services
  • APIs and Datasets
  • More
  • Slide 25
  • Slide 26
  • Thank you
  • How to find out more
Page 6: HATHI TRUST A Shared Digital Repository Digital Repositories for Preservation and Access Digital Directions 2013 Jeremy York July 22, 2013 Unless otherwise

A number of models

bull Full user access to preserved digital objectsbull No end-user access to digital objectsbull Delayed or triggered user access to digital

objectsbull Partial access to digital objects

Requirements to preserve content

bull OAISndash ldquoAn OAIS is an Archive consisting of an

organizationof people and systems that has accepted the responsibility to preserve information and make it available for a Designated Communityrdquo [does not imply unrestricted access]

OAIS

bull Support information modelndash Define target of preservation (content data and representation

information)

ndash Define metadata needed to preserve identify contextualize information (PDI)

bull Fulfill responsibilitiesndash Accept information from Producers

ndash Obtain control sufficient to preserve

ndash Ensure understandable to designated community

ndash Ensure preservation

ndash Make available to designated community with information supporting authenticity

Ensure preservation

bull Some strategiesndash Transformationndash Validationndash Checks on integrityndash Replicationndash Choice of formats ndash Migration

TRAC

bull Starts with ldquoa mission to provide reliable long-term access to managed digital resources to its designated community now and into the futurerdquo

bull Encompassesndash Organizational Infrastructurendash Digital Object Managementndash Technical Infrastructure

TRAC (2)

bull Borrows vocabulary from OAISbull Adapts ideas for applying criteria from nestor

and Digital Curation Centrendash Documentation (evidence)ndash Transparencyndash Adequacyndash Measurability

OAIS TRAC

TransparencyDocumentation Adequacy Measurability

Provenance ContextReference Fixity

Access Rights

Designated Community

Mission

Organizational Infrastructure

Digital Object Management

Technical Infrastructure

Representation InformationContent Data Preservation

Actions

Authenticity ReliabilityIntegrity

Preserve Content

Where does access come in

bull Some level of access is necessaryndash Management integrity

bull What is preserved may not be what is most useful to the end user

bull Implications across the repository

Content formats

bull Can the content you are preserving be delivered over the Webndash Will you be storing derivative files

ndash Is some kind of transformation needed

ndash Do the files offer consistent functionality

bull Implications for scale of repository access systems changes to services

bull In HathiTrustndash Limited to 3 formats largely uniform in technical characteristics

bull ITU G4 TIFFbull JPEG2000bull Unicode (with and without coordinates)

Storage of information about content

bull Is information about object adequately available for both preservation and accessndash Structural informationndash Preservation information with implications for

interfacebull HathiTrust uses METS as a wrapperndash Available for preservation and access

Content Package

imagesSource METStext

HTMETS

Zip

Architecture

imagesSource METStext

HTMETS

uc1pairtree_rootb3543486b34543486

b34543486zip

b34543486metsxml

Storage

bull Does the storage system support needs for ingest and access

bull In HathiTrustndash Need to have fast access to repository systems to

support services

Security

bull Data Integrityndash Checksum validation digital object provenance

bull Physical securityndash Biometric door systems locked racks

bull Network securityndash Firewalling vulnerability scanning

bull Application securityndash Developer best practices input validation

bull Access controlhellip

Differential access to content

bull Rights databasendash Ensures appropriate access

bull Holdings databasendash Facilitates lawful uses of materials

AuthenticationAuthorization

bull Mechanisms to enable differential access ensure security and appropriate use

User services

bull Bibliographic and full-text search indexesbull Collection-building capabilitiesbull User interfaces

APIs and Datasets

bull Data APIbull Bibliographic APIbull OAIbull ldquoHathifilesrdquobull Datasets

More

bull Qualitybull User Supportbull Correction

Provide Access

Content PackageContent Formats Architecture Storage

AuthenticationSecurity Authorization Differential Access

Services User InterfacesLawful Uses

APIs and Datasets

CopyrightAgreements

User Support

Indexes

CorrectionInformation Quality

Content PackageContent Formats

Architecture Storage

AuthenticationSecurity Authorization Differential Access

Services User InterfacesLawful Uses

APIs and Datasets

CopyrightAgreements

User Support

Indexes

CorrectionInformation Quality

OAIS TRAC

TransparencyDocumentation Adequacy Measurability

Provenance ContextReference Fixity

Access Rights

Designated Community

Mission

Organizational Infrastructure

Digital Object Management

Technical Infrastructure

Representation InformationContent Data Preservation

Actions

Authenticity ReliabilityIntegrity

Preservation

Access

Thank you

How to find out more

bull About httpwwwhathitrustorgaboutbull Twitter httptwittercomhathitrustbull Facebook httpwwwfacebookcomhathitrustbull Monthly newsletter ndash httpwwwhathitrustorgupdatesndash RSS httpwwwhathitrustorgupdates_rss

bull Contact us feedbackissueshathitrustorgbull Blogs httpwwwhathitrustorgblogsndash Large-scale Searchndash Perspectives from HathiTrust

  • Digital Repositories for Preservation and Access
  • Digital repositories
  • Reasons to preserve content
  • Reasons to provide access
  • Reasons access might not be offered
  • A number of models
  • Requirements to preserve content
  • OAIS
  • Ensure preservation
  • TRAC
  • TRAC (2)
  • Slide 12
  • Where does access come in
  • Content formats
  • Storage of information about content
  • Content Package
  • Architecture
  • Storage
  • Security
  • Differential access to content
  • AuthenticationAuthorization
  • User services
  • APIs and Datasets
  • More
  • Slide 25
  • Slide 26
  • Thank you
  • How to find out more
Page 7: HATHI TRUST A Shared Digital Repository Digital Repositories for Preservation and Access Digital Directions 2013 Jeremy York July 22, 2013 Unless otherwise

Requirements to preserve content

bull OAISndash ldquoAn OAIS is an Archive consisting of an

organizationof people and systems that has accepted the responsibility to preserve information and make it available for a Designated Communityrdquo [does not imply unrestricted access]

OAIS

bull Support information modelndash Define target of preservation (content data and representation

information)

ndash Define metadata needed to preserve identify contextualize information (PDI)

bull Fulfill responsibilitiesndash Accept information from Producers

ndash Obtain control sufficient to preserve

ndash Ensure understandable to designated community

ndash Ensure preservation

ndash Make available to designated community with information supporting authenticity

Ensure preservation

bull Some strategiesndash Transformationndash Validationndash Checks on integrityndash Replicationndash Choice of formats ndash Migration

TRAC

bull Starts with ldquoa mission to provide reliable long-term access to managed digital resources to its designated community now and into the futurerdquo

bull Encompassesndash Organizational Infrastructurendash Digital Object Managementndash Technical Infrastructure

TRAC (2)

bull Borrows vocabulary from OAISbull Adapts ideas for applying criteria from nestor

and Digital Curation Centrendash Documentation (evidence)ndash Transparencyndash Adequacyndash Measurability

OAIS TRAC

TransparencyDocumentation Adequacy Measurability

Provenance ContextReference Fixity

Access Rights

Designated Community

Mission

Organizational Infrastructure

Digital Object Management

Technical Infrastructure

Representation InformationContent Data Preservation

Actions

Authenticity ReliabilityIntegrity

Preserve Content

Where does access come in

bull Some level of access is necessaryndash Management integrity

bull What is preserved may not be what is most useful to the end user

bull Implications across the repository

Content formats

bull Can the content you are preserving be delivered over the Webndash Will you be storing derivative files

ndash Is some kind of transformation needed

ndash Do the files offer consistent functionality

bull Implications for scale of repository access systems changes to services

bull In HathiTrustndash Limited to 3 formats largely uniform in technical characteristics

bull ITU G4 TIFFbull JPEG2000bull Unicode (with and without coordinates)

Storage of information about content

bull Is information about object adequately available for both preservation and accessndash Structural informationndash Preservation information with implications for

interfacebull HathiTrust uses METS as a wrapperndash Available for preservation and access

Content Package

imagesSource METStext

HTMETS

Zip

Architecture

imagesSource METStext

HTMETS

uc1pairtree_rootb3543486b34543486

b34543486zip

b34543486metsxml

Storage

bull Does the storage system support needs for ingest and access

bull In HathiTrustndash Need to have fast access to repository systems to

support services

Security

bull Data Integrityndash Checksum validation digital object provenance

bull Physical securityndash Biometric door systems locked racks

bull Network securityndash Firewalling vulnerability scanning

bull Application securityndash Developer best practices input validation

bull Access controlhellip

Differential access to content

bull Rights databasendash Ensures appropriate access

bull Holdings databasendash Facilitates lawful uses of materials

AuthenticationAuthorization

bull Mechanisms to enable differential access ensure security and appropriate use

User services

bull Bibliographic and full-text search indexesbull Collection-building capabilitiesbull User interfaces

APIs and Datasets

bull Data APIbull Bibliographic APIbull OAIbull ldquoHathifilesrdquobull Datasets

More

bull Qualitybull User Supportbull Correction

Provide Access

Content PackageContent Formats Architecture Storage

AuthenticationSecurity Authorization Differential Access

Services User InterfacesLawful Uses

APIs and Datasets

CopyrightAgreements

User Support

Indexes

CorrectionInformation Quality

Content PackageContent Formats

Architecture Storage

AuthenticationSecurity Authorization Differential Access

Services User InterfacesLawful Uses

APIs and Datasets

CopyrightAgreements

User Support

Indexes

CorrectionInformation Quality

OAIS TRAC

TransparencyDocumentation Adequacy Measurability

Provenance ContextReference Fixity

Access Rights

Designated Community

Mission

Organizational Infrastructure

Digital Object Management

Technical Infrastructure

Representation InformationContent Data Preservation

Actions

Authenticity ReliabilityIntegrity

Preservation

Access

Thank you

How to find out more

bull About httpwwwhathitrustorgaboutbull Twitter httptwittercomhathitrustbull Facebook httpwwwfacebookcomhathitrustbull Monthly newsletter ndash httpwwwhathitrustorgupdatesndash RSS httpwwwhathitrustorgupdates_rss

bull Contact us feedbackissueshathitrustorgbull Blogs httpwwwhathitrustorgblogsndash Large-scale Searchndash Perspectives from HathiTrust

  • Digital Repositories for Preservation and Access
  • Digital repositories
  • Reasons to preserve content
  • Reasons to provide access
  • Reasons access might not be offered
  • A number of models
  • Requirements to preserve content
  • OAIS
  • Ensure preservation
  • TRAC
  • TRAC (2)
  • Slide 12
  • Where does access come in
  • Content formats
  • Storage of information about content
  • Content Package
  • Architecture
  • Storage
  • Security
  • Differential access to content
  • AuthenticationAuthorization
  • User services
  • APIs and Datasets
  • More
  • Slide 25
  • Slide 26
  • Thank you
  • How to find out more
Page 8: HATHI TRUST A Shared Digital Repository Digital Repositories for Preservation and Access Digital Directions 2013 Jeremy York July 22, 2013 Unless otherwise

OAIS

bull Support information modelndash Define target of preservation (content data and representation

information)

ndash Define metadata needed to preserve identify contextualize information (PDI)

bull Fulfill responsibilitiesndash Accept information from Producers

ndash Obtain control sufficient to preserve

ndash Ensure understandable to designated community

ndash Ensure preservation

ndash Make available to designated community with information supporting authenticity

Ensure preservation

bull Some strategiesndash Transformationndash Validationndash Checks on integrityndash Replicationndash Choice of formats ndash Migration

TRAC

bull Starts with ldquoa mission to provide reliable long-term access to managed digital resources to its designated community now and into the futurerdquo

bull Encompassesndash Organizational Infrastructurendash Digital Object Managementndash Technical Infrastructure

TRAC (2)

bull Borrows vocabulary from OAISbull Adapts ideas for applying criteria from nestor

and Digital Curation Centrendash Documentation (evidence)ndash Transparencyndash Adequacyndash Measurability

OAIS TRAC

TransparencyDocumentation Adequacy Measurability

Provenance ContextReference Fixity

Access Rights

Designated Community

Mission

Organizational Infrastructure

Digital Object Management

Technical Infrastructure

Representation InformationContent Data Preservation

Actions

Authenticity ReliabilityIntegrity

Preserve Content

Where does access come in

bull Some level of access is necessaryndash Management integrity

bull What is preserved may not be what is most useful to the end user

bull Implications across the repository

Content formats

bull Can the content you are preserving be delivered over the Webndash Will you be storing derivative files

ndash Is some kind of transformation needed

ndash Do the files offer consistent functionality

bull Implications for scale of repository access systems changes to services

bull In HathiTrustndash Limited to 3 formats largely uniform in technical characteristics

bull ITU G4 TIFFbull JPEG2000bull Unicode (with and without coordinates)

Storage of information about content

bull Is information about object adequately available for both preservation and accessndash Structural informationndash Preservation information with implications for

interfacebull HathiTrust uses METS as a wrapperndash Available for preservation and access

Content Package

imagesSource METStext

HTMETS

Zip

Architecture

imagesSource METStext

HTMETS

uc1pairtree_rootb3543486b34543486

b34543486zip

b34543486metsxml

Storage

bull Does the storage system support needs for ingest and access

bull In HathiTrustndash Need to have fast access to repository systems to

support services

Security

bull Data Integrityndash Checksum validation digital object provenance

bull Physical securityndash Biometric door systems locked racks

bull Network securityndash Firewalling vulnerability scanning

bull Application securityndash Developer best practices input validation

bull Access controlhellip

Differential access to content

bull Rights databasendash Ensures appropriate access

bull Holdings databasendash Facilitates lawful uses of materials

AuthenticationAuthorization

bull Mechanisms to enable differential access ensure security and appropriate use

User services

bull Bibliographic and full-text search indexesbull Collection-building capabilitiesbull User interfaces

APIs and Datasets

bull Data APIbull Bibliographic APIbull OAIbull ldquoHathifilesrdquobull Datasets

More

bull Qualitybull User Supportbull Correction

Provide Access

Content PackageContent Formats Architecture Storage

AuthenticationSecurity Authorization Differential Access

Services User InterfacesLawful Uses

APIs and Datasets

CopyrightAgreements

User Support

Indexes

CorrectionInformation Quality

Content PackageContent Formats

Architecture Storage

AuthenticationSecurity Authorization Differential Access

Services User InterfacesLawful Uses

APIs and Datasets

CopyrightAgreements

User Support

Indexes

CorrectionInformation Quality

OAIS TRAC

TransparencyDocumentation Adequacy Measurability

Provenance ContextReference Fixity

Access Rights

Designated Community

Mission

Organizational Infrastructure

Digital Object Management

Technical Infrastructure

Representation InformationContent Data Preservation

Actions

Authenticity ReliabilityIntegrity

Preservation

Access

Thank you

How to find out more

bull About httpwwwhathitrustorgaboutbull Twitter httptwittercomhathitrustbull Facebook httpwwwfacebookcomhathitrustbull Monthly newsletter ndash httpwwwhathitrustorgupdatesndash RSS httpwwwhathitrustorgupdates_rss

bull Contact us feedbackissueshathitrustorgbull Blogs httpwwwhathitrustorgblogsndash Large-scale Searchndash Perspectives from HathiTrust

  • Digital Repositories for Preservation and Access
  • Digital repositories
  • Reasons to preserve content
  • Reasons to provide access
  • Reasons access might not be offered
  • A number of models
  • Requirements to preserve content
  • OAIS
  • Ensure preservation
  • TRAC
  • TRAC (2)
  • Slide 12
  • Where does access come in
  • Content formats
  • Storage of information about content
  • Content Package
  • Architecture
  • Storage
  • Security
  • Differential access to content
  • AuthenticationAuthorization
  • User services
  • APIs and Datasets
  • More
  • Slide 25
  • Slide 26
  • Thank you
  • How to find out more
Page 9: HATHI TRUST A Shared Digital Repository Digital Repositories for Preservation and Access Digital Directions 2013 Jeremy York July 22, 2013 Unless otherwise

Ensure preservation

bull Some strategiesndash Transformationndash Validationndash Checks on integrityndash Replicationndash Choice of formats ndash Migration

TRAC

bull Starts with ldquoa mission to provide reliable long-term access to managed digital resources to its designated community now and into the futurerdquo

bull Encompassesndash Organizational Infrastructurendash Digital Object Managementndash Technical Infrastructure

TRAC (2)

bull Borrows vocabulary from OAISbull Adapts ideas for applying criteria from nestor

and Digital Curation Centrendash Documentation (evidence)ndash Transparencyndash Adequacyndash Measurability

OAIS TRAC

TransparencyDocumentation Adequacy Measurability

Provenance ContextReference Fixity

Access Rights

Designated Community

Mission

Organizational Infrastructure

Digital Object Management

Technical Infrastructure

Representation InformationContent Data Preservation

Actions

Authenticity ReliabilityIntegrity

Preserve Content

Where does access come in

bull Some level of access is necessaryndash Management integrity

bull What is preserved may not be what is most useful to the end user

bull Implications across the repository

Content formats

bull Can the content you are preserving be delivered over the Webndash Will you be storing derivative files

ndash Is some kind of transformation needed

ndash Do the files offer consistent functionality

bull Implications for scale of repository access systems changes to services

bull In HathiTrustndash Limited to 3 formats largely uniform in technical characteristics

bull ITU G4 TIFFbull JPEG2000bull Unicode (with and without coordinates)

Storage of information about content

bull Is information about object adequately available for both preservation and accessndash Structural informationndash Preservation information with implications for

interfacebull HathiTrust uses METS as a wrapperndash Available for preservation and access

Content Package

imagesSource METStext

HTMETS

Zip

Architecture

imagesSource METStext

HTMETS

uc1pairtree_rootb3543486b34543486

b34543486zip

b34543486metsxml

Storage

bull Does the storage system support needs for ingest and access

bull In HathiTrustndash Need to have fast access to repository systems to

support services

Security

bull Data Integrityndash Checksum validation digital object provenance

bull Physical securityndash Biometric door systems locked racks

bull Network securityndash Firewalling vulnerability scanning

bull Application securityndash Developer best practices input validation

bull Access controlhellip

Differential access to content

bull Rights databasendash Ensures appropriate access

bull Holdings databasendash Facilitates lawful uses of materials

AuthenticationAuthorization

bull Mechanisms to enable differential access ensure security and appropriate use

User services

bull Bibliographic and full-text search indexesbull Collection-building capabilitiesbull User interfaces

APIs and Datasets

bull Data APIbull Bibliographic APIbull OAIbull ldquoHathifilesrdquobull Datasets

More

bull Qualitybull User Supportbull Correction

Provide Access

Content PackageContent Formats Architecture Storage

AuthenticationSecurity Authorization Differential Access

Services User InterfacesLawful Uses

APIs and Datasets

CopyrightAgreements

User Support

Indexes

CorrectionInformation Quality

Content PackageContent Formats

Architecture Storage

AuthenticationSecurity Authorization Differential Access

Services User InterfacesLawful Uses

APIs and Datasets

CopyrightAgreements

User Support

Indexes

CorrectionInformation Quality

OAIS TRAC

TransparencyDocumentation Adequacy Measurability

Provenance ContextReference Fixity

Access Rights

Designated Community

Mission

Organizational Infrastructure

Digital Object Management

Technical Infrastructure

Representation InformationContent Data Preservation

Actions

Authenticity ReliabilityIntegrity

Preservation

Access

Thank you

How to find out more

bull About httpwwwhathitrustorgaboutbull Twitter httptwittercomhathitrustbull Facebook httpwwwfacebookcomhathitrustbull Monthly newsletter ndash httpwwwhathitrustorgupdatesndash RSS httpwwwhathitrustorgupdates_rss

bull Contact us feedbackissueshathitrustorgbull Blogs httpwwwhathitrustorgblogsndash Large-scale Searchndash Perspectives from HathiTrust

  • Digital Repositories for Preservation and Access
  • Digital repositories
  • Reasons to preserve content
  • Reasons to provide access
  • Reasons access might not be offered
  • A number of models
  • Requirements to preserve content
  • OAIS
  • Ensure preservation
  • TRAC
  • TRAC (2)
  • Slide 12
  • Where does access come in
  • Content formats
  • Storage of information about content
  • Content Package
  • Architecture
  • Storage
  • Security
  • Differential access to content
  • AuthenticationAuthorization
  • User services
  • APIs and Datasets
  • More
  • Slide 25
  • Slide 26
  • Thank you
  • How to find out more
Page 10: HATHI TRUST A Shared Digital Repository Digital Repositories for Preservation and Access Digital Directions 2013 Jeremy York July 22, 2013 Unless otherwise

TRAC

bull Starts with ldquoa mission to provide reliable long-term access to managed digital resources to its designated community now and into the futurerdquo

bull Encompassesndash Organizational Infrastructurendash Digital Object Managementndash Technical Infrastructure

TRAC (2)

bull Borrows vocabulary from OAISbull Adapts ideas for applying criteria from nestor

and Digital Curation Centrendash Documentation (evidence)ndash Transparencyndash Adequacyndash Measurability

OAIS TRAC

TransparencyDocumentation Adequacy Measurability

Provenance ContextReference Fixity

Access Rights

Designated Community

Mission

Organizational Infrastructure

Digital Object Management

Technical Infrastructure

Representation InformationContent Data Preservation

Actions

Authenticity ReliabilityIntegrity

Preserve Content

Where does access come in

bull Some level of access is necessaryndash Management integrity

bull What is preserved may not be what is most useful to the end user

bull Implications across the repository

Content formats

bull Can the content you are preserving be delivered over the Webndash Will you be storing derivative files

ndash Is some kind of transformation needed

ndash Do the files offer consistent functionality

bull Implications for scale of repository access systems changes to services

bull In HathiTrustndash Limited to 3 formats largely uniform in technical characteristics

bull ITU G4 TIFFbull JPEG2000bull Unicode (with and without coordinates)

Storage of information about content

bull Is information about object adequately available for both preservation and accessndash Structural informationndash Preservation information with implications for

interfacebull HathiTrust uses METS as a wrapperndash Available for preservation and access

Content Package

imagesSource METStext

HTMETS

Zip

Architecture

imagesSource METStext

HTMETS

uc1pairtree_rootb3543486b34543486

b34543486zip

b34543486metsxml

Storage

bull Does the storage system support needs for ingest and access

bull In HathiTrustndash Need to have fast access to repository systems to

support services

Security

bull Data Integrityndash Checksum validation digital object provenance

bull Physical securityndash Biometric door systems locked racks

bull Network securityndash Firewalling vulnerability scanning

bull Application securityndash Developer best practices input validation

bull Access controlhellip

Differential access to content

bull Rights databasendash Ensures appropriate access

bull Holdings databasendash Facilitates lawful uses of materials

AuthenticationAuthorization

bull Mechanisms to enable differential access ensure security and appropriate use

User services

bull Bibliographic and full-text search indexesbull Collection-building capabilitiesbull User interfaces

APIs and Datasets

bull Data APIbull Bibliographic APIbull OAIbull ldquoHathifilesrdquobull Datasets

More

bull Qualitybull User Supportbull Correction

Provide Access

Content PackageContent Formats Architecture Storage

AuthenticationSecurity Authorization Differential Access

Services User InterfacesLawful Uses

APIs and Datasets

CopyrightAgreements

User Support

Indexes

CorrectionInformation Quality

Content PackageContent Formats

Architecture Storage

AuthenticationSecurity Authorization Differential Access

Services User InterfacesLawful Uses

APIs and Datasets

CopyrightAgreements

User Support

Indexes

CorrectionInformation Quality

OAIS TRAC

TransparencyDocumentation Adequacy Measurability

Provenance ContextReference Fixity

Access Rights

Designated Community

Mission

Organizational Infrastructure

Digital Object Management

Technical Infrastructure

Representation InformationContent Data Preservation

Actions

Authenticity ReliabilityIntegrity

Preservation

Access

Thank you

How to find out more

bull About httpwwwhathitrustorgaboutbull Twitter httptwittercomhathitrustbull Facebook httpwwwfacebookcomhathitrustbull Monthly newsletter ndash httpwwwhathitrustorgupdatesndash RSS httpwwwhathitrustorgupdates_rss

bull Contact us feedbackissueshathitrustorgbull Blogs httpwwwhathitrustorgblogsndash Large-scale Searchndash Perspectives from HathiTrust

  • Digital Repositories for Preservation and Access
  • Digital repositories
  • Reasons to preserve content
  • Reasons to provide access
  • Reasons access might not be offered
  • A number of models
  • Requirements to preserve content
  • OAIS
  • Ensure preservation
  • TRAC
  • TRAC (2)
  • Slide 12
  • Where does access come in
  • Content formats
  • Storage of information about content
  • Content Package
  • Architecture
  • Storage
  • Security
  • Differential access to content
  • AuthenticationAuthorization
  • User services
  • APIs and Datasets
  • More
  • Slide 25
  • Slide 26
  • Thank you
  • How to find out more
Page 11: HATHI TRUST A Shared Digital Repository Digital Repositories for Preservation and Access Digital Directions 2013 Jeremy York July 22, 2013 Unless otherwise

TRAC (2)

bull Borrows vocabulary from OAISbull Adapts ideas for applying criteria from nestor

and Digital Curation Centrendash Documentation (evidence)ndash Transparencyndash Adequacyndash Measurability

OAIS TRAC

TransparencyDocumentation Adequacy Measurability

Provenance ContextReference Fixity

Access Rights

Designated Community

Mission

Organizational Infrastructure

Digital Object Management

Technical Infrastructure

Representation InformationContent Data Preservation

Actions

Authenticity ReliabilityIntegrity

Preserve Content

Where does access come in

bull Some level of access is necessaryndash Management integrity

bull What is preserved may not be what is most useful to the end user

bull Implications across the repository

Content formats

bull Can the content you are preserving be delivered over the Webndash Will you be storing derivative files

ndash Is some kind of transformation needed

ndash Do the files offer consistent functionality

bull Implications for scale of repository access systems changes to services

bull In HathiTrustndash Limited to 3 formats largely uniform in technical characteristics

bull ITU G4 TIFFbull JPEG2000bull Unicode (with and without coordinates)

Storage of information about content

bull Is information about object adequately available for both preservation and accessndash Structural informationndash Preservation information with implications for

interfacebull HathiTrust uses METS as a wrapperndash Available for preservation and access

Content Package

imagesSource METStext

HTMETS

Zip

Architecture

imagesSource METStext

HTMETS

uc1pairtree_rootb3543486b34543486

b34543486zip

b34543486metsxml

Storage

bull Does the storage system support needs for ingest and access

bull In HathiTrustndash Need to have fast access to repository systems to

support services

Security

bull Data Integrityndash Checksum validation digital object provenance

bull Physical securityndash Biometric door systems locked racks

bull Network securityndash Firewalling vulnerability scanning

bull Application securityndash Developer best practices input validation

bull Access controlhellip

Differential access to content

bull Rights databasendash Ensures appropriate access

bull Holdings databasendash Facilitates lawful uses of materials

AuthenticationAuthorization

bull Mechanisms to enable differential access ensure security and appropriate use

User services

bull Bibliographic and full-text search indexesbull Collection-building capabilitiesbull User interfaces

APIs and Datasets

bull Data APIbull Bibliographic APIbull OAIbull ldquoHathifilesrdquobull Datasets

More

bull Qualitybull User Supportbull Correction

Provide Access

Content PackageContent Formats Architecture Storage

AuthenticationSecurity Authorization Differential Access

Services User InterfacesLawful Uses

APIs and Datasets

CopyrightAgreements

User Support

Indexes

CorrectionInformation Quality

Content PackageContent Formats

Architecture Storage

AuthenticationSecurity Authorization Differential Access

Services User InterfacesLawful Uses

APIs and Datasets

CopyrightAgreements

User Support

Indexes

CorrectionInformation Quality

OAIS TRAC

TransparencyDocumentation Adequacy Measurability

Provenance ContextReference Fixity

Access Rights

Designated Community

Mission

Organizational Infrastructure

Digital Object Management

Technical Infrastructure

Representation InformationContent Data Preservation

Actions

Authenticity ReliabilityIntegrity

Preservation

Access

Thank you

How to find out more

bull About httpwwwhathitrustorgaboutbull Twitter httptwittercomhathitrustbull Facebook httpwwwfacebookcomhathitrustbull Monthly newsletter ndash httpwwwhathitrustorgupdatesndash RSS httpwwwhathitrustorgupdates_rss

bull Contact us feedbackissueshathitrustorgbull Blogs httpwwwhathitrustorgblogsndash Large-scale Searchndash Perspectives from HathiTrust

  • Digital Repositories for Preservation and Access
  • Digital repositories
  • Reasons to preserve content
  • Reasons to provide access
  • Reasons access might not be offered
  • A number of models
  • Requirements to preserve content
  • OAIS
  • Ensure preservation
  • TRAC
  • TRAC (2)
  • Slide 12
  • Where does access come in
  • Content formats
  • Storage of information about content
  • Content Package
  • Architecture
  • Storage
  • Security
  • Differential access to content
  • AuthenticationAuthorization
  • User services
  • APIs and Datasets
  • More
  • Slide 25
  • Slide 26
  • Thank you
  • How to find out more
Page 12: HATHI TRUST A Shared Digital Repository Digital Repositories for Preservation and Access Digital Directions 2013 Jeremy York July 22, 2013 Unless otherwise

OAIS TRAC

TransparencyDocumentation Adequacy Measurability

Provenance ContextReference Fixity

Access Rights

Designated Community

Mission

Organizational Infrastructure

Digital Object Management

Technical Infrastructure

Representation InformationContent Data Preservation

Actions

Authenticity ReliabilityIntegrity

Preserve Content

Where does access come in

bull Some level of access is necessaryndash Management integrity

bull What is preserved may not be what is most useful to the end user

bull Implications across the repository

Content formats

bull Can the content you are preserving be delivered over the Webndash Will you be storing derivative files

ndash Is some kind of transformation needed

ndash Do the files offer consistent functionality

bull Implications for scale of repository access systems changes to services

bull In HathiTrustndash Limited to 3 formats largely uniform in technical characteristics

bull ITU G4 TIFFbull JPEG2000bull Unicode (with and without coordinates)

Storage of information about content

bull Is information about object adequately available for both preservation and accessndash Structural informationndash Preservation information with implications for

interfacebull HathiTrust uses METS as a wrapperndash Available for preservation and access

Content Package

imagesSource METStext

HTMETS

Zip

Architecture

imagesSource METStext

HTMETS

uc1pairtree_rootb3543486b34543486

b34543486zip

b34543486metsxml

Storage

bull Does the storage system support needs for ingest and access

bull In HathiTrustndash Need to have fast access to repository systems to

support services

Security

bull Data Integrityndash Checksum validation digital object provenance

bull Physical securityndash Biometric door systems locked racks

bull Network securityndash Firewalling vulnerability scanning

bull Application securityndash Developer best practices input validation

bull Access controlhellip

Differential access to content

bull Rights databasendash Ensures appropriate access

bull Holdings databasendash Facilitates lawful uses of materials

AuthenticationAuthorization

bull Mechanisms to enable differential access ensure security and appropriate use

User services

bull Bibliographic and full-text search indexesbull Collection-building capabilitiesbull User interfaces

APIs and Datasets

bull Data APIbull Bibliographic APIbull OAIbull ldquoHathifilesrdquobull Datasets

More

bull Qualitybull User Supportbull Correction

Provide Access

Content PackageContent Formats Architecture Storage

AuthenticationSecurity Authorization Differential Access

Services User InterfacesLawful Uses

APIs and Datasets

CopyrightAgreements

User Support

Indexes

CorrectionInformation Quality

Content PackageContent Formats

Architecture Storage

AuthenticationSecurity Authorization Differential Access

Services User InterfacesLawful Uses

APIs and Datasets

CopyrightAgreements

User Support

Indexes

CorrectionInformation Quality

OAIS TRAC

TransparencyDocumentation Adequacy Measurability

Provenance ContextReference Fixity

Access Rights

Designated Community

Mission

Organizational Infrastructure

Digital Object Management

Technical Infrastructure

Representation InformationContent Data Preservation

Actions

Authenticity ReliabilityIntegrity

Preservation

Access

Thank you

How to find out more

bull About httpwwwhathitrustorgaboutbull Twitter httptwittercomhathitrustbull Facebook httpwwwfacebookcomhathitrustbull Monthly newsletter ndash httpwwwhathitrustorgupdatesndash RSS httpwwwhathitrustorgupdates_rss

bull Contact us feedbackissueshathitrustorgbull Blogs httpwwwhathitrustorgblogsndash Large-scale Searchndash Perspectives from HathiTrust

  • Digital Repositories for Preservation and Access
  • Digital repositories
  • Reasons to preserve content
  • Reasons to provide access
  • Reasons access might not be offered
  • A number of models
  • Requirements to preserve content
  • OAIS
  • Ensure preservation
  • TRAC
  • TRAC (2)
  • Slide 12
  • Where does access come in
  • Content formats
  • Storage of information about content
  • Content Package
  • Architecture
  • Storage
  • Security
  • Differential access to content
  • AuthenticationAuthorization
  • User services
  • APIs and Datasets
  • More
  • Slide 25
  • Slide 26
  • Thank you
  • How to find out more
Page 13: HATHI TRUST A Shared Digital Repository Digital Repositories for Preservation and Access Digital Directions 2013 Jeremy York July 22, 2013 Unless otherwise

Where does access come in

bull Some level of access is necessaryndash Management integrity

bull What is preserved may not be what is most useful to the end user

bull Implications across the repository

Content formats

bull Can the content you are preserving be delivered over the Webndash Will you be storing derivative files

ndash Is some kind of transformation needed

ndash Do the files offer consistent functionality

bull Implications for scale of repository access systems changes to services

bull In HathiTrustndash Limited to 3 formats largely uniform in technical characteristics

bull ITU G4 TIFFbull JPEG2000bull Unicode (with and without coordinates)

Storage of information about content

bull Is information about object adequately available for both preservation and accessndash Structural informationndash Preservation information with implications for

interfacebull HathiTrust uses METS as a wrapperndash Available for preservation and access

Content Package

imagesSource METStext

HTMETS

Zip

Architecture

imagesSource METStext

HTMETS

uc1pairtree_rootb3543486b34543486

b34543486zip

b34543486metsxml

Storage

bull Does the storage system support needs for ingest and access

bull In HathiTrustndash Need to have fast access to repository systems to

support services

Security

bull Data Integrityndash Checksum validation digital object provenance

bull Physical securityndash Biometric door systems locked racks

bull Network securityndash Firewalling vulnerability scanning

bull Application securityndash Developer best practices input validation

bull Access controlhellip

Differential access to content

bull Rights databasendash Ensures appropriate access

bull Holdings databasendash Facilitates lawful uses of materials

AuthenticationAuthorization

bull Mechanisms to enable differential access ensure security and appropriate use

User services

bull Bibliographic and full-text search indexesbull Collection-building capabilitiesbull User interfaces

APIs and Datasets

bull Data APIbull Bibliographic APIbull OAIbull ldquoHathifilesrdquobull Datasets

More

bull Qualitybull User Supportbull Correction

Provide Access

Content PackageContent Formats Architecture Storage

AuthenticationSecurity Authorization Differential Access

Services User InterfacesLawful Uses

APIs and Datasets

CopyrightAgreements

User Support

Indexes

CorrectionInformation Quality

Content PackageContent Formats

Architecture Storage

AuthenticationSecurity Authorization Differential Access

Services User InterfacesLawful Uses

APIs and Datasets

CopyrightAgreements

User Support

Indexes

CorrectionInformation Quality

OAIS TRAC

TransparencyDocumentation Adequacy Measurability

Provenance ContextReference Fixity

Access Rights

Designated Community

Mission

Organizational Infrastructure

Digital Object Management

Technical Infrastructure

Representation InformationContent Data Preservation

Actions

Authenticity ReliabilityIntegrity

Preservation

Access

Thank you

How to find out more

bull About httpwwwhathitrustorgaboutbull Twitter httptwittercomhathitrustbull Facebook httpwwwfacebookcomhathitrustbull Monthly newsletter ndash httpwwwhathitrustorgupdatesndash RSS httpwwwhathitrustorgupdates_rss

bull Contact us feedbackissueshathitrustorgbull Blogs httpwwwhathitrustorgblogsndash Large-scale Searchndash Perspectives from HathiTrust

  • Digital Repositories for Preservation and Access
  • Digital repositories
  • Reasons to preserve content
  • Reasons to provide access
  • Reasons access might not be offered
  • A number of models
  • Requirements to preserve content
  • OAIS
  • Ensure preservation
  • TRAC
  • TRAC (2)
  • Slide 12
  • Where does access come in
  • Content formats
  • Storage of information about content
  • Content Package
  • Architecture
  • Storage
  • Security
  • Differential access to content
  • AuthenticationAuthorization
  • User services
  • APIs and Datasets
  • More
  • Slide 25
  • Slide 26
  • Thank you
  • How to find out more
Page 14: HATHI TRUST A Shared Digital Repository Digital Repositories for Preservation and Access Digital Directions 2013 Jeremy York July 22, 2013 Unless otherwise

Content formats

bull Can the content you are preserving be delivered over the Webndash Will you be storing derivative files

ndash Is some kind of transformation needed

ndash Do the files offer consistent functionality

bull Implications for scale of repository access systems changes to services

bull In HathiTrustndash Limited to 3 formats largely uniform in technical characteristics

bull ITU G4 TIFFbull JPEG2000bull Unicode (with and without coordinates)

Storage of information about content

bull Is information about object adequately available for both preservation and accessndash Structural informationndash Preservation information with implications for

interfacebull HathiTrust uses METS as a wrapperndash Available for preservation and access

Content Package

imagesSource METStext

HTMETS

Zip

Architecture

imagesSource METStext

HTMETS

uc1pairtree_rootb3543486b34543486

b34543486zip

b34543486metsxml

Storage

bull Does the storage system support needs for ingest and access

bull In HathiTrustndash Need to have fast access to repository systems to

support services

Security

bull Data Integrityndash Checksum validation digital object provenance

bull Physical securityndash Biometric door systems locked racks

bull Network securityndash Firewalling vulnerability scanning

bull Application securityndash Developer best practices input validation

bull Access controlhellip

Differential access to content

bull Rights databasendash Ensures appropriate access

bull Holdings databasendash Facilitates lawful uses of materials

AuthenticationAuthorization

bull Mechanisms to enable differential access ensure security and appropriate use

User services

bull Bibliographic and full-text search indexesbull Collection-building capabilitiesbull User interfaces

APIs and Datasets

bull Data APIbull Bibliographic APIbull OAIbull ldquoHathifilesrdquobull Datasets

More

bull Qualitybull User Supportbull Correction

Provide Access

Content PackageContent Formats Architecture Storage

AuthenticationSecurity Authorization Differential Access

Services User InterfacesLawful Uses

APIs and Datasets

CopyrightAgreements

User Support

Indexes

CorrectionInformation Quality

Content PackageContent Formats

Architecture Storage

AuthenticationSecurity Authorization Differential Access

Services User InterfacesLawful Uses

APIs and Datasets

CopyrightAgreements

User Support

Indexes

CorrectionInformation Quality

OAIS TRAC

TransparencyDocumentation Adequacy Measurability

Provenance ContextReference Fixity

Access Rights

Designated Community

Mission

Organizational Infrastructure

Digital Object Management

Technical Infrastructure

Representation InformationContent Data Preservation

Actions

Authenticity ReliabilityIntegrity

Preservation

Access

Thank you

How to find out more

bull About httpwwwhathitrustorgaboutbull Twitter httptwittercomhathitrustbull Facebook httpwwwfacebookcomhathitrustbull Monthly newsletter ndash httpwwwhathitrustorgupdatesndash RSS httpwwwhathitrustorgupdates_rss

bull Contact us feedbackissueshathitrustorgbull Blogs httpwwwhathitrustorgblogsndash Large-scale Searchndash Perspectives from HathiTrust

  • Digital Repositories for Preservation and Access
  • Digital repositories
  • Reasons to preserve content
  • Reasons to provide access
  • Reasons access might not be offered
  • A number of models
  • Requirements to preserve content
  • OAIS
  • Ensure preservation
  • TRAC
  • TRAC (2)
  • Slide 12
  • Where does access come in
  • Content formats
  • Storage of information about content
  • Content Package
  • Architecture
  • Storage
  • Security
  • Differential access to content
  • AuthenticationAuthorization
  • User services
  • APIs and Datasets
  • More
  • Slide 25
  • Slide 26
  • Thank you
  • How to find out more
Page 15: HATHI TRUST A Shared Digital Repository Digital Repositories for Preservation and Access Digital Directions 2013 Jeremy York July 22, 2013 Unless otherwise

Storage of information about content

bull Is information about object adequately available for both preservation and accessndash Structural informationndash Preservation information with implications for

interfacebull HathiTrust uses METS as a wrapperndash Available for preservation and access

Content Package

imagesSource METStext

HTMETS

Zip

Architecture

imagesSource METStext

HTMETS

uc1pairtree_rootb3543486b34543486

b34543486zip

b34543486metsxml

Storage

bull Does the storage system support needs for ingest and access

bull In HathiTrustndash Need to have fast access to repository systems to

support services

Security

bull Data Integrityndash Checksum validation digital object provenance

bull Physical securityndash Biometric door systems locked racks

bull Network securityndash Firewalling vulnerability scanning

bull Application securityndash Developer best practices input validation

bull Access controlhellip

Differential access to content

bull Rights databasendash Ensures appropriate access

bull Holdings databasendash Facilitates lawful uses of materials

AuthenticationAuthorization

bull Mechanisms to enable differential access ensure security and appropriate use

User services

bull Bibliographic and full-text search indexesbull Collection-building capabilitiesbull User interfaces

APIs and Datasets

bull Data APIbull Bibliographic APIbull OAIbull ldquoHathifilesrdquobull Datasets

More

bull Qualitybull User Supportbull Correction

Provide Access

Content PackageContent Formats Architecture Storage

AuthenticationSecurity Authorization Differential Access

Services User InterfacesLawful Uses

APIs and Datasets

CopyrightAgreements

User Support

Indexes

CorrectionInformation Quality

Content PackageContent Formats

Architecture Storage

AuthenticationSecurity Authorization Differential Access

Services User InterfacesLawful Uses

APIs and Datasets

CopyrightAgreements

User Support

Indexes

CorrectionInformation Quality

OAIS TRAC

TransparencyDocumentation Adequacy Measurability

Provenance ContextReference Fixity

Access Rights

Designated Community

Mission

Organizational Infrastructure

Digital Object Management

Technical Infrastructure

Representation InformationContent Data Preservation

Actions

Authenticity ReliabilityIntegrity

Preservation

Access

Thank you

How to find out more

bull About httpwwwhathitrustorgaboutbull Twitter httptwittercomhathitrustbull Facebook httpwwwfacebookcomhathitrustbull Monthly newsletter ndash httpwwwhathitrustorgupdatesndash RSS httpwwwhathitrustorgupdates_rss

bull Contact us feedbackissueshathitrustorgbull Blogs httpwwwhathitrustorgblogsndash Large-scale Searchndash Perspectives from HathiTrust

  • Digital Repositories for Preservation and Access
  • Digital repositories
  • Reasons to preserve content
  • Reasons to provide access
  • Reasons access might not be offered
  • A number of models
  • Requirements to preserve content
  • OAIS
  • Ensure preservation
  • TRAC
  • TRAC (2)
  • Slide 12
  • Where does access come in
  • Content formats
  • Storage of information about content
  • Content Package
  • Architecture
  • Storage
  • Security
  • Differential access to content
  • AuthenticationAuthorization
  • User services
  • APIs and Datasets
  • More
  • Slide 25
  • Slide 26
  • Thank you
  • How to find out more
Page 16: HATHI TRUST A Shared Digital Repository Digital Repositories for Preservation and Access Digital Directions 2013 Jeremy York July 22, 2013 Unless otherwise

Content Package

imagesSource METStext

HTMETS

Zip

Architecture

imagesSource METStext

HTMETS

uc1pairtree_rootb3543486b34543486

b34543486zip

b34543486metsxml

Storage

bull Does the storage system support needs for ingest and access

bull In HathiTrustndash Need to have fast access to repository systems to

support services

Security

bull Data Integrityndash Checksum validation digital object provenance

bull Physical securityndash Biometric door systems locked racks

bull Network securityndash Firewalling vulnerability scanning

bull Application securityndash Developer best practices input validation

bull Access controlhellip

Differential access to content

bull Rights databasendash Ensures appropriate access

bull Holdings databasendash Facilitates lawful uses of materials

AuthenticationAuthorization

bull Mechanisms to enable differential access ensure security and appropriate use

User services

bull Bibliographic and full-text search indexesbull Collection-building capabilitiesbull User interfaces

APIs and Datasets

bull Data APIbull Bibliographic APIbull OAIbull ldquoHathifilesrdquobull Datasets

More

bull Qualitybull User Supportbull Correction

Provide Access

Content PackageContent Formats Architecture Storage

AuthenticationSecurity Authorization Differential Access

Services User InterfacesLawful Uses

APIs and Datasets

CopyrightAgreements

User Support

Indexes

CorrectionInformation Quality

Content PackageContent Formats

Architecture Storage

AuthenticationSecurity Authorization Differential Access

Services User InterfacesLawful Uses

APIs and Datasets

CopyrightAgreements

User Support

Indexes

CorrectionInformation Quality

OAIS TRAC

TransparencyDocumentation Adequacy Measurability

Provenance ContextReference Fixity

Access Rights

Designated Community

Mission

Organizational Infrastructure

Digital Object Management

Technical Infrastructure

Representation InformationContent Data Preservation

Actions

Authenticity ReliabilityIntegrity

Preservation

Access

Thank you

How to find out more

bull About httpwwwhathitrustorgaboutbull Twitter httptwittercomhathitrustbull Facebook httpwwwfacebookcomhathitrustbull Monthly newsletter ndash httpwwwhathitrustorgupdatesndash RSS httpwwwhathitrustorgupdates_rss

bull Contact us feedbackissueshathitrustorgbull Blogs httpwwwhathitrustorgblogsndash Large-scale Searchndash Perspectives from HathiTrust

  • Digital Repositories for Preservation and Access
  • Digital repositories
  • Reasons to preserve content
  • Reasons to provide access
  • Reasons access might not be offered
  • A number of models
  • Requirements to preserve content
  • OAIS
  • Ensure preservation
  • TRAC
  • TRAC (2)
  • Slide 12
  • Where does access come in
  • Content formats
  • Storage of information about content
  • Content Package
  • Architecture
  • Storage
  • Security
  • Differential access to content
  • AuthenticationAuthorization
  • User services
  • APIs and Datasets
  • More
  • Slide 25
  • Slide 26
  • Thank you
  • How to find out more
Page 17: HATHI TRUST A Shared Digital Repository Digital Repositories for Preservation and Access Digital Directions 2013 Jeremy York July 22, 2013 Unless otherwise

Architecture

imagesSource METStext

HTMETS

uc1pairtree_rootb3543486b34543486

b34543486zip

b34543486metsxml

Storage

bull Does the storage system support needs for ingest and access

bull In HathiTrustndash Need to have fast access to repository systems to

support services

Security

bull Data Integrityndash Checksum validation digital object provenance

bull Physical securityndash Biometric door systems locked racks

bull Network securityndash Firewalling vulnerability scanning

bull Application securityndash Developer best practices input validation

bull Access controlhellip

Differential access to content

bull Rights databasendash Ensures appropriate access

bull Holdings databasendash Facilitates lawful uses of materials

AuthenticationAuthorization

bull Mechanisms to enable differential access ensure security and appropriate use

User services

bull Bibliographic and full-text search indexesbull Collection-building capabilitiesbull User interfaces

APIs and Datasets

bull Data APIbull Bibliographic APIbull OAIbull ldquoHathifilesrdquobull Datasets

More

bull Qualitybull User Supportbull Correction

Provide Access

Content PackageContent Formats Architecture Storage

AuthenticationSecurity Authorization Differential Access

Services User InterfacesLawful Uses

APIs and Datasets

CopyrightAgreements

User Support

Indexes

CorrectionInformation Quality

Content PackageContent Formats

Architecture Storage

AuthenticationSecurity Authorization Differential Access

Services User InterfacesLawful Uses

APIs and Datasets

CopyrightAgreements

User Support

Indexes

CorrectionInformation Quality

OAIS TRAC

TransparencyDocumentation Adequacy Measurability

Provenance ContextReference Fixity

Access Rights

Designated Community

Mission

Organizational Infrastructure

Digital Object Management

Technical Infrastructure

Representation InformationContent Data Preservation

Actions

Authenticity ReliabilityIntegrity

Preservation

Access

Thank you

How to find out more

bull About httpwwwhathitrustorgaboutbull Twitter httptwittercomhathitrustbull Facebook httpwwwfacebookcomhathitrustbull Monthly newsletter ndash httpwwwhathitrustorgupdatesndash RSS httpwwwhathitrustorgupdates_rss

bull Contact us feedbackissueshathitrustorgbull Blogs httpwwwhathitrustorgblogsndash Large-scale Searchndash Perspectives from HathiTrust

  • Digital Repositories for Preservation and Access
  • Digital repositories
  • Reasons to preserve content
  • Reasons to provide access
  • Reasons access might not be offered
  • A number of models
  • Requirements to preserve content
  • OAIS
  • Ensure preservation
  • TRAC
  • TRAC (2)
  • Slide 12
  • Where does access come in
  • Content formats
  • Storage of information about content
  • Content Package
  • Architecture
  • Storage
  • Security
  • Differential access to content
  • AuthenticationAuthorization
  • User services
  • APIs and Datasets
  • More
  • Slide 25
  • Slide 26
  • Thank you
  • How to find out more
Page 18: HATHI TRUST A Shared Digital Repository Digital Repositories for Preservation and Access Digital Directions 2013 Jeremy York July 22, 2013 Unless otherwise

Storage

bull Does the storage system support needs for ingest and access

bull In HathiTrustndash Need to have fast access to repository systems to

support services

Security

bull Data Integrityndash Checksum validation digital object provenance

bull Physical securityndash Biometric door systems locked racks

bull Network securityndash Firewalling vulnerability scanning

bull Application securityndash Developer best practices input validation

bull Access controlhellip

Differential access to content

bull Rights databasendash Ensures appropriate access

bull Holdings databasendash Facilitates lawful uses of materials

AuthenticationAuthorization

bull Mechanisms to enable differential access ensure security and appropriate use

User services

bull Bibliographic and full-text search indexesbull Collection-building capabilitiesbull User interfaces

APIs and Datasets

bull Data APIbull Bibliographic APIbull OAIbull ldquoHathifilesrdquobull Datasets

More

bull Qualitybull User Supportbull Correction

Provide Access

Content PackageContent Formats Architecture Storage

AuthenticationSecurity Authorization Differential Access

Services User InterfacesLawful Uses

APIs and Datasets

CopyrightAgreements

User Support

Indexes

CorrectionInformation Quality

Content PackageContent Formats

Architecture Storage

AuthenticationSecurity Authorization Differential Access

Services User InterfacesLawful Uses

APIs and Datasets

CopyrightAgreements

User Support

Indexes

CorrectionInformation Quality

OAIS TRAC

TransparencyDocumentation Adequacy Measurability

Provenance ContextReference Fixity

Access Rights

Designated Community

Mission

Organizational Infrastructure

Digital Object Management

Technical Infrastructure

Representation InformationContent Data Preservation

Actions

Authenticity ReliabilityIntegrity

Preservation

Access

Thank you

How to find out more

bull About httpwwwhathitrustorgaboutbull Twitter httptwittercomhathitrustbull Facebook httpwwwfacebookcomhathitrustbull Monthly newsletter ndash httpwwwhathitrustorgupdatesndash RSS httpwwwhathitrustorgupdates_rss

bull Contact us feedbackissueshathitrustorgbull Blogs httpwwwhathitrustorgblogsndash Large-scale Searchndash Perspectives from HathiTrust

  • Digital Repositories for Preservation and Access
  • Digital repositories
  • Reasons to preserve content
  • Reasons to provide access
  • Reasons access might not be offered
  • A number of models
  • Requirements to preserve content
  • OAIS
  • Ensure preservation
  • TRAC
  • TRAC (2)
  • Slide 12
  • Where does access come in
  • Content formats
  • Storage of information about content
  • Content Package
  • Architecture
  • Storage
  • Security
  • Differential access to content
  • AuthenticationAuthorization
  • User services
  • APIs and Datasets
  • More
  • Slide 25
  • Slide 26
  • Thank you
  • How to find out more
Page 19: HATHI TRUST A Shared Digital Repository Digital Repositories for Preservation and Access Digital Directions 2013 Jeremy York July 22, 2013 Unless otherwise

Security

bull Data Integrityndash Checksum validation digital object provenance

bull Physical securityndash Biometric door systems locked racks

bull Network securityndash Firewalling vulnerability scanning

bull Application securityndash Developer best practices input validation

bull Access controlhellip

Differential access to content

bull Rights databasendash Ensures appropriate access

bull Holdings databasendash Facilitates lawful uses of materials

AuthenticationAuthorization

bull Mechanisms to enable differential access ensure security and appropriate use

User services

bull Bibliographic and full-text search indexesbull Collection-building capabilitiesbull User interfaces

APIs and Datasets

bull Data APIbull Bibliographic APIbull OAIbull ldquoHathifilesrdquobull Datasets

More

bull Qualitybull User Supportbull Correction

Provide Access

Content PackageContent Formats Architecture Storage

AuthenticationSecurity Authorization Differential Access

Services User InterfacesLawful Uses

APIs and Datasets

CopyrightAgreements

User Support

Indexes

CorrectionInformation Quality

Content PackageContent Formats

Architecture Storage

AuthenticationSecurity Authorization Differential Access

Services User InterfacesLawful Uses

APIs and Datasets

CopyrightAgreements

User Support

Indexes

CorrectionInformation Quality

OAIS TRAC

TransparencyDocumentation Adequacy Measurability

Provenance ContextReference Fixity

Access Rights

Designated Community

Mission

Organizational Infrastructure

Digital Object Management

Technical Infrastructure

Representation InformationContent Data Preservation

Actions

Authenticity ReliabilityIntegrity

Preservation

Access

Thank you

How to find out more

bull About httpwwwhathitrustorgaboutbull Twitter httptwittercomhathitrustbull Facebook httpwwwfacebookcomhathitrustbull Monthly newsletter ndash httpwwwhathitrustorgupdatesndash RSS httpwwwhathitrustorgupdates_rss

bull Contact us feedbackissueshathitrustorgbull Blogs httpwwwhathitrustorgblogsndash Large-scale Searchndash Perspectives from HathiTrust

  • Digital Repositories for Preservation and Access
  • Digital repositories
  • Reasons to preserve content
  • Reasons to provide access
  • Reasons access might not be offered
  • A number of models
  • Requirements to preserve content
  • OAIS
  • Ensure preservation
  • TRAC
  • TRAC (2)
  • Slide 12
  • Where does access come in
  • Content formats
  • Storage of information about content
  • Content Package
  • Architecture
  • Storage
  • Security
  • Differential access to content
  • AuthenticationAuthorization
  • User services
  • APIs and Datasets
  • More
  • Slide 25
  • Slide 26
  • Thank you
  • How to find out more
Page 20: HATHI TRUST A Shared Digital Repository Digital Repositories for Preservation and Access Digital Directions 2013 Jeremy York July 22, 2013 Unless otherwise

Differential access to content

bull Rights databasendash Ensures appropriate access

bull Holdings databasendash Facilitates lawful uses of materials

AuthenticationAuthorization

bull Mechanisms to enable differential access ensure security and appropriate use

User services

bull Bibliographic and full-text search indexesbull Collection-building capabilitiesbull User interfaces

APIs and Datasets

bull Data APIbull Bibliographic APIbull OAIbull ldquoHathifilesrdquobull Datasets

More

bull Qualitybull User Supportbull Correction

Provide Access

Content PackageContent Formats Architecture Storage

AuthenticationSecurity Authorization Differential Access

Services User InterfacesLawful Uses

APIs and Datasets

CopyrightAgreements

User Support

Indexes

CorrectionInformation Quality

Content PackageContent Formats

Architecture Storage

AuthenticationSecurity Authorization Differential Access

Services User InterfacesLawful Uses

APIs and Datasets

CopyrightAgreements

User Support

Indexes

CorrectionInformation Quality

OAIS TRAC

TransparencyDocumentation Adequacy Measurability

Provenance ContextReference Fixity

Access Rights

Designated Community

Mission

Organizational Infrastructure

Digital Object Management

Technical Infrastructure

Representation InformationContent Data Preservation

Actions

Authenticity ReliabilityIntegrity

Preservation

Access

Thank you

How to find out more

bull About httpwwwhathitrustorgaboutbull Twitter httptwittercomhathitrustbull Facebook httpwwwfacebookcomhathitrustbull Monthly newsletter ndash httpwwwhathitrustorgupdatesndash RSS httpwwwhathitrustorgupdates_rss

bull Contact us feedbackissueshathitrustorgbull Blogs httpwwwhathitrustorgblogsndash Large-scale Searchndash Perspectives from HathiTrust

  • Digital Repositories for Preservation and Access
  • Digital repositories
  • Reasons to preserve content
  • Reasons to provide access
  • Reasons access might not be offered
  • A number of models
  • Requirements to preserve content
  • OAIS
  • Ensure preservation
  • TRAC
  • TRAC (2)
  • Slide 12
  • Where does access come in
  • Content formats
  • Storage of information about content
  • Content Package
  • Architecture
  • Storage
  • Security
  • Differential access to content
  • AuthenticationAuthorization
  • User services
  • APIs and Datasets
  • More
  • Slide 25
  • Slide 26
  • Thank you
  • How to find out more
Page 21: HATHI TRUST A Shared Digital Repository Digital Repositories for Preservation and Access Digital Directions 2013 Jeremy York July 22, 2013 Unless otherwise

AuthenticationAuthorization

bull Mechanisms to enable differential access ensure security and appropriate use

User services

bull Bibliographic and full-text search indexesbull Collection-building capabilitiesbull User interfaces

APIs and Datasets

bull Data APIbull Bibliographic APIbull OAIbull ldquoHathifilesrdquobull Datasets

More

bull Qualitybull User Supportbull Correction

Provide Access

Content PackageContent Formats Architecture Storage

AuthenticationSecurity Authorization Differential Access

Services User InterfacesLawful Uses

APIs and Datasets

CopyrightAgreements

User Support

Indexes

CorrectionInformation Quality

Content PackageContent Formats

Architecture Storage

AuthenticationSecurity Authorization Differential Access

Services User InterfacesLawful Uses

APIs and Datasets

CopyrightAgreements

User Support

Indexes

CorrectionInformation Quality

OAIS TRAC

TransparencyDocumentation Adequacy Measurability

Provenance ContextReference Fixity

Access Rights

Designated Community

Mission

Organizational Infrastructure

Digital Object Management

Technical Infrastructure

Representation InformationContent Data Preservation

Actions

Authenticity ReliabilityIntegrity

Preservation

Access

Thank you

How to find out more

bull About httpwwwhathitrustorgaboutbull Twitter httptwittercomhathitrustbull Facebook httpwwwfacebookcomhathitrustbull Monthly newsletter ndash httpwwwhathitrustorgupdatesndash RSS httpwwwhathitrustorgupdates_rss

bull Contact us feedbackissueshathitrustorgbull Blogs httpwwwhathitrustorgblogsndash Large-scale Searchndash Perspectives from HathiTrust

  • Digital Repositories for Preservation and Access
  • Digital repositories
  • Reasons to preserve content
  • Reasons to provide access
  • Reasons access might not be offered
  • A number of models
  • Requirements to preserve content
  • OAIS
  • Ensure preservation
  • TRAC
  • TRAC (2)
  • Slide 12
  • Where does access come in
  • Content formats
  • Storage of information about content
  • Content Package
  • Architecture
  • Storage
  • Security
  • Differential access to content
  • AuthenticationAuthorization
  • User services
  • APIs and Datasets
  • More
  • Slide 25
  • Slide 26
  • Thank you
  • How to find out more
Page 22: HATHI TRUST A Shared Digital Repository Digital Repositories for Preservation and Access Digital Directions 2013 Jeremy York July 22, 2013 Unless otherwise

User services

bull Bibliographic and full-text search indexesbull Collection-building capabilitiesbull User interfaces

APIs and Datasets

bull Data APIbull Bibliographic APIbull OAIbull ldquoHathifilesrdquobull Datasets

More

bull Qualitybull User Supportbull Correction

Provide Access

Content PackageContent Formats Architecture Storage

AuthenticationSecurity Authorization Differential Access

Services User InterfacesLawful Uses

APIs and Datasets

CopyrightAgreements

User Support

Indexes

CorrectionInformation Quality

Content PackageContent Formats

Architecture Storage

AuthenticationSecurity Authorization Differential Access

Services User InterfacesLawful Uses

APIs and Datasets

CopyrightAgreements

User Support

Indexes

CorrectionInformation Quality

OAIS TRAC

TransparencyDocumentation Adequacy Measurability

Provenance ContextReference Fixity

Access Rights

Designated Community

Mission

Organizational Infrastructure

Digital Object Management

Technical Infrastructure

Representation InformationContent Data Preservation

Actions

Authenticity ReliabilityIntegrity

Preservation

Access

Thank you

How to find out more

bull About httpwwwhathitrustorgaboutbull Twitter httptwittercomhathitrustbull Facebook httpwwwfacebookcomhathitrustbull Monthly newsletter ndash httpwwwhathitrustorgupdatesndash RSS httpwwwhathitrustorgupdates_rss

bull Contact us feedbackissueshathitrustorgbull Blogs httpwwwhathitrustorgblogsndash Large-scale Searchndash Perspectives from HathiTrust

  • Digital Repositories for Preservation and Access
  • Digital repositories
  • Reasons to preserve content
  • Reasons to provide access
  • Reasons access might not be offered
  • A number of models
  • Requirements to preserve content
  • OAIS
  • Ensure preservation
  • TRAC
  • TRAC (2)
  • Slide 12
  • Where does access come in
  • Content formats
  • Storage of information about content
  • Content Package
  • Architecture
  • Storage
  • Security
  • Differential access to content
  • AuthenticationAuthorization
  • User services
  • APIs and Datasets
  • More
  • Slide 25
  • Slide 26
  • Thank you
  • How to find out more
Page 23: HATHI TRUST A Shared Digital Repository Digital Repositories for Preservation and Access Digital Directions 2013 Jeremy York July 22, 2013 Unless otherwise

APIs and Datasets

bull Data APIbull Bibliographic APIbull OAIbull ldquoHathifilesrdquobull Datasets

More

bull Qualitybull User Supportbull Correction

Provide Access

Content PackageContent Formats Architecture Storage

AuthenticationSecurity Authorization Differential Access

Services User InterfacesLawful Uses

APIs and Datasets

CopyrightAgreements

User Support

Indexes

CorrectionInformation Quality

Content PackageContent Formats

Architecture Storage

AuthenticationSecurity Authorization Differential Access

Services User InterfacesLawful Uses

APIs and Datasets

CopyrightAgreements

User Support

Indexes

CorrectionInformation Quality

OAIS TRAC

TransparencyDocumentation Adequacy Measurability

Provenance ContextReference Fixity

Access Rights

Designated Community

Mission

Organizational Infrastructure

Digital Object Management

Technical Infrastructure

Representation InformationContent Data Preservation

Actions

Authenticity ReliabilityIntegrity

Preservation

Access

Thank you

How to find out more

bull About httpwwwhathitrustorgaboutbull Twitter httptwittercomhathitrustbull Facebook httpwwwfacebookcomhathitrustbull Monthly newsletter ndash httpwwwhathitrustorgupdatesndash RSS httpwwwhathitrustorgupdates_rss

bull Contact us feedbackissueshathitrustorgbull Blogs httpwwwhathitrustorgblogsndash Large-scale Searchndash Perspectives from HathiTrust

  • Digital Repositories for Preservation and Access
  • Digital repositories
  • Reasons to preserve content
  • Reasons to provide access
  • Reasons access might not be offered
  • A number of models
  • Requirements to preserve content
  • OAIS
  • Ensure preservation
  • TRAC
  • TRAC (2)
  • Slide 12
  • Where does access come in
  • Content formats
  • Storage of information about content
  • Content Package
  • Architecture
  • Storage
  • Security
  • Differential access to content
  • AuthenticationAuthorization
  • User services
  • APIs and Datasets
  • More
  • Slide 25
  • Slide 26
  • Thank you
  • How to find out more
Page 24: HATHI TRUST A Shared Digital Repository Digital Repositories for Preservation and Access Digital Directions 2013 Jeremy York July 22, 2013 Unless otherwise

More

bull Qualitybull User Supportbull Correction

Provide Access

Content PackageContent Formats Architecture Storage

AuthenticationSecurity Authorization Differential Access

Services User InterfacesLawful Uses

APIs and Datasets

CopyrightAgreements

User Support

Indexes

CorrectionInformation Quality

Content PackageContent Formats

Architecture Storage

AuthenticationSecurity Authorization Differential Access

Services User InterfacesLawful Uses

APIs and Datasets

CopyrightAgreements

User Support

Indexes

CorrectionInformation Quality

OAIS TRAC

TransparencyDocumentation Adequacy Measurability

Provenance ContextReference Fixity

Access Rights

Designated Community

Mission

Organizational Infrastructure

Digital Object Management

Technical Infrastructure

Representation InformationContent Data Preservation

Actions

Authenticity ReliabilityIntegrity

Preservation

Access

Thank you

How to find out more

bull About httpwwwhathitrustorgaboutbull Twitter httptwittercomhathitrustbull Facebook httpwwwfacebookcomhathitrustbull Monthly newsletter ndash httpwwwhathitrustorgupdatesndash RSS httpwwwhathitrustorgupdates_rss

bull Contact us feedbackissueshathitrustorgbull Blogs httpwwwhathitrustorgblogsndash Large-scale Searchndash Perspectives from HathiTrust

  • Digital Repositories for Preservation and Access
  • Digital repositories
  • Reasons to preserve content
  • Reasons to provide access
  • Reasons access might not be offered
  • A number of models
  • Requirements to preserve content
  • OAIS
  • Ensure preservation
  • TRAC
  • TRAC (2)
  • Slide 12
  • Where does access come in
  • Content formats
  • Storage of information about content
  • Content Package
  • Architecture
  • Storage
  • Security
  • Differential access to content
  • AuthenticationAuthorization
  • User services
  • APIs and Datasets
  • More
  • Slide 25
  • Slide 26
  • Thank you
  • How to find out more
Page 25: HATHI TRUST A Shared Digital Repository Digital Repositories for Preservation and Access Digital Directions 2013 Jeremy York July 22, 2013 Unless otherwise

Provide Access

Content PackageContent Formats Architecture Storage

AuthenticationSecurity Authorization Differential Access

Services User InterfacesLawful Uses

APIs and Datasets

CopyrightAgreements

User Support

Indexes

CorrectionInformation Quality

Content PackageContent Formats

Architecture Storage

AuthenticationSecurity Authorization Differential Access

Services User InterfacesLawful Uses

APIs and Datasets

CopyrightAgreements

User Support

Indexes

CorrectionInformation Quality

OAIS TRAC

TransparencyDocumentation Adequacy Measurability

Provenance ContextReference Fixity

Access Rights

Designated Community

Mission

Organizational Infrastructure

Digital Object Management

Technical Infrastructure

Representation InformationContent Data Preservation

Actions

Authenticity ReliabilityIntegrity

Preservation

Access

Thank you

How to find out more

bull About httpwwwhathitrustorgaboutbull Twitter httptwittercomhathitrustbull Facebook httpwwwfacebookcomhathitrustbull Monthly newsletter ndash httpwwwhathitrustorgupdatesndash RSS httpwwwhathitrustorgupdates_rss

bull Contact us feedbackissueshathitrustorgbull Blogs httpwwwhathitrustorgblogsndash Large-scale Searchndash Perspectives from HathiTrust

  • Digital Repositories for Preservation and Access
  • Digital repositories
  • Reasons to preserve content
  • Reasons to provide access
  • Reasons access might not be offered
  • A number of models
  • Requirements to preserve content
  • OAIS
  • Ensure preservation
  • TRAC
  • TRAC (2)
  • Slide 12
  • Where does access come in
  • Content formats
  • Storage of information about content
  • Content Package
  • Architecture
  • Storage
  • Security
  • Differential access to content
  • AuthenticationAuthorization
  • User services
  • APIs and Datasets
  • More
  • Slide 25
  • Slide 26
  • Thank you
  • How to find out more
Page 26: HATHI TRUST A Shared Digital Repository Digital Repositories for Preservation and Access Digital Directions 2013 Jeremy York July 22, 2013 Unless otherwise

Content PackageContent Formats

Architecture Storage

AuthenticationSecurity Authorization Differential Access

Services User InterfacesLawful Uses

APIs and Datasets

CopyrightAgreements

User Support

Indexes

CorrectionInformation Quality

OAIS TRAC

TransparencyDocumentation Adequacy Measurability

Provenance ContextReference Fixity

Access Rights

Designated Community

Mission

Organizational Infrastructure

Digital Object Management

Technical Infrastructure

Representation InformationContent Data Preservation

Actions

Authenticity ReliabilityIntegrity

Preservation

Access

Thank you

How to find out more

bull About httpwwwhathitrustorgaboutbull Twitter httptwittercomhathitrustbull Facebook httpwwwfacebookcomhathitrustbull Monthly newsletter ndash httpwwwhathitrustorgupdatesndash RSS httpwwwhathitrustorgupdates_rss

bull Contact us feedbackissueshathitrustorgbull Blogs httpwwwhathitrustorgblogsndash Large-scale Searchndash Perspectives from HathiTrust

  • Digital Repositories for Preservation and Access
  • Digital repositories
  • Reasons to preserve content
  • Reasons to provide access
  • Reasons access might not be offered
  • A number of models
  • Requirements to preserve content
  • OAIS
  • Ensure preservation
  • TRAC
  • TRAC (2)
  • Slide 12
  • Where does access come in
  • Content formats
  • Storage of information about content
  • Content Package
  • Architecture
  • Storage
  • Security
  • Differential access to content
  • AuthenticationAuthorization
  • User services
  • APIs and Datasets
  • More
  • Slide 25
  • Slide 26
  • Thank you
  • How to find out more
Page 27: HATHI TRUST A Shared Digital Repository Digital Repositories for Preservation and Access Digital Directions 2013 Jeremy York July 22, 2013 Unless otherwise

Thank you

How to find out more

bull About httpwwwhathitrustorgaboutbull Twitter httptwittercomhathitrustbull Facebook httpwwwfacebookcomhathitrustbull Monthly newsletter ndash httpwwwhathitrustorgupdatesndash RSS httpwwwhathitrustorgupdates_rss

bull Contact us feedbackissueshathitrustorgbull Blogs httpwwwhathitrustorgblogsndash Large-scale Searchndash Perspectives from HathiTrust

  • Digital Repositories for Preservation and Access
  • Digital repositories
  • Reasons to preserve content
  • Reasons to provide access
  • Reasons access might not be offered
  • A number of models
  • Requirements to preserve content
  • OAIS
  • Ensure preservation
  • TRAC
  • TRAC (2)
  • Slide 12
  • Where does access come in
  • Content formats
  • Storage of information about content
  • Content Package
  • Architecture
  • Storage
  • Security
  • Differential access to content
  • AuthenticationAuthorization
  • User services
  • APIs and Datasets
  • More
  • Slide 25
  • Slide 26
  • Thank you
  • How to find out more
Page 28: HATHI TRUST A Shared Digital Repository Digital Repositories for Preservation and Access Digital Directions 2013 Jeremy York July 22, 2013 Unless otherwise

How to find out more

bull About httpwwwhathitrustorgaboutbull Twitter httptwittercomhathitrustbull Facebook httpwwwfacebookcomhathitrustbull Monthly newsletter ndash httpwwwhathitrustorgupdatesndash RSS httpwwwhathitrustorgupdates_rss

bull Contact us feedbackissueshathitrustorgbull Blogs httpwwwhathitrustorgblogsndash Large-scale Searchndash Perspectives from HathiTrust

  • Digital Repositories for Preservation and Access
  • Digital repositories
  • Reasons to preserve content
  • Reasons to provide access
  • Reasons access might not be offered
  • A number of models
  • Requirements to preserve content
  • OAIS
  • Ensure preservation
  • TRAC
  • TRAC (2)
  • Slide 12
  • Where does access come in
  • Content formats
  • Storage of information about content
  • Content Package
  • Architecture
  • Storage
  • Security
  • Differential access to content
  • AuthenticationAuthorization
  • User services
  • APIs and Datasets
  • More
  • Slide 25
  • Slide 26
  • Thank you
  • How to find out more