35
Digital Preservation Digital Preservation of of Geoscience Information Geoscience Information Smita Chandra Smita Chandra Librarian Librarian

Digital preservation geoscinfo

  • Upload
    smtcd

  • View
    97

  • Download
    0

Embed Size (px)

DESCRIPTION

Digital Preservation of Geoscience Information

Citation preview

Page 1: Digital preservation geoscinfo

Digital Preservation Digital Preservation of of

Geoscience InformationGeoscience Information

Smita ChandraSmita Chandra

LibrarianLibrarian

Page 2: Digital preservation geoscinfo

22

Page 3: Digital preservation geoscinfo

33

Importance of Digital Information Importance of Digital Information PreservationPreservation

1975 – Two Viking space probes sent to Mars by USA.1975 – Two Viking space probes sent to Mars by USA.

Data generated by unrepeatable mission cost $1 billion.Data generated by unrepeatable mission cost $1 billion.

Recorded data on magnetic tapes was corrupted / Recorded data on magnetic tapes was corrupted / unidentifiable after 2 decades despite being kept in unidentifiable after 2 decades despite being kept in climate controlled environment. climate controlled environment.

Scientists could not access data, unable to decode the Scientists could not access data, unable to decode the formats used. formats used.

Page 4: Digital preservation geoscinfo

44

Importance of Digital Information Importance of Digital Information PreservationPreservation

Original format developers not alive.Original format developers not alive.

Finally old printouts tracked and retyped.Finally old printouts tracked and retyped.

NASA therefore is the biggest supporter of Digital NASA therefore is the biggest supporter of Digital Preservation Projects. Preservation Projects.

This illustrates wide gap in information generation and its This illustrates wide gap in information generation and its management. management.

Page 5: Digital preservation geoscinfo

55

Outline of PresentationOutline of Presentation

Digital information: forms and typesDigital information: forms and typesGeoscience informationGeoscience information Institutional Repositories (IR)Institutional Repositories (IR)Digital Preservation (DP); strategies for Digital Preservation (DP); strategies for

DPDPOAIS model & its implementationOAIS model & its implementation Indian scenario Indian scenario Research proposal & expected resultsResearch proposal & expected results

Page 6: Digital preservation geoscinfo

66

Digital InformationDigital Information

Information in digital formInformation in digital form Born DigitalBorn Digital Converted from AnalogConverted from Analog

Types of Digital InformationTypes of Digital Information Electronic PublicationsElectronic Publications Organizational and Personal RecordsOrganizational and Personal Records DataData Learning Objects like articles, booksLearning Objects like articles, books Software ToolsSoftware Tools Unpublished MaterialsUnpublished Materials Electronic ManuscriptsElectronic Manuscripts Entertainment ProductsEntertainment Products Images (Digitally designed or digitized)Images (Digitally designed or digitized) WebsitesWebsites

Page 7: Digital preservation geoscinfo

77

Threats Threats Media decay and failureMedia decay and failure

Massive storage failures,Massive storage failures, outdated mediaoutdated mediaAccess Component Access Component Obsolescence Obsolescence

Outdated formats, applications & systemsOutdated formats, applications & systemsHuman and Software errors Human and Software errors && External EventsExternal Events

Page 8: Digital preservation geoscinfo

88

Information DelugeInformation DelugePresent & Future ProjectionsPresent & Future Projections

Yawning gap betweenYawning gap between

Our ability to create digital informationOur ability to create digital information Our infrastructure and capacity to manage and Our infrastructure and capacity to manage and

preserve it over timepreserve it over time Cumulative effect foreseen as future “digital dark Cumulative effect foreseen as future “digital dark

ages”ages”

Page 9: Digital preservation geoscinfo

99

Need for Digital PreservationNeed for Digital Preservation

preserving natural/cultural heritagespreserving natural/cultural heritages

for promoting academic researchfor promoting academic research

enabling public access to legacy enabling public access to legacy collectionscollections

Page 10: Digital preservation geoscinfo

1010

Geoscience InformationGeoscience Information

Encompasses complex human-natural systemEncompasses complex human-natural system

Storehouse of massive heterogeneous data sets, and a Storehouse of massive heterogeneous data sets, and a wide variety of wide variety of content and data types which reflect the features of various research content and data types which reflect the features of various research fields of study fields of study

Every content holder aim at the needs of their particular community Every content holder aim at the needs of their particular community and work independently with a loose collaboration and integrationand work independently with a loose collaboration and integration

Every content holder has their respective digital archive system with Every content holder has their respective digital archive system with individual data structure, management policy and search interface, individual data structure, management policy and search interface, however, there is an inability to transform and integrate data with each however, there is an inability to transform and integrate data with each other transparentlyother transparently

Enabling and improving the interoperability for heterogeneous Enabling and improving the interoperability for heterogeneous collections is importantcollections is important

Source : Loudon, T.V. Geoscience after IT : Part A & Part B. Computers & Geosciences, 2000, Source : Loudon, T.V. Geoscience after IT : Part A & Part B. Computers & Geosciences, 2000, 2626(3A), (3A), A1-13.A1-13.

Page 11: Digital preservation geoscinfo

1111

Institutional Repositories (1)Institutional Repositories (1)

DefinitionDefinition : :An institute-based repository is a set of An institute-based repository is a set of

services that an academic institution services that an academic institution offers to the members of its community offers to the members of its community for the management and dissemination for the management and dissemination of digital materials created by the of digital materials created by the institution and its community members.institution and its community members.

Source: Clifford A. Lynch (February 2003), “Institutional Repositories: Essential Infrastructure for Scholarship in the Digital Age” ARL Bimonthly Report 226: 1-7. http://www.arl.org/newsltr/226/ir.html

Page 12: Digital preservation geoscinfo

1212

Institutional Repositories (2)Institutional Repositories (2)

Main ObjectivesMain Objectives to create global visibility for an institution's to create global visibility for an institution's

scholarly research; scholarly research; to collect content at a single location; to collect content at a single location; to provide to provide open accessopen access to institutional research to institutional research

output by output by self-archivingself-archiving it; it; to store and to store and preservepreserve other institutional digital other institutional digital

assets, including unpublished or otherwise assets, including unpublished or otherwise easily lost ("grey") literature (e.g., theses or easily lost ("grey") literature (e.g., theses or technical reports). technical reports).

Page 13: Digital preservation geoscinfo

1313

Institutional Repositories (3)Institutional Repositories (3)

IR SoftwaresIR Softwares DSpace (dspace.mit.edu)DSpace (dspace.mit.edu) Eprints.orgEprints.org

Subject Specific IRsSubject Specific IRs arXiv (www.arXiv.org)arXiv (www.arXiv.org) RePEc (Research Papers in Economics) RePEc (Research Papers in Economics)

(www.repec.org)(www.repec.org) CogPrints (CogPrints (www.cogprints.orgwww.cogprints.org)) NASA Technical Report Server (NASA Technical Report Server (ntrs.nasa.govntrs.nasa.gov)) Networked Computer Science Technical Reference Networked Computer Science Technical Reference

Library (Library (www.ncstrl.orgwww.ncstrl.org))

Page 14: Digital preservation geoscinfo

1414

Institutional Repositories (4)Institutional Repositories (4)

An IR is a model for a preservation system An IR is a model for a preservation system

It requires “most essentially an organizational commitment to the It requires “most essentially an organizational commitment to the stewardship of … digital materials, stewardship of … digital materials, including long-term including long-term preservationpreservation where appropriate, as well as organization and where appropriate, as well as organization and access or distribution”access or distribution”

Attributes of a “Trusted Digital Repository” Attributes of a “Trusted Digital Repository”

“… “…an organisation that has responsibility for the long-an organisation that has responsibility for the long-term maintenance of digital resources, as well as term maintenance of digital resources, as well as making them available [through time and across making them available [through time and across changing technologies] to communities agreed on by changing technologies] to communities agreed on by the depositor and the repositorythe depositor and the repository.” .”

Research Libraries Research Libraries Group Group

http://www.rlg.org/longterm/attributes01.pdfhttp://www.rlg.org/longterm/attributes01.pdf

Page 15: Digital preservation geoscinfo

1515

DefinitionDefinition: : Digital PreservationDigital Preservation

The maintenance of digital materials over the long-termThe maintenance of digital materials over the long-termwith a view to ensuring its continued accessibility. Itwith a view to ensuring its continued accessibility. Itensures that the digital resources are stored correctlyensures that the digital resources are stored correctlyand maintained adequately in the online world, suchand maintained adequately in the online world, suchthat they are available consistently for use over time.that they are available consistently for use over time.

““Long-termLong-term” includes timescales of decades or even centuries” includes timescales of decades or even centuries

Page 16: Digital preservation geoscinfo

1616

Preservation StrategiesPreservation Strategies

Technology preservationTechnology preservation Keep the hardware alive Keep the hardware alive

Technology emulationTechnology emulation Create an environment to be able to run the Create an environment to be able to run the

existing software existing software

Data migrationData migration Convert data to new formats to run in new Convert data to new formats to run in new

applications applications

Page 17: Digital preservation geoscinfo

1717

Open Archival Information Open Archival Information System (OAIS)System (OAIS)

Published by Consultative Committee for Space Data System Published by Consultative Committee for Space Data System (CCSDS) 2002, ISO 14721 : 2003 standard(CCSDS) 2002, ISO 14721 : 2003 standard

An archive consists of an organization of people and systems An archive consists of an organization of people and systems with responsibility to preserve information and make it available with responsibility to preserve information and make it available to users. to users.

SIP = Submission Information PackageAIP = Archive In formation PackageDIP = Dissemination Information Package

Page 18: Digital preservation geoscinfo

1818

OAIS: DefinitionsOAIS: Definitions

To define an Open Archival Information SystemTo define an Open Archival Information System The term 'open' means that the document was developed in The term 'open' means that the document was developed in

an open way, and does not imply that access to any OAIS an open way, and does not imply that access to any OAIS should be unrestrictedshould be unrestricted

An archive is defined as an "organization that intends to An archive is defined as an "organization that intends to preserve information for access and use by a designated preserve information for access and use by a designated community." (p. 1-8)community." (p. 1-8)

While an OAIS itself need not be permanent, the information While an OAIS itself need not be permanent, the information being maintained has been deemed to need "Long Term being maintained has been deemed to need "Long Term Preservation"Preservation"

Long term = long enough for there to be a concern about the Long term = long enough for there to be a concern about the impact of changing technologiesimpact of changing technologies

Page 19: Digital preservation geoscinfo

1919

OAIS: Purpose and Scope OAIS: Purpose and Scope

Primary focus on digital informationPrimary focus on digital information Specific aims include:Specific aims include:

A framework for the understanding and awareness of the A framework for the understanding and awareness of the archival concepts needed for long term preservation (access)archival concepts needed for long term preservation (access)

Terminology and concepts for Terminology and concepts for describing and comparingdescribing and comparing:: Architectures and operationsArchitectures and operations Preservation strategies and techniquesPreservation strategies and techniques Data modelsData models

Consensus on elements and processes for long term Consensus on elements and processes for long term preservationpreservation

A foundation for other standardsA foundation for other standards

Page 20: Digital preservation geoscinfo

2020

OAIS: ApplicabilityOAIS: Applicability

ApplicabilityApplicability::Applicable to any archive, but mainly focused on Applicable to any archive, but mainly focused on

organisations with responsibility for making organisations with responsibility for making information available for the long terminformation available for the long term

Of interest to those who create informationOf interest to those who create information

ConformanceConformanceAn OAIS must support the information model - but An OAIS must support the information model - but

does not specify any particular method of does not specify any particular method of implementationimplementation

Mandatory responsibilities (section 3.1)Mandatory responsibilities (section 3.1)

Page 21: Digital preservation geoscinfo

2121

Implementing OAIS (1)Implementing OAIS (1) Summing up the fundamentals :Summing up the fundamentals :

OAIS is a reference model (conceptual framework), NOT a OAIS is a reference model (conceptual framework), NOT a blueprint for system designblueprint for system design

It informs the design of system architectures, the development It informs the design of system architectures, the development of systems and componentsof systems and components

It provides common definitions of terms, a common language It provides common definitions of terms, a common language and means of making comparisonand means of making comparison

But it does NOT ensure consistency or interoperability between But it does NOT ensure consistency or interoperability between implementationsimplementations

Page 22: Digital preservation geoscinfo

2222

Implementing OAIS (2)Implementing OAIS (2)

Page 23: Digital preservation geoscinfo

2323

Implementing OAIS (3)Implementing OAIS (3)

Page 24: Digital preservation geoscinfo

2424

Implementing OAIS (4)Implementing OAIS (4)

Page 25: Digital preservation geoscinfo

2525

Summing Up : OAISSumming Up : OAIS

The OAIS model is a foundation stone for The OAIS model is a foundation stone for current and future digital preservation effortscurrent and future digital preservation efforts

It is already widely used to inform the It is already widely used to inform the development of preservation tools and development of preservation tools and repositoriesrepositories

It could be used in the future as a basis for It could be used in the future as a basis for conformanceconformance

Page 26: Digital preservation geoscinfo

2626

Indian Scenario (1)Indian Scenario (1)

Open Digital RepositoryOpen Digital Repository Indian Institute of ScienceIndian Institute of Science (http://etd.ncsi.ernet.in)(http://etd.ncsi.ernet.in)

National Chemical LaboratoryNational Chemical Laboratory (http://dspace.ncl.res.in/dspace/index.jsp) (http://dspace.ncl.res.in/dspace/index.jsp)

Indian Statistical InstituteIndian Statistical Institute (http://library.isibang.ac.in:8080/dspace/index/jsp)(http://library.isibang.ac.in:8080/dspace/index/jsp)

Social Science DataSocial Science Data The Census of IndiaThe Census of India M.S.Swaminathan Research FoundationM.S.Swaminathan Research Foundation

Museums and Art GalleriesMuseums and Art Galleries Ministry of Culture, GOIMinistry of Culture, GOI The National ArchivesThe National Archives

Page 27: Digital preservation geoscinfo

2727

Indian Scenario (2)Indian Scenario (2)

Institute Institute ResourceResourceCentral Water CommissionCentral Water Commission Command area mapsCommand area maps

National Bureau of Soil Survey and National Bureau of Soil Survey and

Soil MapsSoil Maps Soil maps and land use dataSoil maps and land use data

Survey of India (SOI)Survey of India (SOI) Topographical maps, geodetic trigonometric Topographical maps, geodetic trigonometric and levelling data, gravity & geomagnetic data, and levelling data, gravity & geomagnetic data, GPS data, tidal data, repetitive geodetic & GPS data, tidal data, repetitive geodetic & geophysical data geophysical data

Geological Survey of India (GSI) Geological Survey of India (GSI) Geological maps on various scales, geological Geological maps on various scales, geological and seismic dataand seismic data

National Remote Sensing AgencyNational Remote Sensing Agency

(NRSA)(NRSA)

Satellite imageries, land use and wasteland Satellite imageries, land use and wasteland maps on different scalesmaps on different scales

Indian Meteorological Department Indian Meteorological Department (IMD)(IMD)

Meteorological and seismic dataMeteorological and seismic data

Ministry of Ocean Development Ministry of Ocean Development (MOD)(MOD)

Oceanic dataOceanic data

Page 28: Digital preservation geoscinfo

2828

Proposal for IRs in IndiaProposal for IRs in India

1.1. Providing adequate financial and technical resources for ensuring “digital Providing adequate financial and technical resources for ensuring “digital preservation” in IRs preservation” in IRs

2.2. National Informatics Center (NIC) entrusted with framing guidelines and National Informatics Center (NIC) entrusted with framing guidelines and policypolicy

or establishing a new agencyor establishing a new agency

For handling digital preservation, for collaboration, sharing and avoiding For handling digital preservation, for collaboration, sharing and avoiding duplicationduplication

3.3. Trusted Digital Repository for accurate and reliable informationTrusted Digital Repository for accurate and reliable information

4.4. Legally sustainable digital preservation policyLegally sustainable digital preservation policy

5.5. Joining the Digital Preservation ConsortiumJoining the Digital Preservation Consortium

6.6. Attention to collection management of digital material in librariesAttention to collection management of digital material in libraries

7.7. Amendment of the Delivery of Books Act and Press and Registration Act Amendment of the Delivery of Books Act and Press and Registration Act to cover the digital materialto cover the digital material

8.8. Training of manpower for the management and preservation of electronic Training of manpower for the management and preservation of electronic recordsrecords

9.9. Research in the area of digital preservationResearch in the area of digital preservation

Page 29: Digital preservation geoscinfo

2929

Research ObjectivesResearch Objectives

Testing a pilot IR in a stand alone modeTesting a pilot IR in a stand alone mode Implement an OAIS-compliant layer to the IR Implement an OAIS-compliant layer to the IR

drawing upon best practicesdrawing upon best practices To develop a preservation strategy and a To develop a preservation strategy and a

custom made model addressing issues like custom made model addressing issues like planning and policy for preservation, the role of planning and policy for preservation, the role of different players in the process, IPR and different players in the process, IPR and copyright, etccopyright, etc

Page 30: Digital preservation geoscinfo

3030

Research MethodologyResearch Methodology

Analog Materials

Digital Preservation

Converted

Born

Institutional Repository

Digitization Process Digital Materials

Material Selection Process

Short TermLong Term

Page 31: Digital preservation geoscinfo

3131

Expected ResultsExpected Results

This research would identify all the This research would identify all the components necessary for the components necessary for the implementation of the OAIS model for a implementation of the OAIS model for a geoscience domain specific institutional geoscience domain specific institutional repositoryrepository

Page 32: Digital preservation geoscinfo

3232

Page 33: Digital preservation geoscinfo

3333

Annexure 1Annexure 1

Preservation Description Information

Provenence

Context

Reference

Fixity

Content Data Object Representation

Information

Physical Object Digital Object

Page 34: Digital preservation geoscinfo

3434

Annexure 2Annexure 2

OAIS Mandatory Responsibilities:OAIS Mandatory Responsibilities:Negotiating and accepting informationNegotiating and accepting informationObtaining sufficient control of the information Obtaining sufficient control of the information

to ensure long-term preservationto ensure long-term preservationDetermining the "designated community" Determining the "designated community" Ensuring that information is "independently Ensuring that information is "independently

understandable"understandable"Following documented policies and Following documented policies and

procedures procedures Making the preserved information availableMaking the preserved information available

Page 35: Digital preservation geoscinfo

3535

Annexure 3Annexure 3