26
Collection and Preservation of At-Risk Digital Geospatial Data: North Carolina Geospatial Data Archiving Project (NDIIPP Partnership) Steve Morris Head of Digital Library Initiatives NCSU Libraries Library of Congress Brown Bag Discussion Dec. 15, 2005

Collection and Preservation of At- Risk Digital Geospatial Data: North Carolina Geospatial Data Archiving Project (NDIIPP Partnership) Steve Morris Head

Embed Size (px)

Citation preview

Page 1: Collection and Preservation of At- Risk Digital Geospatial Data: North Carolina Geospatial Data Archiving Project (NDIIPP Partnership) Steve Morris Head

Collection and Preservation of At-Risk Digital Geospatial Data:

North Carolina Geospatial Data Archiving Project (NDIIPP Partnership) Steve MorrisHead of Digital Library InitiativesNCSU Libraries

Library of Congress Brown Bag Discussion Dec. 15, 2005

Page 2: Collection and Preservation of At- Risk Digital Geospatial Data: North Carolina Geospatial Data Archiving Project (NDIIPP Partnership) Steve Morris Head

Note: Percentages based on the actual number of respondents to each question 2

Project Context

Partnership between university library (NCSU) and state agency (NCCGIA)Focus on state and local geospatial content in North Carolina (state demonstration)Tied to NC OneMap initiative, which provides for seamless access to data, metadata, and inventory informationObjective: engage existing state/federal geospatial data infrastructures in preservation

Page 3: Collection and Preservation of At- Risk Digital Geospatial Data: North Carolina Geospatial Data Archiving Project (NDIIPP Partnership) Steve Morris Head

Note: Percentages based on the actual number of respondents to each question 3

Targeted Content

Resource TypesGIS “vector” (point/line/polygon) dataDigital orthophotography Digital mapsTabular data (e.g. assessment data)

Content ProducersMostly state, local, regional agenciesSome university, not-for-profit, commercialSelected local federal projects

Page 4: Collection and Preservation of At- Risk Digital Geospatial Data: North Carolina Geospatial Data Archiving Project (NDIIPP Partnership) Steve Morris Head

Note: Percentages based on the actual number of respondents to each question 4

Geospatial data types: Vector data

Page 5: Collection and Preservation of At- Risk Digital Geospatial Data: North Carolina Geospatial Data Archiving Project (NDIIPP Partnership) Steve Morris Head

Note: Percentages based on the actual number of respondents to each question 5

Time series – vector dataParcel Boundary Changes 2001-2004, North Raleigh, NC

Page 6: Collection and Preservation of At- Risk Digital Geospatial Data: North Carolina Geospatial Data Archiving Project (NDIIPP Partnership) Steve Morris Head

Note: Percentages based on the actual number of respondents to each question 6

Geospatial data types: Aerial imagery

Page 7: Collection and Preservation of At- Risk Digital Geospatial Data: North Carolina Geospatial Data Archiving Project (NDIIPP Partnership) Steve Morris Head

Note: Percentages based on the actual number of respondents to each question 7

Geospatial data types: Aerial imagery

Page 8: Collection and Preservation of At- Risk Digital Geospatial Data: North Carolina Geospatial Data Archiving Project (NDIIPP Partnership) Steve Morris Head

Note: Percentages based on the actual number of respondents to each question 8

Geospatial data types: Aerial imagery

Page 9: Collection and Preservation of At- Risk Digital Geospatial Data: North Carolina Geospatial Data Archiving Project (NDIIPP Partnership) Steve Morris Head

Note: Percentages based on the actual number of respondents to each question 9

Time series – Ortho imageryVicinity of Raleigh-Durham International Airport 1993-2002

Page 10: Collection and Preservation of At- Risk Digital Geospatial Data: North Carolina Geospatial Data Archiving Project (NDIIPP Partnership) Steve Morris Head

Note: Percentages based on the actual number of respondents to each question 10

Geospatial data types: Tabular data (w/vector)

Page 11: Collection and Preservation of At- Risk Digital Geospatial Data: North Carolina Geospatial Data Archiving Project (NDIIPP Partnership) Steve Morris Head

Note: Percentages based on the actual number of respondents to each question 11

Today’s geospatial data as tomorrow’s cultural heritage

Page 12: Collection and Preservation of At- Risk Digital Geospatial Data: North Carolina Geospatial Data Archiving Project (NDIIPP Partnership) Steve Morris Head

Note: Percentages based on the actual number of respondents to each question 12

Risks to Digital Geospatial Data

.shp

.mif

.gml

.e00

.dwg

.dgn

.bsb

.bil

.sid

Page 13: Collection and Preservation of At- Risk Digital Geospatial Data: North Carolina Geospatial Data Archiving Project (NDIIPP Partnership) Steve Morris Head

Note: Percentages based on the actual number of respondents to each question 13

Risks to Digital Geospatial Data

Producer focus on current dataTime-versioned content generally not archives

Future support of data formats in questionVast range of data formats in use--complex

Shift to web services-based accessArchives have been a by-product of providing access

Preservation metadata requirementsDescriptive, administrative, technical, DRM

GeodatabasesComplex functionality

Page 14: Collection and Preservation of At- Risk Digital Geospatial Data: North Carolina Geospatial Data Archiving Project (NDIIPP Partnership) Steve Morris Head

Note: Percentages based on the actual number of respondents to each question 14

Industry Shift to Web Services

Page 15: Collection and Preservation of At- Risk Digital Geospatial Data: North Carolina Geospatial Data Archiving Project (NDIIPP Partnership) Steve Morris Head

Note: Percentages based on the actual number of respondents to each question 15

Page 16: Collection and Preservation of At- Risk Digital Geospatial Data: North Carolina Geospatial Data Archiving Project (NDIIPP Partnership) Steve Morris Head

Note: Percentages based on the actual number of respondents to each question 16

Page 17: Collection and Preservation of At- Risk Digital Geospatial Data: North Carolina Geospatial Data Archiving Project (NDIIPP Partnership) Steve Morris Head

Note: Percentages based on the actual number of respondents to each question 17

Page 18: Collection and Preservation of At- Risk Digital Geospatial Data: North Carolina Geospatial Data Archiving Project (NDIIPP Partnership) Steve Morris Head

Note: Percentages based on the actual number of respondents to each question 18

Work plan in a Nutshell

Work from existing data inventories

NC OneMap Data Sharing Agreements as the “blanket”, individual agreements as the “quilt”

Partnership: work with existing geospatial data infrastructures (state and federal)

Technical approachMETS with FGDC, PREMIS?, GeoDRM?

Dspace now; re-ingest to different environment

Web services consumption for archival development

Page 19: Collection and Preservation of At- Risk Digital Geospatial Data: North Carolina Geospatial Data Archiving Project (NDIIPP Partnership) Steve Morris Head

Note: Percentages based on the actual number of respondents to each question 19

Big Challenges

Format migration paths

Management of data versions over time

Preservation metadata

Harnessing geospatial web services

Preserving cartographic representation

Keeping content repository-agnostic

Preserving geodatabases

More …

Page 20: Collection and Preservation of At- Risk Digital Geospatial Data: North Carolina Geospatial Data Archiving Project (NDIIPP Partnership) Steve Morris Head

Note: Percentages based on the actual number of respondents to each question 20

Vector Data Format OptionsOption A: use an open format and have a really unfortunate transformation and limited vendor support for the output objectOption B: use closed format but retain the original content and count on short- and medium-term vendor support. Option C: do both to buy time and look for an open, ASCII-based solution. (watch GML activity)

No sweet spot, just an evolving and changing mix offlawed options that are used in combination.

Page 21: Collection and Preservation of At- Risk Digital Geospatial Data: North Carolina Geospatial Data Archiving Project (NDIIPP Partnership) Steve Morris Head

Note: Percentages based on the actual number of respondents to each question 21

Preservation Metadata Issues

FGDC MetadataMany flavors, incoming metadata needs processing

Cross-walk elements to PREMIS, MODS?

Metadata wrapper/Content packagingMETS (Metadata Encoding and Transmission Standard) vs. other industry solutions

Need a geospatial industry solution for the ‘METS-like problem’

GeoDRM a likely trigger—wrapper to enforce licensing (MPEG 21 references in OGIS Web Services 3)

Page 22: Collection and Preservation of At- Risk Digital Geospatial Data: North Carolina Geospatial Data Archiving Project (NDIIPP Partnership) Steve Morris Head

Note: Percentages based on the actual number of respondents to each question 22

Metadata Availability

Page 23: Collection and Preservation of At- Risk Digital Geospatial Data: North Carolina Geospatial Data Archiving Project (NDIIPP Partnership) Steve Morris Head

Note: Percentages based on the actual number of respondents to each question 23

Preserving Cartographic Representation

Page 24: Collection and Preservation of At- Risk Digital Geospatial Data: North Carolina Geospatial Data Archiving Project (NDIIPP Partnership) Steve Morris Head

Note: Percentages based on the actual number of respondents to each question 24

Interest in how geospatial content interacts with widely available digital repository software

Focus on salient, domain-specific issues

Challenge: remain repository agnosticAvoid “imprinting” on repository software environment

Preservation package should not be the same as the ingest object of the first environment

Tension between exploiting repository software features vs. becoming software dependent

Repository Architecture Issues

Page 25: Collection and Preservation of At- Risk Digital Geospatial Data: North Carolina Geospatial Data Archiving Project (NDIIPP Partnership) Steve Morris Head

Note: Percentages based on the actual number of respondents to each question 25

Project Status

Completing inventory analysis stage

Storage system and backup deployed

DSpace deployed to production

Metadata workflow finalized

Ingest workflow near finalization

Content migration workflow near finalization

Regional site visits planned for coming months

Wide range of outreach/collaboration: FGDC, ESRI, EDINA (JISC), USGS, OGC, TRB, etc.

Pilot project, georegistering digital archival geologic maps

Page 26: Collection and Preservation of At- Risk Digital Geospatial Data: North Carolina Geospatial Data Archiving Project (NDIIPP Partnership) Steve Morris Head

Note: Percentages based on the actual number of respondents to each question 26

Questions?

Contact:

Steve MorrisHead, Digital Library InitiativesNCSU [email protected]