Do Something Now: Why Perfect is the Enemy of Good (Enough) in Digital Preservation

Preview:

Citation preview

Do Something Now:Why Perfect is the Enemy of Good (Enough) in Digital Preservation

Starting Blocks at Vacant Starting Line Before Event, by tableatny: https://www.flickr.com/photos/53370644@N06/4976494944

Dan Gillean, MAS, MLISActing on Change

ConferenceLondon – November 30,

2016

Restating the Problem:

We have many tools, services, standards, models, and metrics designed to support digital preservation

and access!

Happy cat, by panli54 - https://www.flickr.com/photos/53911972@N03/4988877591

…And yet, many institutions and

organizations feel they do not have the capacity

or capability to begin seriously addressing digital preservation.

Restating the Problem:

Cat-Sad-Annoyed, by Robert Tortorelli - https://www.flickr.com/photos/39969232@N08/16760847493

2016

1998

2000 2002 2004 2006 2008 2010 2012 2014

Fedora

METS OAIS

PRONOMDSpace

DROID

LOCKSSJHOVE

Archive-It

PREMISPAIMAS

TRAC

AtoM

Archivist’s Toolkit

Archon

Hydra

Islandora

ArchivematicaBitCurator

Blacklight

ArchivesSpaceBagIt PCDM

DuraCloud ArchivesDirect

Perpetua

ACDPS

DPN

RosettaDataverse

PBCoreiRODS

OwnCloud

RODAPreservica ePADD

Exactly

veraPDF

Siegfried

Brunnhilde

QCTools

CollectiveAccess

EAD

Standards, Tools, and Services: A (highly selective)

timeline

WARCBase

Webrecorder

Avalon

MediaConch

COPTRCommunity Owned digital Preservation Tool

Registry

http://coptr.digipres.org/Category:Tools

What’s informing the capacity gap?

Colin Park, “View into chasm known as Huntsman Leap.” http://www.geograph.org.uk/photo/2154831

https://www.tenor.co/view/gif-3528480

Randen Pederson, “Not Sure.” https://www.flickr.com/photos/chefranden/300357762

The Deer in

Headlights Phenomena

https://kidhotsauce12.wordpress.com/2014/02/27/sheep-in-wolfs-clothing/

Imposter Syndrome

2016

1998

2000 2002 2004 2006 2008 2010 2012 2014

Fedora

METS OAIS

PRONOMDSpace

DROID

LOCKSSJHOVE

Archive-It

PREMISPAIMAS

TRAC

AtoM

Archivist’s Toolkit

Archon

Hydra

Islandora

ArchivematicaBitCurator

Blacklight

ArchivesSpaceBagIt PCDM

DuraCloud ArchivesDirect

Perpetua

ACDPS

DPN

RosettaDataverse

PBCoreiRODS

OwnCloud

RODAPreservica ePADD

Exactly

veraPDF

Siegfried

Brunnhilde

QCTools

CollectiveAccess

EAD

Standards, Tools, and Services: A (highly selective)

timeline

WARCBase

Webrecorder

Avalon

MediaConch

COPTRCommunity Owned digital Preservation Tool

Registry

http://coptr.digipres.org/Category:Tools

COPTR

http://coptr.digipres.org/Category:File_Format_Identification

https://en.wikipedia.org/wiki/Monolith_(Space_Odyssey)#/media/File:African_monolith_2001.jpg

The Black Box

http://it.harrypotter.wikia.com/wiki/Incantesimo_di_Disarmo

The Magic Wand

http://knowyourmeme.com/memes/cat-transcendence

Big Picture Paralysis

Alexey Kljatov, “Snowflake.” https://www.flickr.com/photos/chaoticmind75/10823897423/

The Special Snowflake Effect

https://commons.wikimedia.org/wiki/File:Toothbrush_x3_20050716_001.jpg

…aka the Toothbrush Principle

*with thanks to Cassie Findley

The 927 Problemhttps://xkcd.com/927/

Lachlan Donald, “Sharpest Tool in the Shed.” https://www.flickr.com/photos/lox/9408028555/

The Tools Fetishist

What next?http://www.desk7.net/wallpapers.aspx?typeid=8589

Know what you have

https://commons.wikimedia.org/wiki/File:Magnifying_glass_-_Faberge.jpg

Embrace Openness

Open SourceOpen StandardsOpen Formats

Open documentation

https://pixabay.com/en/key-keychain-close-up-123554/

Start an internal audit

Bryan Mason, “Monthly Check up.” https://www.flickr.com/photos/b-may/361018310

• Governance• Organizational structure• Staffing• Procedural accountability• Preservation policy framework• Documentation• Financial sustainability• Security

ISO 16363 Reminds us that much of digital

preservation readiness is not technical – it’s organizational

Level 1 (Protect) Level 2 (Know) Level 3 (Monitor) Level 4 (Repair)

Storage and Geographic

Location

• 2complete copies not collocated

• Get media off diverse storage media and into a system

• At least 3 complete copies• At least 1 in different

geographic location• Document storage system,

media, and what’s needed to use them

• At least 1 copy in location w different disaster threat

• Obsolescence monitoring process for storage system and media

• At least 3 copies in locations w different disaster threats

• Comprehensive plan to keep files and metadata on currently accessible media or systems

File Fixity and Data Integrity

• Fixity check on ingest if checksum provided w content

• Create fixity info if not provided on transfer

• Check fixity on all ingests• Use write-blockers w original

media• Virus check high-risk content

• Fixity checks at regular intervals

• Maintain fixity logs and supply audit on demand

• Virus check all content• Ability to detect corrupt

data

• Check fixity in response to specific events/activities

• Ability to replace/repair corrupted data

• Ensure no one has write access to all copies

Information Security

• Identify who has read, write, move, and delete authorizations

• Restrict who has those authorizations to individual files

• Document access restrictions for content

• Maintain logs of who performed what actions on files, incl. deletions and preservation actions

• Perform audit of logs

Metadata• Inventory of content and its

storage locations• Ensure backup and non-

collocation of inventory

• Store admin metadata• Store transformative

metadata and log events• Store standard technical

and descriptive metadata• Store standard preservation

metadata

File Formats• Encourage creators to use open

formats and codecs when possible

• Inventory of file formats in use • Monitor file format obsolescence issues

• Perform format migrations, emulation, etc. as needed

NDSA Levels of Preservation

Adapted from: http://ndsa.org/activities/levels-of-digital-preservation/

NDSA Levels of Preservation – Categories Quantity of NDSA Levels of Preservation Criteria

Quantity of related ISO 16363 Criteria

Storage and Geographic Location 9 34File Fixity and Data Integrity 12 29

Information Security 5 22Metadata 6 50

File Formats 4 32(Unmappable from ISO 16363) - 23

Blog post: https://www.avpreserve.com/papers-and-presentations/mapping-standards-for-richer-assessments-ndsa-levels-of-digital-preservation-and-iso-163632012/

Mappings: https://www.avpreserve.com/wp-content/uploads/2016/05/ISO-Requirements-by-NDSA-LoDP-Categories.xlsx

Slides: http://www.avpreserve.com/wp-content/uploads/2014/07/NDSA_ISO_Presentation_2014.pdf

AVPreserve – 16363/NDSA mappings

Drupal TRAC Review tool

https://wiki.archivematica.org/Internal_audit_tool

Drupal TRAC Review tool

https://wiki.archivematica.org/Internal_audit_tool

Drupal TRAC Review tool

https://wiki.archivematica.org/Internal_audit_tool

Drupal TRAC Review tool

https://wiki.archivematica.org/Internal_audit_tool

Drupal TRAC Review tool

https://wiki.archivematica.org/Internal_audit_tool

Digital Preservation Capability Maturity Model

http://www.securelyrooted.com/dpcmm

Pick a(ny) tool and play with

itBiser Todorov, “Tools.” https://commons.wikimedia.org/wiki/File:Rusty_tools.JPG

POWRR Tool Grid on COPTR

http://www.digipres.org/tools/

Participate and

contribute

https://en.wikipedia.org/wiki/File:Sheridan_classroom.jpg

Seek out stakeholders and build your case

Unique Hotels, “Board Room - Vihula Manor Country Club & Spa.” https://www.flickr.com/photos/62485988@N05/5692789910

DPC Digital Preservation Business

Case Toolkit

http://wiki.dpconline.org/index.php?title=Digital_Preservation_Business_Case_Toolkit

http://wiki.dpconline.org/index.php?title=Additional_resources

DPC Digital Preservation Business

Case Toolkit

Share your successes…

and your failures

https://commons.wikimedia.org/w/index.php?curid=31154812

Do Something Now

Starting Blocks at Vacant Starting Line Before Event, by tableatny: https://www.flickr.com/photos/53370644@N06/4976494944

info@artefactual.com

Recommended