ArchivesSpace-Archivematica-DSpace Workflow Integration

Preview:

Citation preview

Max Eckard (@max_eckard)Digital Preservation 2016

#digipres2016 #r1c.

ArchivesSpace-Archivematica-DSpace

WORKFLOW INTEGRATION

Ye Olde Days

1997-2009

● Highly manual procedures for born-digital content

● Very limited resources

2010-2011

● MeMail project (email preservation grant)● Additional staff and storage infrastructure● Developed more robust workflows (still

manual)

2011-2014 (and ongoing)

● Automation of key steps in workflow: AutoPro!

● Standardization of metadata creation/collection

Available Community Solutions in 2013 (and today!)

● Archival management system● Creates accession records, tracks locations, generates EAD

● Ingest tool● Produces AIPs, extensive technical and preservation metadata.

● Repository for preservation and access● Provides persistent URLs, secure/managed storage, access controls

GoalsFacilitate creation/reuse of metadata

Streamline the ingest and deposit of content in repository

Find solutions that meet Bentley needs but are flexible and scalable for others

● Modular so that institutions may adopt some, none or all

● Employ open standards so that other repository platforms could be used

Share code and documentation with archives and digital preservation communities

Key Development

Tasks*

● Appraisal Tab● ArchivesSpace Integration● DSpace Integration

*thank you, thank you, !

Appraisal Tab

Search the Backlog● Similar to searching in the

Ingest tab of current version● Among a number of new

features for managing a backlog

Characterize Content (File Formats)● Entire transfer, folder within

a transfer, or individual files● Toggle between report and

visualization ● See format information as

table or pie chart

Characterize Content (File Formats)

Examine Individual Files● Apply facets● Format facet populates File

List with files of that format● Browse and preview content

○ If browser has a viewer, it will appear

○ All files can be downloaded for viewing

Identify Sensitive Data● Examine Contents tab

displays bulk_extractor logs● Personably Identifiable

Information ● Credit Card numbers

Tag Content● Backlog, Analysis or File List

pane● Use cases

○ Tag for arrangement in a specific series or file

○ Tag for sensitive or restricted content

○ Tags as a simple aide-memoire--it’s like a virtual Post-it note!

ArchivesSpace Integration

Search/Browse ArchivesSpace Resources● ArchivesSpace configuration

set in Administration● Search by title or identifier● Browse relationships

Create/Update/Delete Archival Objects● Create/Update Archival

Objects with minimal metadata

○ Title○ Level○ General note○ Conditions governing

access note○ Start date, end date, date

expression● Delete Archival Objects● Written immediately to

ArchivesSpace via API

Create/Update/Delete Archival Objects

Associate Digital Objects with Archival Description● Drag and drop functionality● Folders or files● Once associated, digital

objects are struck through

Add PREMIS Rights Statements● Create Basis and Acts● Will be using to set access

profile in repository● Working with developers of

ArchivesSpace to expand Rights module

Add PREMIS Rights Statements

DSpace Integration

Deposit to DSpace● Tell Archivematica which

DSpace and which collection● AIP Repackaging

○ metadata.7z○ objects.7z

● Deposits to DSpace● Applies access restriction to

metadata● Newly minted handle is

associated with Digital Object in ArchivesSpace

Deposit to DSpace

Systems of RecordAKA Letting Each System Do It’s Thing

● Administrative, descriptive and rights metadata

● Technical and preservation metadata, reconstructing the AIP

● Manage content and enforce access restrictions

Create or Receive Appraisal & Selection Ingest Preservation Action Store Access, Use & Reuse Transform

Archivematica(Transfer)

ArchivesSpace(Accession)

Archivematica(Appraisal)

ArchivesSpace(Resource)

Archivematica(Ingest)

Archivematica (Storage Service)

DSpace(Item)

ArchivesSpace(Digital Object)

viaArchivesSpace REST API,PREMIS Rights Statements (forthcoming)

DCC Curation Lifecycle Map and Dataflow Diagram

viaArchivesSpaceREST API

viaSWORD v2,DSpace API

DLXS(EAD)

viaEncoded Archival Description (EAD) (export and import)

These Days● Wrapped up on October 31, 2016--still implementing locally● Released as part of Archivematica 1.6● Follow along at achival-integration.blogpost.com● “...initial foray that will improve as more institutions employ the Appraisal and

Arrangement tab and adapt it to local needs or integrate new functionality.”○ Treemap visualizations○ Brunnehilde integration○ Named Entity Recognition (NER), Natural Language Processing (NLP), topic modeling

Thanks!Questions?

archival-integration.blogspot.com

@UMBHLCuration

Recommended