31
www.apsr.edu.au Australian Partnership for Sustainable Repositories DigCCurr 2007 Chapel Hill, April 2007 Adrian Burton Leader, Australian Partnership for Sustainable Repositories (APSR) Infrastructure and Services for Digital Collections: Automated Obsolescence Notification

Infrastructure and Services for Digital Collections · 2007. 5. 17. · Australian Partnership for Sustainable Repositories DigCCurr 2007 Chapel Hill, April 2007 Adrian Burton Leader,

  • Upload
    others

  • View
    1

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Infrastructure and Services for Digital Collections · 2007. 5. 17. · Australian Partnership for Sustainable Repositories DigCCurr 2007 Chapel Hill, April 2007 Adrian Burton Leader,

www.apsr.edu.auAustralian Partnership for Sustainable Repositories

DigCCurr 2007Chapel Hill, April 2007

Adrian BurtonLeader,

Australian Partnership for Sustainable Repositories (APSR)

Infrastructure and Services for Digital Collections:

Automated Obsolescence Notification

Page 2: Infrastructure and Services for Digital Collections · 2007. 5. 17. · Australian Partnership for Sustainable Repositories DigCCurr 2007 Chapel Hill, April 2007 Adrian Burton Leader,

www.apsr.edu.auAustralian Partnership for Sustainable Repositories

An Innovative Action Plan for the Future

The Systemic Infrastructure Initiative

The Australian National UniversityThe University of Sydney

The University of QueenslandThe National Library of Australia

The Australian Partnership for Advanced Computing

Supported by:

APSR Partners

Page 3: Infrastructure and Services for Digital Collections · 2007. 5. 17. · Australian Partnership for Sustainable Repositories DigCCurr 2007 Chapel Hill, April 2007 Adrian Burton Leader,

www.apsr.edu.auAustralian Partnership for Sustainable Repositories

ObsolescenceNotification

Page 4: Infrastructure and Services for Digital Collections · 2007. 5. 17. · Australian Partnership for Sustainable Repositories DigCCurr 2007 Chapel Hill, April 2007 Adrian Burton Leader,

www.apsr.edu.auAustralian Partnership for Sustainable Repositories

Automated Obsolescence Notification System (AONS)

Aim: To make collection managers aware of files in their collections that require preservation action

Page 5: Infrastructure and Services for Digital Collections · 2007. 5. 17. · Australian Partnership for Sustainable Repositories DigCCurr 2007 Chapel Hill, April 2007 Adrian Burton Leader,

www.apsr.edu.auAustralian Partnership for Sustainable Repositories

AONSAONS is a loosely-coupled group of services to identify, notify, and evaluate the risk of file format obsolescence.It is particularly designed for large repositories that are:

• Long-lived

• Heterogeneous

• Where creators are not responsible for stewardship

• Without options for preservation on ingest

Infrastructure& Services

Page 6: Infrastructure and Services for Digital Collections · 2007. 5. 17. · Australian Partnership for Sustainable Repositories DigCCurr 2007 Chapel Hill, April 2007 Adrian Burton Leader,

www.apsr.edu.auAustralian Partnership for Sustainable Repositories

Page 7: Infrastructure and Services for Digital Collections · 2007. 5. 17. · Australian Partnership for Sustainable Repositories DigCCurr 2007 Chapel Hill, April 2007 Adrian Burton Leader,

www.apsr.edu.auAustralian Partnership for Sustainable Repositories

Page 8: Infrastructure and Services for Digital Collections · 2007. 5. 17. · Australian Partnership for Sustainable Repositories DigCCurr 2007 Chapel Hill, April 2007 Adrian Burton Leader,

www.apsr.edu.auAustralian Partnership for Sustainable Repositories

Page 9: Infrastructure and Services for Digital Collections · 2007. 5. 17. · Australian Partnership for Sustainable Repositories DigCCurr 2007 Chapel Hill, April 2007 Adrian Burton Leader,

www.apsr.edu.auAustralian Partnership for Sustainable Repositories

Page 10: Infrastructure and Services for Digital Collections · 2007. 5. 17. · Australian Partnership for Sustainable Repositories DigCCurr 2007 Chapel Hill, April 2007 Adrian Burton Leader,

www.apsr.edu.auAustralian Partnership for Sustainable Repositories

Page 11: Infrastructure and Services for Digital Collections · 2007. 5. 17. · Australian Partnership for Sustainable Repositories DigCCurr 2007 Chapel Hill, April 2007 Adrian Burton Leader,

www.apsr.edu.auAustralian Partnership for Sustainable Repositories

Page 12: Infrastructure and Services for Digital Collections · 2007. 5. 17. · Australian Partnership for Sustainable Repositories DigCCurr 2007 Chapel Hill, April 2007 Adrian Burton Leader,

www.apsr.edu.auAustralian Partnership for Sustainable Repositories

Page 13: Infrastructure and Services for Digital Collections · 2007. 5. 17. · Australian Partnership for Sustainable Repositories DigCCurr 2007 Chapel Hill, April 2007 Adrian Burton Leader,

www.apsr.edu.auAustralian Partnership for Sustainable Repositories

Project Background

2005: PANIC architecture*2006: AONS 1, proof of concept system2007: AONS 2, enterprise software, risk analysis, and pilot service

*J.Hunter, S.Choudhury, “PANIC – An Integrated Approach to the Preservation of Complex Digital Objects using Semantic Web Services”, International Journal on Digital Libraries: Special Issue on Complex Digital Objects, January 2006

Page 14: Infrastructure and Services for Digital Collections · 2007. 5. 17. · Australian Partnership for Sustainable Repositories DigCCurr 2007 Chapel Hill, April 2007 Adrian Burton Leader,

www.apsr.edu.auAustralian Partnership for Sustainable Repositories

Page 15: Infrastructure and Services for Digital Collections · 2007. 5. 17. · Australian Partnership for Sustainable Repositories DigCCurr 2007 Chapel Hill, April 2007 Adrian Burton Leader,

www.apsr.edu.auAustralian Partnership for Sustainable Repositories

Format Information

PRONOMLOC – LCDFW (scraper)GDFR (Alpha 1)DCC RIF? NGDA?......

ISSUES

Standard ways to represent format informationUnique ID’s for formatsMetric for measuring obsolescenceOngoing relationships with information providers

Page 16: Infrastructure and Services for Digital Collections · 2007. 5. 17. · Australian Partnership for Sustainable Repositories DigCCurr 2007 Chapel Hill, April 2007 Adrian Burton Leader,

www.apsr.edu.auAustralian Partnership for Sustainable Repositories

Repository Information

Managers self-registerRepositories expose reportsAONS “gets” the reports

ISSUES

authorisation/ authentication for registrationscope of the servicescaling to meet the needs of (eg) Internet Archivethird party repository information aggregators

Page 17: Infrastructure and Services for Digital Collections · 2007. 5. 17. · Australian Partnership for Sustainable Repositories DigCCurr 2007 Chapel Hill, April 2007 Adrian Burton Leader,

www.apsr.edu.auAustralian Partnership for Sustainable Repositories

Standard XML Format Report

<collection><format name=”name1” version=”v1” puid=”fmt1” count=”200”/><format name=”name2” version=”v2” puid=”fmt2” count=”4000”/><format name=”name3” version=”v3” puid=”fmt3” count=”10”/><format name=”name4” version=”v4” puid=”fmt4” count=”300”/>

</collection>

Page 18: Infrastructure and Services for Digital Collections · 2007. 5. 17. · Australian Partnership for Sustainable Repositories DigCCurr 2007 Chapel Hill, April 2007 Adrian Burton Leader,

www.apsr.edu.auAustralian Partnership for Sustainable Repositories

Notification

Email/ RSS….

ISSUES

frequencygranularity of detail (perhaps just a link)

Page 19: Infrastructure and Services for Digital Collections · 2007. 5. 17. · Australian Partnership for Sustainable Repositories DigCCurr 2007 Chapel Hill, April 2007 Adrian Burton Leader,

www.apsr.edu.auAustralian Partnership for Sustainable Repositories

Risk Analysis

INFORM/ Deb WoodyardGeneral vs individual risk Stateful web interface (My AONS)

ISSUES

what digital curators need to know to take actiontesting the metricscommunity risk may be supplied upstreamother applications of the module

Page 20: Infrastructure and Services for Digital Collections · 2007. 5. 17. · Australian Partnership for Sustainable Repositories DigCCurr 2007 Chapel Hill, April 2007 Adrian Burton Leader,

Very High Risk of Obsolescence

High Risk of Obsolescence

Medium Risk of Obsolescence

Low Risk of Obsolescence

Version support end date known?

Y

N

Years to end date?

Version release date

known?Y

Years since release date?

N

New version(s) available?

YNumber of

later versions

Number of view paths available?

0-1

2-3

3-5

5+

<= 0

1-2

3-5

5+

7+

5-7

3-5

< 2

2+

2

1

N

N / Unk

Determine number of view paths available to render format (available support)

Determine number of new versions of format available

Determine likely creator / vendor support period 0

-2

-4

-6

0

-2

-4

0

-2

-4

-6

ScoreAdd the score to the starting risk of 10

+10

-2

Result

8 - 10

6 - 7

3 - 5

<3

Page 21: Infrastructure and Services for Digital Collections · 2007. 5. 17. · Australian Partnership for Sustainable Repositories DigCCurr 2007 Chapel Hill, April 2007 Adrian Burton Leader,

www.apsr.edu.auAustralian Partnership for Sustainable Repositories

Page 22: Infrastructure and Services for Digital Collections · 2007. 5. 17. · Australian Partnership for Sustainable Repositories DigCCurr 2007 Chapel Hill, April 2007 Adrian Burton Leader,

www.apsr.edu.auAustralian Partnership for Sustainable Repositories

AONS Design

Loosely coupled SOAtarget independentModularRobust Java enterprise software

Page 23: Infrastructure and Services for Digital Collections · 2007. 5. 17. · Australian Partnership for Sustainable Repositories DigCCurr 2007 Chapel Hill, April 2007 Adrian Burton Leader,

www.apsr.edu.auAustralian Partnership for Sustainable Repositories

http://www.itee.uq.edu.au/~eresearch/projects/panic/index.html

AONS

?

PANIC

Page 24: Infrastructure and Services for Digital Collections · 2007. 5. 17. · Australian Partnership for Sustainable Repositories DigCCurr 2007 Chapel Hill, April 2007 Adrian Burton Leader,

www.apsr.edu.auAustralian Partnership for Sustainable Repositories

Other Applications

metadata schema obsolescenceQA notification….

Page 25: Infrastructure and Services for Digital Collections · 2007. 5. 17. · Australian Partnership for Sustainable Repositories DigCCurr 2007 Chapel Hill, April 2007 Adrian Burton Leader,

www.apsr.edu.auAustralian Partnership for Sustainable Repositories

Page 26: Infrastructure and Services for Digital Collections · 2007. 5. 17. · Australian Partnership for Sustainable Repositories DigCCurr 2007 Chapel Hill, April 2007 Adrian Burton Leader,

www.apsr.edu.auAustralian Partnership for Sustainable Repositories

Page 27: Infrastructure and Services for Digital Collections · 2007. 5. 17. · Australian Partnership for Sustainable Repositories DigCCurr 2007 Chapel Hill, April 2007 Adrian Burton Leader,

www.apsr.edu.auAustralian Partnership for Sustainable Repositories

Layers 1

Page 28: Infrastructure and Services for Digital Collections · 2007. 5. 17. · Australian Partnership for Sustainable Repositories DigCCurr 2007 Chapel Hill, April 2007 Adrian Burton Leader,

www.apsr.edu.auAustralian Partnership for Sustainable Repositories

Layers 2

Page 29: Infrastructure and Services for Digital Collections · 2007. 5. 17. · Australian Partnership for Sustainable Repositories DigCCurr 2007 Chapel Hill, April 2007 Adrian Burton Leader,

www.apsr.edu.auAustralian Partnership for Sustainable Repositories

Layers 3

Page 30: Infrastructure and Services for Digital Collections · 2007. 5. 17. · Australian Partnership for Sustainable Repositories DigCCurr 2007 Chapel Hill, April 2007 Adrian Burton Leader,

www.apsr.edu.auAustralian Partnership for Sustainable Repositories

Layers 4

Page 31: Infrastructure and Services for Digital Collections · 2007. 5. 17. · Australian Partnership for Sustainable Repositories DigCCurr 2007 Chapel Hill, April 2007 Adrian Burton Leader,

www.apsr.edu.auAustralian Partnership for Sustainable Repositories

Layers 5