18
Long-term Preservation of Digital Scholarly Literature Craig Van Dyck NISO-NFAIS Virtual Conference: Making Certain Digital Content is Preserved 7 December 2016

VanDyck Long-Term Preservation of Digital Scholarly Literature

Embed Size (px)

Citation preview

Page 1: VanDyck Long-Term Preservation of Digital Scholarly Literature

Long-termPreservationofDigitalScholarlyLiterature

CraigVanDyckNISO-NFAISVirtualConference:MakingCertainDigitalContentisPreserved

7 December2016

Page 2: VanDyck Long-Term Preservation of Digital Scholarly Literature

WhyPreservationMatters

• Endusers• Libraries•Publishers•Grantfunders•Researchinstitutes

2

Page 3: VanDyck Long-Term Preservation of Digital Scholarly Literature

3

Page 4: VanDyck Long-Term Preservation of Digital Scholarly Literature

Stakeholders

• Scholarsrelyonpermanentaccesstodigitalmaterials• Thescholarlyliteratureislong-lived• Librariesasthestewardsofpreservation• Librariesmaynotowncopiesofthedigitalliterature• Publisher-providedaccesscanbeunstable

4

Page 5: VanDyck Long-Term Preservation of Digital Scholarly Literature

Stakeholders,cont’d

• Funderswanttheoutputfromtheirfundingtoremainavailable• Researchinstitutesneedtheirfacultytohaveaccesstomaterials;andneedtobesurethattheirfaculty’soutputwillbeaccessible

5

Page 6: VanDyck Long-Term Preservation of Digital Scholarly Literature

HowCLOCKSSWorks

• Introduction• Technology;LOCKSS• Processes• Governance• Statistics• Triggers• Challenges• Priorities

6

Page 7: VanDyck Long-Term Preservation of Digital Scholarly Literature

CLOCKSS: ControlledLOCKSS(LotsofCopiesKeepStuffSafe)

• Beganoperationsin2006• Ensuringlong-termaccesstoscholarlyliteratureforresearchers• Adiverse,robustecosystemofdigitalpreservationsolutions• CLOCKSSpreservesandarchivesonbehalfoflibraries• Librarieshaveinsistedthatpublishersarchivetheircontent

7

Page 8: VanDyck Long-Term Preservation of Digital Scholarly Literature

CLOCKSS-- Technology

• CLOCKSSusestheopensourceLOCKSStechnology,with12libraryservernodes:NA:Indiana,OCLC,Rice,Stanford,Virginia,AlbertaEurope:Edinburgh,Humboldt/Germany,Universita Cattolica /ItalyAPac:HongKongU,NII/Japan,AustraliaNationalU

8

Page 9: VanDyck Long-Term Preservation of Digital Scholarly Literature

CLOCKSS– Technology,cont’d

• CLOCKSSiscertifiedasaTrustedDigitalRepositorybytheCenterforResearchLibraries• TRACauditperfectscorefortechnology;seeDavidRosenthalblog:http://blog.dshr.org/2014/07/trac-certification-of-clockss-archive.html

9

Page 10: VanDyck Long-Term Preservation of Digital Scholarly Literature

AwordaboutLOCKSS

• FromtheStanfordUniversityLibrary• Uniquetechnologysolution:multipleserversconstantlycross-checkingeachother,ensuringthepreserveddataisvalid•Manyinstances:

GlobalLOCKSSNetwork150nodes,eachwiththeirowncollection;postcancellationaccess14PrivateLOCKSSNetworkse.g.CLOCKSS,PublicKnowledgeProject,CanadianGovernmentInformation,CARINIANA(Brazil),ADPN,USgovernmentdocuments

10

Page 11: VanDyck Long-Term Preservation of Digital Scholarly Literature

CLOCKSS-- Processes

• Contentsubmissionviafiletransferorwebharvest:https://www.clockss.org/clocksswiki/files/File_Transfer_Guidelines_-_CLOCKSS.pdfhttps://www.clockss.org/clocksswiki/files/Web_Harvest_Guidelines_-_CLOCKSS.pdf

•Webharvestisparticularlyusefulwith“longtail”publishers

11

Page 12: VanDyck Long-Term Preservation of Digital Scholarly Literature

CLOCKSS-- Governance

• CLOCKSSisa“dark”archive• Triggeredcontentismadeavailableasopenaccess•Whatdoes“trigger”mean?

- Whendigitalcontentceasestobeavailabletoendusers- Accessmustbeensured,tosupportscholarship

12

Page 13: VanDyck Long-Term Preservation of Digital Scholarly Literature

CLOCKSS– Governance,cont’d

• Communitygovernance:equalnumberoflibrariesandpublishersontheBoardofDirectors• Fundedbypublisherfeesandvoluntarylibrarycontributions• Free-standing501(c)(3)non-profit• Financiallystable

13

Page 14: VanDyck Long-Term Preservation of Digital Scholarly Literature

CLOCKSS-- Statistics

• 200publisherparticipants,750librarysupporters• 15millionjournalarticlesandbooks,adding~4million/year• 5largestpublishers=70%ofthecontent• “longtail”publishers=65%ofthepublishers

14

Page 15: VanDyck Long-Term Preservation of Digital Scholarly Literature

CLOCKSS– TriggeringContentforAccess

• Rigorousrulesandpractices• Bylawsrequire75%Boardvoteinfavor,withnomorethan2votingagainstatrigger• 29triggeredjournals;1milliondownloadsthisyear• TriggeredjournalsareopenaccessviaCLOCKSS,atStanfordandEdinburgh• CreativeCommonsAttribution-Noncommercial-NoDerivativeWorksLicense

15

Page 16: VanDyck Long-Term Preservation of Digital Scholarly Literature

Challenges:TwoAsks

1. Preservingthe“LongTail”:- Longtailjournalsarethemostat-risk,andthehardesttofindandworkwith- Weneedlibraries’prioritiesforwhattoarchivethatisnotyetarchived

2. Financialsupportforadiverseandrobustdigitalpreservationenvironment

16

Page 17: VanDyck Long-Term Preservation of Digital Scholarly Literature

CLOCKSS– 2017Priorities

• Investinginhardwareandsoftware:capacity,timeliness• Addingmorelargebackfiles• Newcontenttypese.g.datasets,video,databases• Strongertransparency• Increasedoutreach

17

Page 18: VanDyck Long-Term Preservation of Digital Scholarly Literature

CLOCKSS,concluded

CLOCKSSholdingsarepubliclyreportedintheKeepersRegistry:https://thekeepers.org/

https://[email protected]

18