29
Constance 1 Archives de France Centre des Archives Contemporaines CONSTANCE : Twenty years of data bases archiving ERPANET WORKSHOP Bern April 09-11 2003

ERPANET WORKSHOP Bern April 09-11 2003

  • Upload
    odin

  • View
    37

  • Download
    0

Embed Size (px)

DESCRIPTION

Archives de France Centre des Archives Contemporaines CONSTANCE : Twenty years of data bases archiving. ERPANET WORKSHOP Bern April 09-11 2003. Summary. 1- Institutional, structural and legal frames 2- General strategy and policy 3- Processes from appraisal to ingestion - PowerPoint PPT Presentation

Citation preview

Page 1: ERPANET WORKSHOP Bern April  09-11  2003

Constance 1

Archives de FranceCentre des Archives

Contemporaines

CONSTANCE :Twenty years

of data bases archiving

ERPANET WORKSHOPBern April 09-11 2003

Page 2: ERPANET WORKSHOP Bern April  09-11  2003

Constance 2

Summary

1- Institutional, structural and legal frames

2- General strategy and policy

3- Processes from appraisal to ingestion

4- Data base documentation and metadata

5- Access and re-use : several policies

6- Conclusions

Page 3: ERPANET WORKSHOP Bern April  09-11  2003

Constance 3

Preliminary question :

(for inquisitive People)

What is CONSTANCE ?

Page 4: ERPANET WORKSHOP Bern April  09-11  2003

Constance 4

CONSTANCE

• CONServation et Traitements des Archives Nouvelles Constituées par l’Electronique

• Preservation and Treatments of New Archives issued from computer processing (or IT)

• This acronym was formulated in 1979 by an Archivist.

Page 5: ERPANET WORKSHOP Bern April  09-11  2003

Constance 5

1- CONSTANCE :

Institutional, structural and legal frames

Page 6: ERPANET WORKSHOP Bern April  09-11  2003

Constance 6

Archives and French InstitutionsArchives and French InstitutionsState-

government

CentralAdministration

DepartmentalAdministration

DepartmentalInstitutions

Local institutions

D A F

A N

A D

A M

Page 7: ERPANET WORKSHOP Bern April  09-11  2003

Constance 7

A rchiv es départem enta les

A rch iv es m unic ipa les

O ther loca l A rch iv es

D irection des A rch ives de F rance

C H A N C A O M

C lass ic a l A rch ives CONSTANCE

A rchiv es contem p ora ines C A M T

Arch ives N ationa les

M in is try o f C u lture and C om m unication

Page 8: ERPANET WORKSHOP Bern April  09-11  2003

Constance 8

CONTEMPORARY ARCHIVES : the Network

C A C

Ministry A

ARCHIVIST

Ministry B

ARCHIVIST Ministry C

Ministry X

ARCHIVIST

ARCHIVIST

Institution Y

ARCHIVIST

A1 A2A3

C1 C2

Page 9: ERPANET WORKSHOP Bern April  09-11  2003

Constance 9

Legal frame : accessibility

• The 1979 law on the archives : all documents produced …. whatever their form and support…are an

archival material

• communication regulations :- all documents : 30 years without access, except on a derogatory basis

- all statistical nominative files : 100 years without derogation

• Public access : free for every citizen

Page 10: ERPANET WORKSHOP Bern April  09-11  2003

Constance 10

Legal frame : citizens protection

Since 1978 : CNIL (Commission Nationale de l’Informatique et des Libertés)

• Agency for the respect of the 1978 law on IT and Liberties

• Every nominative file must be declared (private or public)

• Any IT project involving nominative data must be submitted to the agency’s opinion

• When needed, the French National Archives should be mentioned as final recipient of the data as agreed by CNIL.

Page 11: ERPANET WORKSHOP Bern April  09-11  2003

Constance 11

2- CONSTANCE :

GENERAL STRATEGY AND POLICY

Page 12: ERPANET WORKSHOP Bern April  09-11  2003

Constance 12

A big Question :

• Why preserve on the very long term so many data bases at such a cost ?

• Because you cannot envision the 20th century society memory without taking into account the prominent place of the management, statistic and scientific data bases, and their intensive and unavoidable use.

Page 13: ERPANET WORKSHOP Bern April  09-11  2003

Constance

Constance : the chronological steps

• 1978-1983 :the birth• 1983-1986 :the first steps• 1986-1993 :a too fast growth• 1993-1995 : the transition• 1995-1997 :moulting and migration• 1998-2000 :a new vision• 2001-2003 :new fields to investigate

Page 14: ERPANET WORKSHOP Bern April  09-11  2003

Constance 14

Main options taken

• Data bases of historical value, at the national level are to be preserved indefinitely,

• The CAC is in charge of the Constance programme,• The Constance team will be the operational core,• A computer centre was set up in 1983 in the CAC,• The team is composed of archivists and IT

specialists,• The team can deal directly with the producers and

users ( end-users or data base managers) when needed

Page 15: ERPANET WORKSHOP Bern April  09-11  2003

Constance 15

Technical options • Data are extracted from data bases by the

producer in flat files• No software components or elements are

preserved• Data format is the Ascii character mode

(one Ascii character in a byte)• Metadata must be delivered with data files• Data base producers or managers comply

to the National Archives technical recommendations

Page 16: ERPANET WORKSHOP Bern April  09-11  2003

Constance 16

1-Management of technological

resources

2- Digital objects

Conservation management

3- Ingestion and integration

management

4-Help and expertise for

Appraisal and collect

5- Technological watch and advice

Awareness and education

CONSTANCE : Essential functionalities

Page 17: ERPANET WORKSHOP Bern April  09-11  2003

Constance 17

3- CONSTANCE :

From appraisal to ingestion

Page 18: ERPANET WORKSHOP Bern April  09-11  2003

Constance 18

Process from appraisal to ingestion• Archivists detect a data base • Appraise it• Ask IT staff for extraction of data in flat file(s)• Ask the relevant persons for metadata and

documents Valid the components before sending the bundle to the CAC

• Constance team ingests the documentation and the data files

• Archival updates in various finding aids are made

• Then the follow up cycle for preservation of accessibility and integrity begins.

Page 19: ERPANET WORKSHOP Bern April  09-11  2003

Constance 19

The main steps (1)• Archivist detects a data base of historical value in

a governmental organisation• A presentation of the functional, legal, technical

aspects is made by the owners and managers of the data base

• An informal protocol of acquisition is agreed between the two parties

• The data base managers extract the relevant data, following the National Archives technical specifications

• The data base managers gather the technical sets of metadata concerning each file provided for long term preservation

Page 20: ERPANET WORKSHOP Bern April  09-11  2003

Constance 20

The main steps (2)• The data base managers and owners gather and

articulate the general documents related to this data base : organisation, design, purposes and use.

• The archivist in charge appraises the documents and the metadata

• An output of a significant data set is produced and checked against the documentation

• An electronic notice is sent to the CAC for archival management purpose

• File (s), documentation and metadata are sent to the Constance team

• A final and more technical check is made before proceeding with file storing.

Page 21: ERPANET WORKSHOP Bern April  09-11  2003

Constance 21

Ingestion tasks• check and upgrade the documentation and

metadata• control the file (s)• scan different parts of documentation and

store them in TIF format in an ERMS • if in electronic form, store them also in an

ERMS

• copy the data files onto the conservation media (DLT tapes for the time being), twice for security.

• Create and store the first set of metadata related to the file(s) integrity and traceability on the long term

Page 22: ERPANET WORKSHOP Bern April  09-11  2003

Constance 22

4- CONSTANCE :

Data base Documentation and

Metadata

Page 23: ERPANET WORKSHOP Bern April  09-11  2003

Constance 23

The different sets of documents• A set of 16 technical metadata for each file

• The lay out (or design) of the file record (s)• The data and codes dictionary, in alphabetical order• An output of some records from the file to be

transferred • Sets of input screens or forms, notices and results• Bibliographical references of publications when

relevant• The report concerning the conception of the project• Manuals concerning the uses and technical

management of the data base • The legal documents (or references) attesting the

validity and rights of the procedure involving the data base, its content and uses

Page 24: ERPANET WORKSHOP Bern April  09-11  2003

Constance 24

A file metadata set (1)

• 1- Information system name• 2- File name• 3 Media references and characteristics• 4- Copy date of the file onto the media• 5- Operating system references• 6- Software name and version supporting the

database • 7- Dates (first and last) of the data in the file• 8- Records file format (v, f, string…) • 9- Size in bytes of the records file• 10- Number of records files• 11- Number of objects contained in the file

Page 25: ERPANET WORKSHOP Bern April  09-11  2003

Constance 25

A file metadata set (2)

• 12- number of different variables in the file• 13- total number of bytes • 14- sort keys• 15- names and references of the linked or

related files• 16- date and references of the completion of

this inventory form.

Page 26: ERPANET WORKSHOP Bern April  09-11  2003

Constance 26

5- CONSTANCE :

Data base Access and re-use

Page 27: ERPANET WORKSHOP Bern April  09-11  2003

Constance 27

Some figures :

• 1981 : 1 file• 1983 30 files• 1986 300 files• 1994 5000 files• 2002 6000 files

Page 28: ERPANET WORKSHOP Bern April  09-11  2003

Constance 28

Several possible policies :• Deliver a copy of the file and its relevant

metadata• provide in situ some expertise• provide some tools and computer resources• provide technical resources and experts’

help• provide results to some “clients”on request

But who pays what ?

Page 29: ERPANET WORKSHOP Bern April  09-11  2003

Constance 29

Some links :

[email protected]• http://www.archivesdefrance.gouv.fr• http://www.archivesnationale.gouv.fr• http://www.archivesnationales.gouv.fr/cac/fr/index.html