24
21/05/2012 1 A corporate approach to processing microdata in Eurostat Pál JANCSÓK and Christine WIRTZ Eurostat Unit B4

New A corporate approach to processing microdata in Eurostat · 2012. 6. 14. · A corporate approach to processing microdata in Eurostat Pál JANCSÓK and Christine WIRTZ Eurostat

  • Upload
    others

  • View
    2

  • Download
    0

Embed Size (px)

Citation preview

Page 1: New A corporate approach to processing microdata in Eurostat · 2012. 6. 14. · A corporate approach to processing microdata in Eurostat Pál JANCSÓK and Christine WIRTZ Eurostat

21/05/2012 1

A corporate approach to processing microdata in

Eurostat

Pál JANCSÓK and Christine WIRTZ

Eurostat Unit B4

Page 2: New A corporate approach to processing microdata in Eurostat · 2012. 6. 14. · A corporate approach to processing microdata in Eurostat Pál JANCSÓK and Christine WIRTZ Eurostat

2 21/05/2012

Agenda

Introduction

Generic SAS Tool (GSAST) architecture

– Microdata processing

– Architecture

Metadata in GSAST

– Structure

– Management

Possible extension of GSAST to data providers

– Current processing

– Self service

Conclusions

Page 3: New A corporate approach to processing microdata in Eurostat · 2012. 6. 14. · A corporate approach to processing microdata in Eurostat Pál JANCSÓK and Christine WIRTZ Eurostat

4 21/05/2012

Introduction

Eurostat is the statistical office of the European

Union and a General Directorate of the

European Commission.

Eurostat is the central institution of the

European Statistical System (ESS) - a network

of National Statistical Institutes (NSI) from all

EU and EFTA Countries.

Eurostat's mission: to be the leading provider of

high quality statistics on Europe.

Page 4: New A corporate approach to processing microdata in Eurostat · 2012. 6. 14. · A corporate approach to processing microdata in Eurostat Pál JANCSÓK and Christine WIRTZ Eurostat

5 21/05/2012

Introduction

Streamline the statistical production:

Internal efforts External efforts

Harmonisation, and

consolidation.

Modern, generic tools

for

data processing.

Internal Generic

SAS Tool

Cooperation with data

providers,

sharing infrastructure.

Remote access for

data quality.

Shared Generic

SAS Tool

Page 5: New A corporate approach to processing microdata in Eurostat · 2012. 6. 14. · A corporate approach to processing microdata in Eurostat Pál JANCSÓK and Christine WIRTZ Eurostat

6 21/05/2012

Agenda

Introduction

Generic SAS Tool (GSAST) architecture

– Microdata processing

– Architecture

Metadata in GSAST

– Structure

– Management

Possible extension of GSAST to data providers

– Current processing

– Self service

Conclusions

Page 6: New A corporate approach to processing microdata in Eurostat · 2012. 6. 14. · A corporate approach to processing microdata in Eurostat Pál JANCSÓK and Christine WIRTZ Eurostat

7 21/05/2012

Microdata Processing

Stage 1: Regular production chain

Receipt of data Country

processing Aggregation

Member States

Disseminate

EDAMIS

Page 7: New A corporate approach to processing microdata in Eurostat · 2012. 6. 14. · A corporate approach to processing microdata in Eurostat Pál JANCSÓK and Christine WIRTZ Eurostat

8 21/05/2012

Microdata Processing

Stage 1: Regular production chain

Receipt of data Country

processing Aggregation Disseminate

Error and quality report

Data

revision,

justification

Data quality

checks

Revised data

Page 8: New A corporate approach to processing microdata in Eurostat · 2012. 6. 14. · A corporate approach to processing microdata in Eurostat Pál JANCSÓK and Christine WIRTZ Eurostat

9 21/05/2012

Microdata Processing

Stage 1: Regular production chain

Receipt of data Country

processing Aggregation Disseminate

Calculation of European

statistics;

Disclosure control, etc.

Page 9: New A corporate approach to processing microdata in Eurostat · 2012. 6. 14. · A corporate approach to processing microdata in Eurostat Pál JANCSÓK and Christine WIRTZ Eurostat

10 21/05/2012

Microdata Processing

Stage 1: Regular production chain

Receipt of data Country

processing Aggregation Dissemination

Data Explorer

www.ec.europa.eu/eurostat

Anonymised microdata for researchers

Page 10: New A corporate approach to processing microdata in Eurostat · 2012. 6. 14. · A corporate approach to processing microdata in Eurostat Pál JANCSÓK and Christine WIRTZ Eurostat

11 21/05/2012

Microdata Processing

Stage 1: Regular production chain

Receipt of data Country

processing Aggregation Disseminate

Stage 2: Additional ad-hoc processing

further analyse the data

provide statistics to answer external requests

additional publications

Page 11: New A corporate approach to processing microdata in Eurostat · 2012. 6. 14. · A corporate approach to processing microdata in Eurostat Pál JANCSÓK and Christine WIRTZ Eurostat

12 21/05/2012

GSAST, the corporate approach

Requirements:

– Use state-of-the-art technologies;

– Apply a centralised approach;

– Modular design including generic and reusable

modules for all data collections;

– Standard user interface;

– Possibility to perform additional computations on

the data (Stage 2);

– Easy to use and easy to learn by statisticians;

– Maintenance of the tool should be easy and

could preferably be performed by the

statisticians.

Page 12: New A corporate approach to processing microdata in Eurostat · 2012. 6. 14. · A corporate approach to processing microdata in Eurostat Pál JANCSÓK and Christine WIRTZ Eurostat

13 21/05/2012

GSAST, the corporate approach

Implementation:

– SAS Business Intelligence platform: Server –

Client approach;

– Common server-based tool with central

development and support services;

– SAS Enterprise Guide (EG) as user interface

with stored processes;

– Full computational power of SAS is available for

additional processing;

– Easy to learn common user interfaces;

– Use of metadata to parameterise the production

processes.

Page 13: New A corporate approach to processing microdata in Eurostat · 2012. 6. 14. · A corporate approach to processing microdata in Eurostat Pál JANCSÓK and Christine WIRTZ Eurostat

14 21/05/2012

GSAST Workflow example

Continuous Vocational Training Survey

Page 14: New A corporate approach to processing microdata in Eurostat · 2012. 6. 14. · A corporate approach to processing microdata in Eurostat Pál JANCSÓK and Christine WIRTZ Eurostat

15 21/05/2012

GSAST in Use

GSAST is in use for 9 different surveys.

In SAS EG the user initiates the operation one by one

– Parameters are provided by users.

Feedback is provided about the operations.

Page 15: New A corporate approach to processing microdata in Eurostat · 2012. 6. 14. · A corporate approach to processing microdata in Eurostat Pál JANCSÓK and Christine WIRTZ Eurostat

16 21/05/2012

Agenda

Introduction

Generic SAS Tool (GSAST) architecture

– Microdata processing

– Architecture

Metadata in GSAST

– Structure

– Management

Possible extension of GSAST to data providers

– Current processing

– Self service

Conclusions

Page 16: New A corporate approach to processing microdata in Eurostat · 2012. 6. 14. · A corporate approach to processing microdata in Eurostat Pál JANCSÓK and Christine WIRTZ Eurostat

17 21/05/2012

Metadata in GSAST: Structure

Technical metadata

– Defines the computational environment

– SAS BI architecture

Process definition metadata

– Input parameters

– Process parameters

Structural metadata

– Datasets

– Variables

Statistical metadata

– Flags

– Footnotes

– Code lists

Page 17: New A corporate approach to processing microdata in Eurostat · 2012. 6. 14. · A corporate approach to processing microdata in Eurostat Pál JANCSÓK and Christine WIRTZ Eurostat

18 21/05/2012

Metadata in GSAST: Management

Maintenance of the surveys mainly

by means of metadata updates

Complex metadata structure

– Integrity constraints

– Difficult to maintain manually

GSAST

Metadata Editor

Page 18: New A corporate approach to processing microdata in Eurostat · 2012. 6. 14. · A corporate approach to processing microdata in Eurostat Pál JANCSÓK and Christine WIRTZ Eurostat

19 21/05/2012

Metadata in GSAST: Management

GSAST Metadata Editor the maintenance tool

– Integrated to the existing GSAST as an EG

add-in;

– User-friendly way to browse complex

metadata structures;

– Parametric and metadata driven;

– Metadata editing and validation to maintain

integrity;

– Centralised management of code lists by the

SDMX registry.

Page 19: New A corporate approach to processing microdata in Eurostat · 2012. 6. 14. · A corporate approach to processing microdata in Eurostat Pál JANCSÓK and Christine WIRTZ Eurostat

20 21/05/2012

Agenda

Introduction

Generic SAS Tool (GSAST) architecture

– Microdata processing

– Architecture

Metadata in GSAST

– Structure

– Management

Possible extension of GSAST to data providers

– Current processing

– Self service

Conclusions

Page 20: New A corporate approach to processing microdata in Eurostat · 2012. 6. 14. · A corporate approach to processing microdata in Eurostat Pál JANCSÓK and Christine WIRTZ Eurostat

21 21/05/2012

GSAST extension for data providers

– Well-established workflows for microdata collections

at NSIs.

– Data Quality checks at Eurostat at different levels.

– There is an overlap between the processing

workflows at the NSIs and at Eurostat.

– Provide remote access to NSIs for the Country

processing in the GSAST chain.

– Self-Service approach.

Page 21: New A corporate approach to processing microdata in Eurostat · 2012. 6. 14. · A corporate approach to processing microdata in Eurostat Pál JANCSÓK and Christine WIRTZ Eurostat

22 21/05/2012

GSAST: Current processing

EDAMIS

1. Data Upload

2. Country Processing

6. Multiple Countries Processing

3. Feedback on the data

4. Data Upload

5. Country Processing

GSAST

Country Resposibility Eurostat Responsibility

Country Loop

Page 22: New A corporate approach to processing microdata in Eurostat · 2012. 6. 14. · A corporate approach to processing microdata in Eurostat Pál JANCSÓK and Christine WIRTZ Eurostat

23 21/05/2012

GSAST: Proposed workflow

1. Data Upload

2. Country Processing

6. Multiple Countries Processing

Country Resposibility

Eurostat Responsibility

3. Feedback on the data

4. Data Upload

5. Country Processing

Country Loop

GSAST 1: Web Based Interface GSAST 2

Page 23: New A corporate approach to processing microdata in Eurostat · 2012. 6. 14. · A corporate approach to processing microdata in Eurostat Pál JANCSÓK and Christine WIRTZ Eurostat

24 21/05/2012

Agenda

Introduction

Generic SAS Tool (GSAST) architecture

– Microdata processing

– Architecture

Metadata in GSAST

– Structure

– Management

Possible extension of GSAST to data providers

– Current processing

– Self service

Conclusions

Page 24: New A corporate approach to processing microdata in Eurostat · 2012. 6. 14. · A corporate approach to processing microdata in Eurostat Pál JANCSÓK and Christine WIRTZ Eurostat

25 21/05/2012

Conclusions:

– The GSAST platform is proven to be useful to process of

several microdata collections.

– GSAST Metadata Editor solved the problem of the

relatively heavy metadata for survey maintenance.

– Possible extension towards NSIs presents several

challenges.

– While the proposed architecture ensures secure treatment

of microdata, the adaptation of working methods presents

a major challenge which is, however, fully in line with the

joint strategy of the ESS.