39
1 USE OF ICR TECHNOLOGY FOR THE INDONESIA 2010 POPULATION CENSUS BPS Statistics Indonesia New York, February 2011

USE OF ICR TECHNOLOGY FOR THE INDONESIA 2010 POPULATION CENSUS

  • Upload
    landen

  • View
    50

  • Download
    0

Embed Size (px)

DESCRIPTION

BPS Statistics Indonesia New York, February 2011. USE OF ICR TECHNOLOGY FOR THE INDONESIA 2010 POPULATION CENSUS. Background. Population census in Indonesia is held every ten year. Indonesia has the fourth largest population and the largest archipelago. - PowerPoint PPT Presentation

Citation preview

Page 1: USE OF ICR TECHNOLOGY FOR  THE INDONESIA 2010 POPULATION CENSUS

1

USE OF ICR TECHNOLOGY FOR THE INDONESIA 2010 POPULATION CENSUS

BPS Statistics IndonesiaNew York, February 2011

Page 2: USE OF ICR TECHNOLOGY FOR  THE INDONESIA 2010 POPULATION CENSUS

Background• Population census in Indonesia is held every ten year.• Indonesia has the fourth largest population and the largest

archipelago.• History of data processing for population census

o 1971 - OMR Technology, mainframeo 1980 – data entry, mainframeo 1990 – data entry, mainframe, distributedo 2000 – OCR technology, PC clusterso 2010 – ICR and mobile technology, PC clusters.

2

Page 3: USE OF ICR TECHNOLOGY FOR  THE INDONESIA 2010 POPULATION CENSUS

3

Data Processing Centers

• Located in 33 Provincial Statistics Offices.VPN

Page 4: USE OF ICR TECHNOLOGY FOR  THE INDONESIA 2010 POPULATION CENSUS

4

Flow of Documents in the Fields

FIELDSENUMERATOR KORTIM

BPS

DISTRIC

SP2010 L, KBC, RT, ART

Doc Pool Doc PoolSP2010 L,

KBC, RT, ART

BPS

KECDESA

Drop Off Receiving & Handling Queuing

Unpack &Checking

RepackExpedition Entry SP2010-L

PROVINCE

Drop Off Receiving & Handling Queuing

Unpack Repack

Expedition/Next Queuing Entry Coding

BPS

BPS

Page 5: USE OF ICR TECHNOLOGY FOR  THE INDONESIA 2010 POPULATION CENSUS

6

DPC Personnel

Box sorting In Queuing RoomRECEPTION AREA

- 1) Download Box- 2) Put Boxes in the trolley- 3) Input received Data - 4) Arrange box in Queuing Room

Database Box & Block Sensus

Database

CODING PICKUP OFFICER- 5) Take box from queuing room- 6) Registration of pick up box - 7) Deliver box es to Coding Editing Supervisor

CODING EDITING SUPERVISOR- 8) Boxes distribution ke petugas Coding

Editing- 11.2) Check & Authorization on any pages

discrepancies- 13) Update data box that finished coding

editing

CODING EDITING OFFICER- 9) Box opening- 10) Unbind documents- 11) Pages count- 12) Coding Editing

- 11.1) Reporting of discrepancies pages

Sorting Boxes inScanning

Queue

PROVINCE

Page 6: USE OF ICR TECHNOLOGY FOR  THE INDONESIA 2010 POPULATION CENSUS

7

Flow of Processing DocumentsFIELDS

DROP-OFF SERVICESTAGING

DOC PREP

REPACKINGFUMIGATION

DOCUMENT STORAGE

DPC

Drop Off Receiving & Handling Registering

Sorting

Unpack &Checking

Cutting

SCANNING

DOCUMENTPREPARATIONCORRECTION & COMPLETIONVALIDATION

Page 7: USE OF ICR TECHNOLOGY FOR  THE INDONESIA 2010 POPULATION CENSUS

8

Flow of Work in DPC :

Scanning & Warehouse

Box Scanning Queue

SCANNING PICKUP OFFICER- 1) Pickup box from Scanning Queue- 2) Pickup box registration- 3) Deliver box to Scanning Supervisor

Database Box & Block Sensus

Database

SCANNING OFFICER- 5) Register # box- 6) Scan docoments

REPACKING OFFICER- 7) Repacking box- 8)Register finished repack

STORAGE OFFICER- 9) Trolley from Repacking

to Doc Storage

STORAGE OFFICER- 12) Place box refer to

Put-Away

STORAGE ADMIN- 10) Register box- 11) Cetak Put-Away

Penyimpanan

DATA CAPTURESERVER

PROVINCE

Page 8: USE OF ICR TECHNOLOGY FOR  THE INDONESIA 2010 POPULATION CENSUS

9

Flow of Data in DPCDPCBPS

BPSServer

INFORMATION TECHNOLOGY

CAPTURE SYSTEM

SUPPORTAPPS

StagingCleanData

Data Tabulasi

Correction&

CompletionValidasi

Data Validasi

Data Staging

Image + data

RECEPTION SERVICE

DOCUMENT STORAGE

Status box

Lokasi box

Scanning

Image + data

RELEASE

Page 9: USE OF ICR TECHNOLOGY FOR  THE INDONESIA 2010 POPULATION CENSUS

10

Batching System• Document batch

o 1 SP 2010 KBCo Consist of = n SP 2010 RTo Each RT consist of = n SP 2010 ART

Page 10: USE OF ICR TECHNOLOGY FOR  THE INDONESIA 2010 POPULATION CENSUS

11

Capture Process• Fixed Form Approach• High speed Auto classification & separation• Accurate High Speed ICR engine• Accurate High Speed OMR engine• Consistency check capability• Inter-page business rule validation• Multipage business rules validation• Low false positive & Tuning

Page 11: USE OF ICR TECHNOLOGY FOR  THE INDONESIA 2010 POPULATION CENSUS

12

Solution Componen

ts

Page 12: USE OF ICR TECHNOLOGY FOR  THE INDONESIA 2010 POPULATION CENSUS

13

Solution Components• Guillotine• PCs• Server• Scanner• Software Data Capture• Training & troubleshooting• Template Development • Distribution, installation & implementation in

each DPC (33 locations)

Page 13: USE OF ICR TECHNOLOGY FOR  THE INDONESIA 2010 POPULATION CENSUS

14

Fujitsu Scanner Fi-6800• Scanner Speed : 130 ppm 300 dpi • Duty Cycle : 100.000 pages/ day• Resolution : 600 dpi• Feeder Capacity : 500 pages• Paper Size : up to A3• Imprinter capability : Pre and Post

Page 14: USE OF ICR TECHNOLOGY FOR  THE INDONESIA 2010 POPULATION CENSUS

15

Guillotine, workstation, scanner

Page 15: USE OF ICR TECHNOLOGY FOR  THE INDONESIA 2010 POPULATION CENSUS

16

Data Capture Server, Validation Server

Page 16: USE OF ICR TECHNOLOGY FOR  THE INDONESIA 2010 POPULATION CENSUS

17

Server Console, Server Racks

Page 17: USE OF ICR TECHNOLOGY FOR  THE INDONESIA 2010 POPULATION CENSUS

18

Scanner Allocation# DPC - BPS Offices No. of Docs Scanner

allocation1 NAD 4.350.904 1 2 Sumatera Utara 12.336.180 2 3 Sumatera Barat 4.413.328 1 4 Riau 5.738.708 1 5 Jambi 2.961.468 1 6 Sumatera Selatan 7.534.780 1 7 Bengkulu 1.785.968 1 8 Lampung 7.841.096 1 9 Kep Bangka Belitung 1.138.932 1

10 Kepulauan Riau 1.612.044 1 11 DKI Jakarta 10.508.444 2 12 Jawa Barat 37.095.756 6 13 Jawa tengah 30.000.000 5 14 DI Yogyakarta 4.399.192 1 15 Jawa Timur 34.511.536 5 16 Banten 11.114.704 2

Page 18: USE OF ICR TECHNOLOGY FOR  THE INDONESIA 2010 POPULATION CENSUS

19

Scanner Allocation# DPC - BPS Offices No. of Docs Scanner

Allocation17 Bali 3.900.360 1 18 Nusa Tenggara Barat 5.485.668 1 19 Nusa Tenggara Timur 3.998.896 1 20 Kalimantan Barat 4.291.276 1 21 Kalimantan Tengah 2.393.316 1 22 Kalimantan Selatan 3.956.552 1 23 Kalimantan Timur 3.513.608 1 24 Sulawesi Utara 2.669.324 1 25 Sulawesi Tengah 2.301.920 1 26 Sulawesi Selatan 7.745.836 2 27 Sulawesi Tenggara 2.158.264 1 28 Gorontalo 1.142.544 1 29 Sulawesi Barat 1.006.684 1 30 Maluku 1.082.632 1 31 Maluku Utara 715.672 1 32 Papua Barat 837.400 1 33 Papua 2.400.544 1

254.579.368 51

Page 19: USE OF ICR TECHNOLOGY FOR  THE INDONESIA 2010 POPULATION CENSUS

20

Server, PC Allocation# DPC - BPS Offices Server PC

1 NAD 2 302 Sumatera Utara 2 84 3 Sumatera Barat 2 33 4 Riau 2 41 5 Jambi 2 22 6 Sumatera Selatan 2 52 7 Bengkulu 2 14 8 Lampung 2 54 9 Kep Bangka Belitung 2 11

10 Kepulauan Riau 2 14 11 DKI Jakarta 2 73 12 Jawa Barat 2 344 13 Jawa tengah 4 237 14 DI Yogyakarta 4 33 15 Jawa Timur 2 284 16 Banten 4 77

Page 20: USE OF ICR TECHNOLOGY FOR  THE INDONESIA 2010 POPULATION CENSUS

21

Server, PC Allocation# DPC - BPS Offices Server PC

17 Bali 2 28 18 Nusa Tenggara Barat 2 40 19 Nusa Tenggara Timur 2 28 20 Kalimantan Barat 2 30 21 Kalimantan Tengah 2 18 22 Kalimantan Selatan 2 28 23 Kalimantan Timur 2 25 24 Sulawesi Utara 2 20 25 Sulawesi Tengah 2 18 26 Sulawesi Selatan 2 53 27 Sulawesi Tenggara 2 17 28 Gorontalo 2 11 29 Sulawesi Barat 2 10 30 Maluku 2 10 31 Maluku Utara 2 8 32 Papua Barat 2 9 33 Papua 2 19

76 1.775

Page 21: USE OF ICR TECHNOLOGY FOR  THE INDONESIA 2010 POPULATION CENSUS

22

Networking# DPC - BPS Offices Switch 48 node Cable (m)

1 NAD 1 3802 Sumatera Utara 2 940 3 Sumatera Barat 1 410 4 Riau 1 490 5 Jambi 1 300 6 Sumatera Selatan 2 620 7 Bengkulu 1 220 8 Lampung 2 640 9 Kep Bangka Belitung 1 190

10 Kepulauan Riau 1 220 11 DKI Jakarta 2 830 12 Jawa Barat 8 3,780 13 Jawa tengah 6 2,650 14 DI Yogyakarta 1 410 15 Jawa Timur 7 3,140 16 Banten 2 870

Page 22: USE OF ICR TECHNOLOGY FOR  THE INDONESIA 2010 POPULATION CENSUS

23

Networking# DPC - BPS Offices Switch 48 node Cable (m)

17 Bali 1 360 18 Nusa Tenggara Barat 1 480 19 Nusa Tenggara Timur 1 360 20 Kalimantan Barat 1 380 21 Kalimantan Tengah 1 260 22 Kalimantan Selatan 1 360 23 Kalimantan Timur 1 330 24 Sulawesi Utara 1 280 25 Sulawesi Tengah 1 260 26 Sulawesi Selatan 2 630 27 Sulawesi Tenggara 1 250 28 Gorontalo 1 190 29 Sulawesi Barat 1 180 30 Maluku 1 180 31 Maluku Utara 1 160 32 Papua Barat 1 170 33 Papua 1 270

39 21.400

Page 23: USE OF ICR TECHNOLOGY FOR  THE INDONESIA 2010 POPULATION CENSUS

24

Data Capture Software

KOFAX

Page 24: USE OF ICR TECHNOLOGY FOR  THE INDONESIA 2010 POPULATION CENSUS

25

Scan Recognition

Correction Completion Release

Kofax Implementation Overview

Page 25: USE OF ICR TECHNOLOGY FOR  THE INDONESIA 2010 POPULATION CENSUS

26

Software Data Capture Implementation

Doc Template Management o Template Registrationo Template Setting

• Registration Point• Field Definition• Field Formatting• Multi-Engine Voting• Dictionary• Data Look-Up• Business Rules• Integrity among pages

Page 26: USE OF ICR TECHNOLOGY FOR  THE INDONESIA 2010 POPULATION CENSUS

Correction Validation ReleaseRecognitionScanning Quality Check

Provincial Statistics Office

Monitoring

Compiled Data

RBL

Data Processing Context

27

Data Entry Quality Check

Municipality Statistics Office Head Quarter

RBL

ListingStatistical Coordinator

Validate, Summariz

eSend SMS

KBC C1

Validate, Summari

zeSend SMSCensus

Field Work

Page 27: USE OF ICR TECHNOLOGY FOR  THE INDONESIA 2010 POPULATION CENSUS

Capture Process Flow

• Classification• Recognition

PC Document Review

PC & Scanner

Server Data Capture

PC Correction

PC Completion

Server Database

PC QUALITYCONTROL

Page 28: USE OF ICR TECHNOLOGY FOR  THE INDONESIA 2010 POPULATION CENSUS

29

Document Preparation• Objective:

– To cut the side of forms booklet using paper guillotine

– Preparing docs for scanning process

Page 29: USE OF ICR TECHNOLOGY FOR  THE INDONESIA 2010 POPULATION CENSUS

30

Kofax - Module• Scanning :

– Scan batch– Page counting of document batch in scanning process

• QC:– System ensure that the pages of the doc batch match with the registered

sum of pages entry before scanning.

• Classification:o System will classify based on template

• Document Review:o Unrecognized doc will appear in this moduleo Operator may re-arrange, delete and re-scan the doc

Page 30: USE OF ICR TECHNOLOGY FOR  THE INDONESIA 2010 POPULATION CENSUS

31

Kofax - Module• Recognition:

– Data extraction from processed form– unrecognized Data for Correction & Completion

• Correction:–Character correction which un-recognized by system on below a set of

confidence level. Correction made field by field.

• Completion:– To complete all correction on one set of document in a document

batch refer to validation and business rules that have set in the system

•Release:– Exporting image to predefine folder and data to predefine database

Page 31: USE OF ICR TECHNOLOGY FOR  THE INDONESIA 2010 POPULATION CENSUS

32

Kofax - Correction• Sample Screen:

Page 32: USE OF ICR TECHNOLOGY FOR  THE INDONESIA 2010 POPULATION CENSUS

33

Kofax – Completion• Sample Screen

ENTRY PANEL IN TABULAR

FORMAT TO CATEGORISED

FIELD

Page 33: USE OF ICR TECHNOLOGY FOR  THE INDONESIA 2010 POPULATION CENSUS

34

Kofax – Completion

LOCATION ID CHECKING ,

DATA LOOKUP TO DATABASE

Page 34: USE OF ICR TECHNOLOGY FOR  THE INDONESIA 2010 POPULATION CENSUS

35

Kofax – Completion

VERIFICATIONCHILD AGE W/ BIOLOGICAL

MOTHER

VERIFICATION CHILD

NATIONALITY VS BIOLOGICAL FATHER &/ MOTHER

Business Rules

Business Rules

Page 35: USE OF ICR TECHNOLOGY FOR  THE INDONESIA 2010 POPULATION CENSUS

36

Kofax - Release• Objective:

– Deliver image to folder in the File Server– Deliver data to database Staging BPS

• Scope:– Write data to Database Staging RELEASE

Database:-SQL Server, Oracle, DB2,ODBC Compliant Database

Text File:- CSV, XML

Image File

Page 36: USE OF ICR TECHNOLOGY FOR  THE INDONESIA 2010 POPULATION CENSUS

37

Network Architecture of Data Center

Page 37: USE OF ICR TECHNOLOGY FOR  THE INDONESIA 2010 POPULATION CENSUS

38

Network IntegrationPROPINSI

PROPINSI

PROPINSI

BPS PUSAT

Page 38: USE OF ICR TECHNOLOGY FOR  THE INDONESIA 2010 POPULATION CENSUS

Population of Indonesia based on the Census, May 2010(preliminary figures, Released Aug 2010)

Male(000)

Female(000)

Male + Female(000)

119,508

118,048

237,556

39

Page 39: USE OF ICR TECHNOLOGY FOR  THE INDONESIA 2010 POPULATION CENSUS

Thank You

40