Upload
landen
View
50
Download
0
Tags:
Embed Size (px)
DESCRIPTION
BPS Statistics Indonesia New York, February 2011. USE OF ICR TECHNOLOGY FOR THE INDONESIA 2010 POPULATION CENSUS. Background. Population census in Indonesia is held every ten year. Indonesia has the fourth largest population and the largest archipelago. - PowerPoint PPT Presentation
Citation preview
1
USE OF ICR TECHNOLOGY FOR THE INDONESIA 2010 POPULATION CENSUS
BPS Statistics IndonesiaNew York, February 2011
Background• Population census in Indonesia is held every ten year.• Indonesia has the fourth largest population and the largest
archipelago.• History of data processing for population census
o 1971 - OMR Technology, mainframeo 1980 – data entry, mainframeo 1990 – data entry, mainframe, distributedo 2000 – OCR technology, PC clusterso 2010 – ICR and mobile technology, PC clusters.
2
3
Data Processing Centers
• Located in 33 Provincial Statistics Offices.VPN
4
Flow of Documents in the Fields
FIELDSENUMERATOR KORTIM
BPS
DISTRIC
SP2010 L, KBC, RT, ART
Doc Pool Doc PoolSP2010 L,
KBC, RT, ART
BPS
KECDESA
Drop Off Receiving & Handling Queuing
Unpack &Checking
RepackExpedition Entry SP2010-L
PROVINCE
Drop Off Receiving & Handling Queuing
Unpack Repack
Expedition/Next Queuing Entry Coding
BPS
BPS
6
DPC Personnel
Box sorting In Queuing RoomRECEPTION AREA
- 1) Download Box- 2) Put Boxes in the trolley- 3) Input received Data - 4) Arrange box in Queuing Room
Database Box & Block Sensus
Database
CODING PICKUP OFFICER- 5) Take box from queuing room- 6) Registration of pick up box - 7) Deliver box es to Coding Editing Supervisor
CODING EDITING SUPERVISOR- 8) Boxes distribution ke petugas Coding
Editing- 11.2) Check & Authorization on any pages
discrepancies- 13) Update data box that finished coding
editing
CODING EDITING OFFICER- 9) Box opening- 10) Unbind documents- 11) Pages count- 12) Coding Editing
- 11.1) Reporting of discrepancies pages
Sorting Boxes inScanning
Queue
PROVINCE
7
Flow of Processing DocumentsFIELDS
DROP-OFF SERVICESTAGING
DOC PREP
REPACKINGFUMIGATION
DOCUMENT STORAGE
DPC
Drop Off Receiving & Handling Registering
Sorting
Unpack &Checking
Cutting
SCANNING
DOCUMENTPREPARATIONCORRECTION & COMPLETIONVALIDATION
8
Flow of Work in DPC :
Scanning & Warehouse
Box Scanning Queue
SCANNING PICKUP OFFICER- 1) Pickup box from Scanning Queue- 2) Pickup box registration- 3) Deliver box to Scanning Supervisor
Database Box & Block Sensus
Database
SCANNING OFFICER- 5) Register # box- 6) Scan docoments
REPACKING OFFICER- 7) Repacking box- 8)Register finished repack
STORAGE OFFICER- 9) Trolley from Repacking
to Doc Storage
STORAGE OFFICER- 12) Place box refer to
Put-Away
STORAGE ADMIN- 10) Register box- 11) Cetak Put-Away
Penyimpanan
DATA CAPTURESERVER
PROVINCE
9
Flow of Data in DPCDPCBPS
BPSServer
INFORMATION TECHNOLOGY
CAPTURE SYSTEM
SUPPORTAPPS
StagingCleanData
Data Tabulasi
Correction&
CompletionValidasi
Data Validasi
Data Staging
Image + data
RECEPTION SERVICE
DOCUMENT STORAGE
Status box
Lokasi box
Scanning
Image + data
RELEASE
10
Batching System• Document batch
o 1 SP 2010 KBCo Consist of = n SP 2010 RTo Each RT consist of = n SP 2010 ART
11
Capture Process• Fixed Form Approach• High speed Auto classification & separation• Accurate High Speed ICR engine• Accurate High Speed OMR engine• Consistency check capability• Inter-page business rule validation• Multipage business rules validation• Low false positive & Tuning
12
Solution Componen
ts
13
Solution Components• Guillotine• PCs• Server• Scanner• Software Data Capture• Training & troubleshooting• Template Development • Distribution, installation & implementation in
each DPC (33 locations)
14
Fujitsu Scanner Fi-6800• Scanner Speed : 130 ppm 300 dpi • Duty Cycle : 100.000 pages/ day• Resolution : 600 dpi• Feeder Capacity : 500 pages• Paper Size : up to A3• Imprinter capability : Pre and Post
15
Guillotine, workstation, scanner
16
Data Capture Server, Validation Server
17
Server Console, Server Racks
18
Scanner Allocation# DPC - BPS Offices No. of Docs Scanner
allocation1 NAD 4.350.904 1 2 Sumatera Utara 12.336.180 2 3 Sumatera Barat 4.413.328 1 4 Riau 5.738.708 1 5 Jambi 2.961.468 1 6 Sumatera Selatan 7.534.780 1 7 Bengkulu 1.785.968 1 8 Lampung 7.841.096 1 9 Kep Bangka Belitung 1.138.932 1
10 Kepulauan Riau 1.612.044 1 11 DKI Jakarta 10.508.444 2 12 Jawa Barat 37.095.756 6 13 Jawa tengah 30.000.000 5 14 DI Yogyakarta 4.399.192 1 15 Jawa Timur 34.511.536 5 16 Banten 11.114.704 2
19
Scanner Allocation# DPC - BPS Offices No. of Docs Scanner
Allocation17 Bali 3.900.360 1 18 Nusa Tenggara Barat 5.485.668 1 19 Nusa Tenggara Timur 3.998.896 1 20 Kalimantan Barat 4.291.276 1 21 Kalimantan Tengah 2.393.316 1 22 Kalimantan Selatan 3.956.552 1 23 Kalimantan Timur 3.513.608 1 24 Sulawesi Utara 2.669.324 1 25 Sulawesi Tengah 2.301.920 1 26 Sulawesi Selatan 7.745.836 2 27 Sulawesi Tenggara 2.158.264 1 28 Gorontalo 1.142.544 1 29 Sulawesi Barat 1.006.684 1 30 Maluku 1.082.632 1 31 Maluku Utara 715.672 1 32 Papua Barat 837.400 1 33 Papua 2.400.544 1
254.579.368 51
20
Server, PC Allocation# DPC - BPS Offices Server PC
1 NAD 2 302 Sumatera Utara 2 84 3 Sumatera Barat 2 33 4 Riau 2 41 5 Jambi 2 22 6 Sumatera Selatan 2 52 7 Bengkulu 2 14 8 Lampung 2 54 9 Kep Bangka Belitung 2 11
10 Kepulauan Riau 2 14 11 DKI Jakarta 2 73 12 Jawa Barat 2 344 13 Jawa tengah 4 237 14 DI Yogyakarta 4 33 15 Jawa Timur 2 284 16 Banten 4 77
21
Server, PC Allocation# DPC - BPS Offices Server PC
17 Bali 2 28 18 Nusa Tenggara Barat 2 40 19 Nusa Tenggara Timur 2 28 20 Kalimantan Barat 2 30 21 Kalimantan Tengah 2 18 22 Kalimantan Selatan 2 28 23 Kalimantan Timur 2 25 24 Sulawesi Utara 2 20 25 Sulawesi Tengah 2 18 26 Sulawesi Selatan 2 53 27 Sulawesi Tenggara 2 17 28 Gorontalo 2 11 29 Sulawesi Barat 2 10 30 Maluku 2 10 31 Maluku Utara 2 8 32 Papua Barat 2 9 33 Papua 2 19
76 1.775
22
Networking# DPC - BPS Offices Switch 48 node Cable (m)
1 NAD 1 3802 Sumatera Utara 2 940 3 Sumatera Barat 1 410 4 Riau 1 490 5 Jambi 1 300 6 Sumatera Selatan 2 620 7 Bengkulu 1 220 8 Lampung 2 640 9 Kep Bangka Belitung 1 190
10 Kepulauan Riau 1 220 11 DKI Jakarta 2 830 12 Jawa Barat 8 3,780 13 Jawa tengah 6 2,650 14 DI Yogyakarta 1 410 15 Jawa Timur 7 3,140 16 Banten 2 870
23
Networking# DPC - BPS Offices Switch 48 node Cable (m)
17 Bali 1 360 18 Nusa Tenggara Barat 1 480 19 Nusa Tenggara Timur 1 360 20 Kalimantan Barat 1 380 21 Kalimantan Tengah 1 260 22 Kalimantan Selatan 1 360 23 Kalimantan Timur 1 330 24 Sulawesi Utara 1 280 25 Sulawesi Tengah 1 260 26 Sulawesi Selatan 2 630 27 Sulawesi Tenggara 1 250 28 Gorontalo 1 190 29 Sulawesi Barat 1 180 30 Maluku 1 180 31 Maluku Utara 1 160 32 Papua Barat 1 170 33 Papua 1 270
39 21.400
24
Data Capture Software
KOFAX
25
Scan Recognition
Correction Completion Release
Kofax Implementation Overview
26
Software Data Capture Implementation
Doc Template Management o Template Registrationo Template Setting
• Registration Point• Field Definition• Field Formatting• Multi-Engine Voting• Dictionary• Data Look-Up• Business Rules• Integrity among pages
Correction Validation ReleaseRecognitionScanning Quality Check
Provincial Statistics Office
Monitoring
Compiled Data
RBL
Data Processing Context
27
Data Entry Quality Check
Municipality Statistics Office Head Quarter
RBL
ListingStatistical Coordinator
Validate, Summariz
eSend SMS
KBC C1
Validate, Summari
zeSend SMSCensus
Field Work
Capture Process Flow
• Classification• Recognition
PC Document Review
PC & Scanner
Server Data Capture
PC Correction
PC Completion
Server Database
PC QUALITYCONTROL
29
Document Preparation• Objective:
– To cut the side of forms booklet using paper guillotine
– Preparing docs for scanning process
30
Kofax - Module• Scanning :
– Scan batch– Page counting of document batch in scanning process
• QC:– System ensure that the pages of the doc batch match with the registered
sum of pages entry before scanning.
• Classification:o System will classify based on template
• Document Review:o Unrecognized doc will appear in this moduleo Operator may re-arrange, delete and re-scan the doc
31
Kofax - Module• Recognition:
– Data extraction from processed form– unrecognized Data for Correction & Completion
• Correction:–Character correction which un-recognized by system on below a set of
confidence level. Correction made field by field.
• Completion:– To complete all correction on one set of document in a document
batch refer to validation and business rules that have set in the system
•Release:– Exporting image to predefine folder and data to predefine database
32
Kofax - Correction• Sample Screen:
33
Kofax – Completion• Sample Screen
ENTRY PANEL IN TABULAR
FORMAT TO CATEGORISED
FIELD
34
Kofax – Completion
LOCATION ID CHECKING ,
DATA LOOKUP TO DATABASE
35
Kofax – Completion
VERIFICATIONCHILD AGE W/ BIOLOGICAL
MOTHER
VERIFICATION CHILD
NATIONALITY VS BIOLOGICAL FATHER &/ MOTHER
Business Rules
Business Rules
36
Kofax - Release• Objective:
– Deliver image to folder in the File Server– Deliver data to database Staging BPS
• Scope:– Write data to Database Staging RELEASE
Database:-SQL Server, Oracle, DB2,ODBC Compliant Database
Text File:- CSV, XML
Image File
37
Network Architecture of Data Center
38
Network IntegrationPROPINSI
PROPINSI
PROPINSI
BPS PUSAT
Population of Indonesia based on the Census, May 2010(preliminary figures, Released Aug 2010)
Male(000)
Female(000)
Male + Female(000)
119,508
118,048
237,556
39
Thank You
40