35
i How good is OpenStreetMap? A Comparative Study of OpenStreetMap and Malawi Survey Data Sets: Case Study of Lilongwe City DECLARATION I declare that this research project is the product of my own effort and has not been previously submitted for a degree at this University or any other University. Name: Lwitiko Mulwafu Signature: …………………………………. Date: ………………………………… Supervised by: Name: Mr. Edgar Malombe Signature: …………………………………… Date: ……………...............................

Project final version

Embed Size (px)

Citation preview

Page 1: Project final version

i How good is OpenStreetMap? A Comparative Study of OpenStreetMap and Malawi

Survey Data Sets: Case Study of Lilongwe City

DECLARATION

I declare that this research project is the product of my own effort and has not been previously

submitted for a degree at this University or any other University.

Name: Lwitiko Mulwafu

Signature: …………………………………. Date: …………………………………

Supervised by:

Name: Mr. Edgar Malombe

Signature: …………………………………… Date: ……………...............................

Page 2: Project final version

ii How good is OpenStreetMap? A Comparative Study of OpenStreetMap and Malawi

Survey Data Sets: Case Study of Lilongwe City

DEDICATION

I dedicate this work to my Mum and Dad for their invaluable love, support and advice they gave

me during my studies. I love you.

Page 3: Project final version

iii How good is OpenStreetMap? A Comparative Study of OpenStreetMap and Malawi

Survey Data Sets: Case Study of Lilongwe City

ACKNOWLEDGEMENTS

Firstly, I would like to thank my supervisor, Mr. Edgar Malombe for giving me the opportunity to

undertake this project and for all his help throughout the project and for always being willing to

assist with all my queries.

I also wish to send my enormous gratitude to the current Northern Regional Surveyor General,

Masida Mbano for explaining to me significant details of the buffer analysis program that

facilitated the analysis procedure of the entire study.

To the entire Land Management staff, I really express my gratitude for the direction in my

academic life since my first level to the final level.

I also acknowledge my colleagues, most notably Gift Banda, Simon Tembo, Joseph Mikuwa,

Fredrick Nazombe, Precious Mzumara, Mark Chin’gombe, Grifton Gawani, Innocent

Kadammanja, Maggie Gondwe, Felister Geya, Chack Banda, Mwawi Kamanga, Collings

Kafunda, Beston Chisamile, SCOM members, and all friends for they were superb in academic

life, heartfelt advice and good humour. I will always remember you.

My final thanks goes to my family for boundless love and encouragement. It was indeed a long

journey. Remain blessed.

Page 4: Project final version

iv How good is OpenStreetMap? A Comparative Study of OpenStreetMap and Malawi

Survey Data Sets: Case Study of Lilongwe City

LIST OF FIGURES

Figure 2.1: ITN data used for the comparison across London ………….4

Figure: 3.1 Study Area, Lilongwe City………………………………….8

Figure 3.2: Lilongwe Streets dataset……………………………………11

Figure: 3.3 Georeferenced OSM data of Lilongwe city ………….……13

Figure: 3.4 Georeferenced Street Guide ..........................................…...14

Figure: 3.5 Digitized OpenStreetMap…………………………………..15

Figure 3:6 Digitized Lilongwe Street Guide……………………………16

Figure 3.7: Overlapped OSM dataset on the reference dataset ………...17

Figure 4:1 Digitized Lilongwe Map showing roads with names……….1

Figure 4.2: Buffer Analysis…………………………………………….19

Figure 4.3: Intersection of the buffered dataset…………………………20

Figure 4.4: Digitized Lilongwe Map showing roads with names………22

Figure 4.5: Digitized Lilongwe Map showing Length Completes………24

Page 5: Project final version

v How good is OpenStreetMap? A Comparative Study of OpenStreetMap and Malawi

Survey Data Sets: Case Study of Lilongwe City

LIST OF TABLES

Table 3.1: List of coordinate……………………………………….12

Table 4.1: Results of intersection after buffer analysis ……………21

Page 6: Project final version

vi How good is OpenStreetMap? A Comparative Study of OpenStreetMap and Malawi

Survey Data Sets: Case Study of Lilongwe City

LIST OF ABBREVIATIONS

OSM= OpenStreetMap

GPS = Global Positioning System

VGI = Volunteered Geographic Information

Page 7: Project final version

vii How good is OpenStreetMap? A Comparative Study of OpenStreetMap and Malawi

Survey Data Sets: Case Study of Lilongwe City

ABSTRACT

The past few years have seen an influx of open source websites due to advances in technology an

example of one such project in the is OpenStreetMap, a free editable map of the world created by

volunteers.

However, as with all open source projects questions are asked about the quality and validity of the

data provided by OpenStreetMap as the volunteers that contribute the data lack the sufficient

cartographic training and the quality cannot be guaranteed. This study was the carried out by

looking the positional accuracy of OpenStreetMap data by comparing with the Malawi Survey

dataset which is the official data provider.

This study carried out the positional accuracy through a buffer analysis, name Completeness was

assessed by comparing the names on the reference dataset and that of the tested dataset through

the visual comparison and ground verification. Lastly length completeness was calculated as the

percentage of the length of the tested dataset to the length of the reference dataset.

The results of this study found that positional accuracy of OpenStreetMap data is very good in

comparison to the Malawi Survey dataset more than 90% of the dataset was found within the 10

meters of the buffer zone. The completeness was found to be very good with a percentage above

85% in some areas it was found that the reference dataset has more data than the reference dataset.

It was further found out that 19 % of the total length covered in the OSM within the study area are

labelled which means that the total length of 81% are not labelled.

Page 8: Project final version

viii How good is OpenStreetMap? A Comparative Study of OpenStreetMap and Malawi

Survey Data Sets: Case Study of Lilongwe City

CONTENTS

DECLARATION .......................................................................................................................................................... i

DEDICATION ............................................................................................................................................................. ii

ACKNOWLEDGEMENTS ....................................................................................................................................... iii

LIST OF FIGURES .................................................................................................................................................... iv

LIST OF TABLES ....................................................................................................................................................... v

LIST OF ABBREVIATIONS .................................................................................................................................... vi

ABSTRACT ............................................................................................................................................................... vii

CHAPTER 1: INTRODUCTION............................................................................................................................... 1

1.1 Background ........................................................................................................................................................ 1

1.2 Problem Statement ............................................................................................................................................ 2

1.3: Objective of Study ............................................................................................................................................ 3

1.3.1General Objective ........................................................................................................................................ 3

1.3.2 Specific Objective........................................................................................................................................ 3

1.4 Significance of the Study ................................................................................................................................... 3

CHAPTER 2: LITERATURE REVIEW .................................................................................................................. 4

2.1 Positional Accuracy ........................................................................................................................................... 4

2.2 Name Completeness ........................................................................................................................................... 6

2.3 Length Completeness ......................................................................................................................................... 6

CHAPTER 3: THE RESEARCH METHODOLOGY ............................................................................................. 8

3.1 The Study Site .................................................................................................................................................... 8

3.2 Preparation of the datasets ............................................................................................................................... 9

3.2.1 OSM dataset ................................................................................................................................................ 9

3.2.2. Malawi Survey Datasets .......................................................................................................................... 10

3.3 Data Collection Methods ................................................................................................................................. 12

CHAPTER 4: RESULTS ANALYSIS ..................................................................................................................... 18

4.1 Positional Accuracy ......................................................................................................................................... 18

4.2 Name Completeness ......................................................................................................................................... 22

4.3 Length Completeness ....................................................................................................................................... 23

CHAPTER 5: CONCLUSION AND RECOMMENDATIONS ............................................................................ 25

5.1 Conclusion ........................................................................................................................................................ 25

5.2 Recommendations ............................................................................................................................................ 25

Page 9: Project final version

ix How good is OpenStreetMap? A Comparative Study of OpenStreetMap and Malawi

Survey Data Sets: Case Study of Lilongwe City

5.3 Future work ...................................................................................................................................................... 25

REFERENCES .......................................................................................................................................................... 26

Page 10: Project final version

1 How good is OpenStreetMap? A Comparative Study of OpenStreetMap and Malawi

Survey Data Sets: Case Study of Lilongwe City

CHAPTER 1: INTRODUCTION

1.1 BACKGROUND

OpenStreetMap (OSM) is the most popular Volunteered Geographic Information (VGI)

community. OSM was initiated by Stephen Coast in July 2004 at the University College

London. Since its establishment, OSM is expanding at a large scale to the extent that the number

of registered users grew rapidly so that by November 2010 there were more than 320,000 registered

users and more than 2.000 million tracked points in the database (OSM 2010a).

In the past, urban data management used to be performed by professional cartographers, public

authorities or commercial data providers. Nevertheless, in the last couple of years a new and

different trend for data collection has evolved, describing the collaborative and volunteered

collection of geographic data. (VGI) is a term used to describe the voluntary act of creating

geographic information, often by users who are largely un-trained with little or no formal

qualifications in the creation of such data. This is sometimes also referred to as “Neogeography”,

which can be defined as combining the complex techniques of GIS and cartography and placing

them within the reach of users and developers, and is essentially about people using and creating

their own maps, by combining elements of an existing toolset (Turner 2006).

This has also been described as the ‘wikification’ of GIS where the masses are quietly being

transformed from being passive consumers to active producers of geospatial information (Sui,

2008). Volunteered Geographic Information (VGI) comprises the effect that an ever-expanding

range of users creates, assembles and disseminates geographic and spatial data in a collaborative

and volunteered manner (Goodchild 2007a). This means an individual person or groups collects

and creates geographic data based on their personal measurements and their knowledge about their

surroundings and furthermore share that information with others through open web platforms.

Song & Sun (2010) alluded to that VGI can be considered as a new opportunity of systems and

sensors for monitoring urban and regional environments. Particularly in urban environment where

the coverage is very good, because many humans results in many potential sensors and therefore

the usage of VGI in urban management increases.

As an online map collaborative plan, provided voluntarily by individuals involving the capture,

processing and dissemination of geographic information, the project aims to create and distribute

vector data for the world because most maps thought of as free actually have legal or

technical restrictions on their use, holding back people from using them in creative, productive,

or unexpected ways.

OSM follows the peer production model (Haklay & Weber 2008) that created Wikipedia and aims

for the provision of free to use and editable map data. The data in OSM is created in different

ways, one of which being acquisition of original data, manually captured by users with GPS

Page 11: Project final version

2 How good is OpenStreetMap? A Comparative Study of OpenStreetMap and Malawi

Survey Data Sets: Case Study of Lilongwe City

devices. However, people can also contribute data based on aerial images (e.g. by Bing or Yahoo)

or by contributing their local knowledge about the region they live in.

1.2 PROBLEM STATEMENT

According to Benkler (2006) OpenStreetMap (OSM) project offers completely free data with an

open content license which eases data collection to user but still questions arises on the quality and

validity of the information delivered by the OSM. It is however unknown on whether the street

data provided by the volunteers to the OSM maintains its position in terms of the ground

coordinates and its names so as to serve its purpose in guidance. In august 2013 Malawi held the

SADC Summit which saw delegates from different countries attending the meeting and in these

cases OpenStreetMap could play a greater role as guidance tool in navigation so as to help in

finding hotels, filling stations, health facilities and many more important features. Hence a

requirement for an understanding on how good OSM data actually is, so as to identify areas of

improvement and determine the fitness of the map obtained.

Page 12: Project final version

3 How good is OpenStreetMap? A Comparative Study of OpenStreetMap and Malawi

Survey Data Sets: Case Study of Lilongwe City

1.3: OBJECTIVE OF STUDY

1.3.1GENERAL OBJECTIVE

The overall objective of this research will be to compare OpenStreetMap with Malawi survey data

sets. This objective will be achieved by specifically looking at the following objectives.

1.3.2 SPECIFIC OBJECTIVE

1. To establish data quality in terms of positional accuracy

2. To establish data quality in terms of name completeness

3. To establish data quality in terms of length completeness.

1.4 SIGNIFICANCE OF THE STUDY

The study is essential because it will discuss the major concern of the OSM data and measure

whether they are used in appropriate way considering their quality. Users’ needs maps to have

known accuracy and up-to-date so as to be reliable. If these aspects are identified on OSM then

the fitness of use and the quality of the user generated project shall be evaluated.

After gauging the level of positional accuracy of the OSM dataset, the next issue is the level of

completeness. While Steve Coast, the founder of OSM, stated it is important to let go of the concept

of completeness’ (GISPro, 2007, p.22), it is important to know which areas are well covered and

which are not otherwise, the data can be assumed to be unusable. Furthermore, the analysis of

completeness can reveal other characteristics that are relevant to other OSM projects.

Page 13: Project final version

4 How good is OpenStreetMap? A Comparative Study of OpenStreetMap and Malawi

Survey Data Sets: Case Study of Lilongwe City

CHAPTER 2: LITERATURE REVIEW

2.1 POSITIONAL ACCURACY

Initially, Haklay (2010), focused on the positional accuracy and length completeness of England

OSM data through comparison with the Ordinance Survey’s Meridian 2 dataset. The methodology

used to evaluate the positional accuracy was based on Goodchild and Hunter (1997) and

Hunter (1999). The comparison was carried out by using buffers to determine the percentage of

line from one dataset that is within a certain distance of the same feature in another dataset of

higher accuracy. The analysis showed that OSM information can be fairly accurate with average

within about 6 meters of the position recorded by the Ordnance Survey, and with approximately

80% overlap of motorway objects between the two datasets.

Following this work, Ather (2009) studied four areas of London in detail, using the highly accurate

Master Map Integrated Transport Network (ITN) from Ordnance Survey as shown in Figure 2.1

below.

Figure 2.1: ITN data used for the comparison across London (The Cartographic Journal Vol. 47

No. 4 pp. 315–322 November 2010)

Note that Meridian 2 is a generalized dataset, which its nodes are kept in their original position, as

was measured in high accuracy methods, the number of nodes in each road segment is reduced

through the application of a 20-metre buffer. ITN, on the other hand, records the centreline of

roads across the UK, based on information from field survey and through photogrammetry using

Page 14: Project final version

5 How good is OpenStreetMap? A Comparative Study of OpenStreetMap and Malawi

Survey Data Sets: Case Study of Lilongwe City

high-resolution imagery. The accuracy of ITN is expected to be less than 1 metre in urban areas,

where the study took place.

Overall, 109 different roads were examined covering over 328 km. The results of the buffer

analysis showed that on the whole the percentage overlap for all the roads were very high. Each

tile had a combined average percentage overlap above 80%, with the tile TQ37sw (South London)

having largest percentage overlap of 85.8%. Three of the roads examined had an exact 100%

overlap, the A4207, B412 and the B412. Between each tile, the average percentage overlap for the

A-roads ranged between 92.6% to 84.3% and for B-roads it ranged between 81.64% to 71.52%,

and the only motorway to measured, the M4, had a very high percentage overlap of 98.9%. A

pattern here is that the larger road type had better results, as for example the percentage overlap of

the motorway segment was higher than the average A-road results, which in turn was higher than

the average B-road results. This is most probably due to the nature of the buffer analysis where

larger buffer sizes were used for the wider road types, and hence there is a greater margin for error

for any data collected on wider roads. Regardless of this, the average percentage overlap for each

road type was still very high.

Looking closely at the distribution of the results, a large majority of the roads examined had

percentage overlaps above 90%, as illustrated by the histogram below (Figure 29). The histogram

clearly showed the results are skewed towards 100% overlap, with the cumulative percentage

overlap results increasing dramatically at two stages, once after 70% overlap and a second time

after 85% overlap. The last bin (95-100% overlap) contained the highest frequency of results, with

64 if the roads examined (57%) falling in the last three bins (85%-100% overlap).

According to Bhattacharya (2012.p.36) at least 59% of OSM road network was found to be within

2 meter buffer zone, and up to 122% of OSM road placed within 6 meter buffer zone of

corresponding TOP10NL road. This result indicates that there exist more than one roads of OSM

within the 6 meter buffer zone of corresponding TOP10NL road. As said 6 meter is a good

positional accuracy so OSM road network data in the Netherlands was considered to be as quite

compatible with TOP10NL.

Neis et al. (2010) compared the length of the mapped street network of commercial data and OSM

for the year 2010. They stated that there are nearly no differences in the overall street length

between both data sets, but found a difference of 40 percent of street segments capable for routing

applications.

Page 15: Project final version

6 How good is OpenStreetMap? A Comparative Study of OpenStreetMap and Malawi

Survey Data Sets: Case Study of Lilongwe City

2.2 NAME COMPLETENESS

Apart from the positional accuracy Ather (2009) conducted quality tests for the road name attribute

completeness and for the number of users per area. The results showed that the areas with higher

percentage overlap results also had greater levels of road name completeness. It was suggested

that analysis on other attribute information, such as road type would be useful since it was noticed

to be left blank in many cases. With regards to the user analysis the results showed a positive

correlation between road name attribute completeness and number of users per area. On the

other hand, the number of users and the positional accuracy did not appear to have any correlation

at all.

The results of the name completeness analysis showed that only 26% of roads within the study

area are labelled meaning that 84% of the roads are unlabeled roads. Comparing these results with

the results of the previous research made by Ather (2009) for the area of London, we can clearly

see that there is a significant, difference since the most incomplete area of London had only 31%

of roads without name.

Ather (2009 .p.62), results of the road name attribution analysis showed that out of the four tiles

in figure 2.3 above TQ28se (North/Central London) was the largest dataset with 98.53km of road

and had the greatest level of road name attribute completeness with only 5% of roads unlabeled.

Tq38se (East London), the second largest dataset with 94.3 km of road, was the most incomplete

with 31% of road name attributes missing.

2.3 LENGTH COMPLETENESS

Haklay (2008) commented that the completeness of OSM is the most significant aspect of VGI

quality. He calculated the difference of total length of all roads from the OSM and Meridian 2

dataset to check the completeness of the OSM dataset. It appeared that the total length of the OSM

dataset covers up to 69% of the Meridian 2 dataset.

The research further examined length completeness element on comparison between Meridian2

(reference dataset) and OSM (test dataset). The study area covered five OS tiles at 1:10,000

resolution, covering 113 square kilometres across England. England was considered the most

appropriated study area due to the fact that OSM started there. The evaluation of completeness

was undertaken for the whole of England. The results showed that OSM data cover 29% of

England and approximately 4% of these data lack complete set of attributes. Generally, major cities

are well covered but that is not the case as you move from the centers to the outskirts. Hence the

inconsistency of OSM data means that is not yet suitable for more sophisticated GIS analysis.

Bhattacharya (2012.p.30) found out that the road line data of OSM over all study areas covered

more than 72% of the TOP10NL road line data. It was further found out that in Delft the OSM

Page 16: Project final version

7 How good is OpenStreetMap? A Comparative Study of OpenStreetMap and Malawi

Survey Data Sets: Case Study of Lilongwe City

road line data was covered more than TOP10NL. Finally, it was concluded that though the OSM

data over the study areas was not yet 100% complete, they were well presented and the percentage

of completeness was different in different locations. From the high presence of some object classes

it may be presumed that in the near future the OSM data of these areas will be complete.

This brief literature review highlights, the clear need for understanding of OSM data quality in

order to provide an assessment of the level of accuracy of the current and provide

recommendations on OSM dataset.

Page 17: Project final version

8 How good is OpenStreetMap? A Comparative Study of OpenStreetMap and Malawi

Survey Data Sets: Case Study of Lilongwe City

CHAPTER 3: THE RESEARCH METHODOLOGY

3.1 THE STUDY SITE

A research was done in Lilongwe city, located in the central region of Malawi. Lilongwe city was

selected because it has received a lot of contributions on the OpenStreetMap hence being ideal for

the research site. Furthermore the study area includes the City Centre and some suburbs which

enablebled the researcher to compare the results between areas that have dense road network with

high mobility and other areas. The selected spatial extent of interest is shown in figure 3.1 below.

Figure 3.1: Study Area, Lilongwe City; Date of data acquisition: 28thJune, 2014. (Source:

www.openstreetmap.org)

Page 18: Project final version

9 How good is OpenStreetMap? A Comparative Study of OpenStreetMap and Malawi

Survey Data Sets: Case Study of Lilongwe City

3.2 PREPARATION OF THE DATASETS

3.2.1 OSM DATASET

As outlined in the literature review, all map data used to create OpenStreetMap has been collected

and uploaded by users who have registered on the OSM website. The collection is carried out

through the use of handheld GPS devices, or traced from Yahoo! Imagery or other free map

sources, and digitized using either the online or offline editing tools Potlatch and JOSM.

The issue of inconsistent results and differing levels of accuracy in the OSM dataset is also partially

due to the fact that different GPS devices are used by each user, and therefore the measurements

are of different levels of accuracy depending on the device used. The level of accuracy is also

highly dependent upon the nature of data collection. For a user collecting data whilst driving, the

number of points collected for any particular road would vary upon the speed of the car; driving

slowly allows more GPS points to be measured and would therefore lead to greater accuracy.

Therefore, users who collect data by cycling or by walking are more likely to obtain more accurate

results as they are travelling slower.

Numerous human errors can also occur during the digitization process. Users may accidently make

changes to the map whilst digitizing their own points, for example by moving road points created

by other users. This may certainly be the case for first time users using Potlatch, who may be

experimenting at first, although a practice option is provided where no edits to the map are saved.

Another problem is the ease of sabotage, as once some registers to OSM they are free to make any

changes they like to the OSM map.

Although there are specific guidelines about how the data must be edited, it does not exempt the

user from human errors which occurs while the map is being created which results in inconsistent

of OSM datasets. Another factor that makes the OSM inconsistent is that users may collect the

data from different sources, hence obtaining the measurements of different levels of accuracy.

Since the accuracy of OSM data cannot be assessed precisely due to the nature of the mapping

methods the only way to evaluate the quality of these OSM data will be by comparing the obtained

OSM with a higher level datasets which was obtained at digital and mapping section in the

Department of Surveys (DoS).

Page 19: Project final version

10 How good is OpenStreetMap? A Comparative Study of OpenStreetMap and Malawi

Survey Data Sets: Case Study of Lilongwe City

3.2.2. MALAWI SURVEY DATASETS

The Lilongwe street guide road was produced by the by the Department of Surveys (DoS) at the

Malawi Surveys. The Malawi Survey is the most official map provider that produces the

cartographic products which are widely used in the country, this is so because there are specialist

who are assigned to collect data and hence fourth produce the map at specified accuracy depending

on the use of the map. During preparation of the data it was found that the Lilongwe Street Guide

dataset was comprehensively updated in 2003 and partially updated in 2012 as Malawi was

preparing to hold SADC summit in 2013.

During the partial update, major roads which were connected to major facilities such as hotels,

hospitals were carefully retraced based on the comprehensive map which was produced in 2003.

Centered on its accuracy the researcher saw it a need to use the dataset as a reference dataset.

Figure 3.2 below shows the map produced by the Malawi survey.

Page 20: Project final version

11 How good is OpenStreetMap? A Comparative Study of OpenStreetMap and Malawi

Survey Data Sets: Case Study of Lilongwe City

Figure 3.2: Lilongwe Streets dataset, (Source: Department of Surveys (DoS), 2012).

Study site

Page 21: Project final version

12 How good is OpenStreetMap? A Comparative Study of OpenStreetMap and Malawi

Survey Data Sets: Case Study of Lilongwe City

3.3 DATA COLLECTION METHODS

The following methodology aims to assess the quality elements of OSM road network (tested

dataset) against the Lilongwe Street Guide dataset (reference dataset). The methodology focuses

on the analysis of each element and on the comparison between the results of those. The

length completeness analysis was conducted for road network of the dataset but the rest of the

analysis was carried out only for the primary, secondary and tertiary road class due to time

restrictions.

Primary data was collected through field work using the Trimble Nomad GPS. Coordinates in form

of Universal Transverse Mercator (UTM) was collected from the selected crucial points of the

roads so as to act as references. The Trimble Nomad GPS recorded two kinds of information

namely the waypoints and tracks that was useful for georeferencing maps both the OpenStreetMap

and the street guide obtained from Surveys Department in Lilongwe.

The researcher used kinematic positioning to obtain reference points which were then used in

georeferencing the OpenStreetMap and the street map guide which was obtained from digital and

mapping section in the Surveyor General’s office. The following coordinate were obtained in nine

stations, (A to I).

Station ID Easting Northing

A 579898.742 8454929.903

B 581753.265 8455023.972

C 583186.404 8453253.726

D 584966.892 8450889.034

E 587317.496 8454109.726

F 584799.083 8455816.772

G 584998.305 8457070.281

H 584898.962 8457642.575

I 581004.716 8458673.678

Table 3.1: List of coordinates

Page 22: Project final version

13 How good is OpenStreetMap? A Comparative Study of OpenStreetMap and Malawi

Survey Data Sets: Case Study of Lilongwe City

Figure 3.3: Georeferenced OSM data of Lilongwe city; Date of data acquisition: 13th September,

2014. (Source: Georeferenced by the researcher)

Page 23: Project final version

14 How good is OpenStreetMap? A Comparative Study of OpenStreetMap and Malawi

Survey Data Sets: Case Study of Lilongwe City

Figure 3.4: Georeferenced Street Guide; (Source: Department of Surveys (DoS++66) 2013)

The two georeferenced datasets were then digitized using the editor tool in ArcGIS10 software.

Figure 3.5 and 3.6 below shows the digitized data obtained from the two datasets.

Page 24: Project final version

15 How good is OpenStreetMap? A Comparative Study of OpenStreetMap and Malawi

Survey Data Sets: Case Study of Lilongwe City

Figure 3.5: Digitized OpenStreetMap (source; Digitized by the researcher)

Page 25: Project final version

16 How good is OpenStreetMap? A Comparative Study of OpenStreetMap and Malawi

Survey Data Sets: Case Study of Lilongwe City

Figure 3.6: Digitized Lilongwe Street Guide (source; Digitized by the researcher)

After digitizing the OSM and Lilongwe Street map Guide which in this case is a reference dataset,

the OSM dataset was overlaid on the referenced data such that impression of the similarities and

dissimilarities was found. Figure 3.7 bellow depicts the visualization of all object classes together

from OSM and Lilongwe datasets from the same area.

Study

Area

Page 26: Project final version

17 How good is OpenStreetMap? A Comparative Study of OpenStreetMap and Malawi

Survey Data Sets: Case Study of Lilongwe City

Figure 3.7: Overlapped OSM dataset on the reference dataset (source; Digitized by the researcher)

Page 27: Project final version

18 How good is OpenStreetMap? A Comparative Study of OpenStreetMap and Malawi

Survey Data Sets: Case Study of Lilongwe City

CHAPTER 4: RESULTS ANALYSIS

4.1 POSITIONAL ACCURACY

Positional accuracy was carried out through a buffer analysis as recommended by Goodchild and

Hunter (2004). The buffer method was used to know how far or close was the displacement

between two datasets namely the reference (Street Guide Road) and tested dataset (OSM) as shown

in figure 10 below. The results of this method were presented as percentage of the data

within the buffer zone of 10 meters as indicated by Ather, (2009 p; 18). A 100% overlap means

that the tested dataset are perfectly overlapped to the reference dataset within a buffer zone.

Figure 4.1: Buffer Method (source; Digitized by the researcher)

The buffer method was applied in the Main roads and Secondary roads within the study area as

indicated in figure 4.2 below:

Page 28: Project final version

19 How good is OpenStreetMap? A Comparative Study of OpenStreetMap and Malawi

Survey Data Sets: Case Study of Lilongwe City

Figure 4.2: Buffer Analysis (Source; Prepared by the researcher)

The intersection of two dataset was then performed using ArcGIS software such that the roads

which was not within 10 meters was clearly shown as indicated in figure 4.3 below where the red

colour indicates roads within the buffer zone while blue colour indicates roads which are not within

the 10 meters buffer zone.

The figure 4.3 below shows the results of intersection after the buffer analysis was carried out.

Page 29: Project final version

20 How good is OpenStreetMap? A Comparative Study of OpenStreetMap and Malawi

Survey Data Sets: Case Study of Lilongwe City

Figure 4.3: Intersection of the buffered dataset (Source; Prepared by the researcher)

The results was the of the roads within 10 meters buffer zone was then calculated as indicated by

table 4.l below

Page 30: Project final version

21 How good is OpenStreetMap? A Comparative Study of OpenStreetMap and Malawi

Survey Data Sets: Case Study of Lilongwe City

The details of road length within and out of the buffer zone is shown in the table 4.1 below

Names of ROADS Total length (Meters) OSM within 10m buffer zone (Meters) Percentage

Mzimba Street 5,281.85 5,281.85 100

Chidzanja Road 4,053.81 3,369.21 83

Youth Drive 3,142.10 2,896.50 92

Chayamba Road 3,700.17 3,674.55 99

Presidential Way 7,938.38 6,842.88 86

Kenyatta Road 4,155.71 4,155.71 100

Paul Kagame Road 3,547.60 3,547.60 100

Kamuzu Procession Road 10,642.91 9,314.73 88

Kaunda Road 4,231.63 3,062.66 72

Lilongwe Bypass ---------- ---------- -----

Queens Road 2,398.08 2,398.08 100

Sir Glyn jones Avenue 4,473.96 4,442.54 99

Table 4.1: Results of intersection after buffer analysis.

The overall results shows that 92% of roads in the OSM dataset are found within the 10 meters of

the buffer zone which indicates that there is high positional accuracy and this is in line with what

Kounadi, (2009.p.58) found. Since high accuracy in position is achieved then the OSM dataset can

be fairly used by different organizations as for instance in tourism as a navigational tool guide for

tourist and disaster management. In disaster management OSM dataset can be used to control

further spread of diseases such Ebola by indicating the positions of areas which are affected on the

map as a no goes on areas.

The study further found out that Lilongwe bypass is only available on the OSM dataset and not on

the street guide dataset which was used as the reference dataset. This is so because the Lilongwe

bypass is a newly constructed road and it was not there when the reference dataset was being

prepared.

Page 31: Project final version

22 How good is OpenStreetMap? A Comparative Study of OpenStreetMap and Malawi

Survey Data Sets: Case Study of Lilongwe City

4.2 NAME COMPLETENESS

Name Completeness was assessed by comparing the road names on the reference dataset and that

of the tested dataset. This was done through the visual and ground verification in terms of roads

attributes after the two datasets were matched.

Figure 4.4 below shows the road names of the two datasets.

Figure 4.4: Digitized Lilongwe Map showing roads with names (source; Digitized by the

researcher)

The study found that all road names which appears on the reference dataset was also present on

the tested dataset. It was further established that the tested dataset had more road name percentage

Page 32: Project final version

23 How good is OpenStreetMap? A Comparative Study of OpenStreetMap and Malawi

Survey Data Sets: Case Study of Lilongwe City

than the reference dataset as indicated by figure above where roads in blue are the road which have

names labeled on OSM dataset while missing the labels on the Lilongwe street guide.

Although the OSM datasets registered some confidence in presence of road names but it was

however found that the length completeness was very high as compared to name completeness.

This is so because many roads residential roads which appear in both dataset are not labelled and

it was calculated that only 19 % of the total length covered in the OSM within the study area are

labelled which means that the total length of 81% is not labelled.

It was further established that the reference dataset maintains some old names which was changed

as for instance Paul Kagame road is indicated as Chilambula road on the reference dataset although

changed on 4th September, 2007 after being upgraded into a highway. This is so because the

Lilongwe Street Guide (reference dataset) has the names which are not up- to-date,

4.3 LENGTH COMPLETENESS

Length completeness calculated as the percentage of the length of the tested dataset to the length

of the reference dataset. The calculations were carried out for on a sample grid square.

Page 33: Project final version

24 How good is OpenStreetMap? A Comparative Study of OpenStreetMap and Malawi

Survey Data Sets: Case Study of Lilongwe City

Figure 4.5: Digitized Lilongwe Map showing Length Completes (source; Digitized by the

researcher)

Generally, the completeness is very good with a percentage above 85%. The road that do not

appear in the OSM map dataset are residential roads as it was found that all major roads on the

referenced datasets are available on the tested datasets. It was found that some grids have low

percentage of the road completeness as for instance in south east grids in figure 11 above.

It was however found in grid A the total OSM road network was 8000.349 meters in length and

Lilongwe Street Guide road network is 7853.366 meters such that the OSM road network is 2%

more than the Lilongwe Street Guide road network.

Page 34: Project final version

25 How good is OpenStreetMap? A Comparative Study of OpenStreetMap and Malawi

Survey Data Sets: Case Study of Lilongwe City

CHAPTER 5: CONCLUSION AND RECOMMENDATIONS

5.1 CONCLUSION

The study managed to achieve its main objective and specific objectives as stated in the preamble

of the dissertation. Therefore, it can be fairly concluded that the position accuracy of OSM is quiet

high in the primary and secondary roads which means that the datasets can be used with reliability

for several cartographic purposes.

However there are still issues with regards to the name completeness of the data. The analysis

showed that 81% of roads within the study area are unlabeled. Furthermore, the study disclosed

that as volunteers keep the OSM data up-to-date by continuously providing information of new

changes in the real world, users of OSM receive more frequently updated information. On the

contrary, Lilongwe Street Guide users need to wait till the next version is available, which happens

when there is a need without known duration.

5.2 RECOMMENDATIONS

In connection to the findings of this research, the following recommendations have been put

forward:

Ground verification needs to be intensified so that the data which is collected on the OSM

can be labelled. This is due to the fact the road names can only be established by users who

have conducted ground surveys, as identifying road names from other map sources would

have some errors and it can be accompanied by restrictive copyright laws.

The OSM dataset can be fairly used in the production of sketch plans by Malawi Surveys

as its positional accuracy has proved to be high.

Malawi Survey dataset needs to be update frequently if it is to continue standing out as an

official dataset.

5.3 FUTURE WORK

As per the findings of this study, it is suggested that the following areas should be further studied:

The same study can be carried out on a larger sample covering even the position

accuracy of residential roads so as to reduce the limitations of generalisation.

A deeper statistical analysis should be carried out on the data completeness with another

official dataset.

Page 35: Project final version

26 How good is OpenStreetMap? A Comparative Study of OpenStreetMap and Malawi

Survey Data Sets: Case Study of Lilongwe City

REFERENCES

1. Ather, A, 2009.A Quality Analysis Of Openstreetmap Data. MSc, University College

London

2. Benkler, Y. and Nissenbaum, H. (2006) Commons-Based Peer Production and Virtue.

Journal of Political Philosophy, 14 (4): 394-419.

3. Goodchild, M. F. (2007), 'Citizens as Voluntary Sensors: Spatial Data Infrastructure in

the World of Web 2.0', International Journal of Spatial Data Infrastructures Research

Vol.2, 24-32

4. Haklay, M. (2010), 'How good is Volunteered Geographical Information? Acomparative

study of OpenStreetMap and Ordnance Surveydatasets', Environment and Planning B:

Planning and Design.

5. Kounadi, O. (2009), 'Assessing the quality of OpenStreetMap data', Master's thesis,

UniversityCollege of London Department of Civil, Environmental And Geomatic

Engineering.

6. Lee, M., 2010. A political economic critique of Google Maps and Google Earth.

Information,Communication & Society 13(6), 909-928.

7. Turner, A.J. 2006, Introduction to Neogeography. O'Reilly, [eBook]

http://shop.oreilly.com/product/9780596529956.do?CMP=OTCKW7501011010&ATT=

neogeography#.

8. Zulfiqar, N. (2008) A Study of the Quality of OpenStreetMap.org maps: A comparison of

OSM data and Ordnance Survey Data, MEng Thesis, London, University College London.