17
Emerging Trends in Data Visualization and Dissemination Internet mashups and Statistical Data as a Service 7 October 2010

Data as a service

Embed Size (px)

DESCRIPTION

Presentation to the UN Statistical Commission on mashups and cloud computing.

Citation preview

Page 1: Data as a service

Emerging Trends in Data Visualization and Dissemination

Internet mashups and Statistical Data as a Service

7 October 2010

Page 2: Data as a service

What do we have today?

Statistical organizations became competent and capable content providers and users of the Internet. Most NSOs and International Agencies have their websites with static or dynamic content and with interactive databases or downloadable datasets.

With the current approach we are distributing statistical data as goods but are there other ways to provide access to statistical data?

Page 3: Data as a service

What is a Mashup?

• A web application that A web application that combines data from more combines data from more than one source into a than one source into a single integrated toolsingle integrated tool

an example is the use of an example is the use of data from Google Maps data from Google Maps to add location to add location information to statistical information to statistical data, thereby creating a data, thereby creating a new and distinct web new and distinct web service that was not service that was not originally provided by originally provided by either sourceeither source

Page 4: Data as a service

What are the benefits mashups?

- Creation of new dynamic user experience- Gain valuable insights through information remix- Further promotion of our services and data- Minimized application data management- Reduced development effort- Get results faster by accessing information in

place- Ability to quickly assemble applications for new

situations

Page 5: Data as a service

How a mashup works?

User

Mashup website

User Request

Website 1 Website 2

API Call Data API Call Data

Data presentation

Data Manipulation

Page 6: Data as a service

APIs and web services

- API is an abbreviation for Application Program Interface, a set of routines, protocols, and tools for building software applications. A good API makes it easier to develop a program by providing all the building blocks. A programmer then puts the blocks together.

- Web services today are frequently just Application Programming Interfaces (API) or web APIs that can be accessed over a network, such as the Internet, and executed on a remote system hosting the requested services.

Page 7: Data as a service

How to plan for a mashup?

• Pick a subject A Mashup of What?

Map + Statistical data? Google Fusion tables for transformation + Statistical data?

More sources of data More complicated

• Decide your data sources Who is your data provider?

Google maps, Bing maps, etc.. Online data UNdata, Comtrade, Dallas etc..

Usually language agnostic Varying complexity

• Other concerns How much time do you have? Do you have a server to run it on? Which programming language?

Page 8: Data as a service

Providing Data as a Service

Data as Goods Data as Service

Bulk onetime download Dynamic access

Dated with the time of download

Always latest update

Need for storage Storage is provided

Analysts, researchers, data enthusiasts

Dynamic content providers, mashup creators

Page 9: Data as a service

Data dissemination – UNData

UNdata is unique initiative of the Statistics Division to bring statistical information together from various international and national sources and present it in an easily understandable and accessible format.

• Make UN databases freely available• Organize international databases to allow searchability and

open access

• Promote national data dissemination

• Build a global data dissemination infrastructure

Page 10: Data as a service

Data dissemination – UNData

Page 11: Data as a service

Data dissemination – UNData

Page 12: Data as a service

Data dissemination – UNData

Page 13: Data as a service

Global Data Dissemination Framework

UNDATA API project – an API version of the great data made available by the United Nations on the UNDATA site. The aim is to make this data accessible and reusable in a variety or ways so it can be easily mashed up and recombined into new applications or analysis.

Page 14: Data as a service

Mashups with UNdata

Page 15: Data as a service

Mashups with UNdata

Page 16: Data as a service

Data dissemination – Comtrade

• Data from over 150 countries processed into a standard format

• Data by partner country/commodity from 1962 covering about 90% of the world trade

• 1.5 billion statistical records, 0.5 Terabyte of data

• Free Web access to any record and paid subscription for use of download services

• 6 billion records downloaded since June 2003

Page 17: Data as a service

Data dissemination – Comtrade API

Web Server Database Server

Comtrade Web Services

United Nations

Comtrade

Tariff Line

Total Trade

Organization #1

PC XML Files

Organization #3

Database Server

Comtrade Tools

Other Tools

PC

Organization #2

Comtrade Tools

Other Tools

XML Files

Text Files

Internet

Element based XML

SDMX-ML

Http protocol

Http protocol

Trade Data Transfer Architecture