34
Overview of Open Data, Linked Data and Web Science Making Emergent Creativity Haklae Kim, PhD. , August 2012

Overview of Open Data, Linked Data and Web Science

Embed Size (px)

DESCRIPTION

This slide aims to describe general overview of open data and linked data, the relationships between two concepts with examples.

Citation preview

Page 1: Overview of Open Data, Linked Data and Web Science

Overview of Open Data, Linked Data and Web Science Making Emergent Creativity

Haklae Kim, PhD. , August 2012

Page 2: Overview of Open Data, Linked Data and Web Science

London 2012: Open Data Olympics Best Practices

2

Page 3: Overview of Open Data, Linked Data and Web Science

Case Studies The Semantic Web What We Will Do

This Presentation ..... Today

3

Conceptual overview

Page 4: Overview of Open Data, Linked Data and Web Science

4

Page 5: Overview of Open Data, Linked Data and Web Science

Big Data “data that becomes large enough that it cannot be processed using conventional methods”

Let’s Start

5

“Big Data is like Sex in High School–Lots of people are talking about it, but few are having it.”

-Eric Hansen, SiteSpect founder and CEO

Page 6: Overview of Open Data, Linked Data and Web Science

“Open”

material (data) is open if it can be freely used, reused and redistributed by anyone

“Government data”

data and information produced or commissioned by government or government controlled entities.

Source: Open Knowledge Foundation, 2010

6

What is Open (Government) Data? Definition

Page 7: Overview of Open Data, Linked Data and Web Science

•  Transparency •  Participation •  Collaboration

“My administration is committed to creating an unprecedented level of openness in Government.” – Barack Obama

“Memorandum for the Heads of Executive Departments and Agencies – Transparency and Open Government” Jan 2009

Page 8: Overview of Open Data, Linked Data and Web Science

Case Studies The Semantic Web What We Will Do

This Presentation ..... Today

8

Conceptual overview

Page 9: Overview of Open Data, Linked Data and Web Science

h"p://www.prac+calpar+cipa+on.co.uk/odi/wp-­‐content/uploads/2010/06/Open-­‐Data-­‐Impacts-­‐Timeline-­‐Dra@-­‐0.1.png  9  

Page 10: Overview of Open Data, Linked Data and Web Science

Where Does My Money Go PlanningAlerts.com

Top 10 Apps: Data.gov.uk Case Studies

10

OurProperty.co.uk OpenlyLocal.com

Source: Telegraph, 2010, http://www.telegraph.co.uk/technology/news/7044147/Data.gov.uk-Top-Ten-Apps-so-far.html

Page 11: Overview of Open Data, Linked Data and Web Science

Source: http://tinyurl.com/44rub56

The State of Open Government Data Public Sector Dataset

11

Page 12: Overview of Open Data, Linked Data and Web Science

“The application of the four types of instruments by the five countries is depicted – the larger the circle the more instruments are applied” – Huijboom & Van den Broek, 2011.

Open data instruments Open Data Strategies

12

DK DK

DK DK

US

ES ES

ES

AU

UK

UK ES

AU

US

UK US

AU

AU

UK

US

Education and training

Economic instruments

Voluntary approaches

Legislation and control

Page 13: Overview of Open Data, Linked Data and Web Science

Drivers and barries of open data policy implementation Critical factors

13

Strategies and experience in front runner countries 1

2

3

4

5

6

7

8

9

10

Political leadership

Regional initiatives

Citizen initiatives

Market initiatives

Emerging technologies

European legislation

Thought leaders

Possibility of monitoring government

Budgets cuts

Closed government culture

Privacy legislation

Limited quality of data

Limited user-friendliness/information overload

Lack of standardization of open data policy

Security threats

Existing charging models

Uncertain economic impact

Digital divide

Network overload

Source:  Huijboom  and  Van  den  Broek,  2011  

Page 14: Overview of Open Data, Linked Data and Web Science

Case Studies The Semantic Web What We Will Do

This Presentation ..... Today

14

Conceptual overview

Page 15: Overview of Open Data, Linked Data and Web Science

Web in Transition “a steady progression from a document-centric Web to one that is data-centric, including the mediation of semantics”

Let’s Start

15

(Source: Mike, 2007)

Page 16: Overview of Open Data, Linked Data and Web Science

“The Semantic Web isn't just about putting data on the web. It is about making links, so that a person or machine can explore the web of data.  With linked data, when you have some of it, you can find other, related, data” - TBL.

The Semantic Web & Linked Data Overview

16

5 Stars Open linked data

★★

★★★

★★★★

★★★★★

Make your stuff available on the Web

Make it available as structured data

Use open, standard formats (instead of excel)

Use a open data format – URLs, descriptions

Link your data to other people’s data

Page 17: Overview of Open Data, Linked Data and Web Science

… Linked Data provides the means to reach the goal of the Semantic Web – “the emergence of a Web of Data”

17

Growth of Interlinks Overview

2007-05-01 2007-10-08 2007-11-10 2008-02-28 2008-03-31

2008-09-18 2009-03-05 2009-03-27 2009-07-14 2010-09-22

Page 18: Overview of Open Data, Linked Data and Web Science

18 October, 2011 295 interlinked datasets, approximately 31 billions triples

DBpedia

Structured Wikipedia

BBC

Best Buy UK Gov

Multimedia Content

Commercial Product Government Data

Page 19: Overview of Open Data, Linked Data and Web Science

What is the Semantic Web for? Question

19

Search

Inference

Intelligence

Standards

Page 20: Overview of Open Data, Linked Data and Web Science

Google’s Semantic Search Case Studies

People should be able to ask questions and we should understand their meaning, or they should be able to talk about things at a conceptual level. ... A lot of people will turn to things like the semantic Web as a possible answer to that.“ - Google Vice President of Search Products & User Experience Marissa Mayer

20

an initiative launched on 2 June 2011 by Bing, Google and Yahoo! to "create and support a common set of schemas for structured data markup on web pages."

http://schema.org/docs/full.html

The Knowledge Graph is a collection of information sources that help discern a user’s specified intent with each individual query. The graph is actually an encyclopedia with structured information obtained from the web. (currently, 200 million entities)

Freebase is an open, Creative Commons licensed repository of structured data of almost 22 million entities. An entity is a single person, place, or thing connected by a graph.

Page 21: Overview of Open Data, Linked Data and Web Science

Apple’s Siri Case Studies

Ask Siri how Apple recorded the best quarter in history for a tech company, and her answer should be: "Me."

21

Siri (Speech Interpretation and Recognition Interface) is an intelligent personal assistant and knowledge navigator which works as an application for Apple's iOS. A Brief History - In December 2007 Siri, Inc. was formed by Dag Kittlaus (CEO), Adam Cheyer (VP Engineering), and Tom Gruber (CTO/VP Design). - Siri Inc. went after funding and by November 2009 it had secured $15.5 million investment, resulted in the creation of the first Siri application, which debuted on the iPhone 3GS in February 2010. - Siri acquired by Apple; iPhone becomes the Virtual Personal Assistant

Knowledge Navigator (1987) a concept described by former Apple Computer CEO John Sculley in his 1987 book, Odyssey.

(Source: http://www.youtube.com/watch?v=QRH8eimU_20)

Page 22: Overview of Open Data, Linked Data and Web Science

22

Active Ontology Case Studies

A processing formalism where distinct processing elements are arranged according to ontology notions; an execution environment.

Basic concepts * Ontology : A data structure - Formal representation for domain knowledge - Classes, attributes, relations * Active Ontology : A processing environment - Processing elements arranged according to ontology

notions - Communication channels movie

genre actor rating P P P

P

rule set

rule

condition

action

rule

condition

action

rule condition

action

(Baur et al., 2007)

Page 23: Overview of Open Data, Linked Data and Web Science

Linked Data and Open Government Data Why

23

Page 24: Overview of Open Data, Linked Data and Web Science

Linked  Data  life  cycles  

data

awarenessmodeling publishing discovery integration use cases

1 2 3 4 5 6

thedatahub LOD cloud

Neologism DataCube prefix.cc

Google Refine RDB2RDF

VoID DCAT Sindice CKAN

LATC 24/7 duke Sig.ma

datacatalogs data.gov data.gov.uk

Page 25: Overview of Open Data, Linked Data and Web Science

Case Studies The Semantic Web What We Will Do

This Presentation ..... Today

25

Conceptual overview

Page 26: Overview of Open Data, Linked Data and Web Science

Data.gov, along with a number of other data-related sites of the government such as USAspending.gov and Apps.gov, are slated to be shut down due to budget cuts. The current annual budget of $37 million will be reduced to $2 million. – (Guardian April 11)

Reality Check

Data.gov in crisis

26

Page 27: Overview of Open Data, Linked Data and Web Science

고려 사항 Reality Check in Korea

데이터 민감성: WikiLeaks vs Open Data

27

서비스 범위: Domestic vs International

데이터 플랫폼: 정부 vs 민간 vs 커뮤니티

데이터 내용: 통계/수치 데이터 vs 정보형 데이터

데이터 형식: human-readable vs machine-readable

정부의 역할: 시스템 구축 vs 생태계 구축 - 통제가 아닌 효율적인 서비스 지향 - 데이터 공개 및 연계를 위한 로드맵 수립

- 정부기관의 데이터 소유 인식 전환 필요 - 자발적인 참여와 소비를 촉진하는 전략 필요

- 데이터 범주에 따른 차별화된 공개 전략 - 데이터의 활용에 따른 최적화된 서비스 모델

- 서비스 범위에 따른 구축비용/운영 모델 - 국제 표준에 기반한 데이터 접근 서비스 제공

- 통계 기반 시각화에 한정된 모델 지양 - 데이터 특성에 맞는 기술 적용 모델 수립

- 지능적인 데이터 매쉬업 지원을 위한 데이터 모델링 검토

1

2

3

4

5

6

Page 28: Overview of Open Data, Linked Data and Web Science

28

Vision of Government Open Data Conceptual Architecture

“realise significant economic benefits by enabling businesses and non-profit organisations to build innovative applications and websites using public data.”

(Ding et al., 2012)

Page 29: Overview of Open Data, Linked Data and Web Science

29

Roadmap of linked open government data Conceptual Architecture

“the combination of machine power and human power and deliver higher-quality data to a wide range of data consumers via visualization, mashups, and more.”

(Ding et al., 2012)

Page 30: Overview of Open Data, Linked Data and Web Science

Data on the Web Summary

Data is information about things

30

Data is something machines can process

Data drives applications (e.g. web sites, mobile services)

Data is relations among things

Page 31: Overview of Open Data, Linked Data and Web Science

Open Data starts with making available the data that you already have, in whatever format.

•  Equal access for all •  Licensing, legal issues •  Transparency •  Changing the way government works

Open Data vs Linked Data Summary

Open Data

Linked Data •  URIs •  HTTPs •  RDF vocabularies •  Standards

31

Page 32: Overview of Open Data, Linked Data and Web Science

Difficult

Concluding Remarks Hope is not a strategy and the “change” has been change for the worse, and not better.

What We Will Do Interdisciplinary Collaboration

32

Page 33: Overview of Open Data, Linked Data and Web Science

- Charles Baur, Adam Cheyer, Didier Guzzoni, Active, a platform for building intelligent software - Noor Huijboom and Tijs Van den Broek, Open Data: an international comparison of strategies, European journal of ePractices, March/April 2011 - Li Ding, Vassilios Peristeras, and Michael Hausenblas, Linked Open Government Data, IEEE Intelligent Systems, May/June 2012 -  Page 1: http://www.w3.org/DesignIssues/diagrams/websci/Marius%20Watz%20-%20Web%20Science%20artwork.png -  Page 4: http://www.go-gulf.com/60seconds.jpg -  Page 9: http://cloud.frontpagemag.com/wp-content/uploads/2012/03/obama11.jpg -  Page 27: http://www.patentlyapple.com/.a/6a0120a5580826970c0168e5ccdd81970c-800wi -  Page 29: http://programminggeeks.com/wp-content/uploads/2010/05/Programming-Geeks-Web-Science.jpg -  Page 29: http://3.bp.blogspot.com/-C0Kyck90Djo/T4KZTg3k1XI/AAAAAAAAAsE/RUp165S0FCQ/s1600/Commitment.jpeg

Page 2 Case Studies -  http://www.guardian.co.uk/commentisfree/2012/aug/03/london-2012-olympics-open-data -  http://www.bbc.co.uk/news/uk-19050139 -  http://london2012.nytimes.com/results -  http://www.guardian.co.uk/sport/interactive/2012/jul/23/could-you-be-a-medallist -  http://www.guardian.co.uk/sport/datablog/2012/aug/13/olympics-2012-data-journalism -  http://www.guardian.co.uk/sport/datablog/interactive/2012/jul/26/london-2012-price-olympic-games-visualised

References

33

Page 34: Overview of Open Data, Linked Data and Web Science

For more information contact Haklae Kim via [email protected] Twitter: haklaekim Or read up on the sonagi blog at: http://blogweb.co.kr http://thedatahub.kr