Upload
haklae-kim
View
6.877
Download
0
Tags:
Embed Size (px)
DESCRIPTION
This slide aims to describe general overview of open data and linked data, the relationships between two concepts with examples.
Citation preview
Overview of Open Data, Linked Data and Web Science Making Emergent Creativity
Haklae Kim, PhD. , August 2012
London 2012: Open Data Olympics Best Practices
2
Case Studies The Semantic Web What We Will Do
This Presentation ..... Today
3
Conceptual overview
4
Big Data “data that becomes large enough that it cannot be processed using conventional methods”
Let’s Start
5
“Big Data is like Sex in High School–Lots of people are talking about it, but few are having it.”
-Eric Hansen, SiteSpect founder and CEO
“Open”
material (data) is open if it can be freely used, reused and redistributed by anyone
“Government data”
data and information produced or commissioned by government or government controlled entities.
Source: Open Knowledge Foundation, 2010
6
What is Open (Government) Data? Definition
• Transparency • Participation • Collaboration
“My administration is committed to creating an unprecedented level of openness in Government.” – Barack Obama
“Memorandum for the Heads of Executive Departments and Agencies – Transparency and Open Government” Jan 2009
Case Studies The Semantic Web What We Will Do
This Presentation ..... Today
8
Conceptual overview
h"p://www.prac+calpar+cipa+on.co.uk/odi/wp-‐content/uploads/2010/06/Open-‐Data-‐Impacts-‐Timeline-‐Dra@-‐0.1.png 9
Where Does My Money Go PlanningAlerts.com
Top 10 Apps: Data.gov.uk Case Studies
10
OurProperty.co.uk OpenlyLocal.com
Source: Telegraph, 2010, http://www.telegraph.co.uk/technology/news/7044147/Data.gov.uk-Top-Ten-Apps-so-far.html
Source: http://tinyurl.com/44rub56
The State of Open Government Data Public Sector Dataset
11
“The application of the four types of instruments by the five countries is depicted – the larger the circle the more instruments are applied” – Huijboom & Van den Broek, 2011.
Open data instruments Open Data Strategies
12
DK DK
DK DK
US
ES ES
ES
AU
UK
UK ES
AU
US
UK US
AU
AU
UK
US
Education and training
Economic instruments
Voluntary approaches
Legislation and control
Drivers and barries of open data policy implementation Critical factors
13
Strategies and experience in front runner countries 1
2
3
4
5
6
7
8
9
10
Political leadership
Regional initiatives
Citizen initiatives
Market initiatives
Emerging technologies
European legislation
Thought leaders
Possibility of monitoring government
Budgets cuts
Closed government culture
Privacy legislation
Limited quality of data
Limited user-friendliness/information overload
Lack of standardization of open data policy
Security threats
Existing charging models
Uncertain economic impact
Digital divide
Network overload
Source: Huijboom and Van den Broek, 2011
Case Studies The Semantic Web What We Will Do
This Presentation ..... Today
14
Conceptual overview
Web in Transition “a steady progression from a document-centric Web to one that is data-centric, including the mediation of semantics”
Let’s Start
15
(Source: Mike, 2007)
“The Semantic Web isn't just about putting data on the web. It is about making links, so that a person or machine can explore the web of data. With linked data, when you have some of it, you can find other, related, data” - TBL.
The Semantic Web & Linked Data Overview
16
5 Stars Open linked data
★★
★
★★★
★★★★
★★★★★
Make your stuff available on the Web
Make it available as structured data
Use open, standard formats (instead of excel)
Use a open data format – URLs, descriptions
Link your data to other people’s data
… Linked Data provides the means to reach the goal of the Semantic Web – “the emergence of a Web of Data”
17
Growth of Interlinks Overview
2007-05-01 2007-10-08 2007-11-10 2008-02-28 2008-03-31
2008-09-18 2009-03-05 2009-03-27 2009-07-14 2010-09-22
18 October, 2011 295 interlinked datasets, approximately 31 billions triples
DBpedia
Structured Wikipedia
BBC
Best Buy UK Gov
Multimedia Content
Commercial Product Government Data
What is the Semantic Web for? Question
19
Search
Inference
Intelligence
Standards
Google’s Semantic Search Case Studies
People should be able to ask questions and we should understand their meaning, or they should be able to talk about things at a conceptual level. ... A lot of people will turn to things like the semantic Web as a possible answer to that.“ - Google Vice President of Search Products & User Experience Marissa Mayer
20
an initiative launched on 2 June 2011 by Bing, Google and Yahoo! to "create and support a common set of schemas for structured data markup on web pages."
http://schema.org/docs/full.html
The Knowledge Graph is a collection of information sources that help discern a user’s specified intent with each individual query. The graph is actually an encyclopedia with structured information obtained from the web. (currently, 200 million entities)
Freebase is an open, Creative Commons licensed repository of structured data of almost 22 million entities. An entity is a single person, place, or thing connected by a graph.
Apple’s Siri Case Studies
Ask Siri how Apple recorded the best quarter in history for a tech company, and her answer should be: "Me."
21
Siri (Speech Interpretation and Recognition Interface) is an intelligent personal assistant and knowledge navigator which works as an application for Apple's iOS. A Brief History - In December 2007 Siri, Inc. was formed by Dag Kittlaus (CEO), Adam Cheyer (VP Engineering), and Tom Gruber (CTO/VP Design). - Siri Inc. went after funding and by November 2009 it had secured $15.5 million investment, resulted in the creation of the first Siri application, which debuted on the iPhone 3GS in February 2010. - Siri acquired by Apple; iPhone becomes the Virtual Personal Assistant
Knowledge Navigator (1987) a concept described by former Apple Computer CEO John Sculley in his 1987 book, Odyssey.
(Source: http://www.youtube.com/watch?v=QRH8eimU_20)
22
Active Ontology Case Studies
A processing formalism where distinct processing elements are arranged according to ontology notions; an execution environment.
Basic concepts * Ontology : A data structure - Formal representation for domain knowledge - Classes, attributes, relations * Active Ontology : A processing environment - Processing elements arranged according to ontology
notions - Communication channels movie
genre actor rating P P P
P
rule set
rule
condition
action
rule
condition
action
rule condition
action
(Baur et al., 2007)
Linked Data and Open Government Data Why
23
Linked Data life cycles
data
awarenessmodeling publishing discovery integration use cases
1 2 3 4 5 6
thedatahub LOD cloud
Neologism DataCube prefix.cc
Google Refine RDB2RDF
VoID DCAT Sindice CKAN
LATC 24/7 duke Sig.ma
datacatalogs data.gov data.gov.uk
Case Studies The Semantic Web What We Will Do
This Presentation ..... Today
25
Conceptual overview
Data.gov, along with a number of other data-related sites of the government such as USAspending.gov and Apps.gov, are slated to be shut down due to budget cuts. The current annual budget of $37 million will be reduced to $2 million. – (Guardian April 11)
Reality Check
Data.gov in crisis
26
고려 사항 Reality Check in Korea
데이터 민감성: WikiLeaks vs Open Data
27
서비스 범위: Domestic vs International
데이터 플랫폼: 정부 vs 민간 vs 커뮤니티
데이터 내용: 통계/수치 데이터 vs 정보형 데이터
데이터 형식: human-readable vs machine-readable
정부의 역할: 시스템 구축 vs 생태계 구축 - 통제가 아닌 효율적인 서비스 지향 - 데이터 공개 및 연계를 위한 로드맵 수립
- 정부기관의 데이터 소유 인식 전환 필요 - 자발적인 참여와 소비를 촉진하는 전략 필요
- 데이터 범주에 따른 차별화된 공개 전략 - 데이터의 활용에 따른 최적화된 서비스 모델
- 서비스 범위에 따른 구축비용/운영 모델 - 국제 표준에 기반한 데이터 접근 서비스 제공
- 통계 기반 시각화에 한정된 모델 지양 - 데이터 특성에 맞는 기술 적용 모델 수립
- 지능적인 데이터 매쉬업 지원을 위한 데이터 모델링 검토
1
2
3
4
5
6
28
Vision of Government Open Data Conceptual Architecture
“realise significant economic benefits by enabling businesses and non-profit organisations to build innovative applications and websites using public data.”
(Ding et al., 2012)
29
Roadmap of linked open government data Conceptual Architecture
“the combination of machine power and human power and deliver higher-quality data to a wide range of data consumers via visualization, mashups, and more.”
(Ding et al., 2012)
Data on the Web Summary
Data is information about things
30
Data is something machines can process
Data drives applications (e.g. web sites, mobile services)
Data is relations among things
Open Data starts with making available the data that you already have, in whatever format.
• Equal access for all • Licensing, legal issues • Transparency • Changing the way government works
Open Data vs Linked Data Summary
Open Data
Linked Data • URIs • HTTPs • RDF vocabularies • Standards
31
Difficult
Concluding Remarks Hope is not a strategy and the “change” has been change for the worse, and not better.
What We Will Do Interdisciplinary Collaboration
32
- Charles Baur, Adam Cheyer, Didier Guzzoni, Active, a platform for building intelligent software - Noor Huijboom and Tijs Van den Broek, Open Data: an international comparison of strategies, European journal of ePractices, March/April 2011 - Li Ding, Vassilios Peristeras, and Michael Hausenblas, Linked Open Government Data, IEEE Intelligent Systems, May/June 2012 - Page 1: http://www.w3.org/DesignIssues/diagrams/websci/Marius%20Watz%20-%20Web%20Science%20artwork.png - Page 4: http://www.go-gulf.com/60seconds.jpg - Page 9: http://cloud.frontpagemag.com/wp-content/uploads/2012/03/obama11.jpg - Page 27: http://www.patentlyapple.com/.a/6a0120a5580826970c0168e5ccdd81970c-800wi - Page 29: http://programminggeeks.com/wp-content/uploads/2010/05/Programming-Geeks-Web-Science.jpg - Page 29: http://3.bp.blogspot.com/-C0Kyck90Djo/T4KZTg3k1XI/AAAAAAAAAsE/RUp165S0FCQ/s1600/Commitment.jpeg
Page 2 Case Studies - http://www.guardian.co.uk/commentisfree/2012/aug/03/london-2012-olympics-open-data - http://www.bbc.co.uk/news/uk-19050139 - http://london2012.nytimes.com/results - http://www.guardian.co.uk/sport/interactive/2012/jul/23/could-you-be-a-medallist - http://www.guardian.co.uk/sport/datablog/2012/aug/13/olympics-2012-data-journalism - http://www.guardian.co.uk/sport/datablog/interactive/2012/jul/26/london-2012-price-olympic-games-visualised
References
33
For more information contact Haklae Kim via [email protected] Twitter: haklaekim Or read up on the sonagi blog at: http://blogweb.co.kr http://thedatahub.kr