Linked Data and Semantic Web Technologies for Aid Transparency Aman Shakya PhD (National Institute...

Preview:

Citation preview

Linked Data and Semantic Web Technologies for Aid Transparency

Aman ShakyaPhD (National Institute of Informatics, Japan)

Asst. Professor (Institute of Engineering, Nepal)

IT Consultant

http://nepalaid.yipl.com.np/

2

OutlineIntroduction

◦Semantic Web◦Linked Open Data

Aid Transparency Open Data◦IATI standard XML data◦Applications

IATI as Linked Data and Applications

Tools and Platforms

3

Introduction

WWW - World Wide Web page

?

URL: http://nepalaid.yipl.com.np

WWW –Web of Documents

5

Hyperlinks

Hyperlinks

Hyperlinks

URL-http://www.aidtransparency.net/

URL- http://nepalaid.yipl.com.np

6

Semantic WebSir Tim Berners-Lee (inventor of the

WWW)“.. an extension of the current web in which

information is given well-defined meaning, better enabling computers and people to work in cooperation..”Scientific American

(2001)

• Allows data to be shared and reused across application, enterprise, and community boundaries

• Collaborative effort led by W3C with participation from a large number of researchers and industrial partners.

7

The Semantic Web

1. Web of Structured Data

2. Machine understandable semanticsSemantics = Meaning

3. Data modeling and Knowledge representation

4. Interoperable data standards

7

Web of Data

8

Meaningful Data Nodes & Data Links

GGG (Giant Global Graph)Source: http://www.w3.org/2006/Talks/0123-sb-W3C-

ThingsWeb/

9

URI – Uniform Resource Identifier

URI: http://www.china.org/geography/rivers#Yangtze Yangtz

e

China

River

Beijing

6300 km

Located in

Has capital

Is a

Has length

URI: http://dbpedia.org/resource/China

URI: http://dbpedia.org/resource/Beijing

10

Triples (subject, predicate, object)

or (resource, property, value)

Graph data model

RDF - Resource Description Format

Subject ObjectPredicate

11

Syntax (computer format) for RDFRDF/XMLN-triples, turtle, etc.

Embedded formats◦RDFa, microformats, HTML5

JSON (lightweight structured format)

GRDDL

RDF/XML Example<?xml version="1.0"?><rdf:Description rdf:about="http://www.china.org/geography/rivers#Yangtze" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns="http://www.geodesy.org/river#"> <rdf:type rdf:resource="http://www.geodesy.org/river#River"/> <length>6300 kilometers</length> <startingLocation>western China's Qinghai-Tibet Plateau</startingLocation> <endingLocation>East China Sea</endingLocation></rdf:Description>

13

RDFa Example <HTML> file

Semantic Web (explained)Machine understandable Semantics by

◦Data Modeling & Knowledge Representation

Ontology“.. an explicit specification of a conceptualization”

◦Modeling of Concepts and Relationships that exist in the area of interest

14

Gruber (1993)

Example – Activities Ontology

http://intelleo.eu/ontologies/activities/spec/

16

Travel Ontology

17

Ontology LanguagesRDF Schema

◦A vocabulary description language

OWL - Web Ontology Language◦A richer ontology language◦Based on Description Logic

18

OntologiesDomain ontology

◦Open Biomedical Ontologies (OBO)◦CIDOC CRM - an ontology for cultural

heritage information … etc.

Lightweight ontology◦Vocabularies◦Taxonomy◦Thesaurus, etc.

19

SKOSSimple Knowledge Organization

System◦Used for Taxonomies, Thesaurus

Semantic Web Layers Cake

20

The Semantic Web Cake

21

Linked Data

(Web of Data on the Web)

Linked Data

Publishing and interlinking structured data on the Semantic Web

4 basic principles of Linked Data:1. Use URIs to identify things.

2. Use HTTP URIs so that we can locate and look up (dereference) these things.

3. Provide useful information when its URI is dereferenced.

4. Include links to related URIs to improve information discovery on the Web.

22

Linked Open Data Sources on the Web

2007

Open

2009

25

2010

Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/

26

Querying Semantic Web DataSPARQL RDF Query Language

SPARQL endpoint – query interface

Example SPARQL query

PREFIX foaf: <http://xmlns.com/foaf/0.1/> SELECT ?name ?email WHERE {

?person a foaf:Person. ?person foaf:name ?name. ?person foaf:mbox ?email.

}

Beautiful Applications over Raw Data

27Source: Berners-Lee, TED 2009

http://www.data.gov

28

29APPS SHOWCASE

Employment Market Explorer

31

Linked Open Data Rating

- Available on the web (whatever format), but with an open licence

- Available as machine-readable structured data

(e.g. excel instead of image scan of a table)

- Non-proprietary format (e.g. CSV instead of excel)

- All the above plus, Use open standards from W3C (RDF and SPARQL) to identify things

- All the above, plus: Link your data to other people’s data

(Berners-Lee, 2010)

★★

★★★

★★★★

★★★★★

32

Aid Transparency Open Data

33

Aid Transparency Open Data

The IATI Standard◦Activity Standard◦Organization Standard◦IATI Codelists

Available as IATI XML schemas

34

IATI Activity StandardBasic activity informationParticipating organizationsGeopolitical informationClassificationsFinancial detailsRelated documents, activitiesPerformance

35

IATI Organization Standard

36

IATI Codelists

37

IATI Data PublishingOrganisations publish IATI standard files

on their own website

Submit a link to the central IATI Registry

Use IATI Registry to find, use and stay updated on data from different publishers

Whenever an organisation updates/adds IATI files, they let the registry know

IATI Registry

39

IATI Data of Nepal

40

Making aid more transparent◦high quality, timely information on

Aid flow

Actively involved in supporting the IATI 

Map of Aid Activities

43

IATI Aid Activity Tables

44

Data ExplorerCSV TransformationsQuery BuilderXPath EndpointXSLT TransformationsLists serviceXQuery service

45

Data Explorer – Table View

Faceted Search / Browsing

Data Explorer – Timeline View

Developed using …SIMILE project (MIT) http://simile.mit.edu/

http://iatidata.heroku.com/

51

IATI Data as Linked Data

52

IATI Open Data Rating

★★★

Machine-readable datain XML format

Towards RDF Linked Data

★★★★

53

IATI Data as Linked DataWhat is happening Already?

◦Code lists as Linked Data- API access to the code-lists- Return RDF linked data, SKOS concepts

◦Sketching Linked Data Vocabularies◦Support for linked data in the IATI

standard

ref: http://www.aidinfolabs.org/archives/384 (Tim Davies)

54

Tim Davies

http://beta.kasabi.com/dataset/iati

Blog: http://www.aidinfolabs.org/archives/582

XSLT- XML Stylesheet Transformation

Source: http://www.w3.org/Consortium/Offices/Presentations/XSLT_XPATH/

56

Further Possibilities

57

Further possibilitiesMix-and-match existing vocabulariesPerhaps

◦Payment Ontology◦Business Ontologies, Semantic XBRL◦Academic Research Project Funding

Ontology (ARPFO)◦DOAP (Description of a Project)◦…….

Generalized ‘Resource Flow’ ontology

Payments Ontology

http://data.gov.uk/resources/payments

59

XML to RDF Linked DataXML Semantics Reuse

Methodology (Roberto García)

◦XML Schema to OWL◦XML to RDF

ReDeFer◦Convert XML to RDF/OWL

60

GRDDLGleaning Resource Descriptions

from Dialects of Languages

Extract RDF data from XML formats via transformations identified by URIs and typically expressed in XSLT

61

Linking IATI to existing Linked Data

Government Data◦data.gov, data.gov.uk.. etc.

Semantic XBRL◦Financial Reports

Research/Project Data (funded by the donor)

Dbpedia (Wikipedia data)Geo Linked Data……… so on

62

Linked data from data.gov.ukhttp://linked4.org/lsd/explore.html

http://wheredoesmymoneygo.org/

Open Knowledge Foundation

68

http://data.gov.uk/apps

69

For the Developers

70

Semantic Web Platforms and ToolsSemantic Web Frameworks

◦ Triple Stores◦ Parsers and APIs◦ SPARQL query processor

Examples◦ Java – Jena, Sesame◦ Jena .NET◦ PHP – ARC, Redland, RAP◦ Ruby - ActiveRDF◦ Virtuoso, Redland, 3store, Talis platform, RDF Pro,

etc.Ontology editors

◦ Protégé, OntoStudio, TopBraid composer, etc.

71

Linked Data Platforms and Tools

Linked Data Publishing Platforms◦D2R server◦Triplify◦Talis platform◦Kasabi◦Pubby◦Paget … etc.

Linked Data/RDF Editors and Validators◦Hyena◦Vapour

72

Semantic Web Platforms and ToolsSemantic Web CMS and frameworks

◦Semantic Drupal◦Paggr◦ConStruct◦Structured Dynamics

PoolParty◦Thesaurus management and SKOS editor

Rhizomer◦publishing, querying, browsing, editing

and interacting with semantic data

73

Social Semantic Web ToolsSemantic Wikis

◦Semantic Media Wiki◦OntoWiki◦KIWI: Knowledge in a Wiki … etc.

Semantic blogging, tagging… etc.

74

Linked Data VisualizationTools

SIMILE project (MIT) http://simile.mit.edu/

◦Exhibit◦Timeline◦Longwell (faceted browser)

Many eyes◦Open platform for visualization

http://www-958.ibm.com/software/data/cognos/manyeyes/

76

ConclusionSemantic Web

◦A global Linked Open Data commonsIATI standard dataIATI as Linked Open Data

◦Ongoing work◦Further possibilities

Applications and Tools

77

Thank you!Semantic Questions?

Contact: amanshakya@yahoo.com

Recommended