Supporting the use of data: From data repositories to service discovery

Preview:

Citation preview

Supporting the use of data:From data repositories to service discovery

Mathieu d’AquinKnowledge Media Institute, The Open University

mathieu.daquin@open.ac.uk, @mdaquin

Data repositories are becoming more and more important

In education

In general

For Smart Cities

The Key Question is: How do we support the exploitation of

those data

ExamplE: The MK Data Hub

MK Data HubAnalytics

Integration

Curation

Storage

Import

Sensor Data

Local Stats

Gov. Open Data

...

Mobile Apps

Dashboards

Business Intelligence

Social Web Apps

...

Example: The MK Data Hub

MK Data HubAnalytics

Integration

Curation

Storage

Import

Find

Share Develop

GetCreate

Get Data

Create Applications with Data

Find Data

Applications (Examples)

From A Data Platform to an Experiment Platform

From A Data Platform to an Experiment Platform

From A Data Platform to an Experiment Platform

Data Services

Available Data

Ontological representation

of service capability

Ontological representation

of data characteristics

Exploitability

Applicability

Ongoing work on Applicability

… … … …

Ongoing work on Exploitability

Ontological representation

of data characteristics

Exploitability

Characteristics of the data that needs to be taken into account for exploitability:

- Validity, quality, coverage- Nature (time series, statistics, geo, etc)- Scale- Connection, complementarity to others,

uniqueness- Policies and rights

Data Rights and Policies in Data Flows

Daga et al (2015), Propagation of policies in rich data flows

Given a data flow (the application of data services), how do the licence statements propagate across the data flows?

Dataflow representation

Input Data licence

representation

Policy propagation

rules

Output Data licence

representation

Representation of policies

Data Rights and Policies in Data Flows

Daga et al (2015), Propagation of policies in rich data flows

Given a data flow (the application of data services), how do the licence statements propagate across the data flows?

Dataflow representation

Input Data licence

representation

Policy propagation

rules

Output Data licence

representation

Representation of DatAflows

Many options for the representation of workflows (including ProvO) to represent what processes are applied to data.

DataNode ontology to represent what in which way these processes actually affect data: Representation of relationships between the data artefact in input and output of the processes.

Conclusion

Service discovery requires not only an adequate representation of the data services (which is not solved) but also the adequate representation of the available data.

Including right, policy and licence aspects...

Thank You!mathieu@daquin.net

@mdaquin

Alessandro Adamou

Enrico Daga

Carlo Allocca

Shuangyan Liu

Keerthi Thomas

Ilaria Tiddi

Emanuele Bastianelli

Need to manage data in more thoroughly

Recommended