27
Interpreting Data Mining Results with Linked Data for Learning Analytics: Motivation, Case Study and Directions Mathieu d’Aquin Knowledge Media Institute, The Open University mdaquin.net - @mdaquin [email protected] Nicolas Jay Université de Lorraine, LORIA, [email protected]

Interpreting Data Mining Results with Linked Data for Learning Analytics

Embed Size (px)

DESCRIPTION

Interpreting Data Mining Results with Linked Data for Learning Analytics: Motivation, Case Study and Directions Presentation at the LAK 2013 conference - 10-04-2013

Citation preview

Page 1: Interpreting Data Mining Results with Linked Data for Learning Analytics

Interpreting Data Mining Results with Linked Data for Learning

Analytics:Motivation, Case Study and

DirectionsMathieu d’Aquin

Knowledge Media Institute, The Open University mdaquin.net - @mdaquin

[email protected]

Nicolas JayUniversité de Lorraine, LORIA,

[email protected]

Page 2: Interpreting Data Mining Results with Linked Data for Learning Analytics

My super naïve view of learning analytics

Data (from some education

related system)

Some kind of data processing Visualisation

Insight!

Tada!

Page 3: Interpreting Data Mining Results with Linked Data for Learning Analytics

But actually…

Data (from some education

related system)

Some kind of data processing Visualisation

Insight!

Tadada!

Interpretation

Page 4: Interpreting Data Mining Results with Linked Data for Learning Analytics

Needs more data/information

Data (from some education

related system)

Some kind of data processing Visualisation

Insight!

Tadada dou!

Interpretation Background knowledge

Page 5: Interpreting Data Mining Results with Linked Data for Learning Analytics

The challenge for learning analytics

Most of the time, background knowledge needs to be in the head of the people looking at the analytics.

How to find/obtain background information for interpretation to support him/her considering that:

– The data we are analysing and insight we are trying to obtain can cover a wide range of things, topics, domains, subjects…

– We might not know in advance we background information is needed for interpretation

Our approach: Integrate linked data sources at the time of interpretation

Page 6: Interpreting Data Mining Results with Linked Data for Learning Analytics

What’s linked data

See the “Using Linked Data in Learning Analytics” tutorial yesterdayhttp://linkedu.eu/event/lak2013-linkeddata-tutorial/

Page 7: Interpreting Data Mining Results with Linked Data for Learning Analytics

Linked Data

Open University Website

Open UniversityVLE

KMi Website

Mathieu’s Homepage

Mathieu’s List of

PublicationsMathieu’s

Twitter

The Web

M366 Coursepage

Person: Mathieu

Publication: Pub1

Organisation:The Open University

Course: M366

Country: Belgium

Book: Mechatronics

author

workFor

availableIn

offers

setBook

The Web of Linked Data

Page 8: Interpreting Data Mining Results with Linked Data for Learning Analytics

Gene Ontology

FMA OntologyLODE

BIBO

Geo Ontology

DBPedia Ontology

Dublin Core

FOAF

DOAP

SIOC

Music Ontology

Media Ontology

rNews

Page 9: Interpreting Data Mining Results with Linked Data for Learning Analytics

Example: data.open.ac.uk

Page 10: Interpreting Data Mining Results with Linked Data for Learning Analytics

Use case: student enrolment data

From the Open University’s Course Profile Facebook Application:

Who enrolled to what course at what time

Student ID Course Code Status Date112 dse212 Studying 2007

112 d315 Intend to study 2008

109 a207 Completed 2005

Examples:

Page 11: Interpreting Data Mining Results with Linked Data for Learning Analytics

Sequence mining

We can represent each student’s trajectory by a sequence of courses, e.g.

(DD100) (D203, S180) (S283)

Applying sequence mining makes it possible to find frequent patterns in these sequences, i.e., courses often taken together in a certain order.

Page 12: Interpreting Data Mining Results with Linked Data for Learning Analytics

The results(and again, why they need background knowledge for interpretation)

Out of 8,806 sequences (students), we obtained 126 different sequential patterns with a support threshold of 100*i.e. filtering out patterns included in less than 100 sequences.

How to know what that means?We need background information about the courses (DD100, DSE212, ED209 ,etc.)

Sequential pattern Support

(DD100) (DSE212) 232

(DSE212) (ED209) (DD303) 150

(B120) (B201) 122

Examples:

Page 13: Interpreting Data Mining Results with Linked Data for Learning Analytics

The approach to interpretation:

Building a navigation structure in the patterns using dimensions obtained in linked data

Page 14: Interpreting Data Mining Results with Linked Data for Learning Analytics

Making the results linked data compliantUse a simple ontology of sequences to represent the patterns

And use linked data URIs to represent the items, e.g. DSE212 http://data.open.ac.uk/course/dse212

Page 15: Interpreting Data Mining Results with Linked Data for Learning Analytics
Page 16: Interpreting Data Mining Results with Linked Data for Learning Analytics

Selecting a dimension in linked dataPropose relations that apply to the items of the patterns

Then relations that apply to the objects of these relations

Etc.

i.e. follow the links to build a chain of relationships.

Page 17: Interpreting Data Mining Results with Linked Data for Learning Analytics

Building a hierarchy of patterns

The end-values of the chain of relations built out of following links of linked data form attributes of the patterns

Build a lattice (hierarchy) of concepts representing groupings of these attributes, using formal concept analysis

Page 18: Interpreting Data Mining Results with Linked Data for Learning Analytics

Exploring the hierarchy

Page 19: Interpreting Data Mining Results with Linked Data for Learning Analytics
Page 20: Interpreting Data Mining Results with Linked Data for Learning Analytics

Benefits (see following examples)

Provides an overview of the patterns obtained along a custom dimension

Helps identifying gaps and issues in the original data/process

Helps identifying areas in need of further exploration

Generic: can be straightforwardly applied to other source data, other linked data and other mining methods

Page 21: Interpreting Data Mining Results with Linked Data for Learning Analytics

Generalisation of the subjects

Page 22: Interpreting Data Mining Results with Linked Data for Learning Analytics

Examples

• Subjects of books

Subjects of related course material

Page 23: Interpreting Data Mining Results with Linked Data for Learning Analytics

Examples

Assessment method

Page 24: Interpreting Data Mining Results with Linked Data for Learning Analytics

DiscussionLimitations of the approach:

– Requires the results to be linked data and the items to connect to linked data

– Sources of linked data needs to be available to support interpretation)

http://data.linkededucation.org/linkedup/catalog

Page 25: Interpreting Data Mining Results with Linked Data for Learning Analytics

Discussion: It’s a loop

Data (from some education

related system)

mining

Interpretation Background knowledge

Views and dimensionsData selection

Page 26: Interpreting Data Mining Results with Linked Data for Learning Analytics

Conclusion

Linked data can be used to enrich and bring some meaningful structure to the patterns from an analytics/mining process

Introducing linked data not only in input of the process, but also in support of more analytical tasks

Promising, considering the growth of education-related linked data

Should become part of an iterative process, where patterns and data get refined through interpretation and the introduction of background information from linked data

Page 27: Interpreting Data Mining Results with Linked Data for Learning Analytics

Thank you!

More info at:http://mdaquin.net @mdaquin

http://linkedup-project.eu http://linkedup-challenge.org

http://linkedu.eu/event/lak2013-linkeddata-tutorial/