Joining the docs: Back to the Future with Taxonomy and the New Semantics

Preview:

DESCRIPTION

An introduction to a web system that automatically works out 'you may also be interested in.." style links, and performs semantic clustering on folksonomies.

Citation preview

Joining the Docs: Back to the Future with

‘Taxonomy and the New Semantics’

Russell Blakeborough Director – Brightonart

boblists@brightonart.org

Creative Technology Brightonart The Guardian 09.11.2010

Introduction

• The problem

• Research

• Solution

• Application

• The future

Brightonart The Guardian 09.11.2010

The Problem

• People searching for Music and not finding

Piano teachers

• Freetagging vocabularies

• Diverging folksonomies

Brightonart The Guardian 09.11.2010

Research

• 1. Collaborative Creation of Communal Hierarchical

Taxonomies in Social Tagging Systems

Paul Heymann and Hector Garcia-Molina

Computer Science Department, Stanford University

• 2. The Structure of Collaborative Tagging Systems

Scott A. Golder and Bernardo A. Huberman

Information Dynamics Lab, HP Labs

• 3. Clustering Tags in Enterprise and Web Folksonomies

Edwin Simpson

HP Labs

Brightonart The Guardian 09.11.2010

http://www.hpl.hp.com/techreports/2008/HPL-2008-18.pdf

Solution

• NCO

• Normalised Co-Occurrence

• Inference to semantic distance

• Actually co-interest

• Useful community-specific data

Brightonart The Guardian 09.11.2010

NCO – intersection / union

Brightonart The Guardian 09.11.2010

NCO – intersection / union

Brightonart The Guardian 09.11.2010

A B

NCO – intersection / union

Brightonart The Guardian 09.11.2010

A B

Sample Data

Brightonart The Guardian 09.11.2010

Term A Term B NCO

Driving Tai Chi 0

Tai Chi martial arts 0.04

National Standards Cycling Test cycling 0.23

Application

• School of Everything

– schoolofeverything.com

• Netsounds

– netsoundsproject.eu

Brightonart The Guardian 09.11.2010

School of Everything is a social

learning network that connects

people who can teach with

people who want to learn

Web 2.0 system with a strong, fast growing community.

• Driven by the community itself

• Google food – web presence

• Developing new technologies for web learning

http://schoolofeverything.com Brightonart The Guardian 09.11.2010

Brightonart The Guardian 09.11.2010

Brightonart The Guardian 09.11.2010

Brightonart The Guardian 09.11.2010

Brightonart The Guardian 09.11.2010

Relationships

Brightonart The Guardian 09.11.2010

Relationships

Brightonart The Guardian 09.11.2010

Tai Chi

• Showing members tagged as Tai Chi

• Also showing:

Brightonart The Guardian 09.11.2010

Tai chi lessons

tai chi classes

Tai Chi Teacher

Qigong

chi kung

Taiji

qi gong

kung fu

Tai Chi Chuan

Meditation

Taijiquan

martial arts

Tao Yin

Taoist Yoga

RDF SKOS - scheme

<div xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-

ns#" xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#"

xmlns:skos="http://www.w3.org/2004/02/skos/core#">

<div about="http://schoolofeverything.com/subject/tai-

chi/all/teaching" typeof="skos:Concept">

<span property="skos:prefLabel" content="Tai Chi">

<span rel="skos:inScheme"

resource="http://schoolofeverything.com/subject">

<span property="skos:prefLabel" content="School of

Everything Subjects"></span>

Brightonart The Guardian 09.11.2010

RDF SKOS - related

<span rel="skos:related"

resource="http://schoolofeverything.com/subjec

t/qigong/all/teaching">

<span property="skos:prefLabel"

content="Qigong"></span>

Brightonart The Guardian 09.11.2010

Brightonart The Guardian 09.11.2010

Brightonart The Guardian 09.11.2010

London subjects cluster

Brightonart The Guardian 09.11.2010

Languages in London

Showing members tagged as Languages near

London.

Also showing:

Brightonart The Guardian 09.11.2010

• French

• Italian

• spanish

• russian

• Chinese

• japanese

• English

• german language

Literature & Culture

• portuguese

Clustering

• Navigation

• Zoom in

• Popular subjects bubble up automatically as

the community’s interests change

• Like pulling a hierarchical vocabulary out of a

freetagging vocabulary: magic

Brightonart The Guardian 09.11.2010

School of Everything / Netsounds

• Collaborative project

• Peer to peer learning

• Data feed

• Taxonomy

Brightonart The Guardian 09.11.2010

http://netsoundsproject.eu/community/soe_network Brightonart The Guardian 09.11.2010

Brightonart The Guardian 09.11.2010

Brightonart The Guardian 09.11.2010

Brightonart The Guardian 09.11.2010

Brightonart The Guardian 09.11.2010

Brightonart The Guardian 09.11.2010

Brightonart The Guardian 09.11.2010

Matchy Matchy

• Module: taxonomy_nco

• AKA ‘Matchy Matchy’

Brightonart The Guardian 09.11.2010

http://drupal.org/project/taxonomy_nco Brightonart The Guardian 09.11.2010

The Future

• RDF SKOS

• Interaction with results from Open Calais

• Clustering

• Visualisation

• Linking in to established ontologies

Brightonart The Guardian 09.11.2010

Summary

• Drupal contrib. module: taxonomy_nco

• ‘you may also be interested in’

– related subjects / interests

• Clustering

– the big issues

• Community

Brightonart The Guardian 09.11.2010 http://drupal.org/project/taxonomy_nco

Brightonart The Guardian 09.11.2010

Creative Technology

Recommended