A Distributional Approach for Terminological Semantic Search on the Linked Data Web

Preview:

DESCRIPTION

The process of searching and understanding existing vocabularies (terminological artifacts) on the Linked Data Web isan intrinsic activity to the consumption and production ofLinked Data. Data consumers trying to find and understandthe vocabularies behind datasets in order to query them, ordata producers searching for existing resources to describetheir data, face the challenge of semantically searching existing concepts in vocabularies. Traditional search mechanismsdo not address the level of semantic matching necessaryto match users’ information needs to vocabulary elements,bringing an additional barrier to the consumption and production of Linked Data on the Web. This work describes aterminological search mechanism which uses a distributionalsemantic model to provide a best-effort semantic matchingsolution. The distributional semantic model leverages thesemantic information present in large volumes of unstructured text to improve the semantic matching capabilities ofthe search process. A quantitative evaluation of the qualityof the search results shows that the approach provides aneffective semantic matching mechanism for terminologicalsearch.

Citation preview

Copyright 2009 Digital Enterprise Research Institute. All rights reserved.

ESA concept basis

Term basis

Indexed vocabulary term

Query: [airplane]

dbpedia-ontology:Aircraft (score = 0.1146)

dbpedia-ontology:aircraftAttack (score = 0.1097)

dbpedia-ontology:flyingHours (score = 0.1008)

dbpedia-ontology:aircraftTransport (score = 0.0876)

dbpedia-ontology:aircraftPatrol (score = 0.0738)

dbpedia-ontology:aircraftHelicopterTransport (score = 0.0709)

dbpedia-ontology:aircraftHelicopter (score = 0.0706)

Query: [gun]

dbpedia-ontology:Weapon (score = 0.0910)

dbpedia-ontology:shipBeam (score = 0.0662)

dbpedia-ontology:Ship (score = 0.0562)

dbpedia-ontology:militaryCommand (score = 0.0542)

dbpedia-ontology:Aircraft (score = 0.0532)

dbpedia-ontology:field (score = 0.0523)

dbpedia-ontology:aircraftAttack (score = 0.0509)

dbpedia-ontology:aircraftTransport (score = 0.0470)

Query: [king]

dbpedia-ontology:Monarch (score = 0.216)

dbpedia-ontology:monarch (score = 0.2162)

dbpedia-ontology:kingdom (score = 0.0786)

dbpedia-ontology:title (score = 0.0539)

dbpedia-ontology:headteacher (score = 0.0490)

dbpedia-ontology:actingHeadteacher (score = 0.0465)

dbpedia-ontology:appointer (score = 0.0459)

dbpedia-ontology:executiveHeadteacher (score = 0.04357)

Query: [justice]

dbpedia-ontology:SupremeCourtOfTheUnitedStatesCase (score =

0.329)

dbpedia-ontology:Judge (score = 0.129)

dbpedia-ontology:showJudge (score = 0.1277)

dbpedia-ontology:chiefEditor (score = 0.0934)

dbpedia-ontology:appointer (score = 0.0868)

dbpedia-ontology:Criminal (score = 0.0850)

dbpedia-ontology:department (score = 0.0691)

dbpedia-ontology:retired (score = 0.0578)

Query: [engine]

dbpedia-ontology:engineer (score = 1.0)

dbpedia-ontology:engine (score = 1.0)

dbpedia-ontology:engineType (score = 0.7174)

dbpedia-ontology:gameEngine (score = 0.6452)

dbpedia-ontology:principalEngineer (score = 0.5306)

dbpedia-ontology:cylinderCount (score = 0.1784)

dbpedia-ontology:cylinderBore (score = 0.17584)

dbpedia-ontology:AutomobileEngine/cylinderBore (score = 0.17584)

dbpedia-ontology:pistonStroke (score = 0.12070)

dbpedia-ontology:AutomobileEngine/pistonStroke (score = 0.12070)

dbpedia-ontology:AutomobileEngine (score = 0.10195)

Query: [car engine]

dbpedia-ontology:carNumber (score = 0.38506)

dbpedia-ontology:AutomobileEngine (score = 0.15942)

dbpedia-ontology:layout (score = 0.12297)

dbpedia-ontology:cylinderBore (score = 0.10015)

dbpedia-ontology:AutomobileEngine/cylinderBore (score = 0.10015)

dbpedia-ontology:cylinderCount (score = 0.0965)

dbpedia-ontology:engineer (score = 0.0944)

dbpedia-ontology:engine (score = 0.0944)

dbpedia-ontology:displacement (score = 0.0921)

dbpedia-ontology:AutomobileEngine/displacement (score = 0.0921)

dbpedia-ontology:secondDriver (score = 0.0876)

Recommended