DBtrends Exploring Query Logs for Ranking RDF Data · 2019-09-19 · Edgard Marx, Gustavo Publio...

CACAOConditional Spread Activationfor Keyword Query Interpretation

15th Internation Conference on Semantic WebKarlsruhe 2019

Edgard Marx, Gustavo Publio & Thomas Riechert

LiberAI

openQA

*https://github.com/AKSW/openQA

Previous Works

LiberAI

Is the Knowledge

Graph in KBox?

NOResolve

User provides SPARQLquery and the target Knowledge Graph URI

ExecuteSPARQL Query

User/Application

SELECT ?s ?p ?o ...

+http://knowledgegraph.uri

Dereference

*https://github.com/AKSW/KBoxLiberAI

Previous Works

*https://github.com/AKSW/NSpMLiberAI

Previous Works

*https://github.com/AKSW/DBNQA

https://wikipedia.org/wiki/List_of_datasets_for_machine-learning_research

Previous Works

LiberAI

Outline

• Motivation

• Problem Statment

• Background

• Approach

• Evaluation

• Counclusion & Future Works

6LiberAI

Resource Description

Framework (RDF)

Unstructured

Linked Data

Structured

Linked Data

LiberAI

Motivation

• Entity Retrieval• Question Answering• Entity Linking

Linked Data

Framework (RDF)Access (the information needed)

LiberAI

Motivation

Framework (RDF)Entity Retrieval

Formally, a top-k entity retrieval takes a keyword query Q,

an integer 0 < k, a set of entities E={e1, e2, . . . , e|E|},

and returns the top-k entities based on

a scoring function S(Q, e)

LiberAI

Problem statement

Framework (RDF)Entity Retrieval

Select * where {

?s dbo:birthPlace dbpedia:Leipzig

LiberAI

Problem statement

TF-IDF

𝑤𝑡, 𝑑= 𝑡𝑓𝑡, 𝑑 ∙ 𝑙𝑜𝑔

|𝐷|

| 𝑑′ ∈ 𝐷 𝑡 ∈ 𝑑′

LiberAI

Background

"This can opener can open any can that a can opener can and if this can opener cannot open any can that a can opener can, then this can opener is free"

“If this can opener does not open any can, then you can take it for free”

LiberAI

Background

13LiberAI

Background

14LiberAI

Background

Field Weighted

15LiberAI

weight(p1) > weight(p2) > weight(p3) …

Background

Learning to Rank

16LiberAI

Background

17LiberAI

Entity Link in Queries

Entity Linking

Entity Candidate Generation

Ranking

Entities

Query Expansion

Linked Entities

Entity Retrieval

Background

General IR Architecture

Entity Retrieval, Entity Linking, Question Answering

Ranking

Result

Query Expansion

LiberAI

Background

Interaction

Entity Retrieval Entity Linking Question Answering

Ranking

Entities

Query Expansion

Ranking

Linked Entities

Query Expansion

Ranking

Answer

Query Expansion

LiberAI

Background

Lastest Architecture

Entity Retrieval Entity Linking

Ranking

Entities

Query Expansion

Ranking

Linked Entities

Query Expansion

LiberAI

Background

Lastest Architecture

21LiberAI

Background

Current Scenario

Ranking

Entities

Query Expansion

Ranking

Linked Entities

Query Expansion

Ranking

Answer

Query Expansion

LiberAI

Background

Return linked resources

Ranking

Linked Resources

Query Expansion

Ranking

Linked Entities

Query Expansion

Ranking

Answer

Query Expansion

LiberAI

Proposal

Return the answer

Ranking

Linked Resources +

Answer

Query Expansion

Ranking

Linked Entities

Query Expansion

Ranking

Answer

Query Expansion

LiberAI

Proposal

Challenge 1: *Local Term Frequency

25LiberAI

*not to confuse with global

dbr:Albert_Leadbr:Albert_Lea_Public_Library …

“Albert Lea …”

“Albert Lea …” …

Q: “Albert Lea”

Challenge 2: Cohesion

26LiberAI

Q: “Albert Lea”

27LiberAI

Q: “Albert Lea”

Preposition 1 [1][2]: maximize the number of tokens and reduce the number of mapped entities

[1] SWJ, Shekarpour et al. 2015, [2] NAACL, Sakor et al. 2019

28LiberAI

Q: “Albert Lea”

dbr:Albert_Leadbr:Albert_Lea_Public_Library

18“Albert Lea …”

Proposition 1 [1][2]: maximize the number of tokens and reduce the number of mapped entities

Does NOT satisfy: Either A and B comply with the proposition 1

[1] SWJ, Shekarpour et al. 2015, [2] NAACL, Sakor et al. 2019

29LiberAI

Q: “Albert Lea”

Proposition 2 [3][4]: use the context

[3] ISWC, Usbeck et al. 2014, [4] KCap, Moussallem et al. 2017

30LiberAI

Q: “Albert Lea”

Proposition 2 [3][4]: use the context

Does NOT satisfy: Either A and B comply with preposition 2

[3] ISWC, Usbeck et al. 2014, [4] KCap, Moussallem et al. 2017

31LiberAI

Q: “strawberry ice cream”

Proposition 3 [5][6]: Max Similarity (Levenshtein and Jaccard)

dbr:Strawberry_ice_cream

“Strawberry_ice_cream”A

dbr:Banana_split

“strawberry”B

“ice cream”

[5] AAAI, Zhang et al. 2016, [6] VLDB, Zheng et al. 2019

32LiberAI

Q: “strawberry ice cream”

Proposition 3 [5][6]: Max Similarity (Levenshtein and Jaccard)

Does NOT satisfy: Either A and B comply with preposition 3

[5] AAAI, Zhang et al. 2016, [6] VLDB, Zheng et al. 2019

dbr:Strawberry_ice_cream

“Strawberry_ice_cream”A

dbr:Banana_split

“strawberry”B

“ice cream”

Challenge 3: Scoring Function

33LiberAI

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20

Where is the answer?

Approach

34LiberAI

Can we do better?

Conditional Spread Activation

35LiberAI

s = 0s > 0

Q: “Albert Lea”Challenge 1: Local Term Frequency

Approach

Field Weight

36LiberAI

s > 0 s > 0

Q: “Albert Lea”Challenge 1: Local Term FrequencyChallenge 2: Cohesion

weight(ptype) > weight(plabel) > weight(pothers)

Approach

Field Weight

37LiberAI

𝑆 𝑄𝑖, 𝐿𝑖 = ∑𝑄𝑖𝐿𝑖 if ∑𝑄𝑖𝐿𝑖 = 1

Approach

Field Weight

38LiberAI

1 > j(𝑄𝑖, 𝐿𝑖)

𝑆(𝑄𝑖, 𝐿𝑖) = ∑𝑄𝑖𝐿𝑖

22 = 4

𝑆 𝑄𝑖, 𝐿𝑖 = ∑𝑄𝑖𝐿𝑖 if ∑𝑄𝑖𝐿𝑖 = 1

Approach

Field Weight

39LiberAI

Challenge 1: Local Term FrequencyChallenge 2: CohesionChallenge 3: Scoring Function

1 2 3 4 5 6

*PThe answer!!

Approach

Pipeline

Ranking

Result

Query Expansion

LiberAI

Evaluation

Benchmark

41LiberAI

Evaluation

DBpedia-Entity

42LiberAI

Evaluation

QALD-4

43LiberAI

Evaluation

Conclusion & Future Works

44LiberAI

• We have shown a promising algorithm that outperform state-of-the-art ER engines on standard benchmarks by addressing:

o Local Term Frequency;o Cohesion, and;o Score Function problems

• It is still too soon to trace conclusions (law of small numbers)achieves that• Improve the algorithm runtime so it can be used at scale

Acknowledgements

LiberAI

@theLiberAI @HTWKLeipzig @edgardmarx @akswgroup

Acknowledgements

@theLiberAI @HTWKLeipzig @edgardmarx @akswgroup

Thank you!!!* 100+ GitHub stars

* 40+ forks

DBtrends Exploring Query Logs for Ranking RDF Data · 2019-09-19 · Edgard Marx, Gustavo Publio...

Documents

RDF Semantic Graph「RDF 超入門」

ISTITUTO COMPRENSIVO “PUBLIO VIBIO MARIANO”

Practical RDF Chapter 10. Querying RDF: RDF as Data

Graphically Querying RDF Using RDF-GL

RDF-Anwendungen: Photo RDF

Estacio, Publio Papinio - La Tebaida

Student Project: Smart Picture Frame · Student Project: Smart Picture Frame Group 3: Gigi Ho < ho@campus.tuberlin.de > LarsErik Riechert < riechert@mailbox.tuberlin.de >

Practical RDF Ch.10 Querying RDF: RDF as Data

MACROECONOMIA-Cuestionario 1- Diana Enriquez- Publio Castro

Rdf And Rdf Schema For Ontology Specification

Practical RDF Chapter 2. RDF: Heart and Soul

RDF in P2P-Netzen Ting Li 09.02.2004. RDF in P2P-Netzen2 Gliederung 1. Einleitung 2. RDF/RDF Schema 3. Edutella/RDF-basierte P2P-Netze 4. Implementierung

RDF as a Universal Healthcare Exchange Languagedbooth.org/2014/rdf-as-univ/rdf-as-univ-slides.pdf · 10 Yosemite Manifesto on RDF as a Universal Healthcare Exchange Language 1. RDF

6. RDF(S) : RDF 와 RDF Schema

Carta de Publio Lentulus

Internationales SEO von André Riechert

Acuñaciones de Publio Carisio

Anne Kjaer Riechert: Hack the world a better place

Une classi cation exp erimentale multi-crit eres des ...RDF essentials RDF is a w3c standard An RDF graph is a set of RDF triples An RDF triple has three components: ... rdf stores

III Certamen de investigación Publio Hurtado