Text-enhanced Representation Learning for Knowledge...

Reporter: Zhigang WANG

Authors: Zhigang WANG, Juanzi LI, Zhiyuan LIU, Jie TANG

Tsinghua University

To appear in IJCAI 2016

2016-04-17

Text-enhanced Representation Learning for Knowledge Graph

Outline

Introduction

Problem Definition

Our Proposed Approach

Experiments and Analysis

Conclusion

Representation Learning for KG

a knowledge graph 𝒦𝒢 = {(ℎ, 𝑟, 𝑡)}

Target To learn one embedding (a 𝑘-dimensional vector) for each

entity: ℎ → ℎ and 𝑡 → റ𝑡, where ℎ, റ𝑡 ∈ ℝ 𝑘

( Avatar, /film/film/directed_by, James Cameron )

Translation-based Methods

TransE For each triple (head, relation, tail), treat relation as a

translation from head to tail

Simple, effective, and achieving the state-of-the-art performance

Bordes, et al. (2013). Translating embeddings for modeling multi-relational data. NIPS.

James Cameron

Avatar

_directed_by

Titanic

Issues when modelling 1-to-N, N-to-1, N-to-N relations

TransH and TransR Build relation-specific entity embeddings

TransH TransR

Wang, et al. (2014). Knowledge graph embedding by translating on hyperplanes. AAAI.Lin, et al. (2015). Learning entity and relation embeddings for knowledge graph completion. AAAI.

Motivation 1. low performance on 1-to-N, N-to-1 and N-to-N relations

Learn embeddings directly from the graph structure Graph sparseness

In domain-specific and non-English situations

James CameronAvatar _directed_by+ =

Canada

_country

20th Century Fox

_distributed_by

English

_language

Motivation 2. limited performance by the structure sparseness of KG

Our Idea

Text-enhanced Representation Learning for KG Go back to traditional relation extraction

Inspired by distant supervision

( Avatar, /film/film/directed_by, James Cameron )

James Francis Cameron, the famous director of the movie Avatar, is an

The fiction film Avatar directed by J. Cameron was nominated by

In 1994 director James Cameron wrote an 80-page treatment for Avatar

{film, movie, directed, ...} {director, ...}{direct}

Triple:

Context:

Contributions:[Motivation 1]. Enable each relation to own different representations for different head and tail entities.[Motivation 2]. Incorporate the textual contexts to each entity and relation.

Outline

Introduction

Problem Definition

Conclusion

Problem Definition

Input Knowledge Graph

𝒦𝒢 = {(ℎ, 𝑟, 𝑡)}

Text Corpus𝒟 = 𝑤1…𝑤𝑖 … 𝑤𝑚

Text-enhanced Knowledge Embedding (TEKE)

learn the entity embeddings ℎ → ℎ ∈ ℝ 𝑘 and 𝑡 → റ𝑡 ∈ ℝ 𝑘 for each triple (ℎ, 𝑟, 𝑡) by utilizing the rich text information in 𝒟 to deal with

• low performance on 1-to-N, N-to-1, N-to-N relations

• knowledge graph sparseness

learn the relation embedding 𝑟 → റ𝑟 ∈ ℝ 𝑘

Outline

Introduction

Problem Definition

Conclusion

The Proposed Approach

( Avatar, /film/film/directed_by, James Cameron )Triple:

Entity

Annotation

James Francis Cameron, the famous director of the movie Avatar, is an

The fiction film Avatar directed by J. Cameron was nominated by

In 1994 director James Cameron wrote an 80-page treatment for Avatar

·A ·B ·AEntity/Relation

Representation

Modelling

Representation

Training

Textual Context

Embedding

Entity Annotation Given the text corpus 𝒟 = 𝑤1…𝑤𝑖 …𝑤𝑚 , use an entity linking

tool to automatically label the entities in 𝒦𝒢, and get an entity-annotated text corpus:

𝒟′ = 𝑋1…𝑋𝑖 …X𝑚′

Textual Context Embedding co-occurrence network 𝒢 = 𝒳,𝒴

𝑥𝑖 ∈ 𝒳 : denotes to the node (a word or an entity)

𝑦𝑖𝑗 ∈ 𝒴 : co-occurrence frequency between 𝑥𝑖 and 𝑥𝑗

Textual Context Embedding Pointwise textual context

n 𝑥𝑖 = 𝑥𝑗 𝑦𝑖𝑗 > 𝜃

n 𝑨𝒗𝒂𝒕𝒂𝒓 = 𝑓𝑖𝑙𝑚,𝑚𝑜𝑣𝑖𝑒, 𝑑𝑖𝑟𝑒𝑐𝑡𝑒𝑑…n 𝑱𝒂𝒎𝒆𝒔_𝑪𝒂𝒎𝒆𝒓𝒐𝒏 = 𝑑𝑖𝑟𝑒𝑐𝑡𝑜𝑟…

Pairwise textual context

n 𝑥𝑖 , 𝑥𝑗 = 𝑥𝑘 𝑥𝑘 ∈ n 𝑥𝑖 ∩ n 𝑥𝑗n 𝑨𝒗𝒂𝒕𝒂𝒓, 𝑱𝒂𝒎𝒆𝒔_𝑪𝒂𝒎𝒆𝒓𝒐𝒏 = 𝑑𝑖𝑟𝑒𝑐𝑡 …

Textual Context Embedding Word Embedding Learning 𝑥𝑖 → 𝒙𝑖

Pointwise textual context embedding of 𝑥𝑖:

𝒏 𝑥𝑖 =1

σ𝑥𝑗∈n 𝑥𝑖𝑦𝑖𝑗

𝑥𝑗∈n 𝑥𝑖

𝑦𝑖𝑗 ∙ 𝒙𝑗

Pairwise textual context embedding of 𝑥𝑖 and 𝑥𝑗:

𝒏 𝑥𝑖 , 𝑥𝑗 =1

𝑥𝑘∈n 𝑥𝑖,𝑥𝑗

𝑚𝑖𝑛 𝑦𝑖𝑘, 𝑦𝑗𝑘 ∙ 𝒙𝑘

Textual Context

Embedding

Entity/Relation Representation Modeling Incorporate the textual context information to the representation

learning on knowledge graph

𝒉 = 𝒏 ℎ 𝑨 + 𝒉

ො𝒕 = 𝒏 𝑡 𝑨 + 𝒕

ො𝒓 = 𝒏 ℎ, 𝑡 𝑩 + 𝒓

𝑓 ℎ, 𝑟, 𝑡 = 𝒉 + ො𝒓 − ො𝒕2

Linear transformation of textual context information

- given a relation, different textual context embeddingsfor different pairs of (head, tail) entities- to better handle 1-to-N, N-to-1, and N-to-N relations

Incorporate textual context information into the KG

- more background information- to deal with knowledge graph sparseness

Representation Training

Margin-based score function

Stochastic gradient descent (SGD)

Outline

Introduction

Problem Definition

Conclusion

Datasets 4 benchmark knowledge graphs

Entity-annotated Wikipedia corpuses

Evaluation (China, /location/location/adjoin, North_Korea)

Link Prediction• Mean Rank: 11

• Hits@10: 0%

• Raw; Filter: 9; 100%

Triple Classification• a binary classification task

Head China

Relation /location/location/adjoin

1 Japan

2 Taiwan

3 Israel

4 South_Korea

5 Argentina

6 France

7 Philippines

8 Hungary

9 Germany

10 USA

11 North_Korea

Link Prediction

TEKE compare with baselines

A lower Mean Rank is better while a higher Hits@10 is better

Mean Rank• TEKE methods perform much better than the baselines on WN18.

• No much improvement is observed on FB15K

Hits@10• TEKE methods outperform other baselines significantly and

consistently

Link Prediction

Capability to handle 1-to-N, N-to-1 and N-to-N relations FB15K: 1-1, 1-N, N-1, N-N 24.2%, 22.9%, 28.9%, 24.0%

TEKE methods significantly outperform the baselines when predicting the entity where multiple entities could be correct.

TEKE methods have not shown much advantage for predicting the entity where only one entity is correct.

Link Prediction

Capability to handle knowledge graph sparseness

Rank 3,000 entities for 2,238 triples for all three datasets

As the graph density gets higher, both TransE and TEKE_E perform better.

TEKE_E achieves the highest improvement on the sparsest FB3K dataset.

Triple Classification

TEKE compare with baselines

TEKE_E and TEKE_H consistently outperform the comparison methods, especially on WN11.

TEKE_R (unif) on WN11 and TEKE_R (bern) on FB13 perform better than TransR, while others perform a bit worse.

Outline

Introduction

Problem Definition

Conclusion

Conclusion and Future Work

A novel text-enhanced knowledge embedding method named TEKE for knowledge graph representation learning to deal with Low performance on 1-to-N, N-to-1 and N-to-N relations

Limited performance by structure sparseness of KG

Future Work Improve performance on 1-to-1 relations

Experimentally analyze the influence of entity annotation

Use different text corpus

Incorporate knowledge reasoning

Thanks!

Zhigang WANG

wangzigo@gmail.com

http://xlore.org/

TransE

Ideavec('Paris') - vec('France') ≅ vec('Rome') - vec('Italy')

Treat each relation as one unique vector

vec('_has_capital') = vec('_ has_capital')

and it would be reasonable that

vec('Paris') - vec('France')≅ vec('Rome') - vec('Italy') = vec('_capital_of')

_has_capital _has_capital

റ𝑡 ℎ റ𝑡 ℎ റ𝑟

Assumption റ𝒕 - 𝒉 = 𝒓

Text-enhanced Representation Learning for Knowledge...

Documents

Avery Dennison V2000 Beaded Reflective Film...The V2000 beaded reflective film is specially designed for flat applications. Bold day and night color consistency provides enhanced visibility

High-efficiency thin film nano-structured multi-junction ...€¦ · Photon management in thin film solar cells • Key to cost reduction and efficiency improvement • Enhanced optical

Enhanced Coloration Efficiency of Electrochromic Thin Film

IRJET-ENHANCED PERFORMANCE OF OPTICAL OZONE SENSORS USING Au@Ag CORE-SHELL THIN FILM NANOISLANDS

Stoichiometric Hydrogenated Amorphous Silicon Carbide … · Stoichiometric Hydrogenated Amorphous Silicon Carbide Thin Film Synthesis Using DC-Saddle Plasma Enhanced Chemical Vapour

Piezoelectric Pressure Sensor Based on Enhanced Thin-Film PZT … · 2018. 12. 18. · Piezoelectric Pressure Sensor Based on Enhanced Thin-Film PZT Diaphragm ... there are significant

EFR Enhanced Film Fixed Resistor · Terminal Strength IEC 60115-1 4.16.2 2.5kg direct load for 10 seconds in the longitudinal direction of the terminal leads ±1% EFR Enhanced Film

Get Together for a Gold-Class Film Experience · 2019. 5. 22. · R Get Together for a Gold-Class Film Experience With more distortion-free power, IMAX® Enhanced* certification for

CHITOSAN STARCH BASED PACKAGING FILM ENHANCED …eprints.utm.my/id/eprint/53656/25/MohdHarfizSalehudinMFChE2014.pdf · film without CNF (as a control) was 3.1 MPa which increased

Understanding the Enhanced Mobility of Solution- …...Understanding the Enhanced Mobility of Solution-Processed Metal-Oxide Thin-Film Transistors Having High-k Gate Dielectrics By

ANTHROPOCÈNE : L'époque humaine...evolved to include film installations, large-scale Burtynsky High-Resolution Murals enhanced by film extensions, 360 VR short films, and augmented

An electrochemical surface-enhanced Raman spectroscopy ...Keyword: electrochemistry, surface-enhanced Raman, Ag film over nanosphere, dipicolinic acid, anthrax, endospore 1. INTRODUCTION

Thin-film structures with nanocrystals: an origin of enhanced

Design of a squeeze film magnetorheological brake …web.iitd.ac.in/~hirani/chiranjit-iop.pdf · Design of a squeeze film magnetorheological brake considering compression enhanced

Enhanced Performance of a Single Basin Double Slope Solar Still with Thin Film of Water Flowing over

NON-CONTRAST ENHANCED MULTIDETECTOR COMPUTED TOMOGRAPHY VERSUS PLAIN X-RAY URINARY TRACT FILM IN ACUTE RENAL PAIN

Liquid Films and the Plasma-enhanced Deposition of Solid Coatings · 2018. 4. 25. · 5 Former use of liquid phases in plasma-based film deposition Liquid Assisted –Plasma Enhanced

Novel Low Temperature Processing for Enhanced Properties ... · performance of multilayer thin film devices, ... 2.3 Device Characterization (Thin Film Transistor Analysis) ... 7.2

Cut Vinyl Film HP750 High Performance Films and best-of ... · 3 Cut Vinyl Film HP750 High Performance Films Features enhanced shelf life and best-of-class weeding graphics.averydennison.com/CutVinylFilms

Zeolite-polyamide thin film nanocomposite membranes: Towards … · 2020. 3. 7. · Zeolite-polyamide thin film nanocomposite membranes: Towards enhanced performance for forward osmosis