63
Adaptive Semantic Annotation of Entity and Concept Mentions in Text Pablo N. Mendes PhD dissertation defense Ohio Center of Excellence in Knowledge-enabled Computing (kno.e.sis) Wright State University Dayton, OH

Pablo Mendes' Defense: Adaptive Semantic Annotation of Entity and Concept Mentions in Text

Embed Size (px)

DESCRIPTION

PhD defense held at Kno.e.sis Center, Wright State University, December 03, 2013.

Citation preview

Page 1: Pablo Mendes' Defense: Adaptive Semantic Annotation of Entity and Concept Mentions in Text

Adaptive Semantic Annotation of Entity and Concept Mentions in Text

Pablo N. MendesPhD dissertation defense

Ohio Center of Excellence in Knowledge-enabled Computing (kno.e.sis)

Wright State UniversityDayton, OH

Page 2: Pablo Mendes' Defense: Adaptive Semantic Annotation of Entity and Concept Mentions in Text

Introductions and Thank you!

Page 3: Pablo Mendes' Defense: Adaptive Semantic Annotation of Entity and Concept Mentions in Text

Outline

● Introduction, Motivation, Background– KB Tagging, Annotation as a Service

● Conceptual Model● Knowledge Base: DBpedia● System: DBpedia Spotlight● Core Evaluations● Case Studies

– tweets, audio transcripts, educational material

Page 4: Pablo Mendes' Defense: Adaptive Semantic Annotation of Entity and Concept Mentions in Text

Outline

● Introduction, Motivation, Background– KBT: Knowledge Base Tagging of Text

– AaaS: Annotation as a Service

– Adaptability

● Conceptual Model● Knowledge Base: DBpedia● System: DBpedia Spotlight● Core Evaluations● Case Studies

Page 5: Pablo Mendes' Defense: Adaptive Semantic Annotation of Entity and Concept Mentions in Text

KBT, informally

● Knowledge Base Tagging (KBT)● A developer needs to

– “extract entities”,

– “identify what is mentioned”,

– “connect to knowledge bases”.

● He/she is not an NLP or IE expert● Would like to reuse as much as possible● May have limited computational resources

→ Annotation as a Service (AaaS)5

Page 6: Pablo Mendes' Defense: Adaptive Semantic Annotation of Entity and Concept Mentions in Text

arrivals hallsparkspassenger terminal ceilingfire

fire

LOCATION LOCATION

On Thursday, April 11, 1996, a fire in an occupied passenger terminal at the airport in Düsseldorf, Germany, killed 17 people and injured 62. The fire began at approximately 3:31 p.m., about the time someone reported seeing sparks falling from the ceiling in the vicinity of a flower shop at the east end of the arrivals hall on the first floor.

On Thursday, April 11, 1996, a fire in an occupied passenger terminal at the airport in Düsseldorf, Germany, killed 17 people and injured 62. The fire began at approximately 3:31 p.m., about the time someone reported seeing sparks falling from the ceiling in the vicinity of a flower shop at the east end of the arrivals hall on the first floor.

DATE TIMENamed Entity Recognition (NER)

Keyphrase Extraction (KE)

airport Düsseldorf, Germany

On Thursday, April 11, 1996, a fire in an occupied passenger terminal at the airport in Düsseldorf, Germany, killed 17 people and injured 62. The fire began at approximately 3:31 p.m., about the time someone reported seeing sparks falling from the ceiling in the vicinity of a flower shop at the east end of the arrivals hall on the first floor.

Automatic Term Recognition (ATR)

On Thursday, April 11, 1996, a fire in an occupied passenger terminal at the airport in Düsseldorf, Germany, killed 17 people and injured 62. The fire began at approximately 3:31 p.m., about the time someone reported seeing sparks falling from the ceiling in the vicinity of a flower shop at the east end of the arrivals hall on the first floor.

Wikification (WKF)

Entity Linking (EL)Düsseldorf

LOCATION

ID:4213421 6

Page 7: Pablo Mendes' Defense: Adaptive Semantic Annotation of Entity and Concept Mentions in Text

Related Work

Voquette SCORESemagix Freedom

SemTag

My work

Domain-specific Web contentAuto-extracted facts

Community generated Cross-domain Multilingual

Syntactic

Semantic

7

NER

KEATR Wikification

Illinois Wikifier

TagMe

AIDA / Yago

Page 8: Pablo Mendes' Defense: Adaptive Semantic Annotation of Entity and Concept Mentions in Text

Related Work (commercial)

Page 9: Pablo Mendes' Defense: Adaptive Semantic Annotation of Entity and Concept Mentions in Text

Adaptability

News

Scientific literature

Tweets

Audio transcripts

Query keywords

New terms

Named Entities

Important phrases

Concepts related to an objective

● Each developer may have a different application in mind– different input and output– “get key topics for summarization?”– “exhaustive tagging for semantic search?”

● There is no one-size-fits all.● But can we support adaptation to different “fits”?

Page 10: Pablo Mendes' Defense: Adaptive Semantic Annotation of Entity and Concept Mentions in Text

Requirements

● Transparent process– Clear understanding of where things are working or

failing

● Adaptable process– Ability to exchange individual components in order

to achieve different goals

– Ability to modify the behavior of existing components

● Adaptable to different inputs

10

Page 11: Pablo Mendes' Defense: Adaptive Semantic Annotation of Entity and Concept Mentions in Text

Outline

● Introduction, Motivation, Background● Conceptual Model● Knowledge Base: DBpedia● System: DBpedia Spotlight● Core Evaluations● Case Studies● Conclusion

Page 12: Pablo Mendes' Defense: Adaptive Semantic Annotation of Entity and Concept Mentions in Text

KB

On Thursday, April 11, 1996, a fire in an occupied passenger terminal at the airport in Düsseldorf, Germany, killed 17 people and injured 62. The fire began at approximately 3:31 p.m., about the time someone reported seeing sparks falling from the ceiling in the vicinity of a flower shop at the east end of the arrivals hall on the first floor.

LOCATION LOCATIONDATE

User (Creator)

User (Consumer)

Objective

System

Phrase Recognition

Candidate Selection

Disambiguation

Tagging

A Conceptual Model of KBT

Editor

12Annotations

feedback

Page 13: Pablo Mendes' Defense: Adaptive Semantic Annotation of Entity and Concept Mentions in Text

Spark_(fire)

(0.87)

KBT and Related Tasks

On Thursday, April 11, 1996, a fire in an occupied passenger terminal at the airport in Düsseldorf, Germany, killed 17 people and injured 62. The fire began at approximately 3:31 p.m., about the time someone reported seeing sparks falling from the ceiling in the vicinity of a flower shop at the east end of the arrivals hall on the first floor.

LOCATION LOCATIONDATE

13

Extraction Task Outcome

Recognize known terms

Recognize new terms

Classify ontological type

Resolve ambiguity

Measure importance/relevance

Tag each occurrence

KE

x

x (to text)

NER

x

x

x

EL

x (NIL)

/

x

x

WSD

x

WKF

x

x

x

ATR

x

x (to domain)

KBT

x

x

x

x

x

x

Page 14: Pablo Mendes' Defense: Adaptive Semantic Annotation of Entity and Concept Mentions in Text

Novelty in the model

● Users and objective are explicit in the model– Knowledge about content creators provides context

for new types of KBT

– Knowledge about consumer and objective for customizing output

– Using feedback to learn from mistakes

Page 15: Pablo Mendes' Defense: Adaptive Semantic Annotation of Entity and Concept Mentions in Text

Outline

● Introduction, Motivation, Background● Conceptual Model● Knowledge Base: DBpedia● System: DBpedia Spotlight● Core Evaluations● Case Studies

Page 16: Pablo Mendes' Defense: Adaptive Semantic Annotation of Entity and Concept Mentions in Text

Wikipedia Extraction

Page 17: Pablo Mendes' Defense: Adaptive Semantic Annotation of Entity and Concept Mentions in Text

Knowledge Base

● DBpedia is a cross-domain KB extracted from Wikipedia [Auer et al. 2007, Bizer et al. 2009]

– Describes 3.7M things through 400M facts

– Use an ontology of 320 classes and 1,650 properties

● DBpedia Live keeps DBpedia up-to-date with Wikipedia changes

● A whole ecosystem with an active community

[Lehmann et al. 2013]

[Hellmann et al., 2009][Morsey et al., 2012]

17

Page 18: Pablo Mendes' Defense: Adaptive Semantic Annotation of Entity and Concept Mentions in Text

DBpedia Extraction Framework

[with Lehmann et al. @ SWJ 2013]

Added new extractors to support KBT:- Thematic Concepts- Topical signatures- Distributional Semantic Model statistics for semantic relatedness

18

Page 19: Pablo Mendes' Defense: Adaptive Semantic Annotation of Entity and Concept Mentions in Text

Outline

● Introduction, Motivation, Background● Conceptual Model● Knowledge Base: DBpedia● System: DBpedia Spotlight● Core Evaluations● Case Studies● Conclusion

Page 20: Pablo Mendes' Defense: Adaptive Semantic Annotation of Entity and Concept Mentions in Text

System: default workflow

● Phrase Recognition: – mention recognition (e.g. NER)

● Candidate Selection: – detecting possible senses for a surface form

● Disambiguation: – choosing (ranking/classifying) one sense for a

mention

● Tagging: – deciding if should annotate: to account for entities

not in the KB, or uninformative annotations.20

Page 21: Pablo Mendes' Defense: Adaptive Semantic Annotation of Entity and Concept Mentions in Text

(…) Upon their return, Lennon and McCartney went to New York to announce the formation of Apple Corps.

New York (magazine) New York Manhattan Province of New York New York City New York, New York (film) New York metropolitan areaWest New York, New JerseyRoman Catholic Archdiocese of New YorkPennsylvania Station (New York City)

Phrase Recognition

Candidate Selection

DisambiguationNew York City

ContextualRelatedness

0.100.340.220.230.670.450.560.010.330.07

Tagging21

“New York”type: city, pos: 78relevance: 0.67, ...

Page 22: Pablo Mendes' Defense: Adaptive Semantic Annotation of Entity and Concept Mentions in Text

22

A quick example

22

Page 23: Pablo Mendes' Defense: Adaptive Semantic Annotation of Entity and Concept Mentions in Text

23

LSU_Tigers

Louisiana State University

Show Top-K Candidates

23

Page 24: Pablo Mendes' Defense: Adaptive Semantic Annotation of Entity and Concept Mentions in Text

Virtuous Cycle

[with Héder @ WWW'2012]

Through Sztakipedia toolbar,

- DBpedia Spotlight suggests links

- to Wikipedia Editors

- catalyzes evolution of the knowledge source.

/feedback service

- allows users to submit judgements

- enables system evolution with feedback

- also on blogs, etc. with RDFaCE [Khalili]

24

Page 25: Pablo Mendes' Defense: Adaptive Semantic Annotation of Entity and Concept Mentions in Text

Contextual relatedness score: TF*ICF

25

“Washington”

ICF(“Washington”,”USA”) < ICF(“Washington”,”Seattle”)

Washington, DC W={“capital”,”USA”,...}

George Washington W={“president”,”USA”,...}

Washington State W={“Seattle”,”USA”,...}

[Mendes et al. @ ISEM2011]

TF*IDF (Term Freq. * Inverse Doc. Freq.)

TF: relevance of a word in the context of a DBpedia ResourceIDF: words that are too common are less useful

ICF: Inverse Candidate Frequency

Entropy-inspiredICF is the rarity of a word with relation to the possible senses

Page 26: Pablo Mendes' Defense: Adaptive Semantic Annotation of Entity and Concept Mentions in Text

Outline

● Introduction, Motivation, Background● Conceptual Model● Knowledge Base● System: DBpedia Spotlight● Core Evaluations● Case Studies● Conclusion

Page 27: Pablo Mendes' Defense: Adaptive Semantic Annotation of Entity and Concept Mentions in Text

(…) Upon their return, Lennon and McCartney went to New York to announce the formation of Apple Corps.

New York (magazine) New York Manhattan Province of New York New York City New York, New York (film) New York metropolitan areaWest New York, New JerseyRoman Catholic Archdiocese of New YorkPennsylvania Station (New York City)

PhraseRecognition

Candidate Selection

DisambiguationNew York City

ContextualRelatedness

0.100.340.220.230.670.450.560.010.330.07

Tagging

Core Evaluations

27

“New York”type: city, pos: 78relevance: 0.67, ...

Page 28: Pablo Mendes' Defense: Adaptive Semantic Annotation of Entity and Concept Mentions in Text

Phrase Recognition Results

At LREC'2012.

(L) Lexicon-based

(LNP*

) Lexicon-based with at least one noun

(NPL) Noun Phrases, lexicon-lookup (bloom filter)

(CW) Lexicon-based removing common words

(Kea) Keyphrases

(NER) Named Entities Only

(NER U NP) N-Grams within Noun Phrases and NEs

PoliciesS = { s | p(s) > cutoff_S }

Different spotting strategies with CSAW datasetTake home- It is not only about importance / relevance- Precision is not important: taken care in steps downstream- Recall is key: a missing phrase at this stage is an overall fail- Simple methods work quite well- Combinations of techniques improve results 28

Page 29: Pablo Mendes' Defense: Adaptive Semantic Annotation of Entity and Concept Mentions in Text

Context-Independent Strategies

● NAÏVE

– Use surface form to build URI: “berlin” → dbpedia:Berlin

● PROMINENCE

– P(u): n(u) / N (what is the ‘popularity’/importance of this URL)● n(u): number of times URI u occurred● N: total number of occurrences

– Intuition: URIs that have appeared a lot are more likely to appear again

● DEFAULT SENSE

– P(u|s): n(u,s) / n(s)● n(u,s): number of times URI u occurred with surface form s

– Intuition: some surface forms are strongly associated to some specific URIs

29

Page 30: Pablo Mendes' Defense: Adaptive Semantic Annotation of Entity and Concept Mentions in Text

Disambiguation

● Preliminary results:

● With 155,000 randomly selected wikilink samples● Balance of common and less prominent concepts(default sense: 55.12%)● Highly ambiguous (random: 17.77%)

At I-Semantics 2011.30

Page 31: Pablo Mendes' Defense: Adaptive Semantic Annotation of Entity and Concept Mentions in Text

Disambiguation+NIL

● Named Entities Only

At TAC KBP 2011.

TACKBP2010DefaultSense: 79.91%Random: 62.00%Unambiguous: 30.36%

NIL accuracy = 79.27 %Non-NIL accuracy = 87.88 %Overall accuracy = 82.71 %

31

Page 32: Pablo Mendes' Defense: Adaptive Semantic Annotation of Entity and Concept Mentions in Text

Disambiguation Difficulty● Geopolitical entities KB: 830K entities● 311 blog posts, 790 annotations

32

Page 33: Pablo Mendes' Defense: Adaptive Semantic Annotation of Entity and Concept Mentions in Text

Geolocation Disamb. Eval. results

33

- Validates our measure of “difficulty” (performance degrades)- Shows that our system is more robust for disambiguating low dominance entities

Page 34: Pablo Mendes' Defense: Adaptive Semantic Annotation of Entity and Concept Mentions in Text

Dominance Analysis

34

Page 35: Pablo Mendes' Defense: Adaptive Semantic Annotation of Entity and Concept Mentions in Text

Dominance Analysis

35

Page 36: Pablo Mendes' Defense: Adaptive Semantic Annotation of Entity and Concept Mentions in Text

Tagging

• Decide which spots to annotate with links to the disambiguated resources

• Different use cases have different needs– Only annotate prominent resources?

– Only if you’re sure disambiguation is correct?

– Only people?

– Only things related to Berlin?

36

Page 37: Pablo Mendes' Defense: Adaptive Semantic Annotation of Entity and Concept Mentions in Text

Tagging in DBpedia Spotlight

• Tagging needs are application/user-specific

• Can be configured based on:– Thresholds

• Confidence

• Prominence (support)

– Whitelist or Blacklist of types• Hide all people, Show only organizations

– Complex definition of a “type” through a SPARQL query.

37

Page 38: Pablo Mendes' Defense: Adaptive Semantic Annotation of Entity and Concept Mentions in Text

Tagging Evaluation (News)

● Preliminary results

Able to approximate best precision and best recall

Varying parameters allows to cover a wide range of the P/R trade-off

38

Page 39: Pablo Mendes' Defense: Adaptive Semantic Annotation of Entity and Concept Mentions in Text

Tagging (Take home)

● Combines features from spotting, candidate selection and disambiguation

● More informed to make decisions● Can avoid/fix some mistakes from previous

steps● Offers a chance to adapt to users' needs

39

Page 40: Pablo Mendes' Defense: Adaptive Semantic Annotation of Entity and Concept Mentions in Text

Outline

● Introduction, Motivation, Background● Conceptual Model● Knowledge Base● System: DBpedia Spotlight● Core Evaluations● Case Studies● Conclusion

Page 41: Pablo Mendes' Defense: Adaptive Semantic Annotation of Entity and Concept Mentions in Text

Case Study: Audio Tagginghttp://www.bbc.co.uk/programmes

41

Page 42: Pablo Mendes' Defense: Adaptive Semantic Annotation of Entity and Concept Mentions in Text

Example: Audio Transcript

whirlpool or not the b. b. c. witnessed when the jam and capital but then fell to the advancing bad timing in maine nineteen forty five the civilians living there feared to sing and violence steve athens hughes from one woman custom finds that tying it sit beginning of may nineteen forty five but then is being squeezed between the british americans from the west in the russian army from the east but sides fighting for every inch of land and forgets to this city is being pulverized ...

● BBC Audio Archive tag suggestion

Raimond & Lowis, LDOW2012.

German capitalMay 1945

● Tags: Berlin, World War II, Russian Army, etc.

42

Page 43: Pablo Mendes' Defense: Adaptive Semantic Annotation of Entity and Concept Mentions in Text

Scenario: Audio Transcript Tagging

Textual Content Creator(System)

Audio Creator

audio

transcript

No punctuation or capitalizationHigh token transcription error rates

Editor

editorial tags

System

Phrase Recognition

Candidate Selection

Disambiguation

Tagging

1. Contextual relatedness

2. Mention detection (dictionary-based)

3. Entity type preference-based reranking

KB

automated tags

AdaptedWorkflow

43

Page 44: Pablo Mendes' Defense: Adaptive Semantic Annotation of Entity and Concept Mentions in Text

Tagging Audio Transcripts

● Traditional NER features are missing

– Sentence boundaries, POS tags, 50% token error, etc.● Lexicon-based lookup is also difficult

– “big date” → big data● Our approach:

– On-the-fly adaptation

– Skip spotting, focus on named entities

– Preliminary results:

– TopN = 0.19 – 0.21

44

Page 45: Pablo Mendes' Defense: Adaptive Semantic Annotation of Entity and Concept Mentions in Text

Case Study: Tweet NER

Creator

tweet

KBT System

Phrase Recognition

Candidate Selection

Disambiguation

Tagging

KB

tags as features45

Retrained CRF recognizer

● NER challenges

– informal text, faulty grammar, misspellings, short text, irregular capitalization, etc.

– Segmentation harder than classification

● Our approach:

– distant supervision from DBpedia

– DBpedia Spotlight tagging used as features

Entity mentions

Page 46: Pablo Mendes' Defense: Adaptive Semantic Annotation of Entity and Concept Mentions in Text

46

Tweet NER Results

● KBT tags added as features to a Linear chain CRF tagger

● NER improves with distant supervision from KBT

Page 47: Pablo Mendes' Defense: Adaptive Semantic Annotation of Entity and Concept Mentions in Text

47

Educational Material

● Emergency Management Training

“tags that summarize what happened”“configuration parameters allowed removing tags that were 'too general'”

Page 48: Pablo Mendes' Defense: Adaptive Semantic Annotation of Entity and Concept Mentions in Text

48

Knowledge Base

Some User @someuser 4 NovAt home I have an IPad and my bro has a Microsoft Surface.

Case Study: Smart Filtering

Another User @anotheruser 5 NovThe Asus Transformer Infinity is actually quite nifty.

https://twitter.com/someuser/status/123

https://twitter.com/anotheruser/status/456

category:Wi-Fi

category:Touchscreen

IPad

Microsoft Surface

Asus Transformer Infinity

mentions

mentions

Annotations

SELECT ?tweet ?product ?category IPadmentions belongs belongs

SMART FILTERING

[Mendes et al. WI'2010 and Triplification Challenge 2010]

How to look for competitors?

Microposts mentioning competitors

48

Page 49: Pablo Mendes' Defense: Adaptive Semantic Annotation of Entity and Concept Mentions in Text

49

Case Study: Website Tagging

Consumer

KBT System

KB

WebsiteSimilarity

Objective

Evaluation: retrieving similar sites

Page 50: Pablo Mendes' Defense: Adaptive Semantic Annotation of Entity and Concept Mentions in Text

Outline

● Introduction, Motivation, Background● Conceptual Model● Knowledge Base● System: DBpedia Spotlight● Core Evaluations● Case Studies● Conclusion

Page 51: Pablo Mendes' Defense: Adaptive Semantic Annotation of Entity and Concept Mentions in Text

Conclusion

● Model enables cross-task evaluations– KE, NER, etc. can be reused for KBT but

individually often do not suffice

● Model enables deeper evaluations (beyond “black box”)– Prescribes modularized evaluation to identify steps

that need improvement

– Introduces and validates a measure of “difficulty to disambiguate”

● System adapts well to very distinct use cases

51

Page 52: Pablo Mendes' Defense: Adaptive Semantic Annotation of Entity and Concept Mentions in Text

Limitations● What the proposed model is not:

– A silver bullet for all problems

– A substitute for machine learning or expert knowledge or linguistics research

52

Page 53: Pablo Mendes' Defense: Adaptive Semantic Annotation of Entity and Concept Mentions in Text

Extensions to DBpedia

● We extended DBpedia to enable KBT● Created new extractors for necessary data /

statistics● Multilinguality: community process to maintain

international chapters● Results:

– Data to power the computation of features necessary for adaptive KBT

– Prominence, relevance, pertinence, types, etc.

– All reusable to other systems that use DBpedia53

Page 54: Pablo Mendes' Defense: Adaptive Semantic Annotation of Entity and Concept Mentions in Text

54

Demo:

– http://spotlight.dbpedia.org/demo/ Web Service:

- http://spotlight.dbpedia.org/rest/{component}

Components are exposed as services:

– Phrase Recognition (/spot),

– Disambiguation (/disambiguation)

– Top K disambiguations (/candidates)

– Relatedness (/related)

– Annotation (/annotation)

Source code:https://github.com/dbpedia-spotlight/dbpedia-spotlight/

Apache V2 License

DBpedia Spotlight

Page 55: Pablo Mendes' Defense: Adaptive Semantic Annotation of Entity and Concept Mentions in Text

My Ph.D. in retrospect

Complex Entity Recognition and Relationship Extraction EKAW'08 WI'08

Knowledge-driven Text Exploration Scooner ACMSE'10 BIBM'10 IESD@HT'13

Evolution WebSci'10 WWW'12a

Cross-domain Entity Recognition and Linking

Knowledge base tagging

ISEM'11 TAC'11

LREC'12a ISEM'13KCAP'11

LREC'12b

MSM'13 CIKM'12

Knowledge-driven Query Formulation CuadroCuebeeTcruziKBICSC'08

Real-time Information Exploration / Filtering

Twarql WI'10 TwitrisSWC'10SFSW@ESWC'10

ISEM'10

SWJ'13Linked Data EvoDyn'12WWW'12b ISWC'12 EDBT'12Sieve

55

Genome databases TcruziDB Garsa ProtozoaDBNAR'08NAR'06 Bioinformatics'05

This dissertation

Page 56: Pablo Mendes' Defense: Adaptive Semantic Annotation of Entity and Concept Mentions in Text

More thanks!

… and other mentors and collaborators(too many great people for one slide!)

Page 57: Pablo Mendes' Defense: Adaptive Semantic Annotation of Entity and Concept Mentions in Text

References

Page 58: Pablo Mendes' Defense: Adaptive Semantic Annotation of Entity and Concept Mentions in Text
Page 59: Pablo Mendes' Defense: Adaptive Semantic Annotation of Entity and Concept Mentions in Text
Page 60: Pablo Mendes' Defense: Adaptive Semantic Annotation of Entity and Concept Mentions in Text
Page 61: Pablo Mendes' Defense: Adaptive Semantic Annotation of Entity and Concept Mentions in Text

Other publications● Bioinformatics IE & Querying

– 1 Bioinformatics Journal

– 2 Nucleic Acid Research Journal

– 1 IEEE ICSC, 1 EKAW, 1 Web Intelligence

● Linked Data Quality and Fusion– 1 LWDM 2012 @ EDBT

● Book chapters– Semantic Search on the Web, with Bizer et al.

– The People’s Web Meets NLP, with OKF OWLG

61

Page 62: Pablo Mendes' Defense: Adaptive Semantic Annotation of Entity and Concept Mentions in Text

Impact of my research● scholar.google.com: 480+ citations, h-index=12

● Best paper award at I-Semantics 2011

– 174 citations (according to scholar.google.com)

– 4+2 students on Google Summer of Code 2012+2013

– About 6 open sourced third-party clients

● Awarded first prize on:

– Triplification Challenge 2010

– Scripting for Semantic Web Challenge 2010

● 37 publications

– 9 conferences, 5 workshops/posters, 3 magazines (bioinfo)

– 2 book chapters

– 3 workshop proceedings

Page 63: Pablo Mendes' Defense: Adaptive Semantic Annotation of Entity and Concept Mentions in Text

Leadership and Community Involvement● Co-organizer of Web of Linked Entities workshop series

– ISWC2012 and WWW2013

● Founder of the DBpedia Portuguese initiative, involving volunteers from 5 Brazilian universities

● Maintainer of 3 open source projects– Cuebee: query formulation for RDF

– Twarql: streaming annotated microposts

– DBpedia Spotlight: adaptive semantic annotation

● PC member in several conferences and workshops: ISWC, ESWC, LREC, LDOW, IJSWIS, LDL'2012, JWS, SWJ, etc.

● EU projects– leading FUB's participation in PlanetData (FP7 Network of Excellence).

– research on LOD2 (FP7 IP). and BIG Public-Private Forum