26
The Sketch Engine Semantic Tagging Semantic Tags in Sketch Engine Comparison to FrameNet Conclusions References Semantic Word Sketches Diana McCarthy, Adam Kilgarriff, 3 Miloˇ s Jakub´ ıˇ cek, 3Siva Reddy ? DTAL University of Cambridge , Lexical Computing 3 , University of Edinburgh ? , Masaryk University July 2015 Semantic Word Sketches

Semantic Word Sketches...Semantic word sketches could be used to provide selectional preferences and corpus information to such resources Semantic Word Sketches. Thank You. The Sketch

  • Upload
    others

  • View
    2

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Semantic Word Sketches...Semantic word sketches could be used to provide selectional preferences and corpus information to such resources Semantic Word Sketches. Thank You. The Sketch

The Sketch EngineSemantic Tagging

Semantic Tags in Sketch EngineComparison to FrameNet

ConclusionsReferences

Semantic Word Sketches

Diana McCarthy,† Adam Kilgarriff,3

Milos Jakubıcek,3‡ Siva Reddy?

DTAL University of Cambridge†, Lexical Computing3,University of Edinburgh?, Masaryk University‡

July 2015

Semantic Word Sketches

Page 2: Semantic Word Sketches...Semantic word sketches could be used to provide selectional preferences and corpus information to such resources Semantic Word Sketches. Thank You. The Sketch

Outline

1 The Sketch EngineConcordancesWord Sketches

2 Semantic TaggingSuper Sense Tagger (sst)sst Supersenses

3 Semantic Tags in Sketch EngineIn the ConcordanceSemantic Word SketchesOther Possibilities from sst Output

4 Comparison to FrameNet

5 Conclusions

Page 3: Semantic Word Sketches...Semantic word sketches could be used to provide selectional preferences and corpus information to such resources Semantic Word Sketches. Thank You. The Sketch

The Sketch EngineSemantic Tagging

Semantic Tags in Sketch EngineComparison to FrameNet

ConclusionsReferences

ConcordancesWord Sketches

The Sketch Engine

concordances, word lists, collocations

word sketches

create and examine syntactic profiles and collocations of wordsinput automatic part-of-speech tags and a bespoke ‘sketchgrammar’

automatic thesauruses: which other words have similarprofiles?

sketch differences between words

Semantic Word Sketches

Page 4: Semantic Word Sketches...Semantic word sketches could be used to provide selectional preferences and corpus information to such resources Semantic Word Sketches. Thank You. The Sketch

The Sketch EngineSemantic Tagging

Semantic Tags in Sketch EngineComparison to FrameNet

ConclusionsReferences

ConcordancesWord Sketches

The Sketch Enginefor viewing corpora

Semantic Word Sketches

Page 5: Semantic Word Sketches...Semantic word sketches could be used to provide selectional preferences and corpus information to such resources Semantic Word Sketches. Thank You. The Sketch

The Sketch EngineSemantic Tagging

Semantic Tags in Sketch EngineComparison to FrameNet

ConclusionsReferences

ConcordancesWord Sketches

The Sketch EngineWord Sketches: syntactic profiles

Semantic Word Sketches

Page 6: Semantic Word Sketches...Semantic word sketches could be used to provide selectional preferences and corpus information to such resources Semantic Word Sketches. Thank You. The Sketch

The Sketch EngineSemantic Tagging

Semantic Tags in Sketch EngineComparison to FrameNet

ConclusionsReferences

ConcordancesWord Sketches

Sketch GrammarsUnder the hood

Definitions: define(‘any noun’,‘”N..”’). . .

Relations=subject/subject of

2:any noun rel start? adv aux string incl be 1:verb not pp2:any noun rel start? adv aux string incl be aux have adv string 1:past part

1:past part adv string [word=”by”] long np

Semantic Word Sketches

Page 7: Semantic Word Sketches...Semantic word sketches could be used to provide selectional preferences and corpus information to such resources Semantic Word Sketches. Thank You. The Sketch

The Sketch EngineSemantic Tagging

Semantic Tags in Sketch EngineComparison to FrameNet

ConclusionsReferences

Super Sense Tagger (sst)sst Supersenses

Semantic Class Tagging

aim to build word sketches on syntactic and semanticinformation

automatic ‘superclass’ tagging technology

superclass: a coarse grained semantic class that is applicableto multiple words (e.g. animal for cat, fly, hare, pig etc. . .

allow search and analysis with these classes and

semantic word sketches: basic semantic frame with semanticpreferences for arguments

Semantic Word Sketches

Page 8: Semantic Word Sketches...Semantic word sketches could be used to provide selectional preferences and corpus information to such resources Semantic Word Sketches. Thank You. The Sketch

The Sketch EngineSemantic Tagging

Semantic Tags in Sketch EngineComparison to FrameNet

ConclusionsReferences

Super Sense Tagger (sst)sst Supersenses

Semantic Class Tagging

Super Sense Tagger (sst) Ciaramita and Altun (2006)(http://sourceforge.net/projects/supersensetag/)

semantic tags are WordNet Fellbaum (1998) lexicographerclasses

supervised word sense disambiguation (i.e. it requires handlabelled data for training) using a Hidden Markov Modele.g. labels mouse as animal, artifact)

SemCor (Landes et al., 1998) used as training data

Named Entity Recognitione.g. < RHM Technology Ltd.> organization

Multiword tagging using multiwords from WordNete.g. couch potato

Semantic Word Sketches

Page 9: Semantic Word Sketches...Semantic word sketches could be used to provide selectional preferences and corpus information to such resources Semantic Word Sketches. Thank You. The Sketch

The Sketch EngineSemantic Tagging

Semantic Tags in Sketch EngineComparison to FrameNet

ConclusionsReferences

Super Sense Tagger (sst)sst Supersenses

sst WordNet Noun Classes (25)

act acts or actionsobject natural objects (not man-made)animal animalsquantity quantities and units of measureartifact man-made objectsphenomenon natural phenomenaattribute attributes of people and objects plant plantsfood food and drinks. . . . . .

Semantic Word Sketches

Page 10: Semantic Word Sketches...Semantic word sketches could be used to provide selectional preferences and corpus information to such resources Semantic Word Sketches. Thank You. The Sketch

The Sketch EngineSemantic Tagging

Semantic Tags in Sketch EngineComparison to FrameNet

ConclusionsReferences

Super Sense Tagger (sst)sst Supersenses

sst WordNet Verb Classes (15)

body grooming, dressing and bodily careemotion feelingchange size, temperature change, intensifyingmotion walking, flying, swimmingcognition thinking, judging, analyzing, doubtingperception seeing, hearing, feelingcommunication telling, asking, ordering, singingpossession buying, selling, owningcreation sewing, baking, painting, performing. . . . . .

Semantic Word Sketches

Page 11: Semantic Word Sketches...Semantic word sketches could be used to provide selectional preferences and corpus information to such resources Semantic Word Sketches. Thank You. The Sketch

The Sketch EngineSemantic Tagging

Semantic Tags in Sketch EngineComparison to FrameNet

ConclusionsReferences

In the ConcordanceSemantic Word SketchesOther Possibilities from sst Output

Experiments

just over 25% of the UKWaC Ferraresi et al. (2008)

sst tagged with

part-of-speech tags (Penn TreeBank)supersenses (WordNet labels)Named Entity LabelsWordNet multiwords

Semantic Word Sketches

Page 12: Semantic Word Sketches...Semantic word sketches could be used to provide selectional preferences and corpus information to such resources Semantic Word Sketches. Thank You. The Sketch

Semantic Tags in the Concordance

Page 13: Semantic Word Sketches...Semantic word sketches could be used to provide selectional preferences and corpus information to such resources Semantic Word Sketches. Thank You. The Sketch

Semantic Tags in the Word Sketch (selected)

Page 14: Semantic Word Sketches...Semantic word sketches could be used to provide selectional preferences and corpus information to such resources Semantic Word Sketches. Thank You. The Sketch

Semantic Tags in the Word Sketch (selected)

Page 15: Semantic Word Sketches...Semantic word sketches could be used to provide selectional preferences and corpus information to such resources Semantic Word Sketches. Thank You. The Sketch

The Sketch EngineSemantic Tagging

Semantic Tags in Sketch EngineComparison to FrameNet

ConclusionsReferences

In the ConcordanceSemantic Word SketchesOther Possibilities from sst Output

Semantic Word Sketch GrammarAn example for the intransitive frame

=intransframe*COLLOC “%(2.sense) *%(1.sense)-x”

2:any noun rel start? adv aux string incl be 1:verb not ppnot np start

2:any noun rel start? adv aux string incl be aux haveadv string 1:past part not np start

Semantic Word Sketches

Page 16: Semantic Word Sketches...Semantic word sketches could be used to provide selectional preferences and corpus information to such resources Semantic Word Sketches. Thank You. The Sketch

MWEs: detected by sst

Page 17: Semantic Word Sketches...Semantic word sketches could be used to provide selectional preferences and corpus information to such resources Semantic Word Sketches. Thank You. The Sketch

MWEs: Sketch Diff chip (green) vs chips (red)

Page 18: Semantic Word Sketches...Semantic word sketches could be used to provide selectional preferences and corpus information to such resources Semantic Word Sketches. Thank You. The Sketch

Portion of Sketch Diff laugh (green) vs cry (red)

Page 19: Semantic Word Sketches...Semantic word sketches could be used to provide selectional preferences and corpus information to such resources Semantic Word Sketches. Thank You. The Sketch

Semantic Word Lists: CQL + Word Frequency(Communication Verbs)

Page 20: Semantic Word Sketches...Semantic word sketches could be used to provide selectional preferences and corpus information to such resources Semantic Word Sketches. Thank You. The Sketch

Semantic Word Lists: FindX (communication verbs)

Page 21: Semantic Word Sketches...Semantic word sketches could be used to provide selectional preferences and corpus information to such resources Semantic Word Sketches. Thank You. The Sketch

Comparing to FrameNet (Ruppenhofer et al., 2010)

FrameNet contains lots of useful information e.g. [FRAMEemploying:Frame Elements: Employer Employee Position TasksCompensation . . .Definition: An Employer employs an Employee whose Positionentails that the Employee perform certain Tasks in exchangefor Compensation

lots of other information

lexical units employ.v commision.v staff.n employment.nprecedes frame firingwith corpus examples, I employed him as Chief Gardener forten years

but manually produced so low coverage

Semantic word sketches can provide additional informationand high coverage

Page 22: Semantic Word Sketches...Semantic word sketches could be used to provide selectional preferences and corpus information to such resources Semantic Word Sketches. Thank You. The Sketch

The Sketch EngineSemantic Tagging

Semantic Tags in Sketch EngineComparison to FrameNet

ConclusionsReferences

Summary

semantic tagging alongside part-of-speech for semantic wordsketches

provide syntactic and semantic profiling for

semantic queries and word listssemantic and syntactic profiling in the word sketchcomparing words by the profiles

Semantic Word Sketches

Page 23: Semantic Word Sketches...Semantic word sketches could be used to provide selectional preferences and corpus information to such resources Semantic Word Sketches. Thank You. The Sketch

The Sketch EngineSemantic Tagging

Semantic Tags in Sketch EngineComparison to FrameNet

ConclusionsReferences

Future Possibilities

try other semantic tagsets, taggers and tools

sketch grammar could be developed further

no identification of semantic roles as yet in contrast toFrameNet (Ruppenhofer et al., 2010), Propbank (Palmeret al., 2005) and VerbNet (Kipper-Schuler, 2005)

Semantic word sketches could be used to provide selectionalpreferences and corpus information to such resources

Semantic Word Sketches

Page 24: Semantic Word Sketches...Semantic word sketches could be used to provide selectional preferences and corpus information to such resources Semantic Word Sketches. Thank You. The Sketch

Thank You

Page 25: Semantic Word Sketches...Semantic word sketches could be used to provide selectional preferences and corpus information to such resources Semantic Word Sketches. Thank You. The Sketch

The Sketch EngineSemantic Tagging

Semantic Tags in Sketch EngineComparison to FrameNet

ConclusionsReferences

Ciaramita, M. and Altun, Y. (2006). Broad-coverage sensedisambiguation and information extraction with a supersensesequence tagger. In Proceedings of the 2006 Conference onEmpirical Methods in Natural Language Processing, pages594–602, Sydney, Australia. Association for ComputationalLinguistics.

Fellbaum, C., editor (1998). WordNet, An Electronic LexicalDatabase. The MIT Press, Cambridge, MA.

Ferraresi, A., Zanchetta, E., Baroni, M., and Bernardini, S. (2008).Introducing and evaluating ukwac, a very large web-derivedcorpus of english. In Proceedings of the Sixth InternationalConference on Language Resources and Evaluation (LREC2008), Marrakech, Morocco.

Kipper-Schuler, K. (2005). VerbNet: A broad-coverage,comprehensive verb lexicon. PhD thesis, Computer and

Semantic Word Sketches

Page 26: Semantic Word Sketches...Semantic word sketches could be used to provide selectional preferences and corpus information to such resources Semantic Word Sketches. Thank You. The Sketch

The Sketch EngineSemantic Tagging

Semantic Tags in Sketch EngineComparison to FrameNet

ConclusionsReferences

Information Science Dept., University of Pennsylvania.Philadelphia, PA.

Landes, S., Leacock, C., and Randee, I. T. (1998). Buildingsemantic concordances. In Fellbaum, C., editor, WordNet: anElectronic Lexical Database, pages 199–237. MIT Press.

Palmer, M., Gildea, D., and Kingsbury, P. (2005). The propositionbank: A corpus annotated with semantic roles. ComputationalLinguistics, 31(1):71–106.

Ruppenhofer, J., Ellsworth, M., Petruck, M. R. L., Johnson, C. R.,and Scheffczyk, J. (2010). FrameNet II: Extended theory andpractice. Technical report, International Computer ScienceInstitute, Berkeley. http://framenet.icsi.berkeley.edu/.

Semantic Word Sketches