Accessing Relevant Information Over Language Borders

Embed Size (px)

Citation preview

  • 8/13/2019 Accessing Relevant Information Over Language Borders

    1/17

    Semantic Resourcesand Machine Learning

    for

    Quality, Efficiency andPersonalisation ofAccessing Relevant Information

    over Language Borders

    (different languages anddifferent uses of a same language

    Multilingual!e", Lu#em"ourg, $%thof March, &'$& Results of a theme session on

  • 8/13/2019 Accessing Relevant Information Over Language Borders

    2/17

    Partici)ants

    *imo +onela, Aalto -niversity (ra))orteur

    Peter Schmit., Pu"lications /ffice of the E-

    Elena Montanes, /viedo -niversity

    *asos 0outoumanos, Agro0no1 *ech2, 3reece

    4orinne 5ra))art, Pu"lications /ffice of the E-

    Poul Andersen, !EB translation unit, E- 4ommission

    3hassan +addad, 5ace"oo

    S)yridon Pilos, Language a))lications, Euro)ean 4ommission

    6ose Emilio La"ra 3ayo, -niversity of /viedo, S)ain

    Maria Pia Montoro, Intrasoft International, Lu#em"ourg

    7aaniel 3arcia Magarinos, Euro)ean 4entral Ban

  • 8/13/2019 Accessing Relevant Information Over Language Borders

    3/17

    Quality and consistency versus accessi"ility andconte#tual a))ro)riateness of terminology

    *erms good for e#)erts in different domains versuslay)ersons

    4ase 8mem"er state9 versus 8E- country9

    4ase 8human trafficing9 versus 8modern slavery9 4ase Ban note security features

    A thesaurus 1as created as a ma))ing from technical termsto collo:uial language

    (8iridescent stri)e9 to 8glossy stri)e9 4ase legislation (Asturias region in S)ain ma))ing of

    collo:uial terms to official terms, ne1 )ro;ect li"rary ofcongress in 4hile

  • 8/13/2019 Accessing Relevant Information Over Language Borders

    4/17

    Quality and consistency versus accessi"ility andconte#tual a))ro)riateness of terminology

    4onvergent and divergent )rocesses inlanguage use

    /ntologies carefully crafted resources that re:uire

    considera"le resources for im)lementation and use 5olsonomies resources that )rovide information

    on the variation and are constructed "y the cro1ds

    < Possi"ility to model the cro1dsourced data usingmachine learning techni:ues

  • 8/13/2019 Accessing Relevant Information Over Language Borders

    5/17

    Multilingual contents and thesauritrust and :uality

    -se of E-=generated resources such as

    Eurovoc

    6R4=>ames

    Im)ortance of lined o)en data (L/7 4hoosing ey1ords from a controlled voca"ulary

    4onnecting different term versions 1ith an ontology (orfolsonomy

    7etermining a )ro)er conte#ts using L/7

    Multilingual content )rovenance of data

    Quality assurance of L/7

  • 8/13/2019 Accessing Relevant Information Over Language Borders

    6/17

    Effect of conte#t in translationneed for conte#t=rich re)resentations

    /ften the variation in translation of terminologystems from conte#tual factors

    It 1ould "e im)ortant to store enough

    conte#tual information in order to facilitatea))ro)riate choices

  • 8/13/2019 Accessing Relevant Information Over Language Borders

    7/17

    Social and cognitive levelsof language use

    Push and )ull of terminology

    Regulation and maret economy of language

    7ifferent levels of e#)ertise

    E#)erts in different domains versus lay)ersons

    *ae home messages

    ?ariation among language in

    conce)tual structures(challenges for ontology translation

    Semantic variation among languge users

  • 8/13/2019 Accessing Relevant Information Over Language Borders

    8/17

    Melissa Bowerman

    Max Planck Institute forPsycholinguistics

    Space under Construction

    Language-Specific SpatialCategorization

    In irst Language !c"uisition

    Lund -niversity 4ognitive Science&''@

  • 8/13/2019 Accessing Relevant Information Over Language Borders

    9/17

    DUTCH

    INOP AAN INOP AAN

    Categorization of %opening& in English and Korean

  • 8/13/2019 Accessing Relevant Information Over Language Borders

    10/17

    OPEN

    open#oxopendooropen#agopen

    en$elope

    open

    mouthopen clamshell

    open pair of

    shutters

    openlatched

    drawer open hand

    open #ook

    eyes open

    openfan

    Categorization of opening in English and Korean'

    (tear awayfrom #ase(

    YELTA

    (remo$e #arriertointerior space(

    PPAYTA

    )unfit&

    TTUTA

    )rise&

    PELLITA

    (separate two parts

    symmetrically(take offwallpaper

    unwrappackage

    spreadlegs apart

    take offring

    take cassetteout of case

    sun rises

    spread #lanket outpeacock spreads tail

    (spread out flat thing(

    TTUTA

    PHYELCHITA

  • 8/13/2019 Accessing Relevant Information Over Language Borders

    11/17

    *Pye +,,. +,,/0

    PLATE STICK ROPE CLOTHES

    pu pudun

    *long rigidthing0

    M!12!3I1 pu

    !"upi#$*other hard

    thing0

    ra%h"a!i$ *4tear50

    &'!opi"$*long. flexi#le

    thing0

    pa(i#$*rock. glass.

    clay thing0

    6&IC78&M!9!1

    &ear) rip*rea+81:LIS7 *rea+*rea+

    htt)1112m)i2nl)eo)le"o1erman=melissa

    htt)1112m)i2nl)eo)le"o1erman=melissa)u"lications

  • 8/13/2019 Accessing Relevant Information Over Language Borders

    12/17

    -ser=s)ecific difficulty measure

    Paueri, /lliainen +onela, su"mitted

  • 8/13/2019 Accessing Relevant Information Over Language Borders

    13/17

    3I4A analysis !ord ChealthCin State of the -nion Addresses

    *imo +onela, 6uha Raitio,0rista Lagus, Ilari *2 >ieminen,>ina +onela, and Mia Pant.ar2

    Subjects, objects andcontexts: Using GICA methodto quantify epistemologicalsubjectivity2 In Proceedings ofIJCNN 2012, International JoinConference on Neural Networks,to a))ear2

    3I4A 3roundedIntersu";ectivity4once)t Analysis

  • 8/13/2019 Accessing Relevant Information Over Language Borders

    14/17

    4ore of 3I4ASu";ect=/";ect=4onte#t *ensors

    *imo +onela, >ina 6anasi, 0rista Lagus, *iina Lindh=0nuutila, Mia Pant.ar, and 6uha Raitio23I4A 3rounded intersu";ective conce)t analysis = a method for enhancing mutual understanding and)artici)ation2 *echnical Re)ort *00=I4S=RD$, AAL*/=I4S, ESP//, 7ecem"er &'$'2

    htt)users2ics2t2fitho)u"lications2shtmlhtt)users2ics2t2fithoinfo*00=I4S=RD$2shtml

  • 8/13/2019 Accessing Relevant Information Over Language Borders

    15/17

    3uidelines are needed on ho1 to)u"lish data in multi)le languages

    7ifferent versions in different languages

    Alternative language versions

    A standard 1ay of descri"ing ho1 ho1 differentversions are related to each other

    4ase 5A/ *ranslations should refer "ac to theoriginal documents

  • 8/13/2019 Accessing Relevant Information Over Language Borders

    16/17

    !EB

  • 8/13/2019 Accessing Relevant Information Over Language Borders

    17/17

    Lin)ort has related o";ectives