80
Motivation isiZulu intro isiZulu NLG Discussion Conclusions Toward isiZulu Natural Language Generation C. Maria Keet 1 Department of Computer Science University of Cape Town, South Africa [email protected] CS Colloquium @UCT, 12 June 2014 1 Joint work with Dr. Langa Khumalo, Linguistics program and Director of the University Language Planning and Development Office, University of KwaZulu-Natal 1 / 80

Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

  • Upload
    others

  • View
    10

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

Toward isiZulu Natural Language Generation

C. Maria Keet1

Department of Computer ScienceUniversity of Cape Town, South Africa

[email protected]

CS Colloquium @UCT, 12 June 2014

1Joint work with Dr. Langa Khumalo, Linguistics program and Director ofthe University Language Planning and Development Office, University ofKwaZulu-Natal

1 / 80

Page 2: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

Outline

1 MotivationA few application scenariosNLG and knowledge management

2 isiZulu intro

3 isiZulu NLGPatterns and optionsSurvey resultsAlgorithms for selected constructs

4 Discussion

5 Conclusions

2 / 80

Page 3: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

Outline

1 MotivationA few application scenariosNLG and knowledge management

2 isiZulu intro

3 isiZulu NLGPatterns and optionsSurvey resultsAlgorithms for selected constructs

4 Discussion

5 Conclusions

3 / 80

Page 4: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

Natural language interfaces with some NLG

Many tools, webpages, etc. with some natural languagecomponent

Querying of information in natural language (cf. a querylanguage SQL, SPARQL)

Business rules typically specified in a natural language

etc.

4 / 80

Page 5: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

Example: iCal calendar entry with canned text

5 / 80

Page 6: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

Example: Saadiq Moolla’s mobile healthcare app

6 / 80

Page 7: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

Example: Query formulation with Quelo[Franconi et al.(2010)]

Pictures from: Quelo @ The IESD Challenge 2012

Demo at: http://krdbapp.inf.unibz.it:8080/quelo/

7 / 80

Page 8: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

Example: Business rules and conceptual data models

Course Professor

is taught by / teaches

1..*1..*Course Professorteaches is

taught by

Each Course is taught by at least one ProfessorEach Professor teaches at least one Course

8 / 80

Page 9: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

NLG, principal approaches

Canned text

Templates

Notably for English [Fuchs et al.(2010), Schwitter et al.(2008),Third et al.(2011), Curland and Halpin(2007)],but also other languages [Jarrar et al.(2006)]

Grammar engines, such as [Kuhn(2013)], GrammaticalFramework (http://www.grammaticalframework.org/)

⇒ Controlled Natural Language

9 / 80

Page 10: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

NLG, principal approaches

Canned text

Templates

Notably for English [Fuchs et al.(2010), Schwitter et al.(2008),Third et al.(2011), Curland and Halpin(2007)],but also other languages [Jarrar et al.(2006)]

Grammar engines, such as [Kuhn(2013)], GrammaticalFramework (http://www.grammaticalframework.org/)

⇒ Controlled Natural Language

10 / 80

Page 11: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

Business rules/conceptual data models and logicreconstruction

BR: Each Course is taught by at least one Professor

FOL: ∀x (Course(x) → ∃y (is taught by(x , y) ∧ Professor(y)))

DL: Course v ∃ is taught by.Professor

11 / 80

Page 12: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

Example of templates

for a large fragment of ORM, and 11 languages [Jarrar et al.(2006)]

12 / 80

Page 13: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

Example of templates

for a large fragment of ORM, and 11 languages [Jarrar et al.(2006)]

13 / 80

Page 14: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

Example of templates

for a large fragment of ORM, and 11 languages [Jarrar et al.(2006)]

14 / 80

Page 15: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

Example of templates

for a large fragment of ORM, and 11 languages [Jarrar et al.(2006)]

15 / 80

Page 16: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

NL Grammars, illustration

Sentence −→ NounPhrase | VerbPhraseNounPhrase −→ Adjective | NounPhraseNounPhrase −→ Noun

. . .

Noun −→ car | trainAdjective −→ big | broken

. . .(and complexity of the grammar)

16 / 80

Page 17: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

Question

Can the template-based approach be used also for isiZuluNLG?

If so, create those templatesIf not, start with basics for a grammar engine

Use a practically useful language to benefit both ICT andlinguists and, possibly, some subject domain (e.g., medicine,NRS [Alberts et al.(2012)])

Details in[Keet and Khumalo(2014b), Keet and Khumalo(2014a)]

17 / 80

Page 18: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

Question

Can the template-based approach be used also for isiZuluNLG?

If so, create those templatesIf not, start with basics for a grammar engine

Use a practically useful language to benefit both ICT andlinguists and, possibly, some subject domain (e.g., medicine,NRS [Alberts et al.(2012)])

Details in[Keet and Khumalo(2014b), Keet and Khumalo(2014a)]

18 / 80

Page 19: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

Question

Can the template-based approach be used also for isiZuluNLG?

If so, create those templatesIf not, start with basics for a grammar engine

Use a practically useful language to benefit both ICT andlinguists and, possibly, some subject domain (e.g., medicine,NRS [Alberts et al.(2012)])

Details in[Keet and Khumalo(2014b), Keet and Khumalo(2014a)]

19 / 80

Page 20: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

Outline

1 MotivationA few application scenariosNLG and knowledge management

2 isiZulu intro

3 isiZulu NLGPatterns and optionsSurvey resultsAlgorithms for selected constructs

4 Discussion

5 Conclusions

20 / 80

Page 21: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

A few features of isiZulu

Most populous language in SA, first (home) language of±23% (≥ 10 million)

Member of the Bantu language group, spoken by some 300million people

Bantu languages have characteristically agglutinatingmorphology

System of noun classes, controls the concordance of all wordsin a sentence

Abafana abancane bazozithenga izincwadi ezinkuluaba-fana aba-ncane ba- zo- zi- thenga izi-ncwadi e-zi-nkulu2.boy 2.small 2.SUBJ-FUT-10.OBJ-buy 10.book REL-10.big

‘The little boys will buy the big books’

21 / 80

Page 22: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

NC AU PRE Stem (ex-ample)

Meaning Example

1 u- m(u)- -fana humans and other umfana boy2 a- ba- -fana animates abafana boys1a u- - -baba kinship terms and proper ubaba father2a o- - -baba names obaba fathers3a u- - -shizi nonhuman ushizi cheese(2a) o- - -shizi oshizi cheeses3 u- m(u)- -fula trees, plants, non-paired umfula river4 i- mi- -fula body parts imifula rivers5 i- (li)- -gama fruits, paired body parts, igama name6 a- ma- -gama and natural phenomena amagama names7 i- si- -hlalo inanimates and manner/ isihlalo chair8 i- zi- -hlalo style izihlalo chairs9a i- - -rabha nonhuman irabha rubber(6) a- ma- -rabha amarabha rubbers9 i(n)- - -ja animals inja dog10 i- zi(n)- -ja izinja dogs11 u- (lu)- -thi inanimates and long thin uthi stick(10) i- zi(n)- -thi objects izinthi sticks14 u- bu- -hle abstract nouns ubuhle beauty15 u- ku- -cula infinitives ukucula to sing17 ku- locatives, remote/ general locative

22 / 80

Page 23: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

Outline

1 MotivationA few application scenariosNLG and knowledge management

2 isiZulu intro

3 isiZulu NLGPatterns and optionsSurvey resultsAlgorithms for selected constructs

4 Discussion

5 Conclusions

23 / 80

Page 24: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

Logic foundation for isiZulu NLG

Roughly OWL 2 EL

OWL 2 EL is a W3C-standardised profile of OWL 2

Tools, ontologies in OWL 2 (notably SNOMED CT)

On the ‘roughly’: minus transitivity, but with negation,amounting to ALC

of that, we have patterns for universal and existentialquantification, subsumption, negation (disjointness), andconjunctionunion not yet covered explicitly, but note C tD ≡ ¬(¬C u¬D)more detail on the languages: see the Description LogicsHandbook [Baader et al.(2008)] and OWL 2 Standard

24 / 80

Page 25: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

Logic foundation for isiZulu NLG

Roughly OWL 2 EL

OWL 2 EL is a W3C-standardised profile of OWL 2

Tools, ontologies in OWL 2 (notably SNOMED CT)

On the ‘roughly’: minus transitivity, but with negation,amounting to ALC

of that, we have patterns for universal and existentialquantification, subsumption, negation (disjointness), andconjunctionunion not yet covered explicitly, but note C tD ≡ ¬(¬C u¬D)more detail on the languages: see the Description LogicsHandbook [Baader et al.(2008)] and OWL 2 Standard

25 / 80

Page 26: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

ALC syntax

Concepts denoting entity types/classes/unarypredicates/universals, including top > and bottom ⊥;

Roles denoting relationships/associations/n-arypredicates/properties;

Constructors: and u, or t, and not ¬; quantifications forall ∀and exists ∃Complex concepts using constructors: Let C and D beconcept names, R a role name, then

¬C , C u D, and C t D are concepts, and∀R.C and ∃R.C are concepts

Individuals

26 / 80

Page 27: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

ALC semantics

domain of interpretation, and an interpretation, where:

Domain ∆ is a non-empty set of objectsInterpretation: ·I is the interpretation function, domain ∆I

·I maps every concept name A to a subset AI ⊆ ∆I

·I maps every role name R to a subset RI ⊆ ∆I ×∆I

·I maps every individual name a to elements of ∆I : aI ∈ ∆I

Note: >I = ∆I and ⊥I = ∅(¬C )I = ∆I\CI

(C u D)I = CI ∩ DI

(C t D)I = CI ∪ DI

(∀R.C )I = {x | ∀y .RI(x , y)→ CI(y)}(∃R.C )I = {x | ∃y .RI(x , y) ∧ CI(y)}

27 / 80

Page 28: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

A few constructors, their typical verbalization in English,and the basic options in isiZulu

DL sym-bol

Sample verbalizationEnglish

Sample verbalization in isiZulu(see text for additional rules)

v ... is a ... Depends on what is on the rhs of v and desideratum:A) semantic distinction

i) yi/ongu/uyi/ngu (living thing)ii) iyi (non-living thing)

B) syntactic distinctioniii) ng (nouns commencing with a, o, or u)iv) y (nouns commencing with i)

u ... and ... Depends on the use of the u:i) ... na/ne/no ... (list of things)ii) 1) ... futhi ... (connective)

2) ... kanye ... (connective)¬ not ... angi/akusiso/akusona/akubona/akulona/asibona/ akalona/akuyona∃ 1) some ...

2) there exists ...3) at least one ...

Depends on position in axiom:I. quantified over class, depends on meaning of class:

i) kuno (living thing)ii) kune (non-living thing)

II. includes relation (preposition issue omitted):1) ... [concords]dwa2) ... noma [copulative + concord]phi ...3) thize

∀ 1) for all ...2) each ...

Depends on what it is quantified over:A) semantic distinction

i) wonke/bonke/sonke/zonke (living thing)ii) onke/konke/lonke/yonke (non-living thing)

B) another semantic distinctioni) use noun class (see Table 8)

28 / 80

Page 29: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

Universal Quantification

Consider here only the universal quantification at the start ofthe concept inclusion axiom (nominal head)

‘all’/‘each’ uses -onke, prefixed with the oral prefix of the nounclass of that first noun (OWL class/DL concept) on lhs of v

(U1) Boy v ...

wonke umfana ... (‘each boy...’; u- + -onke)

bonke abafana ... (‘all boys...’; ba- + -onke)

(U2) Phone v ...

lonke ifoni ... (‘each phone...’; li- + -onke)

onke amafoni ... (‘all phones...’; a- + -onke)

29 / 80

Page 30: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

30 / 80

Page 31: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

31 / 80

Page 32: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

32 / 80

Page 33: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

Subsumption

Two different ways of carving up the nouns to determinewhich rules apply: semantic and syntactic

Need to choose between

singular and pluralwith or without the universal quantification voicedgeneric or determinate

(S1) MedicinalHerb v Plant

ikhambi ngumuthi (‘medicinal herb is a plant’)

amakhambi yimithi (‘medicinal herbs are plants’)

wonke amakhambi ngumuthi (‘all medicinal herbs are a plant’)

(S2) Giraffes v Animals

izindlulamithi yizilwane (‘giraffes are animals’; generic)

(S3) Cellphone v Phone

Umakhalekhukhwini uyifoni (‘cellphone is a phone’; determ.)

33 / 80

Page 34: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

Possible subsumption patterns

a. N1 <copulative ng/y depending on first letter of N2>N2.

b. <plural of N1> <copulative ng/y depending on first letter ofplural of N2><plural of N2>.

c. <All-concord for NCx>onke <plural of N1, being of NCx><copulative ng/y depending on first letter of N2>N2.

34 / 80

Page 35: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

Subsumption: adding negation

Need to choose between

singular and plural, and with or without the universalquantification voiced

Copulative is omitted

Combines the negative subject concord (NEG SC) of the nounclass of the first noun (aku-) with the pronomial (PRON) ofthe noun class of second noun (-yona)

(SN1) Cup v ¬Glassindebe akuyona ingilazi (‘cup not a glass’)

zonke izindebe aziyona ingilazi (‘all cups not a glass’)

35 / 80

Page 36: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

36 / 80

Page 37: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

37 / 80

Page 38: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

38 / 80

Page 39: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

Possible negation (disjointness) patterns

a. <N1 of NCx> <NEG SC of NCx><PRON of NCy> <N2 ofNCy>.

b. <All-concord for NCx>onke <plural N1, being of NCx><NEG SC of NCx><PRON of NCy> <N2 with NCy>.

39 / 80

Page 40: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

Conjunction

Conjunction as enumeration uses na

Changes into (a + i =) ne or (a + u =) no, depending on thefirst letter of the second noun

Prefixed to the second noun that drops its first letter

Conjunction as connective of clauses: kanye or futhi

(C1) Milk u Butter

Ubisi nebhotela (Ubisi + na + Ibhotela)

(C2) Butter u Milk

Ibhotela nobisi (Ibhotela + na + Ubisi)

(C3) . . .∃has filling.Cream u ∃has Icing.Lemon flavour . . .

...kune zigcwalisa ukhilimu kanye nezinye uqweqweolunambitheka ulamula...

...kune zigcwalisa ukhilimu futhi nezinye uqweqweolunambitheka ulamula...

40 / 80

Page 41: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

Existential Quantification

Different context: Option I in Table 1 for type (E0) Option IIto axioms of type (E1)

(E0) Ezulwini kune zingilosi (‘in heaven there exist angels’)

(E1) Giraffe v ∃eats.Twig

yonke indlulamithi idla ihlamvana elilodwa (‘each giraffe eats at least one twig’)

zonke izindlulamithi zidla ihlamvana elilodwa (‘all giraffes eat at least one twig’)

yonke indlulamithi idla noma yiliphi ihlamvana (‘each giraffe eats some twig’)

zonke izindlulamithi zidla noma yiliphi ihlamvana (‘all giraffes eat some twig’)

yonke indlulamithi idla ihlamvanathize (‘each giraffe eats some twig’)

41 / 80

Page 42: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

Beakdown–examples

noun NC RC QC QSuffix copulative EP ESuffixihlamvana (‘twig’) class 5 eli- -lo- -dwaisifundo (‘module’) class 7 esi- -so- -dwaushizi (‘cheese’) class 3a o- -ye- -dwaihlamvana (‘twig’) class 5 yi- -li- -phiisifundo (‘module’) class 7 yi- -si- -phiushizi (‘cheese’) class 3a ngu- -mu- -phi

42 / 80

Page 43: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

43 / 80

Page 44: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

44 / 80

Page 45: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

45 / 80

Page 46: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

46 / 80

Page 47: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

Possible patterns for existential quantification

a. <All-concord for NCx>onke <pl. N1, is in NCx><conjugated verb> <N2 of NCy> <RC for NCy><QC forNCy>dwa.

b. <All-concord for NCx>onke <pl. N1, is in NCx><conjugated verb> noma <copulative ng/y adjusted to firstletter of N2><EP of NCy>phi <N2>.

c. <All-concord for NCx>onke <N1 in NCx> <conjugatedverb> <N2>thize;

47 / 80

Page 48: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

Which options to choose?

Survey, asking linguists and non-linguists for their preferences

10 questions pitting the patterns against each other

Online, with isiZulu-localised version of Limesurvey (createdas part of COMMUTERM project)

i.e., all text, buttons, autotext and error messages in isiZulu

Analyse results in MS Excel

48 / 80

Page 49: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

Question 1 (screenshot)

http://limesurvey.cs.ukzn.ac.za/index.php?sid=25965&lang=zu

49 / 80

Page 50: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

Results

25 invited: students, academics (linguists), and non-linguists(such as administrators)

12 respondents: 5 linguists, 7 non-linguists (survey is stillopen)

more agreement among linguists

some differences possibly due to dialect

preference for singular in subsumption

other times plural

other times also with universal quantification in theverbalization

clear preference for the -dwa option

50 / 80

Page 51: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

Results

25 invited: students, academics (linguists), and non-linguists(such as administrators)

12 respondents: 5 linguists, 7 non-linguists (survey is stillopen)

more agreement among linguists

some differences possibly due to dialect

preference for singular in subsumption

other times plural

other times also with universal quantification in theverbalization

clear preference for the -dwa option

51 / 80

Page 52: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

Results

Question Respondent Question RespondentLing. Non-

Ling.Total Ling. Non-

Ling.Total

1. isa

sing. 80 0 33

6. exists

sing.+noma-phi 0 29 17pl. 0 43 25 pl.+noma-phi 0 0 0all+pl. 0 0 0 either 20 0 8either 20 57 42 neither 80 71 75neither 0 0 0

2. isa

sing. 80 86 83

7. exists

sing.+-dwa 20 14 17pl. 0 0 0 pl.+-dwa 20 57 42all+pl. 0 0 0 either 40 0 17either 0 14 8 neither 20 29 25neither 20 0 8

3. disj.

sing. 40 29 33

8. exists

sing.+-dwa 0 14 8all+pl. 0 14 8 sing.+noma-phi 20 0 8either 40 14 25 pl.+noma-phi 80 57 67neither 20 43 33 either 0 0 0

neither 0 29 17

4. disj.

sing. 40 71 58

9. exists

pl.+noma-phi 40 14 25pl. 0 0 0 pl.+-thize 0 29 17either 20 0 8 either 40 43 42neither 40 29 33 neither 20 14 16

5. exists

pl.+-dwa 100 57 75

10. and

kanye 0 0 0pl.+noma-phi 0 14 8 futhi 0 14 8either 0 0 0 either 20 0 8neither 0 29 17 neither 80 86 83

52 / 80

Page 53: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

53 / 80

Page 54: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

retrieve class and get its noun class

54 / 80

Page 55: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

'simple' ISA

55 / 80

Page 56: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

negation (disjointness)

56 / 80

Page 57: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

57 / 80

Page 58: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

enum-and or conn-and?

58 / 80

Page 59: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

connective-and

59 / 80

Page 60: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

enumerative-and

60 / 80

Page 61: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

61 / 80

Page 62: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

to be done...

62 / 80

Page 63: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

Example

∀x (Professor(x) → ∃y (teaches(x , y) ∧ Course(y)))

Professor v ∃ teaches.Course

Each Professor teaches at least one Course

∀x (uSolwazi(x) → ∃y (ufundisa(x , y) ∧ Isifundo(y)))

uSolwazi v ∃ ufundisa.Isifundo

?

63 / 80

Page 64: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

Example

∀x (Professor(x) → ∃y (teaches(x , y) ∧ Course(y)))

Professor v ∃ teaches.Course

Each Professor teaches at least one Course

∀x (uSolwazi(x) → ∃y (ufundisa(x , y) ∧ Isifundo(y)))

uSolwazi v ∃ ufundisa.Isifundo

?

64 / 80

Page 65: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

65 / 80

Page 66: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

text

look-up NCpluralise

for-all

Bonke oSolwazi 66 / 80

Page 67: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

text

AlgoConjugate

... for relevant NC. Here:ngi-u-u-si-ni-ba-

Bonke oSolwazi bafundisa 67 / 80

Page 68: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

Bonke oSolwazi bafundisa Isifundo68 / 80

Page 69: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

text

Bonke oSolwazi bafundisa Isifundo esisodwa

look-up NC

get RC

get QC

add -dwa

69 / 80

Page 70: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

Outline

1 MotivationA few application scenariosNLG and knowledge management

2 isiZulu intro

3 isiZulu NLGPatterns and optionsSurvey resultsAlgorithms for selected constructs

4 Discussion

5 Conclusions

70 / 80

Page 71: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

Discussion

Template-based approach is not applicable to isiZulu (and,more generally: Bantu languages that have noun classes)

Or: grammar engine needed

Devising the patterns hampered by outdated literature

Several preferences for patterns

Algorithms nontrivial; covering:

‘simple’ existential and universal quantificationtaxonomic subsumptionnegation (class disjointness)conjunction

71 / 80

Page 72: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

Discussion

Template-based approach is not applicable to isiZulu (and,more generally: Bantu languages that have noun classes)

Or: grammar engine needed

Devising the patterns hampered by outdated literature

Several preferences for patterns

Algorithms nontrivial; covering:

‘simple’ existential and universal quantificationtaxonomic subsumptionnegation (class disjointness)conjunction

72 / 80

Page 73: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

Some other potential use: machine translation

Google Translate English-isiZulu translates, e.g., “mix thesugar and milk and butter” as “hlanganisa ushukela nobisiibhotela” (translation d.d. 14-1-2014)

Misses the second conjunction in the enumerationushukela u ubisi u ibhotela with Algorithm for conjunctionobtains correct verbalisation/translation: ushukela nobisinebhotela

Google’s “all giraffes eat twigs” is translated as “yonkeizindlulamithi udle amahlumela” (translation d.d. 14-1-2014)

But izindlulamithi is in noun class 10, not 9, so it goes withzonkeThis can be correctly verbalised following Algorithmsubsumption verbalization (line 9).

73 / 80

Page 74: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

Outline

1 MotivationA few application scenariosNLG and knowledge management

2 isiZulu intro

3 isiZulu NLGPatterns and optionsSurvey resultsAlgorithms for selected constructs

4 Discussion

5 Conclusions

74 / 80

Page 75: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

Conclusions

Novel verbalization patterns and algorithms for simplesubsumption, disjoint classes, conjunction, and basic optionswith quantification

Verbalizing formally represented knowledge in isiZulu requiresa grammar engine even for the relatively basic languageconstructs

Due to, principally: i) the system of noun classes, ii) thesystem of complex agreement, iii) phonological conditionedcopulatives, and iv) verb conjugation

The survey on verbalization pattern preference showed a clearpreference for the -dwa option, and more variation inpreference by the non-linguists

75 / 80

Page 76: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

Future work

To be done for ‘full’ OWL 2 EL and ALC, mainly:

TransitivityMore elaborate axioms, such as ∀R.C v ∃S .(D u E )Negation in other casesUnion

Conjugation of verbs present and past tense, and theprepositions (taught by, works for)

Preference of patterns vs understandability

Living vs. non-living thing distinction

Interaction with multilingual ontologies (e.g., extendingLemon [McCrae et al.(2012)])

Implement it

76 / 80

Page 77: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

References I

Ronell Alberts, Thomas Fogwill, and C. Maria Keet.

Several required OWL features for indigenous knowledge management systems.In P. Klinov and M. Horridge, editors, 7th Workshop on OWL: Experiences and Directions (OWLED 2012),volume 849 of CEUR-WS, page 12p, 2012.27-28 May, Heraklion, Crete, Greece.

F. Baader, D. Calvanese, D. L. McGuinness, D. Nardi, and P. F. Patel-Schneider, editors.

The Description Logics Handbook – Theory and Applications.Cambridge University Press, 2 edition, 2008.

M. Curland and T. Halpin.

Model driven development with NORMA.In Proceedings of the 40th International Conference on System Sciences (HICSS-40), pages 286a–286a.IEEE Computer Society, 2007.Los Alamitos, Hawaii.

Enrico Franconi, Paolo Guagliardo, and Marco Trevisan.

An intelligent query interface based on ontology navigation.In Workshop on Visual Interfaces to the Social and Semantic Web (VISSW’10), 2010.Hong Kong, February 2010.

Norbert E. Fuchs, Kaarel Kaljurand, and Tobias Kuhn.

Discourse Representation Structures for ACE 6.6.Technical Report ifi-2010.0010, Dept of Informatics, University of Zurich, Switzerland, 2010.

Mustafa Jarrar, C. Maria Keet, and Paolo Dongilli.

Multilingual verbalization of ORM conceptual models and axiomatized ontologies.Starlab technical report, Vrije Universiteit Brussel, Belgium, February 2006.

77 / 80

Page 78: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

References II

C.M. Keet and L. Khumalo.

Toward verbalizing logical theories in isiZulu.In Proceedings of the 4th Workshop on Controlled Natural Language (CNL’14), LNAI, page (accepted).Springer, 2014a.20-22 August 2014, Galway, Ireland.

C.M. Keet and L. Khumalo.

Basics for a grammar engine to verbalize logical theories in isiZulu.In Proceedings of the 8th International Web Rule Symposium (RuleML’14), LNCS, page (accepted).Springer, 2014b.August 18-20, 2014, Prague, Czech Republic.

Tobias Kuhn.

A principled approach to grammars for controlled natural languages and predictive editors.Journal of Logic, Language and Information, 22(1):33–70, 2013.

John McCrae, Guadalupe Aguado de Cea, Paul Buitelaar, Philipp Cimiano, Thierry Declerck, Asuncion

Gomez-Perez, Jorge Gracia, Laura Hollink, Elena Montiel-Ponsoda, Dennis Spohr, and Tobias Wunner.The lemon cookbook.Technical report, Monnet Project, June 2012.www.lemon-model.net.

R. Schwitter, K. Kaljurand, A. Cregan, C. Dolbear, and G. Hart.

A comparison of three controlled natural languages for OWL 1.1.In Proceedings of OWL: Experiences and Directions (OWLED’08 DC), 2008.Washington, DC, USA metropolitan area, on 1-2 April 2008.

78 / 80

Page 79: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

References III

Allan Third, Sandra Williams, and Richard Power.

OWL to English: a tool for generating organised easily-navigated hypertexts from ontologies.poster/demo paper, 2011.10th International Semantic Web Conference (ISWC’11), 23-27 Oct 2011, Bonn, Germany.

79 / 80

Page 80: Toward isiZulu Natural Language Generation › files › zuluNLGtalkUCT.pdf · 2015-10-15 · MotivationisiZulu intro isiZulu NLGDiscussionConclusions Toward isiZulu Natural Language

Motivation isiZulu intro isiZulu NLG Discussion Conclusions

Thank you!

80 / 80