33
Emergence of semantic Emergence of semantic and syntactic complexity and syntactic complexity in language in language Wolfgang Wildgen Wolfgang Wildgen LANGUAGE CULTURE AND MIND CONFERENCE (LCM 2) LANGUAGE CULTURE AND MIND CONFERENCE (LCM 2) INTEGRATING PERSPECTIVES AND METHODOLOGIES INTEGRATING PERSPECTIVES AND METHODOLOGIES IN THE STUDY OF LANGUAGE , PARIS 17-20 JULY IN THE STUDY OF LANGUAGE , PARIS 17-20 JULY 2006 2006

Emergence of semantic and syntactic complexity in language Wolfgang Wildgen LANGUAGE CULTURE AND MIND CONFERENCE (LCM 2) INTEGRATING PERSPECTIVES AND METHODOLOGIES

  • View
    224

  • Download
    0

Embed Size (px)

Citation preview

Emergence of semantic and Emergence of semantic and syntactic complexity in syntactic complexity in

languagelanguage

Wolfgang WildgenWolfgang Wildgen

LANGUAGE CULTURE AND MIND CONFERENCE (LCM LANGUAGE CULTURE AND MIND CONFERENCE (LCM 2)2)

INTEGRATING PERSPECTIVES AND METHODOLOGIES INTEGRATING PERSPECTIVES AND METHODOLOGIES IN THE STUDY OF LANGUAGE , PARIS 17-20 JULY IN THE STUDY OF LANGUAGE , PARIS 17-20 JULY

20062006

LCM 2 Paris 2006 2

Structure of the argumentStructure of the argument

► IntroductionIntroduction►1.The perspective of information theory1.The perspective of information theory►2.Syntactic complexity measures 2.Syntactic complexity measures

(Chomsky)(Chomsky)►3.Geometrical deformations as memory 3.Geometrical deformations as memory

(Leyton)(Leyton)

►4.Semantic archetypes (Thom)4.Semantic archetypes (Thom)

►ConclusionsConclusions

LCM 2 Paris 2006 3

ComplexityComplexity , change in complexity, , change in complexity,

barriers of complexitybarriers of complexity ► Several proposals for a complexity measure Several proposals for a complexity measure

have been made and have played a role in at have been made and have played a role in at least one of the three fields: phylogeny, least one of the three fields: phylogeny, ontogeny and cultural dynamics. The paper ontogeny and cultural dynamics. The paper will try to evaluate four of them and ask how will try to evaluate four of them and ask how they are analyzed in the cases easier to they are analyzed in the cases easier to access: ontogeny of language and cultural access: ontogeny of language and cultural evolution, and how they may be applied to evolution, and how they may be applied to restrict models of linguistic (semiotic) restrict models of linguistic (semiotic) evolution, to which an empirical access seems evolution, to which an empirical access seems to be almost impossible.to be almost impossible.

LCM 2 Paris 2006 4

The four measures are:The four measures are:► Increase of information / informational complexity Increase of information / informational complexity

or negentropy (decrease of entropy). This notion or negentropy (decrease of entropy). This notion rests on principles of thermodynamics (entropy) rests on principles of thermodynamics (entropy) and has been elaborated in the mathematical and has been elaborated in the mathematical theory of information (Shannon and Weaver, 1949).theory of information (Shannon and Weaver, 1949).

► The Chomsky hierarchy of formal languages (finite The Chomsky hierarchy of formal languages (finite state grammars, context-free and context sensitive state grammars, context-free and context sensitive grammars, transformational grammars; cf. grammars, transformational grammars; cf. Chomsky, 1957).Chomsky, 1957).

► A geometrically based transformational measure of A geometrically based transformational measure of complexity (memory) has been proposed by Leyton complexity (memory) has been proposed by Leyton (2001); it may be compared to Chomsky’s proposal (2001); it may be compared to Chomsky’s proposal and to Thom’s hierarchy. and to Thom’s hierarchy.

► The abutment hierarchy of elementary catastrophes The abutment hierarchy of elementary catastrophes and the hierarchy of semantic archetypes derived and the hierarchy of semantic archetypes derived from it (Thom, 1972 and Wildgen, 1994).from it (Thom, 1972 and Wildgen, 1994).

LCM 2 Paris 2006 5

Are all languages equally Are all languages equally complex?complex?

► In the case of the evolution of language In the case of the evolution of language capacity since the LCA (last common capacity since the LCA (last common ancestor) changes in complexity cannot be ancestor) changes in complexity cannot be dismissed.dismissed.

► This is also the case (on an smaller scale) This is also the case (on an smaller scale) for functionally deprived languages like for functionally deprived languages like pidgins and jargons.pidgins and jargons.

► Different kinds of diminished functionality Different kinds of diminished functionality must be assumed for mentally retarded, must be assumed for mentally retarded, autistic, aphasic persons.autistic, aphasic persons.

LCM 2 Paris 2006 6

Different presuppositionsDifferent presuppositions► The four measures I will discuss presuppose The four measures I will discuss presuppose

different views on semiotic systems and refer different views on semiotic systems and refer to different models of human language and to different models of human language and communication.communication.

► For this reason the search for an adequate For this reason the search for an adequate measure of complexity in semiotic systems is measure of complexity in semiotic systems is also a search for a general and adequate also a search for a general and adequate model of human language and semiosis in model of human language and semiosis in general.general.

► Moreover the measures reflect different Moreover the measures reflect different analytic strategies: global characterization of analytic strategies: global characterization of a system (its grammar) versus local a system (its grammar) versus local characterization (the morphogenetic germs characterization (the morphogenetic germs or singularities of the system)or singularities of the system)

LCM 2 Paris 2006 7

1.The perspective of information 1.The perspective of information theorytheory

► The major concern of information theory is The major concern of information theory is the amount and quality of signs transmitted the amount and quality of signs transmitted through a given channel (in a time slot).through a given channel (in a time slot).

► The background is a statistical one, i.e. the The background is a statistical one, i.e. the probability distribution of states of the probability distribution of states of the world(s) which are referred to by signs.world(s) which are referred to by signs.

► Very rare events have high information (cf. Very rare events have high information (cf. the logarithmic measure of the inverse of the logarithmic measure of the inverse of the probability (p) of an event (e): inf(e) = ld the probability (p) of an event (e): inf(e) = ld 1/p(e)1/p(e)

LCM 2 Paris 2006 8

Application to semiotic Application to semiotic systemssystems

► Transition probabilities in Markov-chains Transition probabilities in Markov-chains diminish, if the set of lexical choices available diminish, if the set of lexical choices available at each step increases (if the probability of at each step increases (if the probability of each single choice decreases).each single choice decreases).

► If the (generative) base syntax allows for more If the (generative) base syntax allows for more alternative structures at each step the alternative structures at each step the probability of each point with structural probability of each point with structural alternatives decreases (cf. grammars with alternatives decreases (cf. grammars with variables rules or probabilistic grammars as variables rules or probabilistic grammars as applied in sociolinguistics (HDP 1975, Klein & applied in sociolinguistics (HDP 1975, Klein & Dittmar, Developing grammars, 1979)).Dittmar, Developing grammars, 1979)).

LCM 2 Paris 2006 9

Semantic informationSemantic information► Applying the measure of semantic information, i.e. Applying the measure of semantic information, i.e.

the amount of specification of a semantic universe the amount of specification of a semantic universe proposed by Carnap and Bar-Hillel (1952), I was able proposed by Carnap and Bar-Hillel (1952), I was able to show that the code-hypothesis of Bernstein may to show that the code-hypothesis of Bernstein may be properly opera-tionalized by a measure of be properly opera-tionalized by a measure of semantic information (cf. Wildgen, 1977a,b).semantic information (cf. Wildgen, 1977a,b).

► A given filler of a syntactic slot has higher A given filler of a syntactic slot has higher information, if the semantic universe of the possible information, if the semantic universe of the possible (or experimentally elicited) fillers is more restricted. (or experimentally elicited) fillers is more restricted. Example:Example:

► Given three independent predicates appearing as Given three independent predicates appearing as expressions or its components in a given syntactic expressions or its components in a given syntactic slot: He, a person, a child, a girl, we may assume 2slot: He, a person, a child, a girl, we may assume 23 3

=8 states of the universe which are possible.=8 states of the universe which are possible.► If all predicates are (positively or negatively) If all predicates are (positively or negatively)

specified the probability of the filler is 1/8; specified the probability of the filler is 1/8; information = ld 8= 3.information = ld 8= 3.

LCM 2 Paris 2006 10

2.Syntactic complexity 2.Syntactic complexity measuresmeasures

In his booklet of 1957 Chomsky argued In his booklet of 1957 Chomsky argued against linear grammars and probabilistic against linear grammars and probabilistic considerations in grammar. Instead he considerations in grammar. Instead he proposed a hierarchy of grammar formats:proposed a hierarchy of grammar formats:

► Type 3 corresponds to transitions networks Type 3 corresponds to transitions networks as those considered in information theory as those considered in information theory (Markov-chains).(Markov-chains).

► Chomsky’s hierarchy (or similar ones) Chomsky’s hierarchy (or similar ones) introduce a complexity measure on introduce a complexity measure on syntactic (generative) rules in the syntactic (generative) rules in the framework of production systems.framework of production systems.

LCM 2 Paris 2006 11

►Type 2 grammars are called context free; Type 2 grammars are called context free; they allow for the construction of they allow for the construction of abstract structures without direct abstract structures without direct reference to terminal elements (at each reference to terminal elements (at each step).step).

►Type 1 grammars consider (left, right) Type 1 grammars consider (left, right) contexts of rule applications.contexts of rule applications.

►Type 0 grammars allow for Type 0 grammars allow for transformations of structures (partial transformations of structures (partial trees) with deletion and other rather trees) with deletion and other rather unrestricted operations (e.g. move-unrestricted operations (e.g. move-operations).operations).

LCM 2 Paris 2006 12

► The measure first concerns the computational The measure first concerns the computational strategy of sentence generation / analysis, secondly it strategy of sentence generation / analysis, secondly it involves dependencies between linguistic events with involves dependencies between linguistic events with different scope, i.e. local (step by step from left to different scope, i.e. local (step by step from left to right) for type 3, global for type 0 grammars. Barriers right) for type 3, global for type 0 grammars. Barriers may further restrict the scope of move-operations.may further restrict the scope of move-operations.

► Practically the computational grammars proposed Practically the computational grammars proposed after 1957 tended to solve the problem of sentence after 1957 tended to solve the problem of sentence generation at lower levels (type 3; cf. ATN-networks generation at lower levels (type 3; cf. ATN-networks or type 2).or type 2).

► As the measure is restricted to formal grammars of a As the measure is restricted to formal grammars of a certain type, its relevance for human language is not certain type, its relevance for human language is not evident (formal language is not evident (formal language is not natural language). natural language).

► One may assume that a protolanguage with a very One may assume that a protolanguage with a very small lexicon (below 50 items) and simple syntactic small lexicon (below 50 items) and simple syntactic rules at the level of two word utterances (no or rules at the level of two word utterances (no or primitive morphology) may easily be generated by primitive morphology) may easily be generated by type 3 grammars (this stage corresponds roughly to type 3 grammars (this stage corresponds roughly to the competence of a two year old human child).the competence of a two year old human child).

LCM 2 Paris 2006 13

CriticismCriticism► The Chomsky hierarchy is only a segment of a The Chomsky hierarchy is only a segment of a

net of different restrictions on replacement rules net of different restrictions on replacement rules (in the semi-Thue-format).(in the semi-Thue-format).

► The rule formats are highly dependent on The rule formats are highly dependent on conceptions of grammar writing and have been conceptions of grammar writing and have been changing rather quickly (since Chomsky’s changing rather quickly (since Chomsky’s proposals in 1957). Therefore they are rather proposals in 1957). Therefore they are rather technical features of grammars (therefore technical features of grammars (therefore artifacts) than a characteristic for real artifacts) than a characteristic for real languages. languages.

► Some of the features asking for more complex Some of the features asking for more complex grammars are rather infrequent (not necessarily grammars are rather infrequent (not necessarily relevant for all language users; cf. Chomsky’s relevant for all language users; cf. Chomsky’s mirror-language argument in Chomsky 1957).mirror-language argument in Chomsky 1957).

LCM 2 Paris 2006 14

3.Geometrical deformations as 3.Geometrical deformations as memory (complexity =degree of memory (complexity =degree of

deformation)deformation)

Deformation of a circle by forces () and resultant indentations (cf. Leyton, 1997: P2).

LCM 2 Paris 2006 15

Deformations of complexityDeformations of complexity 33

This process may be continued to any level of complexity

LCM 2 Paris 2006 16

The patterns of deformation (active forces The patterns of deformation (active forces and indentations) have different effects:and indentations) have different effects:

1.1. They refer to some ideal form; e.g. a They refer to some ideal form; e.g. a circle,circle,

2.2. They are an index of forces which apply They are an index of forces which apply to the ideal form,to the ideal form,

3.3. They are a memory of the deformations.They are a memory of the deformations.

The amount of deformations (transforma-The amount of deformations (transforma-tions) may serve as a complexity tions) may serve as a complexity measure (of shape memory)measure (of shape memory)

LCM 2 Paris 2006 17

Early object-symbols as precursors of writing

One can see some geometrically ideal forms like spheres, disks, pyramids and deformations, the composition with lines and holes etc.

LCM 2 Paris 2006 18

Application to symbolic formsApplication to symbolic forms

► The (shape) is the number of deformations The (shape) is the number of deformations applied to it; the asymmetries added (by applied to it; the asymmetries added (by symmetry breaking).symmetry breaking).

► As the deformations are the traces of processes As the deformations are the traces of processes which have acted on the (ideal) figure at the which have acted on the (ideal) figure at the start, they constitute a memory inscribed in the start, they constitute a memory inscribed in the figure.figure.

► The forces of deformation may be power fields in The forces of deformation may be power fields in different medias (brushes in a painting, strokes in different medias (brushes in a painting, strokes in a sculpture, physical forces in an architecture).a sculpture, physical forces in an architecture).

► In phonation a basic vowel (e.g. [ə] may be In phonation a basic vowel (e.g. [ə] may be deformed to obtain the field of vowel phonemes, deformed to obtain the field of vowel phonemes, a regular r pace may give a rhythm, etc.a regular r pace may give a rhythm, etc.

LCM 2 Paris 2006 19

4.Semantic archetypes 4.Semantic archetypes (Thom)(Thom)

► The set of semantic archetypes proposed by The set of semantic archetypes proposed by Thom (1972) and elaborated by Wildgen Thom (1972) and elaborated by Wildgen (1982) are abstract pictures of possible (1982) are abstract pictures of possible scenes scenes

► The basic semantics of sentences can be The basic semantics of sentences can be derived as specific applications derived as specific applications (interpretations) of these archetypes.(interpretations) of these archetypes.

► The abutment hierarchy of elementary The abutment hierarchy of elementary catastro-phes is therefore a mathematically catastro-phes is therefore a mathematically founded complexity measure for schematic founded complexity measure for schematic pictures/actions/sentences.pictures/actions/sentences.

LCM 2 Paris 2006 20

The abutment hierarchy of The abutment hierarchy of catastrophe theorycatastrophe theory

Our examples referred to A3 (complexity 2), A5 (complexity 3) and D4 (compactified in X9; cf. Wildgen, 1985)

LCM 2 Paris 2006 21

► The action-concept involves two stable The action-concept involves two stable entities: the body (the hand) and the object. entities: the body (the hand) and the object. Every point on the lines is an attractor.Every point on the lines is an attractor.

The book

The father

Catastrophe of capture

The father (having the book)

Valence (complexity) 2

Derived from the cusp (A+3)

LCM 2 Paris 2006 22

Valence (complexity) 3

Derived from the butterfly (A+5)

LCM 2 Paris 2006 23

Valence (complexity) 4

Agent

Instrument

Object (pebble)

Substracted piece

Schema of shaping an object with an instrument via subtraction.

Derived from the parabolic umbilic (D5) (compactified)

LCM 2 Paris 2006 24

Schema complexity and the Schema complexity and the evolution of symbolic behaviorevolution of symbolic behavior

The complexity of schemata may be applied to The complexity of schemata may be applied to different symbolic forms (Cassirer):different symbolic forms (Cassirer):

► Lithic industries and the process of Lithic industries and the process of manufacturing tools (cf. Wildgen, 2004)manufacturing tools (cf. Wildgen, 2004)

► Art, e.g. the composition of a figure by manual Art, e.g. the composition of a figure by manual traits or the structure of abstract signstraits or the structure of abstract signs

► Mythical structures, e.g. action schemata of Mythical structures, e.g. action schemata of heroes/ghosts/gods on objects and other actorsheroes/ghosts/gods on objects and other actors

► Valence schemata of verbs in a protolanguage Valence schemata of verbs in a protolanguage (cf. Wildgen, 2004)(cf. Wildgen, 2004)

LCM 2 Paris 2006 25

In order to produce such an hand axe in the early Paleolithic, on had to use both hands and systematically subtract pieces (which could be tools themselves); as a consequence positive tools and negative ones may be distinguished

Abbévillien= 600.000-350.000, second glacial period;

Acheuléen= 350.000-100.000; third glacial period

(above) Abbévillien-Biface (Le Stade) Le Champs de Mars(below) Middle Acheuléen (Saint Acheul) (cf. Weiner, 1972: 130)

LCM 2 Paris 2006 26

Handmotion and basic drawing techniquesHandmotion and basic drawing techniques

Monochro-me drawing of a horse (Peña de Candamo)

The drawing may be decomposed into a number of traits:

Broad curves (semicircular), lines, small strokes, points, etc.

The compositionality principle is a Gestaltist one

LCM 2 Paris 2006 27

Handaxe in the early Paleolithicum

(above)

Abbévillien-

Biface (Le Stade)

Le Champs de Mars

(below)

Middle

Acheuléen

(Saint Acheul)

(cf. Weiner, 1972: 130)

Abbévillien= 600.000-350.000, second glacial period; Acheuléen= 350.000-100.000; third glacial period

LCM 2 Paris 2006 28

Drawing techniquesDrawing techniques

Monochrome drawing of a horse (Peña de Candamo)

LCM 2 Paris 2006 29

Evolution as transition to higher Evolution as transition to higher levels of complexity levels of complexity

(=emergence)(=emergence)► The complexity measure must first allow for The complexity measure must first allow for

levels of complexity, i.e. there must be levels of complexity, i.e. there must be barriers to transition, which define plateaus.barriers to transition, which define plateaus.

► The nature of the barrier must be such its The nature of the barrier must be such its transition has costs, which must be transition has costs, which must be evaluated in a Darwinian context.evaluated in a Darwinian context.

► If we assume parallel channels of symbolic If we assume parallel channels of symbolic processes, a level of complexity may be processes, a level of complexity may be reached in one symbolic form and possibly reached in one symbolic form and possibly enable the transition (at lower costs) in enable the transition (at lower costs) in another one.another one.

LCM 2 Paris 2006 30

Strategic conclusionStrategic conclusion► As a complexity measure is crucial for a whole family of As a complexity measure is crucial for a whole family of

models in language and cognition, one should not mix it models in language and cognition, one should not mix it with spurious complexity introduced by the with spurious complexity introduced by the technicalities of specific descriptive devices. Thus basic technicalities of specific descriptive devices. Thus basic intuitions of the Chomsky hierarchy should be freed intuitions of the Chomsky hierarchy should be freed from their restriction to generative grammars.from their restriction to generative grammars.

► The complexity measure must allow a translation to The complexity measure must allow a translation to measures applicable to natural systems (i.e. in physics, measures applicable to natural systems (i.e. in physics, chemistry, biology). This criterion is only fulfilled by the chemistry, biology). This criterion is only fulfilled by the information measure and the catastrophe hierarchy.information measure and the catastrophe hierarchy.

► The concept of emergence should be defined either in The concept of emergence should be defined either in the context of a morphogenetic theory or in the the context of a morphogenetic theory or in the framework of a theory of (stochastic) self-organization.framework of a theory of (stochastic) self-organization.

LCM 2 Paris 2006 31

ConclusionsConclusions

► The proposal of Leyton may be easily applied to The proposal of Leyton may be easily applied to visual semiotics; its application to semantics visual semiotics; its application to semantics asks for a kind of morphodynamic theory, as asks for a kind of morphodynamic theory, as developed by Thom.developed by Thom.

► Only catastrophe theory allows for a natural Only catastrophe theory allows for a natural restriction to valence 3 (4 with special restriction to valence 3 (4 with special conditions).conditions).

► The abutment hierarchy imposes a very general The abutment hierarchy imposes a very general restriction on model complexity; a feature which restriction on model complexity; a feature which Leyton’s theory is lacking and which is totally Leyton’s theory is lacking and which is totally absent in the algebraic/logical tradition (with the absent in the algebraic/logical tradition (with the exception of Peirce’s diagrammatic logic, but he exception of Peirce’s diagrammatic logic, but he could not derive the restriction logically).could not derive the restriction logically).

LCM 2 Paris 2006 32

References:References:

► Chomsky, Noam, 1957. Syntactic Structures, Mouton, Chomsky, Noam, 1957. Syntactic Structures, Mouton, Den Haag.Den Haag.

► Cumming, Graeme S. and John Collier, 2005. Change Cumming, Graeme S. and John Collier, 2005. Change and Identity in Complex Systems, in: Ecology and and Identity in Complex Systems, in: Ecology and Society 10 (1); online: Society 10 (1); online: www.ecologyandsociety.org/vol10/iss1/art29www.ecologyandsociety.org/vol10/iss1/art29

► Leyton, Michael. 2001. A Generative Theory of Leyton, Michael. 2001. A Generative Theory of Shape, Springer, Heidelberg.Shape, Springer, Heidelberg.

► Shannon C.E. and Weaver, W., 1949. The Shannon C.E. and Weaver, W., 1949. The Mathematical Theory of Information, University of Mathematical Theory of Information, University of Illinois Press, Urbana.Illinois Press, Urbana.

► Thom, R., 1972. Stabilité structurelle et Thom, R., 1972. Stabilité structurelle et morphogenèse (2e édition, revue et corrigée, 1977), morphogenèse (2e édition, revue et corrigée, 1977), Interéditions, Paris.Interéditions, Paris.

LCM 2 Paris 2006 33

► Wildgen, Wolfgang, 1977a. Differentielle Linguistik. Wildgen, Wolfgang, 1977a. Differentielle Linguistik. Entwurf eines Modells zur Messung von semantisch-Entwurf eines Modells zur Messung von semantisch-pragmatischer Variation, Niemeyer, Tübingen.pragmatischer Variation, Niemeyer, Tübingen.

► Wildgen, Wolfgang, 1977b. Kommunikativer Stil und Wildgen, Wolfgang, 1977b. Kommunikativer Stil und Sozialisation. Eine empirische Unter suchung, Niemeyer, Sozialisation. Eine empirische Unter suchung, Niemeyer, TübingenTübingen

► Wildgen, WolfgangWildgen, Wolfgang,1982. Catastrophe Theoretic ,1982. Catastrophe Theoretic Semantics. An Elaboration and Application of René Semantics. An Elaboration and Application of René Thom's Theory, Benjamins, Amsterdam. Thom's Theory, Benjamins, Amsterdam.

► Wildgen, Wolfgang, 1994. Process, Image, and Wildgen, Wolfgang, 1994. Process, Image, and Meaning. A Realistic Model of the Meanings of Meaning. A Realistic Model of the Meanings of Sentences and Narrative Texts, Benjamins, Sentences and Narrative Texts, Benjamins, Amsterdam.Amsterdam.

► Wildgen, Wolfgang, 1999. De la grammaire au discours. Wildgen, Wolfgang, 1999. De la grammaire au discours. Une approche morphodynamique, Reihe: Sémiotique Une approche morphodynamique, Reihe: Sémiotique européenne, Lang Verlag, Bern.européenne, Lang Verlag, Bern.

► Wildgen, Wolfgang, 2004. The Evolution of Human Wildgen, Wolfgang, 2004. The Evolution of Human Languages. Scenarios, Principles, and Cultural Languages. Scenarios, Principles, and Cultural Dynamics, Benjamins, Amsterdam.Dynamics, Benjamins, Amsterdam.