Ariel Rubinstein Economics and Language Five Essays 2000

Embed Size (px)

Citation preview

  • 8/12/2019 Ariel Rubinstein Economics and Language Five Essays 2000

    1/138

  • 8/12/2019 Ariel Rubinstein Economics and Language Five Essays 2000

    2/138

    This page intentionally left blank

  • 8/12/2019 Ariel Rubinstein Economics and Language Five Essays 2000

    3/138

    Arising out of the authors lifetime fascination with the links

    between the formal language of mathematical models and

    natural language, this short book comprises five essays investi-

    gating both the economics of language and the language of eco-nomics. Ariel Rubinstein touches on the structure imposed on

    binary relations in daily language, the evolutionary develop-

    ment of the meaning of words, game-theoretical considerations

    of pragmatics, the language of economic agents, and the rheto-

    ric of game theory. These short essays are full of challenging

    ideas for social scientists that should help to encourage a funda-

    mental rethinking of many of the underlying assumptions ineconomic theory and game theory. A postscript contains com-

    ments by a logician, Johan van Benthem (University of

    Amsterdam, Institute for Logic, Language and Computation

    and Stanford University, Center for the Study of Language and

    Information) and two economists, Tilman Brgers (University

    College, London) and Barton Lipman (University of Wisconsin,

    Madison).

    AR I EL R U BI N ST E IN is Professor of Economics at Tel Aviv

    University and Princeton University. His recent publications

    include Modeling Bounded Rationality(1998), A Course in

    Game Theory (with M. Osborne, 1994) and Bargaining and

    Markets (with M. Osborne, 1990).

  • 8/12/2019 Ariel Rubinstein Economics and Language Five Essays 2000

    4/138

  • 8/12/2019 Ariel Rubinstein Economics and Language Five Essays 2000

    5/138

    Economics and Language

    Five Essays

  • 8/12/2019 Ariel Rubinstein Economics and Language Five Essays 2000

    6/138

  • 8/12/2019 Ariel Rubinstein Economics and Language Five Essays 2000

    7/138

    THE CHURCHILL LECTURES IN ECONOMIC THEORY

    Economics andLanguage

    Five Essays

    A RIEL RUBINS TEINTel Aviv University

    Princeton University

  • 8/12/2019 Ariel Rubinstein Economics and Language Five Essays 2000

    8/138

    The Pitt Building, Trumpington Street, Cambridge, United Kingdom

    The Edinburgh Building, Cambridge CB2 2RU, UK40 West 20th Street, New York, NY 10011-4211, USA477 Williamstown Road, Port Melbourne, VIC 3207, AustraliaRuiz de Alarcn 13, 28014 Madrid, SpainDock House, The Waterfront, Cape Town 8001, South Africa

    http://www.cambridge.org

    First published in printed format

    ISBN 0-521-59306-9 hardbackISBN 0-521-78990-7 paperback

    ISBN 0-511-03913-1 eBook

    Ariel Rubinstein 2004

    2000

    (Adobe Reader)

  • 8/12/2019 Ariel Rubinstein Economics and Language Five Essays 2000

    9/138

    CONTENTS

    0 Acknowledgments page viii

    PART 1 E CON OM IC S O F L AN GU AGE

    0 Economics and language 3

    1 Choosing the semantic properties of language 9

    2 Evolution gives meaning to language 25

    3 Strategic considerations in pragmatics 37

    PART 2 L ANG UA GE O F E CO NOM IC S

    4 Decision making and language 55

    5 On the rhetoric of game theory 71

    Concluding remarks 89

    PART 3 COMMENTS

    0 Johan van Benthem 93

    0 Tilman Brgers 108

    0 Barton Lipman 114

    0 Index 125

    vii

  • 8/12/2019 Ariel Rubinstein Economics and Language Five Essays 2000

    10/138

    ACKNOW LEDGM ENTS

    This book emerged from the kind invitation of the fellows

    of Churchill College to deliver the Churchill Lectures of1996. It is a pleasure for me to use this opportunity tothank Frank Hahn, for his encouragement during myentire career.

    The book is an extended version of my ChurchillLectures delivered in Cambridge, UK in May 1996. Part ofthe material overlaps with the Schwartz Lecture deliveredat Northwestern University in May 1998.

    Three people agreed on very short notice to provide theirwritten comments and to publish them in the book: Johanvan Benthem, Tilman Brgers and Bart Lipman. I thankthem wholeheartedly for their willingness to do so. Theidea to add comments to the book should be credited to

    Chris Harrison, the Cambridge University Press editor ofthis book.Several people have commented on parts of the text in

    its various stages. In particular, I would like to thank BartLipman, Martin Osborne and Rani Spiegler, who werealways ready to help, and Mark Voorneveld, who providedcomments on the final draft of the manuscript. My special

    thanks to my partner in life, Yael. As so much else, the ideaof the cover is hers.

    My thanks to Nina Reshef and Sharon Simmer, whohelped me with language editing; the remaining errors aremy own.

    Finally, I would like to acknowledge the financial supportgiven me by the Israeli Science Foundation (grant no.

    06-1011-0673). I am also grateful to The Russell SageFoundation and New York University, which hosted meduring the period when much of the writing was completed.

    viii

  • 8/12/2019 Ariel Rubinstein Economics and Language Five Essays 2000

    11/138

    P A R T 1

    ECONOM ICS OF LANGUAGE

  • 8/12/2019 Ariel Rubinstein Economics and Language Five Essays 2000

    12/138

  • 8/12/2019 Ariel Rubinstein Economics and Language Five Essays 2000

    13/138

    C H A P T E R 0

    ECONOM ICS AND LANGUAGE

    The psychologist Joel Davitz once wrote: I suspect that

    most research in the social sciences has roots somewherein the personal life of the researcher, though these roots arerarely reported in published papers (Davitz, 1976). Thefirst part of this statement definitely applies to this book.Though I am involved in several fields of economics andgame theory, all my academic research has been motivatedby my childhood desire to understand the way that peopleargue. In high school, I wanted to study logic, which Ithought would be useful in political debates or in legalbattles against evil once I fulfilled my dream of becoming asolicitor. Unfortunately, I became neither a lawyer nor apolitician, and I have since come to understand that logicis not a very useful tool in these areas in any case.

    Nonetheless, I continued to explore formal models of gametheory and economic theory, though not in the hope of pre-dicting human behavior, not in anticipation of predictingthe stock market prices, and without any illusion aboutthe ability of capturing all of reality in one simple model. Iam simply interested in the reasoning behind decisionmaking and in the arguments people bring in debates. I am

    still puzzled, and even fascinated, by the magic of the linksbetween the formal language of mathematical models andnatural language. This brings me to the subject of thislecture Economics and Language.

    0.1 Economics and language

    The title of these lectures may be misleading. Althoughthe caption Economics and Language is a catchy title, itis too vague. It encompasses numerous subjects, most of

    3

  • 8/12/2019 Ariel Rubinstein Economics and Language Five Essays 2000

    14/138

    which will not be touched on here. This series of lectureswill briefly address five issues which fall under this generalheading. The issues can be presented in the form of five

    questions: Why do we tend to arrange things on a line and not in a

    circle? How is it that the utterance be careful is understood

    by the listener as a warning and not as an invitation to adance?

    How is it that the statement it is not raining very hardis understood to mean it is raining but not very hard? Does the textbook utility function log (x11)x2 make

    sense? Is the use of the word strategy in game theory rhetori-

    cal?

    All the issues discussed in these lectures lie somewherebetween economic theory and the study of language. Twoquestions spring to mind:

    Why would economic theory be relevant to linguisticissues? Economic theory is an attempt to explain regu-larities in human interaction and the most fundamentalnonphysical regularity in human interaction is natural

    language. Economic theory carefully analyzes the designof social systems; language is, in part, a mechanism ofcommunication. Economics attempts to explain socialinstitutions as regularities deriving from the optimiza-tion of certain functions; this may be applicable to lan-guage as well. In these lectures I will try to demonstratethe relevance of economic thought to the study of lan-guage by presenting several economic-like analyses toaddress linguistic issues.

    Why would economic theory be a relevant subject ofresearch from the point of view of language? Becauseeconomic agents are human beings for whom language isa central tool in the process of making decisions and

    forming judgments. And because the other importantplayers in Economic Theory namely ourselves, theeconomic theorists use formal models but these are

    Economics of language

    4

  • 8/12/2019 Ariel Rubinstein Economics and Language Five Essays 2000

    15/138

    not simply mathematical models; their significancederives from their interpretation, which is expressedusing daily language.

    0.2 Outline of the lectures

    The book deals with five independent issues organizedinto two groups:

    Part 1 is entitled Economics of Language and com-prises the core of this book. In Part 1, methods taken from

    economic theory are used to address questions regardingnatural language. The basic approach is that languageserves certain functions from which the properties of lan-guage are derived.

    In chapter 1, I assume that language is the product of afictitious optimizer who operates behind a veil of ignor-ance. The substantive issue studied in this chapter is thestructure imposed on binary relations in daily language.The designer chooses properties of binary relations thatwill serve the users of the language. The three parts of thechapter discuss three distinct targets of binary relations:

    (1) To enable the user of the relation to point out namelesselements.

    (2) To improve the accuracy with which the vocabularyspanned by the relation approximates the actual termsto which the user of the language is referring.

    (3) To facilitate the description of the relation by means ofexamples.

    It will be shown that optimization with respect to these

    three targets explains the popularity of linear orderings innatural language.

    In chapter 2, we discuss the evolutionary developmentof the meaning of words. The analytical tool used is avariant of the game-theoretic notion of evolutionary stablestrategy. Complexity considerations are added to the stan-dard notion of evolutionary stable equilibrium as an addi-

    tional evolutionary factor.In chapter 3, I touch on pragmatics, the topic furthest

    from the traditional economic issues that are discussed in

    Economics and language

    5

  • 8/12/2019 Ariel Rubinstein Economics and Language Five Essays 2000

    16/138

    these essays. Pragmatics searches for rules that explain thedifference in meaning between a statement made in a con-versation and the same statement when it is stated in iso-

    lation. Grice examined such rules in the framework of aconversation in which the participants are assumed to becooperative. Here, game-theoretical analysis will be usedto explain a certain phenomenon found in debates.

    Part 2 is entitled Language of Economics and includestwo essays.

    Chapter 4 deals with the Language of Economic Agents.

    The starting point of the discussion is that decisionmakers, when making deliberate choices, often verbalizetheir deliberations. This assumption is especially fittingwhen the decision maker is a collective but also hasappeal when the decision maker is an individual. Tools ofmathematical logic are used to formalize the assumption.The objective is to analyze the constraints on the set ofpreferences which arise from natural restrictions on thelanguage used by the decision maker to verbalize his pref-erences. I demonstrate in two different contexts that thedefinability constraint severely restricts the set of admis-sible preferences.

    Chapter 5 focuses on the rhetoric of game theory. Much

    has been written on the rhetoric of economics in general;little, however, has been written on the rhetoric of gametheory. The starting point of the discussion is that an eco-nomic model is a combination of a formal model and itsinterpretation. Using the Nash bargaining solution as anillustration, I first make the obvious claim that differencesin models which seem equivalent result in significant dif-

    ferences in the interpretation of their results. The mainargument of the chapter is more controversial. I argue thatthe rhetoric of game theory is misleading in that it createsthe impression that game theory is more useful than itactually is, and that a better interpretation would makegame theory much less relevant than is usually claimed inthe applied game theory literature.

    Though the book covers several distinct issues under theheading of economics and language, it by no meanscovers all the issues that might be subsumed under this

    Economics of language

    6

  • 8/12/2019 Ariel Rubinstein Economics and Language Five Essays 2000

    17/138

    rubric. For example, I do not discuss the (largely ignored)literature labeled the economics of language which wassurveyed in a special issue of the International Journal of

    the Sociology of Language (see Grin, 1996). Grin (1996)defines the economics of language as a paradigm oftheoretical economics and uses the concepts and tools ofeconomics in the study of relationships featuring linguis-tic variables; it focuses principally, but not exclusively, onthose relationships in which economic variables play apart. This body of research does indeed revolve around

    traditional economic variables and related issues suchas the economic costs and benefits of multi-languagesociety, language-based inequality, and language andnationalism. However, despite the similar headings,those issues are very far from my interests as expressed inthis book.

    0.3 One more personal comment

    While browsing through the literature in preparation forthese lectures, I came across a short article writtenby Jacob Marschack entitled the The Economics ofLanguage (Marschak, 1965). The article begins with a

    discussion between engineers and psychologists regardingthe design of the communications system of a smallfighter plane. Following the discussion Marschak states:The present writer apologizes to those of his felloweconomists who might prefer to define their field morenarrowly, and who would object to identification ofeconomics with the search of optimality in fields extend-

    ing beyond, though including, the production and distri-bution of marketable goods. He then continues: Beingignorant of linguistics, he apologizes even more humblyto those linguists who would scorn the designation of asimple dial-and-buttons systems a language. I dont feelthat any apology is due to economists but I do feel asincere apology is owed to linguists and philosophers of

    language. Although I am quite ignorant in those areas, Ihope that these essays present some interesting ideas forthe study of language.

    Economics and language

    7

  • 8/12/2019 Ariel Rubinstein Economics and Language Five Essays 2000

    18/138

    R EF ER EN CES

    Davitz, J. (1976) The Communication of Emotional Meaning,New York: Greenwood Publishing Group

    Grin, F. (1996) Economic Approaches to Language andLanguage Planning: An Introduction, InternationalJournal of the Sociology of Language, 121, 116

    Marschak, J. (1965) The Economics of Language, BehavioralScience, 10, 13540

    Economics of language

    8

  • 8/12/2019 Ariel Rubinstein Economics and Language Five Essays 2000

    19/138

    C H A P T E R 1

    CHOOSING THE SEM ANTIC

    PROPERTIES OF LANGUAGE

    1.1 IntroductionThis chapter will present a research agenda whose primeobjective is to explain how features of natural language areconsistent with the optimization of certain reasonabletarget functions. Rather than discuss the research agendain abstract, I will begin with the specific argument andreturn to the general discussion at the end of the chapter.

    This chapter discusses binary relations. A binary rela-tion on a set specifies a connection between elementswithin the set. Such binary relations are common innatural language. For example, personxknows persony,tree x is to the right of tree y, picture x is similar topicturey, chairxand chairyare the same color, and so

    on. I will avoid binary relations such as Professorxworksfor universityy or the Social Security number ofxisy,which specify relationships between elements whichnaturally belong to two distinct sets. I will further restrictthe term binary relation to be irreflexive: No elementrelates to itself. The reason for this is that the termx relates to y when xy is fundamentally different

    from xrelates to y when xy. For example, the state-ment a loves b is different from the statement a loveshimself.

    Certain binary relations, by their nature, must satisfycertain properties. For example, the relation xis a neigh-bor of y must, in any acceptable use of this relation,satisfy the symmetry property (ifxis a neighbor ofy, then

    yis a neighbor ofx). The relation xis to the right of y

    9

    This chapter is based on Rubinstein (1996).

  • 8/12/2019 Ariel Rubinstein Economics and Language Five Essays 2000

    20/138

    must be alinear ordering, thus satisfying the properties ofcompleteness (for everyxy, eitherxrelates toyorytox),asymmetry (for everyxandy, ifxrelates toy,ydoes not

    relate tox), and transitivity (for everyx,y, andz, ifxrelatestoyandytoz, thenxrelates toz). In contrast, the nature ofmany other binary relations, such as the relation xloves

    y, does not imply any specific properties that the relationmust satisfy a priori. It may be true that among a particulargroup of people, xlovesy implies ylovesx. However,there is nothing in our understanding of the relation x

    lovesy which necessitates this symmetry.The subject of this chapter is in fact the properties of

    those binary relations which appear in natural language.Formally, a property of the relation R is defined to be a sen-tence in the language of the calculus of predicates whichuses a name for the binary relation R, variable names, con-nectives, and qualifiers, but does not include any individ-ual names from the set of objects . I will refer to thecombination of properties of a term as its structure.

    I am curious as to the structures of binary relations innatural language. I search for explanations as to why, out ofan infinite number of potential properties, we find thatonly a few are common in natural languages. For example,

    it is difficult to find natural properties of binary relationssuch as the following:

    A1: IfxRyandxRz (yz), and bothyRa andzRa, then alsoxRa.

    A2: For everyxthere are three elementsyfor whichxRy.(In contrast, the relation xis the child ofy on the setof human beings does satisfy the property that foreveryxthere are two elementsywhichxrelates to.)

    Alternatively, it is difficult to find examples of naturalstructures of binary relations which are required to betournaments (satisfying completeness and asymmetry)but which are not required to satisfy transitivity. Oneexception is the structure of the relation x is located

    clockwise fromy(on the shortest arc connectingxandy).Is it simply a coincidence that only a few structures existin natural language?

    Economics of language

    10

  • 8/12/2019 Ariel Rubinstein Economics and Language Five Essays 2000

    21/138

    The starting point for the following discussion is thatbinary relations fulfill certain functions in everyday life.There are many possible criteria for examining the func-

    tionality of binary relations. In this discussion, I examineonly three. I will argue that certain properties, shared bylinear orderings, perform better according to each of thesecriteria. Of course, other criteria are also likely to providealternative explanations for the frequent use of variouscommon structures such as equivalence and similarityrelations.

    1.2 Indication-friendliness

    Consider the case in which two parties observe a group oftrees and the speaker wishes to refer to a certain tree. Ifthe tree is the only olive tree in the grove, the speakershould simply use the term the olive tree. If there is nomutually recognized name for the tree and the two partieshave a certain binary relation defined on the set of trees intheir mutual vocabulary, the user can use this relation todefine the element. For example, the phrase the thirdtree on the right indicates one tree out of many by usingthe linear ordering xstands to the left ofy when the

    group of trees is well defined and the relation being tothe left of is a linear ordering. Similarly, the phrase theseventh floor indicates a location in a building given thelinear ordering floorxis above floory. There would beno need to use the phrase if it was known to be the presi-dential floor. On the other hand, the relation linexonthe clock is clockwise to liney (with the smallest angle

    possible) does not enable the user to indicate a certainline on a number-less clock; any formula which is satis-fied by three oclock is satisfied by four oclock as well. Infact, the existence of even one designated line such astwelve oclock, would enable the use of the relation tospecify all lines on the clock. The effect of using such adesignated element is equivalent to transforming the

    circle into a line.Thus, binary relations are viewed here as tools for indi-

    cating elements in a set whose objects do not have names.

    Choosing the semantic properties of language

    11

  • 8/12/2019 Ariel Rubinstein Economics and Language Five Essays 2000

    22/138

    We look for structures that enable the user to unambigu-ously single out any element out of any subset of . We areled to the following definition:

    Definition: A binary relation R on a set is indication-friendlyif for everyA, and every element aA, thereis a formulafa,A(x) (in the language of the calculus of predi-cates with one binary relation and without individual con-stants) such that a is the only element inA satisfying theformula (when substituting a in place of the free variable

    x).All linear orderings are indication-friendly. If R is a

    linear ordering, the formula P1(x)y(xyxRy) definesthe maximal element in the setA forA. The formulaP2(x)y(xyP1(y)xRy) defines the second-to-the-maximal element, and so on. Note that in natural lan-guage there are short cuts for describing the variouselements. For example, the short cuts for P1(x) and P2(x)are the first and the second.

    In contrast, consider the set {a,b,c,d} and the non-linear binary relation R, called beat, depicted in the fol-lowing diagram (aRb, aRc, dRa, bRd, bRc and cRd):

    Referring to the grand set , the element a is defined byit beats two elements, one of which also beats two ele-ments. The element b is defined by it beats two ele-

    ments, which each beat one element. And so on.However, whereas the relation R allows the user to defineany element in the set , the relation is not effective in

    Economics of language

    12

    Figure 1.1

  • 8/12/2019 Ariel Rubinstein Economics and Language Five Essays 2000

    23/138

  • 8/12/2019 Ariel Rubinstein Economics and Language Five Essays 2000

    24/138

    four expressions available for referring to subsets ofX: allXs, the Ss, the not Ss, and nothing. We will call thecollection of sets V(S){X,S,S,} the vocabularyspanned

    by S.The basic idea is that language should be flexible enough

    to function under unforeseen circumstances. The speakercan use the terms in V(S) but will eventually need to referto sets not necessarily contained in V(S). The term S willbe evaluated by the vocabularys ease of use with leastloss. It is assumed that when the speaker wishes to refer

    to a set ZX, he will use an element in V(S) which isclosest to the set Z. In order to formally state this idea,we need to define the distance between two sets. Let thedistance betweenA and B, d(A,B) be the cardinality of theasymmetric difference betweenA and B (the set of all ele-ments which are inA and not in B or in B and not inA) i.e. d(A,B)|(AB)(BA)|. One interpretation of thisdistance function fits the case in which the user whowishes to refer to the set B and employs a term AV(S)lists the elements in B which are excluded from A orappended toA and bears a cost measured by the numberof elements which have to be excluded or appended (forexample, the sentence You may eat only bread or any fruit

    with the exception of apples and bananas utilizes theterm fruit and three individual names: bread, apples, andbananas).

    For a set B and a vocabulary V, define (B,V)minAVd(A,B), the distance of the set B from the closest setin the vocabulary V. By assigning equal probabilities a

    priori to all possible sets that the user might wish to refer

    to, the problem for the designer becomes minSzX(Z,V(S))/2|X|. Essentially, the problem boils down to thechoice of thenumber of elements in the optimal set S.

    To further clarify the nature of the problem, we willcarry out a detailed calculation for a four-element set X.Consider the case in which S contains two elements. Theuser can refer to any of the sets X, , S or S, without

    incurring any loss. He can approximate any one- orthree-element set by using the set or the set X, respec-tively, while bearing a cost of 1. If he wishes to refer to

    Economics of language

    14

  • 8/12/2019 Ariel Rubinstein Economics and Language Five Essays 2000

    25/138

    one of the four two-element sets which are not S or S, heincurs a loss of 2. Thus, the average loss is [4(0)8(1)4(2)]/161. A similar calculation leads to the conclusion

    that a choice of S as orXleads to an expected loss of 5/4(no loss for andX, a loss of 1 for the eight sets of size 1 and3, and a loss of 2 for the six sets of size 2 i.e. [2(0)8(1)6(2)]/165/4). If S is taken to be a one-element set (or athree-element set), the average loss is only [4(0)6(1)6(1)]/163/4. Thus, splittingXinto two unequal subsetsof sizes 1 and 3 minimizes imprecision.

    Despite the above results, our intuition is correct in thatthe optimal size of S should be half that ofX. When the set

    X is large, the loss associated with choosing an S con-taining half of Xs members is close to being minimal.The following proposition is an exact statement of thisresult (its proof, omitted here, includes a combinatorialcalculation):

    Claim: LetXbe ann-element set. The difference betweenthe solution of the problem minSzX(Z,V(S))/2|X| and theexpected loss from the optimal use of the vocabularyspanned by an (n/2)-member subset ofX, is in the magni-tude of 1/n1/2.

    1.4 Informativeness

    We now return to the world of binary relations on some set. An additional function of binary relations on a set isto transfer or store information concerning a specific rela-tionship existing between the elements of . Consider the

    case in which the grand set includes all authors of articlesin some field of research and the speaker is interested indescribing the relation x quotes y in his article. Thespeaker may describe the relation by listing the pairs ofauthors who satisfy the relation. Alternatively, he may usethose binary relations which are available in his vocabu-lary to describe the xquotes y relation. If he finds his

    vocabulary insufficient to describe the relation, he will usea binary relation which is the best approximation. Forexample, if the relation xis older thany is well defined,

    Choosing the semantic properties of language

    15

  • 8/12/2019 Ariel Rubinstein Economics and Language Five Essays 2000

    26/138

    the speaker can use the sentence: An author quotesanother if he is older than himself. As this may not beentirely correct, he can add a qualifying statement such as

    the exceptions are a who did not quote b (though b isolder), and c who did quote d (though d is not older). Suchqualifying statements are the loss incurred from the useof an imprecise relation in order to approximate the whoquotes whom relation.

    Our discussion envisages an imaginary planner who isable to design only one binary relation during the initial

    stage of the world. Of course, real-life language includesnumerous relations and the effectiveness of each dependson the entire fabric of the language. The assumption thatthe designer is planning only one binary relation is madesolely for analytical convenience.

    The design of one binary relation allows the speaker toselect one of four binary relations. For instance, he canstate: Every author is quoted by all others younger thanhim, or Every author is quoted by all others not youngerthan him, and of course he can also state Everyone quoteseveryone and No one quotes anyone, which do notrequire familiarity with any binary relations. Given a rela-tion R, we will refer to these four relations as the vocabulary

    spanned byR and denote it byV(R). (Note that in definingthe vocabulary spanned byR, we ignore other possibilitiesfor defining a binary relation usingR, such as statements ofthe type xSyif there is az such thatxRz andzRy.)

    We assume that the speaker who wishes to refer toa binary relation S will use a relation in V(R) which isthe closest approximation. The loss is measured by the

    number of differences between the relation which thespeaker wishes to describe and the one he finds available inhis vocabulary. The distance between any two binary rela-tions R and R is taken to be the number of pairs (a,b) forwhich it is not true that aRb if and only if aRb. Note thataccording to this measure, any pair for which R and R dis-agree receives the same weight. Regarding the initial state,

    it seems proper to put equal weights on all possibleimprecisions. The designers problem is to minimize theexpected loss resulting from the optimal use of his vocabu-

    Economics of language

    16

  • 8/12/2019 Ariel Rubinstein Economics and Language Five Essays 2000

    27/138

    lary. It is assumed that from his point of view, all possiblebinary relations are equally likely to be required by thespeaker. Thus, the designers problem is minRS(S,V(R)),

    where (S,V(R))minTV(R)(S,T) and (S,T) is the distancebetween the relations S and T.

    Stated in this way, the problem becomes a special case ofthe problem discussed in the previous section. The setX, inthe terminology employed in that section, is the set (,)|. A binary relation R is identified with the graphof R i.e. the subset ofXof all (a,b) for which aRb. Recall

    that we concluded earlier that splitting the setXinto twoequal sets is nearly optimal for the designer if he wishes toreduce the expected number of imprecisions.

    We have one more step to go to reach the goal of thissection. In planning a binary relation on , the designermay also consider the possibility that the relation willeventually be used in reference to a subset of . This isanalogous to a binary relation being indication-friendly ifit allows the indication of any object out of any subsetof objects (see section 1.2). Hence, R has to be optimalfor potential use in indicating a subset of (,)| for every subset . If R is complete andasymmetric, for every subset , the induced relation

    R| (defined by aR|b iff a,b and aRb) includes exactlyhalf the pairs in (,)|.In summary, our fictitious planner wishes to design a

    binary relation which spans a vocabulary with the goal ofminimizing the expected inaccuracy of the term the userwill actually use. Viewing a relation as a set of pairs of ele-ments, the problem was linked to the optimization problem

    discussed in the previous section. There we concluded thatsplitting a set into two subsets allows close tooptimal useof the induced vocabulary. Requiring the relation to be com-plete and asymmetric guarantees that for any subset of thegrand set, the restricted relation will be close to optimalas an aid to the user in specifying a relation on the subset.

    Conclusion 2: In order to express binary relations asaccurately as possible on any subset of a set using avocabulary spanned by a single binary relation on the set

    Choosing the semantic properties of language

    17

  • 8/12/2019 Ariel Rubinstein Economics and Language Five Essays 2000

    28/138

    , a binary relation on which is complete and asym-metric is close to optimal.

    1.5 Ease of describability

    In this section we discuss the third and last criterion bywhich binary relations are assessed in this chapter.Imagine that a hunter wishes to instruct his son on how tobehave in the forest. When he observes two potentialanimals a and b, should he pursue a or b? The instructions

    have to be applicable to any pair of animals and must beclear. Thus, the set of instructions can be represented by acomplete and asymmetric binary relation R where aRbmeans that when the son simultaneously observes a and bhe should pursue a.

    The son is aware of the sets of animals that are edibleand the structure of the relation R (i.e., the list of proper-ties satisfied). The son is acquainted with the structure ofthe relation either because it is instinctual or because hisfather has informed him of these properties. Therefore, allthat is left for the father to do, when transferring thecontent of R to his son, is to provide him with a list ofexamples i.e., statements of the type animal a should

    be pursued when you see it together with animal b. Theexamples should be rich enough to allow the son to inferthe entire relation from the structure and examples.

    To illustrate, consider the case in which the relation R isa linear ordering and the number of elements in isn. Theminimal number of examples that the father must providein this case isn1 (a1Ra2, a2Ra3 , an1Ran).

    This brings us to the main topic of our discussion. Weassume that providing examples is costly. (The complex-ities of the structure and process of making the inferencesare ignored here.) The following problem comes to mind:What are the structures of the complete and asymetricbinary relations (tournaments) for which the number ofexamples required for their description is minimal?

    Definitions: We say that (f,{aiRbi}iI) defines the binaryrelation R* on when

    Economics of language

    18

  • 8/12/2019 Ariel Rubinstein Economics and Language Five Essays 2000

    29/138

    f is a sentence in the language of the calculus ofpredicates with one binary relation named ai, biR and foralli, ai,bi

    R* is the unique binary relation on satisfying the sen-tencefand for alli, it is true that aiR*bi.

    The complexity of R*, denoted (R*), is the minimumnumber of examples that needs to be appended to R* inorder to enable a definition of R* i.e., the minimum sizeof the set Ion all possible definitions of R*.

    As previously stated, our attention is limited to binaryrelations which are tournaments. The mathematicalproblem we wish to solve is:

    minR*is a tournament(R*).

    Note that for a given (finite) set of objects {a1,...,an}, thestructure of any binary relation R* can be expressed

    by the sentence R*(x1, ,xn)x1, ,xn({xiRxj|aiR*aj}).Thus, the optimization problem proposed above is equiva-lent to the following puzzle-type problem: Start with thegraph of a tournament in which the names of the verticeshave been erased. What is the minimum number of exam-ples required to recover the names of the vertices (up toisomorphism)?

    Example 1: Consider the tournament R* on the set{a,b,c} where aR*b, bR*c and cR*a. The sentencewhich states that R is complete, asymmetric, and anti-transitive (x,y,z(xRyandyRzxRz)) is consistent withtwo relations on ; hence, a single observation, aRb, isneeded to complete the definition of R*. Thus, (R*)1.

    Example 2: Let R* be a linear relation. The relation isdefined by a sentencefexpressing completeness, asymme-try and transitivity, and byn1examples{aiR*ai1}i1,,n1,where a1R*a2R*a3, , an1R*an. Obviously, there is no defi-nition of a linear relation with less thann1 observations.Thus, (R*)n1.

    Example 3: Let {a,b,c,d} (n4) and let R* be the rela-tion satisfying aRxfor allxand bRcRdRb. R* is defined by

    Choosing the semantic properties of language

    19

  • 8/12/2019 Ariel Rubinstein Economics and Language Five Essays 2000

    30/138

    wxyz[wRxwRywRzxRyyRzzRx] and the threeobservations aRb, aRc and bRc.

    Example 4: Consider the relation R* on the set {a,b,c,d}described in figure 1.2. The structure of the relation R can beformulated by a sentence of the type v1v2v3v4(v1,v2,v3,v4).Twenty-four different binary relations on have this struc-ture. It is simple to verify that (R*)4.

    Example 5 (Fishburn, Kim and Tetali, 1994):

    This relation,R*, defined on the 5-element set{a,b,c,d,e}satisfies (R*)3. It is nicely defined by a sentence express-ing the property that for everyxthere are precisely two ele-ments beaten byx. The three observations aRb, aRc, andeRb, define the relation through the chain of conclusions{dRa, eRa}, {cRe, dRe}, {bRd, cRd} and, finally, bRc.

    Examples 2 and 5 illustrate that forn3 andn5, a linearordering is not the most economical structure. Are thereany other binary relations with n7 which are defined byless thann1 observations? I am not aware of a completeanswer to this question. However, we do know the follow-ing (this proposition was suggested to me by Noga Alon):

    Proposition 1.3: For any there exists n() such that foranynn() and for any complete and asymmetric relationR on a set ofn elements, (R)(1)n.

    Economics of language

    20

    Figure 1.2

  • 8/12/2019 Ariel Rubinstein Economics and Language Five Essays 2000

    31/138

    Thus, at least for large sets, linear orderings are almostoptimal with respect to the criterion of minimizing thenumber of observations required for their definition.

    Comment: Notice that in the above discussion, weallowed the relation to be defined by a formula whichdepends on the number of elements in the set . In con-trast, the properties of linear orderings are expressed by aformula which does not depend on the number of elementsin the set . This leads to the following conjecture:

    Conjecture: Let be a sentence in the language of the cal-culus of predicates which includes a single name of abinary relation R. There exists n* such that if ||n*;then, for any tournament R* which is defined on the set by the sentence , (R*)||1.

    In other words, although one can define a relation forsmall sets with less than ||1 examples, it is conjec-tured that a sentence must be accompanied by at least||1 elements in order to define a tournament when thesize of the set is large enough.

    Comment: We conclude this section with an explanation

    of why the term describability and not learnability is used in this section. In our scenario the father choosesthe examples he presents to his son. The number (R*) isthe minimum number of examples the father has topresent in order to convey enough information for the sonto deduce the content of R*. When choosing the examples,the father knows the relation which he would like his sonto learn. On the other hand, had the son wished to acquirethe content of the relation R* by asking a list of ques-tions of the type what should be chased, a or b?, hewould not necessarily have asked first for the (R*)examples which could convey the relation R*. Forexample, he would need 2.5 inquiries on average in

    order to infer a linear ordering defined on a three-elementset.

    Choosing the semantic properties of language

    21

  • 8/12/2019 Ariel Rubinstein Economics and Language Five Essays 2000

    32/138

    1.6 Discussion

    This chapter investigated the observation that in a natural

    language, certain structures of binary relations appearmuch more frequently than others; in particular, we dis-cussed properties that are satisfied by linear orderings. Itwas argued that certain functions of binary relations innatural languages are better served by relations satisfyingthese properties. I believe that this is no more than aninteresting fact. A stronger interpretation of the results

    requires establishing a connection between the optimalityconsiderations presented above and the realization of theoptimal solutions in the real world. Such a connectionwould require at least three premises:

    (1) In order to function, natural languages include only asmall number of structured binary relations.

    (2) Binary relations fulfill several functions in natural lan-guages.

    (3) There are forces (evolution or a planner) which make itmore likely that structures which are optimal withregard to the functions of binary relations will beobserved in natural languages.

    The first premise states that language inherently exhibitsfew of the properties of binary relations. Only if thenumber of possible structures is small can a user of the lan-guage deduce a relations structure from a small number ofinstances in which the relation is used. (It is amazing howfew observations of the type a is better than b are suffi-cient to teach a child that this relation is transitive.)

    The second premise is the central one in this discussion.There are numerous potential criteria by which to measurethe functionality of binary relations. Three such criteriawere examined above. It was argued that certain properties(all shared by linear orderings) perform better according toeach of these criteria.

    The third premise, which links the first two, states that

    either there is a linguistic engineer who chooses theproperties of binary relations so that they function effec-tively or that evolutionary forces select structures which

    Economics of language

    22

  • 8/12/2019 Ariel Rubinstein Economics and Language Five Essays 2000

    33/138

    are optimal or nearly so with respect to the functions theyfulfill. This idea, which is popular in economics, has alsobeen noted by philosophers. For example, Quine states: If

    peoples innate spacing of qualities is a gene-linked trait,then the spacing that has made for the most successfulinductions will have tended to predominate throughnatural selection (Quine, 1969, p.126).

    The approach adopted in this chapter is related to thefunctionality of the language approach discussed in lin-guistics (see, for example, Piatelli-Palmarini, 1970) and to

    attempts to explain the classification system in naturallanguage (see Rosch and Lloyd, 1978).

    The discussion in this chapter is also related to classicalphilosophical discussions on natural kinds. The notionof a natural kind emerges from the philosophical inquiryinto the factors which confirm an inductive argument(see, for example, Goodman, 1972, Quine, 1969, Watanabe,1969. A key puzzle in this literature is the so called riddleof induction: Let us say that up to this moment, allobserved emeralds were green. Why does this observationimply that all emeralds are green rather than all emeraldsare grue, an alternative category which includes allobjects which were green up to this moment and blue from

    now on?One possible answer to this question is that the induc-tive process relies on notions of similarity. Inductive argu-ments are made only with regard to categories of similarobjects. The category green contains similar elements;grue does not. A green element yesterday and a greenelement tomorrow are similar; in contrast, a grue element

    yesterday is not similar to a grue element tomorrow.However, this only begs the question since one is left withthe problem of determining the natural similarity rela-tions. Here we run into similar difficulties. One possiblesolution is to argue that two objects are similar if mostunary predicates coincide in satisfying the two objects.The difficulty with this criterion is raised by The Ugly

    Duckling Theorem (see Watanabe, 1969, Section 7.6). Ifthe set of predicates is closed under Boolean operations,then the number of predicates which satisfies any possible

    Choosing the semantic properties of language

    23

  • 8/12/2019 Ariel Rubinstein Economics and Language Five Essays 2000

    34/138

    object is constant; thus, the existence of elementary unarypredicates cannot be the basis for explaining the existenceof specific similarity relations. This led Watanabe to con-

    clude that if we acknowledge the empirical existence ofclasses of similar objects, it means that we are attachingnonuniform importance to various predicates, and thatthis weighting has an extra-logical origin (Watanabe,1969, p. 376). Thus, there is no escape from assuming thata certain kind of predicate (like green and not likegrue) has a preferred status called natural kind (see

    Quine, 1969).Within the class of properties of binary relations, linear

    orderings, more than other structures, appear to be of anatural kind. This chapter has attempted to providesome rationale as to why this is so.

    R EF ER EN CES

    Fishburn, P., J.H. Kim and P. Tetali (1994) TournamentCertificates, AT&T Bell Labs, mimeo

    Goodman, N. (1972) Problems and Projects, Indianapolis andNew York: Bobbs-Merrill

    Piattelli-Palmarini, M. (ed.) (1970) Language and Learning: TheDebate between Jean Piaget and Noam Chomsky,

    Cambridge, MA: Harvard University PressQuine, W.V. (1969), Ontological Relativity and Other Essays,

    New York: Columbia University PressRosch, E. and B. Lloyd (eds.) (1978) Cognition and

    Categorization, Hillsdale: Lawrence Erlbaum AssociatesRubinstein, A. (1996) Why are Certain Properties of Binary

    Relations Relatively More Common in Natural

    Language?, Econometrica, 64, 343-56Watanabe, S. (1969), Knowing and Guessing, New York: JohnWiley

    Economics of language

    24

  • 8/12/2019 Ariel Rubinstein Economics and Language Five Essays 2000

    35/138

    C H A P T E R 2

    EVOLUTION GIVES M EANING TO

    LANGUAGE

    2.1 The lack of meaning of words in gamesLet us imagine a community of fishermen who makedaily fishing trips onto the lake in small two-man boats.Occasionally, one fisherman notices a rock in the waterwhich requires his partner to take evasive action. In such asituation, the observer needs to warn his partner. It wouldappear that the observer has only to shout Be careful inorder to transmit the information about the obstacle. Butwhy is Be careful indeed interpreted by his partner in theway we understand the phrase rather than as Here is afish or as What a beautiful day it is? This is the ques-tion addressed in this chapter.

    The situation is an interactive one in which the meaning

    of a statement depends on what the speaker thinks that thelistener understands, which in turn depends, in a circularway, on what the listener thinks that the speaker has inmind. Thus, the situation can be analyzed in a game-theo-retic framework. The basic situation will be viewed as asymmetric game. A player can either be an observer or alistener. A strategy is a plan of action for reacting to an

    25

    The chapter is based on variations of Maynard Smiths concept of an ESS

    (see Maynard Smith, 1982). The application of these ideas to the evolution

    of communication protocols was explored by Blume, Kim and Sobel (1993)

    and Warneryd (1993), among others. The ideas in this chapter are closely

    connected to the work of Banerjee and Weibull (1993) and Warneryd

    (1998), who recognized the power of using complexity considerations in

    the evolutionary analysis of language formation. For a formal introduc-

    tion of complexity considerations into game situations, see Rubinstein(1986). For studies of complexity considerations within an evolutionary

    framework, see Binmore and Samuelson (1992) and Binmore, Piccione and

    Samuelson (1998).

  • 8/12/2019 Ariel Rubinstein Economics and Language Five Essays 2000

    36/138

    obstacle in front of the boat and responding to the partnerscry while rowing the boat. Note (at this stage informally)that the situation has one equilibrium in which the cry

    is made in the case of an obstacle being spotted and isinterpreted correctly. However, the situation has anotherpooling equilibrium in which an observer ignores therock (remains silent) and is not programmed to respondto his partners shout if there is one.

    The multiplicity of equilibria in this situation is one ofthe most frustrating challenges encountered by game

    theoreticians. In fact, it is difficult to describe communica-tion using game-theoretic models owing to a fundamentalproperty of any game-theory model: The familiar solutionconcepts are invariant to the names of the actions.Consider a game and rename the actions: Whatever game-theoretic solution concept was used for the first game istransformed into an equilibrium of the game with theactions renamed!

    One exception is Farrells solution concepts, which arerelated to cheap talk (see, for example, Farrell, 1993).Farrell assumes that words have afixed external meaning.A listener operates under the assumption that unless aspeaker has a good reason to mislead him, he does not do

    so. Farrell argues for the instability of the pooling equilib-rium in which an observer remains silent and a listenerignores the cry Be careful. This equilibrium will bedestroyed by a fisherman who nevertheless shouts Becareful when observing a rock, forcing the listener todecide whether to believe the observer or not. He reasonsas follows: My partner has no reason to persuade me that

    there is an obstacle in my way unless indeed there is anobstacle because:

    (1) If I believe him, the best course for me is to prevent acrash, an outcome that is beneficial for him as well.

    (2) He gains no benefit from persuading me that there is arisk when there is none.

    Farrells argument is based on the underlying assumptionthat members of the community understand that the callBe careful is meant to indicate the existence of risk. The

    Economics of language

    26

  • 8/12/2019 Ariel Rubinstein Economics and Language Five Essays 2000

    37/138

    goal of this research agenda is to explain the existence of aterm with this meaning.

    2.2 The basic evolutionary argument

    The explanation discussed here is provided by evolution-ary game theory. The core idea rests on Maynard Smiths(1982) notion of an Evolutionary Stable Strategy (ESS),according to which evolutionary forces select the moresuccessful modes of behavior. Language is taken to be a

    behavioral phenomenon and if it does not serve the needsof the population, evolutionary forces will act to improveits functioning.

    If there were no term in the language which allowed onemember of the community to alert another, a mutationwould occur whereby a small group of mutants would use acertain costless signal called Be careful to indicatehazard and to correctly respond to such a signal. The muta-tion can be thought of as originating in nature or a collectivedecision by a small group of community members. Then, ifa mutant is matched with another in a boat, the informationregarding the hazard is transferred and utilized. Whether amutant does better than a nonmutant when he is matched

    with a nonmutant depends on the way that the cry Becareful is interpreted by a nonmutant. If, for example, non-mutants respond to this unfamiliar cry in panic, mutantswill, on average, do worse than nonmutated members of thepopulation. In that case, the success of a mutant paired withanother mutant would be outweighed by the significant lossincurred when he interacts with a nonmutant (this is the

    more common case when the mutated population is small).Thus, the explanation of the emergence of a proper meaningfor the cry Be careful depends on providing a persuasiveexplanation as to why unfamiliar words do not causenonmutants to respond harmfully.

    Evolutionary game theory has attempted to resolve thisissue. A simple and, in my opinion, very reasonable line of

    reasoning is presented here from Warneryd (1993) andBanerjee and Weibull (1993). These authors assume thatthe respective evolutionary forces not only depend on the

    Evolution gives meaning to language

    27

  • 8/12/2019 Ariel Rubinstein Economics and Language Five Essays 2000

    38/138

    standard payoffs but also on strategy complexity costs.If, in equilibrium, no one shouts the phrase Be careful,then no one would waste his mental resources on being

    prepared to respond to such a cry. It is assumed that evolu-tionary forces act to minimize the waste of mentalresources and eliminate the panic response to the message.Consequently, the term Be careful will remain availablefor mutants to use without suffering a loss from nonmu-tants harmful responses.

    2.3 The basic situation

    Let us now formalize the evolutionary argument outlinedin the previous section. We assume a large homogeneouspopulation whose members are occasionally paired andexamine a situation in which one of them is selected ran-domly to perceive information about a risk which mustbe transmitted to his partner.

    Let Wbe a nonempty set. We will refer to Was the set ofwords, although nothing in the model requires the ele-ments of W to be verbal statements. The interactionbetween the members of the population is modeled as asymmetric game. A strategy in this game has the following

    format:(1) When I perceive risk I will use the word w*W or

    remain silent (shhh, where shhh is not a word in W).(2) When I hear another person saying wW, I will take

    one of three possible actions: I will take action to avoid the hazard (A), the

    correct behavior in the case of a hazard. I will panic (P), the worst possible action under any

    circumstances. I will take the default course of action, D, the best

    course of action in the case of no hazard and thesecond best in the case of a hazard.

    Thus, a strategy for a player in this game has two compo-

    nents: a choice of either shhh or w*Wand a functionr:W{D,A,P}. Note that no decision is made when there isno hazard.

    Economics of language

    28

  • 8/12/2019 Ariel Rubinstein Economics and Language Five Essays 2000

    39/138

    It is assumed that there is no conflict of interest betweenany two fishermen so that we are dealing with a coordina-tion game. Given the existence of a hazard, the payoffs

    depend only on whether or not the listener takes an actionto avoid the hazard (payoff 1), remains with the defaultaction (payoff 0), or panics (payoff M, where M is alarge positive number).

    Denote by u(s,s) the expected payoff of a player whouses the strategy s while his partner uses the strategy s,assuming that they have equal chances of being the

    observer. This definition is extended to u(s, ) where is aprobability measure on the set of strategies. A Nash equi-librium in this situation is a strategy s* satisfying that forno s, u(s,s*)u(s*,s*). There are two types of equilibriahere: In a separating equilibrium, a player as an observerof a hazard sends a message w* and on hearing w* hisresponse isA. In the pooling equilibria, no message is sentand the listener plans to respond to a message by ignor-ing it or panicking. The pooling equilibria are of coursebad since valuable information is not being utilized.

    This brief presentation raises the question whether thereare any evolutionary forces which can bring about theemergence of a good equilibrium in which information

    is transmitted and used.

    2.4 Evolutionary forces

    The idea that evolutionary forces can explain the emer-gence of words in the natural language appeared in theeconomic literature as early as Marschak (1965). However,

    Marschak suggested only a direction for research. It is theliterature initiated by Maynard Smith and Price (1973),Maynard Smith (1982), and especially the more contempo-rary literature of evolutionary game theory, which formu-lated a coherent argument for the emergence of words. Thebasic definition is:

    Definition 1: A strategy s* is an Evolutionary StableStrategy (ESS) if for any strategy s either u(s,s*)u(s*,s*)or, u(s,s*)u(s*,s*) and u(s,s)u(s*,s).

    Evolution gives meaning to language

    29

  • 8/12/2019 Ariel Rubinstein Economics and Language Five Essays 2000

    40/138

    The rationale for this concept is by now familiar to usersof game theory. A mode of behavior s* will not remain astable unique mode of behavior in a community if, follow-

    ing a mutation of an arbitrarily small fraction of the popu-lation to strategy s, the mutants will not, on average, doworse than the members of the community who have notmutated. The above conditions guarantee that whateverthe extent of the mutation, the mutants will do worse thanthe nonmutants (for every 0 small enough,

    u(s,s(1 )s*)u(s*,s(1)s*)).This standard form of ESS does not eliminate the poolingequilibria. In fact, if W includes more than one elementthen no ESS exists. If s* is an ESS, then any mutationwhich changes the response to words which are not usedwill be a resistible mutation. Thus, for ESS to be a usefulconcept, we must modify its definition. At least two pos-sible directions have been proposed in the literature.

    The first method is to utilize a multi-valued solutionconcept. The following is one such alternative definition(for others of this type see, for example, Weibull, 1995):

    Definition 2: A set of strategies S* is an evolutionary

    stable set if there is a probability measure on S* with fullsupport, such that:

    (1) u(s,s)u* for every s,sS*(2) For any sS*, u(s, )u* or u(s, )u* and u(s,s)

    u(s*,s) for any s*S*.

    Thus, there is one payoff u* such that, in an evolutionary

    stable set, all members of the community receive the sameexpected payoff u*. The fact that all of them have the samepayoff maintains the stability of the set. The criteria forthe success of a mutation are derived from the same ratio-nale as that of the ESS. Mutants will do worse than allstrategies which exist in the stable population of strate-gies. It is easy to verify that the set of all strategies which

    instructs the player as an observer to remain silent is anevolutionary stable set. Essentially, the placing of a highprobability on the strategy which responds to any word by

    Economics of language

    30

  • 8/12/2019 Ariel Rubinstein Economics and Language Five Essays 2000

    41/138

    panicking will make any mutant worse off than with anyof the S* strategies.

    Note a difficulty with the above definition of evolution-

    ary stable set. Following the appearance of a mutation, notall strategies in S* will do equally well and thus the distri-bution of strategies in S* may be unstable. My impressionis that other modifications of the ESS, which appear in theliterature, suffer from similar difficulties.

    The second approach to modifying the ESS is to weakenit by demanding that a successful mutation must yield

    strictly higher payoff. This modified version (as Binmoreand Samuelson, 1992, call it) is appropriate when themutation is largely intentional and members are interestednot in maximizing payoffs but in having the highest payoffin the group:

    Definition 3: A strategy s* is a modified evolutionarystable strategy if for any strategy s either u(s,s*)u(s*,s*)or u(s,s*)u(s*,s*) and u(s,s)u(s*,s).

    The strategy dictating the use of the word w* to warn ofa hazard and the response to any message with the actionAis a modified evolutionary stable strategy. However, so isthe strategy to maintain silence (choose shhh) and to

    respond to any message with panic. Thus, the modifiedESS does not provide an explanation of the emergence of awarning.

    2.5 Complexity and evolutionary forces

    This final section of the presentation introduces a new

    element into the model: the complexity of the strategies. Itis assumed that evolutionary forces not only affect thebasic payoffs but also the complexity of strategies. In par-ticular, evolutionary forces eliminate strategies whichdemand resources to respond to never-used messages. Thecosts of complexity put pressure on strategies not toinclude a threat to respond to messages with the

    harmful action P. This clears the way for mutations thattransfer and utilize observed information.

    At this point we should formally define what we mean

    Evolution gives meaning to language

    31

  • 8/12/2019 Ariel Rubinstein Economics and Language Five Essays 2000

    42/138

    by the simplicity of a strategy. A strategy is evaluated bythe number of different actions, besides shhh and D,which it plans to use under certain circumstances even

    though they will probably not be realized. Denote the com-plexity of a strategy s by comp(s). The simplest strategy,with complexity 0, dictates that the player chooses shhhas an observer and takes the action D as a listener whateverthe message he receives. It is the simplest strategy in thesense that its execution does not require any resources tomonitor the situation. The most complex strategy is of

    complexity 3. Such a strategy specifies a message to be sentin case of a hazard, a nonempty set of messages to whichthe response is the action A and a nonempty set of mes-sages to which the response is P.

    As mentioned previously, the basic assumption of thissection is that evolutionary forces work in favor of strate-gies that yield either higher payoffs or lower complexity. Itis assumed that the costs of complexity are lexicographi-cally secondary to the payoffs of the original game. Thus,there are two possible modifications of the ESS notion,both of which support the equilibria in which informationis transmitted and used.

    Definition 4 is analogous to definition 2 with the addi-

    tion of the complexity considerations:

    Definition 4: A set of strategies S* is an evolutionarystable set (with complexity considerations) if there is aprobability measure , with the whole set S* as support,such that:

    (1) u(s,s)u* for every s,sS* and all strategies in S* areequally complex.

    (2) For any sS* u(s,)u* or u(s,)u* and for all s*S*either u(s,s)u(s*,s) or [u(s,s)u(s*,s) and comp(s*)comp(s)].

    Thus, for a mutated strategy s to be successful, either (1) itdoes better on average than any of the strategies in S* or,

    (2) it is equally successful on average as any strategy in S*but does better than any strategy in S* when the followingtwo criteria are applied lexicographically: (a) payoff while

    Economics of language

    32

  • 8/12/2019 Ariel Rubinstein Economics and Language Five Essays 2000

    43/138

    playing against the mutation s and (b) the complexity ofthe strategy.

    It is clear that there is only one evolutionary stable set

    (with complexity considerations) which includes any strat-egy which instructs the user to utter some word in W* whena hazard is seen and to respond withA to any word in W*.Thus, this concept predicts that information will betransmitted and used, though it has the unattractive featurethat any utterance in W* serves as a meaningful signal.

    The final definition is analogous to definition 3:

    Definition 5: A strategy s* is a modified evolutionarystable strategy with complexity considerations if for anystrategy s either

    u(s,s*)u(s*,s*), or

    [u(s,s*)u(s*,s*) and u(s,s)u(s*,s)] or

    [u(s,s*)u(s*,s*), u(s,s)u(s*,s) and comp(s*)comp(s)].

    Thus a modified evolutionary stable strategy (with com-plexity considerations) is a modified ESS by which mutantsurvival is compared using three criteria which are appliedlexicographically: the payoff while playing against s*, thepayoff while playing against s, and the complexity of the

    strategy.If s* is such a modified ESS (with complexity considera-tions) and if s* assigns shhh to the observer, then thecomplexity considerations dictate that the response of s*to any message is D. It follows that a mutation s whichuses the word wto point out a risk and responds to wwith

    A, will be a successful mutation.

    If a modified ESS (with complexity considerations), s*,dictates that w* be uttered when a hazard is observed thenit must be that s* responds to w* withA. According to thecomplexity considerations, there are no circumstancesunder which any word will trigger the action P and thus s*must dictate that an observer utters w* when a hazard isobserved and a listener responds withA to any message in

    some set W* which contains w*. It is easy to verify thatsuch a strategy is indeed a modified evolutionary stablestrategy (with complexity considerations).

    Evolution gives meaning to language

    33

  • 8/12/2019 Ariel Rubinstein Economics and Language Five Essays 2000

    44/138

    2.6 A final comment

    This chapter was meant to provide a short overview of the

    basic evolutionary game-theoretic explanation for theemergence of words (terms) in natural language. It is debat-able whether the model is persuasive. I do not have a firmopinion on this issue. However, I would like to raise threepoints of criticism:

    First, the connection between biological powers andwhat we model as evolutionary forces is far from being

    clear. The definitions are not motivated by biological evi-dence or by intuition.

    Second, the scenario analyzed here is not sufficientlygeneral. The emergence of the term Be careful is ana-lyzed independently of the other elements of the language.In our analysis we classify the situation of two membersgoing on a boat trip as a well defined category. We assumethe existence of a well defined meaning for the notionthere is a hazard. We assume that the listener hasexactly three possible responses. The only element to bedetermined in the model is whether the observer shouts awarning when a hazard is observed and whether the shoutis interpreted correctly. A persuasive explanation of the

    emergence of a linguistic concept requires a much moregeneral setting. One may claim that this criticism is appli-cable to any game-theoretic model. Indeed models of gametheory are usually as simplistic and specialized as the onepresented here. However, I believe that models of gametheory are meant to describe and analyze human reasoning and do not directly deal with reality. Since people focus

    on a small number of factors in decision making, gametheory is able to use simplified models to explain behavior.In contrast, evolutionary theories are intended to explainreal-world phenomena and thus need to be constructed in amore general setting.

    Finally, the explanation of such phenomena using evolu-tionary forces requires an explanation not only for the exis-

    tence of human language but also for the lack thereof amonganimals. It is interesting to note that traces of this criticismalready appeared in the writings of Adam Smith (see Levy,

    Economics of language

    34

  • 8/12/2019 Ariel Rubinstein Economics and Language Five Essays 2000

    45/138

    1992), for an extensive discussion of this issue. In chapter IIof book I, Smith discusses the fact that while human beingsconduct trade, Nobody ever saw a dog make a fair and

    deliberate exchange of one bone for another with anotherdog. Adam Smith points out that trade requires language.He attributes the existence of flexible language amonghuman beings to their hardware, which accounts for thedifference between humans and animals. While AdamSmith discusses the link between the evolution of trade andthe existence of language, the approach discussed in this

    chapter has attempted to explain the evolution of language.So what then explains the difference between humans andanimals with regard to the development of language?Havent we simply replaced one puzzle with another?

    R EF ER EN CES

    Banerjee, A. and J.W. Weibull (1993) Evolutionary Selectionwith Discriminating Players, Stockholm: The IndustrialInstitute for Economic and Social Research

    Binmore, K., M. Piccione and L. Samuelson (1998)Evolutionary Stability in Alternating-offers BargainingGames, Journal of Economic Theory, 80, 25791

    Binmore, K. and L. Samuelson (1992) Evolutionary Stability in

    Repeated Games Played by Finite Automata, Journal ofEconomic Theory, 57, 278305

    Blume, A., Y.-G., Kim and J. Sobel (1993), EvolutionaryStability in Games of Communication, Games andEconomic Behavior, 5, 54775

    Farrell, J. (1993) Meaning and Credibility in Cheap-talkGames, Games and Economic Behavior, 5, 51431

    Levy, D. (1992) The Economic Ideas of Ordinary People,London: RoutledgeMarschak, J. (1965) The Economics of Language, Behavioral

    Science, 10, 13540Maynard Smith, J. (1982) Evolution and the Theory of Games,

    Cambridge: Cambridge University PressMaynard Smith, J. and G.R. Price (1973) The Logic of Animal

    Conflict, Science, 246, 1518Rubinstein, A. (1986) Finite Automata Play the Repeated

    Prisoners Dilemma, Journal of Economic Theory, 39,8396

    Evolution gives meaning to language

    35

  • 8/12/2019 Ariel Rubinstein Economics and Language Five Essays 2000

    46/138

    Warneryd, K. (1993) Cheap Talk, Coordination and EconomicStability, Games and Economic Behavior, 5, 53246

    (1998) Communication, Complexity and Evolutionary

    Stability, International Journal of Game Theory, 27,599609Weibull, J. (1995) Evolutionary Game Theory, Cambridge, MA:

    MIT Press

    Economics of language

    36

  • 8/12/2019 Ariel Rubinstein Economics and Language Five Essays 2000

    47/138

    C H A P T E R 3

    STRATEGIC CONSID ERATIONS IN

    PRAGMATICS

    3.1 Grices principles and game theoryThe study of languages (see Levinson, 1983) is traditionallydivided into three domains: syntax, semantics, and prag-matics. Syntax is the study of language as a collection ofsymbols detached from their interpretation. Semanticsstudies the rules by which an interpretation is assigned toa sentence independently of the context in which the sen-tence is uttered. This chapter deals with the domain ofpragmatics.

    Pragmatics examines the influence of context on theinterpretation of an utterance. An utterance is viewed asa signal which conveys information within a context. Thecontext comprises the speaker, the hearer, the place, the

    time, and so forth. How the hearer views the intentions ofthe speaker and how the speaker views the presuppositionsof the hearer are relevant to the understanding of an utter-ance. Thus, in game-theoretic terms, the way in which anutterance is commonly understood may be thought of asan equilibrium outcome of a game between speakers of alanguage.

    To illustrate the type of phenomena explained by prag-matics, consider the following natural language conversa-tions:

    Example 1: A telephone conversation betweenA at homeand his friend B, calling from a telephone booth:

    37

    Sections 3.23.4 are based on Glazer and Rubinstein (1997). For basic read-

    ings on pragmatics see Levinson (1983). Grice (1989) is a collection of

    Grices papers on the subject.

  • 8/12/2019 Ariel Rubinstein Economics and Language Five Essays 2000

    48/138

    A: B, I am about to go for a walk. What is the weather likeoutside?

    B: It is not raining heavily now.

    Normally,A concludes fromBs statement that it is rainingbut not heavily. This conclusion does not follow from thesemantic interpretation of the sentence its not rainingheavily, which allows for the possibility that it is rainingbut not raining heavily as well as its not raining at all.Furthermore, there are circumstances in which the utterance

    its not raining heavily will indeed be interpreted as notexcluding the possibility that it is not raining at all. Forexample, imagine thatB is in a hut and has toldA that it isdark outside and that he is buried under his blanket. In such acase,B would only be aware of rain that is pouring downand hitting the roof loudly. In thiscaseA could infer only thatit is not raining heavily from Bs statement since B would not

    know whether it was raining lightly or not raining at all.Example 2:

    A: Where does C live? I need to reach him urgently.B: C is somewhere in New York.

    Unless A suspects that B is trying to prevent him fromreaching C, B will normally conclude that A really doesnot know Cs exact address, although he did not say soexplicitly.

    Example 3:

    A, a married man, to B: I went out with a woman lastnight.

    As statement is normally understood to mean that hedated someone who is not his wife although he did notexplicitly say so.

    Example 4:

    A: What do you see?B: It is not a rose.

    Ordinarily,A will understand that B is looking at a flowerwhich is not a rose, although B has not said that he islooking at a flower.

    Economics of language

    38

  • 8/12/2019 Ariel Rubinstein Economics and Language Five Essays 2000

    49/138

    In these examples, the fact that a statement appearedwithin a conversation between two people gives the sen-tence a different meaning than if it were interpreted in iso-

    lation. A theory which attempts to describe the rules forinterpreting conversational utterances was suggested bythe philosopher Paul Grice in the 1960s (see especially hisarticle Logic and Conversation in Grice, 1989) asfollows:

    while it is no doubt true that the formal devices are especially

    amenable to systematic treatment by the logician, it remainsthe case that there are very many inferences and arguments,

    expressed in natural language and not in terms of these devices,

    which are nevertheless recognizably valid. So there must be a

    place for unsimplified logic of the natural counterparts of

    these devices. (pp. 234)

    According to Grice (1989), speakers of a language areguided by the cooperative principle:

    Make your conversational contribution such as is required, at

    the stage at which it occurs, by the accepted purposes or direc-

    tion of the talk exchange in which you are engaged. (p. 26)

    Grice suggests four maxims which in general yield

    results in accordance with the cooperative principle:

    Quantity: Make your contribution as informative as isrequired Dont make your contribution more informa-tive than is required.

    Quality: Try to make your contribution one that istrue 1. Do not say what you believe to be false. 2. Do notsay that for which you lack adequate evidence.

    Relation: Be relevant.

    Manner: Be perspicuous (avoid obscurity, avoid ambigu-

    ity, be brief and be orderly).

    A central concept in Grices theory is that of conversa-tional implicature:

    Strategic considerations in pragmatics

    39

  • 8/12/2019 Ariel Rubinstein Economics and Language Five Essays 2000

    50/138

    A man who, by saying thatp has implicated q, may be said to

    have conversationally implicated that q, provided that (1) he is

    presumed to be observing the conversational maxims, or at

    least the Cooperative Principle; (2) the supposition that he isaware that, or thinks that, q is required in order to make his

    saying or making as if to say p, consistent with this presump-

    tion; and (3) the speaker thinks (and would expect the hearer to

    think that the speaker thinks) that it is within the competence

    of the hearer to work out, or grasp intuitively, that the supposi-

    tion mentioned in (2) is required. (1989, pp. 301)

    The inference mechanism which is used in conversa-tional implicature is described by Grice as follows:

    He had said thatp; there is no reason to suppose that he is not

    observing the maxims, or at least the Cooperative Principle; he

    could not be doing this unless he thought that q; he knows (and

    knows that I know that he knows) that I can see that the suppo-

    sition that he thinks that q is required; he has done nothing tostop me from thinking that q; he intends me to think, or is at

    least willing to allow me to think, that q; and so he has impli-

    cated that q. (1989, p. 31)

    Following are some examples of how Grices theory isapplied:

    Example 5: A is a driver who stops in a remote village andsees B who looks clearly a local.

    A: I am out of petrol.B: There is a garage around the corner.

    As line of thought might be as follows:

    There is no reason for me not to believe that B is co-operative.

    Bs utterance must not interfere with the maxim be rel-evant.

    It must be that B means that petrol is sold at thatgarage.

    Let us return to example 2: Unless B wants to preventAfrom reaching C, the maxim of quality seems to be vio-lated, unless B does not know Cs address. In example 3,

    Economics of language

    40

  • 8/12/2019 Ariel Rubinstein Economics and Language Five Essays 2000

    51/138

  • 8/12/2019 Ariel Rubinstein Economics and Language Five Essays 2000

    52/138

    not raining heavily is the relevant information, whywould he not say so if it were not raining at all?

    Students of game theory will find this discussion to be

    in the spirit of game-theoretic reasoning. Note, in particu-lar, the considerations which Grice imposes on the lis-tener: he (the speaker) could not be doing this unless hethought that q; he knows (and knows that I know that heknows) that I can see that the supposition that he thinksthat q, is required In other words, Grices theory isessentially a description of one agent thinking about how

    another is thinking. This is precisely the definition of stra-tegic reasoning and is the essence of game theory (for anearly paper which gives an account of Gricean communi-cation using the framework of game theory see Parikh,1991).

    As Grice noted, the theory may also be applicable toother forms of interactions between informed and unin-formed agents, in which the informed agent takes anaction (not necessarily an utterance) with the intention ofinfluencing the hearers subsequent actions. The similar-ity to game theory is clear.

    Furthermore, if game theory is to shed light on real-lifephenomena, linguistic phenomena are the most promising

    candidates. Game-theoretical solution concepts are mostsuited to stable real-life situations which are playedoften by large populations of players. Thus, game theoretictools may function most effectively when used to explainlinguistic phenomena.

    3.2 Debates

    To demonstrate the possible applications of game-theoretic methods in explaining Gricean phenomena, theremainder of this chapter will focus on a phenomenonrelated to debates. In this chapter, a debate will refer to asituation in which two parties who disagree regardingsome issue raise arguments in an attempt to persuade a

    third party of their positions. This is, of course, not theonly possible form of a debate. In some cases, the intentionof the debaters is to argue just for the sake of arguing. In

    Economics of language

    42

  • 8/12/2019 Ariel Rubinstein Economics and Language Five Essays 2000

    53/138

    other cases, their aim is to influence one another ratherthan a third party. Debates can be thought of as a specialform of conversation in which the parties have different

    interests. Debates differ from other mechanisms of con-flict resolution, such as bargaining and war, since poweris not used (or is used less) in debates to influence theoutcome. Although debates are common in real life, theyhave rarely been investigated within the economic andgame theoretic literature (notable exceptions are Shin,1994; Lipman and Seppi, 1995 and Spector, 2000).

    The aim of this chapter is to provide an explanation ofthe following phenomenon, often observed in debates: Twoarguments may have a different degree of persuasiveness ascounter-arguments even though they would be viewed asequally persuasive when stated in isolation. At the outset,we should point out that the Gricean theory does not applyto debates since Grices logic of conversation is based onthe principle of cooperation, which does not apply to theconflict of interests characteristic of a debate.

    To initiate the discussion, the results of a survey con-ducted among several groups of students at Tel-AvivUniversity will be presented. The following question waspresented to one group of students:

    Question 1: You are participating in a public debate on thelevel of education in the worlds capitals. You are trying toconvince the audience that in most capital cities, the levelof education has recently increased. Your opponent is chal-lenging you with indisputable evidence showing that thelevel of education in Bangkok has deteriorated. Nowit is your turn to respond. You have similar indisputableevidence to show that the level of education in MexicoCity, Manila, Cairo, and Brussels has increased. However,owing to time constraints, you can present evidence foronly one of the four cities. Which city would you choose tomake the strongest counter-argument to the Bangkokresults?

    Another group of subjects was presented with question 1with the modification that Bangkok was replaced byAmsterdam. To verify that the subjects regarded the four

    Strategic considerations in pragmatics

    43

  • 8/12/2019 Ariel Rubinstein Economics and Language Five Essays 2000

    54/138

  • 8/12/2019 Ariel Rubinstein Economics and Language Five Essays 2000

    55/138

    the debate by quoting Tuesdays report, which recom-mends channel As programs. The third party interruptshim and asks you to reply. Both Wednesdays and

    Thursdays reports are in your favor i.e. they recommendChannel B. However, you have time to present only one ofthese reports as a counter-argument, following which thethird party will make up his mind. Would you choose thereport for Wednesday or for Thursday as a better counter-argument to Tuesday?

    The results were quite clear: About 70 percent of the 58subjects felt that Wednesday would be a better counter-argument than Thursday. Note that Thursday is the lastweekday in Israel and hence, one would expect it to bemore significant than Wednesday.

    The survey results are puzzling. If two argumentscontain information of the same quality, why is one con-sidered to be a stronger counter-argument than the other?The fact that Manila is closer to Bangkok than it is toMexico City seems irrelevant to the substance of thedebate, yet it appears to dramatically affect the choiceof the better counter-argument. Similarly, althoughWednesday is less significant than Thursday with regard to

    TV programming, it is assessed as a better counter-argument against Tuesday.We believe that this phenomenon is connected to prag-

    matics. A debaters response to Bangkok with anycounter-argument other than Manila is interpreted as anadmission that Manila is also an argument in favor of theopponents position. In the context of question 3, if one

    responds to Tuesday with the counter-argument Thursday,rather than Wednesday, it is considered an admission thatWednesdays report is not in his favor. The fact that thesentence On Thursday Channel B is superior is utteredin a debate as a counter-argument to On Tuesday Channel

    A is superior gives the sentence a meaning different thanif the statement were made in isolation.

    In what follows, I have no pretensions to fully explainthe rules by which people compare arguments. The soletarget is to point out that the logic of debate should not

    Strategic considerations in pragmatics

    45

  • 8/12/2019 Ariel Rubinstein Economics and Language Five Essays 2000

    56/138

    necessarily include an axiom stating that if an argumentxis superior to argumenty(in the sense of being a successfulcounter-argument against it) then argumentyshould not

    be superior to argumentx.

    3.3 The model

    We view a debate as a mechanism designed to extractinformation from debaters. We assume that the aim of thedesigner is to increase the probability that the right con-clusion will be drawn by the listener subject to the con-

    straints imposed on the debaters and the listener in termsof time and cognitive abilities. It will be shown that fulfill-ing this aim may lead to debating rules which do not treatarguments and counter-arguments symmetrically.

    The setup is very simple: An uninformedlistener has tochoose between two outcomes, O1 and O2. The correctoutcome, from his point of view, is determined by fiveaspects, numbered 1, , 5. An aspecti may be realized aseither 1 or 2. When an aspect i receives the value j itimplies that aspect i possibly supports the outcome Oj(j1,2). A state (j)j1,,5 is a five-tuple of 1s and 2swhich describes the realizations of the five aspects. Thelistener assigns equal weights to all five aspects, and the

    correct outcome in state , C(), is the one which is sup-ported by the majority of the arguments.While the listener is ignorant of the state, the two debat-

    ers, named debater 1 and debater 2, have full informationabout the state. There exists a conflict of interests betweenthe two debaters and the listener: Each debater i wishesthat outcome Oi be chosen, whatever the state, whereas

    the listener wants the correct outcome to be chosen.A debate is defined to be a mechanism in which each

    debater reveals information in order to persuade the lis-tener to choose his preferred outcome. A debate consists oftwo elements: The procedural rule which specifies theorder and types of arguments which are permitted and the

    persuasion rule which specifies the relationship between

    the arguments presented and the listeners conclusion.The language that the debaters are permitted to use

    must be specified. It is assumed that debaters cannot make

    Economics of language

    46

  • 8/12/2019 Ariel Rubinstein Economics and Language Five Essays 2000

    57/138

    any moves other than raising arguments of the type argu-ment j supports me. Thus, a debater cannot raise argu-ments that support the outcome preferred by the other

    debater. We will further assume that the debaters mustprove their claims i.e. debater i cannot claim that thevalue of aspect j isi unless it is indeed so.

    The optimal debate should have an effective constrainton its length. Of course, if it is possible for three argumentsto be raised during the debate (that is, there is enough timefor raising three arguments and the listener has the cogni-

    tive ability to digest them all), then the listener can obtainthe correct outcome with certainty. He can accomplishthis by requesting that one of the debaters present threearguments the debater will win the debate if and only ifhe fulfills this task. Thus, for the length constraint to beeffective, it is assumed that the number of argumentswhich can be raised during a debate is constrained to two.

    Formally, we define a debate to be an extensive gameform in which:

    (1) The set of feasible moves for each debater is a subset of{1, , 5}.

    (2) There can be at most two moves: Either only one of thedebaters is allowed to make at most two arguments (the

    one-speaker debate); or the two debaters move simulta-neously, each one making at most one argument (thesimultaneous debate); or the debate is a two-stageprocess in which one debater is allowed to make at mostone argument in each stage (thesequential debate).

    (3) One of the outcomes, O1 or O2, is attached to each ter-minal history.

    When debater j presents an argument to answer a previousargument of debateri, debater js argument is referred to asa counter-argument.

    A debate and a state determine a game () whichwill be played by the two debaters when the state is . Thetwo-player game form () is obtained from by deleting,

    for each debater i, all aspects which do not support hisposition in . If playeri has to move following a historyhand if at none of the argu