43
ATheory of Disease and Development: Productivity, Exchange, and Social Segregation Javier A. Birchenall University of California at Santa Barbara March 13, 2020 Abstract This paper studies the inuence of disease on the social organization of production and exchange. I consider an assignment problem in which disease spillovers (i.e., conta- gion) induce social segregation. The eects of social segregation on factor productivity and output per capita are characterized across production and trading arrangements with a dierential scope for disease spillovers. Historical accounts describe numerous instances in which past societies have dealt with disease through social segregation in production and exchange. Some of these instances are catalogued here. Keywords: Disease, Economic Development, Total Factor Productivity JEL classication: E23, I15, O40. Communications to: Javier A. Birchenall Department of Economics, 2127 North Hall University of California, Santa Barbara CA 93106 Phone/fax: (805) 893-5275 [email protected] I would like to thank the seminar participants at the University of Chicago, University of California Merced, Yonsei University, American University of Beirut, and UC Santa Barbara Symposium on Inequality for their many comments. I have, over the years, also beneted from comments and remarks by Hoyt Bleakley, Robert Fogel, Pete Klenow, Shelly Lundberg, Raaj Sah, and Alwyn Young.

theory of disease april 2020 - UCSB's Department of Economicsecon.ucsb.edu/~jabirche/Papers/development_theory.pdf · counting for educational outcomes; see also Caselli and Ciccone

  • Upload
    others

  • View
    4

  • Download
    0

Embed Size (px)

Citation preview

Page 1: theory of disease april 2020 - UCSB's Department of Economicsecon.ucsb.edu/~jabirche/Papers/development_theory.pdf · counting for educational outcomes; see also Caselli and Ciccone

A Theory of Disease and Development:Productivity, Exchange, and Social

Segregation

Javier A. Birchenall∗

University of California at Santa Barbara

March 13, 2020

Abstract

This paper studies the influence of disease on the social organization of production

and exchange. I consider an assignment problem in which disease spillovers (i.e., conta-

gion) induce social segregation. The effects of social segregation on factor productivity

and output per capita are characterized across production and trading arrangements

with a differential scope for disease spillovers. Historical accounts describe numerous

instances in which past societies have dealt with disease through social segregation in

production and exchange. Some of these instances are catalogued here.

Keywords: Disease, Economic Development, Total Factor Productivity

JEL classification: E23, I15, O40.

Communications to: Javier A. Birchenall

Department of Economics, 2127 North Hall

University of California, Santa Barbara CA 93106

Phone/fax: (805) 893-5275

[email protected]

∗I would like to thank the seminar participants at the University of Chicago, University of CaliforniaMerced, Yonsei University, American University of Beirut, and UC Santa Barbara Symposium on Inequality

for their many comments. I have, over the years, also benefited from comments and remarks by Hoyt

Bleakley, Robert Fogel, Pete Klenow, Shelly Lundberg, Raaj Sah, and Alwyn Young.

Page 2: theory of disease april 2020 - UCSB's Department of Economicsecon.ucsb.edu/~jabirche/Papers/development_theory.pdf · counting for educational outcomes; see also Caselli and Ciccone

1 Introduction

This paper proposes a theory of disease and economic development. Its focus is not on

how individuals respond to disease, but on the effect of disease spillovers (i.e., contagion)

and social segregation on aggregate output. From an individual perspective, disease is a

biological process. On physiological grounds, infectious and parasitic diseases reduce the

amount of energy available for work and study; see, e.g., Fogel (1994), Miguel and Kremer

(2004), Bleakley (2007, 2010), and Weil (2007). Disease, however, rarely affects individuals

in isolation. Throughout history, endemic and epidemic diseases have typically acted on

large groups of individuals. The goal of this paper is to study the influence of disease on

the social organization of production and exchange.

I consider a highly stylized assignment economy. There are two types of individuals,

healthy and diseased. All individuals are endowed with the same physical capital. Produc-

tion and exchange are social activities. Individuals meet randomly, and production takes

place in pairwise meetings. Disease spillovers take the form of contagion when individu-

als meet. Individuals must be assigned across a continuum of communities. Communities

are then consolidated into an aggregate economy. I find that when disease spillovers are

sufficiently negative, relative to the diminishing returns to labor, it is (socially) efficient to

segregate healthy and diseased individuals (Proposition 1). I also find that disease acts as

a non-technological influence in total factor productivity. Aggregating across production

units reduces measured total factor productivity relative to the technological possibilities

in the economy (Proposition 2). I characterize the productivity effect of disease in many

alternative production and exchange settings that differ in their degree of centralization and

disease exposure (Propositions 3 to 6). I extend the baseline assignment model in multiple

directions (i.e., dynamic disease spillovers and a multi-class society).

Historians have long recognized the importance of disease environments in shaping social

and economic institutions; see notably McNeill (1976).1 I catalogue how past societies

sought to avoid contagion through social segregation. These historical accounts include the

1Briefly stated, the central argument in McNeill (1976) is that human populations and their disease

environments reach a stable equilibrium which, when disrupted (i.e., because of trade or travel), motivates

social, economic, demographic, and political changes. Crosby (1972) and Zinsser (1935) are equally relevant

discussions on the ‘disruptive’ role of disease.

2

Page 3: theory of disease april 2020 - UCSB's Department of Economicsecon.ucsb.edu/~jabirche/Papers/development_theory.pdf · counting for educational outcomes; see also Caselli and Ciccone

caste system in India in which direct and indirect physical contact was socially regulated;

legal quarantines and isolation in medieval Europe where trade was temporarily restricted;

peripheral markets in pre-colonial Africa in which trade was ‘silent’ and featured no social

interactions; and the emergence of high altitude cities in pre-Columbian empires in the New

World which faced less hostile environments than coastal areas. These historical perspectives

are consistent with the role of disease sketched by the model as their scope is not about

individual responses. The social organization of production and exchange in the past is

still relevant today. The social influence of disease is often path-dependent and difficult to

change.

Economists have, on the other hand, focused on the biological and demographic effect

of disease.2 The biological response to disease has a proximate effect on aggregate output.

Weil (2007) studied the significance of health human capital in development accounting. His

measurement strategy treats health human capital in efficiency units of labor. Efficiency

units, however, are inconsistent with disease spillovers. Under efficiency units, the spatial

distribution of the population is irrelevant; only mean (or total) health human matters.

Jones (2014) used a general aggregator to study distributional effects in development ac-

counting for educational outcomes; see also Caselli and Ciccone (2019). I build on Weil’s

(2007) measurement strategy. I find that the productivity effect of disease is potentially

large. Not only has mean health human capital increased over time, but health dispar-

ities have narrowed across individuals and countries improving aggregate efficiency. The

development accounting presented here is tentative and conservative. In the absence of a

direct estimate of disease spillovers, I use microeconomic estimates for the substitutability

between skilled and unskilled labor in terms of educational attainment. These estimates do

not properly measure disease spillovers.

Links between disease and long-run economic and political outcomes beyond those just

2Demographic effects are linked to the burden of higher population growth. Disease loads that reduce

population growth are bound to increase temporarily output per capita due to diminishing returns to labor;

see, e.g., Acemoglu and Johnson (2007), Birchenall (2007), and Young (2005). The influence of disease on

education and physical capital has been amply studied; see, e.g., Bleakley (2007, 2010), Bleakley and Lange

(2005), Evans and Miguel (2005), Miguel and Kremer (2004), and Soares (2005). Disease influences capital

markets by influencing, among other aspects, savings for retirement and education; see, e.g., De Nardi et

al. (2009). Bloom et al. (2019) seeks to reconcile the empirical effects of health on economic growth at

disaggregate and aggregate levels.

3

Page 4: theory of disease april 2020 - UCSB's Department of Economicsecon.ucsb.edu/~jabirche/Papers/development_theory.pdf · counting for educational outcomes; see also Caselli and Ciccone

noted include the colonial origins hypothesis advanced by Acemoglu et al. (2001) and

Alsan’s (2015) detailed case study of the TseTse fly on the political centralization and

demography of sub-Saharan Africa. Work in this area, however, is highly empirical. There

is also a large literature in economics that examines socioeconomic segregation but without

touching on the role of disease; see, e.g., Schelling (1971) for a foundational work on sorting

and segregation. Segregation is often examined in the realm of education and location

theory; see, e.g., Benabou (1996), Kremer and Maskin (1996), Fernández and Rogerson

(2001), and Becker and Murphy (2003).

Fogli and Veldkamp (2018) proposed a network approach to study the effect of different

social networks on aggregate output and technology adoption. Their model is dynamic

and very rich in terms of the social connections between individuals. In their model, the

diffusion of ideas and germs through the population serves to trace the structure of the

social network, and the structure of the social network serves to further diffuse ideas and

germs in the population. The rich dynamics and social structure make the characterization

and implementation of a network setting difficult.3 Overall, Fogli and Veldkamp’s (2018, p.

24) “bottom line is that the way in which networks affect economic growth depends on the

disease environment.” This paper makes the social structure much simpler, so the effects of

the disease environment on production and exchange protocols are very tractable. I have

also taken the level of technology as exogenous. In the present model, disease acts as non-

technological influence in productivity. In here, capital and labor markets tackle disease

spillovers through a “social barrier.” This “social barrier” is endogenous. The absence of

factor price equalization across communities or individuals of different types is thus not

indicative of misallocation as it is in Hsieh et al. (2019), for example.

3Fogli and Veldkamp (2018) characterize the average path length of the network, i.e., the distance between

nodes, and limiting properties of the network, i.e., the existence of a stationary disease-free. The mapping

between social networks across countries is also challenging. Cross-country measures of individualism-

collectivism, individual mobility, and friendship links are drawn from attitude surveys made by a multina-

tional US company (IBM) in host countries, the cross-state mobility of first-generation US immigrants from

different countries, and the number of close friends US residents with ancestors from different countries

report. Their framework involves many moving parts. For example, since high-diffusion networks help

technology adoption but foster disease, Fogli and Veldkamp (2018, p.22) find quantitatively that the effect

of high-diffusion networks on technology and output “can be positive, negative, or zero.”

4

Page 5: theory of disease april 2020 - UCSB's Department of Economicsecon.ucsb.edu/~jabirche/Papers/development_theory.pdf · counting for educational outcomes; see also Caselli and Ciccone

2 Baseline model

This section studies the emergence of segregation and stratification in a two-class society.

Individuals differ in their endowment of health human capital. Production and exchange

are social activities subject to contagion. The baseline model exposes channels, many not

tacitly recognized in economics, through which disease determines the social organization

of production and exchange. Section 3 contains remarks and extensions. Section 4 offers

some historical perspectives consistent with the theory’s predictions.

Individuals and communities. There are individuals and a continuum of commu-

nities (e.g., ethnicities, religions, linguistic groups, or other social divisions) of measure 1.

Each individual is endowed with the same amount of physical capital, = . Capital is

tradable. Aggregate capital is . Individuals are of two types, healthy and diseased. Their

respective health human capitals are denoted by and . Healthy individuals have higher

labor productivity than diseased individuals:

Assumption 1 Health human capital satisfies ≥ 0.

There is a given number of healthy individuals in the economy, Γ ≡ . The number

of diseased individuals is − Γ = (1 − ) with ∈ (0 1). The disease prevalence rateis 1− . The economic problem is the assignment of individuals across communities. The

number of healthy individuals to be assigned to community is () , where () ∈ [0 1] isthe fraction of healthy individuals in the community. The assignment’s feasibility condition

is Z 1

0

() = . (1)

Within each community, production takes place in pairs of randomly matched individ-

uals. Individuals cannot produce in isolation. The desirability of pairwise meetings is dis-

cussed below (Proposition 3). When two individuals meet, they pool their physical capital.

By construction, the pair’s pooled capital is the (arithmetic) mean of the two individual’s

capital, i.e., = ( + )2. A pair of randomly matched individuals may have different

health human capitals. The health human capital of a matched pair is also pooled, but not

necessarily linearly. The way health human capital is averaged depends on disease spillovers.

5

Page 6: theory of disease april 2020 - UCSB's Department of Economicsecon.ucsb.edu/~jabirche/Papers/development_theory.pdf · counting for educational outcomes; see also Caselli and Ciccone

I assume that when individuals of type and meet, the pair’s health human capital is

( ) = [( + )2]1, (2)

where the aggregator ( ) is: linearly homogeneous, ( ) = ( ) for all 0;

symmetric, ( ) = ( ); and representative, ( ) = and ( ) = for all .

The pairwise human capital aggregator ( ) is increasing in and yields a variety

of common means as varies. For example, (2) satisfies: −∞( ) = min{ } = ;

−1( ) = 2()( + ), e.g., the harmonic mean of and ; 0( ) =√, e.g., the

geometric mean of and ; 1( ) = (+ )2, e.g., the arithmetic mean of and ; and

∞( ) = max{ } = . As will be evident below (Proposition 1), the parameter is

central to determine the social organization of production.4

Production takes place with a Cobb-Douglas production function. The level of technol-

ogy is given and denoted by 0. When healthy individuals meet they produce ( ) =

1− with ∈ (0 1); when diseased individuals meet they produce ( ) = 1−;

and when a healthy and a diseased individual meet they produce ( ) = [( )]1−

or

( ) = [( + )2](1−). (3)

The probability that a healthy individual meets another healthy individual is propor-

tional to the fraction of healthy individuals in the community, (). Thus, the meeting

probability between two healthy individuals is ()2. Likewise, the meeting probability be-

tween two diseased individuals is (1−())2. The meeting probability between healthy and

diseased individuals is 2()(1−()), since there are two (symmetric) ways such meetings

could occur. (Obviously, ()2 + 2()(1− ()) + (1− ())2 = 1.)

Community ’s expected output per capita, denoted by (()), is

(()) ≡ ©()2( ) + 2()(1− ())( ) + (1− ())2( )ª. (4)

4Assuming that capital is linearly aggregated in a pairwise meeting is without loss of generality. Assuming

that a meeting’s physical capital is ( ) = leaves all results intact.

6

Page 7: theory of disease april 2020 - UCSB's Department of Economicsecon.ucsb.edu/~jabirche/Papers/development_theory.pdf · counting for educational outcomes; see also Caselli and Ciccone

Disease spillovers and social segregation. Let ∗() denote the optimal value of

(). There are only two ways to assign individuals across communities: healthy and

diseased individuals should be either integrated or segregated. Segregation makes sense

when a healthy individual’s productivity increases more when paired with another healthy

individual than when paired with a diseased one, e.g., when disease exert negative spillovers.

The following assumption captures these spillovers.

Assumption 2 The parameters and satisfy 1− .

The social organization of production is characterized by the next proposition.

Proposition 1 Under Assumptions 1 and 2, it is (socially) optimal to segregate healthy

and diseased individuals. That is, ∗() = 1 in a measure of communities and ∗() = 0

in a measure 1− of communities.

Proof. The Lagrangean of the assignment problem is

L ≡Z 1

0

(())+

∙ −

Z 1

0

()

¸+

Z 1

0

{0()() + 1()[1− ()]}, (5)

where is the Lagrange multiplier on (1), and 0() and 1() are the Lagrange multipliers

on () ≥ 0 and 1− () ≥ 0, respectively. The Kuhn-Tucker conditions are

(∗())()

− + 0()− 1() = 0, (6)

∗() ≥ 0, 1− ∗() ≥ 0, 0()∗() = 0, 1()[1− ∗()] = 0, 0() ≥ 0, and 1() ≥ 0.Since the constraints are linear, the nature of the assignment depends on concavity-

convexity of (()) in (5). If (()) is strictly convex in (), the only solution to (6) is

∗() = {0 1}, where the measure of all communities with ∗() = 1 equals to satisfy (1).Convexity requires 2(∗())()2 = ( ) − 2( ) + ( ) 0. Using (3), this

condition is

[( + ) 2]1 [(1− + 1−)2]1(1−). (7)

By Jensen’s inequality, (7) requires 1− (Assumption 2).

7

Page 8: theory of disease april 2020 - UCSB's Department of Economicsecon.ucsb.edu/~jabirche/Papers/development_theory.pdf · counting for educational outcomes; see also Caselli and Ciccone

Proposition 1 is the foundation of the rest of this section. It shows that disease spillovers

determine the social organization of production. In particular, the type of sorting of healthy

and diseased individuals into communities depends on the concavity-convexity of output

per capita with respect to (). Due to Assumption 2, (()) is strictly convex in ().

Thus output per capita is maximized by a corner solution in which each community has

only healthy or diseased individuals with no room for mixing. The specific distribution of

these segregated communities along [0 1] is indeterminate. For instance, an assignment with

∗() = 1 for ∈ [0 ] and ∗() = 0 for ∈ ( 1] where all healthy pairs are ‘clustered’is equivalent to a ‘polarized’ assignment with ∗() = 1 for ∈ [0 2] ∪ [1 − 2 1] and

∗() = 0 for ∈ (2 1 − 2) where the (equally-sized) segregated communities are in

the extremes of the [0 1] interval.

Under randommatching, the sorting condition (Assumption 2) takes a simple form. Only

and determine if the assignments leads to segregation or integration. The assignment

is such that even a small difference between 1− and is sufficient to induce segregation.

To illustrate the tension between and , consider two values of . First, under = 1, (2)

becomes the arithmetic mean of and : 1( ) = ( + )2. Assumption 2 (and hence

Proposition 1) does not hold in this case. The function (()) is strictly concave in ()

so it is optimal to integrate healthy and diseased individuals by setting ∗() = 1 for

all ∈ [0 1]. In this case, the disease spillovers are small. To minimize the negative impactof diminishing returns that arise when healthy workers are paired together, the efficient

assignment requires social integration.

Consider next = −∞, which exemplifies Kremer’s (1993) O-ring technology. Thisis a special case of Assumption 2 that does not require particular assumptions about

the production function. Meeting a diseased individual renders healthy individuals un-

productive, e.g., (2) becomes −∞( ) = min{ } = , and (4) becomes (()) =

()2( ) + (1− ()2)( ). Since ( ) ( ), it is optimal to ‘protect’ healthy

individuals by setting ∗() = 1 in the largest possible number of communities.

As these examples show, the tension between 1− and in Assumption 2 is a tension

between two negative forces: diminishing returns to scale in production and (negative)

disease spillovers in pairwise matching.

8

Page 9: theory of disease april 2020 - UCSB's Department of Economicsecon.ucsb.edu/~jabirche/Papers/development_theory.pdf · counting for educational outcomes; see also Caselli and Ciccone

Output and aggregate efficiency. Social segregation (Proposition 1) matters for

aggregate output and aggregate production efficiency, i.e., measured aggregate total factor

productivity (TFP). Let ≡ + (1− ) denote mean health human capital, and let the

superscript () denote the allocation under segregation. For example, is aggregate output

per capita, i.e., the aggregate of (∗()) across segregated communities,

≡Z 1

0

(∗()). (8)

Aggregate output is ≡ .

Proposition 2 Under social segregation, there is an aggregate Cobb-Douglas production

function = ()1−, where aggregate production efficiency satisfies

"

µ

¶1−+ (1− )

µ

¶1−#, with . (9)

Proof. Using ∗() in (4) yields (∗()) = ( ) + (1 − )( ). This and (8)

yield = [1− + (1− )1−]. Simple rearrangements and (9) yield = 1−,

from which follows. The inequality in (9) follows from Jensen’s inequality:

1− + (1− )1− [+ (1− )]1− ≡ 1− for ∈ (0 1).The aggregate production function is Cobb-Douglas. The aggregate labor input is the

product of the quantity and quality of the labor force, and respectively. Labor quality

captures the proximate effect of disease on aggregate output. The proximate effect

depends on the mean of health human capital but not on how health human capital is

distributed across individuals or production units. Proposition 2 also shows that disease is

a non-technological influence in total factor productivity. The effect of health human capital

on acts as a productivity effect of disease. This effect captures the fact that measured

aggregate TFP is lower than the pairwise level of technology. As (9) shows, aggregating

across pairwise meetings lowers aggregate efficiency, i.e., .

The productivity effect of disease depends on how health human capital is distributed

across individuals and production units. It is independent of the mean of health human

capital. If, for example, and increase by a factor of 0, increases by and

9

Page 10: theory of disease april 2020 - UCSB's Department of Economicsecon.ucsb.edu/~jabirche/Papers/development_theory.pdf · counting for educational outcomes; see also Caselli and Ciccone

increases by (1−), but remains unchanged. Instead, responds to mean preserving

changes to the distribution of health human capital. Consider, for example, a mean pre-

serving worsening of health ∆ = ∆ + (1 − )∆ = 0 with ∆ 0 ∆. While

remains constant, aggregate output and measured aggregate TFP decrease by

∆ = ∆ = (1− )(1− ){− − −}∆ 0.

As health dispersion vanishes → , the social organization of production becomes ir-

relevant: sorting becomes indeterminate in the sense that all allocations become equally

productive. In this case, and exactly coincide.

The productivity effect of disease arises because the aggregate labor input is represented

by total health human capital . Representing the aggregate labor input using a more

general aggregate that takes into account the distribution of health human capital (not just

its mean) would ‘eliminate’ the role of disease in aggregate efficiency. Changes in health

dispersion would then be assigned to changes in the labor input, as in Jones (2014) who

studied a general human capital aggregate for educational outcomes. Proposition 2, however,

uses a standard representation for the aggregate production function, i.e., aggregate inputs

and are mapped into aggregate output . This representation is consistent with the

actual measurement practice for aggregate TFP. The measurement strategy in Proposition

2 is not consistent with the accounting framework in Weil (2007). He treated health human

capital in efficiency units. I will later on discuss the efficiency units framework.

A central marketplace. To further discuss the productivity effects of disease, abandon

temporarily the pairwise matching assumption. Assume that capital and labor are pooled

in a central marketplace. Let superscript () denote the variables under this alternative

production arrangement. A central marketplace is not the same as an integrated assignment.

Under integration, production still takes place in different communities but () is equally

assigned across communities. In a central marketplace, () is no longer a choice variable.

That is, in a central marketplace, there is no need to sort people across communities. In per

capita terms, the economy’s physical capital is = +(1−) = , the economy’s health

human capital is ( ) = ( + (1− ))1, and the economy’s per capita production

10

Page 11: theory of disease april 2020 - UCSB's Department of Economicsecon.ucsb.edu/~jabirche/Papers/development_theory.pdf · counting for educational outcomes; see also Caselli and Ciccone

function is = [( )](1−). In a central market place, the values ( ) and

follow by definition.5

In a central marketplace, aggregate output is also Cobb-Douglas, = ()1−,

with

µ

+ (1− )

µ

¶¸(1−). (10)

This Cobb-Douglas production function differs from that in Proposition 2 only due to dif-

ferences in measured productivity. This means that the productivity effect of disease can

be stated in terms relative to the production efficiency of a central marketplace.

Proposition 3 Aggregate production efficiency in a central marketplace subject to disease

spillovers is lower than aggregate production efficiency in segregated pairwise meetings. That

is, .

Proof. From (9) and (10), iff [ + (1− )](1−) 1− + (1− )1−.

This inequality, which is a general version of (7) that holds for any value of ∈ (0 1), canbe written as

[ + (1− )]1 [1− + (1− )1−]1(1−). (11)

By Jensen’s inequality, (11) is satisfied when 1− (Assumption 2).

Since segregation is socially efficient, production in pairwise meetings is a source of ag-

gregate efficiency gains. The reason is that less efficiency is lost in the process of aggregating

across segregated pairwise meetings than in the process of pooling all factors of production

in a central marketplace subject to disease spillovers. For instance, Propositions 2 and 3

imply that

.

On the contrary, when integration is optimal, inequality (11) is reversed and centralizing

production in a single marketplace is a source of efficiency gains. In the extreme case of

→ −∞, → − ≡ ()

(1−), (12)

5The index ( ) is the natural representation of the economy’s health human capital when all factors

of production are pooled. For instance, expression (2) in pairwise meetings is ( ) with = 12.

11

Page 12: theory of disease april 2020 - UCSB's Department of Economicsecon.ucsb.edu/~jabirche/Papers/development_theory.pdf · counting for educational outcomes; see also Caselli and Ciccone

which is a lower bound for measured aggregate productivity in a central marketplace .

(− is also a lower bound for overall productivity across trading arrangements.)

Disease spillovers and exchange. So far, there is no exchange in physical capital.

Since physical capital is more productive in the hands of healthy individuals, aggregate

output and aggregate efficiency would differ if capital could be exchanged across production

pairs. The efficiency gains from capital trade, however, depend on how exchange in capital

markets is introduced. To study the social organization of exchange, I next consider several

trading arrangements that vary in the degree of centralization and exposure to disease. In

the trade protocols in this sub-section, capital exchange takes place between worker-pairs.

When worker-pairs are segregated, worker-pairs are also individual types.

Consider first a benchmark in which capital can be exchanged in a centralized competi-

tive market with no disease spillovers. Let the superscript () denote allocations in a central

marketplace with no spillovers.

Proposition 4 Suppose that capital is exchanged in a central marketplace with no disease

spillovers. Aggregate output is Cobb-Douglas and production is as efficient as pairwise pro-

duction. That is, = .

Proof. A competitive capital market exchange between segregated pairs of workers

attains the same allocation as a social planner that solves

max()()

1−() + (1− )1−(), s.t., () + (1− )() = .

Capital-labor ratios are equalized, i.e., () = () and simple substitutions yield

= ()1−. Thus, = .

Capital exchange increases aggregate production efficiency because capital endowments

are inefficiently assigned (all individuals have the same capital regardless of their health hu-

man capital). Capital exchange undoes the negative effect of disease and segregation. There

are, however, no disease spillovers in the interactions between worker-pairs in a centralized

market in Proposition 4.

To introduce disease spillovers in capital exchange, consider again pairwise trade arrange-

ments. Assume that matched pairs of segregated workers can exchange capital but that trad-

12

Page 13: theory of disease april 2020 - UCSB's Department of Economicsecon.ucsb.edu/~jabirche/Papers/development_theory.pdf · counting for educational outcomes; see also Caselli and Ciccone

ing opportunities arrive stochastically as segregated worker-pairs search for trading partners

along the [0 1] interval. The probability that a matched pair of healthy individuals meets

another pair of healthy individuals is , so their meeting probability is 2. The meeting

probability between pairs of diseased individuals is (1 − )2, and the meeting probability

between healthy pairs and disease pairs is 2(1−). (Obviously, 2+(1−)2+2(1−) = 1.)There are no gains from trade when alike worker-pairs meet. Gains from trade occur

when capital is reallocated between diverse pairs of segregated workers. Consider further

two possible outcomes from these meetings.

Assume first that capital is exchanged in a decentralized marketplace with disease

spillovers, as in Proposition 3. Expected output when healthy worker-pairs meet other

healthy worker-pairs 21−. Expected output when diseased worker-pairs meet a dis-

eased worker-pair is (1−)21−. Expected output when healthy worker-pairs meet dis-eased worker-pairs is 2(1−)[(+)2](1−), composed by the meeting probability

2(1 − ) and the output when their labor and capital is pooled: [( + )2](1−).

Let the superscript () denote this trading arrangement. The reason for using () is that

pairwise capital trading under disease spillovers coincides with an integrated assignment of

healthy individuals across locations, i.e., one in which ∗() = for all ∈ [0 1] in thestatic assignment (4).

Aggregate output under () is also Cobb-Douglas = ()1−. Aggregate effi-

ciency is characterized below.

Proposition 5 Suppose capital is exchanged in decentralized marketplaces with disease

spillovers. Aggregate production efficiency is higher than in a central marketplace with

disease spillovers (Proposition 3) but lower than under segregation (Proposition 2). That is,

.

Proof. Under integration, aggregate output per capita becomes 21− + (1 −)21− + 2(1− )[( + )2](1−). Aggregate production efficiency is

"2µ

¶1−+ (1− )2

µ

¶1−+ 2(1− )

½1

2

µ

+1

2

µ

¶¾(1−)#.

13

Page 14: theory of disease april 2020 - UCSB's Department of Economicsecon.ucsb.edu/~jabirche/Papers/development_theory.pdf · counting for educational outcomes; see also Caselli and Ciccone

To show that , one needs 21− + (1 − )21− + 2(1− )[( + )2](1−)

1− + (1− )1−. Rearrangements show that this inequality implies [( + )2]1

[(1− + 1−)2]1(1−), which holds due to Assumption 2; see (7).

Next, rewrite [ + (1− )](1−)

in as {2 + (1 − )2 + 2(1 − )[( +

)2]}(1−). To show that , notice that the relevant comparison becomes {2+(1 − )2 + 2(1 − )[( + )2]}1 {21− + (1 − )21− + 2(1 − )[( +

)2](1−)}1(1−), which again holds by Assumption 2.Proposition 1 has shown that it is inefficient to integrate healthy and diseased individ-

uals for production purposes. It is possible to interpret Proposition 5 as stating that it also

socially inefficient to integrate healthy and disease individuals for exchange purposes. Ex-

changing capital in a decentralized marketplace essentially undoes the efficiency gains from

socially segregating workers. The integrated assignment coincides with that in Proposition

5. Its proof, as the proof of Proposition 1, relies on simple concavity arguments. An inte-

grated assignment is more efficient than a central marketplace, i.e., . The negative

disease spillovers in a central marketplace apply to all individuals. In the decentralized mar-

ketplaces with disease spillovers of Proposition 5, only a fraction 2(1− ) of worker-pairs

experience the negative disease spillovers. This fraction is at most 12 when = 12. (The

other meetings, which account for 2 = 14 and (1− )2 = 14, would not feature disease

spillovers as they happen between equal worker-pairs.)

For the second capital exchange arrangement, consider an intermediate point between

the previous protocols. Assume, as in Proposition 5, that pairwise trading opportunities

arise stochastically but suppose now that the capital of diverse matched pairs is exchanged in

a decentralized competitive capital market with no disease spillovers, similar to Proposition

4 but at a pairwise level. Let superscript () denote this trading arrangement. The output

of healthy and diseased worker-pairs is 21− + (1 − )21−, respectively. The

expected output of a meeting between different worker-pairs is

2(1− ) max()()

{2−1[1−() + 1−()] : () + () = 2}, (13)

and output can be written as = ()1−, with described next.

14

Page 15: theory of disease april 2020 - UCSB's Department of Economicsecon.ucsb.edu/~jabirche/Papers/development_theory.pdf · counting for educational outcomes; see also Caselli and Ciccone

Proposition 6 Suppose that capital is exchanged in decentralized marketplaces with no dis-

ease spillovers. Aggregate production efficiency is higher than under segregation (Proposition

2) but lower than in a centralized market with no disease spillovers (Proposition 4). That

is, .

Proof. The efficient allocation of capital satisfies () = [2( + )] and () =

[2(+ )]. The value of the internal maximization problem in (13) is [(+ )2]1−.

Aggregate efficiency satisfies

"2µ

¶1−+ (1− )2

µ

¶1−+ 2(1− )

½1

2

µ+

¶¾(1−)#. (14)

From (9) and (14), iff 1−+(1−)1− 21−+(1−)21−+2(1−)[(+)2]1−. This inequality can be written simply as: [(1− + 1−)2]1(1−) ( + )2,

which holds by Jensen’s inequality. Recall that = . Hence for , one simply

needs 21− + (1 − )21− + 2(1 − )[( + )2]1− 1− in (14). Since 1− can

be written as (2+ (1− )2+ 2(1− )[(+ )2])1−, this inequality holds due to the

concavity of 1−.

The absence of disease spillovers in worker-pair meetings makes capital exchange a source

of efficiency relative to a segregated assignment, i.e., . Efficiency gains arise as cap-

ital is directed to higher productivity worker-pairs. For instance, heathier worker-pairs re-

ceive more capital relative to their endowment, while diseased worker-pairs end up with less

relative capital. As in the case of a central marketplace, capital-labor ratios are equalized,

as in()

()=

()

()=

1.

Since these gains only apply to a fraction 2(1− ) of meetings, aggregate productivity is

still lower than in a central marketplace with no disease spillovers in which all worker-pair

exchanges are unaffected by disease, i.e., .

Summary. The previous propositions say that disease spillovers reduce social interac-

tions and aggregate efficiency. They also say that the efficiency of production and capital

exchange varies with the scope of disease spillovers. Aggregate efficiency (and consequently

15

Page 16: theory of disease april 2020 - UCSB's Department of Economicsecon.ucsb.edu/~jabirche/Papers/development_theory.pdf · counting for educational outcomes; see also Caselli and Ciccone

'

&

$

%

¶µ

³´¶µ

³´

¶µ

³´¶

µ³´

¶µ

³´¶µ

³´

'

&

$

%

Figure 1: Productivity ordering across trading arrangements and disease spillovers. Dashed

boxes indicate the presence disease spillovers. Solid boxes their absence. Exchange happens

between encircled individual types.

aggregate output) can be ranked according to the extent of the disease spillovers and the

degree of exchange centralization. The previous propositions imply,

. (15)

As this expression makes clear, all production and exchange protocols are less efficient

than technologically possible due to disease spillovers. The extreme case assumes that

labor and capital are exchanged in a central marketplace. (Under −, trade is also cen-

tralized but → −∞.) A centralized marketplace generates the largest scope for negativedisease spillovers and the lowest aggregate production efficiency. The other extreme, ,

also assumes centralized exchange but with no disease spillovers. In the intermediate cases,

aggregate efficiency is lost in the process of production and exchange. In fact, segregation

dominates trading in decentralized markets with disease spillovers since they would lead

to social integration, i.e., . The ranking of aggregate productivities is depicted in

Figure 1 across different trading arrangements and disease spillovers.

3 Some remarks and extensions

This section provides some remarks and extensions regarding the role of disease as a deter-

minant of the social organization of production and exchange.

Segregation and health human capital. The nature the assignment is such that

segregation is optimal for all values of ( ) and ( ) that satisfy Assumptions 1 and

16

Page 17: theory of disease april 2020 - UCSB's Department of Economicsecon.ucsb.edu/~jabirche/Papers/development_theory.pdf · counting for educational outcomes; see also Caselli and Ciccone

2. The literature on segregation by skill has typically found that worker segregation in

production depends on particular values of the skills, say and ; see, e.g., Kremer and

Maskin (1996) and McCann and Trokhimtchouk (2010, Example 1).

To illustrate this point, assume, without loss of any generality, that = = 1. Consider

Cobb-Douglas production functions such as ( ) = . Under this production function,

worker segregation is socially efficient if and only if differences between workers’ skills are

sufficiently large. The second-order condition (7) is of the form ( )−2( )+( ) =

+−2++. For ≡ ≥ 1, the relevant terms needed to determine if segregationis optimal can be written as +{+ − 2 + 1}. As a function of , under 0,

the difference in the term inside the brackets is zero if skills are equal or = 1. It decreases

monotonically as increases until it reaches a trough at (2( + ))1. Then this term

increases monotonically until it eventually becomes positive. A positive difference in the

term in brackets is necessary for segregation. Health human capitals and should then

be sufficiently different for segregation to occur.

In the worker assignment here, even small differences in and are consistent with

social segregation. I treat health human capital in a pairwise meeting as a single input

of production. A pairwise CES aggregation, however, is more general than Kremer and

Maskin (1996). The pairwise health human capital approach is similar to the general human

capital aggregator in Jones (2014).6 A CES aggregator serves to streamline the analysis of

disease spillovers in pairwise meetings. It also helps to identify the tension behind worker

segregation as a conflict between disease spillovers and decreasing returns to scale.

Health human capital and market prices. The previous assignment of workers

across communities takes a social planner perspective. Competitive markets tackle disease

spillovers through a “social barrier.” For instance, segregation is inconsistent with factor

price equalization across communities or individuals of different types. The reason is that

diseased individuals cannot ‘bribe’ healthy workers into forming a pairwise match. Diseased

workers benefit from pairing with healthy workers, but healthy individuals are better off

6Jones (2014) used a generalized human capital measure in a development accounting framework. The

empirical implementation of a general aggregator depends on the sources of variation underlying the different

inputs; see, e.g., the comment on Jones (2014) by Caselli and Ciccone (2019). One of these sources of

variation is the relative efficiency of skilled workers. In the case of health human capital, relative efficiency

seems to be driven largely by physiological considerations; see, e.g., Fogel (1994).

17

Page 18: theory of disease april 2020 - UCSB's Department of Economicsecon.ucsb.edu/~jabirche/Papers/development_theory.pdf · counting for educational outcomes; see also Caselli and Ciccone

teaming only with other healthy individuals.

Within a segregated assignment, consider a market in which physical capital is rewarded

its marginal product denoted by () and (), i.e., () = −11−. Wages for

each worker type, () and (), exhaust income, i.e., ( ) = () + () so that

() = (1−)( ) = (1−)1− and () = (1−)1−. Wages across workersdiffer due to differences in health human capital. The log-wage gap between healthy and

diseased workers,

ln [()()] = (1− ) ln[] 0. (16)

Weil (2007) employed data on adult mortality and latent measures of health (i.e., age of

menarche and height) in order to quantify disease’s proximate effect on aggregate output.

He used a strategy based on observed differences in log-wages such as those in (16). The

estimates used by Weil (2007) yield a wage premium. In Weil (2007), however, labor is

homogeneous so wage differences reflect differences in the efficiency units of labor. The

assumption of efficiency units of labor implies that health human capital can be perfectly

substituted across individuals.

Assumption 2 is inconsistent with the notion that health human capital is a homogeneous

input. Healthy and diseased workers are perfect substitutes only when = 1; see (2). In this

case, integration is optimal and there is no productivity effect of disease. The mean health

human capital in a pair is 1( ) = ( + )2, so the baseline price for human capital is

= (1− )[1( )]. Factor payments +1 exhaust income and there is a log-

wage gap between healthy and diseased workers of the same form as (16), i.e., ln [].

Wages can be ordered relative to a segregated assignment, as in () () .

Wage income is more unequal under efficiency units.

“Social barriers” and misallocation. Segregation and the inequality in marginal

products (16) contrasts with analyses of resource misallocation. In concave economies, mis-

allocation is associated with differences in marginal products. Hsieh et al. (2019), for

example, studied “social barriers” associated with racial and gender discrimination, and

argued that these barriers misallocate resources. Hsieh et al. (2019) shows that social

integration, measured by convergence in the occupational and wage distribution, has con-

18

Page 19: theory of disease april 2020 - UCSB's Department of Economicsecon.ucsb.edu/~jabirche/Papers/development_theory.pdf · counting for educational outcomes; see also Caselli and Ciccone

tributed significantly to aggregate efficiency in the US since 1960. Hayashi and Prescott

(2008) also considered a “social barrier” that limited regional and occupational mobility in

prewar Japan. Misallocation was associated with a traditional stem family system, known

as ie. They also find considerable aggregate efficiency gains in postwar Japan due to the

erosion of this “social barrier.”

Even though the aggregate production function is a standard Cobb-Douglas, the econ-

omy in this paper is not a standard concave economy. Hence differences in marginal products

cannot be taken as evidence of misallocation. The central difference here is that the “so-

cial barrier” is endogenous and not imposed. Thus, marginal products or factor prices are

different, but there is no gain from factor reallocations. In fact, segregation by disease

is socially efficient and policies that promote social integration in production or exchange

reduce aggregate efficiency and income.

Health human capital and development accounting. To illustrate the quantitative

contribution of the productivity effect of disease, relative to the proximate effect, consider a

conventional development accounting framework. Under aggregate Cobb-Douglas functions,

output per capita can be written as = 1(1−) ( )(1−)

. In the neoclassical

growth model, capital-output ratios are determined by the stationary value of capital’s

marginal product, i.e., = . Assume that the marginal product is independent of

health human capital. Also, normalize the value of to 1. The comparison between the

productivity and the proximate effects of disease in output per capita is based on differences

between 1(1−) ×.

I consider a value of = 080. This means that 80 percent of population is healthy and 20

percent diseased. As Fogel (1994, pp. 373-374) notes, toward the end of the 18th century, a

considerable fraction (about 20 percent) of the population was excluded from the labor force

due to the lack of energy for work. The value of = 080 is based on the previous estimate

by Fogel (1994). I take = 040 based on aggregate factor payments to capital and labor. I

assume a value of = 028 as illustration. I am not aware of estimates of the substitutability

between different health human capital types. There are several microeconomic estimates of

the elasticity of substitution between skilled and unskilled labor, as measured by educational

attainment. The consensus is a range between 1 and 2 with a common estimate of 14; see,

19

Page 20: theory of disease april 2020 - UCSB's Department of Economicsecon.ucsb.edu/~jabirche/Papers/development_theory.pdf · counting for educational outcomes; see also Caselli and Ciccone

e.g., Jones (2014) and Caselli and Ciccone (2019). I therefore assume = 1 − 114. Forreasons discussed below, the value of = 028 is conservative. The lowest possible value of

measured aggregate TFP is −, which assumes → −∞.

Table 1. Proximate and productivity effects of disease in development accounting.

(−)11−

()11−

()11−

()11−

()11−

()11−

1.25 0.00 1.00 0.00 0.57 0.71 0.86 0.96 1.00

1.20 0.20 1.00 0.38 0.90 0.92 0.95 0.98 1.00

1.15 0.40 1.00 0.58 0.95 0.96 0.98 0.99 1.00

3.125 0.00 2.50 0.00 0.57 0.71 0.86 0.96 1.00

3.000 0.50 2.50 0.38 0.90 0.92 0.95 0.98 1.00

2.875 1.00 2.50 0.58 0.95 0.96 0.98 0.99 1.00

2.750 1.50 2.50 0.74 0.98 0.99 0.99 1.00 1.00

2.625 2.00 2.50 0.87 0.99 1.00 1.00 1.00 1.00

2.500 2.50 2.50 1.00 1.00 1.00 1.00 1.00 1.00

Note: The values are based on the following parameters = 1, = 080, = 04, and = 028.

Normalize = 1 and consider an improvement in mean health human capital of 0 =

25. This value is consistent with the detailed measurement exercise by Weil (2007). As

noted by Weil (2007, p. 1287), Fogel (1994) estimates that health human capital in Great

Britain “increased by a factor of 1.95” over the period 1780-1980. Similarly, using cross-

country differences in the age of menarche of 3.7 years, the range in his sample, Weil (2007,

p. 1289) found differences in the “labor input per worker of a factor of 2.73.” Using cross-

country differences in adult mortality between Iceland and Botswana, Weil (2007, p. 1292)

found differences in the “labor input per worker by a factor of 1.59.” These numbers suggest

that the increase in mean health human capital used here is a reasonable benchmark.

Table 1 considers multiple values of and consistent with = 1 and 0 = 25. The

multiple combinations of and are irrelevant under efficiency units since they lead to the

same value of or 0. The table lists the production and trading arrangements in (15)

as well as − in (12) as a reference. The first rows consider = 1 in three forms. First,

by ( ) = (1 0); second, by ( ) = (120 020); and third by ( ) = (115 040). As

noted before, the case with = 0 yields − = 0 as healthy workers have been rendered as

20

Page 21: theory of disease april 2020 - UCSB's Department of Economicsecon.ucsb.edu/~jabirche/Papers/development_theory.pdf · counting for educational outcomes; see also Caselli and Ciccone

unproductive as diseased individuals. Under centralized trading with disease spillovers (),

the productivity effect of disease represents a fraction 057 of the pairwise productivity

= 1. Positive values of reduce the gap in measured productivities to the point of making

closer to = = 1 when and are closer, i.e., ( ) = (115 040).

The next set of cases in Table 1 considers different ways to reach 0 = 25. The case

(0 0) = (25 0) is exactly the same as ( ) = (1 0) since measured productivities

only depend on relative values. The following five cases progressively increase 0 by∆ = 05

and lower 0 by ∆ = 0125 so as to maintain 0 = 25. As expected, reducing health

dispersion increases measured productivity along any column. Across the different rows,

productivity also increases as disease spillovers are less detrimental across arrangements

(e.g., Figure 1).

Since mean health human capital has increased over time and health disparities have

narrowed across individuals and countries, the change from the extremes ( ) = (1 0)

to (0 0) = ( 0 0) is perhaps the most representative of health changes during modern

times. In that comparison,

= 100× 057→ 250× 100 = ()0,

with output per capita increasing by factor of 437, as in ()0 → 437× (), and with 0 → 250 × and ()

1(1−) → 174 × ()1(1−)

. An increase in output by a factor

of 437 does not fully account for the difference between current poor and rich countries,

or for the historical experience of rich countries. These differences are of a higher order of

magnitude. (The 85th/15th percentile income ratio across countries today is 16.9 and the

75th/25th ratio is 6.3; see, e.g., Jones (2014, Table 1).) Yet, Table 1 shows that disease

influences multiple aspects of production and that the productivity effect of disease has the

potential of being of a significant order of magnitude.

The contribution of the productivity effects of disease in Table 1 is conservative. Table 1

provides a lower bound since disease spillovers use a ‘high’ elasticity of substitution between

health types. The empirical literature on skill types based on education consider values

of elasticities of substitution below 1 as implausible; see Jones (2014). Elasticities above

21

Page 22: theory of disease april 2020 - UCSB's Department of Economicsecon.ucsb.edu/~jabirche/Papers/development_theory.pdf · counting for educational outcomes; see also Caselli and Ciccone

one imply 0. The difference between positive and negative values of the parameter

is important for health human capital. Positive values of give a greater weight to

healthy individuals in the pairwise aggregator ( ) whereas negative values of give a

greater weight to diseased individuals. (This can be seen easily for ∞( ) = max{ }and −∞( ) = min{ }.) Assuming 0 therefore bounds the scope of negative

disease spillovers. Even within a positive range for , alternative values of give a much

higher significance to the productivity effect of disease. For example, = 1− 112 implies()

1(1−) → 305× ()1(1−)

and = 1− 111 implies ()1(1−) → 931× ()

1(1−).

Inequality and welfare. The baseline model focuses on the role of disease in deter-

mining the social organization of production and exchange. Segregation, however, might

improve welfare if disease spillovers are sufficiently negative and society has a relatively

small aversion to inequality.

Suppose that production is equally consumed within each pair of workers and let ( ) =

(1− )−1[( )]1− denote the ex post utility of a meeting between healthy individuals in

the community. Likewise, define ( ) and ( ) as the ex post utility of ( ) and

( ) meetings respectively. Let (()) denote community ’s utility, i.e., (()) ≡{()2( ) + 2()(1− ())( ) + (1− ())2( )}, and define the economy’s so-cial welfare function as the CES aggregate of expected utilities across communities, i.e.,

() ≡ 1

1−

Z 1

0

(()1−. (17)

The welfare maximization problem now involves maximizing () s.t. (1) and 0 ≤ () ≤ 1.The optimal allocation of workers across communities is more involved than in Section

2 because and introduce concavity at the community level (i.e., in relative to ) and

at the societal level (i.e., in relative to ). Concavity means that output maximization

is not the only concern for the social planner. Inequality matters in two separate ways.

First, ∈ [0 1] captures social concerns for within-community inequality. For example, as increases, society gains from observing healthy and diseased workers paired together in any

community. Second, ∈ [0 1] captures social concerns for between-community inequality.For example, as increases, society gains from observing more homogeneous communities.

22

Page 23: theory of disease april 2020 - UCSB's Department of Economicsecon.ucsb.edu/~jabirche/Papers/development_theory.pdf · counting for educational outcomes; see also Caselli and Ciccone

(Ignoring these inequality concerns, i.e., = = 0, yields the benchmark formulation.)

The presence of and reduce the incentives of social segregation, so Assumption 2

must be strengthened. The next proposition provides two illustrations of the conditions in

production and the social welfare functions that make segregation desirable from a welfare

perspective.

Proposition 7 Suppose that = 0 and amend Assumption 2 as follows (1−)(1− ).

Alternatively, assume that → −∞ and that 0 ≤ 12. In either case, segregation bydisease maximizes social welfare (17).

Proof. The proof of the first case is standard because the objective function (17) has

the same form as (8). Thus, as in (7) and Proposition 1, Proposition 7 can be proven using

Jensen’s inequality, i.e., by showing that the previous joint restriction on ( ) implies

that community ’s expected utility is convex in (), i.e., ( ) + ( ) 2( ).

Consider next the second case. When →−∞, (()) = ()2( )+(1−()2)( ).

It is straightforward to show that ≤ 12 is sufficient for () to be a strictly convex

function of ().7

Segregation is welfare-improving if disease spillovers are sufficiently negative relative to

the degree of inequality that can be socially tolerated. When = 0, society is indifferent to

the inequality between communities. If the social concerns for within-community inequality

are small, i.e., 1− (1− ), then segregation becomes desirable. This represents the

first case in Proposition 7. The second case incorporates a concern for inequality between

communities. When 0, the social welfare function is a strictly concave function of

a community’s expected utility (()). Concavity in () implies a preference for social

integration. Integration when → −∞ and ≤ 12, however, has such adverse effects ina community’s output that society is better off trading off output and equality.

7The first derivative of () with respect to () is [()2( )+(1−()2)( )]−{2()(( )−( ))}. After some manipulations, the second derivative can be written as

2((())

()2=2(( )− ( ))

©( ) + ()2[( )− ( )](1− 2)ª

[()2( ) + (1− ()2)( )]1+.

The only source of ambiguity lies in (1 − 2). Under 0 ≤ 12, 2((())()2 0 leading to a

convex objective function.

23

Page 24: theory of disease april 2020 - UCSB's Department of Economicsecon.ucsb.edu/~jabirche/Papers/development_theory.pdf · counting for educational outcomes; see also Caselli and Ciccone

Dynamic spillovers. Assumption 2 formalizes the notion that healthy and diseased

individuals are poor partners for production and exchange. This assumption, and the base-

line model more generally, is not explicit about the progression and transmission of disease.

In fact, one can reformulate the baseline theory to substitute the notion of disease spillovers

for spillovers based on attributes that are also seen as undesirable in social settings, i.e.,

crime and ‘immoral’ activities that trigger social exclusion and marginalization for reasons

unrelated to disease contagion.

I consider disease progression and transmission as in mathematical epidemiology, e.g.,

Anderson and May (1992). Canonical examples of diseases transmitted by physical contact

include respiratory infections such as tuberculosis and pneumonia. Sexually transmitted

diseases also feature direct transmission. Many parasitic diseases, notably malaria, are not

contagious (spread by contact). These diseases, however, exhibit cross infection: a vector

infects the human host and then the human host infects another vector. I consider a simple

extension with both modes of transmission.

Population is constant and has no age structure. Capital is also constant. Let

and 1 − denote the current fraction of healthy but susceptible individuals and the

fraction of infected individuals, respectively. There is a stationary population of size 0

of infected disease vectors (or nonhuman hosts such as mosquitoes). These vectors are

uniformly distributed across all communities. Susceptible (i.e., healthy) individuals become

infected through contact with a disease vector or infected individuals.8

Contagion takes place randomly. The probability that a healthy individual meets a

disease vector is (). The probability that such meeting leads to a disease transmission

is represented by the vector transmission rate 0 (i.e., the biting rate of the vector). As

before, the meeting probability between a healthy and a diseased individual is 2()(1 −()), and the human transmission rate is 0, i.e., the probability that the susceptible

individual becomes infected. Infected individuals recover at a rate , i.e., a fraction of

(1 − ) individuals recover from the infection during the current period. The fraction of

8The population of vectors is not modeled explicitly. There is a large literature that examines how

disease evolves. See Anderson and May (1992) for an introduction to this literature.

24

Page 25: theory of disease april 2020 - UCSB's Department of Economicsecon.ucsb.edu/~jabirche/Papers/development_theory.pdf · counting for educational outcomes; see also Caselli and Ciccone

susceptible (i.e., healthy) individuals evolves over time as follows:

0 − = (1− )−

Z 1

0

()−

Z 1

0

2()(1− ()), (18)

where the first term in the right-hand-side is the fraction of infected individuals that re-

cover, and the second and third are the vector and human transmissions per capita. In

nonstationary populations, the evolution of disease would add new healthy individuals and

subtract fatalities. Births and deaths are implicitly cancelled in (18).

Let 0 1 denote the discount factor. The social planner allocates healthy workers

across communities to maximize the discounted present value of aggregate output,

() = max0≤()≤1

½Z 1

0

(())+ (0)

¾, s.t., (1) and (18).

The following proposition characterizes the dynamic assignment:

Proposition 8 Segregation is more socially desirable in a dynamic environment than in

the benchmark static model. The desirability of segregation is independent of the vector

transmission rate , but it increases with the human transmission rate . The (stable)

stationary fraction of healthy individuals in the population is ∗ = ( + ).

Proof. As in the static problem, the second-order condition determines the assignment

of individuals across communities. In addition to the static terms in the baseline model (5),

the assignment problem now contains a Lagrange multiplier 0 on the constraint (18). The

second-order condition for () depends on

[( )− 2( ) + ( )] + 02. (19)

The first term in (19) captures the static spillovers of disease (7) and the second the dynamic

spillovers. This second term is positive since 0 0. The added constraint (18) thus

increases the social planner’s incentives for segregation. The transmission rate does not

influence (19) because the vector transmission is uniform across locations. The human

transmission rate , however, induce segregation in (19) even if (()) is linear, i.e., 1− =

25

Page 26: theory of disease april 2020 - UCSB's Department of Economicsecon.ucsb.edu/~jabirche/Papers/development_theory.pdf · counting for educational outcomes; see also Caselli and Ciccone

. Under segregation, (18) becomes 0− = (1−)−, which converges monotonically

to ∗ from any 0 0.

It is natural for segregation to become more socially desirable in a dynamic environment.

The social planner is now concerned with maximizing aggregate production (Proposition 1)

but also with increasing mean health over time. These two objectives are complementary

in (). A pairwise meeting between healthy and diseased individuals reduces the overall

fraction of healthy individuals in the economy in the next period. This dynamic spillover

of disease implies that segregation would be beneficial even if there are no static disease

spillovers. In this sense, dynamic considerations could weaken Assumption 2.

Proposition 8 also highlights the role of the human transmission rate in making segre-

gation more desirable. The incentive to discourage meetings between healthy and diseased

individuals is larger for easily transmittable diseases than for diseases with low transmission

rates. For instance, if = 0 in (18), disease would evolve exogenously from the perspective

of a social planner. The incentives to segregate populations are such that health evolves

as if = 0 in the optimal assignment. In other words, the optimal assignment leads to an

instantaneous response that forces disease to evolve exogenously.

In the dynamic model, the fraction of healthy individuals is endogenous. The steady

state value ∗ is increasing in the recovery rate and decreasing in the product of the vector

transmission rate and the prevalence of disease vectors . A disease vector is important

to sustain an endemic steady state. In its absence, segregation and a positive recovery rate

would eventually ensure that all individuals are equally healthy, i.e., ∗ = 1. In Proposition

8, differences in disease vectors and their direct transmission rate determine health

differences across populations.

A multi-class society. The baseline model considers two types of health human capi-

tals. Social stratification typically takes place in hierarchies, so it is important to know the

conditions that might lead to segregation in more general settings. This section considers

≥ 2 types of individuals. The (positive) health human capital endowments of these dif-ferent types are represented by a (column) vector h ≡ (1 ). Health human capitalsare ordered so that 1 2 . The given number of workers of type is and

their fraction in the population is = , withX

=1 = 1. Individuals are endowed

26

Page 27: theory of disease april 2020 - UCSB's Department of Economicsecon.ucsb.edu/~jabirche/Papers/development_theory.pdf · counting for educational outcomes; see also Caselli and Ciccone

with physical capital as in the benchmark model, i.e., = = .

To keep the analysis comparable to the baseline model, production takes place in pairs.

Pairwise production can be summarized by a × matrix F defined such that its ( )

element is given by F() ≡ ( ). That is,

F ≡

⎡⎢⎢⎢⎢⎢⎢⎣(1 1) (1 2) · · · (1 )

(2 1) (2 2) · · · (2 )...

.... . .

...

( 1) ( 2) · · · ( )

⎤⎥⎥⎥⎥⎥⎥⎦The assignment problem is the allocation of workers across communities. Let ()

denote the fraction of individuals of type ∈ {1 : } to be assigned to community ∈ .

The choice vector is n() ≡ (1() ()), which is consistent with the baseline model.Since workers meet randomly, expected output in community is given by a quadratic form

2(n()) ≡ n()0Fn(). A social planner maximizing aggregate output per capita solves

maxn()

Z 1

0

(n()), s.t.,

Z 1

0

() = ,

and 0 ≤ () ≤ 1, for ∈ {1 : }.A generalization of Assumption 2 is sufficient to induce a multi-class society:

Proposition 9 For a given ≥ 2 types of workers, assume that the parameters , and satisfy µ

+

2

¶1

µ1

− 1¶1(1−)Ã1− + 1−

2

!1(1−), (20)

for all ( )∈{1:} pairs. Then it is (socially) optimal to segregate individuals into

classes with () = 1 in communities, and () = 0 otherwise.

Proof. The matrix F is symmetric due to the symmetry of the mean function ( ),

as in the benchmark model. Since the entries F() are real, the matrix F is also Hermitian.

27

Page 28: theory of disease april 2020 - UCSB's Department of Economicsecon.ucsb.edu/~jabirche/Papers/development_theory.pdf · counting for educational outcomes; see also Caselli and Ciccone

Therefore, if pairwise comparisons satisfy

F() 1

− 1pF()F(),

for all ( ) ∈ {1 : }, then F is positive definite; see Bernstein (2005, Fact 8.7.34). Thestandard inequality of arithmetic and geometric means

pF()F() ≤ (F() + F())2

imply that

F() 1

− 1F() + F()

2(21)

is a sufficient condition for F to be positive definite. Since F() = 1− , F() =

1− , and F() = ([ +

]2)

(1−), some algebra shows that (21) can be written

as (20). Since the feasibility constraints are linear, the second-order conditions associated

with output maximization can be related solely to the properties of F. If F is positive

definite, the optimal allocation is a corner solution.

When = 2, expression (20) holds as long as (1−), which is precisely Assumption2. But Assumption 2 might not be sufficient for the general case, especially when the number

of types is large. There is no straightforward intuition to describe the general condition

(20) other than to reiterate that the spillovers must be sufficiently negative to outweigh the

benefits of integrating different types of individuals. Special cases such as = −∞ do not

help much with intuition.

Proposition 9 leads to generalizations of the baseline model. Under segregation, the

aggregate production function is still Cobb-Douglas = ()1−, and aggregate

efficiency is given by

"X

=1

µ

¶1−#, (22)

with ≡ −1X

=1. Expression (22) extends (9) to the case of classes and it carries

the same interpretation. Likewise, aggregate efficiency in a centralized market place with

disease spillovers can be defined as in the benchmark model by

∙X

=1

µ

¶¸(1−),

28

Page 29: theory of disease april 2020 - UCSB's Department of Economicsecon.ucsb.edu/~jabirche/Papers/development_theory.pdf · counting for educational outcomes; see also Caselli and Ciccone

where due to Jensen’s inequality. If, once segregated, capital is exchanged in

a central marketplace with no disease spillovers, as if a planner is solving

max

X

=1

1−

, s.t.,X

=1 = ,

one recovers an aggregate production function = ()1−, with = . These

expressions generalize the two-class model of Section 2 and so they carry the same economic

implications.

Proposition 9 retains the pairwise matching assumption but allows for different types

of workers.9 Society is segregated in classes and there is a clear hierarchy as some types

are more productive than others. A multi-class society is not especially interesting from the

perspective of different degrees of contagion because there is no obvious disease hierarchy.

Proposition 9, however, is interesting because of the multiple traits that are often associated

with a stratified society (i.e., occupation, race, religion, and so on). Human capital is often

viewed as a composite of multiple types. If individuals differ in terms of education and

health, multiple human capitals will be given by the product of an individual’s education

and health, as in Weil (2007, Eq. 2).

4 Some historical perspectives

The central hypothesis that arises from the previous propositions is that the social orga-

nization of production and exchange are influenced by disease spillovers (i.e., contagion).

This section briefly discusses some historical accounts consistent with the role of disease

sketched by the model. Some of the channels discussed here are not tacitly recognized in

economics but are often discussed by historians. A detailed study of the instances discussed

here would require, at a minimum, a series of dedicated papers. The goal of this section is

9One can also relax the random matching assumption in the baseline model. Assumption 2 can be

weakened by assuming a (more general) matching function of the form ( ), with and representing

the fraction of individual-types in each community. It is possible to see that 12( ) 0 is sufficient to

induce segregation when (1 − ) = . This case relies on complementarities in pairwise meetings and not

in the production of output. This complementarity seems more difficult to validate empirically relative to

the complementarities in production.

29

Page 30: theory of disease april 2020 - UCSB's Department of Economicsecon.ucsb.edu/~jabirche/Papers/development_theory.pdf · counting for educational outcomes; see also Caselli and Ciccone

more in line with creating a catalog of past responses to disease that aligns with this paper’s

thesis and might be helpful in organizing future work.

Caste. The canonical example of social segregation is the caste system. Caste divides

society into groups according to principles of separation in terms of direct and indirect

contact; specialization and division of labor; and social hierarchy; see, e.g., Dumont (1970,

p. 21).10

The best-known example of caste is the Indian caste system. The Indian caste system

segregates individuals using the opposing concepts of pollution and purity derived from

Hindu principles; see, e.g., Bayly (1999), Dumont (1970), Hutton (1969), and Singh (2005)

for some general and diverse perspectives of the caste system in India. Very broadly, caste

in India separates individuals into mutually exclusive hierarchical groups: the ‘forward

caste’ that includes Brahmins (traditionally priest and teachers), Kshatriyas (traditionally

warriors and rulers), and Vaisyas (traditionally money lenders and traders); and the Shudras

(traditionally servants) which constitute the ‘other backward castes.’ These groups are

above the ‘pollution line.’ The fifth group is regarded as being outside the caste system (i.e.,

outcasted) and below the ‘pollution line’ as they are untouchable because their proximity

is polluting.11

Pollution and purity are prevalent ideas in historical narratives; see, e.g., Douglas (1966).

Across societies in the past, as Dumont (1970, p. 49) notes, “dangerous contact acts directly

10Many social institutions are based on some of these principles, but strict separation is the defining

feature of the caste system. For example, specialization and division of labor guided the Medieval guild

system in Europe and the occupational code of the Western Roman Empire, i.e., the Theodosian Code which

mandated children to follow their father’s occupation. Many other examples of hereditary functionalism

that do not constitute caste are discussed by Hutton (1969, Chapter IX). This category also includes the

Japanese ie discussed by Prescott and Hayashi (2007). Racial relationships between whites and blacks

feature a social hierarchy but they are not part of a caste system. As discussed by Hutton (1969, P. 136),

“there is strong prejudice on the part of the whites [. . . ], but the question of taboo and pollution by touch

hardly arises. A negro servant to a white man is no strange anomaly, but a Brahman with a Chandal [who

deals with disposal of corpses] cook is unthinkable.”11There are many examples of outcaste groups outside of India; see Hutton (1969, Chapter IX) and the

discussion of unranked systems in Horowitz (2000, Chapter 1). In Burma, under the old rule, outcaste

classes included beggars and vagabonds, lepers and others suffering from disease, as well as the sandala in

charge of grave-digging and the disposal of corpses. (As Hutton (1969, p. 145) notes, sandala “is clearly

the same word as the Hindi chandal.”) Outcastes are also prevalent in sub-Saharan Africa for individuals

that perform occupations that are regarded as unclean (i.e., leather-working). Outcastes in Japan (i.e., the

Burakumin) were in charge of animal slaughter and removing sewage. The Philippines and Sri Lanka also

featured unclean outcastes.

30

Page 31: theory of disease april 2020 - UCSB's Department of Economicsecon.ucsb.edu/~jabirche/Papers/development_theory.pdf · counting for educational outcomes; see also Caselli and Ciccone

on the person involved, affecting his health, for example.” The view of pollution in the caste

system, however, is more extreme than in traditional societies and tribes. Impurity is often

temporary as in the case of seclusions after births or burials, or during menstruation; see

Dumont (1970, p. 50). The ‘practice of untouchability’ in the Indian caste system is per-

manent to the point of even being hereditary. Untouchability arises from performing ‘dirty’

production tasks associated with “taboo infections” such as animal skinning, leather-work,

scavenging, and removing sewage and corpses; see, e.g., Hutton (1969, p. 89). Moreover,

the separation underlying the ‘practice of untouchability’ restricts direct physical contact

but also other indirect forms of contact including sharing and preparing meals as well as

entering the same physical space (i.e., kitchen or temple). Ambedkar’s (1936) famous Anni-

hilation of Caste goes further to describe the untouchables as those “whose presence, whose

touch, whose very shadow is considered to be polluting by privileged-caste Hindus.” Al-

though caste serves many purposes, a central function of the Indian caste system appears to

be the regulation of direct and indirect physical contact as a disease-avoidance mechanism.

Accounts of the Indian caste system date back to the Vedic period somewhere around

1500-1000 BC.12 McNeill (1976) argues that the emergence of the caste system, and the hier-

archical relationship between castes, originated as a social response to disease spillovers. Ac-

cording to McNeill (1976, pp. 81-84), the caste system developed when “intrusive Aryans”

from central Asia encountered “small, self-contained communities of forest-dwelling peo-

ples.” He views this encounter as an “epidemiological standoff” between a population unfa-

miliar with the disease environment of the Indus valley and ‘forest folk’ who experienced a

variety of tropical diseases and parasitic infections. Such population, familiar with the dis-

ease environment, “had acquired tolerances for formidable local infections.” McNeill (1976,

pp. 66-67) argues that “the taboos on personal contact across caste lines, and the elaborate

rules for bodily purification in case of inadvertent infringement of such taboos, suggests

the importance fear of disease probably had in defining a safe distance between the various

social groups that became the castes of historic Indian society.” The end result was that

12The Indian caste system has adapted to historical circumstances during the Mughal era, British colonial

rule, and independence; see, e.g., Bayly (1999). There are many views on the origin and persistence of the

Indian caste system. For example, Freitas (2006) argues that the caste system was a social mechanism to

share information, enforce contracts, and facilitate trade. Freitas (2006) also discusses alternative hypotheses

from anthropology and sociology.

31

Page 32: theory of disease april 2020 - UCSB's Department of Economicsecon.ucsb.edu/~jabirche/Papers/development_theory.pdf · counting for educational outcomes; see also Caselli and Ciccone

“the homogenizing process fell short of the ‘digestive’ pattern characteristic of the other

Old World civilizations.” The hierarchies in the caste system also align with an increased

distance between higher castes and outcaste populations with higher exposure to disease.13

Quarantines. Caste regulates social distance based on social norms. As in the lifetime

isolation of lepers, discussed many times in the Old Testament, enforcement is based on

social and not legal sanctions. Quarantines are an early example of segregation by disease

enforced by the authority of the state.

Quarantines are preventive measures. They represent the physical separation of healthy

individuals who have the potential of transmitting infections, i.e., asymptomatic individ-

uals who may have been exposed to disease. (Isolation represents the physical separation

between symptomatic individuals or diseased individuals.) Quarantines began as a strat-

egy by Mediterranean city-state governments to minimize the risk of infection during the

Black Death. The plague’s arrival to Europe in the 14th century is among the best docu-

mented (and deadliest) instances of “what can happen when an unfamiliar infection attacks

a population for the first time”; see, e.g., McNeill (1976, p. 7).

The plague was introduced to Europe by merchants and it spread along trade routes

(Biraben, 1975). Cities and towns in medieval Europe institutionalized quarantine regula-

tions for travelers and shipping. Quarantines prevented individuals presumed to be diseased

(and their cargo) from entering cities and ports. According to McNeill (1976, p. 151), the

idea of quarantines “stemmed from biblical passages prescribing the ostracism of lepers; and

by treating plague sufferers as though they were temporary lepers.” Trade and factor mo-

bility restrictions were supplemented with public health practices (i.e., bills of health that

certified that the last port visited by a traveler was disease-free) and by the ‘persecution’

and isolation of plague victims.14

13The role of disease is also noted by Singh (2005, p. 36): “Most of the communities that were in India

before the arrival of the Aryan were integrated in the Sudra varna or were made outcastes depending on

the professions of these communities. Communities who professed non-polluting jobs were integrated into

Sudra varna. And communities who professed polluting professions were made outcastes. The Brahmans

are very strict about cleanliness. In the past, people believed that diseases can also spread through the

air and not only through physical touch. Perhaps because of this reason the untouchables were not only

disallowed to touch the high caste communities but they also had to stand at a certain distance from the

high castes.”14The plague also served to stigmatize and persecute “minority groups” that were “easily identifiable,

already unpopular, widely scattered and lacking any powerful protector.” Examples include pilgrims and

32

Page 33: theory of disease april 2020 - UCSB's Department of Economicsecon.ucsb.edu/~jabirche/Papers/development_theory.pdf · counting for educational outcomes; see also Caselli and Ciccone

Social segregation was based on the mistaken hypothesis of the existence of a ‘miasma’

or poison cloud emerging from rotten corpses or rotting matter in the earth. Distancing,

however, did minimize direct contact between healthy and diseased individuals over a suffi-

ciently long period. Forty days eventually became standard in Italian cities, although some

ports considered shorter durations (i.e., Dubrovnik in Croatia first implemented a thirty

day isolation or ‘trentina’ in 1377 AD). Although there was no understanding of disease

transmission, “[i]n many cases such precautions must have checked the spread of plague.

[...] Quarantine rules were therefore well founded” according to McNeill (1976, p. 151).

Quarantines were not universally adopted in medieval Europe, but they became “general in

Christian ports of the Mediterranean.” Quarantines and the cordon sanitaire between the

Habsburg and Ottoman empires are often credited for the disappearance of the plague from

Western Europe; see, e.g., Biraben (1975).

Zoning. Cities feature a number of positive and negative agglomeration externalities

due to the close proximity between individuals as well as between individuals and firms.

In the past, cities had much higher infectious disease loads than rural areas. As noted by

Pinter-Wollman et al. (2018, p. 3) “London, Paris, New York City and Chicago were densely

populated and characterized by residences in proximity to factories, animal yards, slaughter-

houses and crowded tenement houses with little airflow or light.” Rapid urbanization during

the nineteenth and early twentieth century produced greater potential for disease transmis-

sion and facilitated the spread of communicable diseases such as tuberculosis, typhoid fever,

and cholera (Birchenall, 2007).

Even during the age of miasmas, public health officials considered ways to improve

city sanitation using principles based on contagion. As in the response of Mediterranean

cities to the Black Death, early public health approaches relied on a notion of contagion

that “misrepresented the root cause of disease.” Eventually, with the advent of the germ

theory of disease, “filth was recognized as the medium for transmitting disease instead of

the primary source of contagion” and appropriate public health responses were put in place;

see, e.g., Melosi (2008, pp. 41-43). Indeed, “[d]isease was the raison d’être for the advent of

lepers all over Europe, and Jews and Arabs in Germany and Spain; see Ziegler (1969, p. 97). Quite often,

as these examples illustrate, quarantine principles took a class, ethnicity, or race character.

33

Page 34: theory of disease april 2020 - UCSB's Department of Economicsecon.ucsb.edu/~jabirche/Papers/development_theory.pdf · counting for educational outcomes; see also Caselli and Ciccone

urban planning in Europe and the USA” according to Pinter-Wollman et al. (2018, p. 3).

The regulation of urban spaces collectively known as zoning was one of the central public

health strategies to counter the harmful effects of urbanization. (Zoning is defined as the

spatial segregation of residential, commercial, and industrial land-uses.) Zoning has a clear

public health basis; see, e.g., Schilling and Linton (2005, p. 98). As with other forms of

segregation by disease, zoning limited or eliminated physical contact between individuals

and unhealthy land-uses in production.

‘Silent trade.’ Tropical parasitic and infectious diseases have been heavily concentrated

in sub-Saharan Africa.15 An illustration of the effect of disease on market exchange is the

tendency for trade in pre-colonial Africa to be conducted at the border of territories, rather

than at centralized markets. This form of decentralized exchange, often labeled silent trade,

takes place with no direct physical contact between trading partners and with a total absence

of social interactions.

The earliest account of silent trade protocols comes from Herodotus of Halicarnassus,

around 500 BC. As reported by Price (1980, p. 75), he described Carthaginian traders in

West Africa:16 “The Carthaginians also tell us that they trade with a race of men who live

in a part of Libya beyond the Pillars of Heracles. On reaching this country, they unload

their goods, arrange them tidily along the beach, and then, returning to their boats, raise

a smoke. Seeing the smoke, the natives come down to the beach, place on the ground a

certain quantity of gold in exchange for the goods, and go off again to a distance. The

Carthaginians then come shore and take a look at the gold; and if they think it represents

a fair price for their wares, they collect it and go away; if, on the other hand, it seems too

little, they go back on board and wait, and the natives come and add to the gold until they

15Diamond and Panosian’s (2006, pp. 32-33), in their discussion of the geographic origins of tropical

disease, note that “even the most significant diseases which originated in the New World tropics, Chagas’

disease and leishmaniasis (the latter also arose in the Old World tropics), have much less human impact than

any of the six leading diseases of the Old World tropics (yellow fever, falciparum malaria, vivax malaria,

cholera, dengue fever, and East African sleeping sickness).”16Grierson (1903), Sundström (1974), Kurimoto (1980), and Price (1967, 1980) discuss additional exam-

ples of similar trade protocols in monetary and barter economies. This includes accounts by Ibn Battuta

in Northern Siberia in the 1300s, by Cosmas of Alexandria in present-day Ethiopia in the 500s, and many

other narratives in the Old World. Batra and Birchenall (2019) provide a more detailed survey of existing

narratives and a discussion of the different views on silent trade. Several remarks listed here are taken from

that source.

34

Page 35: theory of disease april 2020 - UCSB's Department of Economicsecon.ucsb.edu/~jabirche/Papers/development_theory.pdf · counting for educational outcomes; see also Caselli and Ciccone

are satisfied. There is perfect honesty on both sides; the Carthaginians never touch the

gold until it equals in value what they have offered for sale, and the natives never touch the

goods until the gold has been taken away.”

Exchange based on iterative transactions that minimize direct physical contact are likely

the result of severe disease spillovers. (Language barriers and general concerns for safety

are also seen as reasons for the silent trade.) In the context of sub-Saharan Africa, Hartwig

and Patterson (1978, pp. 6-7) note that “some long-distance trade involved relatively little

human movement because specific communities prohibited transit traffic. Instead, these

communities acquired goods on one frontier and carried what they did not want to keep

across their own territory to exchange with yet another neighboring community. Such

practices reduced the possibility that contagious diseases would spread over large areas.”

Moreover, “[b]efore the European arrival in the interior of Central Africa, trade was localized

and organized in such a way that one ethnic group transported its goods to the limits of

its district, and the next group did the same”; see Azevado (1978, p. 127).17 Decentralized

exchange was also common in the trans-Saharan trade networks as “aliens encountered

severe health problems in West Africa, and merchants tended to turn back at the desert’s

edge”; see Curtin et al. (1995, pp. 93-94).

During colonial times, integration and long-distance trade (i.e., the use of rivers) was

particularly costly in Central Africa. As Azevado (1978, p. 119) notes, indigenous epidemic

diseases including sleeping sickness (trypanosomiasis) and malaria “accompanied the break-

down of barriers between previously isolated peoples.” As Hartwig and Patterson (1978, p.

11) note, “[e]fficient movement of people and goods was essential for administrative control

and economic exploitation of the newly acquired territories. Internal trade barriers were

eliminated and banditry suppressed. Road, railways, and harbors were built and greater

use was made of navigable waterways. [...] Clearly, greater intercommunication meant a

17Azevado (1978) studied the Sara of Southern Chad. Hodder (1977) provides additional accounts of

how sub-Saharan African markets worked “before the colonial period.” For example, Hodder (1977, pp.

256-257) notes that “there is in Yorubaland a remarkable lack of correspondence between the location of

traditional periodic markets on the one hand and the location and hierarchy of settlements in the other.”

Hodder (1977, p. 258) writes “a few isolated traditional markets may often be found around the periphery

of the tribal lands where inter-tribal trade could take place: a few markets, for instance, seem for long to

have existed along the Ubangi river where it forms the boundary between the Ngbandi and Banda peoples.

Similar peripheral markets are to be found along the borders of the Ruandi and Urundi groups.”

35

Page 36: theory of disease april 2020 - UCSB's Department of Economicsecon.ucsb.edu/~jabirche/Papers/development_theory.pdf · counting for educational outcomes; see also Caselli and Ciccone

more rapid spread of disease.” Many additional accounts of the relationship between in-

creased intercommunication and epidemic diseases in early colonial Africa are discussed in

Brown (1978). It has been long been understood, for example, that railways and transport

routes accelerated and facilitated the spread of disease in Africa.

Altitude. The ‘silent trade’ protocol counters contagion with geographic distancing

between trading partners. Establishing settlements at high altitudes is another form of

geographic distancing to counter tropical diseases.

In Latin America, high altitude cities have been continuously used as administrative,

religious, and cultural centers (e.g., as capital cities) since pre-colonial times. Tenochtitlán

(currently Mexico city) and Cuzco, the capital cities of the Aztecs and Incas, and the most

densely populated cities in pre-Columbian America, were established at high altitudes: 2,420

and 3,399 meters above sea level, respectively. Many other pre-Columbian cities were also

located at high altitudes and remained in use during colonization and independence. As

Hardoy (1993, p. 100) pointed out, “[m]any Latin American capitals were founded in what

we could define as areas of comparatively dense indigenous settlements. These include

Guatemala city, San Salvador, Bogotá, Caracas, Quito, Lima, Asunción, and Santiago de

Chile.” The vast majority of these pre-modern settlements were established in the interior of

each territory and at relatively high altitudes; e.g., Guatemala city (1,500 m.), San Salvador

(1,893 m.), Bogotá (2,625 m.), Quito (2,800 m.), La Paz (3,650 m.), and Tegucigalpa (980

m.).18

The logic behind high altitude settlements is simple. At higher altitudes, disease en-

vironments are less hostile since several tropical disease vectors and bacteria are sensitive

to altitude and UV radiation. As Coatsworth (2008, p. 556) notes, “[t]he concentration

of pre-Columbian populations in the highlands occurred because the disease environment

18Only two countries are landlocked in Latin America (Bolivia and Paraguay) but there were “very few

ports at the time of the Spanish conquest; these were mostly located in the Yucatan Peninsula [..] and on

the Caribbean and Pacific coasts of Guatemala”; see Hardoy (1993, p. 100). Ports in the Caribbean, such as

the ports of Habana and Veracruz, became central during colonial times. Historically, African empires also

had administrative centers but of a much smaller scale compared to the empires of the New World. In fact,

“some African monarchies deliberately avoided urbanization.” For example, “[i]n early Kanem and Mali,

capitals were even more mobile and existed wherever the king made his residence at a particular time”; see

Hull (1976, pp. 2-7). Notice, nonetheless, that the majority of the Ethiopian population still resides in the

northern highlands in the interior of the country.

36

Page 37: theory of disease april 2020 - UCSB's Department of Economicsecon.ucsb.edu/~jabirche/Papers/development_theory.pdf · counting for educational outcomes; see also Caselli and Ciccone

was less deadly and the soils richer in minerals, both of which helped cities grow larger.”

A detailed account of how the expansion of the Inca empire to the Peruvian lowlands was

regulated to counter the incidence of disease is available in Monge (1948). The Incas were

well aware that Indians “do very badly away from the region to which they are native [...].

Those of hot regions die when they go to cold regions, and if they are out in the cold country

they die on going to the hot.” Strict mobility restrictions prevented “upland Indians” from

going to “coastal plains” and vice versa. In their military campaigns at lower altitudes,

Inca soldiers were replaced “every two months since the coastal lands are unhealthy for the

uplanders”; see Monge (1948, pp. 7-8).

During colonial times, European settlers in the Caribbean established ‘hill stations’ for

military personnel as a way to avoid tropical diseases. Ports were particularly exposed to en-

demic malaria and periodic episodes of yellow fever. As noted by Curtin (1989, p. 62) “[t]he

normal military response once the disease reached a Jamaica port, was flight.” Relocating

soldiers to the mountains succeed in the Caribbean; “[t]he death rate for European troops

on Jamaica dropped from 128 per thousand in 1817-1836 to 60.8 per thousand in 1837-46,

a change credited to the move [to a hill station], no doubt correctly”; see Curtin (1989, p.

49). Hill stations were not successful in Algeria and India due to the misunderstood nature

of malaria; see Curtin (1989, pp. 47-50).

Tropical diseases were also central, according to McNeill (1974, p. 76), in explaining

the “slow pace of Chinese settlement in more southerly parts of what is today China.”

Climate and disease environments are sharply different between Northern and Southern

China. McNeill (1974, p. 77-78) claims that malaria and dengue fever “may have constituted

the principal obstacle to early Chinese expansions southward.” Schafer (1985) contains a rich

account based on administrative records and travel guides of the attitudes and interactions

between the “Man,” aborigines of Nam-Viet, and the “Hua,” Chinese intruders during the

T’ang Dynasty in the seventh century. As noted by Schafer (1985, Chapter 7), malaria and

dengue fever played a prominent role as a deterrent in military intervention and settlements

in tropical areas.

37

Page 38: theory of disease april 2020 - UCSB's Department of Economicsecon.ucsb.edu/~jabirche/Papers/development_theory.pdf · counting for educational outcomes; see also Caselli and Ciccone

5 Conclusion

This paper examined how aggregate production and exchange efficiency respond to disease

spillovers (i.e., contagion). I sketched out an analytical framework whose inferences follow

from simple concavity principles. Its main purpose was to characterize aggregate efficiency

across a number of centralized and decentralized production and exchange protocols with

differential exposure to disease. To validate the theory’s predictions, I catalogued how past

societies, from very different parts of the world, tried to avoid contagion through social

segregation. Having discussed the theoretical and practical implications of segregation by

disease, it is useful to summarize some general principles in the theory and the perspectives

offered by historians.

The previous instances are not individual but social responses to disease. The baseline

model and the historical perspectives highlight that production and exchange are social

activities. Contagion arises because individuals are not self-sufficient in dealing with ‘dirty’

production or exchange tasks. The social responses noted here are not about the demog-

raphy of the economy; they are not about the size of the different types of agents in the

economy or about the total or mean health human capital. Instead, social responses to

disease target the distribution of the different types in the population across space, broadly

defined. Understanding and measuring the aggregate economic effects of social segregation

by disease requires a distributional framework.

The social responses to disease are organized around the notion of distance. In the

past, as in the simple theory, physical distance provided the only protective barrier against

communicable diseases. Even with a very imperfect understanding of disease, distance took

many forms: permanent social quarantines in the caste system, temporary legal quarantine

orders and isolation in medieval Europe, and permanent land-use separation in industrial

cities. Disease and distancing also posed limits to the extent of the market and the lo-

cation of economic activity. In pre-colonial times, tropical diseases limited the geographic

proximity between economic centers of production, i.e., capital cities at high altitudes, and

international trading ports in coastal areas. Disease exposure even limited the proximity

between economic agents themselves, i.e., buyers and sellers in a ‘silent trade’ protocol.

38

Page 39: theory of disease april 2020 - UCSB's Department of Economicsecon.ucsb.edu/~jabirche/Papers/development_theory.pdf · counting for educational outcomes; see also Caselli and Ciccone

The historical perspectives noted here are selected, but they are still relevant for un-

derstanding long-run economic growth. As the theory makes clear, all production and ex-

change protocols are less efficient than technologically possible. These efficiency costs take

many forms, but they are likely to introduce path dependencies. The social enforcement of

caste-based rules, for instance, remains much alive in India today despite the many legal

sanctions against ‘untouchability’ and other discriminatory practices. High altitude cities in

Latin America similarly served to consolidate pre-colonial empires. The increased distance

from coastal areas and rugged terrain, however, has led to much higher transportation and

integration costs in today’s globalized world. Since colonial times, Latin American exports

have concentrated in commodities with high value per-weight (i.e., gold, silver, tin, oil,

sugar, flowers, and illegal drugs). Difficulties in centralized state-building appear important

in sub-Saharan Africa today where severe, and often fatal, infectious diseases (i.e., Ebola)

are recurrent phenomena. But not all historical legacies are negative. The segregation of

land-use functions is the most prevalent land use planning tool in modern cities and towns.

In the absence of vaccines, quarantines and isolation remain one of the most important

and effective preventive public health measures against communicable disease outbreaks.

Clearly, not all past social responses to disease have become maladaptive.

References

[1] Acemoglu, D., Johnson, S., and Robinson, J.A. (2001) “The Colonial Origins of Com-

parative Development: An Empirical Investigation,” American Economic Review, Vol.

91, 1369-1401.

[2] Acemoglu, D. and Johnson, S. (2007) “Disease and Development: The Effect of Life

Expectancy on Economic Growth,” Journal of Political Economy, Vol 115, 925-985.

[3] Alsan, M. (2015) “The Effect of the TseTse Fly on African Development,” American

Economic Review, Vol. 105, 382-410.

[4] Ambedkar, B.R. [2016] (1936) Annihilation of Caste, Verso.

[5] Anderson, R.M. and May, R.M. (1992) Infectious Diseases of Humans: Dynamics

and Control, Oxford University Press.

39

Page 40: theory of disease april 2020 - UCSB's Department of Economicsecon.ucsb.edu/~jabirche/Papers/development_theory.pdf · counting for educational outcomes; see also Caselli and Ciccone

[6] Azevado, M.J. (1980) “Epidemic Disease among the Sara of Southern Chad, 1890-

1940,” in Hartwig, G.W. and Patterson, K.D., Disease in African History, Duke

University Press, 119-152.

[7] Batra, D. and Birchenall, J.A. (2019) “Analyzing the Locations, Theories, and Costs

Associated with Silent Trade,” UC Santa Barbara, Mimeo.

[8] Bayly, S. (1999) Caste, Society and Politics in India from the Eighteenth

Century to the Modern Age, Cambridge University Press.

[9] Becker, G.S. and Murphy, K.M. (2003) Social Economics: Market Behavior in a

Social Environment, Harvard University Press.

[10] Benabou, R. (1996) “Equity and Efficiency in Human Capital Investment: The Local

Connection,” Review of Economic Studies, Vol. 63, 237-264.

[11] Bernstein , D.S. (2005) Matrix Mathematics: Theory, Facts, and Formulas,

Princeton University Press.

[12] Biraben, J-N. (1975-1976) Les Hommes et la Peste en France et dans les pays

Europeens et Mediterraneens, Mouton.

[13] Birchenall, J.A. (2007) “Escaping High Mortality,” Journal of Economic Growth, Vol.

12, 351-387.

[14] Bleakley, H. (2007) “Disease and Development: Evidence from Hookworm Eradication

in the American South,” Quarterly Journal of Economics, Vol. 122, 73-117.

[15] Bleakley, H. (2010) “Malaria Eradication in the Americas: A Retrospective Analysis of

Childhood Exposure,” American Economic Journal: Applied Economics, Vol. 2, 1-45.

[16] Bleakley, H. and Lange, F. (2009) “Chronic Disease Burden and the Interaction of

Education, Fertility, and Growth,” Review of Economics and Statistics, Vol. 91, 52-65.

[17] Bloom, D.E., Canning, D., Kotschy, R., Prettner, K., and Schünemann, J.J. (2019)

“Health and Economic Growth: Reconciling the Micro and Macro Evidence,” National

Bureau of Economic Research, Working Paper 26003.

[18] Brown, J.W. (1978) “Increased Inter-communication and Epidemic Disease in Early

Colonial Ashanti,” in Hartwig, G.W. and Patterson, K.D., Disease in African His-

tory, Duke University Press, 180-206.

[19] Caselli, F. and Ciccone, A. (2019) “The Human Capital Stock: A Generalized Ap-

proach: Comment,” American Economic Review, Vol. 109, 1155-1174.

[20] Coatsworth, J.H. (2008) “Inequality, Institutions and Economic Growth in Latin Amer-

ica,” Journal of Latin American Studies, Vol. 40, 545—569.

[21] Crosby, A. (1972) The Columbian Exchange: Biological and Cultural Conse-

quences of 1492, Greenwood Publishing Group.

40

Page 41: theory of disease april 2020 - UCSB's Department of Economicsecon.ucsb.edu/~jabirche/Papers/development_theory.pdf · counting for educational outcomes; see also Caselli and Ciccone

[22] Curtin, P.D. (1989)Death byMigration: Europe’s Encounter with the Tropical

World in the Nineteenth Century, Cambridge University Press

[23] Curtin, P.D., Feierman, S., Thompson, L., and Vansina, J. (1995) African History:

From Earliest Times to Independence, Longman.

[24] De Nardi, M., French, E., and Jones, J.B. (2009) “Life Expectancy and Old Age Sav-

ings,” American Economic Review, Vol. 99, 110-115.

[25] Diamond, J. and Panosian, C. (2006) “Continental Asymmetry of the Origins of Hu-

man Infectious Diseases,” in Hämäläinen, P. (Ed.)When Disease Makes History:

Epidemics and Great Historical Turning Points, Helsinki University Press, 17-44.

[26] Douglas, M. (1966) Purity and Danger: An Analysis of Concepts of Pollution

and Taboo, Routledge and Keegan Paul.

[27] Dumont, L. (1970) Homo Hierarchicus: The Caste System and Its Implica-

tions, University of Chicago Press.

[28] Evans, D.K., and Miguel, E. (2007) “Orphans and Schooling in Africa: A Longitudinal

Analysis,”Demography, Vol. 44, 35-57.

[29] Fernández, R. and Rogerson, R. (2001) “Sorting and Long-Run Inequality,” Quarterly

Journal of Economics, Vol. 116, 1305-1341.

[30] Fogel, R. (1994) “Economic Growth, Population Theory, and Physiology: The Bear-

ing of Long-Term Processes on the Making of Economic Policy,” American Economic

Review, Vol. 84, 369-395.

[31] Fogli, A. and Veldkamp, L. (2018) “Germs, Social Networks, and Growth,” Federal

Reserve Bank of Minneapolis Staff Report 572.

[32] Freitas, K. (2006), “The Indian Caste System as a Means of Contract Enforcement,”

University of Texas at Austin. Available from: http://www.eco.utexas.edu/~kmf679/

[33] Grierson, P.J. (1980) [1903] The Silent Trade: A Contribution to the Early

History of Human Intercourse, Green and Son.

[34] Hardoy, J.E. (1993) “Ancient Capital Cities and New Capital Cities of Latin America,”

in Taylor, J., Lemgellé, J.G., and Andrew, C. (Eds.) Capital Cities: International

Perspectives, McGill-Queen’s University Press, 99-128.

[35] Hartwig, G.W. and Patterson, K.D. (1980) Disease in African History, Duke Uni-

versity Press.

[36] Hayashi, F. and Prescott, E.C. (2008) “The Depressing Effect of Agricultural Insti-

tutions on the Prewar Japanese Economy,” Journal of Political Economy, Vol. 116,

573-632.

41

Page 42: theory of disease april 2020 - UCSB's Department of Economicsecon.ucsb.edu/~jabirche/Papers/development_theory.pdf · counting for educational outcomes; see also Caselli and Ciccone

[37] Hodder, B.W. (1977) Markets in West Africa: Studies of Markets and Trade

among the Yoruba and Ibo, Ibadan University Press.

[38] Horowitz, D.L. (2000) Ethnic Groups in Conflict, University of California Press.

[39] Hsieh,C-T., Hurst,E., Jones, C.I., and Klenow, P.J. (2019) “The Allocation of Talent

and U.S. Economic Growth,” Econometrica, Vol. 87, 1439-1474.

[40] Hull, R.W. (1976) African Cities and Towns before the European Conquest,

W. W. Norton & Company.

[41] Hutton, J.H. (1969) Caste in India, Oxford University Press.

[42] Jones, B.F. (2014) “The Human Capital Stock: A Generalized Approach,” American

Economic Review, Vol. 104, 3752-3777.

[43] Kremer, M. (1993) “The O-Ring Theory of Development,” Quarterly Journal of Eco-

nomics, Vol. 108, 551-575.

[44] Kremer, M. and Maskin, E. (1996) “Wage Inequality and Segregation by Skill,” Na-

tional Bureau of Economic Research, Working Paper 5718.Optimal Partition of a Large

Labor Force into Working Pairs.

[45] Kurimoto, S. (1980) “Silent Trade in Japan,” Research in Economic Anthropology, Vol.

3, 97-108.

[46] McCann, R.J. and Trokhimtchouk, M. (2010) “Optimal Partition of a Large Labor

Force into Working Pairs,” Economic Theory, Vol. 42, 375-395.

[47] McNeill, W.H. (1976) Plagues and People, Anchor Books.

[48] Melosi, M.V. (2008) The Sanitary City: Environmental Services in Urban

America from Colonial Times to the Present, University of Pittsburgh Press.

[49] Miguel, E. and Kremer, M. (2004) “Worms: Identifying Impacts on Education and

Health in the Presence of Treatment Externalities,” Econometrica, Vol. 72, 159-217.

[50] Monge, M.C. (1948) Acclimatization in the Andes: Historical Confirmation of

“Climatic Aggression” in the Development of Andean Man, Johns Hopkins

Press.

[51] Pinter-Wollman, N., Jelic, A., and Wells, N.M. (2018) “The Impact of the Built En-

vironment on Health Behaviours and Disease Transmission in Social Systems,” Philo-

sophical Transansactions of the Royal Society B, Vol. 373, 2-14.

[52] Price, J.A. (1967) “Conditions in the Development of Silent Trade,” Kroeber Anthro-

pological Papers, Vol. 36, 67-79.

[53] Price, J.A. (1980) “On Silent Trade,”Research in Economic Anthropology, Vol. 3, 75-96.

42

Page 43: theory of disease april 2020 - UCSB's Department of Economicsecon.ucsb.edu/~jabirche/Papers/development_theory.pdf · counting for educational outcomes; see also Caselli and Ciccone

[54] Schafer, E.H. (1985) The Vermilion Bird: T’ang Images of the South, University

of California Press.

[55] Schelling, T. C. (1971) “Dynamic Models of Segregation,” Journal of Mathematical

Sociology, Vol. 1, 143-186.

[56] Schilling, J. and Linton, L.S. (2005) “The Public Health Roots of Zoning: In Search of

Active Living’s Legal Genealogy,” American Journal of Preventive Medicine, Vol. 28,

96-104.

[57] Singh, E. (2005) Caste System In India: A Historical Perspective, Kalpaz Pub-

lications

[58] Soares, R.R. (2005) “Mortality Reductions, Educational Attainment, and Fertility

Choice,” American Economic Review, Vol. 95, 580-601.

[59] Sundström, L. (1974) The Exchange Economy of Pre-Colonial Tropical Africa,

St. Martin’s Press.

[60] Weil, D. (2007) “Accounting for the Effect of Health on Economic Growth,” Quarterly

Journal of Economics, Vol. 122, 1265-1306.

[61] Young, A. (2005) “The Gift of the Dying: The Tragedy of AIDS and the Welfare of

Future African Generations,” Quarterly Journal of Economics, Vol. 120, 423-466.

[62] Ziegler, P. (1969) The Black Death, Sutton Publishers.

[63] Zinsser, H. (1935) Rats, Lice and History, Little, Brown and Company.

43