Upload
phamthu
View
225
Download
3
Embed Size (px)
Citation preview
Development Effects of Electrification:
Evidence from the Geologic Placement of Hydropower Plants in Brazil
Preliminary Draft, not for general circulation: This version 7/30/08
Molly Lipscomb University of Colorado, Boulder
A. Mushfiq Mobarak
Yale School of Management
Tania Barham University of Colorado, Boulder
Abstract
Expansions of electricity grids reflect both cost considerations (where is it cheapest to generate electricity?) and demand-side concerns (where are firms and people located, and where is demand for power likely to grow most?). Demand evolves simultaneously with power generation, and complicates efforts to estimate the effects of electrification. This paper attempts to isolate the portion of the variation in grid expansions in Brazil that is attributable to “exogenous” engineering cost considerations to estimate the development effects of electrification between 1950 and 2000. Brazil relies almost exclusively on hydropower, and hydro-power generation requires intercepting water at high velocity. A portion of the spatial variation in the expansion of the electricity grid in Brazil during this period is therefore driven by river gradients suitable for hydro-power generation.
We predict hydropower plant placement based on geologic characteristics (river gradient and water flow) of locations throughout Brazil and then develop a cost-minimizing “engineering model” to predict the expansions of transmission lines from each of those predicted hypothetical stations every decade. The model generates maps of hypothetical electricity grids for Brazil in each decade which show what the grid would have looked like had infrastructure investments been made based solely on geologic cost considerations, ignoring all demand-side concerns. We use these modeled hypothetical maps to instrument for the actual variation in electrification across Brazil over time.
We begin by showing that the very strong cross-sectional correlation between electrification and population density is largely driven by the correlation between population density and water availability. In fixed effect regressions, the effect of grid expansions on subsequent changes in population density is much smaller, and fixed effects instrumental variables estimates show that even this smaller effect is likely a result of electricity grid expansions following population projections, rather than electrification inducing an increase in density. Electricity is estimated to increase GDP per capita under the fixed effects IV. Since the population density results suggest that other ‘general equilibrium’ effects such as in-migration to electrified areas or selective in-migration by skilled workers and firms are not taking place, we interpret the positive GDP result as an indication of a true causal effect of electricity on some aspect of productivity. *We thank the University of Colorado NICHD Population Center, Corporación Andina de Fomento, Center for Advancement in Research and Teaching in the Social Sciences at the University of Colorado and the Macmillan Center at Yale University for the financial support that made this data collection possible. We also thank Daniel Ortega and seminar participants at Corporación Andina de Fomento, UC Energy Institute and Yale School of Management for comments, and Vanessa Empinotti and Steven Li for excellent research assistance.
2
I. Introduction
A fundamental role of governments in developing countries is to provide public services
such as health, education and infrastructure for its citizens. To this end, developing countries
devote about a third of their budget to health, education, and infrastructure programs (World
Bank, 2004). Assisting developing countries in providing services that enhance human
development (such as education, health, water, and electricity) is a primary responsibility of
multi-lateral development agencies, such as the World Bank which spent over 20 billion dollars
in development projects in 2004 alone (World Bank, 2005a).
It is important for social scientists to inform policy-makers about the returns to each type
of public investment so that money is spent effectively to reduce poverty and stimulate economic
growth. Our knowledge about the returns to many health and education initiatives are well
developed, since it has been possible to design small-scale randomized experiments in order to
measure the effectiveness of isolated social interventions, for example, increasing teachers’
salaries or paying parents to take children for preventive health checkups. Designing
randomized experiments for large-scale infrastructure projects (such as building electricity grids)
is beyond researchers’ capabilities, which has limited our understanding of the true impacts of
such projects.
The need for research on the returns to infrastructure is particularly pressing because
there is now renewed support from the development community for large infrastructure projects
as a means to poverty reduction (World Bank, 2003; Ali and Pernia, 2003). This renewed focus
on infrastructure is partly a response to large unmet needs. There are 3 billion people who lack
access to modern energy, 1.1 billion people who lack access to clean water supply, 2.4 billion
3
who live without adequate sanitation, 20 percent of the rural population who live more than 2 km
from an all-weather road, and a third of the world’s population which has never used a telephone
(World Bank, 2005b).
This paper exploits quasi-random variation in hydro-power generation and transmission
in Brazil to determine the causal impact of electricity provision on income and population
density. River gradient (controlling for the slope of the land in nearby areas) is an exogenous
source of variation in the potential for hydro-power generation, and the cost-minimizing
expansion paths of transmission lines from the hydropower generation plants create panel
variation in this exogenous component of electricity generation. We will use these exogenous
geographic components of electricity provision to examine the effects of electricity on income
and population density using county-level data for Brazil from 1960 to 2000.
We focus on both population density and GDP per capita as outcome variables to better
understand the precise mechanisms through which electricity affects socio-economic outcomes –
is it that electrification induces in-migration of highly productive people and firms, or does it
enhance the productivity of existing firms and workers. Cross-sectional instrumental variables
estimates show a very large population density response to electrification, although this may be
due to a positive cross-sectional correlation between density and rivers (which is required for
hydropower, but also attracts people for other reasons). In location fixed effects estimates,
electrification induces a 10-15% increase in density, but this may be driven by electric grid
expansion plans following population projections. Finally, in a fixed effects instrumental
variables estimate, the effect of electricity on population density is statistically indistinguishable
from zero. This suggests that the large positive effect of electrification on subsequent GDP per
capita increases that we observe in our fixed effects IV estimates is perhaps not entirely be
driven by the in-migration of more productive workers and firms.
4
Section II reviews the relevant literature on the effects of infrastructure on development,
section III explains our estimation strategy, section IV presents an overview of the data which we
are using and we are continuing to collect, section V presents estimates on the role of local GDP
and population in the expansion of the electricity networks, and section VI concludes and
presents a framework for the continuing process we will follow in order to strengthen the
instrument and improve the accuracy of our coefficient estimates.
II. Background and Literature
Brazil is a particularly relevant setting for our project because the country relies almost
exclusively on hydro-power to meet its electricity needs. Eighty-seven percent of the electricity
in Brazil is generated from hydro-electric plants as opposed to 23 percent across the developing
world (World Bank, 2005d). This dependence on hydro-electricity suggests that electricity
provision in Brazil is more closely tied to quasi-random geographic variation associated with
river gradients and flow accumulation than electricity provision elsewhere. Twenty-seven
percent of rural Brazilians still lack access to electricity (World Bank, 2005c). Furthermore,
over the course of the study, 1950-2000, electricity networks in Brazil have exploded in size.
The transmission network in Brazil has grown at an average rate of 8.9 percent per year,
increasing in size from 2,359 kilometers in 1950 to 174,806 kilometers in 2000 (Siese, 1991,
2002). In addition, the Brazilian Congress is currently debating the construction of what would
be the world’s second largest hydro-electric dam in the Amazon, and its current president is
committed to increasing hydro-electric energy in the country (New York Times, 2005).
President Luiz Inácio Lula da Silva was recently quoted in the New York Times as saying:
“There was a dereliction in not building hydroelectric projects in the previous government. With
5
the projects that are underway, we can permanently guarantee [supplies of energy] for 5, 6 or
even 10 years down the line.”
Despite the growth in multi-lateral lending for infrastructure in the early 2000s (World
Bank, 2003; Ali and Pernia, 2003), there has been very limited academic research on the impacts
of electricity. In the little evidence on electricity that does exist (Balisacan and Pernia, 2002; Fan
et al., 2002; Balisacan et al., 2002; Taylor, 2005; Escobal et al., 2005), it is difficult to make
causal inferences about its impacts, since studies fail to account for the fact the electricity is
often expanded first in areas with the greatest potential for economic growth. The problem is
particularly acute when researchers simply contrast areas with electricity against areas without
using cross-sectional data (Escobal et al., 2005), since the two types of areas are likely to be
different in important but unobservable ways, and people and firms who expect to benefit most
from electricity access are likely to migrate in response to provision. In studies that use panel
data, it is possible to compare changes in outcomes over time in ‘treatment’ (with electricity) and
‘control’ (without) areas, but the treatment areas have not been randomly selected.1
The lack of clear empirical evidence on the consequences of electricity is in sharp
contrast to the knowledge social scientists have developed on the returns to investments in health
and education. This imbalance is mostly attributable to the nature of statistical inquiry. Since
the provision of government services typically responds to existing conditions or expectations of
benefits, to clearly measure the returns to any public project, we need to observe provision in an
“experimental” setting, where the allocation is randomized either by design or by nature. While
it is feasible for social scientists to design small-scale randomized interventions in health or
1 Fan et al. (2002) use Chinese provincial-level data from 1970-97 and show that for every 10,000 yuan spent on electricity development, 2.3 persons are brought out of poverty. Balisacan and Pernia (2002) use Filipino provincial level data from 1985-1997 to argue that the rich tend to benefit from access to electricity. Balisacan et al. (2002) shows that in Indonesia in 1990, a 10% improvement in access to a composite technology measure (presence of public phones, TV postal office, and electricity in the village) raises the income of the poor by roughly 2 percent.
6
education,2 large-scale infrastructure projects such as roads or electricity networks are rarely, if
ever, randomly placed.
In a simple observational study, we would over-estimate the returns to building electric
grids if electricity is more likely to be provided in areas where the government expects it to lead
to larger benefits. The U.N. commissioned Canambra Engineering feasibility study reports
(1968) drew up expansion plans for the electricity grid in the southern and south-eastern states of
Brazil based on forecasts of load factors (the key demand side factor) that were primarily linked
to local GDP and projected GDP growth. Expansion of networks is prioritized where forecast
loads are largest. Even in a panel data study that controls for location fixed effects, expansion
plans based on forecasts may result in an upward bias in the estimated effect of electricity
provision on various growth measures (e.g. economic or population growth), which are directly
related to the demand side factors for electricity used for grid expansion plans. Conversely, we
may under-estimate returns in an observational study if the government’s objective is to provide
electricity to areas deemed in greater need or if it emphasizes the expansion of coverage to new
regions or states that were previously left uncovered. This paper attempts to develop a
methodology for estimating the returns to electricity that accounts for the non-random placement
of power grids.
Geographic variation has been used as a source of identification in a few empirical
studies of the effects of public investment in new infrastructure projects. Michaels (2007) uses
whether a county is connected to the interstate highway network to estimate the effects of
lowered trade barriers on the demand for skilled labor. Duflo and Pande (2005) use slope as an
instrument for the placement of dams in Indian districts. Since slope does not vary over time,
they create an interaction of slope and state-wide dam construction in each time period to create 2 See Kremer (2003) for a review of some randomized experiments in education. Other examples include: Miguel and Kremer (2003); Banerjee et al., 2004; Gertler and Boyce, 2001; Shultz, 2001; Kremer et al., 2002; Thomas et al., 2003.
7
panel variation in their instrument. Dinkelman (2008) is the most closely related paper to ours,
and uses land gradient as an instrument to show that there are significant cooking technology and
female labor participation impacts of communities’ connection to the electricity grid after 1996
in Kwa-Zulu Natal province of South Africa. In her design, slope increases the cost of providing
electricity, making a connection less likely. In our study we use river gradient as a positive
predictor of hydropower dam placement, and are careful to always include land slope as a control
variable since - as Dinkelman (2008) notes - land slope may be correlated with agricultural
outcomes, the cost of providing other public services, and other characteristics of the population.
In Kwa-zulu Natal the correlation between slope and other socio-economic variables Dinkelman
(2008) gathers data on is low, but this may not be more generally true across Brazil. Also, since
land or river gradient would only have cross-sectional variation, we rely on a simplified cost-
minimizing engineering model of the grid expansion to examine the cross-sectional time series
effects of electricity grid expansion across Brazil over the time period 1960-2000.
III. Estimation Strategy
Even though electricity is not randomly allocated, households’ access to electricity in a
country that relies heavily on hydro-power may have an exogenous geographic component to it
because it would depend on proximity to rivers with a gradient suitable for hydro-electricity
generation. Our research design exploits the quasi-random placement of hydro-electric plants in
Brazil based on variation in geography that affects the possibility of hydro-power generation.
Generating hydro-electricity requires high-velocity water flow, which is contingent on
intercepting water at a steep gradient. The ability to construct reservoirs and dams and the
availability of sufficient water flow are crucial to guaranteeing consistent electricity supply from
a given power plant. Hydroelectric plants must be constructed where river depth and flow is
8
sufficiently large to maintain the plant throughout the year (Canambra, 1968). Since geography
(particularly river gradient and flow) affects the suitability of hydro-electric plant construction,
this creates some natural variation in the source points for electricity across Brazil. To the extent
that the physical slope of a river is uncorrelated with socio-economic characteristics of the local
population, this creates a nice natural experiment where observationally similar households are
either more or less likely to be endowed with electricity access depending on the their distance to
an appropriately sloped river. One possible drawback of this methodology is that geography
(and in particular the slope of the land where a household resides) could indeed be correlated
with socio-economic outcomes. Slope can affect land use, agricultural outcomes, and
occupational choices.3 However, river gradient (which affects hydro-power generation and
suitability of dam construction) is typically different from the slope of the land, so we can
separately control for geographic characteristics that are related to socio-economic outcomes,
including the slope of the land, while simultaneously using river gradient as an instrument for
hydropower plant construction.
Measures of river gradient and water flow only vary in the cross-section and are constant
over time at a given location. To generate panel variation in the predicted expansion of electricity
grids, we develop a simplified engineering model of the least-cost placement of transmission
lines and sub-stations (which distribute the electricity generated by our predicted generation
plants). Electricity transmission networks transport electricity from the generation plant to the
regional hub which provides electricity to local distribution networks. Construction of new
transmission lines is expensive and requires extensive planning in order to ensure that the new
lines do not cause system shortages. Our simplified engineering model is decidedly a-behavioral
3 Dinkelman (2008) uses slope as an instrument for electricity provision in South Africa, arguing that it’s more costly to extend coverage to sloped areas. Apart from the relationship of land gradient to land use and agriculture, gradient might also make it costlier to supply other public services,
9
in that transmission lines are chosen to simply minimize costs without any demand side
considerations. The actual placement of transmission lines in the real world is based on both cost
and demand-side behavioral and human considerations, but since we consider pull factors from
demand to be endogenous, the idea here is to extract only that portion of the variability in
electricity availability that is attributable to variation in river gradient, water flow, and distance
to the locations that possess those geographic characteristics suitable for generating hydropower.
IV. Data
Brazil has experienced almost exponential growth in its electricity network since 1950
(figure 1), largely fueled by hydropower. The transmission network expanded from 2,359
kilometers in 1950 to 167,443 kilometers in 2000 (SIESE, 2000). We have assembled a database
of the locations of all power plants and all electricity transmission substations across Brazil from
the 1960s until 2000, using the feasibility studies and inventories that major electricity
companies in Brazil undertake prior to planning expansion of their networks. The power plant
and transmission line data come in two forms – (1) tables with inventories of all transmission
lines that typically specify the county of origin, the destination county, length and voltage (for
transmission lines) and location (for hydropower generation plants, or (2) large paper maps of
generation plants and lines by region of Brazil. We digitize and combine all this information
into five different GIS maps of the Brazilian electricity network for the 1960s, 70s, 80s, 90s and
2000.4 Power plants were placed on the digital map according to their reported latitude and
longitude, while transmission substations were assumed to be located at the centroid of their
4 The 1960s network is based on the comprehensive inventory taken by Canambra (1967) and Canambra (1969) for 1965 and 1967. The 1970s network is pieced together from various maps and tables from the different regions of Brazil from a variety of sources (e.g. see Figure 5). The 1980s network is based on another comprehensive inventory by SIESE (1987). The 1990s is again pieced together from various sources (e.g. Furnas 1993), and the 2000 network is based on SIESE (2000).
10
county of record. Please see Figures 2-5 for examples of the maps and tables which were used to
construct the data set. 775 major electricity plants have been constructed in Brazil since 1910,
and 546 of which are hydroelectric plants.
A. Unit of Observation:
Constructing a digital map database of the evolution of the electricity network allows us
to overlay GIS maps of elevation and rivers and merge information on slope and water flow to
the power plant and transmission data by location. We define “location” (i.e. our spatial unit of
observation) to be a grid of 32,500 evenly spaced points for all of Brazil set 16km apart from
each other. Our task then is to create measures of electricity availability, land slope, river
gradient, water availability and water flow for each of these grid points. Choosing grid points as
our unit of observation rather than states or counties helps diffuse potential bias arising from
political economy factors that might determine the shapes and sizes of political jurisdictions.
B. Constructing measures of electricity availability:
We have data on generation plants and transmission lines, but not on distribution
networks, which is the final component of the electricity grid that connects users to the sub-
stations. Transmission lines transfer electricity from the generation plants to the regions which
are being supplied, while distribution networks transport electricity from the major local
transmission substation to household, industrial and agricultural consumers of electricity. It was
not possible to map these distribution networks going back to the 1960s because electricity
distribution in Brazil is decentralized across 64 privatized electricity companies, and there is no
central clearinghouse for data on their operations. We do however need to account for the
distribution network since assuming that only areas with substations have electricity would be
unnatural. Based on our conversations with electricity sector professionals in Brazil, we assume
that distribution networks stretch 100 km across, so all grid points within a 50 kilometer radius
11
of the centroid of a county containing a transmission substation have access to electricity. Figure
6 illustrates how these ideas are implemented in South Brazil for the 1960s. The dark blue points
are counties which have transmission substations and the light blue circles surrounding them are
assumed to be the distribution networks associated with those substations. All grid points (black
dots) which fall within light blue areas are assumed to have access to electricity. Note that in our
data electricity availability is only a direct function of the location of substations (i.e. the
endpoints of transmission lines), and not the placement of hydropower plants. Building long
transmission lines is costly, so proximity to power plants does indirectly affect a location’s
likelihood of having access to power.
Figures 7-11 map the evolution of the electricity network in Brazil from the 1960s
through 2000. As one might expect, the early development of the electricity network was
focused in relatively affluent and industrial south and from 1970s onwards the grid was
expanded to the populous (but poorer) south-east and north-east. The network has expanded
westward every decade since the 1970s, and by 2000, the coastal areas of the south-east and
north-east had almost universal coverage. The Amazon and Pantanal areas have remained
largely uncovered.
C. Geographic data:
Figure 12 demonstrates our construction of the river gradient instrument. We draw circles
of radius 10km around the evenly spaced grid points throughout Brazil and measure average land
slope in those circles to use as a control variable. We then overlay a map of water bodies, create
2 km buffers on either side of each river, and compute the gradient along the river using
elevation maps. We use this river gradient along with measures of water flow to help predict
plant location (while simultaneously controlling for land slope in the surrounding areas). The
specific indicators of water availability used are average and maximum flow accumulation
12
(which measures the amount of water flowing into each point on the river), calculated based on
GIS maps from U.S. Geological Survey’s Hydro1k program.
D. Constructing the instrument:
While the exogenous geographic factors such as river gradient and flow accumulation
help predict hydropower plant location, we need to construct an instrument for “predicted
electricity availability in each decade”, which accounts for the evolution of transmission lines
and substations and varies both over time and in the cross section. We construct a very simple
engineering model of electricity grid expansion in which decisions are made solely based on
geography-induced cost considerations in order to generate a prediction for whether each of the
32,500 evenly spaced grid points has electricity access in each of the five time periods of data
between 1960 and 2000.
Our objective is to generate matched predictions for the five specific time periods of data
on the actual electricity grid available to us – 1960s based on inventories conducted in 1965 and
1967, 1970s based on maps from 1973 of the network, 1980s based on a comprehensive
inventory conducted in 1987, 1990s and 2000 from the National Agency for Electricity’s data on
recent construction of major transmission lines. From the perspective of the engineering model,
the specific dates for which we have data (i.e. the years that inventories were conducted) is
essentially arbitrary, which implies that the scale of expansion between two periods – i.e. the
number of new power plants and transmission lines built since the last inventory - would remain
indeterminate. We therefore match the scale of expansion between two periods to match the
scale of investment in hydropower plants observed in the actual data. In other words, we allocate
a budget of 240 power plants to the model in the 1960s because that is the number of
hydropower plants in existence in Brazil by 1967 - our inventory date for that decade. By similar
reasoning, the budget for 1970s was 53 additional power plants, 36 additional plants for 1980s,
13
25 additional plants for 1990s, and 24 additional plants for 2000. The model takes these budgets
as given, and chooses the optimal location of hydropower plants and transmission lines based
purely on exogenous geological factors.
The first step in the model is to choose the locations of the budgeted hydropower plants.
The model uses as inputs (i.e. “instruments”) whether the location (a circle of radius 10km
around each grid point) has a river, the average and maximum gradient of the river, maximum
water flow accumulation anywhere within that circle, and an indicator for whether that location
falls in the Amazon. We let the data guide us on the relative importance of each of these factors
in determining plant location. Specifically, we run a probit regression of actual hydropower plant
locations on these instruments to generate predicted probabilities for hydropower plant
placement as a function of geologic conditions (see table 1). As expected, hydropower plants are
most likely to be located in areas with steep river gradient, with greater water flow, and away
from the Amazon. Armed with this prediction, our model places the first 240 power plants in the
highest probability locations based on geologic considerations in the 1960s. Figure 13 maps
these 240 predicted plants as red dots and the entire set of hydropower plants used to generate
the probit predictions as yellow dots against a background of elevation (darker colors are closer
to sea level) and rivers. Our model predicts a large number of power plants along the south-east
to north-east corridor (Sao Francisco river basin) where elevation suddenly moves to high from
the low-lying areas of the coast, implying a steep increase in slope. One drawback is that the
model predicts excessively dense clustering of hydropower plants due to the high spatial
correlation in geologic factors conducive to hydropower plant construction. In a future version of
this model, we plan to introduce a slightly more sophisticated decision-making rule where plants
cannot be constructed too close together along the same river.
14
The next step in the model is to predict the locations of substations (i.e. directions of
transmission lines) which deliver the electricity generated at each plant predicted from the
previous step. We make the simplifying assumptions that all power plants have the same
generation capacity, and - guided by the data - that each plant has exactly two transmission
substations which are connected through a single line (the average number of transmission
substations per power plant as of 2000 was 2.5 (SINDAT, 2008)). The electricity network is
fully durable, and new substations and power plants cannot be placed in locations which have
already received electricity in prior decades.
The model arrives at the optimal lowest cost electricity network in each decade by
computing costs for all possible arrangements of transmission lines. There are a finite but
arbitrarily large number of possible permutations of transmission lines, and the numeric method
we use to arrive at the lowest-cost grid in equilibrium is detailed in the Appendix. The model
assumes that cost increases with distance and is very high when building substations in the
Amazon (due to high material transport costs). In a future version, we plan to introduce other
natural geographic barriers (e.g. crossing a mountain) as cost-enhancing. Once the equilibrium
set of transmission lines are determined, we assume that all grid points within a 50 kilometer
radius of any substation will receive access to electricity, accounting for the distribution network
for that substation. In other words, we purposefully remain agnostic about the direction in which
the distribution networks are expanded because that choice is governed by demand-side factors
which are ‘endogenous’ to measures of economic performance. The chosen 50km radius is also
data-driven (based on the average size of actual distribution networks in Brazil), and mirrors our
treatment of distribution networks in the actual electricity data. In subsequent decades, new
power plants are placed in the highest probability circles among those which have not yet
15
received electricity, and alternative grid points for transmission substations are proposed from
among the points which remain without electricity from the previous decades.
Figures 14-18 plot the areas predicted to receive electricity by this model by decade.
There is a reasonably good cross-sectional spatial correlation with the actual electricity network
for Brazil (figures 7-11), and there are encouraging signs of correlation in terms of direction of
expansion. Geologic considerations induce the model to cover the south and south-east region
first, and then move towards the northeast and slowly west-ward, which is a pattern we observe
in the actual data. However, overall the model chooses to cover a much greater portion of Brazil
with electricity in the first period (1960s) than we see in the actual data. As a result, there is
much lower panel variation in coverage in the time-series dimension than we see in the actual
data, as highlighted in figure 19. As we will see, this adversely affects our predictive power in
the panel regressions. This problem can be traced back to our simplifying assumption that all
hydropower plants have the same power generation capacity which cannot be expanded. In
allocating a generation plant budget to the model for each decade, we assumed that the scale of
expansion of the electricity grid has to match actual number of power plants built between time
periods. This ignores the fact that many power plants were initially built small and had their
capacity expanded in later decades, with an associated increase in the number of substations
allocated to the plant. The constant capacity (and constant number of 2 substations per plant)
assumed therefore forces the model to over-predict electricity availability in earlier years and
under-predict it in later years. We are currently putting together data on the evolution of the
generation capacity of each plant and the cost of expanding capacity relative to the cost of
building new plants, so that we incorporate a decision on whether to expand capacity on existing
plants versus build new plants into the engineering model.
E. Dependent Variables
16
This study examines the effect of electricity on changes in population density, GDP per
capita and industrial GDP per capita from the 1960s until today. County level population counts
are based on the decennial demographic censuses, while county GDP number are estimated by
the Instituto de Pesquisa Economica Aplicada (IPEA) based on various sources of data including
the census of manufacturing and of agriculture. Brazilian counties have split numerous times
over the period of our analysis. Since we do not have digital county maps from 1960, we are
unable to assign the grid points (our spatial unit of observation) to the exact county they fall into
in every decade. However, we do know the area of each county for every decade. In order to
approximate the boundaries of each county over time, we digitally identify the location of all
counties existing in a given decade from the centroid of the county as of 2001 and draw circles
around the centroid of those counties such that the area of the circle equals the area of the county
in the decade of interest. We assume all grid points which fall in the circles belong to the county
in that year. In cases where the grid point falls into more than one county circle, the average
value of the dependent variable is used. Grid points which fall outside of all circles are assigned
to the nearest county circle in that decade. Figure 20 plots the 1960 county proxies, which is the
year with the fewest number of counties. Although our unit of observation is grid points, the
underlying source of data is at the county level, so we cluster all errors by 1960 counties, which
is the most conservative assumption on clustering (for later decades we actually have more
counties and therefore more data).
Table 2 reports the summary statistics for these dependent variables. There is much
greater cross-sectional variation in population density than there is panel variation holding
locations fixed. Population density is also the variable that we feel is measured most accurately
over time. Given the way county-level GDP is estimated by IPEA, the cross-sectional
differences in these series are of higher quality than changes in the panel within locations. In
17
other words, after the addition of location fixed effects, it is likely that the signal to noise ratios
in the GDP series decrease dramatically. So for GDP, one has to be more careful making
inferences with fixed effects estimates.
V. Results
There are a number of distinct mechanisms through which electricity may affect
economic outcomes. The most direct mechanism, and the one most closely related the research
questions motivated in the introduction, is the productivity-enhancing effects of, say, lighting a
school room or refrigerating perishable vaccines, or introducing electricity-powered capital into a
firm’s production process. A less direct but equally plausible mechanism is that electrification
induces the movement of people and firms. And if more highly skilled workers and firms
respond differentially to the availability of electricity, the resulting in-migration to electrified
areas can also lead to a positive correlation between electricity and socio-economic outcomes.
For example, if electricity and worker skill are complementary inputs in the production process
and if workers are mobile, then we may observe a large positive effect of electricity on firm
productivity or area GDP per capita even if electricity is not itself highly productive as an input.
In that case, electrification only leads to a spatial reallocation of resources within a country rather
than an overall productivity boost at the macro level. And in that case our estimates will not
speak directly to the comparison between the returns to spending an extra dollar of funds on
educational investments or electricity investments.5
5 Other ‘general equilibrium’ connections between electricity investments and socio-economic outcomes can also prevent us from isolating direct causal effect of electricity on productivity. For example, imagine that with limited budgets, simple cost-benefit calculus induces governments to allocate a greater share of the budget to electricity in areas where electricity is low-cost to generate (e.g. sloped areas with water, as in our instruments), whereas in high-cost areas a greater share of the budget is allocated to other public services such as health, sanitation or education. In that case, even in the instrumental variables regression we would be comparing electrified areas with low levels of other public services to non-electrified areas with high levels of other public services, which, in this case, would lead
18
The multiple potential mechanisms at play suggest that it would be useful to examine
electricity’s effect on a variety of outcome variables that can help isolate specific mechanisms.
For example, we could examine the effect of electricity on migration patterns, especially as it
relates to movements of firms and worker types. This may help identify whether a productivity-
enhancing effect of electricity is due to the in-migration of productive firms to electrified areas
or to increases in productivity at existing firms. Unfortunately, we do not yet have access to
such specific variables over our long panel stretching back to the 1960s. So we begin by
exploring the effects of electricity on two summary outcomes for which it is possible to get long
panels of data – population density and GDP per capita. These two variables are related to the
two main hypothesized mechanisms at play – the movement of people and direct production
function effects. Population density changes likely reflect migration since the magnitudes of the
short-term effects of electricity on mortality and fertility – which are the two other important
sources of change in population – are unlikely to be large enough to affect population density
numbers greatly. GDP per capita changes may reflect both differential migration of skilled
workers and productive firms, or improvements in the productivity of existing firms and
workers. We will thus begin with an examination of population density to see whether the
migration mechanism appears empirically important, which will in turn help us better interpret
the sources of any changes in GDP and industrial GDP per capita.
Our regression analysis of the effect of electricity on population density and GDP use
different types of variation in the data in turn. We begin with a purely cross-sectional approach
and take advantage of variation across all regions and states of Brazil (while always clustering
standard errors by the large 1960 counties). As one would anticipate based on a cursory glance
to an under-estimate of the returns to electricity if those other public services are productive. The solution to this type of a problem is conceptually straightforward – we would need to add control variables for such other productive public services in our regressions.
19
of the maps of the actual electricity network in figures 7-11 and the maps of the predicted
(modeled) electricity network in figures 14-18, our instrument’s fit to the endogenous electricity
variable in this case is excellent. Areas predicted by our geologic instruments and model to have
electricity have a 43 percentage point greater chance of actually having electricity available.
However, this statistically significant correlation is somewhat driven by the Amazon dummy in
our predicted hydro-power plant locations, and the large effect of the instrument reflects the
large differences in electrification between the Amazon and non-Amazon regions of Brazil.
These regions are different in other economically-relevant dimensions as well, so we next show
estimates that include region fixed effects, so that the empirical inference is not based on
differences across the disparate regions. The estimated effect in the first-stage regression
immediately jumps down: Areas predicted by our geologic instruments are 8 percentage points
more likely to have electricity than other areas within the same region. The third column in table
3 shows that state dummies have a very similar effect on the first-stage estimates as region
dummies (predicted locations are also 8 percentage points more likely to actually have electricity
as other locations within the same state). In column 5 we add random effects by grid point in
addition to region dummies, and in the last column fixed effects by grid point. The instrument
remains positive and strongly predictive of the actual electricity distribution. Land slope is also
strongly correlated with hydropower plant placement, which both (a) works to reduce the power
of our instrument (due to the correlation between land slope and river gradient, and that land
slope is a separate control variable in the second stage), and (b) makes it important to separately
control for slope in the second stage if there are other possible links between slope and socio-
economic or demographic outcomes.
The first three columns of table 4 report OLS estimates of electricity availability (lagged
one period) on population density. All specifications always separately control for land slope
20
and standard errors are always (conservatively) clustered by the large county areas in 1960. The
second and third columns add region and state dummies respectively, which expectedly lowers
the magnitude of the effect of electricity on population density. The estimated effects are quite
large – electricity availability in an area is associated with its population density being twice as
large the following decade (at the mean), even relative to other areas within that same state. The
average population density in the non-Amazon regions of Brazil is 23 people per square
kilometer during the entire sample 1960-2000, and this average rises to 33 people in the non-
Amazon and non-Pantanal regions. The instrumental variables estimates of the effect of
electricity on population density (in the last 3 columns) are always larger than the corresponding
OLS estimates, which implies that areas suitable for hydropower plant construction (and
transmission line expansions from those plants) have an even an even larger correlation with
dense population. Moreover, the IV estimates get even larger when region and state dummies
are added, indicating strong population clustering near hydropower suitable sites within each
state.
If we were to ascribe causality to these IV estimates, then the results would be consistent
with electricity being targeted more to areas that were sparsely populated. Such targeting would
appear to be most pronounced within states. In other words, under-populated parts within states
were targeted more than denser parts, as opposed to some states being targeted more than others.
However, since these are cross-sectional estimates, then there is another plausible interpretation
of this data. In the cross-section, our instrumental variables estimates are essentially comparing
areas that have rivers with a steep gradient and voluminous water flow against all other areas that
do not. This control group is actually comprised of three different types of areas: those with flat
rivers, those with narrow, shallow rivers, and those with no river at all. Given the presence of no
or low water areas in the control group, our IV estimates may be conflating the effects of
21
electricity with the effects of having water available. And water may draw dense population
concentrations for reasons completely independent of electricity generation. Note that this
inference problem would be completely mitigated in panel regressions where we control for
location fixed effects, since water availability at flow at each location is essentially a fixed
factor. Thus it is important to present fixed effects instrumental variables estimates in this
context in order to isolate the causal effect of electricity availability.
In summary, we find a strong positive cross-sectional (across Brazil, within-state and
within-region) correlation between electrification and dense population using OLS, and an even
stronger correlation between hydropower generation (and associated grid expansion) and
subsequent population density using IV. It is quite possible that the former is driven by reverse
causality (grid expansions follow population projections, which makes those projections self-
fulfilling), and the latter is driven by people’s attraction to both electricity and water. Whatever
the nature or direction of the causality, there is certainly a relationship between electrification
and in-migration, as evidenced by the very large population density gains. Such large gains in
density are unlikely to be driven by changing fertility and mortality within a decade.
Table 5 examines the effect of electricity on GDP per capita, again in the cross-section.
The OLS estimates indicate a strong positive effect, but that effect is understandably smaller
once region dummies or state dummies are added. Electrifying a location is associated with a
15% larger GDP per capita there the following decade relative to other areas within the same
region. The effect becomes smaller and loses statistical significance under the state dummies
specification, but with the caveat that the methods to estimate county-level GDP by IPEA using
underlying sources that were designed to be reported only at the state level probably carries a lot
of measurement error when we focus on its within-state variation. The instrumental variables
estimates of electricity on GDP per capita are less precisely estimated and quite variable
22
depending on nature of the data variation being used, ranging from a strong positive effect across
Brazil to a negative point estimate (but an effect that is statistically indistinguishable from zero)
within states. Estimates of electricity’s effect on just the industrial component of GDP are much
more stable. The OLS estimates are positive and statistically significant and indicate that within
a region, electricity increases industrial GDP by 45% at the mean year-2000 value. The IV
estimates are of comparable magnitude, but the standard errors are much larger. The IV point
estimate suggests that electricity increases industrial GDP by 20% at the mean within regions,
but this is imprecisely estimated and we cannot rule out that the true effect is zero. As with the
population density specifications, the effect on GDP in the cross-sectional IV estimates may
reflect the effect of both water and hydropower.
We estimate random effects and fixed effects models in tables 7 and 8, taking full
advantage of the panel dimension of our dataset. In table 7 we add random effects by grid point
in addition to the region dummies in the IV model, which assumes that location specific
characteristics are uncorrelated with our predicted electricity instrument. Table 8 relaxes this
questionable assumption and reports a results from a “within” estimator where we effectively
regress the decadal change in the outcome variable (GDP or population density) on the lagged
change in our model’s prediction for whether that particular location has electricity. The random
effects IV model shows that electrification increases GDP per capita by about 20% within
region. The estimate for industrial GDP has a comparable magnitude but the effect is not
statistically significant. Electrification has a large positive effect on subsequent GDP per capita
changes in the fixed effects IV regressions.
The most notable difference between the panel and pooled cross-sectional models is that
the effect of electricity on population density is estimated to be much smaller in the panel. Once
we add location (i.e. grid point) fixed effects without instrumenting in column 3 of table 8, we
23
find that electrification is associated with a 10-15% increase in population density the following
decade. This effect is an order of magnitude smaller than all cross-sectional OLS and IV
estimates. Thus the cross-sectional IV estimates do indeed appear to be biased upward due to a
positive correlation between rivers and density. Moreover, even the remaining 10-15% positive
effect we uncover in this fixed effects regression may be driven by the electricity grid being
expanded to areas where planners expect future population growth (and acting on that
expectation in turn fulfills that growth promise). To account for this, we report within-region
random effects IV and grid point fixed effects IV estimates in the last columns of tables 7 and 8,
and find that the effect of electrification on population density is statistically indistinguishable
from zero in both specifications. Any upward bias from grid expansion plans based on
population projections appears to be small.
If the fixed effects IV results are successful in isolating the causal effect of electricity,
then the GDP and population density results indicate that there is a large GDP per capita increase
following electrification which cannot be fully explained by large-scale movements of people.
Thus electricity likely has a direct positive production function effect in stimulating greater
economic productivity.
We are concerned about the poor fit of the predictive engineering model in the panel
dimension, particularly in the earlier two decades when we vastly over-predict electricity
availability and under-predict its growth. Table 10 therefore re-visits the fixed effects
instrumental variables models on a restricted sample of 1980s-2000, since our model’s
predictions of electricity growth during this period matches reality (although we still
systematically under-predict electricity level during this period). The instrumental variables
estimates with grid-point fixed effects produce incredibly large estimates of electricity on GDP.
Both GDP per capita and industrial GDP are predicted to more than double following
24
electrification. Electricity induces a population density increase of about 50-60% at the mean,
but the standard error associated with this estimate is again very large.
VI. Conclusion
Understanding the role of access to electricity in economic development is crucial to
planning long term investments in infrastructure improvements in developing countries. Taking
advantage of the close link between geology and hydropower plant construction in Brazil, we
produce quasi-experimental estimates of the long-term effects of electrification on local income
and population density over four decades. Our methodology is able to predict quite well the
cross-sectional variation in electricity availability across regions of Brazil based only on geologic
factors and cost-minimizing engineering considerations. Our model still requires refinements in
order to improve its prediction of the panel variation in electrification, particularly in the inter-
temporal dimension.
While this geography-based instrumental variables methodology can help isolate the
causal effect of electrification on economic development, there may be multiple mechanisms that
mediate this relationship that are interesting to distinguish. Electricity may induce the in-
migration of people and firms, or the in-migration of a certain type of (e.g. disproportionately
higher quality) worker or firm, and it may simply improve the productivity of existing workers or
firms. From a national-level policy perspective, the last effect is a pure net gain while the
migration-related effects are in part a reallocation of already productive resources. Our focus on
two different types of dependent variables – GDP and population density – helps to distinguish
between the two types of mechanisms to a certain extent, since population density changes are
closely related to migration, while GDP changes reflect an overall effect that aggregates both
mechanisms.
25
Using cross-sectional variation in the data we find very strong effects of electricity on
population density, in the sense that areas with geologic characteristics conducive to building
hydropower plants are predicted to be more than twice as dense on average than areas that are
not. The panel (fixed and random effects) estimates suggest that that electrification during a
decade induces about a 10-15% increase in population density at those locations in the following
decade. In the fixed effects IV estimates (where we simultaneously add the location fixed effects
and use our instrumental variables strategy to address the non-random placement of the
electricity grid), the effect of electricity on population density is statistically indistinguishable
from zero.
Although the GDP estimates are less stable, electricity appears to have a strong positive effect
on subsequent GDP per capita even in the fixed effects instrumental variables specification. The
accuracy of our first-stage predictions of the panel variation in location of electricity provision
still needs to improve. New data on capacity of each generation plant should help in this regard.
Further research remains to be done on the mechanisms through which the provision of
electricity affects socio-economic outcomes. We plan to use a broader set of outcome variables
from household and firm surveys to more precisely understand these mechanisms.
26
References Ali, I. and E. Pernia (2003). “Infrastructure and Poverty Reduction: What is the Connection?”
ERD Policy Brief Series No. 13, Economics and Research Department, Asian Development Bank, Manila, Philippines.
Arellano, Manuel and Stephen Bond (1991). “Some Tests of Specification for Panel Data: Monte Carlo Evidence and an Application to Employment Equations.” Review of Economic Studies. V.58 No.2. p.277-297.
Balisacan, A., Banerjee, A., Cole , S., E. Pernia. (2002). “Probing Beneath Cross-National Averages: Poverty, Inequality, and Growth in the Philippines.” ERD Working Paper Series No. 7, Economics and Research Department, Asian Development Bank, Manila.
Balisacan, A., E. Pernia, A. Asra. (2002). “Revisiting Growth and Poverty Reduction in Indonesia: What Do Sub-National Data Show?” ERD Working Paper Series No. 25, Economics and Research Department, Asian Development Bank, Manila.
Banerjee, A., S. Cole, E. Duflo, and L. Linden (2003). “Remedying Education: Evidence from Two Randomized Experiments in India,” mimeo, MIT, November 2004.
Baldacci, E. N. Guin-Siu, L. de Mello (2003). “More on the Effectiveness of Public Spending on Health and Education: A Covariance Structure Model,” Journal of International Development, 15(6), 709-725.
Bloom, D., D. Canning, and J. Sevilla (2001). Health, Human Capital, and Economic Growth Randomized Evaluations of Educational Programs in Developing Countries: Some Lessons. Commission on Macroeconomics and Health Working Paper #WG1:4.
Briceno-Garmendia, C. Stache, A. Shafik. (2004). “Infrastructure Services in Developing Countries: Access, Quality, Cost and Policy Reform.” World Bank Policy Research Working Paper No 3468, Washington D.C.
Cachapuz, Paulo Brandi de Barros (2002). O Planejamento da expansão do setor de energia elétrica : a atuação da Eletrobrás e do Grupo Coordenador do Planejamento dos Sistemas Elétricos. Eletrobras, Rio de Janeiro.
Canambra engineering consultants, Power Study of South Brazil. 1969. Dinkelman, Taryn (2008). “The Effects of Rural Electrification on Employment: New Evidence
from South Africa.” Duflo, E. and R. Pande (2005). “Dams”, mimeo MIT, July. Duflo, E. (2001). “Schooling and Labor Market Consequences of School Construction in
Indonesia: Evidence from an Unusual Policy Experiment,” American Economic Review, 91(4), 795-813.
Oster, E. (2005). Sexually Transmitted Infections, Sexual Behavior and the HIV/AIDS Epidemic, mimeo, Harvard.
Escobal, G. and M. Torero (2005). “Measuring the Impact of Asset Complementarities: The Case of Rural Peru,” Cuadernos de Economia, 42(May), 137-164.
Fan, S., L. Zhang, and X. Chang (2002). Growth, Inequality and Poverty in Rural China: The Role of Public Investments, Research Report 125, International Food Policy Research Institute, Washington D.C.
Gertler P. and S. Boyce (2001). “An Experiment in Incentive-Based Welfare: The Impact of Progresa on Health in Mexico,” mimeo, University of California at Berkeley.
27
Glewwe, P., M. Kremer, and S. Moulin (1998). “Textbooks and Test Scores: Evidence from a Randomized Evaluation in Kenya,” Washington, DC: World Bank Development Research Group.
Gramlich, E. (1994). “Infrastructure Investment: A Review Essay,” Journal of Economic Literature. 32(30), 1176-1196.
International Monetary Fund (IMF) (2004). International Financial Statistics, Washington D.C. Kremer, M. (2003). “Randomized Evaluations of Educational Programs in Developing
Countries: Some Lessons,” American Economic Review 93(2) , May 2003, pp. 102-106. Kremer, Michael, Josh Angrist, Eric Bettinger, Erik Bloom, and Elizabeth King (2002).
“Vouchers for Private Schooling in Colombia: Evidence from a Randomized Natural Experiment,” American Economic Review, 92(5), 1535-1558.
Linden, L., A. Banerjee, and E. Duflo (2003). “Computer-Assisted Learning: Evidence from a Randomized Experiment,” Poverty Action Lab Paper #5.
Londono, J. (1996). “Poverty, Inequality, and Human Capital Development in Latin America 1950-2025,” The World Bank, Washington D.C.
Mehrotra, S. and E. Delamonica (2002). “Public Spending for Children: An Empirical Note,” Journal of International Development, 14 (8), 1105-1116.
Michaels, Guy. (2007). “The Effect of Trade on the Demand for Skill—Evidence from the Interstate Highway System.” Forthcoming, Review of Economics and Statistics.
Miguel, E. and M. Kremer (2004). “Worms: Identifying Impacts on Education and Health in the Presence of Treatment Externalities,” Econometrica, 72(1), 159-217.
Mingat, A. and J. Tan (1996). “The Full Social Returns to Education: Estimates Based on Countries’ Economic Growth,” Human Capital Working Paper 16131, World Bank, Washington D.C.
New York Times (2005). “Brazil Weighs Costs and Benefits of Alliance With China,” September 20.
Schultz, T. Paul (2004). “School Subsidies for the Poor: Evaluating the Mexican Progresa Poverty Program,” Journal of Development Economics, 74(1), 199-250.
Summers, L. (1994). Investing in all the People: Educating Women in Developing Countries, The World Bank, Washington D.C.
Taylor, M. (2005). “Electrifying Rural Guatemala: Central Policy and Rural Reality,” Environment and Planning C: Government and Policy, 23, 173-189.
Thomas, D. et al. (2003) “Iron Deficiency and the Well-Being of Older Adults: Early Results from a Randomized Nutrition Intervention,” mimeo, UCLA.
The World Bank (2005a). Ten Things You Never Knew About the World Bank, http://www.worldbank.org/tenthings/, accessed November 14, 2005.
The World Bank (2005b) Infrastructure, http://www.worldbank.org/infrastructure/, accessed November 11, 2005.
The World Bank (2005c). Brazil: Background Study for a National Rural Electrification Strategy: Aiming for Universal Access, The World Bank, Washington, D.C.
The World Bank (2005d). World Development Indicators, Washington, D.C. The World Bank (2005e). Energy Working Notes: May 2005, Energy and Mining Sector Board,
The World Bank, Washington D.C. The World Bank (2004). World Development Report 2004: Making Services Work for Poor
People, World Bank, Washington D.C. The World Bank (2003). Infrastructure Action Plan. The World Bank, Washington D.C., July The World Bank (1994) World Development Report 1994: Infrastructure. The World Bank, D.C.
28
The World Bank (1993) World Development Report 1993: Investing in Health. The World Bank, D.C.
29
Data Appendix:
There is no comprehensive source for electricity data over our period of analysis across
Brazil. Most of the network was privatized in the 1990s, and the overseeing agencies, Operador
Nacional do Sistema Eletrico (ONS) and Agencia Nacional de Energia Eletrica (ANEEL) were
formed in the early 1990s and have little institutional memory for the period prior to their
charters. In order to assemble our data set, we traveled to Brazil and met with professionals in
the field at major electricity companies and government agencies in Brasilia, Sao Paulo, Rio de
Janeiro, Curitiba, Salvador, and Foz do Iguaçu (Itaipu). The meetings with local professionals
were informative not only in terms of data with which they provided, but also for the broader
understanding of the development of the electricity network in Brazil.
We collected data from the Ministério de Minas e Energia and ANEEL in Brasilia,
Eletrobras, ONS, the Memoria de Eletricidade, and Furnas in Rio de Janeiro, Compania
Hidroeletrica de Sao Francisco (CHESF) in Salvador, Copel in Curitiba, and Itaipu Binacional in
Foz do Iguaçu. Data on the location and year of creation of plants was assembled from a
database of important power plants from Sistema de Informações Georreferenciadas do Setor
Elétrico (SIGEL) and a historical study of hydroelectricity in Brazil by the Memoria de
Eletricidade.
Data on the state of the network in each decade was assembled from a combination of
sources. Data from the 1960s was procured primarily through feasibility studies conducted by
the Canambra Engineering Consultants who did a comprehensive survey of electricity in Brazil
in the 1960s, focusing on the South and South-Central Brazil. Inventories of the network as of
the publication dates in 1965 and 1967 were included as part of the surveys, and maps were also
included to show the placement of the network. CHESF also provided limited information about
the state of the network in North Eastern Brazil from 1960 through the present.
The 1970s data was put together primarily through maps which were drawn by Furnas
and Eletrobras in 1973. Data from the 1980s is from a comprehensive inventory done by SIESE
in 1987. The survey includes detailed data on both transmission lines and generation plants.
Data from the 1990s is from a listing by SIGEL which is a survey of the current electricity
network done by ANEEL. Data from 2000 is from both SIGEL and SINDAT, the database of
the current electricity network done by ONS.
We had the data from the inventories in each period entered into Excel spreadsheets by
firms in India and Bangladesh. Data on line voltage was used to ensure comparability of
30
inventories conducted by different sources—only lines of at least 69 kilovolts were included as
transmission lines—13 kilo volt lines were considered part of the distribution networks. Data
from the tables were transferred into GIS maps which we then compared against maps which
were drawn of the electricity network in each decade in order to insure the accuracy of our final
decade-by-decade networks.
31
Appendix: Modeling Electricity Networks in Brazil: An Engineering Cost Approach
The model begins by selecting the locations of generation plants based on probabilities estimated in a probit equation based on geographic factors. An indicator for whether or not a hydropower plant occurs within a grid circle is regressed on an Amazon indicator which accounts for the lower probability of building power plants in the Amazon due to high materials costs, the average and maximum slope of the river within the grid circle, the log of the maximum flow accumulation (the number of raster grids which flow into each raster), and a water indicator which takes the value of one if the circle has a river or stream in it, zero otherwise.
The initial placement of transmission substations is randomly selected from the remaining grid points across Brazil. Two substations are allocated to each power plant, and a single line is assumed to pass through the three points.
Slope=.15
Slope=.35
Slope=.61
Slope=.48
Slope=.20 Slope=.54
Slope=.29
The cost of the transmission lines is calculated based on distance and an Amazon indicator. (High materials costs are assumed for transmission stations within the Amazon as transport costs are high). Costs are calculated by dividing each transmission line into 100 equally spaced points.
32
Points occurring within a perimeter of a grid point are assigned the average slope which has been calculated for the region surrounding the closest grid point.
The program randomly chooses alternative points and calculates the cost of the transmission lines through those points. If the line cost for the alternative points is lower, the alternative points are retained.
The process is repeated until a lowest cost allocation is achieved for each decade (30,000 iterations).
Distribution networks surrounding each transmission substation and power plant are generated. All points within a 50 km radius are allocated electricity.
33
The process is repeated for subsequent decades. In each decade, power plants are allocated to the highest probability points which have not yet received electricity. Initial locations for transmission substations are randomly chosen from the points across Brazil which have not yet received substations or power plants.
Alternative points are proposed for the new substations. Substations and plants from previous decades are not altered.
The program is again iterated until the lowest cost equilibrium is reached. Distribution networks within a 50 kilometer radius of the chosen points are again assigned. Grid points assigned to receive electricity in a given decade are given a value of 1, while those which are not projected to receive electricity in that decade are given a value of 0. The vector of projected indicator values for each grid point is then used as our instrument for electricity provision in the instrumental variable regressions.
34
Figures:
Figure 1: Increase in kilometers of transmission lines from 1950 through 2000 Source: SIESE
Figure 2: Inventory from Canambra report 1969 Figure 3: Inventory from SIESE report 1987 Source: Canambra (1969) Source: SIESE (1987)
35
Figure 4: South Brazil Transmission as of 1967 Figure 5: South Brazil Transmission as of 1979 Source: Canambra (1967) Source: PAREE (1979)
Figure 6: South Brazil Electricity Grid as of 1960
36
Figure 7: 1960s Transmission with Distribution Figure 8: 1970s Transmission with Distribution
Figure 9: 1980s Transmission with Distribution Figure 10: 1990s Transmission with Distribution
37
Figure 11: 2000 Transmission with Distribution
38
Figure 12: Construction of the river gradient instrument Figure 13: Predicted Locations of Hydropower plants, actual plants, rivers and
elevation
39
Figure 14: 1960’s modeled power allocation Figure 15: 1970’s modeled power allocation
Figure 16: 1980’s modeled power allocation Figure 17: 1990’s modeled power allocation
40
Figure 18: 2000 modeled power allocation Figure 19: Low level of panel variation in model
Figure 20: 1960 county proxies
41
Table 1: Probit Forecast of Placement of Generation Plants (step 1 of the engineering model that creates the instrument)
Dependent variable: Indicator for location has a hydropower plant
Log of Maximum Flow Accumulation
0.029** (0.014)
Average Slope in the river 0.044 (0.030)
Maximum Slope in the river 0.062*** (0.012)
Amazon Indicator ‐0.753*** (0.066)
Indicator for location has a river in it
‐0.030 (0.063)
Observations 33342 R‐squared . Standard errors in parentheses *** p<0.01, ** p<0.05, * p<0.1
42
Table 2: Summary Statistics
GDP per capita (thousands of 2000 R$ per person) Obs Mean Std. Dev. Min Max
1970 35257 1.383992 1.192956 0.045881 47.90996 1980 32528 3.416521 7.051991 0.060664 455.9149 1990 32528 2.90737 3.221777 0.088616 91.90031 2000 32528 3.671918 4.05901 0.461927 184.9774
Industrial GDP pc (thousands of 2000 R$ per person) Obs Mean Std. Dev. Min Max
1970 32527 0.206988 0.442533 0 27.54974 1980 32528 1.055195 2.576341 0.000513 211.3002 1990 32528 0.603734 2.001518 7.78E‐06 89.60116 2000 32528 0.8449 2.735928 0.004097 112.1848
Population Density (people per square kilometer) Obs Mean Std. Dev. Min Max
1970 32527 11.0904 99.17917 0.015955 8893.941 1980 32528 14.28085 139.1132 0.042236 11729.97 1990 32528 17.15497 152.0696 0.089627 12199.77 2000 32528 20.3986 181.7498 0.131608 12915.98
43
Table 3: First Stage for Instrumental Variables Regressions Dependent Variable is Indicator for whether a grid point actually has electricity in the data
Pooled Cross‐sectional Random Effects Fixed Effects
Modeled Electricity Indicator
0.432*** 0.082*** 0.080*** 0.355*** 0.118*** 0.198*** (0.018) (0.014) (0.013) (0.024) (0.016) (0.025)
Land Slope 0.021*** 0.004*** 0.002* 0.027*** 0.003* (0.002) (0.001) (0.001) (0.003) (0.002)
Region Fixed Effects? N Y N N Y . State Fixed Effects? N N Y N N .
Fixed Effects? N N N N N Y Observations 130111 130023 130023 130108 130020 130820 Number of Locations for Fixed Effects 32528 32506 32705 R‐squared 0.257 0.475 0.543 . . 0.164 Robust standard errors clustered by 1960 counties in parentheses *** p<0.01, ** p<0.05, * p<0.1 Notes: Observations are by 16km spaced grid points. Standard Errors are clustered by county.
44
Table 4: Effect of Electricity on Population Density OLS Instrumental Variables
Lagged Electricity 47.023*** 36.511*** 20.956*** 67.663*** 72.703*** 98.326***(4.061) (3.710) (2.340) (8.164) (28.211) (37.689)
Land Slope 1.657 0.707 ‐0.401 0.548 0.472 ‐0.700 (1.092) (1.074) (1.019) (1.088) (1.089) (1.049)
Region Fixed Effects? N Y N N Y N State Fixed Effects? N N Y N N Y Observations 130111 130023 130023 130111 130023 130023 R‐squared 0.020 0.025 0.048 0.017 0.019 0.025 Robust standard errors clustered by 1960 counties in parentheses *** p<0.01, ** p<0.05, * p<0.1 Notes: Observations are by 16km spaced grid points. Errors are clustered by county.
45
Table 5: Effect of Electricity on GDP per Capita
OLS Instrumental Variables
Lagged Electricity 0.667*** 0.451*** 0.165 2.084*** 0.455 ‐0.601 (0.170) (0.122) (0.105) (0.443) (1.450) (1.482)
Land Slope 0.043** 0.003 ‐0.028** ‐0.033 0.003 ‐0.025*
(0.017) (0.019) (0.014) (0.024) (0.021) (0.015) Region Fixed Effects? N Y N N Y N State Fixed Effects? N N Y N N Y Observations 130111 130023 130023 130111 130023 130023 R‐squared 0.043 0.095 0.118 0.027 0.095 0.116 *** p<0.01, ** p<0.05, * p<0.1 Robust standard errors clustered by 1960 counties in parentheses Notes: Observations are by 16km spaced grid points. Errors are clustered by county.
46
Table 6: Effect of Electricity on Industrial GDP per Capita
OLS Instrumental Variables
Lagged Electricity 0.318** 0.383*** 0.253*** 0.463 0.158 0.287 (0.138) (0.051) (0.052) (0.302) (0.544) (0.567)
Land Slope 0.016 0.016 0.000 0.008 0.018 ‐0.000 (0.011) (0.014) (0.011) (0.015) (0.015) (0.011)
Region Fixed Effects? N Y N N Y N State Fixed Effects? N N Y N N Y Observations 130111 130023 130023 130111 130023 130023 R‐squared 0.025 0.041 0.071 0.025 0.040 0.071 *** p<0.01, ** p<0.05, * p<0.1 Robust standard errors clustered by 1960 counties in parentheses
Notes: Observations are by 16km spaced grid points. Errors are clustered by county.
47
48
Table 7: Random Effects Instrumental Variable Regressions with Bootstrapped Errors
GDP per Capita Industrial GDP per
Capita Population Density
Lagged Electricity Provision 2.096*** 0.790** 0.458*** 0.229 30.088*** 3.865 (0.124) (0.313) (0.054) (0.209) (4.428) (6.287)
Land Slope ‐0.034** 0.001 0.008 0.017** 2.567** 0.920 (0.014) (0.012) (0.007) (0.007) (1.085) (1.101)
Region Fixed Effects? N Y N Y N Y Observations 130111 130023 130111 130023 130111 130023 Number of fid 32528 32506 32528 32506 32528 32506 R‐squared . . . . . . *** p<0.01, ** p<0.05, * p<0.1 Standard errors in parentheses Notes: Observations are by 16 km spaced grid points by decade. Errors are bootstrapped with 50 repetitions. Lagged predicted electricity has been used as the instrument.
49
Table 8: Fixed Effects Regressions OLS Panel Regressions Instrumental Variable Regressions
GDP per Capita
Industrial GDP pc
Population Density
GDP per Capita
Industrial GDP pc
Population Density
Lagged Electricity ‐0.056 0.051 4.856*** 2.453*** 0.382 ‐3.572 (0.129) (0.074) (1.396) (0.917) (0.473) (11.348)
Fixed Effects? Y Y Y Y Y Y Observations 130108 130108 130108 130820 130820 130820 Number of fid 32527 32527 32527 32705 32705 32705 R‐squared 0.063 0.040 0.008 0.044 0.038 0.006 Robust standard errors clustered by 1960 counties in parentheses *** p<0.01, ** p<0.05, * p<0.1
Notes: Observations are by 16 km spaced grid points across Brazil by decade. Errors are clustered by county as of 1960. Where a grid point is overlapped by more than one county circle, the assignment choice of clusters between adjacent counties is random.
50
51
Table 9: Fixed Effects Regressions for 1980s‐2000
GDP pc Industrial GDP pc
Population Density
Lagged Electricity Access Indicator
8.312*** 4.276*** 10.487 (2.622) (1.653) (26.090)
Observations 98115 98115 98115 R‐squared ‐0.090 ‐0.106 0.005 Number of fid 32705 32705 32705 *** p<0.01, ** p<0.05, * p<0.1 Robust standard errors clustered by 1960 counties in parentheses