14
PLEASE SCROLL DOWN FOR ARTICLE This article was downloaded by: [Filippelli, Gabriel] On: 13 September 2010 Access details: Access Details: [subscription number 926849130] Publisher Taylor & Francis Informa Ltd Registered in England and Wales Registered Number: 1072954 Registered office: Mortimer House, 37- 41 Mortimer Street, London W1T 3JH, UK International Journal of Remote Sensing Publication details, including instructions for authors and subscription information: http://www.informaworld.com/smpp/title~content=t713722504 Estimation of heavy-metal contamination in soil using reflectance spectroscopy and partial least-squares regression Caaminee M. Pandit a ; Gabriel M. Filippelli a ; Lin Li a a Department of Earth Sciences, Indiana University Purdue University Indianapolis (IUPUI), Indianapolis, IN, USA Online publication date: 13 September 2010 To cite this Article Pandit, Caaminee M. , Filippelli, Gabriel M. and Li, Lin(2010) 'Estimation of heavy-metal contamination in soil using reflectance spectroscopy and partial least-squares regression', International Journal of Remote Sensing, 31: 15, 4111 — 4123 To link to this Article: DOI: 10.1080/01431160903229200 URL: http://dx.doi.org/10.1080/01431160903229200 Full terms and conditions of use: http://www.informaworld.com/terms-and-conditions-of-access.pdf This article may be used for research, teaching and private study purposes. Any substantial or systematic reproduction, re-distribution, re-selling, loan or sub-licensing, systematic supply or distribution in any form to anyone is expressly forbidden. The publisher does not give any warranty express or implied or make any representation that the contents will be complete or accurate or up to date. The accuracy of any instructions, formulae and drug doses should be independently verified with primary sources. The publisher shall not be liable for any loss, actions, claims, proceedings, demand or costs or damages whatsoever or howsoever caused arising directly or indirectly in connection with or arising out of the use of this material.

Estimation of heavy-metal contamination in soil using reflectance spectroscopy and partial least-squares regression

  • Upload
    iupui

  • View
    0

  • Download
    0

Embed Size (px)

Citation preview

PLEASE SCROLL DOWN FOR ARTICLE

This article was downloaded by: [Filippelli, Gabriel]On: 13 September 2010Access details: Access Details: [subscription number 926849130]Publisher Taylor & FrancisInforma Ltd Registered in England and Wales Registered Number: 1072954 Registered office: Mortimer House, 37-41 Mortimer Street, London W1T 3JH, UK

International Journal of Remote SensingPublication details, including instructions for authors and subscription information:http://www.informaworld.com/smpp/title~content=t713722504

Estimation of heavy-metal contamination in soil using reflectancespectroscopy and partial least-squares regressionCaaminee M. Pandita; Gabriel M. Filippellia; Lin Lia

a Department of Earth Sciences, Indiana University Purdue University Indianapolis (IUPUI),Indianapolis, IN, USA

Online publication date: 13 September 2010

To cite this Article Pandit, Caaminee M. , Filippelli, Gabriel M. and Li, Lin(2010) 'Estimation of heavy-metalcontamination in soil using reflectance spectroscopy and partial least-squares regression', International Journal ofRemote Sensing, 31: 15, 4111 — 4123To link to this Article: DOI: 10.1080/01431160903229200URL: http://dx.doi.org/10.1080/01431160903229200

Full terms and conditions of use: http://www.informaworld.com/terms-and-conditions-of-access.pdf

This article may be used for research, teaching and private study purposes. Any substantial orsystematic reproduction, re-distribution, re-selling, loan or sub-licensing, systematic supply ordistribution in any form to anyone is expressly forbidden.

The publisher does not give any warranty express or implied or make any representation that the contentswill be complete or accurate or up to date. The accuracy of any instructions, formulae and drug dosesshould be independently verified with primary sources. The publisher shall not be liable for any loss,actions, claims, proceedings, demand or costs or damages whatsoever or howsoever caused arising directlyor indirectly in connection with or arising out of the use of this material.

Estimation of heavy-metal contamination in soil using reflectancespectroscopy and partial least-squares regression

CAAMINEE M. PANDIT, GABRIEL M. FILIPPELLI and LIN LI*

Department of Earth Sciences, Indiana University Purdue University Indianapolis

(IUPUI), 723 W. Michigan St., Indianapolis, IN 46202, USA

(Received 30 January 2008; in final form 7 November 2008)

Lead (Pb) poisoning from anthropogenic sources continues to threaten the health

of urban children. Mapping Pb distribution on a large scale is imperative to

identify hotspots and reduce Pb poisoning. To assess the feasibility of using

reflectance spectroscopy to map soil Pb and other heavy metal abundance, the

relationship between surface soil metal concentrations and hyperspectral reflec-

tance measurements was examined via partial least-squares regression (PLSR)

modelling. Soil samples were taken from four study sites. Metal concentrations

were determined by inductively coupled plasma-atomic-emission spectrometry

(ICP-AES) analysis, and reflectance was measured with an ASD (Analytical

Spectral Devices) field spectrometer covering the spectral region of 350–2500 nm.

Pb displayed an exponential decrease as a function of distance from the roadway,

demonstrating the depositional patterns from leaded gas combustion which remain

on the landscape 20 years after the phase-out of leaded gasoline. Calibration

samples were used to derive the PLSR algorithm, and validation samples assessed

the model’s predictive ability. The correlation coefficients between the lab-

determined abundance and the abundance predicted from PLSR calibration for

all metals except copper were at or above 0.970, with the correlation coefficient for

Pb the highest of all metals (0.992). Manganese, zinc and Pb had significant

coefficients of determination (0.808, 0.760 and 0.746, respectively) for the valida-

tion samples. These results suggest that Pb and other heavy metal concentrations

can be retrieved from spectral reflectance at high accuracy. Reflectance spectro-

scopy thus has potential to map the spatial distribution of Pb abundance with the

aim of improving children’s health in an urban environment.

1. Introduction

Lead (Pb) poisoning from anthropogenic sources remains a common, but prevent-

able, disease. Though the incidence of Pb poisoning in children (defined as a blood Pb

level . 10 mg dl-1) has decreased by 80% (to ,2.2%) since the late 1970s, childhood Pb

poisoning remains a public health menace in urban areas (Brody et al. 1994). Children

of low-income and minority families located in the inner-city endure high rates of

Pb poisoning; over 15% of children in many cities have blood levels of or exceedingthe maximum ‘safe’ level of 10 mg dl-1 (Rabito et al. 2003). Pb – a neurotoxin – can

permanently impair the brain’s functioning, inhibiting a child’s mental and beha-

vioural development. High doses can be fatal (Mielke 1999). Recent studies show that

*Corresponding author. Email: [email protected]

International Journal of Remote SensingISSN 0143-1161 print/ISSN 1366-5901 online # 2010 Taylor & Francis

http://www.tandf.co.uk/journalsDOI: 10.1080/01431160903229200

International Journal of Remote Sensing

Vol. 31, No. 15, 10 August 2010, 4111–4123

Downloaded By: [Filippelli, Gabriel] At: 15:21 13 September 2010

many urban children have high blood Pb levels due to exposure of soil-borne Pb

(figure 1). Small children are more susceptible to increased levels of Pb intake due to

frequent hand-to-mouth behaviour (Mielke 1999) after playing in an environment

with hazards such as elevated levels of Pb in house dust or yard soils (Bellinger 2004,

Choi et al. 2005).

Pb is found in inner-city soils primarily due to decomposed Pb-based paints andthe deposition of Pb from leaded gasoline and from decomposing Pb-based paint on

homes built before 1950. Gasoline-sourced Pb from combustion products was initially

deposited within ,50 m of a roadway if no obstructions were present (Filippelli et al.

2005). Leaded gasoline was used until 1986; an estimated 4 to 5 million tons of Pb has

been released into the atmosphere and has subsequently accumulated in the environ-

ment (Laidlaw et al. 2005). The amount of Pb deposited in soil by leaded gasoline is

proportional to the traffic flow of nearby roadways. Due to its insoluble nature, when

5 km

39o56′N86o21′W 39o56′N85o55′W

39

>200 ppm lead125–200 ppm lead75–125 ppm lead

o37′N85o55′W 39o37′N86o21′W

Figure 1. Concentration of diffuse soil lead in surface soils of Indianapolis (in colouredregions) showing a characteristic pattern of urban enrichment trending towards backgroundvalues in suburban and agricultural regions. The overprint of high diffuse soil lead presentedhere corresponds roughly with the distribution of elevated blood lead levels in children.Reprinted with permission from Filippelli et al. (2005).

4112 C. M. Pandit et al.

Downloaded By: [Filippelli, Gabriel] At: 15:21 13 September 2010

the particulate Pb is deposited on to a grassy area near the road, it is retained in

surface soil particulates through the surface complexation of Pb and Pb oxides with

soil organic matter (Filippelli et al. 2005). Smelters also contribute to high Pb

concentrations in urban areas.

Conventional wet-chemistry techniques used to determine Pb concentrations neces-sitate tedious processes, including preparation, loss on ignition, strong acid digestion

and inductively coupled plasma-atomic-emission spectrometry (ICP-AES). Soil

sampling and preparation entails collecting surface soils at least 5 cm deep and sieving

and drying the samples. After soil sampling and preparation, ashing the samples,

strong acid digestion and ICP-AES, about a week has passed. Not only are these

processes expensive (cost of instruments and manpower), but they are inefficient and

ineffective as well. The Pb concentration determined from sampling one spot is

not representative of the entire area; for instance, observe the Pb concentrations ofa roadway transect. Substantially higher Pb concentrations are found in a sample

taken 5 m away from a heavily trafficked road as opposed to 10 m. Developing a much

quicker and efficacious method for determining Pb concentrations is paramount.

Reflectance spectroscopy is promising in its ability to determine heavy metal con-

centrations through airborne or spaceborne remote sensors. These remote sensors are

operated from airplanes and satellites to acquire data on how various earth surfaces

emit and reflect electromagnetic energy (Lillesand et al. 2004). Hyperspectral remote

sensing is employed to measure more detailed spectral characteristics of objects innature than traditional multispectral sensors do, thus finding more applications in

monitoring and assessing environmental conditions. With the aid of field spectroscopy

for in-flight calibration (Milton et al. 1995), hyperspectral remote sensing has been

used to identify minerals and vegetation from airborne or spaceborne sensor systems

(Peters et al. 2001, 2003). Such capabilities are indicative of reflectance spectroscopy’s

ability to map heavy metal concentrations for health-related reasons.

Based on this background research, we hypothesize that the samples with the

highest concentrations (those closest to the roadway, according to the Pb roadwaytrend) would have the lowest reflectance, and that reflectance would increase propor-

tionally as the Pb concentrations decreased. Previous research on large-scale mining

contamination suggests that reflectance is inversely proportional to metal concentra-

tions (Peters et al. 2001, 2003). Urban Pb poisoning in children continues to remain a

threat, not only in Indianapolis but worldwide; through an exploration of reflectance

spectroscopy and partial least-squares regression (PLSR) modelling, this research lays

the foundation for utilizing airplane or satellite-based remote sensors to quickly map

Pb distributions over large regions.

2. Materials and methods

2.1 Study sites

The soil samples analysed in this research were taken along a transect in MilitaryPark, Indianapolis, IN, USA. This sampling array was chosen to characterize soils

that had varying levels of Pb as defined by an Indianapolis-wide soil Pb study in 2005

(Filippelli et al. 2005). Military Park is located in the urban area of downtown

Indianapolis, adjacent to the heavily travelled New York Street. The park has

remained completely undeveloped and relatively undisturbed over the past century,

making it an ideal and stable site to assess urban atmospheric deposition of contami-

nants: Military Park’s only exposure to Pb is through roadways (Filippelli et al. 2005).

Estimation of heavy-metal contamination in soil 4113

Downloaded By: [Filippelli, Gabriel] At: 15:21 13 September 2010

2.2 Field sampling

The soil samples used in this study were surface soils (top 5 cm). Sample location was

documented as both distance from the roadway and GPS coordinates. Sampling

began ,1 m from the roadway and concluded ,50 m.

2.3 Sample processing

The soil samples were dried at 105�C for 2 days and then sieved to 150 mm. Dried and

sieved samples were ashed in a muffle furnace at 550�C and digested with trace metal

grade 2 N HCl.

2.4 Sample geochemistry

Samples were analysed for Pb – as well as other heavy metals commonly found in urban

soils: zinc (Zn), copper (Cu), cadmium (Cd) and manganese (Mn) – concentrations viaICP-AES. Replicates and a standard were used. The percentage organic matter and the

abundance of the heavy metals were calculated for each soil sample to determine which

soil constituent had the greatest effect on the reflectance measurements.

2.5 Reflectance spectroscopy

The remaining dried samples were sieved to 500 mm and taken outside on a dry, sunny

and cloudless day. Spectral data were collected using an ASD field spectrometer

UV/VNIR (Analytical Spectral Devices Inc., Boulder, CO, USA) spectrometer with

an 8� lens. Optimization and white reference of the spectrometer preceded data

collection. The instrument acquired a continuous spectrum ranging from 350 to2500 nm simultaneously. Because reflectance spectra were measured outdoors, the

spectra are very noisy at around 1400, 1900 nm and the region beyond 2350 nm due to

atmospheric water. These noisy spectral bands were discarded to avoid the adverse

effect of the noise on the development of a spectral-compositional model.

2.6 Partial least-squares regression modelling

Partial least-squares regression (PLSR) is a standard multivariate statistical technique

that was developed by Herman Wold in 1966 (Wold 1966). The underlying assumption

behind a PLSR model is that linear combinations of observed explanatory variables,

which are called components, factors, or latent variables, drive a system or process.PLSR is used to find eigenvectors of the explanatory variables that will produce score

values. These score values must both summarize the variance of the explanatory

variables well and highly correlate with the response variables. A simplified PLSR

model consists of two outer relations and an inner relation. The two outer relations are

a result of the eigenstructure decomposition of the X and Y matrices, respectively. The

inner relation links the score for the X and Y matrices via a regression.

X is an n � m matrix for the explanatory variables measured for n samples at m

variables, and Y is an n � p matrix for the response variables measured for n samplesat p variables. Decomposing X into a matrix T (the score matrix) and a matrix PT (the

loadings matrix) plus an error matrix E derives the outer relation for the X matrix:

X ¼ TPT þ E: (1)

In this way, Y is decomposed into the score matrix U and the loading matrix QT plus

the error matrix F to create the outer relation for the Y matrix:

4114 C. M. Pandit et al.

Downloaded By: [Filippelli, Gabriel] At: 15:21 13 September 2010

Y ¼ UQT þ F: (2)

T and P are sized n� k and m� k, respectively; U and Q are sized n� l and p� l; k and

l are the number of principal components for reconstructing X and Y; PT and QT

represent the transpose of P and Q. Both E and F are residual matrices. E, the n � m

residual matrix for the matrix X, results from using k principal components to

summarize variance. F is the n � p residual matrix resulting from the subtraction ofl principal components from the Y matrix.

The purpose of PLSR is to minimize the norm of F while keeping the correlation

between X and Y by the inner relation. B is an n� n matrix for the regression coefficient:

U ¼ BT: (3)

By deriving the orthogonal score matrix (T), PLSR remains unaffected by harmful

colinearity in explanatory variables. The decomposition or reduction in the dimen-

sionality of the explanatory variables allows the regression model to be composed of a

small number of samples. PLSR has been applied to visible, near-infrared reflectanceand mid-infrared reflectance to predict mineral abundance and biological activity in

soil (Chang et al. 2005, Li 2006).

A MATLAB program was implemented to conduct PLSR modelling on the

samples. The spectral bands served as the explanatory variables, and six soil consti-

tuents (the 5 heavy metals and organic matter) were the response variables for PLSR

modelling. X is an n � s matrix, and Y is a 6 � n matrix, with n as the number

of samples and s as the number of spectral bands. Bands that are sensitive only to

water content in the atmosphere were automatically determined by the program andleft out during the regression, as they have no bearing on metal abundance

predictions.

To create a PLSR model with high predictive power, an optimal number of PLSR

components were selected for use in the model. Cross calibration of the PLSR model

using an increasing number of components was utilized to determine the ideal number

of components. One sample at a time was left out for the cross calibration of the

response variables. For the n sets of reflectance data, a PLSR model was built using

n – 1 spectra along with the corresponding abundance. The abundance of each metaland organic matter for the sample that was left out was predicted. This process was

repeated n times until each sample had been left out once. The abundance predictions

for the n soil samples were compared with the actual concentrations of the samples.

The resulting prediction error sum of squares (PRESS) was calculated:

PRESS ¼Xn

i¼1

yðiÞ � yi

� �2

; (4)

where y(i) is the estimate of a constituent value yi for the left-out sample i.

The root mean square error of cross-validation (RMSECV) was derived from the

PRESS. RMSECV indicates the accuracy of the model’s predictive power for new

samples. k is the number of components used in the PLSR model, and so PRESSk

is the prediction error sum of squares when k PLS components are used in cross-

validation:

RMSECV ¼ffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiðPRESSkÞ

n

r: (5)

Estimation of heavy-metal contamination in soil 4115

Downloaded By: [Filippelli, Gabriel] At: 15:21 13 September 2010

The RMSECV was calculated as each new component was added to the PLSR model.

The model with the optimal number of PLSR components was the model with the lowest

RMSECV. RMSECV differs from the root mean square of calculation (RMSEC), as the

RMSEC indicates the accuracy of the model’s predictive power for the samples used to

create the model. The RMSEC also indicates the minimum error tolerated in predictionfor new samples based on the optimal number of PLSR components for modelling.

RMSEC ¼

ffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiPni¼1

yi � yið Þ2

n

vuuut: (6)

3. Results and discussion

3.1 Geochemical analysis

The geochemical characteristics determined from the Military Park transect were

generally predictable, with higher values of vehicular-sourced metals near the road-

way, and a decrease with distance (table 1; figure 2). The curve in figure 3 is typical of a

Military Park Geochemical Transect

Distance from road edge (m)

0 10 20 30 40 50 60

Con

cent

ratio

n (p

pm)

0

200

400

600

800

1000

1200

1400

Pb (ppm)Zn (ppm)Cu (ppm)Cd (ppm)Mn (ppm)Organic matter (%)

Figure 2. Metal concentrations as a function of distance from roadway. Mn displays anincrease away from the roadway, possibly caused by leaching of Mn from soils due to roadwaysalt application and subsequent ion exchange. Zn and Pb exhibit a decrease as a function ofdistance from roadway, reflecting the vehicle sources for Zn (tire rubber) and Pb (leadedgasoline). Note the general decrease for Pb, indicating that the roadway source signature isstill preserved in surface soils, even though leaded gasoline was banned over 30 years ago.

4116 C. M. Pandit et al.

Downloaded By: [Filippelli, Gabriel] At: 15:21 13 September 2010

Pb roadway transect, illustrating the rapid deposition of Pb in exhaust particulates

from the combustion of leaded gasoline and the persistence of this Pb in surface soils(Filippelli et al. 2005). The only possible Pb source in Military Park was leaded

gasoline, which was phased out over 20 years ago. This trend demonstrates the effect

of long-term Pb loading from leaded gasoline and the immobility of this Pb in soil.

Soil Pb exhibited a weak relationship with percentage organic matter, yielding a

correlation coefficient (R) of 0.473. This low correlation evinces that although organic

matter is a factor in Pb retention, it does not control the deposition of Pb because little

organic matter is needed to retain Pb. Consequently, an increase in organic matter

would not retain any more Pb. For the other sites, Pb exhibited higher relationshipswith organic-matter content. While organic-matter content has little influence on the

deposition of Pb concentrations, it does affect other metals. For example, Cd and Zn

seem to exhibit some signs of mobility, with organic-matter content variations playing

a larger role in controlling their depositional concentrations than for Pb.

Pb, Zn and Cu have a similar roadway source – with Pb from leaded gasoline, Zn

from tyre dust and Cu from break pads – which may account for the high correlation

between Pb and Zn for these samples (R¼ 0.911). However, Mn and Cd do not share

this similar roadway source. As a function of distance, Cd and Cu vary little inconcentration. Magnesium exhibits no discernible pattern in respect to the roadway

trend, organic matter, the other metals or even alone.

3.2 Reflectance spectroscopy

A promising relationship between the Military Park Pb concentrations and reflectance

spectra (figure 3) was determined for the samples, with the exception of sample one. It

was hypothesized that the samples with the highest concentrations would have thelowest reflectance, and that the reflectance would increase proportionally as the Pb

concentrations decreased. Indeed, the spectral signatures of the soil samples changed

significantly as a function of Pb concentration. The most distinguishing change among

the samples was an overall decrease in albedo as the Pb concentrations increased.

Sample one did not fit this expected pattern, possibly due to its very close location to

the road and therefore its composition of shiny metallic particulates – as opposed to

elemental metals bound to particulates – which would affect the hyperspectral readings.

Without sample one, the linear relationship between Pb and reflectance is significant,

Table 1. Distance from the roadway, organic-matter content and metal concentrations foreach sample.

Sample Distance (m)Organic-matter

content (%) Pb (ppm) Zn (ppm) Cu (ppm) Cd (ppm) Mn (ppm)

1 1.7 15.87 412 339 41 7.0 7092 6 20.61 358 364 57 9.5 10193 10 20.97 279 309 38 9.5 10844 15 15.11 245 243 27 7.7 12195 20 16.97 204 225 20 6.8 9926 25 18.00 255 286 31 8.3 11937 40 12.00 187 197 19 6.5 10388 50 12.07 222 214 36 6.8 1286

Estimation of heavy-metal contamination in soil 4117

Downloaded By: [Filippelli, Gabriel] At: 15:21 13 September 2010

with a coefficient of determination (R2) of 0.691 and 0.683 at 800 and 1300 nm,respectively. Virtually all points are within a 95% level of confidence.

The dip in the scatter plots in figure 4 indicates that there may be an underlying effect

influencing the reflection of the samples, such as the nonlinear effect caused by spectral

overlapping. There is a possibility that the correlation may not only reveal the relation-

ship between reflectance and Pb, but also include effects of organic matter or other soil

constituents on the reflectance. Even if this is the case, Pb abundance may still be

determined from reflectance. For instance, Zn adopts the same roadway curve as Pb

due to their similar roadway source. For that reason, the same reflectance pattern isexpected for Pb as for Zn. High levels of Zn in an area (deposited from tyre dust) are

indicative of high Pb levels (from leaded gasoline) in that area. The high correlation

between organic matter and reflectance can be accounted for as well. While organic-

matter content has little influence over Pb retention, it does have an effect on the

depositional concentrations of Zn. Based on organic-matter content, one can determine

Zn concentrations and therefore determine Pb concentrations through a comprehensive

use of chemometrics. However, the overall trend indicates that metal – specifically Pb –

concentrations can be determined from remote sensing data for human healthpurposes.

3.3 Partial least-squares regression

Due to the aforementioned interference effects of organic matter, the selection of the

most sensitive spectral bands using corrgram plots did not result in highly accurate

prediction. Organic matter masks spectral signatures, making it difficult to identify

metal quantities at these wavelengths. In contrast, PLSR performs the regression

using full-spectrum data, and improves the estimates of Pb and other heavy metals

Military Park

Wavelength (nm)500 1000 1500 2000 2500

Ref

lect

ance

0.00

0.05

0.10

0.15

0.20

0.25Sample 1Sample 2Sample 3Sample 4Sample 5Sample 6Sample 7Sample 8

Figure 3. Reflectance of soil samples, from a transect of near-road to far-road samples(sample 1 to sample 8). The reflectance was measured outdoor and the noise at 1400 and1900 nm is due to atmospheric water.

4118 C. M. Pandit et al.

Downloaded By: [Filippelli, Gabriel] At: 15:21 13 September 2010

from hyperspectral data. Table 2 shows the correlation coefficients of the five heavy

metals and organic matter as derived from PLSR modelling for the eight Military

Park samples compared with those generated from a corrgram. This modelling

resulted in improved predictions for all metals and organic matter. The model withthe highest predictive ability was that of Pb, with R2 ¼ 0.951 (figure 5).

Because the results from the Military Park PLSR were very promising, 24 samples

were taken from the three additional study sites with the goals of improving PLSR

prediction and calibrating and validating a PLSR algorithm. With this expanded pool

0.06 0.08 0.10 0.12 0.14 0.16

Pb

(ppm

)

100

150

200

250

300

350

400

0.08 0.10 0.12 0.14 0.16 0.18 0.20 0.22

Pb

(ppm

)

100

150

200

250

300

350

400

Reflectance at 800 nm

Reflectance at 1300 nm

(a)

(b)

R2 = 0.691

R2 = 0.683

Figure 4. Linear model for Military Park Pb concentrations as a function of reflectance attwo representative wavelengths 800 nm (a) and 1300nm (b) from figure 3.

Estimation of heavy-metal contamination in soil 4119

Downloaded By: [Filippelli, Gabriel] At: 15:21 13 September 2010

of samples, a calibration PLSR model was created and tested. The samples were split

into two groups, one as the training set for deriving the PLSR algorithm and the other

as the validation set for assessing the model’s predictive ability. Twenty-four samples

were used to create the algorithm, and eight randomly selected samples were validated

using this algorithm. Again, the correlation coefficients as derived from PLSR (for the

calibration samples) were much higher than the correlations initially generated usinga corrgram plot (table 3), and even higher than those determined from the PLSR of

the Military Park samples. However, the correlation coefficient for Cu is skewed due

to the leverage point in the upper-right of the scatter plot. Without the leverage point,

R falls from 0.882 to 0.741.

The predictive accuracy of the calibration model is expressed by the R2 for the

validation samples (table 4 and figure 6). Mn, Zn and Pb had significant coefficients of

determination (0.808, 0.760 and 0.746, respectively) for the validation samples, indi-

cating that the partial least-squares calibration model has the predictive power toestimate metal abundance with accuracy.

This pilot project constitutes the first attempt to estimate Pb concentrations

from spectral reflectance with the aim of improving children’s health in an urban

Table 2. Correlation coefficients of metal concentrations, organic-matter content andreflectance (Ref.) at 800 and 1300 nm. Correlations calculated by plotting metal abundanceand organic-matter content against reflectance measurements. Bolded values are significant at

the 95th percentile (n ¼ 8 for geochemistry and organic matter).

Pb Cu Zn Cd MnOrganic-matter

contentRef. 800

nmRef. 1300

nm

PbCu 0.786Zn 0.911 0.849Cd 0.409 0.685 0.724Mn -0.654 -0.176 -0.484 0.119Org. 0.473 0.557 0.772 0.883 -0.184Ref. 800 nm -0.831 -0.661 -0.828 -0.776 0.307 -0.840Ref. 1300 nm -0.827 -0.666 -0.820 -0.767 0.377 -0.836 0.998

100

200

300

400

500

100 200 300 400 500

Samples

PLSR Model

1:1 correlation

R2 = 0.951

Pb PLSR: Military Park

Laboratory-determined abundance (ppm)

Pre

dict

ed a

bund

ance

(pp

m)

Figure 5. Laboratory-determined abundance of Pb plotted against the abundance of Pbpredicted by PLSR.

4120 C. M. Pandit et al.

Downloaded By: [Filippelli, Gabriel] At: 15:21 13 September 2010

environment. The results demonstrate the feasibility of employing hyperspectral

analysis to determine Pb concentrations. However, this pilot mandates a continuance

of research. The factors of atmospheric conditions, soil moisture, organic matter,

particle size and soil texture need to be addressed before hyperspectral remote sensing

will be able to map Pb concentrations of a city.

4. Conclusions

As suggested in Peters et al. (2003), large concentrations of heavy metals can be

predicted using reflectance spectroscopy. However, it is much more difficult to

Table 3. Absolute value of the correlation coefficients generated from plotting metal concen-trations against reflectance measurements at 800 and 1300 nm, listed above the bold correlationcoefficients derived from PLSR. The advantage of PLSR can be seen by the improvedcorrelations for all constituents. *Cu correlations for the Transacts are those without the

leverage points.

PLSR: Military Park

Organic-matter content Pb Zn Cu* Cd Mn

Ref. 800 nm 0.840 0.831 0.828 0.661 0.776 0.307Ref. 1300 nm 0.836 0.827 0.820 0.666 0.767 0.377PLSR 0.886 0.975 0.972 0.832 0.843 0.968

Table 4. Coefficients of determination generated from the calibration model are listed abovethe coefficient of determinations derived from the model, which estimated soil constituentabundance from reflectance. Cu coefficients of determination are those without the leverage

points.

PLSR: four transects

Organic-matter content Pb Zn Cu Cd Mn

Calibration 0.982 0.985 0.95 0.778 0.982 0.941Validation 0.562 0.746 0.76 0.814 0.426 0.808

0

100

200

300

400

500

0 100 200 300 400 500

Calibration

Validation

1:1 Correlation

Calibration:

Validation:R2 = 0.985

R2 = 0.746

Pb PLSR calibration/validation

Laboratory-determined abundance (ppm)

Pre

dict

ed a

bund

ance

(pp

m)

Figure 6. Comparison of laboratory determined abundance of Pb with those generated fromPLSR calibration and validation.

Estimation of heavy-metal contamination in soil 4121

Downloaded By: [Filippelli, Gabriel] At: 15:21 13 September 2010

determine small concentrations of heavy metals in soil as opposed to large concen-

trations in mines. For the sake of children’s health, it is imperative to be able to

identify these relatively small Pb concentrations in soil because slight increases in Pb

concentration differentiate safe from poisonous Pb levels. In this study, we found

highly significant correlations between reflectance of soil and the soil constituents ofPb, Zn, Mn, Cd and organic matter. The PLSR models for Pb, Zn and Mn were also

accurate in estimating the metal abundance for new (validation) samples. This

research demonstrates the promise of using reflectance spectroscopy and PLSR

modelling to determine Pb and other heavy metal contamination in urban areas.

Further research on the correlation between reflectance spectroscopy and heavy

metals will benefit many, especially in the metropolises of the world. In advancing

remote sensing to the point where it can map the Pb concentrations of a city, those

potentially at risk from Pb poisoning – especially children – can be made aware ofhealth hazards in time to take preventative measures.

References

BELLINGER, D.C., 2004, Lead. Pediatrics, 113, pp. 1016–1020.

BRODY, D.J., PIRKLE, J.L., KRAMER, R.A., FLEGAL, K.M., MATTE, T.D., GUNTER, E.W. and

PASCHAL, D.C., 1994, Blood lead levels in the US population: Phase 1 of the third

national health and nutrition examination survey (NHANES III, 1988 to 1991). Journal

of the American Medical Association, 272, pp. 277–283.

CHANG, C., LAIRD, D.A. and HURBURGH JR, C.R., 2005, Influence of soil moisture on near

infrared reflectance spectroscopic measurement of soil properties. Soil Science, 170,

pp. 244–255.

CHOI, J.H., DAMM, C.J., O’DONOVAN, N.J., SAWYER, R.F., KOSHLAND, C.P. and LUCAS, D.,

2005, Detection of lead in soil with excimer laser fragmentation fluorescence spectro-

scopy. Applied Spectroscopy, 59, pp. 258–261.

FILIPPELLI, G.M., LAIDLAW, M.A.S., LATIMER, J.C. and RAFTIS, R, 2005, Urban lead poisoning

and medical geology: An unfinished story. GSA Today, 15, pp. 4–11.

LAIDLAW, M.A.S., MIELKE, H.W., FILIPPELLI, G.M., JOHNSON, D.L. and GONZALES, C.R., 2005,

Seasonality and children’s blood lead levels: Developing a predictive model using climatic

variables and blood lead data from Indianapolis, Indiana, Syracuse, New York, and New

Orleans, Louisiana (USA). Environmental Health Perspectives, 113, pp. 793–800.

LI, L., 2006, Partial least squares modeling to quantify lunar soil composition with hyperspec-

tral reflectance measurements. Journal of Geophysical Research, 111, pp. E04002.

LILLESAND, T.M., KIEFER, R.W. and CHIPMAN, J.W., 2004, Remote Sensing and Image

Interpretation, pp. 763 (Hoboken, NJ: Wiley).

MIELKE, H., 1999, Lead in the inner cities. American Scientist Online. Available online at:

http://americanscientist.org/template/AssetDetail/assetid/15602 (accessed 1 January

2008).

MILTON, E.J., ROLLIN, E.M. and EMERY, D.R., 1995, Advances in field spectroscopy. In

Advances in Environmental Remote Sensing, F.M. Danson and S.E. Plummer (Eds),

pp. 9–32 (New York: Wiley).

PETERS, D.C., HAUFF, P.L., PEPPIN, W.A., PROSH, E.C., BORSTAD, G.A., HENDERSON III,

F.B., DILLENBECK, E.D., CLOSS, L.G., LEE, K., THIROS, J.L., COULTER, D.W., ROBBINS,

D.A. and WANGERUD, K.W., 2001, Waste identification and characterization for the

international smelter area using CASI/SFSI hyperspectral remote sensing data. In

Proceedings of the Eighth International Conference on Tailings & Mine Waste ’01,

16–19 January 2001, Fort Collins, CO (Rotterdam: Balkema).

PETERS, D.C., HAUFF, P.L., SARES, M.A., HENDERSON III, F.B., BIRD, D., PROSH, E.C. and

PEPPIN, W.A., 2003, Preliminary results of a NASA-funded hyperspectral remote

4122 C. M. Pandit et al.

Downloaded By: [Filippelli, Gabriel] At: 15:21 13 September 2010

sensing project on natural and anthropogenic sources of impacted drainage. In 2003

SME Annual Meeting and Exhibit, 24–26 February, Cincinnati, OH.

RABITO, A., SHORTER, C. and WHITE, L.E., 2003, Lead levels among children who live in public

housing. Epidemiology, 14, pp. 263–268.

WOLD, H., 1966, Nonlinear estimation by iterative least squares procedure. In Research Papers

in Statistics, F. David (Ed.), pp. 411–444 (New York: Wiley & Sons).

Estimation of heavy-metal contamination in soil 4123

Downloaded By: [Filippelli, Gabriel] At: 15:21 13 September 2010