21
Pampa - Appendix Collection 4 Version 1 General coordinator Heinrich Hasenack Team Eduardo Vélez-Martin Eliseu José Weber Juliano Schirmbeck Bruna Zanatta Moraes Gabriel Selbach Hofmann

Pampa - Appendix · 2019. 8. 29. · Eliseu José Weber Juliano Schirmbeck Bruna Zanatta Moraes Gabriel Selbach Hofmann . 1 OVERVIEW OF CLASSIFICATION METHOD The production of the

  • Upload
    others

  • View
    2

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Pampa - Appendix · 2019. 8. 29. · Eliseu José Weber Juliano Schirmbeck Bruna Zanatta Moraes Gabriel Selbach Hofmann . 1 OVERVIEW OF CLASSIFICATION METHOD The production of the

Pampa - Appendix

Collection 4

Version 1

General coordinator

Heinrich Hasenack

Team

Eduardo Vélez-Martin

Eliseu José Weber

Juliano Schirmbeck

Bruna Zanatta Moraes

Gabriel Selbach Hofmann

Page 2: Pampa - Appendix · 2019. 8. 29. · Eliseu José Weber Juliano Schirmbeck Bruna Zanatta Moraes Gabriel Selbach Hofmann . 1 OVERVIEW OF CLASSIFICATION METHOD The production of the

1 OVERVIEW OF CLASSIFICATION METHOD

The production of the Collection 4, with land cover and land use annual maps for

the period of 1985-2018, followed a sequence of steps in the Pampa biome, similar to those

used in the previous Collection 3.1 (Figure 1). However, some improvements were added

up, particularly in the geographical units for classification and in the post classification

filters.

Figure 1 Classification steps to produce Collection 4 in the Pampa biome.

2 GEOGRAPHICAL UNITS OF CLASSIFICATION

In Collection 3.1, the sheets of the World International Chart to the Millionth

(1:250.000), hereafter called ‘charts’, were the spatial units adopted for data processing. A

total of 23 charts were used to cover the biome. Each chart sets the geographical limits to

build up the temporal and spatial Landsat mosaics, to collect training samples and to

proceed with digital classification procedures. The final map of the Pampa biome was

generated merging these 23 units.

In Collection 4 the charts approach was applied only to manage the Landsat mosaics.

For the years 1985-2017 the same mosaics were used with the addition of new ones for

2018. All the following steps were based on newer geographical units, corresponding to

seven homogeneous regions. These regions are an adaptation of the former nine ecological

systems proposed by Hasenack et al. (2010) using vegetation, relief and soils data (Figure

2).

This new approach was an attempt to reduce confusion of samples and classes, to

get a better balance of samples, and to avoid abrupt transitions when merging the

geographical units of classification.

Page 3: Pampa - Appendix · 2019. 8. 29. · Eliseu José Weber Juliano Schirmbeck Bruna Zanatta Moraes Gabriel Selbach Hofmann . 1 OVERVIEW OF CLASSIFICATION METHOD The production of the

Figure 2 Geographical units of classification in the Pampa biome: the 23 sheets of the World International Chart to the Millionth (1:250.000) used in Collection 3.1 (white lines) and the seven regions used in Collection 4 (yellow lines).

3 LANDSAT IMAGE MOSAICS

3.1 Definition of the temporal period

The image selection period for the Pampa biome was defined aiming to minimize

confusion between natural and cultivated vegetation due to phenological changes, while

trying to maximize the coverage by useful Landsat images after cloud removing/masking.

Unlike most of other the Brazilian biomes, the climate of the Pampa biome does not have a

defined dry season, with the seasonal variation of temperature being the main factor

determining the physiological behavior of vegetation throughout the year.

Forest, although classified mostly as Seasonal Deciduous Forest and to a lesser

extent as Semi-Deciduous, expresses minor deciduousness. Only a small fraction of species

in the tree communities loses leaves during winter, thus Pampa forests are expected to

show less variation in spectral response over the year than other types of vegetation cover.

On the other hand, herbaceous vegetation typologies in terrestrial environments

tend to present a characteristic seasonal pattern. Figure 3 presents a schematic diagram of

the seasonal behavior of grasslands and the most significant summer crops in the Pampa

biome, markedly rice and soybean. During autumn, the photosynthetic production of

herbaceous vegetation begins to decline, reaching its lowest point in winter, when a

Page 4: Pampa - Appendix · 2019. 8. 29. · Eliseu José Weber Juliano Schirmbeck Bruna Zanatta Moraes Gabriel Selbach Hofmann . 1 OVERVIEW OF CLASSIFICATION METHOD The production of the

significant portion of the leaf biomass reaches a senescent stage. From late winter and early

spring on, annual species germinate and perennial species begin to regrowth, shooting new

leaves and increasing progressively the photosynthetically active biomass, which will reach

its peak in the summer.

Figure 3 Scheme of phenological patterns of native grasslands, and soybean/rice crops in the Pampa biome. The y-axis corresponds to photosynthetic biomass production with merely illustrative values.

Rice and soybean are summer crops. During late winter and early spring, soil

preparation takes place, which can result, giving the different agronomic management

techniques, in exposed soil (conventional planting), dried vegetation (no-tillage) due to

herbicide application, or flooded in the case of rice planting. In such areas it is also usual to

sow winter pastures, for providing soil cover and supplemental forage for the livestock.

Consequently, during winter there are patches with photosynthetically active herbaceous

vegetation in the landscape, contrasting with the senescent native grasslands. Summer

planted pastures have less expression than winter pastures, and their peak of

photosynthetic activity coincides with that of the grassland vegetation during January and

February (Figure 4).

The period of the year allowable to distinguish between native vegetation, crops,

pastures with exotic species and forestry through remote sensing in the Pampa biome was

defined taken the known phenological patterns. It is expected that more contrast will be

found between these land cover typologies from September to November, when summer

crop areas are under preparation for planting, cultivated pastures are off their

photosynthetic peak, and native grasslands are in the beginning of regrowth and

development of new leaves (Figure 5).

Page 5: Pampa - Appendix · 2019. 8. 29. · Eliseu José Weber Juliano Schirmbeck Bruna Zanatta Moraes Gabriel Selbach Hofmann . 1 OVERVIEW OF CLASSIFICATION METHOD The production of the

Figure 4 Scheme of phenological patterns of winter and summer pasture in the Pampa biome. The y-axis corresponds to photosynthetic biomass production with merely illustrative values.

Figure 5 Schematic phenological pattern of grasslands, summer crops and winter pastures in the Pampa biome, indicating the temporal window ensuring greater contrast for the purpose of satellite imagery classification.

Page 6: Pampa - Appendix · 2019. 8. 29. · Eliseu José Weber Juliano Schirmbeck Bruna Zanatta Moraes Gabriel Selbach Hofmann . 1 OVERVIEW OF CLASSIFICATION METHOD The production of the

3.2 Image selection

For the selection of Landsat scenes to build the mosaics of each chart for each year,

within the selected standard period September-November, a threshold of 90% of cloud

cover was applied (i.e., any available scene with up to 90% of cloud cover was accepted).

This limit was established based on a visual analysis, after many trials observing the results

of the cloud removing/masking algorithm. When needed, due to excessive cloud cover

and/or lack of data, the standard period was extended to encompass a larger number of

scenes in order to allow the generation of a mosaic without holes. Whenever possible, this

was made by including months in the beginning of the period, in the winter season.

In most cases, at least one additional month had to be included in order to provide

enough images for the mosaic (Table 1), with an overall mean of 3.9 months. In some

specific cases it was needed to extend significantly the temporal period, while in others it

was shortened. Extension was more frequent for years 1990 and 2007, with an average of

six months, while shortening was noticeable for year 2012, when the acceptable period of

four charts had to be reduced to only two months (April and May) due to the low quality of

the available Landsat 7 scenes.

According to the year and the quality of images available, a specific Landsat

collection was selected:

● 1985 to 2000 and 2003 to 2011: Landsat 5,

● 2001, 2002 and 2012: Landsat 7,

● 2013 to 2018: Landsat 8.

3.3 Final quality

Considering the 23 charts of the Pampa biome and the 34 years of Collection 4, a

total of 782 mosaics were produced, all with satisfactory quality. Eventually, small portions

of some individual mosaics remained with no data pixels, but always smaller than 5% of the

chart.

Page 7: Pampa - Appendix · 2019. 8. 29. · Eliseu José Weber Juliano Schirmbeck Bruna Zanatta Moraes Gabriel Selbach Hofmann . 1 OVERVIEW OF CLASSIFICATION METHOD The production of the

Table 1 Temporal range (number of months) for selection of Landsat scenes used for building the mosaics of each chart of the Pampa biome for each year in the period 1985-2018.

1

9

8

5

1

9

8

6

1

9

8

7

1

9

8

8

1

9

8

9

1

9

9

0

1

9

9

1

1

9

9

2

1

9

9

3

1

9

9

4

1

9

9

5

1

9

9

6

1

9

9

7

1

9

9

8

1

9

9

9

2

0

0

0

2

0

0

1

2

0

0

2

2

0

0

3

2

0

0

4

2

0

0

5

2

0

0

6

2

0

0

7

2

0

0

8

2

0

0

9

2

0

1

0

2

0

1

1

2

0

1

2

2

0

1

3

2

0

1

4

2

0

1

5

2

0

1

6

2

0

1

7

2

0

1

8

M

e

a

n

SH-21-V-D 4 4 3 3 3 6 3 3 3 3 7 3 3 3 4 3 3 3 3 3 6 3 8 3 3 3 3 3 3 3 6 3 3 3 4

SH-21-X-A 3 3 3 3 3 6 3 3 5 4 3 3 4 4 3 3 4 4 3 3 4 3 8 3 3 10 3 3 3 3 6 3 3 3 4

SH-21-X-B 3 4 8 3 6 5 3 3 6 4 7 4 4 4 3 5 3 5 3 3 3 4 8 3 3 4 3 2 3 3 3 3 3 3 4

SH-21-X-C 4 4 3 3 3 7 3 3 5 3 8 4 5 3 4 3 4 3 3 3 4 3 8 3 3 3 3 3 3 3 6 3 3 3 4

SH-21-X-D 5 5 3 3 3 4 3 3 4 5 8 4 5 3 3 5 4 5 3 3 3 3 7 3 3 8 3 2 3 3 3 3 3 3 4

SH-21-Y-B 4 4 3 3 3 6 3 3 3 3 7 3 3 4 3 3 3 3 3 3 5 3 8 3 3 3 3 3 3 3 5 3 4 3 4

SH-21-Z-A 4 3 7 3 3 5 3 3 7 3 5 3 3 4 3 3 4 3 3 4 3 3 8 3 3 3 3 3 3 3 3 3 4 3 4

SH-21-Z-B 5 3 3 4 3 4 3 3 4 5 8 3 3 3 3 3 4 3 3 3 3 3 4 3 3 3 3 3 3 3 3 3 3 3 3

SH-21-Z-C 7 3 3 3 3 6 7 4 5 3 4 3 3 4 4 3 3 3 3 3 3 3 8 3 3 3 3 3 3 3 3 3 5 3 4

SH-21-Z-D 5 4 3 4 3 4 3 3 3 7 4 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3

SH-22-V-A 3 5 8 3 4 4 3 3 3 3 4 5 3 3 3 3 4 5 3 3 3 3 4 3 3 3 3 2 3 3 3 3 4 3 4

SH-22-V-C 5 4 8 4 3 4 3 5 3 5 3 6 5 4 3 6 3 7 3 3 3 3 5 3 3 3 3 2 3 3 3 6 3 3 4

SH-22-V-D 3 3 8 3 3 4 3 4 3 4 3 7 8 4 3 3 4 3 4 6 8 3 5 3 5 3 3 3 3 4 4 5 3 3 4

SH-22-X-C 3 3 3 3 9 5 3 4 3 4 3 6 4 5 3 3 5 5 4 3 6 3 4 3 3 3 8 3 3 3 3 3 3 3 4

SH-22-Y-A 5 3 7 5 3 6 3 8 3 3 3 7 4 3 3 5 4 3 4 3 3 3 5 3 3 3 3 3 3 3 3 9 3 3 4

SH-22-Y-B 8 4 6 4 3 5 3 12 3 7 3 6 5 4 3 5 3 3 4 3 5 3 6 3 3 5 3 3 3 3 6 5 3 3 4

SH-22-Y-C 7 3 6 4 3 6 3 3 3 3 3 6 3 3 3 3 4 3 3 3 3 3 3 3 3 4 3 3 3 3 3 6 4 3 4

SH-22-Y-D 8 4 5 3 4 8 4 3 3 7 3 5 5 6 3 3 4 3 3 3 6 6 6 3 3 8 3 3 3 3 6 4 3 3 4

SH-22-Z-A 3 3 3 3 6 3 3 9 3 5 4 8 4 5 3 3 4 7 4 3 4 4 6 3 3 3 3 3 3 3 5 3 4 3 4

SH-22-Z-C 5 3 3 3 4 5 3 3 3 4 3 6 4 3 3 3 4 3 3 3 3 3 3 3 3 3 3 3 3 3 4 3 3 3 3

SI-22-V-A 6 5 6 3 3 8 8 3 3 3 3 7 5 3 3 3 5 3 3 3 3 3 6 3 3 3 3 3 3 3 3 5 3 3 4

SI-22-V-B 3 7 3 3 4 11 5 3 3 7 3 5 5 5 3 3 3 3 3 3 5 6 4 3 3 3 3 3 3 3 7 5 3 3 4

SI-22-V-C 5 5 3 3 6 7 5 4 6 4 3 5 8 3 3 3 3 3 3 3 3 3 5 3 3 8 3 3 3 3 3 5 3 3 4

Mean 5 4 5 3 4 6 4 4 4 4 4 5 4 4 3 3 4 4 3 3 4 3 6 3 3 4 3 3 3 3 4 4 3 3

Page 8: Pampa - Appendix · 2019. 8. 29. · Eliseu José Weber Juliano Schirmbeck Bruna Zanatta Moraes Gabriel Selbach Hofmann . 1 OVERVIEW OF CLASSIFICATION METHOD The production of the

4 CLASSIFICATION

4.1 Classification scheme

The digital classification of the Landsat mosaics for the Pampa biome aimed to

individualize a subset of eight land use and land cover (LULC) classes (Table 2) from the

complete MapBiomas Collection 4 legend, which were integrated with the cross-cutting

themes in a further step.

Table 2 Land cover and land use categories considered for digital classification of Landsat mosaics for the Pampa biome in the MapBiomas Collection 4.

Legend class of Collection 4 Numeric ID Color

1.1.1. Forest Formation 3

2.1. Wetland 11

2.2. Grassland Formation 12

2.4. Rocky Outcrop 29

3.3 Mosaic of Agriculture and Pasture 21

4.4. Other non vegetated area 25

5. Water 33

6. Non Observed 27

4.2 Feature space

The feature space for digital classification of the categories of interest for the Pampa

biome comprised a subset of 27 variables (Table 3), taken from the complete feature space

of MapBiomas Collection 4. These variables include the original Landsat reflectance bands,

as well as vegetation indexes, spectral mixture modeling-derived variables, terrain

morphometry (slope), and a spatial texture measure. The definition of the subset was made

based on the expected usefulness of each variable to discriminate the targets of concern,

taking into account local knowledge about their spectral, spatial and temporal dynamics.

Page 9: Pampa - Appendix · 2019. 8. 29. · Eliseu José Weber Juliano Schirmbeck Bruna Zanatta Moraes Gabriel Selbach Hofmann . 1 OVERVIEW OF CLASSIFICATION METHOD The production of the

Table 3 Feature space subset considered in the classification of the Pampa biome Landsat image mosaics in the MapBiomas Collection 4 (1985-2018). ID Variable Description Statistics Temporal range Script acronym Group

1 Blue Landsat band median mosaic months 'median_blue' Landsat band 2 Green Landsat band median mosaic months 'median_green' Landsat band

3 Near Infrared (NIR)

Landsat band median mosaic months 'median_nir' Landsat band

4 Red Landsat band median mosaic months 'median_red' Landsat band

5 Shortwave Infrared (SWIR) 1

Landsat band median mosaic months 'median_swir1' Landsat band

6 Shortwave Infrared (SWIR) 2

Landsat band median mosaic months 'median_swir2' Landsat band

7 Evi 2 Enhanced Vegetation Index 2 median mosaic months 'median_evi2' Vegetation Index 8 Ndvi Normalized Difference Vegetation Index median mosaic months 'median_ndvi' Vegetation Index

9 Ndvi Dry Normalized Difference Vegetation Index median year -below first quartile

'median_ndvi_dry' Vegetation Index

10 Ndvi Wet Normalized Difference Vegetation Index median year - above third quartile

'median_ndvi_wet' Vegetation Index

11 Savi Soil-adjusted Vegetation Index median mosaic months 'median_savi' Vegetation Index 12 Sefi Savanna Ecosystem Fraction Index median mosaic months 'median_sefi' Vegetation Index 13 Sefi Savanna Ecosystem Fraction Index standard deviation complete year 'stdDev_sefi' Vegetation Index 14 Ndwi Normalized Difference Water Index median mosaic months 'median_ndwi' Water Index

15 Ndwi Wet Normalized Difference Water Index median year - above third quartile

'median_ndwi_wet' Water Index

16 Gv green vegetation fraction median mosaic months 'median_gv' Spectral Mixture Modeling

17 Gvs GV / (100 - shade) median mosaic months 'median_gvs' Spectral Mixture Modeling

18 Ndfi Normalized Difference Fraction Index median mosaic months 'median_ndfi' Spectral Mixture Modeling

19 Ndfi Normalized Difference Fraction Index standard deviation complete year 'stdDev_ndfi' Spectral Mixture Modeling

20 Ndfi Wet Normalized Difference Fraction Index median year - above third quartile

'median_ndfi_wet' Spectral Mixture Modeling

21 Npv npv fraction median mosaic months 'median_npv' Spectral Mixture Modeling

Page 10: Pampa - Appendix · 2019. 8. 29. · Eliseu José Weber Juliano Schirmbeck Bruna Zanatta Moraes Gabriel Selbach Hofmann . 1 OVERVIEW OF CLASSIFICATION METHOD The production of the

22 Npv npv fraction standard deviation complete year 'stdDev_npv' Spectral Mixture Modeling

23 Shade shade fraction median mosaic months 'median_shade' Spectral Mixture Modeling

24 Soil soil fraction median mosaic months 'median_soil' Spectral Mixture Modeling

25 Soil soil fraction standard deviation complete year 'stdDev_soil' Spectral Mixture Modeling

26 Green spatial texture

Spatial texture median mosaic months 'textG' Spatial Texture

27 Slope Slope - permanent 'slope' Geomorphometric

Page 11: Pampa - Appendix · 2019. 8. 29. · Eliseu José Weber Juliano Schirmbeck Bruna Zanatta Moraes Gabriel Selbach Hofmann . 1 OVERVIEW OF CLASSIFICATION METHOD The production of the

4.3 Classification algorithm, training samples and parameters

Digital classification was performed region by region, year by year, using a Random

Forest algorithm (Breiman, 2001) available in Google Earth Engine, running 60 iterations

(random forest trees). Training samples for each region were defined following a strategy

of using pixels for which the LULC remained the same along the 33 years of Collection 3.1,

hereafter called “stable samples”. An additional set of manually drawn polygons were used

as complementary stable samples.

4.3.1 Stable samples from Collection 3.1

The extraction of stable samples from the previous Collection 3.1 followed several

steps aiming to ensure their confidence for use as training areas. First, a threshold was

established for each class, specifying a minimum number of years in which a pixel should

remained with that class to be eligible as a stable sample. Then, a layer of pixels with a

stable classification along the 33 years of Collection 3.1 was generated by applying such

thresholds and used to extract random training points. A subset of 2,000 points were

randomly selected for each class, in each region, to classify all the 34 years.

4.3.2 Complementary samples

The need for complementary samples for each region and for all classes was

evaluated through the analysis of preliminary results from the classification of 2017, used

as a reference. The analysis was done combining three indicators: visual inspection, global

accuracy and confusion matrix, and mapped area in each class. Whenever better results

have been achieved and were stable, then the addition of extra polygons ceased.

The complementary samples were get drawing polygons directly in the Google Earth

Engine Code Editor. The same concept of stable samples was applied, checking the false-

color composites of the Landsat mosaics for all the 34 years during the polygon drawing. An

additional sample of 500 to 800 points were extracted from these polygons to complement

the training points used in classification.

4.3.3 Final classification

Final classification was performed for all regions and years using the merged set of

3.1 stable samples and complementary samples. All years used the same set of samples and

were trained with the correspondent mosaic of the year that was classified.

5 POST-CLASSIFICATION

The results of the final classification were improved through a sequence of filters, to

correct missing data, “salt-and-pepper” classification errors and other misclassification

problems.

Page 12: Pampa - Appendix · 2019. 8. 29. · Eliseu José Weber Juliano Schirmbeck Bruna Zanatta Moraes Gabriel Selbach Hofmann . 1 OVERVIEW OF CLASSIFICATION METHOD The production of the

5.1 Gap fill filter

This filter implemented a hole-filling routine using information of previous years to

replace pixels classified as Non Observed.

5.2 Spatial filter

The spatial filter uses a mask to change only those patches with pixels connected to

five or less pixels of the same class. These pixels were replaced by the most frequent value

of their correspondent eight neighbors.

5.3 Temporal filter

The temporal filter uses the information from the previous year and the year later

to identify and correct a pixel misclassification, considered as cases of invalid transitions.

The rules differ for the first year (1985), the last year (2018) and intermediate years.

The process starts looking at the three first years. Whenever a pixel in the year 1985 does

not belong to any native vegetation classes (3, 11, 12, 29) but is repeatedly followed by

them in 1986 and 1987, then it is corrected by the corresponding natural class. This

procedure avoids cases of false positives of regeneration in the first year. The next step

focuses on the three last years. Whenever a pixel in the year 2018 is not classified as 21

(Mosaic of Agriculture and Pasture class) but is preceded by two years equal to 21, then it

is corrected to the class 21, avoiding false positives of regeneration. The last step uses a 3-

year moving window to correct intermediate years. Whenever a pixel with the middle year

preceded and succeeded by the same other class, it is replaced with this correspondent

class. This procedure fixes very abrupt transitions that are unlikely to happen. The filter is

applied step by step respecting the following sequence of classes: [33, 29, 22, 21, 11, 3, 12].

5.4 Frequency filter

Frequency filters were applied to use the temporal information available for each

pixel to correct a very specific subset of misclassification problems.

The general logic of the frequency filter is to search for each pixel a specific

combination of classes throughout the 34 years and to replace all the original data by a

specific class. The output of this filter always sets the same target class to all 34 years,

demanding use with parsimony. Four different variations of the frequency filters were

employed.

The first frequency filter was applied to those pixels considered as cases of “stable

native vegetation”, including only shifts among classes 3, 11, 12, 22, 29 and 33. For instance,

if a stable native vegetation pixel is at least 90% of years classified as Forest and the

remaining 10% years classified as Grassland, all the Grassland shifted to Forest. In such

situations, it is more likely that the years with Grassland were misclassified and not real

cases of transitions between Forest and Grasslands. Abrupt shifts among native vegetation

classes are less expected to occur at this temporal range.

Page 13: Pampa - Appendix · 2019. 8. 29. · Eliseu José Weber Juliano Schirmbeck Bruna Zanatta Moraes Gabriel Selbach Hofmann . 1 OVERVIEW OF CLASSIFICATION METHOD The production of the

The result of this frequency filter is a classification with a more stable classification

among native vegetation classes. Another important result is the removal of noises in the

first and last year in classification.

The second frequency filter was used to fix the confusion among rice paddies, water,

and wetlands that occurs at regions 5, 6, and 7. First, the rule selects all pixels with shifts

among classes 11, 21, and 33 and sets to class 21 whenever it has a frequency greater than

33% along the 34 years. Rice paddies out of the cropping season show temporary waters

and development of aquatic vegetation confusing agriculture with water or wetlands during

the classification.

The third frequency filter corrected the confusion among wetlands, forest, and

water which is common at regions 3, 5, 6, and 7. First, the rule selects all pixels with shifts

among classes 3, 11, and 33 and sets to class 33 or 11, whenever they have a frequency

greater than 33% along the 34 years. The radiometric response of dense aquatic vegetation

is very close to that of forests inducing the classifier to include false positives of this class in

places without the minimal physiological conditions to support trees.

The fourth frequency filter corrected the confusion among non-vegetated areas,

agriculture, forest, and grassland, which appeared at regions 1, 2, 3, and 4, especially in the

first third of the temporal range. First, the rule selects all pixels with shifts among classes 3,

12, 21, and 22, with a mandatory presence of class 22 greater than one year. Pixels with this

behavior were shifted to the class with the highest frequency along the 34 years. This

procedure solved the confusion of places with grasslands on shallow soils or with

agricultural soils temporarily exposed and incorrectly classified as permanently non-

vegetated areas in some years.

5.5 Incident filter

To correct the classification of pixels considered with an excessive amount of

changes along the 34 years an incident filter was applied. The procedure assumes that there

are some temporal variations in the classification that are not real, but artifacts arising from

two main sources.

The first one comes from small tilts in the acquisition of the original radiometric

information by the satellite sensors associated with issues of georeferencing precision.

These situations affect especially those pixels located at the border of two classes when

compared over successive years. The same pixel may be classified to each one of the

neighbor classes according to the intensity of these unwanted effects. The rule to solve the

problem includes three steps: to select all pixels that shifted above a threshold of eight

times among classes over the 34 years (incident pixels), to select only small patches of pixels

with this behavior (less than 6 incident connected pixels), and to replace the original

variable classification for all years by a unique class equivalent to the most frequent class

observed at each pixel (mode).

The second one derives from lands with an intense use by family farming. These

places show dynamic transformations within and between years. The greenness of

Page 14: Pampa - Appendix · 2019. 8. 29. · Eliseu José Weber Juliano Schirmbeck Bruna Zanatta Moraes Gabriel Selbach Hofmann . 1 OVERVIEW OF CLASSIFICATION METHOD The production of the

rotational annual crops is a frequent source of confusion with evergreen forest, increasing

forest commission errors. The rule applied to fix the problem also includes three steps: to

select all pixels that shifted above a threshold of ten times among classes over the 34 years,

to select only small areas (patches greater than 6 and lower than 66 incident connected

pixels), to search for those incident pixels where the most frequent class observed is forest

(class 3) or mosaic of pasture and agriculture (class 21) and replace the original classification

for all years by class 21.

6 INTEGRATION WITH CROSS-CUTTING THEMES

The maps resulting from the post-classification filters for each of the 34 years (1985-

2018) were then integrated with the cross-cutting themes, through hierarchical rules of

prevalence (Table 4). The output of this step is a final LULC set of maps for the Pampa biome

for the 34 years.

In the Pampa biome, the Grassland Formation class overlaps most of the Pasture

class. The remaining pixels classified as Pasture (15) were remapped to class 19 (Annual

and Perennial Crop) whenever overlapping class 21 (Mosaic of Agriculture or Pasture) and

to class 25 (Other non-vegetated Area) whenever overlapping class 25. These rules are

specific for the Pampa biome where planted pastures are in most of the cases a land cover

practice out of the harvest season.

Table 4 Prevalence rules for combining the output of classification after filtering with the crosscutting themes in the Pampa biome, Collection 4.

Order Class Name Font

1 23 4.1. Beach and Dune Crosscutting Theme

2 33 5.1. River, Lake and Ocean Biome

3 9 1.2. Forest Plantation Crosscutting Theme

4 30 4.4. Mining Crosscutting Theme

5 24 4.2. Urban Infrastructure Crosscutting Theme

6 19 3.2.1. Annual and Perennial Crop Crosscutting Theme

7 3 1.1.1. Forest Formation Biome

8 29 4.5. Rocky Outcrop Biome

9 11 2.1. Wetland Biome

10 12 2.2. Grassland Formation Biome

11 15 3.1. Pasture Crosscutting Theme

12 25 4.3. Other non-vegetated Area Biome

13 21 3.3 Mosaic of Agriculture or Pasture Biome

Page 15: Pampa - Appendix · 2019. 8. 29. · Eliseu José Weber Juliano Schirmbeck Bruna Zanatta Moraes Gabriel Selbach Hofmann . 1 OVERVIEW OF CLASSIFICATION METHOD The production of the

7 VALIDATION STRATEGIES

7.1 Mapping Agreement Analysis

Overall, there have been few previous initiatives on LULC mapping the Pampa biome

with spatial and thematic detail compatible with the MapBiomas Project. There are

available basically three maps that depict the years 2002 (Hasenack et al., 2015), 2009

(Weber et al., 2016) and 2015 (Hofmann et al., 2018) for the state of Rio Grande do Sul,

which includes the Pampa biome.

All of the mentioned maps were produced through visual interpretation of Landsat

imagery, therefore being vector polygon maps, aiming to ensure level of spatial detail

compatible with cartographic scale 1:250,000. Their thematic richness, on the other hand,

comprises a number of categories of natural vegetation cover and anthropic uses that is

larger than those of Collection 4 of the MapBiomas project. Thus, for validation purposes,

the three available vector maps were first rasterized with a spatial resolution of 30 meters,

like the MapBiomas’ maps, then reclassified according to the MapBiomas legend (Figures

6, 7, and 8), and used as references for an analysis of agreement with the digital

classification maps of the same years. Different from accuracy assessment based on

sampling points, here the whole surface of each MapBiomas map was compared with the

reference map of the corresponding year.

The overall agreement between the MapBiomas LULC maps and the reclassified

reference maps was of 70.23% for the year 2002, 70.13% for 2009 and 71.04% for 2015.

Allocation disagreement was the major component of disagreement (Pontius and Millones,

2011), reaching 23.4% in 2002, 23.82% in 2009 and 23.06% in 2015. This reflects in part the

different nature of both sets of maps: the reference maps are inherently more generalized

due to the manual polygon drawing, while the MapBiomas’ digital classification maps are

pixel-based, so that a single polygon delineating a unique class in the former can encompass

several pixels of different classes in the latter.

Average value for the three years was of 70.23% for overall agreement, 23.4% for

allocation disagreement and 6.28% of quantity disagreement. Summing overall agreement

and allocation disagreement gives an indicator of areal agreement, which was above 93%

in all the three years, with an average of 93.89%.

Page 16: Pampa - Appendix · 2019. 8. 29. · Eliseu José Weber Juliano Schirmbeck Bruna Zanatta Moraes Gabriel Selbach Hofmann . 1 OVERVIEW OF CLASSIFICATION METHOD The production of the

Figure 6 Reference map for the Pampa biome, year 2002 (Hasenack et al., 2015), reclassified to the legend of Collection 4 of the MapBiomas project.

Page 17: Pampa - Appendix · 2019. 8. 29. · Eliseu José Weber Juliano Schirmbeck Bruna Zanatta Moraes Gabriel Selbach Hofmann . 1 OVERVIEW OF CLASSIFICATION METHOD The production of the

Figure 7 Reference map for the Pampa biome, year 2009 (Weber et al., 2016), reclassified to the legend of Collection 4 of the MapBiomas project.

Page 18: Pampa - Appendix · 2019. 8. 29. · Eliseu José Weber Juliano Schirmbeck Bruna Zanatta Moraes Gabriel Selbach Hofmann . 1 OVERVIEW OF CLASSIFICATION METHOD The production of the

Figure 8 Reference map for the Pampa biome, year 2015 (Hofmann et al., 2018), reclassified to the legend of Collection 4 of the MapBiomas project.

Page 19: Pampa - Appendix · 2019. 8. 29. · Eliseu José Weber Juliano Schirmbeck Bruna Zanatta Moraes Gabriel Selbach Hofmann . 1 OVERVIEW OF CLASSIFICATION METHOD The production of the

7.2 Mapping Accuracy Analysis

A new set of 2,361 independent validation points provided by Lapig (Laboratório de

Processamento de Imagens e Geoprocessamento - UFG) was used to perform accuracy

analysis (Figure 9).

Figure 9 Lapig independent random points used for accuracy analysis in the Pampa biome.

To evaluate the improvements of different classification filters of Collection 4 we

performed a comparative analysis of accuracy, including the previous Collection 3.1. Those

classes presented only in the integration were previously remapped to the corresponding

classes in the biome classification to allow direct comparison of accuracy results between

classifications with and without the integration with cross-cutting themes.

Accuracy results indicate positive outcomes for the different filters and better

results for Collection 4 when compared with Collection 3.1 (Figure 10).

Page 20: Pampa - Appendix · 2019. 8. 29. · Eliseu José Weber Juliano Schirmbeck Bruna Zanatta Moraes Gabriel Selbach Hofmann . 1 OVERVIEW OF CLASSIFICATION METHOD The production of the

Figure 10 Comparative analysis of accuracy in the Pampa biome, including different classification filters in the Collection 4 and Collection 3.1 comparison.

Page 21: Pampa - Appendix · 2019. 8. 29. · Eliseu José Weber Juliano Schirmbeck Bruna Zanatta Moraes Gabriel Selbach Hofmann . 1 OVERVIEW OF CLASSIFICATION METHOD The production of the

8 REFERENCES

Breiman, L. Random forests. Machine learning, v. 45, n. 1, p. 5-32, 2001.

Hasenack, H.; Cordeiro, J.L.P; Weber, E.J. (Org.). 2015. Uso e cobertura vegetal do Estado

do Rio Grande do Sul – situação em 2002. Porto Alegre: UFRGS IB Centro de Ecologia. 1a

ed.

Hasenack, H.; Weber, E.; Boldrini, I. I. Trevisan, R. 2010. Mapa de sistemas ecológicos das

Savanas Uruguaias em escala 1:500.000 ou superior. Porto Alegre, Centro de Ecologia.

Relatório técnico Projeto UFRGS/TNC. 18 p.

Hofmann, G.S.; Weber, E.J; Hasenack, H. (Org.). Uso e cobertura vegetal do Estado do Rio

Grande do Sul – situação em 2015. Porto Alegre: UFRGS IB Centro de Ecologia, 2018. 1a ed.

Pontius, R.G., Millones, M., 2011. Death to Kappa: birth of quantity disagreement and

allocation disagreement for accuracy assessment. Int. J. Remote Sens. v. 32, p. 4407–4429.

http://dx.doi.org/10.1080/01431161.2011.552923.

Weber, E.J.; Hofmann, G.S.; Oliveira, C.V.; Hasenack, H. (Org.). Uso e cobertura vegetal do

Estado do Rio Grande do Sul – situação em 2009. Porto Alegre: UFRGS IB Centro de

Ecologia, 2016. 1a ed. ISBN 978-85-63843-20-3.