Transcript
Page 1: ISEC 2014 (International Statistical Ecology Conference)

Grid spacing and quality of spatially predicted speciesabundances

A case-study for zero-inflated spatial data

Olga Lyashevska* Dick Brus** Jaap van der Meer*

*Royal Netherlands Institute for Sea ResearchDepartment of Marine Ecology

**Alterra, Wageningen University and Research Centre

[email protected]

July, 2 2014

Lyashevska et al, 2014 [email protected] July, 2 2014 1 / 16

Page 2: ISEC 2014 (International Statistical Ecology Conference)

Problem

Sampling is expensive, therefore it is important to statisticallyevaluate sampling designs prior to implementation ofmonitoring network;

This has been done before . . . (Bijleveld et al., 2012; Brus andde Gruijter, 2013), but. . .

spatial empirical ecological data are typically zero-inflated

and accounting for spatial dependence of such data is notstraightforward.

Lyashevska et al, 2014 [email protected] July, 2 2014 2 / 16

Page 3: ISEC 2014 (International Statistical Ecology Conference)

Problem

Sampling is expensive, therefore it is important to statisticallyevaluate sampling designs prior to implementation of monitoringnetwork;

This has been done before . . . (Bijleveld et al., 2012; Brus andde Gruijter, 2013), but. . .

spatial empirical ecological data are typically zero-inflated

and accounting for spatial dependence of such data is notstraightforward.

Lyashevska et al, 2014 [email protected] July, 2 2014 2 / 16

Page 4: ISEC 2014 (International Statistical Ecology Conference)

Problem

Sampling is expensive, therefore it is important to statisticallyevaluate sampling designs prior to implementation of monitoringnetwork;

This has been done before . . . (Bijleveld et al., 2012; Brus andde Gruijter, 2013), but. . .

spatial empirical ecological data are typically zero-inflated

and accounting for spatial dependence of such data is notstraightforward.

Lyashevska et al, 2014 [email protected] July, 2 2014 2 / 16

Page 5: ISEC 2014 (International Statistical Ecology Conference)

Problem

Sampling is expensive, therefore it is important to statisticallyevaluate sampling designs prior to implementation of monitoringnetwork;

This has been done before . . . (Bijleveld et al., 2012; Brus andde Gruijter, 2013), but. . .

spatial empirical ecological data are typically zero-inflated

and accounting for spatial dependence of such data is notstraightforward.

Lyashevska et al, 2014 [email protected] July, 2 2014 2 / 16

Page 6: ISEC 2014 (International Statistical Ecology Conference)

Aim

1. To work out a methodology for statistical evaluation ofsampling designs for zero-inflated spatially correlated countdata;

2. To test proposed methodology in a real-world case study.

Lyashevska et al, 2014 [email protected] July, 2 2014 3 / 16

Page 7: ISEC 2014 (International Statistical Ecology Conference)

Aim

1. To work out a methodology for statistical evaluation of samplingdesigns for zero-inflated spatially correlated count data;

2. To test proposed methodology in a real-world case study.

Lyashevska et al, 2014 [email protected] July, 2 2014 3 / 16

Page 8: ISEC 2014 (International Statistical Ecology Conference)

Methodology

Postulate a statistical model of the spatial distribution of thevariable;

Use prior data to calibrate such model;

Simulate a large number of pseudo-realities;

Sample each pseudo-reality repeatedly with candidate samplingdesigns;

Predict variable of interest at validation points;

Compute performance statistics;

Select the best candidate design out of evaluated candidates

Lyashevska et al, 2014 [email protected] July, 2 2014 4 / 16

Page 9: ISEC 2014 (International Statistical Ecology Conference)

Methodology

Postulate a statistical model of the spatial distribution of the variable;

Use prior data to calibrate such model;

Simulate a large number of pseudo-realities;

Sample each pseudo-reality repeatedly with candidate samplingdesigns;

Predict variable of interest at validation points;

Compute performance statistics;

Select the best candidate design out of evaluated candidates

Lyashevska et al, 2014 [email protected] July, 2 2014 4 / 16

Page 10: ISEC 2014 (International Statistical Ecology Conference)

Methodology

Postulate a statistical model of the spatial distribution of the variable;

Use prior data to calibrate such model;

Simulate a large number of pseudo-realities;

Sample each pseudo-reality repeatedly with candidate samplingdesigns;

Predict variable of interest at validation points;

Compute performance statistics;

Select the best candidate design out of evaluated candidates

Lyashevska et al, 2014 [email protected] July, 2 2014 4 / 16

Page 11: ISEC 2014 (International Statistical Ecology Conference)

Methodology

Postulate a statistical model of the spatial distribution of the variable;

Use prior data to calibrate such model;

Simulate a large number of pseudo-realities;

Sample each pseudo-reality repeatedly with candidate samplingdesigns;

Predict variable of interest at validation points;

Compute performance statistics;

Select the best candidate design out of evaluated candidates

Lyashevska et al, 2014 [email protected] July, 2 2014 4 / 16

Page 12: ISEC 2014 (International Statistical Ecology Conference)

Methodology

Postulate a statistical model of the spatial distribution of the variable;

Use prior data to calibrate such model;

Simulate a large number of pseudo-realities;

Sample each pseudo-reality repeatedly with candidate samplingdesigns;

Predict variable of interest at validation points;

Compute performance statistics;

Select the best candidate design out of evaluated candidates

Lyashevska et al, 2014 [email protected] July, 2 2014 4 / 16

Page 13: ISEC 2014 (International Statistical Ecology Conference)

Methodology

Postulate a statistical model of the spatial distribution of the variable;

Use prior data to calibrate such model;

Simulate a large number of pseudo-realities;

Sample each pseudo-reality repeatedly with candidate samplingdesigns;

Predict variable of interest at validation points;

Compute performance statistics;

Select the best candidate design out of evaluated candidates

Lyashevska et al, 2014 [email protected] July, 2 2014 4 / 16

Page 14: ISEC 2014 (International Statistical Ecology Conference)

Methodology

Postulate a statistical model of the spatial distribution of the variable;

Use prior data to calibrate such model;

Simulate a large number of pseudo-realities;

Sample each pseudo-reality repeatedly with candidate samplingdesigns;

Predict variable of interest at validation points;

Compute performance statistics;

Select the best candidate design out of evaluated candidates

Lyashevska et al, 2014 [email protected] July, 2 2014 4 / 16

Page 15: ISEC 2014 (International Statistical Ecology Conference)

Case Study

Dutch Wadden Sea;

Area: 2483 km2;

Abundance of Baltic tellin(M. balthica);

Centrifuge tube (17.3 – 17.7cm) to a depth of 25 cm

June–October 2010

Lyashevska et al, 2014 [email protected] July, 2 2014 5 / 16

Page 16: ISEC 2014 (International Statistical Ecology Conference)

Field data - Species Abundance

0

1000

2000

3000

0 25 50 75Species abundance

Cou

nts

90% observations are zeros

max 100 individuals

µ = 1.39 individuals

var = 24 individuals

Lyashevska et al, 2014 [email protected] July, 2 2014 6 / 16

Page 17: ISEC 2014 (International Statistical Ecology Conference)

Field data - Species Occurrence

●●●●●●

●●●

●●●●●●●●

●●●●●●●●●●●

●●

● ●●●●●●●●●

●●●●●●●●●

●●●●●●

●●●

●●●●●

●●●●●●●●●●●●

●●●●●●● ●●●●

●●●●●●●●

●●●

●●●

●●●●●●

●●●●●●●

●●●●●●●●●

●●●●

●●●●●●●●●●

●●●●

●●●● ●●●●●●●●

●●●●●●●●●

●●

●●●●

●●●●●●●●●●● ●

●●●

●●●

●●●●

●●●●●

●●

●●

●●●●●●●●●●● ●

●●●●●●●●●

●●●

●●●●●●

●● ●

●●●●●●●●

●●●●●●●

●●●●●●●●●●●

●●●●

●●●

●●●●●

●●●●

●●●

●●●●●●●●●●●●●●●

●●●●

●●●●●●

●●●●●●●●●●

●●●●●

●●●

●●●●●●●●

●● ●●●●●

●●●●

●●●●●●● ●●●●●●●●●●

●●●●●●● ●●●●●

●●

●●●●●●

●●●●

●●●●●●●

●●●● ●●●●

●●●●●

●●●●●●●

●●●●●●●

●●●●

●●●●

●●●●●

●●●●●●●

●●●

●●●●●

●●●●●●●●●●

●●

●●●●●●●●

●●

●●●

●●●●●●●●●

●●●●●●

●●●● ●

●●

●●●●●●●●●● ●●

●●●

●●●

●●●

●●●●●●●

●●●●●●●●

●●●●●

●●●●●●●●

●●●●●●●●

●●●●●●●●

●●●

●●●●●

●●●●

●●●

●●●●●●●●●●

●●●●●●●●●●

●●●

●●

●●

●●●●●●●● ●

●●●●●●●

●●●

● ●●●●●●

●●●●●●●

●●●●

●●●●●●

●●●●

●●●

●●●●

●●●●

●●●●●● ●

●●●●●●

●●

●●●●●●●

●●●●●

●●●●●●●●●●

●●●●●●●●●●

●●●●●●●●●●●●

●●●●●

●●●●●

●●●●●●●●●

●●●●

●●●●●

●●●●●●●●●

●●●●

●●●●●

●●

● ●●●●●

●●

● ●●●●●●●●●●●●

●●●●●●●●●

●●●●●●

● ●●●●●●●●●●●

●●●●●●●●●●●

●●●●●●●

●●●●●●●●●

●●●●●

●●●●●●●●●●●

●●●●●

●●●●●●●●

●●●

●●●

●●●

●●●●●● ●●●●●●

●●

●●

●●●●●

●●●●●●

●●●●●

●●●●●● ●●

●●●●●

●●●●●

●●●

●●

●●

●●●●

●●●●●●

●●●

●●

●●●

●●●

●●●●●●●

●●

●●●

●●●●

●●●●

●●●

●●●●●

●●●●●●

●●●●

●●●●●●● ●●●● ●

●●●●

●●●●●●●●●●●

●●●●

●●●●

●●●● ●●

●●●●

●●●

●●●●●●●●●●●

●●●●

●●● ●●●●● ●●

●●●●

●●●●●●●●●

●●●●●●

●●●●●●

●●●●●●●●●●●●●

●●

●●●●●●●●●●●●●

●●●●●●●●

●●●●●●●●●●

●●●●●●●●●●

●●

●●●●●

●●●●

●●●●●●●●●

● ●● ●●●

●●●●●●●

●●●●●●

●●●●

●●●●

●●●●●●●●

●●●

●●●●●●●●●●●●

●●

●●●●●●●●●●

●●●●

●●●●

●●●

●●●●

●●

●●●●●●

●●●●●●

●●●●●

●●●●●●●●●

●●●●● ●

●●●●●

●● ●●●

●●●●●●●

●●

●●●●

●●●●●●●●●

●●●●

●●●●● ●●●

●●●●●

●●●●●●●●●● ●●●●●●●●●●

●●●●●●●●●●●

●●● ●

●●

●●●●●●●●●●●●●●●

●●●● ●

●●

●●●●●●●●●●●

●●

●●●●●

●●●

●●●

●●●●●●●●●

●●● ●●●●●●●

●●●

●●●●●●●●●

●●●●●

●●●●●●●●●●●●● ●

●●●● ●

●●● ●

●●●

●● ●

● ●●

●●●●●

●●●

●●

●●●

●● ●●●●●

●●●

●●●

●●●

●●●●

●●●

●●

●●●●●●

●●●●

●●●●

●● ●

●●●●●

●●●

●●●●

●●●

● ●● ●

●●●

●●●●●●●

●●●●

●●●

●●●●●

● ●●●●

●● ●●●●●

● ●●●●

●●

● ●●●●●●●●

●●●●●●●● ●

●●●●

●●●●●●●

●●●●●

●●●●

●●●●●

●●●

●●

●●●●● ●

●●●●●●●

●●●

●●●●●●●

●●●●●

●●

●● ●

●●

●●●

●●●

●●●

●●●●

●●●●●●●●●●●

● ●●●●●●

●●●

●●●

●●●●●●●●●

●●●●●●●

●●

●●●●●

●●●●

●●● ●●

●●● ●●●●●●●●●

●●●●

●●●●● ●

● ●●●●●

●●●●

●●●

●●●●●●●●●●●

●●●●●

●●●●●●

●●●

●●●●● ●●●●

●●●●●

●● ●●

●●● ●

●●●●●●●●●●●●●●●

●●

●●

●●

●●●●●●●●

●●●●

●●●●

●●●●●

●●●●

●● ●●

●●●●●●●

●●●●●●●

●●● ●●●●●

●●●●

●●

●●●●●

●●●●●●●

●●●

●●●●●●

●●●●●●●●

●●●●●●●●●●● ●●

●●●●●●●

● ●●●●

●●●●●●●●●

●●●●●●●

●●●●

●●●

●●●●●●●

●●●●●●●●●●●●●●

●●●●●

●●●●●●● ●●

●●●●

●●●●●●●●●●●●

●●●●●●●●●●●

● ●●●●●●●

●●

●●●●

●●●●●●● ●●●

●●●●●

●●●●●●

●●●●●●●

●●●●

●●●●

●●●●●●

●●●●●

●●

●●●●●●

●●●●●● ●

●●●● ●

●●

●●●

●●●●● ●

●●●●●

●●●●●

●●●

●●●●●●

●●●●●●

●●●●●●● ●●

●●●●●●●●

● ●

●●●●●●●● ●●●

●●●●●●●

●●● ●●

●●●●●●

●●●●●●

●●●●●●● ●●

●●●●●●●●●● ●●●

●●●●●●●●

●●●●●●● ●●●●●●●

●●● ●●

●●●

●●●●

●●●●

●●●●●●● ●

●●● ●

●●●

●●●●●

●●●●●●●

●●●●●●●●●●●

●●●●●●●●●

●●●●●●●

●●●

●●●●

●●●● ●

●●●●●● ● ●●●●●

●● ●●●●●

●●●

●●●●●●●●●●●●● ●●●●●●

●●● ●●

●●●

●●●●●●●●●●●●●●●●●●●●●●

●●●● ●●●●●●

●●●●

●●●●

●●●●●●●

●●●●

●●

●●●●●●●●●●●●●

●●●●

●●●

●●

●● ●

●●

●●●●

●●●●●●●

●●

● ●●●

●●●

●●●

●●

●●●●●●●●●●●

●●●●●●●●

●●

●●●●●

●●●●●

●●●●●●● ●

●●●

● ●●●●

●●●●

●●●●●●●●

●●●●●●●● ●

●●●●●

●●●●●●●●●●

●●●●●● ●●●

●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●● ●●●●●●●●●●●●●●

●●

●●●●

●●●

●●●●

●●●●

●●●●●●●●●●

●●●●●●

●●●●●●●●

●●●●●●●

●●●●●●●

●●●●●●●●●●

●●●●●●

●●●●

●●

●●●

●●●●●●●●● ●●

●●●●●●●●●

●●●●●

●●●●●●●

●●●●●●●●

●●●●●●●●●●●

●●●

●●●●●●●●●

●●●●

●●●●●●●●

●●

●●●●●●

●●●●●●●●

●●●●

●●●●● ● ●

●●●●●

●●●●●●●●

●●●●●

●●●

●●●●

●●●●

●●●

●●

●●●●

●●●●●●

●●●

●●●

●●●● ●

●●●●

●●

● ●●●

●●●●

●●

●● ●

●●●●●●

●●●●●

●●●

●●

●●

●●●●●●●

●●●●●●

●●●●●

●●●●

●● ●

●●●●●●●●

●●●●●

●●●

●●●●●●●

●●●●●

●●

●●●

●●●

●●●●●●●●

●●●● ●●

●●●●●●● ●●

●●●●●●●●●

●●● ●

●●●●

●●●●●●

●●●●

●●●●●●

●●●● ●

● ●

●●●

●●●●●

●●●

●●●●●●●●● ●●●●

●●●●

●●●●

●●●●●●●●

●●

●●●●●●●

●●●●● ●●

●●

●●●●●●●●●●●●●●●●

●●●●●●

●●●●●●

●●●●●

●●●●

●●●

●●●●●●●●

●●

●●●

● ●●●●●

●●●●

●●●● ●●●●●●●●

●●●●●●●

●●● ●

●●●●●

●●●

●●●● ●

●●●

●●●●●

●●●

●●

●●●●●●●●●

●●●●●

●●●●

●●

●●● ●

●●● ●

●●

●●●●●●●●●●●●●●

●●●●●●●●●●●●●●

●●● ●●● ●●●●

●●●●●●●●●

●●●

●●●●●●●●●●●●●●

● ●●●●●● ●●●●●●

●●●●●●

●●●●●●●●●●●●

●●●●●● ●

●●●●

●●

●●●●●●●●●

●●●● ●●●●●●●●●●●● ●●●

●●●●

●●● ●

●●●●●●

●●●●●●●●

●●●●●●●●

●●●

●●

●●●

●●●●●●●●

●●●●●●

●● ●

●●●●●

●●●

●●

●●●●●●●●

●●●●●

●●●●●●

●●

●●●

●●●●●●●●

●●●●●●●●●●●●●

●●●●●

●●●●●●●●●●●●●●●

●●●

●●●●●●●●●●●

●●●●●●

●●●●●●●●●●●

●●●●●●●●

●●●●●

●●●

● ●●

● ●

●●●●●

●●●●●●

●●● ●

●●● ●

●● ●

● ●●

●●

●●●●●●●●●●●●

●●●●●●●●●

●●●●●●● ●●

●●●●●●

●●

●●●●●●●

●●●●●●●●

●●● ●

●●

●●●●●

●●●●●●●●●● ●

●●●●●●

●●●●●●

●●●

●●● ●

●●●●

●●●●

●●●●●●

●●●●●

●●●

●●●

●●● ●

●●●●●●●

●●●

●●●●●●

●●●●●●

●●●

●●● ●

●●●●

●●●●●●●●●●●

●●●●●●●

●●●●

●●●

●●●

●●●● ●

●●●

●●●●

●●●

●●●●●●●

●●

●●

●●●●●●●●●● ●●●●●●●●●●●●● ●●●

●●●●

●●●●

●●●

●●●●●●●

●●

●●●●●●●

●●

●●●●

●●

●●●●●

●●●

●●●

●●

●●●●

●●

●●●●

●●

●●●●●

●●●

● ●●●

●●●●●●●●

●●●●

●●

●●●

●●

●●●●●

●●●●

●●●

●●

●●

●●

●●●●●●

●●●●

●●●

3320

3340

3360

3380

4000 4050 4100Easting (km)

Nor

thin

g (

km)

4100 samples

500 m grid + 10% random points

Lyashevska et al, 2014 [email protected] July, 2 2014 7 / 16

Page 18: ISEC 2014 (International Statistical Ecology Conference)

Modelling of the spatial distribution

1. Calibrate zero-inflated Poisson mixture model (assuming independentdata);

2. Use fitted model to classify each zero either as a Bernoulli or aPoisson zero;

3. Model the Bernoulli and Poisson variables separately (accounting forspatial dependence).

Lyashevska et al, 2014 [email protected] July, 2 2014 8 / 16

Page 19: ISEC 2014 (International Statistical Ecology Conference)

Modelling of the spatial distribution

1. Zero inflated Poisson mixture model (Lambert, 1992);

P(y |x) =exp(−µ)µy

y !(1)

logit(ψ) = log(ψ

1− ψ) = xTβ (2)

P(Y = y)

{ψ + (1− ψ)exp(−µ) y=0

(1− ψ) exp(−µ)µy

y ! for y = 1, 2, 3, . . .(3)

Lyashevska et al, 2014 [email protected] July, 2 2014 9 / 16

Page 20: ISEC 2014 (International Statistical Ecology Conference)

Modelling of the spatial distribution

2. Bernoulli/Poisson zeros;

Compute the ratio of the probability of a Bernoulli zero to the totalprobability of a zero;

ψ

ψ + (1− ψ)exp(−µ)(1)

Randomly allocate each zero to a Bernoulli zero or a Poisson zero.

Lyashevska et al, 2014 [email protected] July, 2 2014 9 / 16

Page 21: ISEC 2014 (International Statistical Ecology Conference)

Modelling of the spatial distribution

3. Bernoulli and Poisson variables are modelled separately by GLGM(Diggle et al., 1998; Christensen, 2004)

GLGM is GLM for dependent data (spatial random effect);Transformed model parameters, logit(ψ) and log(µ) are modelled withGaussian Random Field.

S1 = logit(ψ) = x1β1 + ε1 (1)

S2 = log(µ) = x2β2 + ε2 (2)

The model parameters are obtained through Marcov Chain MonteCarlo (MCML);MCML is computationally prohibitive for large data sets.

Lyashevska et al, 2014 [email protected] July, 2 2014 9 / 16

Page 22: ISEC 2014 (International Statistical Ecology Conference)

Simulation of the pseudo-realities

Simulate signals S (linear combination of covariates andGaussian noise) with GLGM models for Bernoulli and Poissonvariables at sampling locations (original grid);

Use sequential Gaussian simulation to simulate signals at very finegrid (100 m x 100 m) supplemented with validation points;

Combine pairwise the simulated fields of Bernoulli indicators andPoisson counts to pseudo-realities of zero-inflated Poisson counts;

Lyashevska et al, 2014 [email protected] July, 2 2014 10 / 16

Page 23: ISEC 2014 (International Statistical Ecology Conference)

Simulation of the pseudo-realities

Simulate signals S (linear combination of covariates and Gaussiannoise) with GLGM models for Bernoulli and Poisson variables atsampling locations (original grid);

Use sequential Gaussian simulation to simulate signals at veryfine grid (100 m x 100 m) supplemented with validation points;

Combine pairwise the simulated fields of Bernoulli indicators andPoisson counts to pseudo-realities of zero-inflated Poisson counts;

Lyashevska et al, 2014 [email protected] July, 2 2014 10 / 16

Page 24: ISEC 2014 (International Statistical Ecology Conference)

Simulation of the pseudo-realities

Simulate signals S (linear combination of covariates and Gaussiannoise) with GLGM models for Bernoulli and Poisson variables atsampling locations (original grid);

Use sequential Gaussian simulation to simulate signals at very finegrid (100 m x 100 m) supplemented with validation points;

Combine pairwise the simulated fields of Bernoulli indicatorsand Poisson counts to pseudo-realities of zero-inflated Poissoncounts;

Lyashevska et al, 2014 [email protected] July, 2 2014 10 / 16

Page 25: ISEC 2014 (International Statistical Ecology Conference)

Simulated data vs Original

Figure : Simulated data, species occurrence

Lyashevska et al, 2014 [email protected] July, 2 2014 11 / 16

Page 26: ISEC 2014 (International Statistical Ecology Conference)

Simulated data vs Original

●●●●●●

●●●

●●●●●●●●

●●●●●●●●●●●

●●

● ●●●●●●●●●

●●●●●●●●●

●●●●●●

●●●

●●●●●

●●●●●●●●●●●●

●●●●●●● ●●●●

●●●●●●●●

●●●

●●●

●●●●●●

●●●●●●●

●●●●●●●●●

●●●●

●●●●●●●●●●

●●●●

●●●● ●●●●●●●●

●●●●●●●●●

●●

●●●●

●●●●●●●●●●● ●

●●●

●●●

●●●●

●●●●●

●●

●●

●●●●●●●●●●● ●

●●●●●●●●●

●●●

●●●●●●

●● ●

●●●●●●●●

●●●●●●●

●●●●●●●●●●●

●●●●

●●●

●●●●●

●●●●

●●●

●●●●●●●●●●●●●●●

●●●●

●●●●●●

●●●●●●●●●●

●●●●●

●●●

●●●●●●●●

●● ●●●●●

●●●●

●●●●●●● ●●●●●●●●●●

●●●●●●● ●●●●●

●●

●●●●●●

●●●●

●●●●●●●

●●●● ●●●●

●●●●●

●●●●●●●

●●●●●●●

●●●●

●●●●

●●●●●

●●●●●●●

●●●

●●●●●

●●●●●●●●●●

●●

●●●●●●●●

●●

●●●

●●●●●●●●●

●●●●●●

●●●● ●

●●

●●●●●●●●●● ●●

●●●

●●●

●●●

●●●●●●●

●●●●●●●●

●●●●●

●●●●●●●●

●●●●●●●●

●●●●●●●●

●●●

●●●●●

●●●●

●●●

●●●●●●●●●●

●●●●●●●●●●

●●●

●●

●●

●●●●●●●● ●

●●●●●●●

●●●

● ●●●●●●

●●●●●●●

●●●●

●●●●●●

●●●●

●●●

●●●●

●●●●

●●●●●● ●

●●●●●●

●●

●●●●●●●

●●●●●

●●●●●●●●●●

●●●●●●●●●●

●●●●●●●●●●●●

●●●●●

●●●●●

●●●●●●●●●

●●●●

●●●●●

●●●●●●●●●

●●●●

●●●●●

●●

● ●●●●●

●●

● ●●●●●●●●●●●●

●●●●●●●●●

●●●●●●

● ●●●●●●●●●●●

●●●●●●●●●●●

●●●●●●●

●●●●●●●●●

●●●●●

●●●●●●●●●●●

●●●●●

●●●●●●●●

●●●

●●●

●●●

●●●●●● ●●●●●●

●●

●●

●●●●●

●●●●●●

●●●●●

●●●●●● ●●

●●●●●

●●●●●

●●●

●●

●●

●●●●

●●●●●●

●●●

●●

●●●

●●●

●●●●●●●

●●

●●●

●●●●

●●●●

●●●

●●●●●

●●●●●●

●●●●

●●●●●●● ●●●● ●

●●●●

●●●●●●●●●●●

●●●●

●●●●

●●●● ●●

●●●●

●●●

●●●●●●●●●●●

●●●●

●●● ●●●●● ●●

●●●●

●●●●●●●●●

●●●●●●

●●●●●●

●●●●●●●●●●●●●

●●

●●●●●●●●●●●●●

●●●●●●●●

●●●●●●●●●●

●●●●●●●●●●

●●

●●●●●

●●●●

●●●●●●●●●

● ●● ●●●

●●●●●●●

●●●●●●

●●●●

●●●●

●●●●●●●●

●●●

●●●●●●●●●●●●

●●

●●●●●●●●●●

●●●●

●●●●

●●●

●●●●

●●

●●●●●●

●●●●●●

●●●●●

●●●●●●●●●

●●●●● ●

●●●●●

●● ●●●

●●●●●●●

●●

●●●●

●●●●●●●●●

●●●●

●●●●● ●●●

●●●●●

●●●●●●●●●● ●●●●●●●●●●

●●●●●●●●●●●

●●● ●

●●

●●●●●●●●●●●●●●●

●●●● ●

●●

●●●●●●●●●●●

●●

●●●●●

●●●

●●●

●●●●●●●●●

●●● ●●●●●●●

●●●

●●●●●●●●●

●●●●●

●●●●●●●●●●●●● ●

●●●● ●

●●● ●

●●●

●● ●

● ●●

●●●●●

●●●

●●

●●●

●● ●●●●●

●●●

●●●

●●●

●●●●

●●●

●●

●●●●●●

●●●●

●●●●

●● ●

●●●●●

●●●

●●●●

●●●

● ●● ●

●●●

●●●●●●●

●●●●

●●●

●●●●●

● ●●●●

●● ●●●●●

● ●●●●

●●

● ●●●●●●●●

●●●●●●●● ●

●●●●

●●●●●●●

●●●●●

●●●●

●●●●●

●●●

●●

●●●●● ●

●●●●●●●

●●●

●●●●●●●

●●●●●

●●

●● ●

●●

●●●

●●●

●●●

●●●●

●●●●●●●●●●●

● ●●●●●●

●●●

●●●

●●●●●●●●●

●●●●●●●

●●

●●●●●

●●●●

●●● ●●

●●● ●●●●●●●●●

●●●●

●●●●● ●

● ●●●●●

●●●●

●●●

●●●●●●●●●●●

●●●●●

●●●●●●

●●●

●●●●● ●●●●

●●●●●

●● ●●

●●● ●

●●●●●●●●●●●●●●●

●●

●●

●●

●●●●●●●●

●●●●

●●●●

●●●●●

●●●●

●● ●●

●●●●●●●

●●●●●●●

●●● ●●●●●

●●●●

●●

●●●●●

●●●●●●●

●●●

●●●●●●

●●●●●●●●

●●●●●●●●●●● ●●

●●●●●●●

● ●●●●

●●●●●●●●●

●●●●●●●

●●●●

●●●

●●●●●●●

●●●●●●●●●●●●●●

●●●●●

●●●●●●● ●●

●●●●

●●●●●●●●●●●●

●●●●●●●●●●●

● ●●●●●●●

●●

●●●●

●●●●●●● ●●●

●●●●●

●●●●●●

●●●●●●●

●●●●

●●●●

●●●●●●

●●●●●

●●

●●●●●●

●●●●●● ●

●●●● ●

●●

●●●

●●●●● ●

●●●●●

●●●●●

●●●

●●●●●●

●●●●●●

●●●●●●● ●●

●●●●●●●●

● ●

●●●●●●●● ●●●

●●●●●●●

●●● ●●

●●●●●●

●●●●●●

●●●●●●● ●●

●●●●●●●●●● ●●●

●●●●●●●●

●●●●●●● ●●●●●●●

●●● ●●

●●●

●●●●

●●●●

●●●●●●● ●

●●● ●

●●●

●●●●●

●●●●●●●

●●●●●●●●●●●

●●●●●●●●●

●●●●●●●

●●●

●●●●

●●●● ●

●●●●●● ● ●●●●●

●● ●●●●●

●●●

●●●●●●●●●●●●● ●●●●●●

●●● ●●

●●●

●●●●●●●●●●●●●●●●●●●●●●

●●●● ●●●●●●

●●●●

●●●●

●●●●●●●

●●●●

●●

●●●●●●●●●●●●●

●●●●

●●●

●●

●● ●

●●

●●●●

●●●●●●●

●●

● ●●●

●●●

●●●

●●

●●●●●●●●●●●

●●●●●●●●

●●

●●●●●

●●●●●

●●●●●●● ●

●●●

● ●●●●

●●●●

●●●●●●●●

●●●●●●●● ●

●●●●●

●●●●●●●●●●

●●●●●● ●●●

●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●● ●●●●●●●●●●●●●●

●●

●●●●

●●●

●●●●

●●●●

●●●●●●●●●●

●●●●●●

●●●●●●●●

●●●●●●●

●●●●●●●

●●●●●●●●●●

●●●●●●

●●●●

●●

●●●

●●●●●●●●● ●●

●●●●●●●●●

●●●●●

●●●●●●●

●●●●●●●●

●●●●●●●●●●●

●●●

●●●●●●●●●

●●●●

●●●●●●●●

●●

●●●●●●

●●●●●●●●

●●●●

●●●●● ● ●

●●●●●

●●●●●●●●

●●●●●

●●●

●●●●

●●●●

●●●

●●

●●●●

●●●●●●

●●●

●●●

●●●● ●

●●●●

●●

● ●●●

●●●●

●●

●● ●

●●●●●●

●●●●●

●●●

●●

●●

●●●●●●●

●●●●●●

●●●●●

●●●●

●● ●

●●●●●●●●

●●●●●

●●●

●●●●●●●

●●●●●

●●

●●●

●●●

●●●●●●●●

●●●● ●●

●●●●●●● ●●

●●●●●●●●●

●●● ●

●●●●

●●●●●●

●●●●

●●●●●●

●●●● ●

● ●

●●●

●●●●●

●●●

●●●●●●●●● ●●●●

●●●●

●●●●

●●●●●●●●

●●

●●●●●●●

●●●●● ●●

●●

●●●●●●●●●●●●●●●●

●●●●●●

●●●●●●

●●●●●

●●●●

●●●

●●●●●●●●

●●

●●●

● ●●●●●

●●●●

●●●● ●●●●●●●●

●●●●●●●

●●● ●

●●●●●

●●●

●●●● ●

●●●

●●●●●

●●●

●●

●●●●●●●●●

●●●●●

●●●●

●●

●●● ●

●●● ●

●●

●●●●●●●●●●●●●●

●●●●●●●●●●●●●●

●●● ●●● ●●●●

●●●●●●●●●

●●●

●●●●●●●●●●●●●●

● ●●●●●● ●●●●●●

●●●●●●

●●●●●●●●●●●●

●●●●●● ●

●●●●

●●

●●●●●●●●●

●●●● ●●●●●●●●●●●● ●●●

●●●●

●●● ●

●●●●●●

●●●●●●●●

●●●●●●●●

●●●

●●

●●●

●●●●●●●●

●●●●●●

●● ●

●●●●●

●●●

●●

●●●●●●●●

●●●●●

●●●●●●

●●

●●●

●●●●●●●●

●●●●●●●●●●●●●

●●●●●

●●●●●●●●●●●●●●●

●●●

●●●●●●●●●●●

●●●●●●

●●●●●●●●●●●

●●●●●●●●

●●●●●

●●●

● ●●

● ●

●●●●●

●●●●●●

●●● ●

●●● ●

●● ●

● ●●

●●

●●●●●●●●●●●●

●●●●●●●●●

●●●●●●● ●●

●●●●●●

●●

●●●●●●●

●●●●●●●●

●●● ●

●●

●●●●●

●●●●●●●●●● ●

●●●●●●

●●●●●●

●●●

●●● ●

●●●●

●●●●

●●●●●●

●●●●●

●●●

●●●

●●● ●

●●●●●●●

●●●

●●●●●●

●●●●●●

●●●

●●● ●

●●●●

●●●●●●●●●●●

●●●●●●●

●●●●

●●●

●●●

●●●● ●

●●●

●●●●

●●●

●●●●●●●

●●

●●

●●●●●●●●●● ●●●●●●●●●●●●● ●●●

●●●●

●●●●

●●●

●●●●●●●

●●

●●●●●●●

●●

●●●●

●●

●●●●●

●●●

●●●

●●

●●●●

●●

●●●●

●●

●●●●●

●●●

● ●●●

●●●●●●●●

●●●●

●●

●●●

●●

●●●●●

●●●●

●●●

●●

●●

●●

●●●●●●

●●●●

●●●

3320

3340

3360

3380

4000 4050 4100Easting (km)

Nor

thin

g (

km)

Figure : Original data, species occurrence

Lyashevska et al, 2014 [email protected] July, 2 2014 11 / 16

Page 27: ISEC 2014 (International Statistical Ecology Conference)

Grid spacing and Performance

Sample each pseudo-reality of zero-inflated Poisson datarepeatedly by grid-sampling with a given spacing;

Repeat it for all considered grid-spacings;

Predict values with IDW interpolation at validation points;

Calculate the performance statistics: the Mean Squared Error

MSE =1

N

N∑i=1

{Y (a0)− Y (a0)

}2(3)

MMSE =1

(R ∗ S)

R∑i=1

S∑j=1

MSEji (4)

N is a number of validation points, R - simulations andS - samples.

Lyashevska et al, 2014 [email protected] July, 2 2014 12 / 16

Page 28: ISEC 2014 (International Statistical Ecology Conference)

Grid spacing and Performance

Sample each pseudo-reality of zero-inflated Poisson data repeatedlyby grid-sampling with a given spacing;

Repeat it for all considered grid-spacings;

Predict values with IDW interpolation at validation points;

Calculate the performance statistics: the Mean Squared Error

MSE =1

N

N∑i=1

{Y (a0)− Y (a0)

}2(3)

MMSE =1

(R ∗ S)

R∑i=1

S∑j=1

MSEji (4)

N is a number of validation points, R - simulations andS - samples.

Lyashevska et al, 2014 [email protected] July, 2 2014 12 / 16

Page 29: ISEC 2014 (International Statistical Ecology Conference)

Grid spacing and Performance

Sample each pseudo-reality of zero-inflated Poisson data repeatedlyby grid-sampling with a given spacing;

Repeat it for all considered grid-spacings;

Predict values with IDW interpolation at validation points;

Calculate the performance statistics: the Mean Squared Error

MSE =1

N

N∑i=1

{Y (a0)− Y (a0)

}2(3)

MMSE =1

(R ∗ S)

R∑i=1

S∑j=1

MSEji (4)

N is a number of validation points, R - simulations andS - samples.

Lyashevska et al, 2014 [email protected] July, 2 2014 12 / 16

Page 30: ISEC 2014 (International Statistical Ecology Conference)

Grid spacing and Performance

Sample each pseudo-reality of zero-inflated Poisson data repeatedlyby grid-sampling with a given spacing;

Repeat it for all considered grid-spacings;

Predict values with IDW interpolation at validation points;

Calculate the performance statistics: the Mean Squared Error

MSE =1

N

N∑i=1

{Y (a0)− Y (a0)

}2(3)

MMSE =1

(R ∗ S)

R∑i=1

S∑j=1

MSEji (4)

N is a number of validation points, R - simulations andS - samples.

Lyashevska et al, 2014 [email protected] July, 2 2014 12 / 16

Page 31: ISEC 2014 (International Statistical Ecology Conference)

MMSE and Variance of MMSE

68

72

76

80

1000 2000 3000Spacing (m)

MM

SE

●●

0

2000

4000

6000

1000 2000 3000Spacing (m)

varia

nce

MM

SE

Lyashevska et al, 2014 [email protected] July, 2 2014 13 / 16

Page 32: ISEC 2014 (International Statistical Ecology Conference)

Conclusions

Sampling design for zero-inflated spatial count data isevaluated;

A strong monotonous increase of the MMSE is observed;

MSEji varies strongly between simulations and samples, especially forlarge grid spacings;

So numerous simulations and samples are needed for estimatingMMSE;

Spatial modelling of zero-inflated spatial data is laborious andcomputer-intensive.Is there an easier way: INLA?

Lyashevska et al, 2014 [email protected] July, 2 2014 14 / 16

Page 33: ISEC 2014 (International Statistical Ecology Conference)

Conclusions

Sampling design for zero-inflated spatial count data is evaluated;

A strong monotonous increase of the MMSE is observed;

MSEji varies strongly between simulations and samples, especially forlarge grid spacings;

So numerous simulations and samples are needed for estimatingMMSE;

Spatial modelling of zero-inflated spatial data is laborious andcomputer-intensive.Is there an easier way: INLA?

Lyashevska et al, 2014 [email protected] July, 2 2014 14 / 16

Page 34: ISEC 2014 (International Statistical Ecology Conference)

Conclusions

Sampling design for zero-inflated spatial count data is evaluated;

A strong monotonous increase of the MMSE is observed;

MSEji varies strongly between simulations and samples,especially for large grid spacings;

So numerous simulations and samples are needed for estimatingMMSE;

Spatial modelling of zero-inflated spatial data is laborious andcomputer-intensive.Is there an easier way: INLA?

Lyashevska et al, 2014 [email protected] July, 2 2014 14 / 16

Page 35: ISEC 2014 (International Statistical Ecology Conference)

Conclusions

Sampling design for zero-inflated spatial count data is evaluated;

A strong monotonous increase of the MMSE is observed;

MSEji varies strongly between simulations and samples, especially forlarge grid spacings;

So numerous simulations and samples are needed for estimatingMMSE;

Spatial modelling of zero-inflated spatial data is laborious andcomputer-intensive.Is there an easier way: INLA?

Lyashevska et al, 2014 [email protected] July, 2 2014 14 / 16

Page 36: ISEC 2014 (International Statistical Ecology Conference)

Conclusions

Sampling design for zero-inflated spatial count data is evaluated;

A strong monotonous increase of the MMSE is observed;

MSEji varies strongly between simulations and samples, especially forlarge grid spacings;

So numerous simulations and samples are needed for estimatingMMSE;

Spatial modelling of zero-inflated spatial data is laborious andcomputer-intensive.Is there an easier way: INLA?

Lyashevska et al, 2014 [email protected] July, 2 2014 14 / 16

Page 37: ISEC 2014 (International Statistical Ecology Conference)

Thanks!

Acknowledgements:This work was done in the framework of the WaLTER (Wadden Sea Long-TermEcosystem Research) project (WP5)

www.walterproject.nl

Lyashevska et al, 2014 [email protected] July, 2 2014 15 / 16

Page 38: ISEC 2014 (International Statistical Ecology Conference)

References I

Bijleveld, A. I., van Gils, J. A., van der Meer, J., Dekinga, A., Kraan, C., van derVeer, H. W., and Piersma, T. (2012). Designing a benthic monitoringprogramme with multiple conflicting objectives. Methods in Ecology andEvolution, 3(3):526–536.

Brus, D. and de Gruijter, J. (2013). Effects of spatial pattern persistence on theperformance of sampling designs for regional trend monitoring analyzed bysimulation of spacetime fields. Computers & Geosciences, 61(0):175 – 183.

Christensen, O. F. (2004). Monte carlo maximum likelihood in model-basedgeostatistics. Journal of Computational and Graphical Statistics, 13(3):pp.702–718.

Diggle, P. J., Tawn, J. A., and Moyeed, R. A. (1998). Model-based geostatistics.Journal of the Royal Statistical Society. Series C (Applied Statistics), 47(3):pp.299–350.

Lambert, D. (1992). Zero-inflated poisson regression, with an application todefects in manufacturing. Technometrics, 34(1):pp. 1–14.

Lyashevska et al, 2014 [email protected] July, 2 2014 16 / 16