32
A new project to foster Japanese-Australian discovery through Data Science for environmental and ecological management Charis Burridge Cherry Bud Workshop, Mar ’08, Keio University, Japan

A new project to foster Japanese-Australian discovery ... · A new project to foster Japanese-Australian discovery through Data Science for environmental and ecological management

  • Upload
    others

  • View
    1

  • Download
    0

Embed Size (px)

Citation preview

Page 1: A new project to foster Japanese-Australian discovery ... · A new project to foster Japanese-Australian discovery through Data Science for environmental and ecological management

A new project to foster Japanese-Australian

discovery through Data Science for

environmental and ecological management

Charis Burridge

Cherry Bud Workshop, Mar ’08, Keio University, Japan

Page 2: A new project to foster Japanese-Australian discovery ... · A new project to foster Japanese-Australian discovery through Data Science for environmental and ecological management

Outline of talk

Two parts:

Australian-Japan Foundation project

(AJF): Keio-CSIRO collaboration

including intros to Keio and CSIRO

Recent reef fish monitoring project:

involving CMIS statisticians and CMAR

marine ecologists

Page 3: A new project to foster Japanese-Australian discovery ... · A new project to foster Japanese-Australian discovery through Data Science for environmental and ecological management

Keio-CSIRO partnership: to now

• CSIRO staff at previous Cherry Bud workshops:

Richard Jarrett, Harri Kiiveri, John Donnelly

• 2 PhD students (Hideyasu Shimadzu & Yuki Sugaya)

recently undertook 6-week internships with CMIS:

Hideyasu worked with Matt Browne, Yuki with Ian

Saunders

• Hideyasu & Matt developed an AJF project proposal

• Hideyasu recruited by GeoScience Australia for

collaborative research with CSIRO on national marine

biodiversity prediction

• CSIRO staff at current Cherry Bud workshop:

Ian Saunders, Richard Jarrett and myself

Page 4: A new project to foster Japanese-Australian discovery ... · A new project to foster Japanese-Australian discovery through Data Science for environmental and ecological management

Keio-CSIRO partnership: from here

• Apr 2008-Mar 2009: AJF funds 4-week visits by 3 people

to/from Australia to promote joint research in marine

biodiversity and other applications of joint interest to

CSIRO and Keio

• One-year program will include mini-workshops in

Australia and final workshop in Keio next March

• With a combination of CSIRO internships and

(hopefully) further AJF travel grants, we plan to develop

a longer-term program to engage both current and

future Keio Uni postgraduate students in applying

statistical methods to important, real-world problems

Page 5: A new project to foster Japanese-Australian discovery ... · A new project to foster Japanese-Australian discovery through Data Science for environmental and ecological management

Keio University: a snapshot

• Oldest university in Japan (150 years)

• Motto: ‘the pen is mightier than the sword’

• Human resource data

~2,600 staff, ~32,000 students (~3,700 doctoral)

• Graduate School of Science & Technology is home to

mathematicians and statisticians

• In 2003 Prof Ritei Shibata and others were awarded a 5-

year grant as a designated 21st Century Centre of

Excellence

• The Data Science component has been led by Ritei, part

of this being the Data and Description environment he

will describe later

Page 6: A new project to foster Japanese-Australian discovery ... · A new project to foster Japanese-Australian discovery through Data Science for environmental and ecological management

Introducing CSIRO

• Commonwealth Scientific and Industrial Research Organisation

• Formed in 1926 (some longevity, but younger than Keio!)

• Australia's largest single employer of scientists (>6500; diverse)

• Operating from 57 sites in Australia and overseas

• A co-founding member of the Global Research Alliance

Page 7: A new project to foster Japanese-Australian discovery ... · A new project to foster Japanese-Australian discovery through Data Science for environmental and ecological management

Research facilities and locations

• CSIRO manages three National Research Facilities:

• Australian Animal Health Laboratory at Geelong

• Australia Telescope at Parkes, Coonabarabran and Narrabri

• the oceanographic research vessel Southern Surveyor.

• Also have >30 other research facilities such as:

• Riverside Life Sciences Centre at Brisbane

• CSIRO Discovery Centre at Canberra

• Australian Resources Research Centre at Perth

• Corporate headquarters in the Australian Capital Territory (Canberra).

• CSIRO has a laboratory in France and staff located in:

• Ireland

• The Netherlands

• United States of America.

• .

Page 8: A new project to foster Japanese-Australian discovery ... · A new project to foster Japanese-Australian discovery through Data Science for environmental and ecological management

Structure of CSIRO: Divisions & Flagships

Page 9: A new project to foster Japanese-Australian discovery ... · A new project to foster Japanese-Australian discovery through Data Science for environmental and ecological management

CMIS Division: ~120 staff (many statisticians)

[Input Programs - R; Output Themes - L]

Page 10: A new project to foster Japanese-Australian discovery ... · A new project to foster Japanese-Australian discovery through Data Science for environmental and ecological management

CMIS Environmental Informatics Program

delivers into multiple Outcome Domains

Advanced

Materials

Energy

Food Production

and Supply

Oceans

Understanding

the Universe

Biosecurity

Water

Preventative

Health

Security

Sustainable

Communities

Biodiversity

Agricultural

Sustainability

Climate

ManufacturingICT

Mineral

Resources

Broad applicability

Page 11: A new project to foster Japanese-Australian discovery ... · A new project to foster Japanese-Australian discovery through Data Science for environmental and ecological management

CMIS Environmental Informatics Program:

Core Capabilities

• Scaleable technologies with development through multiple

investments

i. mathematical frameworks for highly structured stochastic

systems

ii. space-time models for environmental monitoring

iii. sample and survey design

iv. eco-scape risk assessment frameworks

v. mathematics for multi-scale space-time data integration

vi. high dimensional data and statistical pattern recognition

(discrimination, classification, un-mixing)

Page 12: A new project to foster Japanese-Australian discovery ... · A new project to foster Japanese-Australian discovery through Data Science for environmental and ecological management

What are we aiming to achieve?

Through harnessing appropriate capabilities we deploy mathematical and statistical

technologies that:

1. increase value of remote sensing information via quantified parameter accuracy

& variability estimates

2. incorporate the data streams provided by emerging monitoring and assessment

technologies to improve the prediction and quantification of uncertainty at

different spatial and temporal scales

3. integrate different models & data sources and reliably represent uncertainty in

system wide predictions

4. enable the reliable estimation of condition, trends and extremes in both space

and time

5. optimally allocate scarce monitoring resources across the environment to satisfy

multiple objectives

6. integrate empirical, process based and expert based information into coherent

eco-scape risk assessment frameworks

Page 13: A new project to foster Japanese-Australian discovery ... · A new project to foster Japanese-Australian discovery through Data Science for environmental and ecological management

Science Plan for Environmental Informatics in CMIS

Applicatio

n

areas

Capabilities

Using the mathematical

sciences to understand

and predict processes

dominated by scale

and complexity Research

drivers

• National Research

priorities

• International

Research priorities

• Identified needs of

collaborators/clients

• CMIS Purpose

Eco-scapes

• Continental Scale Vegetation

• Marine Systems

• Catchment to Coastal Zone

• Urban

• space-time statistical modelling

• spatio-temporal sample and survey design

• Hierarchical Structured Stochastic Systems

• mathematics for multi-scale space-time data integration

• statistical pattern recognition

• eco-scape risk assessment+

• Domain knowledge

• Relationships

People

& Processes

Page 14: A new project to foster Japanese-Australian discovery ... · A new project to foster Japanese-Australian discovery through Data Science for environmental and ecological management

Recent successes of Aquatics Stream

1. Estimating abundance and distribution of minke

whales for the International Whaling Commission and

Australian Antarctic Division

2. Research to support science-based management of

the Northern Prawn Fishery

3. Novel methods for mapping and predicting seabed

biodiversity in GBR and SE Australia

4. Effective reporting on environmental impact study for

Lihir Gold Mine ($$$$$ to PNG but extensive potential

ecological footprint)

Page 15: A new project to foster Japanese-Australian discovery ... · A new project to foster Japanese-Australian discovery through Data Science for environmental and ecological management

Background to Lihir environmental study

• Study area

• Lihir Island group, about 900 km NE of Port Moresby, capital of Papua

New Guinea – now mainly monitoring Niolam (main island) and Mali.

Page 16: A new project to foster Japanese-Australian discovery ... · A new project to foster Japanese-Australian discovery through Data Science for environmental and ecological management

Survey sites for inshore fish monitoring

• Phase I (1999-2002)

• Three surveys at >100 sites

on 4 islands, grouped into 17

‘Locations’

• Phase II (2005-2007)• Three surveys at subset of

Phase I sites, visited on each

survey – longitudinal design to

optimise ability to detect change

• Locations 2-5, 16 and 17 no

longer surveyed

• Approx 75 sites per visit, and 52

sites have been sampled in all

six surveys

Page 17: A new project to foster Japanese-Australian discovery ... · A new project to foster Japanese-Australian discovery through Data Science for environmental and ecological management

Underwater Visual Census (UVC) sampling

• Method

• Diver swims 100m transect, identifying, assessing size and counting

conspicuous fish within a 10m belt.

• Starts at 15m depth and swims obliquely until reaching reef crest.

• Also records substrate & habitat information, eg state of coral.

Page 18: A new project to foster Japanese-Australian discovery ... · A new project to foster Japanese-Australian discovery through Data Science for environmental and ecological management

Spatial distribution of two species (raw counts)

Page 19: A new project to foster Japanese-Australian discovery ... · A new project to foster Japanese-Australian discovery through Data Science for environmental and ecological management

Species vary greatly in presence & abundance

0 50 100 150 200 250 300

05

00

10

00

15

00

20

00

Frequency of sighting

To

tal co

un

t

Page 20: A new project to foster Japanese-Australian discovery ... · A new project to foster Japanese-Australian discovery through Data Science for environmental and ecological management

Data selection before model-fitting

• Selected transects visited at least 4 times

• 71, 56, 55, 72, 71 and 71 sites per survey

• Trading off ability to model longitudinal trends at transect level

with enough sites to model spatial pattern in density

• Selected species with at least 220 specimens seen, in at

least 55 transects and in every survey

• 24 species for individual modelling (out of 117)

• Future monitoring likely to focus on small number of commonly-

seen species

• Definition of ‘mine’ location (Luise harbour) expanded

from 3 sites to 6

• to improve confidence interval for mean density

• without introducing too much bias in the (mostly) very low mean

density in Luise Harbour

Page 21: A new project to foster Japanese-Australian discovery ... · A new project to foster Japanese-Australian discovery through Data Science for environmental and ecological management

Characteristics modelled

• Abundance of individual fish species

• Total abundance of larger groups:

all species; top 24 species; remaining species;

6 feeding/trophic guilds:

• benthic invertebrate feeders (8 spp.); herbivorous croppers (5

spp.); herbivorous scrapers (7 spp.); omnivores (2 spp.);

Planktivorous feeders (Naso brevirostris);

Pelagic piscivores (Aphareus furca)

• Also, percent live coral cover

Page 22: A new project to foster Japanese-Australian discovery ... · A new project to foster Japanese-Australian discovery through Data Science for environmental and ecological management

Basic components of model

• Statistical blocking factors

• 6 different divers, 2-3 per survey

• Visibility (m) categorised into low (4-8 m) & high (> 8m)

• Spatial terms

• linear/quadratic coastline covariate (‘Dist’ & ‘Dist_sq’)

• Different intercept in Luise Harbour (‘Mine’)

• Temporal term

• linear time covariate (‘Time’)

• Spatio-temporal terms

• ‘Time x Dist’ and ‘Time x Dist_sq’

• Quasi-Poisson for count (per transect), with log-link

• Random effect(s) for transect (up to 3 parameters:

intercept variance, time-slope variance and correlation)

Page 23: A new project to foster Japanese-Australian discovery ... · A new project to foster Japanese-Australian discovery through Data Science for environmental and ecological management

Count data (adjusted for diver & visibility) compared

with transect-level/smoothed distance/time model

Top 24

Distance along coast (km)

Lo

g (

ad

juste

d r

aw

co

un

t +

1)

0

1

2

3

4

1999

-20 -10 0 10 20

2000

2002

0

1

2

3

4

2005

0

1

2

3

4

-20 -10 0 10 20

2006 2007

DiverDBDD

MHRP

SFTS

Others

Distance along coast (km)

Lo

g (

ad

juste

d r

aw

co

un

t +

1)

0

1

2

3

4

5

1999

-20 -10 0 10 20

2000

2002

0

1

2

3

4

5

2005

0

1

2

3

4

5

-20 -10 0 10 20

2006 2007

DiverDBDD

MHRP

SFTS

Page 24: A new project to foster Japanese-Australian discovery ... · A new project to foster Japanese-Australian discovery through Data Science for environmental and ecological management

Smoothed space-time model for total of top 24

species and total of other 93 species

Top 24

Predicted Count per ha

Coast Distance(km)

Su

rve

y D

ate

19/11/99

1/12/00

7/2/02

18/2/05

17/2/06

9/3/07

-10 0 10 20

200

250

300

350

400

450

500

Predicted log(count / ha)

Coast Distance(km)

Su

rve

y D

ate

19/11/99

1/12/00

7/2/02

18/2/05

17/2/06

9/3/07

-10 0 10 20

5.4

5.6

5.8

6.0

6.2

Standard Error log(count / ha)

Coast Distance(km)

Su

rve

y D

ate

19/11/99

1/12/00

7/2/02

18/2/05

17/2/06

9/3/07

-10 0 10 20

2.36

2.38

2.40

2.42

2.44

2.46

2.48

Others

Predicted Count per ha

Coast Distance(km)

Su

rve

y D

ate

19/11/99

1/12/00

7/2/02

18/2/05

17/2/06

9/3/07

-10 0 10 20

100

150

200

250

300

Predicted log(count / ha)

Coast Distance(km)

Su

rve

y D

ate

19/11/99

1/12/00

7/2/02

18/2/05

17/2/06

9/3/07

-10 0 10 20

4.4

4.6

4.8

5.0

5.2

5.4

5.6

5.8

Standard Error log(count / ha)

Coast Distance(km)

Su

rve

y D

ate

19/11/99

1/12/00

7/2/02

18/2/05

17/2/06

9/3/07

-10 0 10 20

2.45

2.50

2.55

2.60

2.65

Page 25: A new project to foster Japanese-Australian discovery ... · A new project to foster Japanese-Australian discovery through Data Science for environmental and ecological management

Smoothed trends for herbivorous scraper

abundance & percent live coral cover

Live coral cover

Predicted Count per ha

Coast Distance(km)

Su

rve

y D

ate

19/11/99

1/12/00

7/2/02

18/2/05

17/2/06

9/3/07

-10 0 10 20

100

150

200

250

300

350

400

Predicted log(count / ha)

Coast Distance(km)

Su

rve

y D

ate

19/11/99

1/12/00

7/2/02

18/2/05

17/2/06

9/3/07

-10 0 10 20

4.6

4.8

5.0

5.2

5.4

5.6

5.8

6.0

Standard Error log(count / ha)

Coast Distance(km)

Su

rve

y D

ate

19/11/99

1/12/00

7/2/02

18/2/05

17/2/06

9/3/07

-10 0 10 20

2.36

2.38

2.40

2.42

2.44

2.46

2.48

2.50

HS

Predicted Count per ha

Coast Distance(km)

Su

rve

y D

ate

19/11/99

1/12/00

7/2/02

18/2/05

17/2/06

9/3/07

-10 0 10 20

20

40

60

80

100

120

140

160

180

Predicted log(count / ha)

Coast Distance(km)

Su

rve

y D

ate

19/11/99

1/12/00

7/2/02

18/2/05

17/2/06

9/3/07

-10 0 10 20

3.0

3.5

4.0

4.5

5.0

Standard Error log(count / ha)

Coast Distance(km)

Su

rve

y D

ate

19/11/99

1/12/00

7/2/02

18/2/05

17/2/06

9/3/07

-10 0 10 20

2.38

2.40

2.42

2.44

2.46

2.48

2.50

2.52

2.54

Page 26: A new project to foster Japanese-Australian discovery ... · A new project to foster Japanese-Australian discovery through Data Science for environmental and ecological management

Spatial assessment of mine impact

• Generally, fish were less abundant in Luise Harbour

than nearby areas, e.g.

• 7 out of 9 fish species with p<0.1 for ‘Mine’ effect – ranging

from 45% less for Naso lituratus to 94% less abundant for

Naso unicornis.

• Every group or guild with p<0.1 for ‘Mine’ effect – ranging from

35% less for All fish, Top 24 and herbivorous croppers to 70%

less for herbivorous scrapers

• Live coral cover was much lower in Luise Harbour

• p < 0.0001 for ‘Mine’ effect – 45% of cover relative to that

observed elsewhere

Page 27: A new project to foster Japanese-Australian discovery ... · A new project to foster Japanese-Australian discovery through Data Science for environmental and ecological management

Abundance in 2007 compared with that in 1999

survey, using log-ratio of smoothed predictions

-20 -10 0 10 20

-0.5

0.0

0.5

Distance along coast (km)

Log(r

atio

of fit

ted d

ensity

in la

st v. fir

st surv

ey)

All fish

-20 -10 0 10 20

-0.4

-0.2

0.0

0.2

0.4

0.6

0.8

1.0

Distance along coast (km)

Log(r

atio

of fit

ted d

ensity

in la

st v. fir

st surv

ey)

Top 24

-20 -10 0 10 20

-1.5

-1.0

-0.5

0.0

0.5

Distance along coast (km)

Log(r

atio

of fit

ted d

ensity

in la

st v. fir

st surv

ey)

Others

-20 -10 0 10 20

-0.5

0.0

0.5

1.0

Distance along coast (km)

Log(r

atio

of fit

ted d

ensity

in la

st v. fir

st surv

ey)

BI

-20 -10 0 10 20

-1.0

-0.5

0.0

0.5

1.0

1.5

Distance along coast (km)

Log(r

atio

of fit

ted d

ensity

in la

st v. fir

st surv

ey)

HC

-20 -10 0 10 20

-0.5

0.0

0.5

1.0

Distance along coast (km)

Log(r

atio

of fit

ted d

ensity

in la

st v. fir

st surv

ey)

HS

-20 -10 0 10 20

-1.0

-0.5

0.0

0.5

1.0

1.5

2.0

Distance along coast (km)

Log(r

atio

of fit

ted d

ensity

in la

st v. fir

st surv

ey)

O

-20 -10 0 10 20

02

46

Distance along coast (km)

Log(r

atio

of fit

ted d

ensity

in la

st v. fir

st surv

ey)

PF

-20 -10 0 10 20

-3-2

-10

Distance along coast (km)

Log(r

atio

of fit

ted d

ensity

in la

st v. fir

st surv

ey)

PP

-20 -10 0 10 20

-1.0

-0.5

0.00.5

Distance along coast (km)

Log(r

atio o

f fitte

d den

sity i

n las

t v. fi

rst su

rvey)

Live coral

Page 28: A new project to foster Japanese-Australian discovery ... · A new project to foster Japanese-Australian discovery through Data Science for environmental and ecological management

Temporal assessment of mine impact

• For many groups and individual species, abundance

increased over time, though few groups had a

statistically significant increase along the entire

coastline – often, a mixture of increase / no change /

decrease in different places.

• Herbivorous scrapers increased almost universally, as

did the Top 24 as a group.

• Two lutjanids (L. gibbus and L. monostigma) showed a

significant decrease along almost all of the coastline.

• Live coral cover decreased markedly along the northern

section of the coast.

Page 29: A new project to foster Japanese-Australian discovery ... · A new project to foster Japanese-Australian discovery through Data Science for environmental and ecological management

Statistical significance of temporal and spatio-temporal effects,

and average change between first and last surveys

Species or group Time Time x

Dist

Time x

Dist_sq

Average

change*

Top 24 fish

species

0.0477 > 0.2 > 0.5 +

Rarer 93 fish

species

> 0.5 > 0.2 > 0.2 (−)

Herbivorous

scrapers (7 spp.)

< 0.0001 > 0.2 > 0.1 +

Lutjanus

gibbus

< 0.0001 > 0.1 > 0.5 −

Lutjanus

monostigma

0.0445 0.0715 > 0.2 −

Live coral cover < 0.0001 0.0013 0.0559 −

* () indicates 95% C.L. for log-ratio overlap 0 for most of coastline.

Page 30: A new project to foster Japanese-Australian discovery ... · A new project to foster Japanese-Australian discovery through Data Science for environmental and ecological management

Minimum detectable change (90% power) between

two surveys

Species or group Decrease Increase

All fish −17% +20%

Top 24 species −18% +21%

Herbivorous scrapers −22% +25%

Benthic invertebrate

feeders

−27% +33%

Omnivores −33% +40%

Individual species −37% to

− 72%

+42% to

+140%

Page 31: A new project to foster Japanese-Australian discovery ... · A new project to foster Japanese-Australian discovery through Data Science for environmental and ecological management

Conclusions

• Although fish community appears impoverished in

Luise Harbour, impact appears not to be encroaching

on other parts of the main island.

• Inter-diver differences are uncomfortably large; so

recommended that future surveys incorporate a

training/calibration day and a number of sites to be

assessed by all divers.

• Grouping species into guilds improves power to detect

change; useful if all species in the group respond

similarly.

Page 32: A new project to foster Japanese-Australian discovery ... · A new project to foster Japanese-Australian discovery through Data Science for environmental and ecological management

Thank you

CMIS/EI

Charis Burridge

Research statistician

Phone: +61 7 3826 7186

Email: [email protected]

Web: www.csiro.au/group

Contact Us

Phone: 1300 363 400 or +61 3 9545 2176

Email: [email protected] Web: www.csiro.au