69
Data-driven Planning for TMDL Effectiveness Monitoring & Statistical Test Selection for Analysis of Monitoring Data Sponsored by EPA Region 10 Presenters: Laura Blake, Corey Godfrey, and Andy Somor The Cadmus Group, Inc. 1

Data-driven Planning for TMDL Effectiveness Monitoring ......Water quality data are often collected without considering the number of samples needed to demonstrate statistically significant

  • Upload
    others

  • View
    6

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Data-driven Planning for TMDL Effectiveness Monitoring ......Water quality data are often collected without considering the number of samples needed to demonstrate statistically significant

Data-driven Planning for TMDL Effectiveness Monitoring & Statistical Test Selection for Analysis of Monitoring Data Sponsored by EPA Region 10

Presenters: Laura Blake, Corey Godfrey, and Andy Somor The Cadmus Group, Inc.

1

Page 2: Data-driven Planning for TMDL Effectiveness Monitoring ......Water quality data are often collected without considering the number of samples needed to demonstrate statistically significant

Overview of Presentation

Recommended steps for developing a TMDL effectiveness monitoring plan.

Data-driven planning for TMDL effectiveness monitoring.

Data exploration methods for selecting statistical tests for analyzing TMDL effectiveness monitoring data.

Demonstration of Excel-based TMDL effectiveness monitoring planning tool.

2

Page 3: Data-driven Planning for TMDL Effectiveness Monitoring ......Water quality data are often collected without considering the number of samples needed to demonstrate statistically significant

Data-driven Planning for TMDL Effectiveness Monitoring & Statistical Test Selection for Analysis of Monitoring Data

TMDL EFFECTIVENESS MONITORING PLANNING

3

Page 4: Data-driven Planning for TMDL Effectiveness Monitoring ......Water quality data are often collected without considering the number of samples needed to demonstrate statistically significant

“Effectiveness Monitoring”

The process of measuring improvements in the water quality of a water body.

Not to be confused with BMP effectiveness, which measures the success or effectiveness of the BMPs themselves.

The primary goal of TMDL effectiveness monitoring is to identify water quality improvements (or lack thereof) that result from TMDL implementation.

4

Page 5: Data-driven Planning for TMDL Effectiveness Monitoring ......Water quality data are often collected without considering the number of samples needed to demonstrate statistically significant

Need for Effectiveness Monitoring Provides a quantitative measure of progress

towards attainment with WQS. Allows for demonstration of incremental

improvements in water quality. Supports adaptive management approach to

implementation and restoration. Provides data for use in making SP-12 and

WQ-10 determinations. Provides documentation of water quality

improvements, which can be used to communicate results and justify the need forfunding to support water programs.

5

Page 6: Data-driven Planning for TMDL Effectiveness Monitoring ......Water quality data are often collected without considering the number of samples needed to demonstrate statistically significant

Recommended Steps for TMDL Effectiveness Monitoring Planning 1. Review existing data and information.

2. Select monitoring sites, parameters, and study design.

3. Estimate sample size.

4. Develop TMDL effectiveness monitoring plan.

6

Page 7: Data-driven Planning for TMDL Effectiveness Monitoring ......Water quality data are often collected without considering the number of samples needed to demonstrate statistically significant

Review Existing Data and Information Begin with a thorough review of all

available information that may direct the process.

Existing data and information will provide an understanding of: Historical and current water quality

conditions. TMDL implementation activities.

Involve stakeholders early.

7

Page 8: Data-driven Planning for TMDL Effectiveness Monitoring ......Water quality data are often collected without considering the number of samples needed to demonstrate statistically significant

Select Monitoring Sites, Parameters, and Study Design

Design effectiveness monitoring projects at the watershed scale. Specific project scale should be decided

upon using information on the number and extent of impaired or threatened waters, project resources, and project partners.

Watershed scale effectiveness monitoring: Pour point method

Distributed sampling method 8

Page 9: Data-driven Planning for TMDL Effectiveness Monitoring ......Water quality data are often collected without considering the number of samples needed to demonstrate statistically significant

Site Selection Approach: Pour Point Method

Monitoring Site

9

Page 10: Data-driven Planning for TMDL Effectiveness Monitoring ......Water quality data are often collected without considering the number of samples needed to demonstrate statistically significant

Site Selection Approach: Distributed Sampling Method

Monitoring Site

10

Page 11: Data-driven Planning for TMDL Effectiveness Monitoring ......Water quality data are often collected without considering the number of samples needed to demonstrate statistically significant

Site Selection

Locate monitoring sites where TMDL implementation isexpected to have discernible water quality effects.

Examples include sites on impaired/degraded waterbodies that are downstream of: WWTFs with new or revised WLAs. Discontinued illicit discharges. NPS that are managed through BMPs. Stream channel restoration projects. Improved onsite wastewater management or expansion

of sanitary sewer service. Other TMDL-specific pollution control measures.

Evaluate any potential existing monitoring network orsites.

11

Page 12: Data-driven Planning for TMDL Effectiveness Monitoring ......Water quality data are often collected without considering the number of samples needed to demonstrate statistically significant

Parameters

At a minimum, monitor the pollutants for which the TMDL was developed.

If resources allow, monitor for stressor and/orresponse variables, which provide additionalinformation about the condition of a water body.

If resources allow, monitor parameters thatmay be covariates to the primary pollutants ofinterest. Stream flow is a common covariate for

pollutants in streams and rivers.

12

Page 13: Data-driven Planning for TMDL Effectiveness Monitoring ......Water quality data are often collected without considering the number of samples needed to demonstrate statistically significant

Study Design

Outlines how water quality improvements will be demonstrated.

Critical to ensuring the collection of the specificdata needed to answer the study questions/goals.

Selection is dependent on many factors, including: Types of TMDL implementation actions. Implementation schedule. Availability and quality of previously collected data Resources. Existence of suitable reference sites.

13

Page 14: Data-driven Planning for TMDL Effectiveness Monitoring ......Water quality data are often collected without considering the number of samples needed to demonstrate statistically significant

Study Design (cont.)

Potential study designs for TMDL effectiveness monitoring include: Before & After

Upstream/downstream

Paired watersheds

Trend monitoring

14

Page 15: Data-driven Planning for TMDL Effectiveness Monitoring ......Water quality data are often collected without considering the number of samples needed to demonstrate statistically significant

Study Design: Before/After Study

Before After

Time 15

Page 16: Data-driven Planning for TMDL Effectiveness Monitoring ......Water quality data are often collected without considering the number of samples needed to demonstrate statistically significant

Study Design: Upstream/Downstream Study

16

Page 17: Data-driven Planning for TMDL Effectiveness Monitoring ......Water quality data are often collected without considering the number of samples needed to demonstrate statistically significant

Treatment Control

Study Design: Paired Watersheds Study

Before After 17

Time

Page 18: Data-driven Planning for TMDL Effectiveness Monitoring ......Water quality data are often collected without considering the number of samples needed to demonstrate statistically significant

Study Design: Trend Monitoring

Time 18

Page 19: Data-driven Planning for TMDL Effectiveness Monitoring ......Water quality data are often collected without considering the number of samples needed to demonstrate statistically significant

Estimate Sample Size

Water quality data are often collected without considering the number of samples needed to demonstrate statistically significant changes.

Objective and informed sample size decisions can be made using a statistical method know as power analysis.

Power analysis uses information from pilot data to determine the optimal number of samples needed to identify statistically significant changes or trends.

19

Page 20: Data-driven Planning for TMDL Effectiveness Monitoring ......Water quality data are often collected without considering the number of samples needed to demonstrate statistically significant

Develop TMDL Effectiveness Monitoring Plan

Steps for planning a TMDL effectiveness monitoring project: 1. Review existing data and information.

2. Select monitoring sites, parameters, and study design.

3. Estimate sample size.

4. Develop TMDL effectiveness monitoring plan.

20

Page 21: Data-driven Planning for TMDL Effectiveness Monitoring ......Water quality data are often collected without considering the number of samples needed to demonstrate statistically significant

Develop TMDL Effectiveness Monitoring Plan (cont.) The planning document should include: Relevant background information. Project goals and objectives. Where and when monitoring will occur. List of parameters to be monitored. Preliminary discussion of intended data

analysis methods, including selected level ofsignificance.

The TMDL effectiveness monitoring plan can be incorporated into a QAPP.

21

Page 22: Data-driven Planning for TMDL Effectiveness Monitoring ......Water quality data are often collected without considering the number of samples needed to demonstrate statistically significant

Data-driven Planning for TMDL Effectiveness Monitoring & Statistical Test Selection for Analysis of Monitoring Data

DATA-DRIVEN PLANNING

22

Page 23: Data-driven Planning for TMDL Effectiveness Monitoring ......Water quality data are often collected without considering the number of samples needed to demonstrate statistically significant

Common Example of post-TMDL Monitoring Ambient monitoring reveals that the Fox Run

Watershed is impaired for total phosphorus. TMDL developed for total phosphorus.

TP TN TSS E. Coli

Post-TMDL monitoring completed through the routine monthly ambient monitoring of:

DO, conductivity, pH, Temp, Turbidity

23

Page 24: Data-driven Planning for TMDL Effectiveness Monitoring ......Water quality data are often collected without considering the number of samples needed to demonstrate statistically significant

Explore Pilot Data

Pilot data are previously collected data Ambient monitoring program TMDL data Special studies

Helps to informs sample size estimation (power analysis)

Informs selection of parameters to monitor for (correlation analysis)

24

Page 25: Data-driven Planning for TMDL Effectiveness Monitoring ......Water quality data are often collected without considering the number of samples needed to demonstrate statistically significant

Power Analysis

β ≈ MDC x α x √n/σ

Where: β=power MDC=minimum detectable change α=significance n=sample size σ=standard deviation (variability)

25

Page 26: Data-driven Planning for TMDL Effectiveness Monitoring ......Water quality data are often collected without considering the number of samples needed to demonstrate statistically significant

0

20

40

60

80

100

120

140

160

180 M

inim

um D

etec

tabl

e C

hang

e E.

co

li (M

PN/1

00 m

L) p

er Y

ear

Estimate Sample Size

5 25 45 65 85 105 125 145 165 185 Sample Size (N)

26

Page 27: Data-driven Planning for TMDL Effectiveness Monitoring ......Water quality data are often collected without considering the number of samples needed to demonstrate statistically significant

Tota

l Pho

spho

rus

(mg/

L)

Streamflow (cfs)

1 10 100 1,000 10,000

27

Correlation Analyses

1.00

0.10

0.01

Page 28: Data-driven Planning for TMDL Effectiveness Monitoring ......Water quality data are often collected without considering the number of samples needed to demonstrate statistically significant

Select Parameters

Chloride Conductivity TN TP TSS Turbidity

Chloride 1 0.88 0.62 0.31 0.26 0.31

Conductivity 0.88 1 0.74 0.18 0.3 0.32

TN 0.62 0.74 1 0.28 0.34 0.36

TP 0.31 0.18 0.28 1 0.52 0.56

TSS 0.26 0.3 0.34 0.52 1 0.88

Turbidity 0.31 0.32 0.36 0.56 0.88 1

28

Page 29: Data-driven Planning for TMDL Effectiveness Monitoring ......Water quality data are often collected without considering the number of samples needed to demonstrate statistically significant

Data-driven Planning for TMDL Effectiveness Monitoring & Statistical Test Selection for Analysis of Monitoring Data

SELECTING STATISTICAL TESTS FOR ANALYZING EFFECTIVENESS MONITORING DATA

29

Page 30: Data-driven Planning for TMDL Effectiveness Monitoring ......Water quality data are often collected without considering the number of samples needed to demonstrate statistically significant

Common Types of Statistical Tests

Parametric: Assumes the data are normally distributed. Examples include: t-Test

Multiple linear regression

Nonparametric: Makes no assumption about data distribution. Examples include: Signed rank test

Mann-Kendall test for trend

30

Page 31: Data-driven Planning for TMDL Effectiveness Monitoring ......Water quality data are often collected without considering the number of samples needed to demonstrate statistically significant

Mean

Freq

uenc

y

Value Mean Mean

Freq

uenc

y

31 Fr

eque

ncy

Value Value

Data Distributions

Page 32: Data-driven Planning for TMDL Effectiveness Monitoring ......Water quality data are often collected without considering the number of samples needed to demonstrate statistically significant

Total Phosphorous (mg/L) 32

Fr

eque

ncy

80

60

40

20

0

Histograms

Page 33: Data-driven Planning for TMDL Effectiveness Monitoring ......Water quality data are often collected without considering the number of samples needed to demonstrate statistically significant

0

10

20

30

Freq

uenc

y

Histograms (cont.)

33 pH

Page 34: Data-driven Planning for TMDL Effectiveness Monitoring ......Water quality data are often collected without considering the number of samples needed to demonstrate statistically significant

Freq

uenc

y

Total Phosphorous (mg/L) 34

Data Transformations

80

60

40

20

0

Page 35: Data-driven Planning for TMDL Effectiveness Monitoring ......Water quality data are often collected without considering the number of samples needed to demonstrate statistically significant

log (Total Phosphorous) 35

0

10

20

30

40

Freq

uenc

y Data Transformations (cont.)

Page 36: Data-driven Planning for TMDL Effectiveness Monitoring ......Water quality data are often collected without considering the number of samples needed to demonstrate statistically significant

36

FC

(# p

er 1

00 m

L)

8,000

6,000

4,000

2,000

0

Time Series Plots

Page 37: Data-driven Planning for TMDL Effectiveness Monitoring ......Water quality data are often collected without considering the number of samples needed to demonstrate statistically significant

log

(Fec

al C

olifo

rm (#

/100

mL)

) 4.0

3.0

2.0

1.0

0.0

Minimum

Median

25%

Maximum

75%

Boxplots

Spring/Summer Fall/Winter 37

Page 38: Data-driven Planning for TMDL Effectiveness Monitoring ......Water quality data are often collected without considering the number of samples needed to demonstrate statistically significant

t)L)

g/m(

us

phor

hos

PlaotT

Log

(

Autocorrelation & Lag Plots

-0.5

-1.0

-1.5

-2.0

-2.5

-3.0

R² = 0.4172

38 Log (Total Phosphorus (mg/L)) t-1

Page 39: Data-driven Planning for TMDL Effectiveness Monitoring ......Water quality data are often collected without considering the number of samples needed to demonstrate statistically significant

Censored Data

Date Fecal Coliform (#/100 mL) 1/13/2012 345 2/1/2012 280 2/29/2012 >2,400 3/17/2012 279 4/2/2012 240

Substituting >2,400 with the value of 2,400 gives a calculated mean of 709

If that value were really 10,000, the mean would be 2,229

39

Page 40: Data-driven Planning for TMDL Effectiveness Monitoring ......Water quality data are often collected without considering the number of samples needed to demonstrate statistically significant

Selecting an Appropriate Test

Objectives Study Design Characteristics of our data: Normal distribution Outliers Censored data points Seasonality Autocorrelation Missing data

40

Page 41: Data-driven Planning for TMDL Effectiveness Monitoring ......Water quality data are often collected without considering the number of samples needed to demonstrate statistically significant

Selecting an Appropriate Test (cont.)

Study Objective: Compare two independent data groups

Relevant Study Designs: Before/after;

Upstream/downstream

Parametric test: Two sample t-test

Nonparametric test: Rank sum test

41

Page 42: Data-driven Planning for TMDL Effectiveness Monitoring ......Water quality data are often collected without considering the number of samples needed to demonstrate statistically significant

Selecting an Appropriate Test (cont.)

Study Objective: Compare data groups with matched pairs

Relevant Study Designs Paired watersheds;

Upstream/downstream

Parametric test: Paired t-test

Nonparametric test: Signed rank test

42

Page 43: Data-driven Planning for TMDL Effectiveness Monitoring ......Water quality data are often collected without considering the number of samples needed to demonstrate statistically significant

Selecting an Appropriate Test (cont.)

Study Objective: Compare two data groups while adjusting for covariates

Relevant Study Designs: Paired watersheds

Parametric test: Analysis of covariance (ANCOVA)

43

Page 44: Data-driven Planning for TMDL Effectiveness Monitoring ......Water quality data are often collected without considering the number of samples needed to demonstrate statistically significant

Selecting an Appropriate Test (cont.)

Study Objective: Evaluate the relationship between one data group and time (without seasonality)

Relevant Study Designs: Trend monitoring

Parametric test: Linear regression

Nonparametric test: Mann-Kendall test

44

Page 45: Data-driven Planning for TMDL Effectiveness Monitoring ......Water quality data are often collected without considering the number of samples needed to demonstrate statistically significant

Selecting an Appropriate Test (cont.)

Study Objective: Evaluate the relationship between one data group and time (with seasonality)

Relevant Study Designs: Trend Monitoring

Parametric test: Linear regression with seasonal term

Nonparametric test: Seasonal Kendall test

45

Page 46: Data-driven Planning for TMDL Effectiveness Monitoring ......Water quality data are often collected without considering the number of samples needed to demonstrate statistically significant

Selecting an Appropriate Test (cont.)

Study Objective: Evaluate the relationship between one data group, time, and other variables (without seasonality)

Relevant Study Designs: Trend Monitoring

Parametric test: Multiple linear regression

Nonparametric test: Mann-Kendall test with LOWESS (“locally weighted scatterplot smoothing”)

46

Page 47: Data-driven Planning for TMDL Effectiveness Monitoring ......Water quality data are often collected without considering the number of samples needed to demonstrate statistically significant

Selecting an Appropriate Test (cont.)

Study Objective: – Evaluate the relationship between one data group, time, and other variables (with seasonality)

Relevant Study Designs: Trend monitoring

Parametric test: Multiple linear regression with seasonal term

Nonparametric test: Seasonal Kendall test with LOWESS

47

Page 48: Data-driven Planning for TMDL Effectiveness Monitoring ......Water quality data are often collected without considering the number of samples needed to demonstrate statistically significant

Data-driven Planning for TMDL Effectiveness Monitoring & Statistical Test Selection for Analysis of Monitoring Data

DEMO OF TMDL EFFECTIVENESS MONITORING PLANNING TOOL

48

Page 49: Data-driven Planning for TMDL Effectiveness Monitoring ......Water quality data are often collected without considering the number of samples needed to demonstrate statistically significant

Data Exploration Software

Spreadsheet software (Excel) Simple user-interface

Limited analysis options

Statistical packages Open source (R) or proprietary (SAS)

Steep learning curve

Web-based tools

49

Page 50: Data-driven Planning for TMDL Effectiveness Monitoring ......Water quality data are often collected without considering the number of samples needed to demonstrate statistically significant

A Customized Tool

TMDL Effectiveness Monitoring Planning Tool(EMTool.xlsm) Excel-based Enhances basic Excel features with VBA code

Objective: Facilitate data-driven planning. Create exploratory plots Estimate sample size using power analysis Estimate monitoring costs

50

Page 51: Data-driven Planning for TMDL Effectiveness Monitoring ......Water quality data are often collected without considering the number of samples needed to demonstrate statistically significant

Data Exploration Checklist

Inputs: Pollutants of concern Pollution control types & implementation schedule Monitoring locations Study design Water quality targets

Outputs: Covariates Sample frequency, timing, and size • Project costs

Statistical methods

51

Page 52: Data-driven Planning for TMDL Effectiveness Monitoring ......Water quality data are often collected without considering the number of samples needed to demonstrate statistically significant

Example Dataset: Bacteria in Bear Creek, OR

TMDLs for Bear Creek& tributaries (1992 & 2007):

Ammonia BOD Total phosphorus Sediment Bacteria Temperature

2008-2009 summer E. coli concentrations at the watershed outlet.

52

Page 53: Data-driven Planning for TMDL Effectiveness Monitoring ......Water quality data are often collected without considering the number of samples needed to demonstrate statistically significant

Excel Tool - Pilot Data Worksheet

Enter P arameter Names & Units

53

Page 54: Data-driven Planning for TMDL Effectiveness Monitoring ......Water quality data are often collected without considering the number of samples needed to demonstrate statistically significant

Enter Sample Dates, Seasons & Values

54

Page 55: Data-driven Planning for TMDL Effectiveness Monitoring ......Water quality data are often collected without considering the number of samples needed to demonstrate statistically significant

Select a Parameter

Excel Tool - Data Exploration Worksheet

Review Time Series Plot & Summary Statistics

55

Page 56: Data-driven Planning for TMDL Effectiveness Monitoring ......Water quality data are often collected without considering the number of samples needed to demonstrate statistically significant

Review Histogram

Excel Tool - Data Exploration Worksheet

56

Page 57: Data-driven Planning for TMDL Effectiveness Monitoring ......Water quality data are often collected without considering the number of samples needed to demonstrate statistically significant

Review Seasonal Boxplot

Excel Tool - Data Exploration Worksheet

57

Page 58: Data-driven Planning for TMDL Effectiveness Monitoring ......Water quality data are often collected without considering the number of samples needed to demonstrate statistically significant

Review Lag Plot

Excel Tool - Data Exploration Worksheet

58

Page 59: Data-driven Planning for TMDL Effectiveness Monitoring ......Water quality data are often collected without considering the number of samples needed to demonstrate statistically significant

Explore Covariates

Excel Tool - Data Exploration Worksheet

59

Page 60: Data-driven Planning for TMDL Effectiveness Monitoring ......Water quality data are often collected without considering the number of samples needed to demonstrate statistically significant

Explore D ata Tra nsformation Options

Excel Tool - Data Exploration Worksheet

60

Page 61: Data-driven Planning for TMDL Effectiveness Monitoring ......Water quality data are often collected without considering the number of samples needed to demonstrate statistically significant

Excel T ool - Sample Size Worksheet

How many samples are needed to detect a statistically significant water quality change?

Depends on: Statistical test applied

Confidence level

Statistical power

Data variability

Minimum detectable change

61

Page 62: Data-driven Planning for TMDL Effectiveness Monitoring ......Water quality data are often collected without considering the number of samples needed to demonstrate statistically significant

Select a Parameter & Data Transformation Option

Excel Tool - Sample Size Worksheet

Key Assumption: A Parametric Statistical Test Will be Applied

62

Page 63: Data-driven Planning for TMDL Effectiveness Monitoring ......Water quality data are often collected without considering the number of samples needed to demonstrate statistically significant

Excel Tool - Sample Size Worksheet

Sample size depends on: 1) Statistical test applied

Select Study Design

63

Page 64: Data-driven Planning for TMDL Effectiveness Monitoring ......Water quality data are often collected without considering the number of samples needed to demonstrate statistically significant

64

Sample size depends on:

Excel Tool - Sample Size Worksheet

2) Statistical power 3) Confidence level

Enter Statistical Power & Confidence Level

4) Data variability

Review Standard Deviation

Page 65: Data-driven Planning for TMDL Effectiveness Monitoring ......Water quality data are often collected without considering the number of samples needed to demonstrate statistically significant

Sample size depends on:

Excel Tool - Sample Size Worksheet

5) Minimum detectable change

65 Enter Minimum Detectable C hange

Page 66: Data-driven Planning for TMDL Effectiveness Monitoring ......Water quality data are often collected without considering the number of samples needed to demonstrate statistically significant

E. C

oli

E. C

oli

100 MPN/100 mL

0 10Year 0 Year 10

Excel Tool - Sample Size Worksheet

Enter Minimum Detectable Change 66

Page 67: Data-driven Planning for TMDL Effectiveness Monitoring ......Water quality data are often collected without considering the number of samples needed to demonstrate statistically significant

Excel Tool - Sample Size Worksheet

Estimate Sample Size

67

Page 68: Data-driven Planning for TMDL Effectiveness Monitoring ......Water quality data are often collected without considering the number of samples needed to demonstrate statistically significant

Excel Tool – Cost Estimation Worksheet

Use information gained from data exploration to estimate project costs: Lab/equipment Labor/personnel Travel QAPP development Data analysis

Adjust monitoring decisions to maximize available resources

68

Page 69: Data-driven Planning for TMDL Effectiveness Monitoring ......Water quality data are often collected without considering the number of samples needed to demonstrate statistically significant

Question & Answer

Laura Blake, The Cadmus Group, Inc. 617-673-7148, [email protected]

Corey Godfrey, The Cadmus Group, Inc. 617-673-7343, [email protected]

Andy Somor, The Cadmus Group, Inc. 503-467-7194, [email protected]

David Croxton, EPA Region 10 (206) 553-6694, [email protected]

Jayne Carlin, EPA Region 10 (206) 553-8512, [email protected]

69