Ch 2 - Frequency Analysis and Forecasting

Embed Size (px)

Citation preview

  • 7/30/2019 Ch 2 - Frequency Analysis and Forecasting

    1/31

    Frequency Analysis and Forecasting

    Water resource systems must be planned for future eventsfor which no exact time of occurrence can be forecasted.

    Hence, the hydrologist must give a statement of the

    probability of the stream flow (or other hydrologic factors)

    will equal or exceed (or be less than) a specified value.

    important to the economic and social evaluation of a project.

    For planning and design of water resources

    development projects, the important parameters are

    river discharges and related questions on the

    frequency and duration of normal flows (e.g. forhydropower production or water availability) and

    extreme flows (floods and droughts).

    Introduction

    Flood frequency

    Frequency distribution

  • 7/30/2019 Ch 2 - Frequency Analysis and Forecasting

    2/31

    Frequency Analysis and Forecasting

    Hydrologic processes such as floods are exceedingly complexnatural events.

    estimation of the flood peak - a very complex problem leading tomany different approaches.

    empirical formulae

    unit hydrograph methods

    statistical method of frequency analysis.

    In statistical methoduses historical records of peak flows to produceguidance about the expected behavior of future flooding.

    Two primary applications of flood frequency analyses are:

    To predict the possible flood magnitude over a certain time period

    To estimate the frequency with which floods of a certain magnitude mayoccur

    There are two data series of floods (peak flows):

    the annual series, and

    thepartial duration series.

    Introduction

    Flood frequency

    Frequency distribution

  • 7/30/2019 Ch 2 - Frequency Analysis and Forecasting

    3/31

    Frequency Analysis and Forecasting

    Annual flood series - constitutes the data seriesthat the values of the single maximum

    daily/monthly/annually discharge in each year of

    record so that the number of data values equals the

    record length in years. Partial duration flood series - constitutes the data

    series with those values that exceed some arbitrary

    level. All the peaks above a selected level of discharge

    (a threshold) are included in the series and hence theseries is often called the Peaks over Threshold (POT)

    series.

    d/c of these data series? (data length and independence assumption)

    Introduction

    Flood frequency

    Frequency distribution

  • 7/30/2019 Ch 2 - Frequency Analysis and Forecasting

    4/31

    Frequency Analysis and Forecasting

    Annual flood series Analysis. the data are arranged in decreasing order of

    magnitude and the probability P of each event being

    equalled to or exceeded is calculated by the plotting-

    position formula (previous chapter)

    The recurrence interval, T (return periodor frequency)

    is calculated as

    Introduction

    Flood frequency

    Frequency distribution

    P

    T1

  • 7/30/2019 Ch 2 - Frequency Analysis and Forecasting

    5/31

    Frequency Analysis and Forecasting

    Annual flood series. The relationship between T and the probability of

    occurrence of various events is determined from

    binomial distribution.

    Introduction

    Flood frequency

    Frequency distribution

    rnrrnr

    r

    n

    nr qPrrn

    nqPCP

    !)!(

    !,

    the probability of occurrence

    of the event r times in n

    successive years

    Where q = 1 - P.

  • 7/30/2019 Ch 2 - Frequency Analysis and Forecasting

    6/31

    Frequency Analysis and Forecasting

    Annual flood series.

    Introduction

    Flood frequency

    Frequency distribution

    Order No. m Flood magnitude Q (m3/s) T in years = 51/m

    1 160 51.00

    2 135 25.503 128 17.00

    4 116 12.75

    . . .

    . . .

    . . .

    49 65 1.04

    50 63 1.02

    A plot of Q Vs T yields the probability distribution. For interpolation/extrapolation of small return periods, a simple best-

    fitting curve through plotted points can be used as the probability

    distribution.

    However, when larger extrapolations of T are involved, theoretical

    probability distributions (e.g. Gumbel extreme-value, Log-Pearson Type III,

    and log normal distributions) have to be used.

    example - a list of flood magnitudes of a river arranged in descending

    order as shown below.

  • 7/30/2019 Ch 2 - Frequency Analysis and Forecasting

    7/31

    Frequency Analysis and Forecasting

    Annual flood series. In frequency analysis of floods the usual problem is to predict

    extreme flood events. Towards this, specific extreme-value

    distributions are assumed and the required statistical parameters

    calculated from the available data. Using these, the flood

    magnitude for a specific return period is estimated.

    Chow (1951) has shown that most frequency-distribution

    functions applicable in hydrologic studies can be expressed by the

    following equation known as the general equation of hydrologic

    frequency analysis:

    Introduction

    Flood frequency

    Frequency distribution

    x x KT

    Where xT = value of the variate X of a random hydrologic series with

    a return period T, = mean of the variate, = standard deviation of

    the variate, K = frequency factor which depends upon the return

    period, T and the assumed frequency distribution.

  • 7/30/2019 Ch 2 - Frequency Analysis and Forecasting

    8/31

    Frequency Analysis and Forecasting

    Annual flood series. the commonly used frequency distribution functions

    for the prediction of extreme flood values are Gumbels extreme value distribution

    log-Pearson Type III distribution

    log-Pearson Type II distribution log-normal distribution

    normal distribution

    In order to select and recommend a certain frequencydistribution function (distribution model), the samplestatistics data distribution should be tested for goodness of

    fit as a satisfactory basis for selection. (test of goodness, inprevious chapter)

    Introduction

    Flood frequency

    Frequency distribution

  • 7/30/2019 Ch 2 - Frequency Analysis and Forecasting

    9/31

    Frequency Analysis and Forecasting

    Gumbels distribution. the most widely used probability-distribution

    functions for extreme values in hydrologic and

    meteorological studies for prediction of flood peaks,

    maximum rainfalls, maximum wind speed, etc. For annual series of peak flows, the probability of

    occurrence of an event equal to or larger than a value

    x0 is

    Introduction

    Flood frequency

    Frequency distribution

    P X x ee y

    0 1

    y

    X X

    X

    128259

    0577.

    .

    In which y is a dimensionless variable given by

  • 7/30/2019 Ch 2 - Frequency Analysis and Forecasting

    10/31

    Frequency Analysis and Forecasting

    Gumbels distribution. Rearranging the dimensionless variables, y, equation in the form

    of Chows general hydrologic frequency equation; the value of XT

    with a return period T is:

    In practice it is the value of X for a given P that is

    required and the Gumbels equation is transposed as

    y = -1n(-1n(q)) = -1n(-1n(1-p))

    Noting that the return period T = 1/P and designating;yT = the value of y, commonly called the reduced

    variate, for a given T

    Introduction

    Flood frequency

    Frequency distribution

    yT

    TT

    ln ln

    1

    x x K

    xT

    Ky

    T0577

    12825

    .

    .Where

  • 7/30/2019 Ch 2 - Frequency Analysis and Forecasting

    11/31

    Frequency Analysis and Forecasting

    Gumbel's Equation for Practical Use. The above Gumbel's equations are applicable to an infinite

    sample size (i.e. N ), however, annual data series of extreme

    hydrologic events have finite lengths of record. So that, Gumbels

    equation is modified to account for finite N as given below for

    practical use.

    Introduction

    Flood frequency

    Frequency distribution

    x x KnT

    1

    Where n-1 = standard deviation of the sample

    x xN

    2

    1

    K = frequency factor expressed as K

    y y

    S

    T n

    n

    yT = reduced variate, a function of T

    yn

    = reduced mean, a function of sample size N and is given inGumbel distribution (Table 2.1 )

    Sn = reduced standard deviation, a function of sample size N (Table 2.2)

  • 7/30/2019 Ch 2 - Frequency Analysis and Forecasting

    12/31

    Frequency Analysis and Forecasting

    Gumbel's Equation for Practical Use.

    Introduction

    Flood frequency

    Frequency distribution

    Table 2.1. Reduced mean in Gumbel's extreme value distribution, N = sample size

    N 0 1 2 3 4 5 6 7 8 9

    10 0.4952 0.4996 0.5035 0.5070 0.5100 0.5128 0.5157 0.5181 0.5202 0.5220

    20 0.5236 0.5252 0.5268 0.5283 0.5296 0.5309 0.5320 0.5332 0.5343 0.5353

    30 0.5362 0.5371 0.5380 0.5388 0.5396 0.5402 0.5410 0.5418 0.5424 0.5430

    40 0.5436 0.5442 0.5448 0.5453 0.5458 0.5463 0.5468 0.5473 0.5477 0.5481

    50 0.5485 0.5489 0.5493 0.5497 0.5501 0.5504 0.5508 0.5511 0.5515 0.5518

    60 0.5521 0.5524 0.5527 0.5530 0.5533 0.5535 0.5538 0.5540 0.5543 0.5545

    70 0.5548 0.5550 0.5552 0.5555 0.5557 0.5559 0.5561 0.5563 0.5565 0.5567

    80 0.5569 0.5570 0.5572 0.5574 0.5576 0.5578 0.5580 0.5581 0.5583 0.5585

    90 0.5586 0.5587 0.5589 0.5591 0.5592 0.5593 0.5595 0.5596 0.5598 0.5599

    100 0.5600

    ny

  • 7/30/2019 Ch 2 - Frequency Analysis and Forecasting

    13/31

    Frequency Analysis and Forecasting

    Gumbel's Equation for Practical Use.

    Introduction

    Flood frequency

    Frequency distribution

    Table 2.2. Reduced standard deviation Sn in Gumbel's extreme value distribution, N = sample size

    N 0 1 2 3 4 5 6 7 8 9

    10 0.9496 0.9676 0.9833 0.9971 1.0095 1.0206 1.0316 1.0411 1.0493 1.0565

    20 1.0628 1.0696 1.0754 1.0811 1.0864 1.0915 1.0961 1.1004 1.1047 1.1086

    30 1.1124 1.1159 1.1193 1.1226 1.1255 1.1285 1.1313 1.1339 1.1363 1.1388

    40 1.1413 1.1436 1.1458 1.1480 1.1499 1.1519 1.1538 1.1557 1.1574 1.1590

    50 1.1607 1.1623 1.1638 1.1658 1.1667 1.1681 1.1696 1.1708 1.1721 1.1734

    60 1.1747 1.1759 1.1770 1.1782 1.1793 1.1803 1.1814 1.1824 1.1834 1.1844

    70 1.1854 1.1863 1.1873 1.1881 1.1890 1.1898 1.1906 1.1915 1.1923 1.1930

    80 1.1938 1.1945 1.1953 1.1959 1.1967 1.1973 1.1980 1.1987 1.1994 1.2001

    90 1.2007 1.2013 1.2020 1.2026 1.2032 1.2038 1.2044 1.2049 1.2055 1.2060

    100 1.2065

  • 7/30/2019 Ch 2 - Frequency Analysis and Forecasting

    14/31

    Frequency Analysis and Forecasting

    Application procedure Of Gumbel distribution to estimate theflood magnitude for d/t return period. Assemble the discharge data of record length N. Find mean andn-1 for

    the given data.

    Using Tables 2.1 and 2.2 determine the reduced mean and standarddeviation for sample N.

    Find the reduced variate, yTfor a given T

    Find K by

    Determine the required xTby

    Introduction

    Flood frequency

    Frequency distribution

    yT

    TT

    ln ln

    1

    Ky y

    S

    T n

    n

    x x KnT

    1

  • 7/30/2019 Ch 2 - Frequency Analysis and Forecasting

    15/31

    Frequency Analysis and Forecasting

    How to verify the annual flood series data follows theassumes Gumbel distribution?

    The value of xT for some return periods T

  • 7/30/2019 Ch 2 - Frequency Analysis and Forecasting

    16/31

    Frequency Analysis and Forecasting

    Example : Annual maximum recorded floods in a certain river, for

    the period 1951 to 1977 is given below. Verify whether the

    Gumbel extreme-value distribution fit the recorded values.

    Estimate the flood discharge with return period of (i) 100 years

    and (ii) 150 years by graphical extrapolation.

    Introduction

    Flood frequency

    Frequency distribution

    Year Max. flood (m3/s) Year Max. flood (m3/s) Year Max. flood (m3/s)

    1951 2947 1960 4798 1969 6599

    1952 3521 1961 4290 1970 3700

    1953 2399 1962 4652 1971 4175

    1954 4124 1963 5050 1972 2988

    1955 3496 1964 6900 1973 27091956 2947 1965 4366 1974 3873

    1957 5060 1966 3380 1975 4593

    1958 4903 1967 7826 1976 6761

    1959 3757 1968 3320 1977 1971

    x

  • 7/30/2019 Ch 2 - Frequency Analysis and Forecasting

    17/31

    Frequency Analysis and Forecasting

    SOLUTION: The flood discharge values are arranged in descending

    order and the plotting position return period TP for each

    discharge is obtained as

    Introduction

    Flood frequency

    Frequency distribution

    TN

    m mP

    1 28

    Ordernumber

    m

    Flooddischargex (m3/s)

    TP(years)

    Ordernumber

    m

    Flooddischargex (m3/s)

    TP(years)

    1 7826 28.00 15 3873 1.872 6900 14.00 16 3757 1.753 6761 9.33 17 3700 1.654 6599 7.00 18 3521 1.565 5060 5.60 19 3496 1.476 5050 4.67 20 3380 1.407 4903 4.00 21 3320 1.338 4798 3.50 22 2988 1.279 4652 3.11 23 2947 -

    10 4593 2.80 24 2947 1.1711 4366 2.55 25 2709 1.1212 4290 2.33 26 2399 1.0813 4175 2.15 27 1971 1.0414 4124 2.00

    N = 27 years, = 4263 m3/s, n-1 = 1432.6 m3/s

    x

    x

  • 7/30/2019 Ch 2 - Frequency Analysis and Forecasting

    18/31

    Frequency Analysis and Forecasting

    SOLUTION:

    From Tables 2.1 and 2.2, for N = 27, yn = 0.5332 (reduced

    mean)and Sn = 1.1004 (reduced standard deviation) is calculated.

    YT and K for any choosen T is calculated by the formulas we

    descussed

    discharge xT for any chosen return interval is calculated by usingGumbel's formulae, and the result for some T values are shown in

    table below.

    Introduction

    Flood frequency

    Frequency distribution

    x

    T(years)

    XT[obtained by Eq.(5.30)](m3/s)

    5.0 552210.0 6499

    20.0 7436100.0 9558150.0 10088

    When these values are plotted on Gumbel probability paper, it is seen that these

    points lie on a straight line according to the property of the Gumbel's extreme

    probability paper.

    x

  • 7/30/2019 Ch 2 - Frequency Analysis and Forecasting

    19/31

    Frequency Analysis and Forecasting

    Confidence Limits of probability distribution estimates:

    Since the value of the variate for a given return period, xT

    determined by Gumbel's method can have errors due to the

    limited sample data used, an estimate of the confidence limits of

    the estimate is desirable.

    For a confidence probability c, the confidence interval of the

    variate xT is bound by value x1 and x2 given by

    x1/2 = xTf (c)Se

    Where f(c) = function of the confidence probability c determined by using the

    table of normal variate

    Introduction

    Flood frequency

    Frequency distribution

    x

    c in per cent 50 68 80 90 95 99

    f(c) 0.674 1.00 1.282 1.645 1.96 2.58

    Se = probable error =b

    N

    n 1

    b K K 1 1 3 11 2. .K = frequency factor givenn-1 = standard deviation of the sampleN = sample size

  • 7/30/2019 Ch 2 - Frequency Analysis and Forecasting

    20/31

    Frequency Analysis and Forecasting

    Confidence Limits of probability distribution estimates:

    Example: Data covering a period of 92 years for a certain riveryielded the mean and standard deviation of the annual flood

    series as 6437 and 2951 m3/s respectively. Using Gumbel's

    method estimate the flood discharge with a return period of 500

    years. What are the (a) 95% and (b) 80% confidence limits forthis estimate.

    Solution

    Introduction

    Flood frequency

    Frequency distribution

    From Table 2.1 and 2.1 for N = 92 years,

    yn

    = 0.5589, and Sn = 1.2020.

    Then y50 = -[ln((ln(500/499))] = 6.21361 , K500 = (6.21361 -

    0.5589)/1.2020 = 4.7044 x500 = 6437 + 4.7044*2951 = 20320m3/s.

    For the 95% confidence probability f(c) = 1.96 and x1/2 = 20320(1.96*1726), which results in x1 = 23703m

    3/s and x2 = 16937m3/s.

    Thus the estimated discharge of 20320m3/s has a 95% probability oflying between 23700 and 16940m3/s.

    http://localhost/var/www/apps/conversion/releases/20121120192253/tmp/scratch_2/Confidence%20Limits.dochttp://localhost/var/www/apps/conversion/releases/20121120192253/tmp/scratch_2/Confidence%20Limits.doc
  • 7/30/2019 Ch 2 - Frequency Analysis and Forecasting

    21/31

    Frequency Analysis and Forecasting

    STANDARD FREQUENCY DISTRIBUTIONS

    Several cumulative frequency distributions are commonly used

    in the analysis of hydrologic data and, as a result, they have

    been studied extensively and are now standardized.

    The frequency distributions that have been found most useful in

    hydrologic data analysis are

    normal distribution,

    log-normal distribution,

    Gumbel extreme value distribution, and

    log-Pearson Type III distribution.

    Introduction

    Flood frequency

    Frequency distribution

  • 7/30/2019 Ch 2 - Frequency Analysis and Forecasting

    22/31

    Frequency Analysis and Forecasting

    Normal Distribution

    It is a classical mathematical

    distribution commonly used in

    the analysis of natural

    phenomena.

    It has a symmetrical,

    unbounded, bell-shaped curve

    with the maximum value at the

    central point and extending

    from - to + .

    Introduction

    Flood frequency

    Frequency distribution

    Normal probability distribution

    Standard normal distribution

  • 7/30/2019 Ch 2 - Frequency Analysis and Forecasting

    23/31

    Frequency Analysis and Forecasting

    Normal Distribution characteristics

    the maximum value occurs at the

    mean.

    Half of the flows will be below the

    mean and half are above.

    68.3% of the events fall between 1standard deviation (S), 95% of the

    events fall within 2S, and 99.7% fall

    within 3S.

    the coefficient of skew is zero

    Introduction

    Flood frequency

    Frequency distribution

    Normal probability distribution

    Standard normal distribution

  • 7/30/2019 Ch 2 - Frequency Analysis and Forecasting

    24/31

    Frequency Analysis and Forecasting

    Normal Distribution Frequency curve:

    Following the characteristics of normal probability distribution,

    frequency curve of normally distributed random variable X can be

    developed by the ff procedure.

    Compute the mean X and standard deviation S of the annual

    flood series.

    Plot two points on the probability paper: (a) X + S at an

    exceedence probability of 0.159 (15.9%) and (b) X - S at an

    exceedence probability of 0.841 (84.1%).

    Draw a straight line through these two points; the accuracy of the

    graphing can be checked by ensuring that the line passes throughthe point defined by X at an exceedence probability of 0.50

    (50%).

    Introduction

    Flood frequency

    Frequency distribution

    The straight line represents the assumed normal population. It can be used eitherto make probability estimates for given values of X or to estimate values of X for

    given exceedence probabilities.

  • 7/30/2019 Ch 2 - Frequency Analysis and Forecasting

    25/31

    Frequency Analysis and Forecasting

    Normal Distribution Application:

    Disadvantage: it is unbounded in the negative direction whereas most

    hydrologic variables are bounded and can never be less than zero.

    For this reason and the fact that many hydrologic variables exhibit a

    pronounced skew, It usually has limited applications.

    pdf for normal distribution

    Introduction

    Flood frequency

    Frequency distribution

    Hydrologic variables such as annual precipitation, annual average streamflow, or

    annual average pollutant loadings follow normal distribution

    2

    2

    1

    2

    1)(

    x

    X exf

    is the mean and is the standarddeviation

    http://upload.wikimedia.org/wikipedia/commons/1/1b/Normal_distribution_pdf.png
  • 7/30/2019 Ch 2 - Frequency Analysis and Forecasting

    26/31

    Frequency Analysis and Forecasting

    Standard Normal Distribution:

    A standard normal distribution is a

    normal distribution with mean () = 0

    and standard deviation () = 1

    Normal distribution is transformed to

    standard normal distribution by using

    the following formula:

    Introduction

    Flood frequency

    Frequency distribution

    z is called the standard normal variable

  • 7/30/2019 Ch 2 - Frequency Analysis and Forecasting

    27/31

    Frequency Analysis and Forecasting

    Flow estimation by Normal Distribution.

    Comparing the reduced variable (frequency factor) in the

    normal distribution, and the standard normal variable in the

    standard Normal distribution, we will have:

    Introduction

    Flood frequency

    Frequency distribution

    2

    2

    1

    2

    1)(

    x

    X exf

    TT

    Tz

    s

    xxK

    So the frequency factor for the Normal Distribution is the standard normalvariate, and the Estimated Value for some return period T is expressed as:

    szxsKxx TTT

  • 7/30/2019 Ch 2 - Frequency Analysis and Forecasting

    28/31

    Frequency Analysis and Forecasting

    Normal distribution estimation procedure:

    To make a probability estimate p for a given magnitude, use thefollowing procedure:

    compute the value of the standard normal deviate by

    with the value of z and obtain the exceedence probabilityfrom standard normal probability table or use excel function

    NORMSDIST(z ).

    To make estimates of the magnitude for a given exceedence

    probability, use the following procedure:

    with the exceedence probability obtain the correspondingvalue of z from standard normal probability table or use excel

    function NORMSINV(probability).

    compute the magnitude X by:

    Introduction

    Flood frequency

    Frequency distribution

    szxx TT

  • 7/30/2019 Ch 2 - Frequency Analysis and Forecasting

    29/31

    Frequency Analysis and Forecasting

    Log normal distribution:

    It has the same characteristics as the normal distribution except

    that the dependent variable, X, is replaced with its logarithm.

    Introduction

    Flood frequency

    Frequency distribution

    If the pdf of X is skewed, its not

    normally distributed

    If the pdf of Y = log (X) is normallydistributed, then X is said to be

    lognormally distributed.

    xlogyandxy

    xxf

    y

    y

    ,0

    2

    )(exp

    2

    1)(

    2

    2

    Hydraulic conductivity, distribution of raindrop sizes in storm follow lognormal

    distribution.

    The characteristics of the log-normal distribution are that it is bounded on the left byzero and it has a pronounced positive skew. These are both characteristics of many ofthe frequency distributions that result from an analysis of hydrologic data.

    http://upload.wikimedia.org/wikipedia/commons/4/46/Lognormal_distribution_PDF.png
  • 7/30/2019 Ch 2 - Frequency Analysis and Forecasting

    30/31

    Frequency Analysis and Forecasting

    Log normal distribution probability plot:

    Similar to normal distribution, If the logarithms of the peak

    flows are normally distributed, the data will plot as a straight

    line according to the equation:

    Introduction

    Flood frequency

    Frequency distribution

    procedure for developing the graph of the log-normal distribution:1. Transform the values of the flood series X by taking logarithms: Y =

    log X.2. Compute the log mean (Y) and log standard deviation (Sy) using

    the logarithms.3. Using Y and Sy, compute 10Y + Sy and 10Y - Sy. Using logarithmic

    frequency paper, plot these two values at exceedence probabilities of0.159 (15.9%) and 0.841 (84.1%), respectively.

    4. Draw a straight line through the two points.

    The data points can now be plotted on the logarithmic probability paper using thesame procedure as outlined for the normal distribution. Specifically, the flood

    magnitudes are plotted against the probabilities from a plotting position formula

  • 7/30/2019 Ch 2 - Frequency Analysis and Forecasting

    31/31

    Frequency Analysis and Forecasting

    Log normal distribution - Estimation:

    Graphical estimates of either flood magnitudes or probabilities

    can be taken directly from the line representing the assumed log-

    normal distribution. Values can also be computed using:

    Introduction

    Flood frequency

    Frequency distribution

    to obtain a probability for the logarithm of a givenmagnitude (Y = log X)

    to obtain a magnitude for a given probability.

    Anti-log transformation to obtain the true magnitude for

    a given probability.

    Two useful approximate relation of the logarithms b/n the original variables: