Upload
rasidh72
View
216
Download
0
Embed Size (px)
DESCRIPTION
statistika, distribucije
Citation preview
1. Fundamentals
1.1 Probability Theory
2.1 Theoretical Distributions
2.1.1 Discrete Distributions
The Binomial Distribution
The Poisson Distribution
2.1.2 Continuous Distributions
The Normal Distribution
Gamma Distribution
Gumbel Distribution
The x2 Distribution
The t Distribution
The F distribution
2.1.3 Multivariate Distributions
1.1 Theoretical Distributions
Bernoulli Distribution
A Bernoulli experiment is one in which there are just two outcomes of interests event A occurs or does not occur.
The indicator function of the event A is called a Bernoulli random variable:
1 ,0for ,)(or 1)()0( ,)()1(
occurs if 0,occurs if ,1
1 =======
=
xqpxfpqAPfpAPf
AA
X
xx
c
c
Examples: - tossing a coin (heads or tails)
- testing a product (good or defective)
- precipitation (rain or non rain)
The mean and the variance are
pqppEXEXXVarpqpEX
====+=
222 )()(01
1.1 Theoretical Distributions
Binomial Distribution B(n,p)
Consider the independent and identically distributed random variables X1,,Xn, which are the results of n Bernoulli trials. The number of successes among n trials, which is the sum of the 0s and 1s resulting from the individual trails
is described by a Binomial distribution and has the probability
nn XXS ++= L1
successesk exactly are herein which t patternsdistict ofnumber the:
n trials among successesk ofpattern particular a ofy probabilit the:
) trialsin successes ()(
===
kn
qp
qpkn
nkPkSP
knk
knkn
The mean and variance are
npqXVarXVarSVarnpEXEXES
nn
n
=++==++=
)()()( 121
LL
10
,!)!(
!
k) choose(n t coefficien Binomial
=
=
n
kknn
kn
1.1 Theoretical Distributions
Approximate Binomial Probabilities
According to the Central Limit Theorem, the distribution of the sum Sn and hence the binomial distribution, is asymptotically normal for large n. More precisely, for fixed p
)()1,0(lim zFznpqnpSP Nn
n=
=
Example:
Consider a binomial distribution with n=8 and p=1/2. The binomial distribution is
and the standard normal distribution is
which approximates the binomial distribution well
knk
xk kxXP
= 21218
)(
=24)( xFxXP
1.1 Theoretical Distributions
Example: Binomial Distribution and the Freezing of Cayuga Lake
Given the years, during which the Cayuga Lake in central New York State was observed to have frozen in the last 200 years,
1796 1816 1856 1875 1884 1904 1912 1934 1961 1979
what is the probability for the lake freezing at least once during some decade in the future?
Model: the number of years of lake freezing in 10 years is a binomial distributed random variable
The model is appropriate, since one can assume
- Lake freezing is a Bernoulli experiment
- Whether it freezes in a given winter is independent of whether it froze in recent years
- The probability that the lake will freeze in a given winter is constant
Estimating the model parameter
05.0200/10 ==pPrediction given by the model
32.0)95.0)(05.0(!9!1!10)05.01(05.0
110
)1( 91101 ==
== nSP
1.1 Theoretical Distributions
Poisson Distribution P(t)The random variable
X=number of events in an interval of width t
is described by the density
!)(e
!)(e ) in time events ()(
-mt-
xm
xttxPxf
xx
===
=number of events per unit time,
t=m= number of events in t
A Poisson variable has the mean and variancetXVartEX == )( ,
1.1 Theoretical Distributions
Relation to Binomial Distribution
-The Poisson distribution arises when concerning rare events (e.g. wind speed larger than Vc)
-Under the assumption of a constant , the base interval t is divided into n equal length sub-intervals with n being large enough so that the likelihood of two exceedances in any one sub-interval is negligible
- The occurrence of an exceedance in any one sub-interval can be approximated as a Bernoulli trial with probability t/n of success- Assume that events in adjacent sub-intervals are independent. The number of exceedance X in interval t is binomially distributed with B(n,t/n)- )()/,( , tPntnBn
1.1 Theoretical Distributions
Example: Poisson Distribution and Annual Tornado Counts
Given the annual tornado counts in New York State between 1959-1988, what are the probabilities for zero tornado per year or for greater than nine tornados per year?
Model: the annual number of tornados is a Poisson distributed random variable
Estimating the model parameter tThe rate of tornado occurrence:
6.430/138 ==m
Prediction of the model:
+==>
===
>
L!10 6.4)()9(01.0
!06.4)1()0(
106.4
9
06.4
0
exfxP
ePf
x
Histogram of number of tornados reported annually in New York State for 1959-1988 (dashed), and fitted Poisson distribution with m=4.6 tornados per year (solid)
1.1 Theoretical Distributions
The Normal Distribution N (,2)
=
=
x
N
N
dttxF
xxf
2
2
2
2
2)(exp
21)(
2)(exp
21)(
- described by two parameters and - symmetric
- for a N(0,1) random variable, probability for values larger than 1.96 (2.58) relative to the mean is smaller than 5% (1%). Nearly all values lie within [-3,3].- the c.d.f cannot be given explicitly
The Standard Normal Distribution N (0,1)
Any normal distribution can be transformed to N(0,1) using Z=(X-)/
= 221exp
21)( xxf
1.1 Theoretical Distributions
Normal Distribution
)(xfN )(xFN
1.1 Theoretical Distributions
1.1 Theoretical Distributions
Example: The normal distribution and the mean January temperature
Suppose that the mean January temperature at Ithaca (New York State) is a random variable with a Gaussian distribution with
=22.2F =4.4 FWhat is the probability that an arbitrarily selected January will have mean temperature as cold as or colder than 21.32 F (23.08F)?
- transforming the temperature into a standard normal variable
z=(21.32F-22.2F)/4.4F=-0.2
z=(23.08F-22.2F)/4.4F=+0.2
-looking up the table
F(z=-0.2)=0.421
F(z=+0.2)=0.579
Since Gaussian distribution is symmetric, one has
)(1)(1)(1)()( zFzZPzZPzZPzF ===
1.1 Theoretical Distributions
Example: The normal distribution and the mean January temperature
The average January temperature over the US are available for the period 1951-1980 and the year 1989. How can one assess the probabilities of the 1989 temperatures?
Model: the mean January temperatures at each stations are normally distributed random variables
Estimating the model parameters:
data available :
)(1
1
2
1
2
1
i
N
ii
N
ii
x
xN
xN
=
=
=
=
1.1 Theoretical Distributions
Probabilities of 1989 January mean temperature obtained from the model
- The probabilities are described in terms of percentiles of local Gaussian distributions
- Florida and most of the midwest were substantially warmer than usual, while only a small portion of the western United States was cooler than usual
1.1 Theoretical Distributions
Gamma Distribution: G (,)
0,, ,)(
)/exp()/()(1
>=
xxxxf
- defined for positive real numbers
- characterized by two parameters:
the shape parameter and the scale parameter - skewed
- mean and variance:
- approaches normal distribution for large - the c.d.f. is not explicit
- any gamma distribution can be transformed
to the standard Gamma distribution using
the transformation
22 , ==
x=
1(1) ),()1( ==+
0(0) :1on)distributi al(exponenti
/1)0(:10 as )(:1
=>
==