33
Estimation of a new parameter discriminating between Weibull tail-distributions and heavy-tailed distributions by Jonathan EL METHNI in collaboration with Laurent GARDES & St´ ephane GIRARD & Armelle GUILLOU Tunis Thursday 26 th May 2011.

Estimation of a new parameter discriminating between

  • Upload
    others

  • View
    3

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Estimation of a new parameter discriminating between

Estimation of a new parameter discriminating betweenWeibull tail-distributions and heavy-tailed distributions

by

Jonathan EL METHNI

in collaboration with

Laurent GARDES & Stephane GIRARD & Armelle GUILLOU

Tunis Thursday 26th May 2011.

Page 2: Estimation of a new parameter discriminating between

Outline Introduction to the extreme value theory Model Estimators Illustration on simulations Concluding remarks

1 Introduction to the extreme value theoryMotivations3 domains of attractionFrechet / Gumbel

2 Model

3 EstimatorsDefinitionAsymptotic properties

4 Illustration on simulations

5 Concluding remarks

2 / 22

Page 3: Estimation of a new parameter discriminating between

Outline Introduction to the extreme value theory Model Estimators Illustration on simulations Concluding remarks

Motivations

Let X1, . . . ,Xn be a sample of independent and identically distributed randomvariables driven from X with cumulative distribution function F , and letX1,n ≤ · · · ≤ Xn,n denote the order statistics associated to this sample.

We want to estimate the extreme quantile xpn of order pn associated tothe random variable X ∈ R defined by

xpn = F←

(pn) = inf{x , F (x) ≤ pn},

with pn → 0 when n→∞. The function F←

is the generalized inverse ofthe non-increasing function F = 1− F .

Difficulty : If npn → 0 then P(xpn > Xn,n)→ 1.

3 / 22

Page 4: Estimation of a new parameter discriminating between

Outline Introduction to the extreme value theory Model Estimators Illustration on simulations Concluding remarks

Principals results on extreme value theory

Fisher-Tippett-Gnedenko theorem

Under some conditions of regularity on the cumulative distribution function F ,there exists a real parameter γ and two sequences (an)n≥1 > 0 and (bn)n≥1 ∈ Rsuch that for all x ∈ R,

limn→∞

P(Xn,n − bn

an≤ x

)= Hγ(x),

with

Hγ(x) =

{exp

(−(1 + γx)

−1/γ+

)if γ 6= 0,

exp(−e−x

)if γ = 0,

where y+ = max(0, y).

4 / 22

Page 5: Estimation of a new parameter discriminating between

Outline Introduction to the extreme value theory Model Estimators Illustration on simulations Concluding remarks

3 domains of attraction

Hγ is called the cumulative distribution function of the extreme valuedistribution.

If F verifies the Fisher-Tippett-Gnedenko theorem, we say that F belongsto the domain of attraction of Hγ .

γ is called the extreme value index.

Frechet (γ > 0) Gumbel (γ = 0) Weibull (γ < 0)Pareto Normal UniformStudent Exponential BetaBurr Log-normalFrechet Gamma

Weibull

5 / 22

Page 6: Estimation of a new parameter discriminating between

Outline Introduction to the extreme value theory Model Estimators Illustration on simulations Concluding remarks

Frechet maximum domain of attraction : heavy-tailed distributions

All cumulative functions which belong to the Frechet maximum domain ofattraction denoted by D(Fr echet) can be rewritten as

F (x) = x−1/γ`(x),

where γ > 0 and `(x) is a slowly varying function i.e. `(λx)/`(x)→ 1 asx →∞ for all λ ≥ 1. F (x) is said to be regularly varying at infinity withindex −1/γ. This property is denoted by F ∈ R−1/γ .

Gumbel maximum domain of attraction : light-tailed distributions

There is no simple representation for distributions which belong to D(Gumbel).We focus on an interesting sub-family called Weibull tail-distributions

F (x) = exp (−xα`(x)) ,

where α is called the Weibull tail-coefficient and `(x) is a slowly varyingfunction.

6 / 22

Page 7: Estimation of a new parameter discriminating between

Outline Introduction to the extreme value theory Model Estimators Illustration on simulations Concluding remarks

Frechet maximum domain of attraction : heavy-tailed distributions

All cumulative functions which belong to the Frechet maximum domain ofattraction denoted by D(Fr echet) can be rewritten as

F (x) = x−1/γ`(x),

where γ > 0 and `(x) is a slowly varying function i.e. `(λx)/`(x)→ 1 asx →∞ for all λ ≥ 1. F (x) is said to be regularly varying at infinity withindex −1/γ. This property is denoted by F ∈ R−1/γ .

Gumbel maximum domain of attraction : light-tailed distributions

There is no simple representation for distributions which belong to D(Gumbel).We focus on an interesting sub-family called Weibull tail-distributions

F (x) = exp (−xα`(x)) ,

where α is called the Weibull tail-coefficient and `(x) is a slowly varyingfunction.

6 / 22

Page 8: Estimation of a new parameter discriminating between

Outline Introduction to the extreme value theory Model Estimators Illustration on simulations Concluding remarks

Model established by L. Gardes, S. Girard & A. Guillou

First order condition (A1(τ, θ))

Let us consider the family of survival distribution functions defined as

(A1(τ, θ)) F (x) = exp(−K←τ (logH(x))) for x ≥ x∗ with x∗ > 0 and

Kτ (y) =∫ y

1uτ−1du where τ ∈ [0, 1],

H an increasing function such that H← ∈ Rθ where θ > 0.

Proposition

F verifies (A1(0, θ)) if and only if F is a Weibull-tail distribution functionwith Weibull tail-coefficient θ.

If F verifies (A1(τ, θ)), τ ∈ [0, 1) and if H is twice differentiable then Fbelongs to the Gumbel maximum domain of attraction.

F verifies (A1(1, θ)) if and only if F is in the Frechet maximum domain ofattraction with tail-index θ.

7 / 22

Page 9: Estimation of a new parameter discriminating between

Outline Introduction to the extreme value theory Model Estimators Illustration on simulations Concluding remarks

Model established by L. Gardes, S. Girard & A. Guillou

First order condition (A1(τ, θ))

Let us consider the family of survival distribution functions defined as

(A1(τ, θ)) F (x) = exp(−K←τ (logH(x))) for x ≥ x∗ with x∗ > 0 and

Kτ (y) =∫ y

1uτ−1du where τ ∈ [0, 1],

H an increasing function such that H← ∈ Rθ where θ > 0.

Proposition

F verifies (A1(0, θ)) if and only if F is a Weibull-tail distribution functionwith Weibull tail-coefficient θ.

If F verifies (A1(τ, θ)), τ ∈ [0, 1) and if H is twice differentiable then Fbelongs to the Gumbel maximum domain of attraction.

F verifies (A1(1, θ)) if and only if F is in the Frechet maximum domain ofattraction with tail-index θ.

7 / 22

Page 10: Estimation of a new parameter discriminating between

Outline Introduction to the extreme value theory Model Estimators Illustration on simulations Concluding remarks

Estimator of θ

Definition

Denoting by (kn) an intermediate sequence of integers, the following estimatorof θ is considered :

θn,τ (kn) =Hn(kn)

µτ (log(n/kn)),

with, for all t > 0,

µτ (t) =

∫ ∞0

(Kτ (x + t)− Kτ (t)) e−xdx .

Definition

Let us consider (kn) an intermediate sequence of integers such thatkn ∈ {1 . . . n} the Hill estimator is given by :

Hn(kn) =1

kn − 1

kn−1∑i=1

log(Xn−i+1,n)− log(Xn−kn+1,n).

8 / 22

Page 11: Estimation of a new parameter discriminating between

Outline Introduction to the extreme value theory Model Estimators Illustration on simulations Concluding remarks

Model established by L. Gardes, S. Girard & A. Guillou

Definition

An estimator of the extreme quantile xpn can be deduced by :

xpn,θn,τ (kn) = Xn−kn+1,n exp(θn,τ (kn) (Kτ (log(1/pn))− Kτ (log(n/kn)))

).

Second order condition (A2(ρ))

To establish the asymptotic normality of the estimators, a second-ordercondition on ` is required :

(A2(ρ)) There exist ρ < 0, a function b satisfying b(x)→ 0 and |b|asymptotically decreasing such that uniformly locally on λ > 0

log

(`(λx)

`(x)

)∼ b(x)Kρ(λ), when x →∞.

It can be shown that necessarily |b| ∈ Rρ.

9 / 22

Page 12: Estimation of a new parameter discriminating between

Outline Introduction to the extreme value theory Model Estimators Illustration on simulations Concluding remarks

Model established by L. Gardes, S. Girard & A. Guillou

Definition

An estimator of the extreme quantile xpn can be deduced by :

xpn,θn,τ (kn) = Xn−kn+1,n exp(θn,τ (kn) (Kτ (log(1/pn))− Kτ (log(n/kn)))

).

Second order condition (A2(ρ))

To establish the asymptotic normality of the estimators, a second-ordercondition on ` is required :

(A2(ρ)) There exist ρ < 0, a function b satisfying b(x)→ 0 and |b|asymptotically decreasing such that uniformly locally on λ > 0

log

(`(λx)

`(x)

)∼ b(x)Kρ(λ), when x →∞.

It can be shown that necessarily |b| ∈ Rρ.

9 / 22

Page 13: Estimation of a new parameter discriminating between

Outline Introduction to the extreme value theory Model Estimators Illustration on simulations Concluding remarks

An intuitive justification

Objectives

1 Estimate τ independently from θ,

2 Replace τ by τn in θn,τ (kn),

3 Replace τ by τn and θn,τ (kn) by θn,τn (kn) in xpn,θn,τ (kn).

Note that for (kn) and (k ′n) two intermediate sequences of integers such that

θn,τ (kn)P−→ θ and θn,τ (k ′n)

P−→ θ and k ′n > kn we have

θn,τ (kn)

θn,τ (k ′n)=

Hn(kn)

Hn(k ′n)

µτ (log(n/k ′n))

µτ (log(n/kn))P−→ 1.

Then,Hn(kn)

Hn(k ′n)P∼ µτ (log(n/kn))

µτ (log(n/k ′n))= ψ(τ ; log(n/kn), log(n/k ′n)),

where

ψ(x ; t, t′) =µx(t)

µx(t′)is a bijection from R to

(−∞, exp(t − t′)

).

10 / 22

Page 14: Estimation of a new parameter discriminating between

Outline Introduction to the extreme value theory Model Estimators Illustration on simulations Concluding remarks

An intuitive justification

Objectives

1 Estimate τ independently from θ,

2 Replace τ by τn in θn,τ (kn),

3 Replace τ by τn and θn,τ (kn) by θn,τn (kn) in xpn,θn,τ (kn).

Note that for (kn) and (k ′n) two intermediate sequences of integers such that

θn,τ (kn)P−→ θ and θn,τ (k ′n)

P−→ θ and k ′n > kn we have

θn,τ (kn)

θn,τ (k ′n)=

Hn(kn)

Hn(k ′n)

µτ (log(n/k ′n))

µτ (log(n/kn))P−→ 1.

Then,Hn(kn)

Hn(k ′n)P∼ µτ (log(n/kn))

µτ (log(n/k ′n))= ψ(τ ; log(n/kn), log(n/k ′n)),

where

ψ(x ; t, t′) =µx(t)

µx(t′)is a bijection from R to

(−∞, exp(t − t′)

).

10 / 22

Page 15: Estimation of a new parameter discriminating between

Outline Introduction to the extreme value theory Model Estimators Illustration on simulations Concluding remarks

An intuitive justification

Objectives

1 Estimate τ independently from θ,

2 Replace τ by τn in θn,τ (kn),

3 Replace τ by τn and θn,τ (kn) by θn,τn (kn) in xpn,θn,τ (kn).

Note that for (kn) and (k ′n) two intermediate sequences of integers such that

θn,τ (kn)P−→ θ and θn,τ (k ′n)

P−→ θ and k ′n > kn we have

θn,τ (kn)

θn,τ (k ′n)=

Hn(kn)

Hn(k ′n)

µτ (log(n/k ′n))

µτ (log(n/kn))P−→ 1.

Then,Hn(kn)

Hn(k ′n)P∼ µτ (log(n/kn))

µτ (log(n/k ′n))= ψ(τ ; log(n/kn), log(n/k ′n)),

where

ψ(x ; t, t′) =µx(t)

µx(t′)is a bijection from R to

(−∞, exp(t − t′)

).

10 / 22

Page 16: Estimation of a new parameter discriminating between

Outline Introduction to the extreme value theory Model Estimators Illustration on simulations Concluding remarks

An intuitive justification

Objectives

1 Estimate τ independently from θ,

2 Replace τ by τn in θn,τ (kn),

3 Replace τ by τn and θn,τ (kn) by θn,τn (kn) in xpn,θn,τ (kn).

Note that for (kn) and (k ′n) two intermediate sequences of integers such that

θn,τ (kn)P−→ θ and θn,τ (k ′n)

P−→ θ and k ′n > kn we have

θn,τ (kn)

θn,τ (k ′n)=

Hn(kn)

Hn(k ′n)

µτ (log(n/k ′n))

µτ (log(n/kn))P−→ 1.

Then,Hn(kn)

Hn(k ′n)P∼ µτ (log(n/kn))

µτ (log(n/k ′n))= ψ(τ ; log(n/kn), log(n/k ′n)),

where

ψ(x ; t, t′) =µx(t)

µx(t′)is a bijection from R to

(−∞, exp(t − t′)

).

10 / 22

Page 17: Estimation of a new parameter discriminating between

Outline Introduction to the extreme value theory Model Estimators Illustration on simulations Concluding remarks

Estimator of τ

Definition

Denoting by (kn) and (k ′n) two intermediate sequences of integers such that

k ′n > kn, the following estimator of τ is considered :

τn =

ψ−1(

Hn(kn)Hn(k′

n) ; log(n/kn), log(n/k ′n))

if Hn(kn)Hn(k′

n) <k′n

kn,

u if Hn(kn)Hn(k′

n) ≥k′n

kn.

where u is the realization of a standard uniform distribution.

11 / 22

Page 18: Estimation of a new parameter discriminating between

Outline Introduction to the extreme value theory Model Estimators Illustration on simulations Concluding remarks

Asymptotic properties

Asymptotic normality of τn

Suppose that (A1(τ, θ)) and (A2(ρ)) hold. Let (kn) and (k ′n) be twointermediate sequences of integers such that

(H1) kn →∞, k ′n/n→ 0, kn/k′n → 0,

√k ′nb(expKτ (log n/k ′n))→ 0,

(H2) log(n/k ′n)(log2(n/kn)− log2(n/k ′n)

)→∞,

√kn(log2(n/kn)− log2(n/k ′n)

)→∞.

we have :√kn(log2(n/kn)− log2(n/k ′n)

)(τn − τ)

d→ N (0, 1).

where log2 = log(log).

12 / 22

Page 19: Estimation of a new parameter discriminating between

Outline Introduction to the extreme value theory Model Estimators Illustration on simulations Concluding remarks

Asymptotic properties

Replacing τ by τn we obtain

θn,τn (kn) =Hn(kn)

µτn (log(n/kn)).

Asymptotic normality of θn,τn (kn)

Suppose that (A1(τ, θ)) and (A2(ρ)) hold. Let (kn) and (k ′n) be twointermediate sequences of integers such that (H1), (H2) hold with

(H3)(log2(n/kn)− log2(n/k ′n)

)/ log2(n/kn)→ 0,

(H4)√kn(log2(n/kn)− log2(n/k ′n)

)/ log2(n/kn)→∞.

we have :√kn (log2(n/kn)− log2(n/k ′n))

log2(n/kn)

(θn,τn (kn)− θ

)d→ N (0, θ2).

13 / 22

Page 20: Estimation of a new parameter discriminating between

Outline Introduction to the extreme value theory Model Estimators Illustration on simulations Concluding remarks

Asymptotic properties

Replacing τ by τn we obtain

θn,τn (kn) =Hn(kn)

µτn (log(n/kn)).

Asymptotic normality of θn,τn (kn)

Suppose that (A1(τ, θ)) and (A2(ρ)) hold. Let (kn) and (k ′n) be twointermediate sequences of integers such that (H1), (H2) hold with

(H3)(log2(n/kn)− log2(n/k ′n)

)/ log2(n/kn)→ 0,

(H4)√kn(log2(n/kn)− log2(n/k ′n)

)/ log2(n/kn)→∞.

we have :√kn (log2(n/kn)− log2(n/k ′n))

log2(n/kn)

(θn,τn (kn)− θ

)d→ N (0, θ2).

13 / 22

Page 21: Estimation of a new parameter discriminating between

Outline Introduction to the extreme value theory Model Estimators Illustration on simulations Concluding remarks

Asymptotic properties

Replacing τ by τn and θn,τ (kn) by θn,τn (kn) we obtain

xpn,θn,τn (kn) = Xn−kn+1,n exp(θn,τn (kn) (Kτn (log(1/pn))− Kτn (log(n/kn)))

).

Asymptotic normality of xpn,θn,τn (kn)

Suppose that (A1(τ, θ)) and (A2(ρ)) hold. Let (kn) and (k ′n) be twointermediate sequences of integers such that (H1), (H2), (H3), (H4) hold with

(log(n/kn))1−τ (Kτ (log(1/pn))− Kτ (log(n/kn)))→∞,

(log2(1/pn))/√kn(log2(n/kn)− log2(n/k ′n))→ 0,

log2(n/kn)(Kτ (log(1/pn))− Kτ (log(n/kn)))∫ log(1/pn)

log(n/kn)log(u)uτ−1du

→ 0,

we have :

√kn (log2(n/kn)− log2(n/k ′n))∫ log(1/pn)

log(n/kn)log(u)uτ−1du

(xpn,θn,τn (kn)

xpn− 1

)d−→ N (0, θ2).

14 / 22

Page 22: Estimation of a new parameter discriminating between

Outline Introduction to the extreme value theory Model Estimators Illustration on simulations Concluding remarks

Asymptotic properties

Replacing τ by τn and θn,τ (kn) by θn,τn (kn) we obtain

xpn,θn,τn (kn) = Xn−kn+1,n exp(θn,τn (kn) (Kτn (log(1/pn))− Kτn (log(n/kn)))

).

Asymptotic normality of xpn,θn,τn (kn)

Suppose that (A1(τ, θ)) and (A2(ρ)) hold. Let (kn) and (k ′n) be twointermediate sequences of integers such that (H1), (H2), (H3), (H4) hold with

(log(n/kn))1−τ (Kτ (log(1/pn))− Kτ (log(n/kn)))→∞,

(log2(1/pn))/√kn(log2(n/kn)− log2(n/k ′n))→ 0,

log2(n/kn)(Kτ (log(1/pn))− Kτ (log(n/kn)))∫ log(1/pn)

log(n/kn)log(u)uτ−1du

→ 0,

we have :

√kn (log2(n/kn)− log2(n/k ′n))∫ log(1/pn)

log(n/kn)log(u)uτ−1du

(xpn,θn,τn (kn)

xpn− 1

)d−→ N (0, θ2).

14 / 22

Page 23: Estimation of a new parameter discriminating between

Outline Introduction to the extreme value theory Model Estimators Illustration on simulations Concluding remarks

Simulations

We generate N = 100 samples (Xn,i )i=1,...,N of size n = 300.

On each sample (Xn,i ), the estimator xpn,θn,τn (kn) is computed for

kn = 2, . . . , 299 and k ′n = kn, . . . , 300.

In what follows we show simulation results for quantiles corresponding topn = 1/2n = 1.6 ∗ 10−3.

The associated deciles of the empirical Mean-Squared Error MSE areplotted.

Comparison with an estimator of A. L. M. Dekkers, J.H.J. Einmahl & L.de Haan.

15 / 22

Page 24: Estimation of a new parameter discriminating between

Outline Introduction to the extreme value theory Model Estimators Illustration on simulations Concluding remarks

Gamma distribution for xpn,θn,τn (kn)

/ D(Gumbel)

Empirical deciles of the Mean-Square Error16 / 22

Page 25: Estimation of a new parameter discriminating between

Outline Introduction to the extreme value theory Model Estimators Illustration on simulations Concluding remarks

Gamma distribution for an estimator of A. L. M. Dekkers et al.

Empirical deciles ot the Mean-Square Error17 / 22

Page 26: Estimation of a new parameter discriminating between

Outline Introduction to the extreme value theory Model Estimators Illustration on simulations Concluding remarks

Pareto distribution for xpn,θn,τn (kn)

/ D(Fr echet)

Empirical deciles of the Mean-Square Error18 / 22

Page 27: Estimation of a new parameter discriminating between

Outline Introduction to the extreme value theory Model Estimators Illustration on simulations Concluding remarks

Pareto distribution for an estimator of A. L. M. Dekkers et al.

Empirical deciles of the Mean-Square Error19 / 22

Page 28: Estimation of a new parameter discriminating between

Outline Introduction to the extreme value theory Model Estimators Illustration on simulations Concluding remarks

Concluding remarks

Conclusions and Further Work

The choice of the parameters kn and k ′n in practice.

Adapt our results to the case τ > 1 and investigate the possible link withsuper-heavy tails.

Extend this work to random variable Y = ϕ(X ) where X has a parentdistribution satisfying (A1(τ, θ)).

For instance, choosing ϕ(x) = x∗ − 1/x would allow to considerdistributions in the Weibull maximum domain of attraction (with finiteendpoint x∗).

20 / 22

Page 29: Estimation of a new parameter discriminating between

Outline Introduction to the extreme value theory Model Estimators Illustration on simulations Concluding remarks

Concluding remarks

Conclusions and Further Work

The choice of the parameters kn and k ′n in practice.

Adapt our results to the case τ > 1 and investigate the possible link withsuper-heavy tails.

Extend this work to random variable Y = ϕ(X ) where X has a parentdistribution satisfying (A1(τ, θ)).

For instance, choosing ϕ(x) = x∗ − 1/x would allow to considerdistributions in the Weibull maximum domain of attraction (with finiteendpoint x∗).

20 / 22

Page 30: Estimation of a new parameter discriminating between

Outline Introduction to the extreme value theory Model Estimators Illustration on simulations Concluding remarks

Concluding remarks

Conclusions and Further Work

The choice of the parameters kn and k ′n in practice.

Adapt our results to the case τ > 1 and investigate the possible link withsuper-heavy tails.

Extend this work to random variable Y = ϕ(X ) where X has a parentdistribution satisfying (A1(τ, θ)).

For instance, choosing ϕ(x) = x∗ − 1/x would allow to considerdistributions in the Weibull maximum domain of attraction (with finiteendpoint x∗).

20 / 22

Page 31: Estimation of a new parameter discriminating between

Outline Introduction to the extreme value theory Model Estimators Illustration on simulations Concluding remarks

Concluding remarks

Conclusions and Further Work

The choice of the parameters kn and k ′n in practice.

Adapt our results to the case τ > 1 and investigate the possible link withsuper-heavy tails.

Extend this work to random variable Y = ϕ(X ) where X has a parentdistribution satisfying (A1(τ, θ)).

For instance, choosing ϕ(x) = x∗ − 1/x would allow to considerdistributions in the Weibull maximum domain of attraction (with finiteendpoint x∗).

20 / 22

Page 32: Estimation of a new parameter discriminating between

Outline Introduction to the extreme value theory Model Estimators Illustration on simulations Concluding remarks

Main references

Dekkers, A. L. M., Einmahl, J. H. J., De Haan, L., (1989). A MomentEstimator for the Index of an Extreme-Value Distribution, The Annals ofStatistics, 17, 1833–1855.

Gardes, L., Girard, S., Guillou, A., (2011). Weibull tail-distributionsrevisited : a new look at some tail estimators, Journal of StatisticalPlanning and Inference, 141, 429–444.

Gnedenko, B.V., (1943). Sur la distribution limite du terme maximumd’une serie aleatoire, The Annals of Mathematics, 44, 423–453.

Hill, B.M., (1975). A simple general approach to inference about the tailof a distribution, The Annals of Statistics, 3, 1163–1174.

21 / 22

Page 33: Estimation of a new parameter discriminating between

Outline Introduction to the extreme value theory Model Estimators Illustration on simulations Concluding remarks

Thank you for your attention

22 / 22