A New Panel Data Treatment for Heterogeneity in Time · PDF fileA New Panel Data Treatment for Heterogeneity in Time ... Analysis of technical eﬃciencies in the ... since for many

A New Panel Data Treatment forHeterogeneity in Time Trends

Alois KneipUniversitat Bonn

joint work with R. Sickles and W. Song

Stochastic frontier models

Goal: Assessment of a firm’s level of technical efficiency using panel data

I Data: {Yit ,Xit}I n individual firms, i = 1, . . . , nI T equidistant time periods, t = 1, . . . ,TI Yit - (logarithm of) production output of firm i in period tI Xit - vector of production inputs of firm i in period t

Standard panel model:

Yit = β0 +

p∑j=1

βjXitj + ui + ϵit

ui - individual effect characterizing firm i

I ui > uj ⇔ firm i more efficient than firm j

I Usual normalization: ui ≤ 0 (eg. half-normal)

I exp(ui ) ∈ [0, 1] - score characterizing technical efficiency

Example (n = 8,T = 8)

11

1

1

1 1

1

1

2

2

2

2

2

2

2

2

3 3

3

3

3

3

33

4

4

44

4 44

4

5

5 5

5

5

5

5

5

66

66

6

66

6

7

7

77

7

7

7 7

88

8 8

8

8

8

8

ln X

ln Y

Standard Analysis

Yit = β0 +

p∑j=1

βjXitj + ui + ϵit

I ϵit - error term, E (ϵit) = 0, Var((ϵit) = σ2

I ui - individual effect; uncorrelated with ϵit

I Frontier analysis: ui ≤ 0 (e.g. half-normal)I Standard panel or longitudinal data analysis: E (ui ) = 0,

Var(ui ) = σ2u (e.g. ui ∼ N(0, σ2

u))

I Analysis based on standard methods:I Least Squares (fixed effect approach)I Generalized Least SquaresI Maximum LikelihoodI Choice of an adequate procedure in dependence of assumptions

on the stochastic structure of ui , ϵi (ui correlated with Xi?)

Problem: The assumption of time-invariant technical efficienciesis too restrictive in many applications.

Example: Analysis of technical efficiencies in the U.S. Bankingindustry

I Panel data: Monthly observation for n = 971 banks from1984 - 1995 (T = 120)

I Well-known changes in business practices due to financialderegulation in the 1980’s⇒ possibly time-varying technical efficiencies

0 10 20 30

t

0.0

0.2

0.4

0.6

0.8

1.0

u(t)

Ind. 1

Ind. 2Ind. 3

Ind. 4

Ind. 5

Time Varying Individual Effects

Yit = β0(t) +

p∑j=1

βjXitj + vi (t) + ϵit

I β0(t) - general mean process.Identifiability:

∑i vi (t) = 0.

I vi (t) - individual effects

I Efficiencies: θit = exp

(vi (t)−maxj ,s vj(s)

)Qualitative assumption:vi (t) represent smooth time trends

Semiparametric ModelFor some fixed L ∈ {0, 1, 2, . . . }

Yit = β0(t) +

p∑j=1

βjXitj +L∑

r=1

θirgr (t)︸︷︷︸vi (t)

+ϵit

Standardization (no restriction!):

a)∑

i θ2i1 ≥

∑i θ

2i2 ≥ . . .

b)∑

i θirθis = 0 for r = s

c) 1T

∑Tt=1 gr (t)

2 = 1 and∑T

t=1 gr (t)gs(t) = 0

I L, w , θir as well as the functions gr are unknown and have tobe estimated from the data

I Related literature: Econometric factor models [e.g. Forni and Lippi

(1997), Forni and Reichlin (1998), Stock and Watson (2002), Forni

et al. (2000), Barnanke and Bovin (2000), Bai and Ng (2002), Bai

(2003, 2005,2007)]

ExamplesExample with L = 3 (Cornwell, Schmidt and Sickles (1990)):

Yit = β0 +

p∑j=1

βjXitj + θi0 + θi1t + θi2t2︸︷︷︸

vi (t)

+ϵit

Example with L = 1 (Battese and Coelli (1992)):

Yit = β0 +

p∑j=1

βjXitj + θi exp(η(t − T ))︸︷︷︸vi (t)

+ϵit

Example with L = 1: Random walk

Yit = β0 +

p∑j=1

βjXitj + θig(t)︸︷︷︸vi (t)

+ϵit , g(t + 1) = g(t) + δt ,

for i.i.d. innovations δ1, δ2, . . .with E(δt) = 0, var(δt) = σ2δ .

Identifiability

Model: Yit = β0(t) +∑p

j=1 βjXitj +∑L

r=1 θirgr (t) + ϵit

Identifiability of the model requires that all variables Xitj possess aconsiderable variation over t = 1, . . . ,T . Effects of time-invariantsocioeconomic or demographic variables are captured by thecoefficients θir

Example: Standard panel model with p time-changing and qtime-invariant variables, Xitj ≡ Xij , j = p + 1, . . . , p + q.⇒ The models holds with L = 1 and g1(t) ≡ 1,

Yit = β0 +

p∑j=1

βjXitj +

p+q∑j=p+1

βjXij + ui︸︷︷︸θi1

+ϵit

I Regression of θi1 on Xi ,p+1, . . . ,Xi ,p+q ⇒ βp+1, . . . , βp+q

If the vi were known, the components gr could by determined from the

eigenvectors of the empirical covariance matrix Σn of v1, . . . , vn:

Σn =1

n

∑i

(vi (1), . . . , vi (T )) · ((vi (1), . . . , vi (T ))′

⇒ Eigenvalues λ1 ≥ λ2 ≥ · · · ≥ λT and correspondingorthonormal eigenvectors γ1, γ2, . . . , γT .

λr =T

n

∑i

θ2ir for all r = 1, 2, . . . ,

gr (t) =√T · γrt for all r = 1, . . . , L, t = 1, . . . ,T

Necessarily,

n∑i=1

T∑t=1

(vi (t)−L∑

r=1

θirgr (t))2 ≤

n∑i=1

minαi1,...,αiL

(T∑t=1

(vi (t)−L∑

r=1

αirbr (t))2

)

for any possible choice of br (t).

EstimationRecall: Yit = β0(t) +

∑pj=1 βjXitj + vi (t)︸︷︷︸∑L

r=1 θirgr (t)

+ϵit

Step 1. Semi-parametric estimation of β and vi (partial leastsquares):

Determine β1, . . . , βp and functional approximations νi (t) byminimizing

∑i

1

T

∑t

(Yit − Yt −p∑

j=1

βj(Xitj − Xtj)− νi (t))2

+ κ∑i

1

T

∫ T

1

(ν(m)i (s))2ds

Then estimate vi (t) by vi (t) := νi (t), t = 1, . . . ,T , i = 1, . . . , n.

I κ - smoothing parameter

Smoother Matrices

Let z1, . . . , zT denote a natural spline basis of order 2m, and let Zand A be the T × T matrices with elements zj(t) and∫ T1 z

(m)j (s)z

(m)j (t),

Step 1 in matrix notation. Let

Zκ = Z (Z ′Z + κA)−1Z ′

⇒ β = (∑i

(Xi − X )′(I −Zκ)(Xi − X ))−1

·∑i

(Xi − X )′(I −Zκ)(Yi − Y )

vi =

vi (1)...

vi (T )

= Zκ(Yi − Y − (Xi − X )β)

Step 2: Estimate the empirical covariance matrix by

Σn =1

n

∑i

vi v′i

and calculate its

I eigenvalues λ1 ≥ λ2 ≥ . . . λT

I eigenvectors γ1, γ2, . . . , γT

Step 2: Setgr (t) =

√T · γrt

For i = 1, . . . , n determine θ1i , . . . , θLi by minimizing

∑t

(Yit − Yt −p∑

j=1

βj(Xitj − Xtj)−L∑

r=1

θri gr (t))2

Resulting approximation:

Yit − Yt =

p∑j=1

βj(Xitj − Xtj) +L∑

r=1

θir gr (t)

Technical efficiency of firm i in period t:

vi (t) =L∑

r=1

θir gr (t)

TEit = exp(vi (t)−maxjs

vj(s))

Questions: Theoretical results, Choice of L?

Asymptotic Theory

I We assume an i.i.d. sample of individual units and analyze theasymptotic behavior as n,T → ∞, κ ≡ κ(n,T ).

I The model is assumed to possess a fixed dimension L for alln,T .

I The time paths of the explanatory variables Xitj , t = 1, . . . ,Thas to be ”less smooth” than the time paths of the vi (t)Example: {Xitj}t=1,...,T ARMA(p, q)-processes

I vi and Xi may be correlated, but rates of convergence improveif they are ulc-uncorrelated:We will say that vi and Xi are “uncorrelated up to linearcomponents” (ulc-uncorrelated) if there exist linear functionszv ,i ,T (t) and zx ,i ,j ,T (t), possibly depending on i = 1, . . . , n,j = 1, . . . , p, or T , so that E(v∗i v

∗l |X ∗) = E(v∗i v

∗l ) holds for

all i , l ∈ {1, . . . , n}, where v∗i (t) = vi (t)− zv ,i ,T (t),X ∗itj = Xitj − zx ,i ,j ,T (t).

Setup 1: Smoothness

I Asymptotic Setup: For each individualvi (t) =

∑Lr=1 ϑrigr (

tT ) holds for all t = 1, . . . ,T , where

g1, . . . , gL are an underlying smooth (at least twicecontinuously differentiable) functions defined on [0, 1].

Consider cubic smoothing splines (m = 2)

If additionally κ ∼ (nT )−4/5 · T 4 if n = o(T 4) and T = o(n4),then

I ∥β − β∥ = OP(1√nT

)

I T−1/2∥gr − gr∥ = OP((nT )−2/5)

I∑

i ∥vi−∑L

r=1 θri gr∥∑i ∥vi∥

= OP(T−1/2)

I |θri − θri | = OP(1√T), θri is estimated as efficiently as in a

parametric model with known functions gr .

Setup 2: Stochastic Trends

Basic intuition (cubic spline smoothing):

I Time series asymptotics: T → ∞ corresponds to adding moreand more equidistant time periods.

I The squared bias of the spline approximation to vi (t) can be

bounded by κ · 1T

∫ T1 |v ′′i (s)|2ds, where vi (s) is a cubic spline

interpolant of vi (t), t = 1, . . . ,T .

I 1T

∫ T1 |v ′′i (s)|2ds ≤ 4 1

T

∑T−1t=2 (v(t − 1)− 2v(t) + v(t + 1))2

I Smoothing may work, since for many stochastic trendprocesses 1

T

∑t(vi (t + 1)− 2vi (t) + vi (t − 1))2 is of smaller

order of magnitude than 1T

∑t vi (t)

2.

I Random walk:E( 1

T

∑t vi (t)

2) = O(T )E( 1

T

∑t(vi (t + 1)− 2vi (t) + vi (t − 1))2 = O(1)

Example: Independent random walksLet

vi (t) :=L∑

r=1

ϑrigr (t), with gr (t + 1) = gr (t) + δr ,t ,

δr ,1, δr ,2, . . . i.i.d, E (δr ,t) = 0, var(δr ,t) = σ2δ,t .

If κ ∼ (nT )−1/2, then

I

∥β − β∥ =

OP(

1√nT

) if Xi and vi are ulc-uncorrelated

OP(1√T) else

I T−1/2∥gr − gr∥ = OP(T−3/2 + (nT )−1/2)

I∑

i ∥vi−∑L

r=1 θri gr∥∑i ∥vi∥

= OP(T−3/2 + (nT )−1/2)

I |θri − θri | = OP(1√T)

General theoretical results

Note:

I Smoothness asymptotics: vi (t) = νi (tT )

⇒ 1T

∑t vi (t)

2 →∫ 10 νi (t)

2dt as T → ∞I Stochastic trends: usually 1

T

∑t vi (t)

2 → ∞ as T → ∞

Theoretical results depend on c(T ), b(T ), and d(T ) whichquantify the rate of increase of 1

T

∑t vi (t)

2, of the correspondingsecond order differences, and of

∑X 2itj :

I E( 1T

∑Tt=1 vi (t)

2) =: c(T )

I E(

1T

∑T−1t=2 (vi (t − 1)− 2vi (t) + vi (t + 1))2

)=: b(T )

I E( 1T

∑Tt=1 X

2it,j) =: d(T )

Theorem

Let κb(T ) → 0 as well as κ1/4/T → 0 as n,T → ∞. Then, undersome additional technical conditions

(a) ∥β − Eϵ(β)∥ = OP(√

bβ(n,T , κ)), where

bβ(n,T , κ) :=

max{1,κ}b(T )

Tn if Xi and vi are ulc-uncorrelated,

max{1,κ}b(T )T ) else,

and V−1/2n,T (β − Eϵ(β)) ∼ N(0, I ), where

Vn,T = OP

(1

nT

).

Therefore,

∥β−β∥ = |β−Eϵ(β)−(β−Eϵ(β))∥ = OP(√

bβ(n,T , κ) + 1/(nT )).

(b) For all r = 1, . . . , L

T−1/2∥gr−gr∥ = OP

(√κb(T ) + d(T )bβ(n,T , κ)

c(T )+

1

nc(T )max{1, κ1/4}

),

where κ = min{κ, κ2}.

(c) For all r = 1, . . . , L

|θri−θri | = OP

(√T−1 + κb(T ) + d(T )bβ(n,T , κ) + (nmax{1, κ1/4})−1

).

Furthermore, ifκb(T ) + d(T )bβ(n,T , κ) + (nmax{1, κ1/4})−1 = o(T−1),then √

T (θ1i − θ1i , . . . , θLi − θLi )′ →d N(0, σ2I ).

(d) For all i = 1, . . . , n∑i ∥vi −

∑Lr=1 θri gr∥∑

i ∥vi∥

= OP

(√T−1 + κb(T ) + d(T )bβ(n,T , κ) + (nmax{1, κ1/4})−1

c(T )

).

(e) If additionally Tnmax{1,κ1/4} → 0 as well as

Td(T )bβ(n,T , κ) + d(T )n + 1

Tc(T ) = o(

Tnmax{1,κ1/4}

), then

n∑T

r=L+1 λr − (n − 1)σ2 · tr(ZκPLZκ)

σ2

√2n · tr((ZκPLZκ)2)

→d N(0, 1),

n · tr(PLΣn,T )− (n − 1)σ2 · tr(ZκPLZκ)

σ2√2n · tr((ZκPLZκ)2)

→d N(0, 1),

where PL = I − 1T

∑Lr=1 gr g

′r , and PL is the projection matrix

projecting into the n − L dimensional linear space orthogonal to

span{Zκg1, . . . ,ZκgL}.

Choice of L

Estimation of σ2:

σ2 :=1

(n − 1) · tr(I −Zκ)2

∑i

∥(I −Zκ)(Yi − Y − (Xi − X )β)∥2

Procedure: First select an α > 0 (e.g., α = 1%). For l = 1, 2, . . .determine

∆(l) :=n∑T

r=l+1 λr − (n − 1)σ2 · tr(ZκPlZκ)

σ2

√2n · tr((ZκPlZκ)2)

.

Choose L as the smallest l = 1, 2, . . . such that

∆(l) ≤ z1−α,

where z1−α is the 1− α quantile of a standard normal distribution.

Choice of κ

A smoothing parameter κ may be chosen by cross-validation:

CV (κ) :=1

nT

∑i

∑t

(Yit − Yt − (Xit − Xt)β−i −L∑

r=1

θr ,−i gr ,−i (t))2

I β−i , gr ,−i - estimates of β and gr obtained from the data(Ykj ,Xkj), k = 1, . . . , i − 1, i + 1, . . . , n, j = 1, . . . ,T

I θr ,−i - corresponding estimates of θri to be obtained when

using β−i , gr ,−i instead of β, gr

Simulation resultsGeneral setup:

Yit =

p∑j=1

βjXitj + vi (t) + ϵit

I Simulated samples of size n = 30, 100, 300 with T = 12, 30 ina model with p = 2 regressors

I β1 = β2 = 0.5

I ϵit i.i.d. N(0, 1)

I Regressors are generated according to a VAR model

Xit = RXi ,t−1 + ηit

with

R =

(0.4 0.050.05 0.4

)

Monte Carlo Simulation Results for DGP1:

vi (t) = θi0 + θi1t

T+ θi2

( t

T

)2MSE of Coefficients

N T Within GLS CSSW KSS

30 12 0.07258 0.06381 0.00867 0.0087430 0.02832 0.02355 0.00240 0.00258

100 12 0.01862 0.01643 0.00266 0.0027330 0.00678 0.00649 0.00073 0.00075

300 12 0.00610 0.00609 0.00086 0.0008730 0.00210 0.00208 0.00023 0.00023


vi (t) = θi0 + θi1t

T+ θi2

( t

T

)2MSE of Effects

N T Within GLS CSSW KSS L

30 12 0.1770 0.1746 0.0091 0.0091 2.407030 0.1666 0.1663 0.0036 0.0043 2.8050

100 12 0.1285 0.1280 0.0072 0.0073 2.968830 0.1240 0.1240 0.0029 0.0030 3.0100

300 12 0.1025 0.1025 0.0059 0.0060 3.004030 0.1001 0.1001 0.0024 0.0025 3.0060


vi (t) = ϑi rt , rt = random walk

MSE of Coefficients


30 12 0.02414 0.02085 0.01370 0.0047730 0.00699 0.00675 0.00662 0.00188

100 12 0.00974 0.00842 0.00488 0.0013930 0.00201 0.00195 0.00193 0.00052

300 12 0.00341 0.00430 0.00169 0.0004730 0.00071 0.00073 0.00063 0.00028


vi (t) = ϑi rt , rt = random walk

MSE of Effects


30 12 0.1655 0.1630 0.0601 0.0170 1.005030 0.0976 0.0975 0.0692 0.0100 1.0000

100 12 0.1544 0.1547 0.0491 0.0117 1.000030 0.0890 0.0890 0.0624 0.0072 1.0000

300 12 0.1480 0.1484 0.4500 0.0104 1.000030 0.0860 0.0861 0.0597 0.0065 1.0000


vi (t) ≡ ϑi standard panel model

MSE of Coefficients


30 12 0.00544 0.00484 0.00841 0.0061530 0.00188 0.00181 0.00221 0.00200

100 12 0.00176 0.00122 0.00262 0.0018330 0.00061 0.00051 0.00073 0.00062

300 12 0.00056 0.00080 0.00086 0.0005830 0.00020 0.00026 0.00024 0.00020


vi (t) ≡ ϑi standard panel model

MSE of Effects


30 12 0.1213 0.1126 0.3387 0.1519 1.032030 0.0472 0.0462 0.1288 0.0638 1.0100

100 12 0.0929 0.0876 0.2706 0.1032 1.043030 0.0363 0.0354 0.1062 0.0414 1.0230

300 12 0.0795 0.0811 0.2366 0.0838 1.028030 0.0319 0.0323 0.0947 0.0339 1.0200

Monte Carlo Simulation Results for DGP5 (L = 6):

vi (t) = 3(ϑi0 + ϑi1t

T+ θi2

( t

T

)2) + ϑi3rt + ϑi4g1t + ϑi5g2t

with rt+1 = rt + δt , g1t = sin(πt/4), g2t = cos(πt/4).

MSE of Coefficients


30 12 13.3517 12.4807 5.3708 0.249530 4.3252 4.2655 1.5867 0.0141

100 12 3.863 3.3662 1.6696 0.034230 1.3015 1.2465 0.5392 0.0021

300 12 1.3521 1.1517 0.6465 0.006530 0.4562 0.4314 0.185 0.0009

Monte Carlo Simulation Results for DGP5 (L = 6):

vi (t) = 3(ϑi0 + ϑi1t

T+ θi2

( t

T

)2) + ϑi3rt + ϑi4g1t + ϑi5g2t

with rt+1 = rt + δt , g1t = sin(πt/4), g2t = cos(πt/4).

MSE of Effects


30 12 0.1792 0.1781 0.0468 0.0013 4.957

30 0.1654 0.1653 0.0558 0.0004 5.000

100 12 0.1545 0.1540 0.0427 0.0006 5.000

30 0.1568 0.1567 0.0600 0.0002 5.987

300 12 0.1823 0.1821 0.0578 0.0003 5.996

30 0.1904 0.1903 0.0746 0.0003 6.000

ApplicationAnalysis of technical efficiencies in the U.S. Banking industry

I Panel data: Monthly observation for n = 971 banks from1984 - 1995 (T = 120)

I Well-known changes in business practices due to financialderegulation in the 1980’s⇒ possibly time-varying technical efficiencies

I Estimated dimension of the model L = 7

1 2 3 4 5 6 7 8 9 10 11 120.55

0.56

0.57

0.58

0.59

0.6

0.61

0.62

0.63

0.64

0.65

Time

Effic

iency

CSSWKSSBC

Documents

A New Panel Data Treatment for Heterogeneity in Time · PDF fileA New Panel Data Treatment for Heterogeneity in Time ... Analysis of technical eﬃciencies in the ... since for many