Economic Foundations for Entertainment, Media, and Technology · 2018. 2. 2. · 44/76 A Stochastic...

William Greene

New York University

True Random Effects in Stochastic

Frontier Models

http://people.stern.nyu.edu/wgreene/appc2014.pdf

Agenda

Skew normality – Adelchi Azzalini

Stochastic frontier model

Panel Data: Time varying and time invariant inefficiency models

Panel Data: True random effects models

Maximum Simulated Likelihood Estimation

Applications of true random effects

Persistent and transient inefficiency in Swiss railroads

A panel data sample selection corrected stochastic frontier model

Spatial effects in a stochastic frontier model

Skew Normality

The Stochastic Frontier Model

~ 0, ,

| |, ~ 0, ,

= v | |

Convenient parameterization (notation)

| | = [0,1] | [0,1] |

i i i i

i i i u

i i i i i

i v i u i v i u

u U U N

V U N N

2log log

log ( , , , ) = ( )

2 = log

Log Likelihood

Skew Normal

Density

Birnbaum (1950) Wrote About Skew Normality

Effect of

Linear

Truncation on

a Multinormal

Population

Weinstein (1964) Found f()

Query 2: The Sum of

Values from a

Normal and a

Truncated Normal

Distribution

See, also, Nelson (Technometrics, 1964), Roberts (JASA, 1966)

Resembles f()

O’Hagan and Leonard (1976) Found

Something Like f()

Bayes Estimation

Subject to Uncertainty

About Parameter

Constraints

ALS (1977) Discovered How

to Make Great Use of f()

See, also, Forsund and Hjalmarsson (1974), Battese and Corra (1976)

Poirier,… Timmer, … several others.

The standard skew normal distribution

f( ) = 2 ( ) ( )

Azzalini (1985) Figured Out f()

And Noticed the Connection to ALS

http://azzalini.stat.unipd.it/SN/

http://azzalini.stat.unipd.it/SN/abstracts.html#sn99

How to generate pseudo random draws on

1. Draw , from independent N[0,1]

2. = + | |u u

A Useful FAQ About the Skew Normal

2 2 22 2

For a particular desired and

Use and = 1 1

(0,1) | (0,1) |

v uN N

Random Number Generator

How Many Applications of SF Are There?

2 ( ) ( )z z

W. D. Walls (2006) On Skewness in the Movies

Cites Azzalini.

“The skew-normal

distribution

developed by Sahu et

al. (2003)…”

Does not

know Azzalini.

SNARCH Model for Financial Crises (2013)

Mixed Logit Model

exp( )Prob( )

exp( )

Random Parameters

Asymmetric (Skewed) Parameter Distribution

| |~ (0, , )

ik ik ik

Choice j

w v U SN

A Skew Normal Mixed Logit Model (2010)

Greene (2010, knows Azzalini and ALS),

Bhat (2011, knows not Azzalini … or ALS)

Foundation: An Entire Field

Stochastic Frontier Model

Occasional Modeling Strategy

Culture: Skewed Distribution of Movie Revenues

Finance: Crisis and Contagion

Choice Modeling: The Mixed Logit Model

How can these people find each other?

Where else do applications appear?

Skew Normal Applications

Stochastic Frontier

The Cross Section Departure Point: 1977

Aigner et al. (ALS) Stochastic Frontier Model

~ [0, ]

| | and ~ [0, ]

Jondrow et al. (JLMS) Inefficiency Estimator

( )ˆ [ | ]

i i i i

i i i u

ii i i i

ui i i

u U U N

2 2, ,

iv u i

The Panel Data Models Appear: 1981

Pitt and Lee Random Effects Approach: 1981

~ [0, ], | | and ~ [0, ]

Counterpart to Jondrow et al. (1982)

( / )ˆ [ | ,..., ]

1 ( / )

it it it i

it v i i i u

it it i

ii i i iT i

v N u U U N

Reinterpreting the Within Estimator: 1984

Schmidt and Sickles Fixed Effects Approach: 1984

~ [0, ],

Counterpart to Jondrow et al. (198

it i it it

it v i

semiparametically specified

fixed mean, constant variance

ˆ ˆˆ max ( )

(The cost of the semiparametric specification is the

location of the inefficiency distribution. The authors

also revisit Pitt and Lee to demonstrate.)

i i i iuTime

Misgivings About Time Fixed Inefficiency: 1990-

Cornwell Schmidt and Sickles (1990)

Kumbhakar (1990)

[1 exp( )] | |

Battese and Coelli (1992, 1995)

exp[ ( )] | |, exp[ ( , , )] | |

Cuesta (2000)

exp[ ( )]

it i i i

it i it it i

u bt ct U

u t T U u g t T U

| |, exp[ ( , , )] | |i it i it iU u g t T Uz

Are the systematically time varying models

more like time fixed or freely time varying?

A Pooled Model

Battese and Coelli (1992) exp[ ( )] | |

Pitt and Lee (1981) | |

Where is Battese and Coelli?

Closer to

it it it it

it it it i

u t T U

the pooled model or to Pitt and Lee?

Greene (2004): Much closer to the Pitt and Lee model

In these models with time varying inefficiency,

( , ) | |

~ [0, ] and ~ [0, ],

where does unobserved time invariant

heterogeneity end up?

In the inefficiency! Even with t

it it it i it i

it v it u

y v g t U

v N U N

he extensions.

Skepticism About Time Varying Inefficiency

Models: Greene (2004)

True Random Effects

True Random and Fixed Effects: 2004

True Random and Fixed Effects Approach: 2004

~ [0, ], | | and ~ [0, ]

Unobserved time invariant heterogeneity,

not unobserved time invariant inefficiency

it i it it it

it v it it it u

v N u U U N

ndrow et al. (JLMS) Inefficiency Estimator

( )[ | ]

itit it it

u itit it it v u i

varying

Estimation of TFE and TRE Models: 2004

True Fixed Effects: MLE

~ [0, ], | | and ~ [0, ]

it i it it it

it v it it it u

v N u U U N

Just add firm dummy variables to the SF model (!)

True Random Effects: Maximum Simulated Likelihood (RPM)

~ [0, ], | | and ~ [0, ], ~ [0, ]

it i it it it

it v it it it u i w

y w v u

v N u U U N w N

Random parameters stochastic frontier model

Log likelihood function for stochastic frontier model

2log log

log ( , , , ) = ( )

for stochastic frontier model

with a time invariant random constant term. (TRE model)

1log ( , , , , ) = log

it w ir it

N R TS

w i r t

LR y w

Simulated log likelihood fun t

draws from N[0,1].

The Most Famous Frontier Study Ever

The Famous WHO Model

logCOMP= +1logPerCapitaHealthExpenditure +

2logYearsEduc +

3Log2YearsEduc +

= v - u

Schmidt/Sickles FEM

191 Countries.

140 of them observed 1993-1997.

The Notorious WHO Results

No, it

doesn’t.

August

12, 2012

38/76 Huffington Post, April 17, 2014

we are #37

Greene, W., Distinguishing Between

Heterogeneity and Inefficiency:

Stochastic Frontier Analysis of the

World Health Organization’s Panel

Data on National Health Care

Systems, Health Economics, 13, 2004,

pp. 959-980.

21, log , log , log

log , log ,

Exp Ed Ed

PopDen PerCapitaGDP

GovtEff VoxPopuli OECD GINI

Three Extensions of the

True Random Effects Model

Generalized True Random Effects Stochastic Frontier Model

Transient random components

Time varying normal - half normal SF

Persistent random com

xit i i it it it

y A B v u

ponents

Time fixed normal - half normal SFi iA B

Generalized True Random Effects Model

A Stochastic Frontier Model with Short-Run and

Long-Run Inefficiency:

Colombi, R., Kumbhakar, S., Martini, G., Vittadini,

G., University of Bergamo, WP, 2011, JPA 2014,

forthcoming.

Tsionas, G. and Kumbhakar, S.

Firm Heterogeneity, Persistent and Transient Technical Inefficiency:

A Generalized True Random Effects Model

Journal of Applied Econometrics. Published online, November, 2012.

Extremely involved Bayesian MCMC procedure. Efficiency components estimated by

data augmentation.

Generalized True Random Effects Stochastic Frontier Model

( | |)

Time varying, transient random components

~ [0, ], | | and ~ [0, ],

it w i i it it it

it v it it it u

y w e v u

v N u U U N

invariant random components

~ [0,1], ~ [0,1]

The random constant term in this model has a closed skew

normal distribution, instead of the usual normal distribution.

i iw N e N

Estimating Efficiency in the CSN Model

Moment Generating Function for the Multivariate CSN Distribution

( , )E[exp( ) | ] exp

(..., ) Multivariate normal cdf. Parts defined in Colombi et al.

Computed using

T ii i i

Rr tt u y t Rr t t

GHK simulator.

0 1 0, = , , ...,

Estimating the GTRE Model

Colombi et al. Classical Maximum Likelihood Estimator

log ( , )log

log ( ( , )) log 2

(...) T-variate normal pdf.

(..., )) ( 1) Multivariate normal int

N T i i T

iq i i T

y X 1 AVA

R y X 1

egral.

Very time consuming and complicated.

“From the sampling theory perspective, the application

of the model is computationally prohibitive when T is

large. This is because the likelihood function depends

on a (T+1)-dimensional integral of the normal

distribution.” [Tsionas and Kumbhakar (2012, p. 6)]

Kumbhakar, Lien, Hardaker

Technical Efficiency in Competing Panel Data Models: A Study of

Norwegian Grain Farming, JPA, Published online, September, 2012.

Three steps based on GLS:

(1) RE/FGLS to estimate (,)

(2) Decompose time varying residuals using MoM and SF.

(3) Decompose estimates of time invariant residuals.

Maximum Simulated Full Information log likelihood function for the

"generalized true random effects stochastic frontier model"

( | |)2,

1logL , = log

it w ir ir

( ( | |) )

draws from N[0,1]

|U | absolute values of draws from N[0,1]

it w ir ir it

WHO Results: 2014

21, log , log , log

log , log ,

it i i it it

Exp Ed Ed

PopDen PerCapitaGDP

GovtEff VoxPopuli OECD GINI

A B v u

Empirical application

Cost Efficiency of Swiss Railway

Companies

Model Specification

TC = f ( Y1, Y2, PL , PC , PE , N, NS, dt )

C : Total costs

Y1 : Passenger-km

Y2 : Ton-km

PL : Price of labor (wage per FTE)

PC : Price of capital (capital costs / total number of seats)

PE : Price of electricity

N : Network length

NS: Number of stations

Dt: time dummies

50 railway companies

Period 1985 to 1997

unbalanced panel with number of periods (Ti) varying from 1 to 13 and

with 45 companies with 12 or 13 years, resulting in 605 observations

Data source: Swiss federal transport office

Data set available at http://people.stern.nyu.edu/wgreene/

Data set used in: Farsi, Filippini, Greene (2005), Efficiency and

measurement in network industries: application to the Swiss railway

companies, Journal of Regulatory Economics

Cost Efficiency Estimates

Correlations

MSL Estimation

Why is the MSL method so computationally

efficient compared to classical FIML and

Bayesian MCMC for this model?

Conditioned on the permanent effects, the group

observations are independent.

The joint conditional distribution is simple and easy to

compute, in closed form.

The full likelihood is obtained by integrating over only

one dimension. (This was discovered by Butler and

Moffitt in 1982.)

Neither of the other methods takes advantage of this

result. Both integrate over T+1 dimensions.

Equivalent Log Likelihood – Identical Outcome

One Dimensional Integration over δi

T+1 Dimensional Integration over Rei.

1log ( | , , , , , )

i ir w hi rG

Simulated [over (w,h)] Log Likelihood

Very Fast – with T=13, one minute or so

Also Simulated Log Likelihood

GHK simulator is used to approximate the T+1 variate normal

integrals.

Very Slow – Huge amount of unnecessary computation.

247 Farms, 6 years.

100 Halton draws.

Computation time:

35 seconds including

computing efficiencies.

Computation of the GTRE Model is Actually Fast and Easy

Simulation Variance

Does the simulation chatter degrade the

econometric efficiency of the MSL estimator?

Hajivassiliou, V., “Some practical issues in maximum simulated

likelihood,” Simulation-based Inference in Econometrics: Methods

and Applications, Mariano, R., Weeks, M. and Schuerman, T.,

Cambridge University Press, 2008

Speculated that Asy.Var[estimator] = V + (1/R)C

The contribution of the chatter would be of second or third order.

R is typically in the hundreds or thousands.

No other evidence on this subject.

An Experiment

Pooled Spanish Dairy Farms Data

Stochastic frontier using FIML.

Random constant term linear regression with

constant term equal to - |w|, w~ N[0,1]

This is equivalent to the stochastic frontier

model.

Maximum simulated likelihood

500 random draws for the simulation for the base case.

Uses Mersenne Twister for the RNG

50 repetitions of estimation based on 500 random

draws to suggest variation due to simulation chatter.

ˆ 0.10371

ˆ 0.15573

Chatter

.00543

.00590

.00042

.00119

Simulation Noise in Standard Errors of Coefficients

Quasi-Monte Carlo Integration Based on

Halton Sequences

Coverage of the unit interval is the objective,

not randomness of the set of draws.

Halton sequences --- Markov chain

p = a prime number,

r= the sequence of integers, decomposed as

H(r|p)

0, ,...1 r = r (e.g., 10,11,12,...)

For example, using base p=5, the integer r=37 has b0 = 2, b1 = 2, and b3 = 1; (37=1x52 + 2x51 + 2x50). Then H(37|5) = 25-1 + 25-2 + 15-3 = 0.488.

Is It Really Simulation?

Halton or Sobol sequences are not

random

Far more stable than random draws, by a

factor of about 10.

There is no simulation chatter

View the same as numerical quadrature

There may be some approximation error.

How would we know?

Halton sequences --- Markov chain

p = a prime number,

r= the sequence of integers, decomposed as

H(r|p)

Coverage of the unit interval is the objective,

not randomness of the set of draws.

0, ,...

1 r = r (e.g., 10,11,12,...)

Halton Sequences

LogL( , , , , , )

LogL ( , , , , , )

it it i

i t it it i

it it ir

Halton[prime( ), burn in]

it it ir

ir w ir h ir

Haltonized Log Likelihood

Summary

The skew normal distribution

Two useful models for panel data (and one

potentially useful model pending development)

Extension of TRE model that allows both transient and

persistent random variation and inefficiency

Sample selection corrected stochastic frontier

Spatial autocorrelation stochastic frontier model

Methods: Maximum simulated likelihood as an

alternative to received brute force methods

Simpler

Faster

Accurate

Simulation “chatter” is a red herring – use Halton sequences

Sample Selection

TECHNICAL EFFICIENCY ANALYSIS CORRECTING FOR

BIASES FROM OBSERVED AND UNOBSERVED

VARIABLES: AN APPLICATION TO A NATURAL RESOURCE

MANAGEMENT PROJECT Empirical Economics: Volume 43, Issue 1 (2012), Pages 55-72

Boris Bravo-Ureta

University of Connecticut

Daniel Solis

University of Miami

William Greene

New York University

The MARENA Program in Honduras

Several programs have been implemented to address resource degradation while also seeking to improve productivity, managerial performance and reduce poverty (and in some cases make up for lack of public support).

One such effort is the Programa Multifase de Manejo de Recursos Naturales en Cuencas Prioritarias or MARENA in Honduras focusing on small scale hillside farmers.

Expected Impact Evaluation

Methods

A matched group of beneficiaries and control

farmers is determined using Propensity Score

Matching techniques to mitigate biases that

would stem from selection on observed

variables.

In addition, we deal with possible self-selection

on unobservables arising from unobserved

variables using a selectivity correction model for

stochastic frontiers introduced by Greene (2010).

A Sample Selected SF Model

di = 1[′zi + hi > 0], hi ~ N[0,12]

yi = + ′xi + i, i ~ N[0,2]

(yi,xi) observed only when di = 1.

i = vi - ui

ui = u|Ui| where Ui ~ N[0,12]

vi = vVi where Vi ~ N[0,12].

(hi,vi) ~ N2[(0,1), (1, v, v2)]

Simulated logL for the Standard SF Model

2 212exp[ ( |) / ]

( | ,| |)2

i i u i vi i i

y |Uf y U

exp[ ( |) / ]( | ) (| |) | |

i i u i vi i i i

y |Uf y p U d U

2122exp[ | | ]

(| |) , |U | 0. (Half normal)2

1 exp[ ( |) / ]( | )

R i i u ir vi r

y |Uf y

1 exp[ ( |) / ]log ( , , , ) = log

N R i i u ir vS u v i r

This is simply a linear regression with a random constant term, αi = α - σu |Ui |

Likelihood For a Sample Selected SF Model

| ( , , ,| |)

exp ( | |) / )

2 (1 ) ( )

( | |) /

| ( , , ) | ( , , ,| |) (| |) | |

i i i i i

i i u i v

i i u i i

i i i i i i i i i i iU

f y d U

d dy U

f y d f y d U f U d U

x z x z

Simulated Log Likelihood for a Selectivity

Corrected Stochastic Frontier Model

exp ( | |) / )

1 ( | |) /log ( , , , , , ) log 1

(1 ) ( )

i i u ir v

N Ri i u ir iS u v i r

The simulation is over the inefficiency term.

JLMS Estimator of ui

ˆˆ ˆ ˆexp ( | |) / )

ˆ 2ˆ

ˆˆ ˆ ˆ ˆ( | |) /

1 1ˆ ˆˆ ˆˆ = ( | |) ,

ˆˆ Estimator of [ | ]

i i u ir v

i i u ir v i

i u ir ir i irr r

ii i i

A U f B fR R

Au E u

ˆˆ ˆ ˆ| | where , 1

Riru ir ir irR r

fU g g

Closed Form for the Selection Model

The selection model can be estimated without

simulation

“The stochastic frontier model with correction

for sample selection revisited.” Lai, Hung-pin.

Forthcoming, JPA

Based on closed skew normal distribution

Similar to Maddala’s 1982 result for the linear

selection model. See slide 42.

Not more computationally efficient.

Statistical properties identical.

Suggested possibility that simulation chatter is an element of

inefficiency in the maximum simulated likelihood estimator.

Spanish Dairy Farms: Selection based on being farm #1-125. 6 periods

The theory works.

Closed Form vs. Simulation

Variables Used

in the Analysis

Production

Participation

Findings from the First Wave

A Panel Data Model

Selection takes place only at the baseline.

There is no attrition.

1[ > 0] Sample Selector

, 0,1,... Stochastic Frontier

Selection effect is exerted on ; Corr( , , )

( , ) ( ) ( | )

it i it it it

it i i it i

y w v u t

P y d P d P y d

0 1 0 00

0 1 0 0 0

onditioned on the selection ( ) observations are independent.

( , ,..., | ) ( | )

I.e., the selection is acting like a permanent random effect.

( , ,..., , ) ( ) (

i i iT i it it

i i iT i i it

P y y y d P y d

P y y y d P d P y 0| )t id

Simulated Log Likelihood

log ( , , , , )

exp ( | |) / )

( | |) /

S C u v

it it u itr v

it it u itr v i

R y U a

Benefit group is more efficient in both years

The gap is wider in the second year

Both means increase from year 0 to year 1

Both variances decline from year 0 to year 1

Main Empirical Conclusions from Waves 0 and 1

Spatial Autocorrelation

Spatial Stochastic Frontier Models: Accounting for Unobserved

Local Determinants of Inefficiency: A.M.Schmidt, A.R.B.Morris,

S.M.Helfand, T.C.O.Fonseca, Journal of Productivity Analysis, 31,

2009, pp. 101-112

Simply redefines the random effect to be a ‘region effect.’ Just a

reinterpretation of the ‘group.’ No spatial decay with distance.

True REM does not “perform” as well as several other

specifications. (“Performance” has nothing to do with the frontier

model.)

True Random Spatial Effects

Economic Foundations for Entertainment, Media, and Technology · 2018. 2. 2. · 44/76 A Stochastic...

Documents

Colombi (1)

COLOMBI VIAGGIATORI - Versele

Finanzas de la gran colombi anew

Termologia - Cap. 18 - Professor Bruce Colombi

Technical efﬁciency in competing panel data models: a ...pages.stern.nyu.edu/.../Reference-Papers/Kumbhakar-JPA-PanelSurv… · Technical efﬁciency in competing panel data models:

Flupa UX Day 2012 : Gamifier une application - Teresa Colombi (LudoTIC)

Colombi di città: Tecnica necroscopica

artesanías de colombi?s. a. · artesanías de colombi?s. a.., ... El hombre primitivo comenzó sirviéndose de formas elementales a modo de ... Ull zapato tiene el tamaño del pie

Viaje y Neurosis - Beatriz Colombi

A practitioners guide to stochastic frontier analysis using stata-kumbhakar

Ft. M a Lucy Hernández Medellín - Colombi

Viaje Intelectual-Beatriz Colombi

Viaje Intelectual _ Beatriz Colombi

El Pueblo and La Rosca: A Political Dialogue in Colombi

Updated April 2020 SUBAL C. KUMBHAKAR CURRICULUM …bingweb.binghamton.edu/~kkar/cv2_03.pdf1 Updated April 2020 SUBAL C. KUMBHAKAR CURRICULUM VITAE Address: Department of Economics

Colombi · Colombi: Compro E ciente Prórroga del Acuerdo lrarco de Prec¡os para el sum¡nist o de Tiquetes Aéreos por parte de las Entidades Compradoras, CCE-283-i-AfúP-2015 celebrada

SeGF 2013 | Umsetzung der Informatikstrategie: Verwaltungsinterne Basis für innovative eGov-Vorhaben (Alexander Colombi)

accessoires - Colombi Sports

COLOMBI DI CITTA': QUALI MALATTIE?

Colombi - Viaje intelectual - Migraciones y desplazamientos en América Latina (1880-1915) - Selección