45
TMLE ANALYSIS NYC PREDICTIVE ANALYTICS ORI STITELMAN ([email protected]) BRIAN DALESSANDRO CLAUDIA PERLICH FOSTER PROVOST AUGUST 11, 2011 THE CAUSAL EFFECT OF DISPLAY ADVERTISING ON CONVERSION

TMLE ANALYSIS

  • Upload
    gur

  • View
    52

  • Download
    0

Embed Size (px)

DESCRIPTION

TMLE ANALYSIS. The Causal effect of display advertising on conversion. NYC Predictive Analytics Ori Stitelman ( [email protected] ) Brian Dalessandro Claudia Perlich foster provost August 11, 2011. QUESTION OF INTEREST. WHAT IS THE EFFECT OF DISPLAY ADVERTISING - PowerPoint PPT Presentation

Citation preview

Page 1: TMLE ANALYSIS

TMLE ANALYSIS

NYC PREDICTIVE ANALYTICSORI STITELMAN ([email protected])BRIAN DALESSANDROCLAUDIA PERLICHFOSTER PROVOSTAUGUST 11, 2011

THE CAUSAL EFFECT OF DISPLAY ADVERTISING ON CONVERSION

Page 2: TMLE ANALYSIS

2

QUESTIONOF INTEREST

WHAT IS THE EFFECT OFDISPLAY ADVERTISINGON CUSTOMER CONVERSION? ?

Page 3: TMLE ANALYSIS

3

WHAT ISTMLE?A SEMI-PARAMETRIC method

for estimating CAUSAL PARAMETERS

that DIRECTLY answer a

business question of interest?

Page 4: TMLE ANALYSIS

4

SEMI-PARAMETRICRealistic assumptions

CAUSAL PARAMETERSEffect/Impact (NOT Coefficients)

DIRECTLYActionable

WHAT ISTMLE?A SEMI-PARAMETRIC methodfor estimating CAUSAL PARAMETERSthat DIRECTLY answer abusiness question of interest?

Page 5: TMLE ANALYSIS

5

OUTLINE1. BACKGROUND: DISPLAY ADVERTISING &

2. PEEK AT RESULTS

3. A/B TESTING

4. ALTERNATIVE APPROACHES (NICE)

5. RESULTS

6. CONCLUSION

Page 6: TMLE ANALYSIS

6

THEBROWSERPROCESS

2. Use observed data to build list of prospects

3. Subsequently observe same browser surfing the web the next day

4. Browser visits a site where a display ad spot exists and bid requests are made

5. Auction is held for display spot

6. If auction is won display the ad

7. Observe browsers actions after displaying the ad

1. Observe people taking actions and visiting content

Page 7: TMLE ANALYSIS

7

RESULTSPREVIEW

RETARGETING

M6D

PROSPEC...

RETARGETING

M6D

PROSPEC...

RETARGETING

M6D

PROSPEC...

0%

2%

4%

6%

8%

10%

12%

14%

RELATIVE LIFT:EXPOSED VS. UNEXPOSED USERS

DID NOT SEE AD SAW ADCO

NVER

SIO

N RA

TE

1.05X

2.62X

1.11X

1.31X

0.92X2.26X

TELECOM COMPANY A

TELECOM COMPANY B

TELECOM COMPANY C

Page 8: TMLE ANALYSIS

8

P

GENERAL APPROACH

?

Ψ(P)

1. State Question

3. Define Parameter

Ψ(Pn)4. Estimate Parameter

2. Define Causal Assumption/Likelihood

Page 9: TMLE ANALYSIS

9

WHAT IS THE EFFECT OFDISPLAY ADVERTISING ON CUSTOMERCONVERSION?

?1. STATE QUESTION

DISPLAY ADVERTISINGShowing/Not showing a browser a display ad.

CUSTOMER CONVERSIONVisiting the advertisers website in the next 5 days.

Page 10: TMLE ANALYSIS

10

P2. DEFINE CAUSAL ASSUMPTIONS/LIKELIHOOD

O = (W,A,Y) ~ P0

W – Baseline VariablesA – Binary Treatment (Ad)Y – Binary Outcome (Site Visit)

P0(O) = P(W)P(A|W)P(Y|A,W)QW

QY

gA

Page 11: TMLE ANALYSIS

11

DATA STRUCTURE: OURVIEWERS

CHARACTERISTICS(W)

TREATMENT(A)

CONVERSION(Y)

Color Sex HeadShape

Ad No Ad

No Yes

Page 12: TMLE ANALYSIS

12

Ψ(P)3. DEFINE PARAMETER

P0(O) = P(W)P(A|W)P(Y|A,W)

P0,a(O) = P(W)P(Y|A=a,W)

P0,a(O) = P(W)P(Ya|W)=

LIKELIHOOD

DISTRIBUTION UNDER INTERVENTION

Page 13: TMLE ANALYSIS

13

Ψ(P)3. DEFINE PARAMETER

ΨAI(P)=E[YA=ad] – E[YA=no ad]

ΨRI(P)=E[YA=ad]/E[YA=no ad]

1. ADDITIVE IMPACT

2. RELATIVE IMPACT

P0,a(O) = P(W)P(Ya|W)

Page 14: TMLE ANALYSIS

14

E[YA=ad] E[YA=no ad]

φn,ad/φn,no ad

1. PARAMETERS ARE COMBINATIONS OF TREATMENT SPECIFIC CONVERSION RATES

2. SO WE CAN COMBINE ESTIMATES OF THESE THESE RATES

Ψ(Pn)4. ESTIMATE PARAMETER

φn,ad-φn,no ad

Page 15: TMLE ANALYSIS

15

Ψ(Pn)4. ESTIMATE PARAMETER

1. Optimal Experiment

2. A/B Testing

3. MLE Based Substitution Estimator (MLE)

4. Inverse Probability Estimators (IPTW)

5. Double Robust Estimating Equations (DR-IPW)

6. Targeted Maximum Likelihood Estimation (TMLE)

NICE

Page 16: TMLE ANALYSIS

16

OPTIMALEXPERIMENT:

3.6 per 1,000

1.2 per 1,000

Compare conversion rates of seeing an ad to conversion rate without ad for SAME individuals.

Page 17: TMLE ANALYSIS

17

OPTIMALEXPERIMENT:

BROWSER

SHOW AD?

OBSERVEOUTCOME

OR

OR

Page 18: TMLE ANALYSIS

18

REALITY: OR

OR

OR

OR

Page 19: TMLE ANALYSIS

19

OPTIMAL EXPERIMENT:Compare conversion rates of seeing an ad to conversion rate without ad for SAME individuals.

3.6 per 1,000

1.2 per 1,000

Page 20: TMLE ANALYSIS

20

COMMON APPROACH:A/B TESTINGSince we can not both treat and not treat SAME individuals. Randomization is used to create “EQUIVALENT” groups to treat and not treat.

3.4 per 1,000

1.6 per 1,000

Page 21: TMLE ANALYSIS

21

SIGNIFICANT COSTSASSOCIATED WITH DOING A/B TESTING WELL

1. Cost of displaying PSAs to the control(untreated group).

2. Overhead cost of implementing A/B test and ensuring that it is done CORRECTLY (Kohavi et al.)

3. Wait time necessary to evaluate the results.

Page 22: TMLE ANALYSIS

22

NON INVASIVECAUSAL ESTIMATION (NICE)Estimate The Effects In The Natural Environment (Observed Data)

Page 23: TMLE ANALYSIS

23

“WHAT IF”CAUSAL ANALYSIS ADJUSTING FOR CONFOUNDING

Need to adjust for the fact that the group that saw the advertisement and the group that didn’t may be very different.

Page 24: TMLE ANALYSIS

24

TWO MACHINES:QY,n AND gA,n

gA

QY

Pn(A| )

Pn(Y| )Pn(Y| )

gA,n

QY,n

Page 25: TMLE ANALYSIS

ESTIMATING QY AND gA

1. Many tools exist for estimating binary conditional distributions.

1. Logistic regression, SVM, GAM, Regression Trees, etc.2. Data adaptive estimation methods that use cross

validation.3. SuperLearner (R package, Eric Polley)

2. Causal analyses will benefit from the advances in parallelized routines for data adaptive estimation.

Page 26: TMLE ANALYSIS

26

INVERSE PROBABILITY WEIGHTED ESTIMATORS (IPTW)

•Adjust For Confounding Through gn.

•Weight individuals that are unlikely to be shown an advertisement more than individuals that are likely to be shown an advertisement

STRATEGY

ESTIMATOR

Pn(A| )=1/2

=

1/10

Pn(A| )

Page 27: TMLE ANALYSIS

QY

QY

MLE BASED SUBSTITUTION ESTIMATOR (MLE)

•Adjust For Confounding Through Q.•Predict how each observed browser will behave had they been shown an ad and had they not been shown an ad.

STRATEGY

ESTIMATOR

QY,n

QY,n27

Page 28: TMLE ANALYSIS

28

DOUBLE ROBUST ESTIMATORSWhat if QY,n or gA,n are broken? The MLE based estimator and IPTW rely on consistent estimates of Q and g respectively.

gA

QY

P(A| )

P(Y| )P(Y| )

gA,n

QY,n

Page 29: TMLE ANALYSIS

29

OR

STRATEGY

ESTIMATOR

AUGMENTED – IPTW (A-IPTW)

•Adjust For Confounding Through Q and g.

•Augments IPTW estimator with information from Q.

•Alternatively adjusts MLE with information from g.

Page 30: TMLE ANALYSIS

Ψ(QY)

TARGETED MAXIMUM LIKELIHOOD ESTIMATOR (TMLE)

•Adjust For Confounding Through Q and g.

•Predict how each observed browser will behave had they been shown ad and had they not been shown ad.

•The new machine Q* is calibrated with concern for the parameter of interest.

•R package “tmle” at CRAN

STRATEGY

ESTIMATORQY

Ψ(QY)QY

Q*Y,n

Q*Y,n

30

Page 31: TMLE ANALYSIS

CREATING Q*YN

•QYn is updated to Q*Yn using a clever covariate that is a function of g.

•Update is done through the use of a parametric submodel.

•Update is a univariate regression with the initial QYn as an offset.

QY

gA

Ψ(QY)QY

Q*Y,n

31

Page 32: TMLE ANALYSIS

32

SAMPLING/ANALYSIS1. Select Prospects that we got a bid request for on day

0.

2. Observe if treated on day 1. For those treated A=1 and those not treated A=0. Collect W.

3. Create outcome window that is the next five days following treatment and observe if event occurs.

4. Estimate parameters using the methods previously described.

Page 33: TMLE ANALYSIS

33

RESULTS

Page 34: TMLE ANALYSIS

34

IPTW AND A-IPTW BLOW UP UNDER POSITIVITY VIOLATIONS

Page 35: TMLE ANALYSIS

35

METHOD VALIDATION: A/B TEST VS TMLE

Page 36: TMLE ANALYSIS

36

METHOD VALIDATION: NEGATIVE TEST

Page 37: TMLE ANALYSIS

37

GROSSCONVERSIONRATESARE ONLY PARTOF THE STORY

RETARGETING

M6D

PROSPECTING

RETARGETING

M6D

PROSPECTING

RETARGETING

M6D

PROSPECTING

0.0%

0.1%

0.2%

0.3%

0.4%

0.5%

0.6%

0.7%

0.8%

0.9%

1.0%

0.5%

0.9%

0.1%

0.8%0.7%

ADDITIVE IMPACT: EXPOSED VS. UNEXPOSED USERS

ADDI

TIVE

IMPA

CT IN

CO

NVER

SIO

N RA

TE

-0.2%

TELECOM COMPANY A

TELECOM COMPANY B

TELECOM COMPANY C

Page 38: TMLE ANALYSIS

38

EFFECTIVENESSVARIES BYMARKETER& CAMPAIGN

RETARGETING PROSPECTING RETARGETING PROSPECTING0%

2%

4%

6%

8%

10%

12%

14%

16%

RELATIVE IMPACTDID NOT SEE AD SAW AD

CONV

ERSI

ON

RATE

B2B COMPANY

A

B2B COMPANY

B

1.08X

1.08X4.23X

3.77X

Page 39: TMLE ANALYSIS

39

RETARGETING

M6D P

ROSPECTING

RETARGETING

M6D P

ROSPECTING

RETARGETING

M6D P

ROSPECTING

RETARGETING

M6D P

ROSPECTING

0.0%

1.0%

2.0%

3.0%

4.0%

5.0%

6.0%

7.0%

8.0%

RELATIVE IMPACTDID NOT SEE AD SAW AD

CONV

ERSI

ON

RATE

1.71X1.34X

2.57X

1.17X

3.00X

2.33X

1.84X

1.11X

CAR RENTAL RESTAURANTHOTELAIRLINE

Effective creative and

targeting drive lift across

marketers and verticals

Page 40: TMLE ANALYSIS

TEC

HN

OLO

GY

A

TRA

VE

L A

B2B

A

TRA

VE

L B

TELE

CO

M A

TRA

VE

L C

TELE

CO

M B

TRA

VE

L D

RE

TAIL

A

RE

TAIL

B

TRA

VE

L E

ED

UC

ATI

ON

A

TRA

VE

L F

RE

STA

UR

AN

TA

AU

TO A

RE

TAIL

C

RE

STA

UR

AN

T B

TRA

VE

L G

ED

UC

ATI

ON

B

TEC

HN

OLO

GY

B

RE

TAIL

D

TELE

CO

M C

RE

TAIL

E

AU

TO B

ED

UC

ATI

ON

C

FIN

AN

CE

A

TRA

VE

L H

B2B

B

RE

TAIL

F

TRA

VE

L I

-100%

0%

100%

200%

300%

400%

500%

600%

700%

800%

-100%

0%

100%

200%

300%

400%

500%

600%

700%

800%

M6D PROSPECTING LIFT RETARGETING LIFT

M6D

PRO

SPEC

TING

LIF

T

RETA

RGET

ING

LIF

T

For 25 out of 30 marketers,

relative lift was

higher for prospecting

candidates than for retargeting

candidates .

SUMMARY RESULTSAVERAGE RELATIVE LIFT OF 90% FOR M6D PROSPECTING

Page 41: TMLE ANALYSIS

41

CONCLUSIONS1. Estimating causal effects allows one to directly estimate the

impact of the display advertisement.

2. Causal effects can be estimated in the observed data.

3. Display advertising works in varied ways. Causal analysis allows us to estimate effects on a case by case basis.

4. All methods for estimating effects are not equal.

5. These methods may be used for assessing other types of causal effects.

6. What is next?

Page 42: TMLE ANALYSIS

42

VISIT OUR BLOGmedia6degrees.com/blog

Page 43: TMLE ANALYSIS

43

OTHER RESOURCES FOR TMLE

Enter “Slide Show” view and click image to link to site.

Page 44: TMLE ANALYSIS

44

ACKNOWLEDGEMENTS

Edward CaprioloBrian DalessandroRod HookBrian MayRyan OttobreClaudia PerlichTom Phillips

Mark van der LaanSusan GruberEric Polley

Foster Provost

Page 45: TMLE ANALYSIS

45

REFERENCES1. O. Stitelman, B. Dalessandro, C. Perlich, and F. Provost. Estimating The Effect Of Online Display

Advertising On Browser Conversion. In Proceedings of KDD, Annual International Workshop on Data Mining and Audience Intelligence for Online Advertising, ADKDD ’11.

2. M. van der Laan and S. Rose. Targeted Learning: Causal Inference for Observational and Experimental Data. New York, NY: Springer Publishing Company, 2011. http://www.targetedlearningbook.com/

3. ‘tmle’ R Package http://cran.r-project.org/web/packages/tmle/index.html

4. R. Kohavi and R. Longbotham. Unexpected results in online controlled experiments. ACM SIGKDD Explorations Newsletter, 12(2):31–35, 2010.

5. R. Lewis and D. Reiley. Does retail advertising work: Measuring the effects of advertising on sales via a controlled experiment on yahoo. Technical report, Working paper, 2010.

6. D. Chan, R. Ge, O. Gershony, T. Hesterberg, and D. Lambert. Evaluating online ad campaigns in a pipeline: causal models at scale. In Proceedings of KDD, KDD ’10, pages 7–16, New York, NY, USA, 2010. ACM.