Addressing Delayed Feedback for Continuous Training with ... · Logistic regression Wide-and-deep...

RecSys 2019

Addressing Delayed Feedback for Continuous Training with

Neural Networks in CTR prediction SI Ktena, A Tejani, L Theis, P Kumar Myana,

D Dilipkumar, F Huszár, S Yoo, W Shi

Why continuous training?

Background

Why continuous training?

Background

New campaign IDs + non-stationary features

Background

Challenge: Delayed feedback

Users may click ads after 1 second 1 minute or 1 hour

Challenge: Delayed feedbackWhy is it a challenge?

Should we wait? → Delays model training

Should we not wait? How do we decide the label?

Training Delay

Solution: accept “fake negative”(user1, ad1, t1) imp 1 (user2, ad1, t2) imp 1 (user1, ad1, t3) click 1

Event Label Weight

e same features

Event Label Weight

e same features

Event Label Weight

Assume X #Clicks out of Y #Impressions

Works well when CTR is low, where X/Y ~= X/ (X+Y)

same features

Event Label Weight

Background

Delayed feedback models

Background

● The probability of click is not constant through time [Chapelle 2014]

Background

● Second model similar to survival time analysis models captures the delay between impression and click

Background

● Assume an exponential distribution or other non-parametric distribution

Background

● Assume an exponential distribution or other non-parametric distribution

Background

Our approach

Importance sampling

● p is the actual data distribution ● b is the biased data distribution

Importance weights

Our approach

● Continuous training scheme -> potentially wait infinite time for positive engagement

● Two models○ Logistic regression○ Wide-and-deep model

● Four loss functions○ Delayed feedback loss [Chapelle, 2014]○ Positive-unlabeled loss [du Plessis et al., 2015]○ Fake negative weighted○ Fake negative calibration

Our approach

● Continuous training scheme -> potentially wait infinite time for positive engagement

● Two models○ Logistic regression○ Wide-and-deep model

● Four loss functions○ Delayed feedback loss [Chapelle, 2014]○ Positive-unlabeled loss [du Plessis et al., 2015]○ Fake negative weighted○ Fake negative calibration

both rely on importance sampling

Delayed feedback loss

Assume exponential distribution for time delay

Loss functions

Delayed feedback loss

Assume exponential distribution for time delay

Loss functions

Fake negative weighted & calibration

Don’t apply any weights on the training samples, only calibrate the output of the network using the following formulation

Loss functions

Fake negative weighted & calibration

Don’t apply any weights on the training samples, only calibrate the output of the network using the following formulation

Loss functions

Experiments

Offline experiments

Criteo data

○ Small dataset & public○ Training - 15.5M / Testing: 3.5M examples

RCE: normalised version of cross-entropy (higher values are better)

Offline experiments

Criteo data

○ Small dataset & public○ Training - 15.5M / Testing: 3.5M examples

Offline experiments

Twitter data

○ Large & proprietary due to user information○ Training: 668M ads w. FN / Testing: 7M ads

Offline experiments

Twitter data

○ Large & proprietary due to user information○ Training: 668M ads w. FN / Testing: 7M ads

Online experiment

Online (A/B test)

Pooled RCE: RCE on combined traffic generated by models RPMq: Revenue per thousand requests

Conclusions

● Solve problem of delayed feedback in continuous training by relying on importance weights

● FN weighted and FN calibration proposed and applied for the first time

● Offline evaluation on large proprietary dataset and online A/B test

Future directions

● Address catastrophic forgetting and overfitting

● Exploration / exploitation strategies

Questions?

https://careers.twitter.com

@s0f1ra

Addressing Delayed Feedback for Continuous Training with ... · Logistic regression Wide-and-deep...

Documents

Delayed feedback control of chaos: bifurcation analysis N. Janson (Loughborough) Collaborators: A. Balanov and E. Schöll (TUB) Technische Universität Berlin

Performing music with delayed auditory feedback

Delayed Feedback Model of Axonal Length Sensingbhargav/AxonDelayBJ15.pdf · Article Delayed Feedback Model of Axonal Length Sensing Bhargav R. Karamched1 and Paul C. Bressloff1,*

Delayed Versus Immediate Feedback in an Independent Study

Stochastic two-delay differential model of delayed visual feedback effects on postural dynamics by Jason Boulet, Ramesh Balasubramaniam, Andreas Daffertshofer,

Complex dynamics of a microwave time-delayed feedback loop

Delayed Inner Ear Maturation and Neuronal Loss in ... · Delayed Inner Ear Maturation and Neuronal Loss in Postnatal Igf-1-Deﬁcient Mice Guadalupe Camarero,1 Carlos Avendan˜o,2

Delayed Entry Program (DEP) Loss Behavior ENTRY PROGRAM (DEP) LOSS BEHAVIOR EXECJLIV SUMMARY Requirerent: The problem of losses from the Delayed Entry Program (DEP) is an

DELAYED VERSUS IMMEDIATE CORRECTIVE FEEDBACK ON ORALLY ... · DELAYED VERSUS IMMEDIATE CORRECTIVE FEEDBACK ON ORALLY ... DELAYED VERSUS IMMEDIATE CORRECTIVE FEEDBACK ON ORALLY

2005 Dynamics of an inverted pendulum with delayed ... · INVERTED PENDULUM WITH DELAYED FEEDBACK CONTROL 335 Table 2 Values of parameters in experimental system. Parameter Value

Virtual Chimera States for Delayed-Feedback Systems...Virtual Chimera States for Delayed-Feedback Systems Laurent Larger,1 Bogdan Penkovsky,1,2 and Yuri Maistrenko1,3 1FEMTO-ST/Optics

RELATIONSHIP BETWEEN TEMPERAMENT AND DYSFLUENCIES UNDER DELAYED AUDITORY FEEDBACK IN FLUENT SPEAKERS Eden Treadway, B.S., Tyler McMurtry, B.S., MBA, Brittany

Beam Loss Monitor System – Preliminary Feedback November 11 th, 2010

Time-Delayed Feedback Control Method and Unstable Controllerspyragas.pfi.lt/pdffiles/2003/peterburg.pdf · 2003. 5. 26. · Time-Delayed Feedback Control Method and Unstable Controllers

Effect of Time-Delayed Feedback on the Interaction of a ...SCIENtIFIC REPORTS ã 15468 DI1.13s1-1-11-z 1 Eect of Time-Delayed Feedback on the Interaction of a Dimer System with its

Oscillation Onset in Neural Delayed Feedback · PDF fileOscillation Onset in Neural Delayed Feedback 131 a muscle activity. For example, the negative feedback dynamics of the human

Dynamics of an Inverted Pendulum with Delayed Feedback Control

CENTER - pyragas.pfi.ltpyragas.pfi.lt/pdffiles/Projektas/Pranesimai/2012/V.Novicenko_CHAO… · Analytical expression for the period of orbits stabilized. by extended delayed feedback

Dynamic Properties of Quantum Dot Distributed Feedback Lasersoptoelectronics.ece.vt.edu/Theses/Su_PhD_2004.pdf · Feedback Resistance of Laterally Loss Coupled Distributed Feedback

Aalborg Universitet Hear You Later Alligator How delayed ...Hear You Later Alligator: How delayed auditory feedback affects non-musically trained people’s strumming Jeppe Veirum