Parameter and Structure Learning Dhruv Batra, 10-708 Recitation 10/02/2008

Parameter and Structure Learning

Dhruv Batra,10-708 Recitation

10/02/2008

Overview• Parameter Learning

– Classical view, estimation task– Estimators, properties of estimators– MLE, why MLE?– MLE in BNs, decomposability

• Structure Learning– Structure score, decomposable scores – TAN, Chow-Liu– HW2 implementation steps

Note• Plagiarism alert

– Some slides taken from others– Credits/references at the end

Parameter Learning• Classical statistics view / Point Estimation

– Parameters unknown but not random– Point estimation = “find the right parameter”– Estimate parameters (or functions of parameters) of the

model from data

• Estimators– Any statistic– Function of data alone

• Say you have a dataset– Need to estimate mean– Is 5, an estimator?– What would you do?

Properties of estimator

• Since estimator gives rise an estimate that depends on sample points (x1,x2,,,xn) estimate is a function of sample points.

• Sample points are random variable therefore estimate is random variable and has probability distribution.

• We want that estimator to have several desirable properties like• Consistency• Unbiasedness• Minimum variance

• In general it is not possible for an estimator to have all these properties.

So why MLE?• MLE has some nice properties

– MLEs are often simple and easy to compute.– MLEs have asymptotic optimality properties (consistency

and efficiency).– MLEs are invariant under reparameterization.– and more..

Let’s try

Back to BNs• MLE in BN

– Data– Model DAG G– Parameters CPTs– Learn parameters from data

structure parameters

CPTs –

P(Xi| PaXi)

10-708 – Carlos Guestrin 2006-2008

Learning the CPTs

DataFor each discrete variable Xi

Example• Learning MLE parameters

Learning the CPTs

DataFor each discrete variable Xi

Maximum likelihood estimation (MLE) of BN parameters – example

• Given structure, log likelihood of data:

Flu Allergy

Decomposability• Likelihood Decomposition

• Local likelihood function

What’s the difference?

Global parameter

independence!

Taking derivatives of MLE of BN parameters – General case

Structure Learning• Constraint Based

– Check independences, learn PDAG– HW1

• Score Based– Give a score for all possible structures– Maximize score

Score Based• What’s a good score function?

• How about our old friend, log likelihood?

• So here’s our score function:

Score Based• [Defn]: Decomposable scores

• Why do we care about decomposable scores?

• Log likelihood based score decomposes!

Need regularization

Score Based• Chow-Liu

Score Based• Chow-Liu modification for TAN (HW2)

Slide and other credits• Zoubin Ghahramani, guest lectures in 10-702• Andrew Moore tutorial

– http://www.autonlab.org/tutorials/mle.html

• http://cnx.org/content/m11446/latest/• Lecture slides by Carlos Guestrin

Parameter and Structure Learning Dhruv Batra, 10-708 Recitation 10/02/2008

Documents

L24 less labels - Georgia Institute of TechnologyRelated Work: Meta-Learning (C) Dhruv Batra & Zsolt Kira 16 • Training a recurrent neural network to optimize – outputs update,

Abstract arXiv:1902.01385v1 [cs.LG] 4 Feb 2019 · 2019-02-05 · Devendra Singh Chaplot1,2, Lisa Lee 1, Ruslan Salakhutdinov , Devi Parikh 2,3, Dhruv Batra 1Carnegie Mellon University,

Practical No-box Adversarial Attacks against DNNs ... · Dhruv Batra. Grad-cam: Visual explanations from deep networks via gradient-based localization. In ICCV, 2017. [7] Karen Simonyan

L21 advanced topics2014/05/09 · ECE 6504: Advanced Topics in Machine Learning Probabilistic Graphical Models and Large-Scale Learning Dhruv Batra Virginia Tech Topics – Summary

Active Appearance Models Dhruv Batra ECE CMU. Active Appearance Models 1.T.F.Cootes, G.J. Edwards and C.J.Taylor. "Active Appearance Models", in Proc

ECE 6504: Deep Learning for Perception Dhruv Batra Virginia Tech Topics: –Recurrent Neural Networks (RNNs) –BackProp Through Time (BPTT) –Vanishing / Exploding

Beyond Trees: MRF Inference via Outer-Planar Decomposition [(To appear) CVPR ‘10] Dhruv Batra Carnegie Mellon University Joint work: Andrew Gallagher (Eastman

CVPR 2013 Diversity Tutorial Closing Remarks: What can we do with multiple diverse solutions? Dhruv Batra Virginia Tech

ECE 6504: Deep Learning for Perception Dhruv Batra Virginia Tech Topics: –Toeplitz Matrix –1x1 Convolution –AKA How to run a ConvNet on arbitrary sized

Dynamic Tree Block Coordinate Ascent Daniel Tarlow 1, Dhruv Batra 2 Pushmeet Kohli 3, Vladimir Kolmogorov 4 1: University of Toronto3: Microsoft Research

Visual Dialog Abhishek Das, Satwik Kottur, Khushi Gupta ... · Abhishek Das, Satwik Kottur, Khushi Gupta, Avi Singh, Deshraj Yadav, José M.F. Moura, Devi Parikh, Dhruv Batra Presented

ECE 5984: Introduction to Machine Learnings15ece5984/slides/L8_regression.pptx.pdf · ECE 5984: Introduction to Machine Learning Dhruv Batra Virginia Tech ... exam scores • Learn

CS 4803 / 7643: Deep Learning · 2020. 9. 15. · CS 4803 / 7643: Deep Learning Dhruv Batra Georgia Tech Topics: – Convolutional Neural Networks – What is a convolution? – FC

CS 4803 / 7643: Deep Learning...(C) Dhruv Batra 2. Recap from last time (C) Dhruv Batra 3. Convolutional Neural Networks (without the brain stuff) Slide Credit: Fei-Fei Li, Justin

ECE 6504: Deep Learning for Perception Dhruv Batra Virginia Tech Topics: –(Finish) Backprop –Convolutional Neural Nets

VBM683 Machine Learning - Hacettepepinar/courses/VBM683/lectures/nearest... · VBM683 Machine Learning Pinar Duygulu Slides are adapted from Dhruv Batra, ... be used as an efficient

An Efficient Message-Passing Algorithm for the M-Best MAP Problem Dhruv Batra (Currently) Research Assistant Professor TTI-Chicago (Spring 2013) Assistant

Object-Proposal Evaluation Protocol is ‘Gameable’ · 2015-11-24 · Object-Proposal Evaluation Protocol is ‘Gameable’ Neelima Chavali Harsh Agrawal Aroma Mahendru Dhruv Batra

VBM683 Machine Learning - Hacettepe Üniversitesipinar/courses/VBM683/lectures/introduction.pdf · A Brief History of AI (C) Dhruv Batra • ^We propose that a 2 month, 10 man study

Sort Story: Sorting Jumbled Images and Captions into · PDF fileSort Story: Sorting Jumbled Images and Captions into Stories Harsh Agrawal;1Arjun Chandrasekaran y Dhruv Batra 3;1Devi