Semi-Supervised Learning10701/slides/17_SSL.pdfSemi-supervised SVM Many others 9. Notation 10....

Semi-Supervised Learning

Barnabas Poczos

Slides Courtesy: Jerry Zhu, Aarti Singh

Supervised Learning

Learning algorithm

Labeled

Feature Space Label Space

Optimal predictor (Bayes Rule) depends on unknown PXY, so instead

learn a good prediction rule from training data

Labeled and Unlabeled data

Human expert/

Special equipment/

Experiment

“Crystal” “Needle” “Empty”

Cheap and abundant ! Expensive and scarce !

“0” “1” “2” …

“Sports”

“News”

“Science”

Free-of-cost labels?

Luis von Ahn: Games with a purpose (ReCaptcha)

Word challenging to OCR (Optical Character Recognition)

You provide a free label!

Semi-Supervised learning

Supervised learning (SL)

Semi-Supervised learning (SSL)

Learning algorithm

Goal: Learn a better prediction rule than based on labeled data alone.

“Crystal”

Semi-Supervised learning in Humans

Can unlabeled data help?

Assume each class is a coherent group (e.g. Gaussian)

Then unlabeled data can help identify the boundary more accurately.

Positive labeled data

Negative labeled data

Unlabeled data

Supervised Decision Boundary

Semi-Supervised Decision Boundary

Can unlabeled data help?

“0” “1” “2” …

“Similar” data points have “similar” labels8

This embedding can be done by manifold learning algorithms

Some SSL Algorithms

▪ Self-Training

▪ Generative methods, mixture models

▪ Graph-based methods

▪ Co-Training

▪ Semi-supervised SVM

▪ Many others

Notation

Self-training

Self-training Example

Propagating 1-NN

Mixture Models for Labeled Data

> 1/2<

Estimate the

parameters from the

labeled data

Decision for any test

point not in the labeled

dataset

Mixture Models for Labeled Data

Mixture Models for SSL Data

Mixture Models

Mixture ModelsSL vs SSL

Mixture Models

Gaussian Mixture Models

EM for Gaussian Mixture Models

Assumption for GMMs

Related: Cluster and Label

Graph Based Methods

Assumption: Similar unlabeled data have similar labels.

Graph RegularizationSimilarity Graphs: Model local neighborhood relations between data points

Assumption:

Nodes connected by heavy edges

tend to have similar label

Graph Regularization

If data points i and j are similar (i.e. weight wij is large), then their labels

are similar fi = fj

Loss on labeled data

(mean square,0-1)

Graph based smoothness prior

on labeled and unlabeled data

Co-training

Co-training (Blum & Mitchell, 1998) (Mitchell, 1999) assumes that (i) features can be split into two sets; (ii) each sub-feature set is sufficient to train a good classifier.

• Initially two separate classifiers are trained with the labeled data, on the two sub-feature sets respectively.

• Each classifier then classifies the unlabeled data, and ‘teaches’ the other classifier with the few unlabeled examples (and the predicted labels) they feel most confident.

• Each classifier is retrained with the additional training examples given by the other classifier, and the process repeats.

Co-training Algorithm

Co-training AlgorithmBlum & Mitchell’98

Semi-Supervised SVMs

Semi-Supervised Learning

▪ Generative methods

▪ Graph-based methods

▪ Co-Training

▪ Semi-Supervised SVMs

▪ Many other methods

SSL algorithms can use unlabeled data to help improve prediction accuracy if data satisfies appropriate assumptions

Semi-Supervised Learning10701/slides/17_SSL.pdfSemi-supervised SVM Many others 9. Notation 10....

Documents

Self-Supervised Video Representation Learning with

Learning Action Representations for Self-supervised Visual ...andrea/papers/2019_ICRA__LearningAction... · Learning Action Representations for Self-supervised Visual Exploration

Neighbor2Neighbor: Self-Supervised Denoising From Single

Self-Supervised Learning Based on Discriminative Nonlinear ...users.ece.northwestern.edu/~yingwu/papers/journal/pr04_tian.pdf · Self-Supervised Learning Based on Discriminative Nonlinear

Self-Supervised Difference Detection for Weakly-Supervised

A Self-supervised Approach for Adversarial Robustnessopenaccess.thecvf.com/content_CVPR_2020/papers/Naseer_A_Self... · A Self-supervised Approach for Adversarial Robustness Muzammal

CompRess: Self-Supervised Learning by Compressing

SELF SUPERVISED REPRESENTATION LEARNING WITH RELATIVE

Self-supervised Learning

Self-Supervised Multi-Channel Hypergraph Convolutional

S4L: Self-Supervised Semi-Supervised Learningopenaccess.thecvf.com/content_ICCV_2019/papers/... · Introduction Modern computer vision systems demonstrate outstand- ... self-supervised

Interpretable Self-Supervised Facial Micro-Expression

Supervised Self-Organising Maps · Supervised Self-Organising Maps Ron Wehrens Institute of Molecules and Materials, IMM Radboud University Nijmegen, The Netherlands Self-organising

Self-Supervised Deep Depth Denoising (Supplementary Material)openaccess.thecvf.com/content_ICCV_2019/supplemental/St...Self-Supervised Deep Depth Denoising (Supplementary Material)

Self-Supervised Difference Detection for Weakly …openaccess.thecvf.com/content_ICCV_2019/papers/Shimoda...Self-Supervised Difference Detection for Weakly-Supervised Semantic Segmentation

Self-supervised Audiovisual Representation Learning for

Backdoor Attacks on Self-Supervised Learning

Self-Supervised Geometric Features Discovery via

Using Self-Supervised Learning Can Improve Model ......Self-supervised learning holds great promise for improving representations when labeled data are scarce. In semi-supervised learning,

Employer Supervised Self-Swab Overall Process