30
Tangent space intrinsic manifold regularization for data representation Shiliang Sun Department of Computer Science and Technology East China Normal University, Shanghai, China [email protected] S. Sun (ECNU) PRML Research Group July 8, 2013 1 / 22

Tangent space intrinsic manifold regularization for data representation … · 2018. 8. 10. · Tangent space intrinsic manifold regularization for data representation Shiliang Sun

  • Upload
    others

  • View
    6

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Tangent space intrinsic manifold regularization for data representation … · 2018. 8. 10. · Tangent space intrinsic manifold regularization for data representation Shiliang Sun

Tangent space intrinsic manifoldregularization for data representation

Shiliang Sun

Department of Computer Science and TechnologyEast China Normal University, Shanghai, China

[email protected]

S. Sun (ECNU) PRML Research Group July 8, 2013 1 / 22

Page 2: Tangent space intrinsic manifold regularization for data representation … · 2018. 8. 10. · Tangent space intrinsic manifold regularization for data representation Shiliang Sun

Regularization

The principle of regularization has its root inmathematics to solve ill-posed problems, and iswidely used in pattern recognition and machinelearning.

Many well-known algorithms, e.g., SVMs, ridgeregression and lasso, can be interpreted asinstantiations of the idea of regularization.

S. Sun (ECNU) PRML Research Group July 8, 2013 2 / 22

Page 3: Tangent space intrinsic manifold regularization for data representation … · 2018. 8. 10. · Tangent space intrinsic manifold regularization for data representation Shiliang Sun

Regularization

The principle of regularization has its root inmathematics to solve ill-posed problems, and iswidely used in pattern recognition and machinelearning.

Many well-known algorithms, e.g., SVMs, ridgeregression and lasso, can be interpreted asinstantiations of the idea of regularization.

The regularization method presented in this paper isintrinsic to data manifold which prefers linearfunctions on the manifold. We further apply it todata representation or dimensionality reduction.

S. Sun (ECNU) PRML Research Group July 8, 2013 2 / 22

Page 4: Tangent space intrinsic manifold regularization for data representation … · 2018. 8. 10. · Tangent space intrinsic manifold regularization for data representation Shiliang Sun

Manifold Learning

Data lying in a high-dimensional space are assumedto be intrinsically of low dimensionality.That is, data can be well characterized by far fewerparameters or degrees of freedom than the actualambient representation.

This setting is usually referred to as manifoldlearning, and the distribution of data is regarded tolive on or near a low-dimensional manifold.

S. Sun (ECNU) PRML Research Group July 8, 2013 3 / 22

Page 5: Tangent space intrinsic manifold regularization for data representation … · 2018. 8. 10. · Tangent space intrinsic manifold regularization for data representation Shiliang Sun

Manifold Learning

Data lying in a high-dimensional space are assumedto be intrinsically of low dimensionality.That is, data can be well characterized by far fewerparameters or degrees of freedom than the actualambient representation.

This setting is usually referred to as manifoldlearning, and the distribution of data is regarded tolive on or near a low-dimensional manifold.

The validity of manifold learning, especially forhigh-dimensional image and text data, has alreadybeen testified by recent developments.

S. Sun (ECNU) PRML Research Group July 8, 2013 3 / 22

Page 6: Tangent space intrinsic manifold regularization for data representation … · 2018. 8. 10. · Tangent space intrinsic manifold regularization for data representation Shiliang Sun

Data Representation

The problem of representing data in a low-dimensionalspace for the sake of data visualization and organizationis essentially a dimensionality reduction problem.

Data Representation or Dimensionality Reduction

Given a data set {xi}ki=1 with xi ∈ R

d , the task is todeliver a data set {fi}k

i=1 where fi ∈ Rm corresponds to

xi and m << d .

S. Sun (ECNU) PRML Research Group July 8, 2013 4 / 22

Page 7: Tangent space intrinsic manifold regularization for data representation … · 2018. 8. 10. · Tangent space intrinsic manifold regularization for data representation Shiliang Sun

Data Representation

The problem of representing data in a low-dimensionalspace for the sake of data visualization and organizationis essentially a dimensionality reduction problem.

Data Representation or Dimensionality Reduction

Given a data set {xi}ki=1 with xi ∈ R

d , the task is todeliver a data set {fi}k

i=1 where fi ∈ Rm corresponds to

xi and m << d .

Representative methods: PCA, MDS, isomap, LLE,maximum variance unfolding (MVU), Laplacianeigenmap.

S. Sun (ECNU) PRML Research Group July 8, 2013 4 / 22

Page 8: Tangent space intrinsic manifold regularization for data representation … · 2018. 8. 10. · Tangent space intrinsic manifold regularization for data representation Shiliang Sun

Outline

1 The Proposed Regularization

2 Generalization and Reformulation

3 Application to Data Representation and Results

S. Sun (ECNU) PRML Research Group July 8, 2013 5 / 22

Page 9: Tangent space intrinsic manifold regularization for data representation … · 2018. 8. 10. · Tangent space intrinsic manifold regularization for data representation Shiliang Sun

Tangent Space Intrinsic Manifold Regularization (TSIMR)

We are interested in estimating a function f (x) definedon M ⊂ R

d , where M is a smooth manifold on Rd . We

assume that f (x) can be well approximated by a linearfunction with respect to the manifold M.Let m be the dimensionality of M.

S. Sun (ECNU) PRML Research Group July 8, 2013 6 / 22

Page 10: Tangent space intrinsic manifold regularization for data representation … · 2018. 8. 10. · Tangent space intrinsic manifold regularization for data representation Shiliang Sun

Tangent Space Intrinsic Manifold Regularization (TSIMR)

We are interested in estimating a function f (x) definedon M ⊂ R

d , where M is a smooth manifold on Rd . We

assume that f (x) can be well approximated by a linearfunction with respect to the manifold M.Let m be the dimensionality of M.

At each point z ∈ M, f (x) can be represented as alinear function f (x) ≈ bz + w

⊤z uz(x) locally around z,

where uz(x) = Tz(x − z) is an m-dimensional vectorrepresenting x in the tangent space around z, and Tz isan m × d matrix that projects x around z to arepresentation in the tangent space of M at z.

S. Sun (ECNU) PRML Research Group July 8, 2013 6 / 22

Page 11: Tangent space intrinsic manifold regularization for data representation … · 2018. 8. 10. · Tangent space intrinsic manifold regularization for data representation Shiliang Sun

Tangent Space Intrinsic Manifold Regularization (TSIMR)

f (x) ≈ bz + w⊤zuz(x), uz(x) = Tz(x − z) (1)

Note that the basis for Tz is computed using localPCA for its simplicity and wide applicability.

The weight vector wz ∈ Rm is an m-dimensional

vector, and it is also the manifold-gradient of f (x)at z with respect to the uz(·) representation on themanifold, which we write as ∇T f (x)|x=z = wz.

S. Sun (ECNU) PRML Research Group July 8, 2013 7 / 22

Page 12: Tangent space intrinsic manifold regularization for data representation … · 2018. 8. 10. · Tangent space intrinsic manifold regularization for data representation Shiliang Sun

TSIMR: Derivation

To see how our approach works, we assume for simplicitythat Tz is an orthogonal matrix for all z: TzT

⊤z

= I(m×m).This means that if x ∈ M is close to z ∈ M, thenx − z ≈ T

⊤z Tz(x − z).

Now consider x that is close to both z and z′. We can

express f (x) both in the tangent space representation atz and z

′, which gives bz + w⊤zuz(x) ≈ bz′ + w

⊤z′uz′(x).

S. Sun (ECNU) PRML Research Group July 8, 2013 8 / 22

Page 13: Tangent space intrinsic manifold regularization for data representation … · 2018. 8. 10. · Tangent space intrinsic manifold regularization for data representation Shiliang Sun

TSIMR: Derivation

To see how our approach works, we assume for simplicitythat Tz is an orthogonal matrix for all z: TzT

⊤z

= I(m×m).This means that if x ∈ M is close to z ∈ M, thenx − z ≈ T

⊤z Tz(x − z).

Now consider x that is close to both z and z′. We can

express f (x) both in the tangent space representation atz and z

′, which gives bz + w⊤zuz(x) ≈ bz′ + w

⊤z′uz′(x).

This means that

bz + w⊤zTz(x − z) ≈ bz′ + w

⊤z′Tz′(x − z

′). (2)

S. Sun (ECNU) PRML Research Group July 8, 2013 8 / 22

Page 14: Tangent space intrinsic manifold regularization for data representation … · 2018. 8. 10. · Tangent space intrinsic manifold regularization for data representation Shiliang Sun

TSIMR: Derivation

bz ≈ bz′ + w⊤z′Tz′(z − z

′) , (3)

wz ≈ TzT⊤z′wz′ . (4)

S. Sun (ECNU) PRML Research Group July 8, 2013 9 / 22

Page 15: Tangent space intrinsic manifold regularization for data representation … · 2018. 8. 10. · Tangent space intrinsic manifold regularization for data representation Shiliang Sun

TSIMR

Therefore, if we expand at points {z1, . . . , zk} , Z , anddenote neighbors of zj as N (zj), the regularizer will be

R({bz,wz}z∈Z ) =k

i=1

j∈N (zi)

[(

bzi− bzj

w⊤zjTzj

(zi − zj))2

+ γ‖wzi− Tzi

T⊤zjwzj

‖22

]

. (5)

S. Sun (ECNU) PRML Research Group July 8, 2013 10 / 22

Page 16: Tangent space intrinsic manifold regularization for data representation … · 2018. 8. 10. · Tangent space intrinsic manifold regularization for data representation Shiliang Sun

TSIMR

Therefore, if we expand at points {z1, . . . , zk} , Z , anddenote neighbors of zj as N (zj), the regularizer will be

R({bz,wz}z∈Z ) =k

i=1

j∈N (zi)

[(

bzi− bzj

w⊤zjTzj

(zi − zj))2

+ γ‖wzi− Tzi

T⊤zjwzj

‖22

]

. (5)

With z(x) = arg minz∈Z ‖x − z‖2, the function f (x) isapproximated as f (x) = bz(x) + w

⊤z(x)Tz(x)

(

x − z(x))

,

which is a very natural formulation for out-of-exampleextensions.

S. Sun (ECNU) PRML Research Group July 8, 2013 10 / 22

Page 17: Tangent space intrinsic manifold regularization for data representation … · 2018. 8. 10. · Tangent space intrinsic manifold regularization for data representation Shiliang Sun

Outline

1 The Proposed Regularization

2 Generalization and Reformulation

3 Application to Data Representation and Results

S. Sun (ECNU) PRML Research Group July 8, 2013 11 / 22

Page 18: Tangent space intrinsic manifold regularization for data representation … · 2018. 8. 10. · Tangent space intrinsic manifold regularization for data representation Shiliang Sun

Generalization and Reformulation

Idea: Weighting the regularizer with point adjacency.

S. Sun (ECNU) PRML Research Group July 8, 2013 12 / 22

Page 19: Tangent space intrinsic manifold regularization for data representation … · 2018. 8. 10. · Tangent space intrinsic manifold regularization for data representation Shiliang Sun

Generalization and Reformulation

Idea: Weighting the regularizer with point adjacency.

R({bz,wz}z∈Z ) =

k∑

i=1

k∑

j=1

Wij

[(

bzi− bzj

w⊤zjTzj

(zi − zj))2

+ γ‖wzi− Tzi

T⊤zjwzj

‖22

]

. (6)

S. Sun (ECNU) PRML Research Group July 8, 2013 12 / 22

Page 20: Tangent space intrinsic manifold regularization for data representation … · 2018. 8. 10. · Tangent space intrinsic manifold regularization for data representation Shiliang Sun

Generalization and Reformulation

Reformulated as a standard quadratic form:

R({bz,wz}z∈Z ) =

bz1...

bzk

wz1...

wzk

(

S1 S2

S⊤2 S3

)

bz1...

bzk

wz1...

wzk

.

(7)

Denote

(

S1 S2

S⊤2 S3

)

by S .

S. Sun (ECNU) PRML Research Group July 8, 2013 13 / 22

Page 21: Tangent space intrinsic manifold regularization for data representation … · 2018. 8. 10. · Tangent space intrinsic manifold regularization for data representation Shiliang Sun

Outline

1 The Proposed Regularization

2 Generalization and Reformulation

3 Application to Data Representation and Results

S. Sun (ECNU) PRML Research Group July 8, 2013 14 / 22

Page 22: Tangent space intrinsic manifold regularization for data representation … · 2018. 8. 10. · Tangent space intrinsic manifold regularization for data representation Shiliang Sun

Application to Data Representation

Define w = (w⊤z1,w⊤

z2, . . . ,w⊤

zk)⊤.

Suppose vector f = (bz1, bz2

, . . . , bzk)⊤ is an embedding

or representation of points z1, z2, . . . , zk in a line.

A reasonable criterion for finding a good embeddingunder the principle of the tangent space intrinsicmanifold regularization is to minimize the objective(

f

w

)⊤

S

(

f

w

)

under appropriate constraints.

S. Sun (ECNU) PRML Research Group July 8, 2013 15 / 22

Page 23: Tangent space intrinsic manifold regularization for data representation … · 2018. 8. 10. · Tangent space intrinsic manifold regularization for data representation Shiliang Sun

Application to Data Representation

To remove an arbitrary scaling factor in both f and w,

we take into account the constraint

(

f

w

)⊤(

f

w

)

= 1.

Therefore, the optimization problem becomes

minf,w

(

f

w

)⊤

S

(

f

w

)

, s.t.

(

f

w

)⊤(

f

w

)

= 1. (8)

S. Sun (ECNU) PRML Research Group July 8, 2013 16 / 22

Page 24: Tangent space intrinsic manifold regularization for data representation … · 2018. 8. 10. · Tangent space intrinsic manifold regularization for data representation Shiliang Sun

Application to Data Representation

To remove an arbitrary scaling factor in both f and w,

we take into account the constraint

(

f

w

)⊤(

f

w

)

= 1.

Therefore, the optimization problem becomes

minf,w

(

f

w

)⊤

S

(

f

w

)

, s.t.

(

f

w

)⊤(

f

w

)

= 1. (8)

The solution is given by the eigenvector corresponding tothe minimal eigenvalue of the eigen-decomposition

S

(

f

w

)

= λ

(

f

w

)

.

Easily extendable to find multidimensional embeddings.S. Sun (ECNU) PRML Research Group July 8, 2013 16 / 22

Page 25: Tangent space intrinsic manifold regularization for data representation … · 2018. 8. 10. · Tangent space intrinsic manifold regularization for data representation Shiliang Sun

Results (1)

Embedding results of the TSIMR on the Swiss roll.

S. Sun (ECNU) PRML Research Group July 8, 2013 17 / 22

Page 26: Tangent space intrinsic manifold regularization for data representation … · 2018. 8. 10. · Tangent space intrinsic manifold regularization for data representation Shiliang Sun

Results (2)

Embedding results on the Swiss roll with a hole.

S. Sun (ECNU) PRML Research Group July 8, 2013 18 / 22

Page 27: Tangent space intrinsic manifold regularization for data representation … · 2018. 8. 10. · Tangent space intrinsic manifold regularization for data representation Shiliang Sun

Results (3)

Embedding results on the Swiss roll with a LONG hole.

S. Sun (ECNU) PRML Research Group July 8, 2013 19 / 22

Page 28: Tangent space intrinsic manifold regularization for data representation … · 2018. 8. 10. · Tangent space intrinsic manifold regularization for data representation Shiliang Sun

Results (4.1)

Embedding results on the face images with varying poseand expression.

S. Sun (ECNU) PRML Research Group July 8, 2013 20 / 22

Page 29: Tangent space intrinsic manifold regularization for data representation … · 2018. 8. 10. · Tangent space intrinsic manifold regularization for data representation Shiliang Sun

Results (4.2)

Embedding results on the face images with varying poseand expression.

S. Sun (ECNU) PRML Research Group July 8, 2013 21 / 22

Page 30: Tangent space intrinsic manifold regularization for data representation … · 2018. 8. 10. · Tangent space intrinsic manifold regularization for data representation Shiliang Sun

The End

S. Sun (ECNU) PRML Research Group July 8, 2013 22 / 22