24
Rosca et al. – Generalized Sparse Mixing Model & BSS – ICASSP, Montreal 2004 Siemens Corporate Research Generalized Sparse Signal Mixing Model and Application to Noisy Blind Source Separation Justinian Rosca Christian Borss # Radu Balan Siemens Corporate Research, Princeton, USA # Presently: University of Brawnschweig

Siemens Corporate Research Rosca et al. Generalized Sparse Mixing Model BSS ICASSP, Montreal 2004 Generalized Sparse Signal Mixing Model and Application

Embed Size (px)

DESCRIPTION

Siemens Corporate Research Rosca et al. – Generalized Sparse Mixing Model & BSS – ICASSP, Montreal 2004 Motivation Solve ICA problem in realistic scenarios In the presence of noise. Is this really feasible? When A is “fat” (degenerate) Successful DUET/Time-frequency masking - approach and implementation Can we do better if we relax the DUET assumption about number of sources “active” at any time- frequency point? [Rickard et al. 2000,2001]

Citation preview

Page 1: Siemens Corporate Research Rosca et al.  Generalized Sparse Mixing Model  BSS  ICASSP, Montreal 2004 Generalized Sparse Signal Mixing Model and Application

Rosca et al. – Generalized Sparse Mixing Model & BSS – ICASSP, Montreal 2004

Siemens Corporate Research

Generalized Sparse Signal Mixing Model and Application to Noisy Blind Source Separation

Justinian Rosca

Christian Borss #

Radu Balan

Siemens Corporate Research, Princeton, USA

# Presently: University of Brawnschweig

Page 2: Siemens Corporate Research Rosca et al.  Generalized Sparse Mixing Model  BSS  ICASSP, Montreal 2004 Generalized Sparse Signal Mixing Model and Application

Rosca et al. – Generalized Sparse Mixing Model & BSS – ICASSP, Montreal 2004

Siemens Corporate Research

ICA/BSS Scenario

Signal Processor

x1 x2 xD

S1, S2 ,…,SL

s1

s2

sL-1

sL

n

L sourcesD microphones

ji n s i

jij ax

NSAX

DL ,"fat" isA

Page 3: Siemens Corporate Research Rosca et al.  Generalized Sparse Mixing Model  BSS  ICASSP, Montreal 2004 Generalized Sparse Signal Mixing Model and Application

Rosca et al. – Generalized Sparse Mixing Model & BSS – ICASSP, Montreal 2004

Siemens Corporate Research

Motivation

• Solve ICA problem in realistic scenarios• In the presence of noise. Is this really feasible?• When A is “fat” (degenerate)

• Successful DUET/Time-frequency masking - approach and implementation

• Can we do better if we relax the DUET assumption about number of sources “active” at any time-frequency point? [Rickard et al. 2000,2001]

Page 4: Siemens Corporate Research Rosca et al.  Generalized Sparse Mixing Model  BSS  ICASSP, Montreal 2004 Generalized Sparse Signal Mixing Model and Application

Rosca et al. – Generalized Sparse Mixing Model & BSS – ICASSP, Montreal 2004

Siemens Corporate Research

Sparseness in TF

• DUET assumption: the maximum number of sources active at any time - frequency point in a mixture of signals is one

ji 0),(),( :ityorthogonaldisjoint -W tStS ji

Page 5: Siemens Corporate Research Rosca et al.  Generalized Sparse Mixing Model  BSS  ICASSP, Montreal 2004 Generalized Sparse Signal Mixing Model and Application

Rosca et al. – Generalized Sparse Mixing Model & BSS – ICASSP, Montreal 2004

Siemens Corporate Research

Example Voice Signal and TF Representation

Siemens Corporate ResearchJ.Rosca et al. – Scalable BSS under Noise – DAGA, Aachen 2003

Page 6: Siemens Corporate Research Rosca et al.  Generalized Sparse Mixing Model  BSS  ICASSP, Montreal 2004 Generalized Sparse Signal Mixing Model and Application

Rosca et al. – Generalized Sparse Mixing Model & BSS – ICASSP, Montreal 2004

Siemens Corporate Research

Sparseness in TF

• Sources hop from one set of frequencies to another over time, with no collisions (at most one source active at any time-freq. point)

s1

s2

s3

1

2

3

Page 7: Siemens Corporate Research Rosca et al.  Generalized Sparse Mixing Model  BSS  ICASSP, Montreal 2004 Generalized Sparse Signal Mixing Model and Application

Rosca et al. – Generalized Sparse Mixing Model & BSS – ICASSP, Montreal 2004

Siemens Corporate Research

Generalized Sparseness in TF

• Sources hop from one set of frequencies to another over time, with collisions (at most N sources active at any time-freq. point)

s1

s2

s3

N=2, L=3

Page 8: Siemens Corporate Research Rosca et al.  Generalized Sparse Mixing Model  BSS  ICASSP, Montreal 2004 Generalized Sparse Signal Mixing Model and Application

Rosca et al. – Generalized Sparse Mixing Model & BSS – ICASSP, Montreal 2004

Siemens Corporate Research

The Independence AssumptionAssume the TF coefficient S(k,) is modeled as a product of a Bernoulli (0/1) r.v., V, and a continuous r.v. G:

The p.d.f. of S becomes:

For L independent signals the joint source pdf becomes:

),...,,(Rest)()()1()()1(),...,,( 212

1 ,1

1

121 L

L

l

L

ljjjl

LL

ll

LL SSSqSSpqqSqSSSp

),(),(),( kGkVkS

)()1()()( SqSqpSpS

•W-Disjoint Orthogonality (DUET): q very small→ retain first two terms; at most one source is active at any time-freq. point

•Generalized W-Disj.Orth.: q very small→ retain first N+1 terms; at most N sources are active at any time-freq. point

Page 9: Siemens Corporate Research Rosca et al.  Generalized Sparse Mixing Model  BSS  ICASSP, Montreal 2004 Generalized Sparse Signal Mixing Model and Application

Rosca et al. – Generalized Sparse Mixing Model & BSS – ICASSP, Montreal 2004

Siemens Corporate Research

Signal Model (1)• Assumptions:

– L sources, D sensors– Far-field– Direct-path– Noises iid, Gaussian (0,σ2)

lklk

L

lkllk

L

ll

kDktntstx

tntstx

1 ;2 ),()()(

)()()(

1

11

1

bS

1 2 … D

aS

Page 10: Siemens Corporate Research Rosca et al.  Generalized Sparse Mixing Model  BSS  ICASSP, Montreal 2004 Generalized Sparse Signal Mixing Model and Application

Rosca et al. – Generalized Sparse Mixing Model & BSS – ICASSP, Montreal 2004

Siemens Corporate Research

jkS

NmkRkS

j

mjm

,0),(

1 ),,(),(

Signal Model (2)

– Mixing model:

– Source sparseness in TF

– Let those be:

NmtS,...L},{},...,j{jN(t,ω

mj

N

1 ,0),( such that 21 indices most at ), 1

L

ldl

did kNkSekX l

1

)1( ),(),(),(

Page 11: Siemens Corporate Research Rosca et al.  Generalized Sparse Mixing Model  BSS  ICASSP, Montreal 2004 Generalized Sparse Signal Mixing Model and Application

Rosca et al. – Generalized Sparse Mixing Model & BSS – ICASSP, Montreal 2004

Siemens Corporate Research

Example

Let N=2, two sources active at any time-freq. point

L

k

tRtRtt

kiftRkiftR

kkiftS

Ltt

...),(),(),(),(

:are problem theof Parameters

),(),(

,0),(

},,...,1{),(),,(

21

21

21

22

11

21

2121

active is source iff ),(

},,...,1{)},{(:

llk

Lk

m

m

Page 12: Siemens Corporate Research Rosca et al.  Generalized Sparse Mixing Model  BSS  ICASSP, Montreal 2004 Generalized Sparse Signal Mixing Model and Application

Rosca et al. – Generalized Sparse Mixing Model & BSS – ICASSP, Montreal 2004

Siemens Corporate Research

BSS Problem

• Given measurements {x(t)}1<=t<=T , D sensors

• Determine estimate of parameters :

• Note: L>D, degenerate BSS problem

L

N kRkRRk

,...,)),(),...,,((

),(

1

1

Mixing parameters

Mapping and Source signals

Page 13: Siemens Corporate Research Rosca et al.  Generalized Sparse Mixing Model  BSS  ICASSP, Montreal 2004 Generalized Sparse Signal Mixing Model and Application

Rosca et al. – Generalized Sparse Mixing Model & BSS – ICASSP, Montreal 2004

Siemens Corporate Research

Approach: Two Steps

1. Estimate mixing parameters, e.g. using the stronger constraint of W-disjoint orthogonality

2. Estimate the source signals under the generalized W-disjoint orthogonality assumption

Page 14: Siemens Corporate Research Rosca et al.  Generalized Sparse Mixing Model  BSS  ICASSP, Montreal 2004 Generalized Sparse Signal Mixing Model and Application

Rosca et al. – Generalized Sparse Mixing Model & BSS – ICASSP, Montreal 2004

Siemens Corporate Research

Solution Sketch (Ad-Hoc)• Employ principle of coherence (e.g. N=2)

– Given a pair of sources Sa and Sb active at some time-freq. point, then what we know what we should measure at all microphones pairs!

– Sa and Sb are the true ones if they result in minimum variance across all microphone pairs, i.e. coherent measurements

– Note: For N=2 and L=4 there are 6 pair of sources to be tested! (1,2),(1,3),(1,4),(2,3),(2,4),(3,4)

),(

),(

j)(i, b),(a, edhypothesizfor known

2221

1211

),(),(),(),(

),( pairs mic allfor and ),(

jib

jia

j

i

SS

baRbaRbaRbaR

XX

jitfixed

bS

i j

aS

Page 15: Siemens Corporate Research Rosca et al.  Generalized Sparse Mixing Model  BSS  ICASSP, Montreal 2004 Generalized Sparse Signal Mixing Model and Application

Rosca et al. – Generalized Sparse Mixing Model & BSS – ICASSP, Montreal 2004

Siemens Corporate Research

Solution Sketch (ML-1)

• Maximize likelihood function L(,R)=p(X| ,R)

),(),(

where

}),(),(1exp{1),(

1

212

1

0 ),(2

L

ll

idd

dd

D

d k

kSekY

kYkXRL

lj

l

Page 16: Siemens Corporate Research Rosca et al.  Generalized Sparse Mixing Model  BSS  ICASSP, Montreal 2004 Generalized Sparse Signal Mixing Model and Application

Rosca et al. – Generalized Sparse Mixing Model & BSS – ICASSP, Montreal 2004

Siemens Corporate Research

Solution Sketch (ML-2)

• max L(,R), after taking log

10,

)(ˆ

),(),(min

,1

*1*

),(

1

0

21,

DdeMwhere

XMMMR

kYkX

ljidld

k

D

dddR

Njjj

jNjj

jNjj

iDiDiD

iii

iii

eeeeeeeee

M

)1()1()1(

222

111

...

...

...1...11

21

21

21

Page 17: Siemens Corporate Research Rosca et al.  Generalized Sparse Mixing Model  BSS  ICASSP, Montreal 2004 Generalized Sparse Signal Mixing Model and Application

Rosca et al. – Generalized Sparse Mixing Model & BSS – ICASSP, Montreal 2004

Siemens Corporate Research

Solution Sketch (ML-3)

• After substituting R:

projection orthogonal,

max

})({max

)(min

2*

2

*1**

),(

2*1*

MMMM

M

P

k

PPPP

XP

XMMMMX

XMMMMX

M

Page 18: Siemens Corporate Research Rosca et al.  Generalized Sparse Mixing Model  BSS  ICASSP, Montreal 2004 Generalized Sparse Signal Mixing Model and Application

Rosca et al. – Generalized Sparse Mixing Model & BSS – ICASSP, Montreal 2004

Siemens Corporate Research

Interpretation of Solution• Criterion:

– projection of X onto the span of columns of M

• Solution (“coherent” measurements)– N-dim subspace of CD closest to X among

all L-choose-N subspaces spanned by different combinations of N columns of the matrix M

• Existence iff N≤D-1

CD

1M

2M

NLM

Page 19: Siemens Corporate Research Rosca et al.  Generalized Sparse Mixing Model  BSS  ICASSP, Montreal 2004 Generalized Sparse Signal Mixing Model and Application

Rosca et al. – Generalized Sparse Mixing Model & BSS – ICASSP, Montreal 2004

Siemens Corporate Research

Experimental Results (1)

• Algorithm applied to realistic synthetic mixtures

• From anechoic, low echoic, echoic to strongly echoic

• 16kHz data, 256 sample window, 50% overlap, coherent noise, SIR (-5dB,10dB), 30 gradient steps/iteration (Step 2), 5 iterations

• Evaluation: SIRGain, SegmentalSNR, Distortion

Page 20: Siemens Corporate Research Rosca et al.  Generalized Sparse Mixing Model  BSS  ICASSP, Montreal 2004 Generalized Sparse Signal Mixing Model and Application

Rosca et al. – Generalized Sparse Mixing Model & BSS – ICASSP, Montreal 2004

Siemens Corporate Research

Example Sources L=4, Mics D=2, N=2

Mixing

Sources

Estimates

Page 21: Siemens Corporate Research Rosca et al.  Generalized Sparse Mixing Model  BSS  ICASSP, Montreal 2004 Generalized Sparse Signal Mixing Model and Application

Rosca et al. – Generalized Sparse Mixing Model & BSS – ICASSP, Montreal 2004

Siemens Corporate Research

Discussion: L=4 sources, D=2 mics is a case too simple?

-10123456789

10

1 2

# Sources Simultaneously Active

SIR

Gai

n ML-anechoic

AH-anechoicML-echoic

AH-echoic

• In some simulations, the N=2 assumption helps • Conjecture: approach is useful when N is a small

fraction of L

Page 22: Siemens Corporate Research Rosca et al.  Generalized Sparse Mixing Model  BSS  ICASSP, Montreal 2004 Generalized Sparse Signal Mixing Model and Application

Rosca et al. – Generalized Sparse Mixing Model & BSS – ICASSP, Montreal 2004

Siemens Corporate Research

Conclusion

• Contribution: ML approach to noisy BSS problem under generalized sparseness assumptions, addressing degenerate case D<L

• Estimation problem can be addressed using sparse decomposition techniques: progress is needed

Page 23: Siemens Corporate Research Rosca et al.  Generalized Sparse Mixing Model  BSS  ICASSP, Montreal 2004 Generalized Sparse Signal Mixing Model and Application

Rosca et al. – Generalized Sparse Mixing Model & BSS – ICASSP, Montreal 2004

Siemens Corporate Research

Thank you!

Real speech separation demo for those interested after session!

Page 24: Siemens Corporate Research Rosca et al.  Generalized Sparse Mixing Model  BSS  ICASSP, Montreal 2004 Generalized Sparse Signal Mixing Model and Application

Rosca et al. – Generalized Sparse Mixing Model & BSS – ICASSP, Montreal 2004

Siemens Corporate Research

Outline

• Generalized sparseness assumption• Signal model and assumptions• BSS problem definition• Solution sketch: Ad-hoc and ML estimators• Geometrical interpretation of solution• Experimental results• Conclusion