Pipelining and number theory for multiuser detection Sridhar Rajagopal and Joseph R. Cavallaro Rice...

Preview:

Citation preview

Pipelining and number theory for multiuser detection

Sridhar Rajagopal and Joseph R. Cavallaro

Rice University

This work is supported by Nokia, TI, TATP and NSF

Motivation

• Several multiuser detection schemes• Hardware implementation infeasible• Optimize algorithm + hardware

• Design a reduced complexity multiuser detection algorithm and its implementation for 3GPP W-CDMA

Approaches

• Algorithm– parallel interference cancellation– reduced complexity, parallel structure

• Pipelining – bit-streaming, avoid block detection

• Number theory– Most Significant Digit First (MSDF) computation– sign detection

Contributions

• decrease detection latency and storage requirements by window length (12X)

• On-line arithmetic based on MSDF computation– further latency reduction by 1.9X– increase in throughput by 3X– possible savings in area

Outline

• Parallel interference cancellation

• Pipelining

• On-line arithmetic

• Conclusions

• Current research directions

Multiuser detection

ri-2 ri-1 ri ri+1

Interference from previous bits of other

users

Interference fromfuture bits ofother users

Desired user

User 1

User j

ri

bibi+1

time

Multiuser detection

• Optimal - MLSE • Decorrelating• MMSE• Serial/Parallel interference

cancellation

• Top 3 require inversion of matrices• Block based detection

Parallel interference cancellation

)y(signd

d]SAARe[yy1l

lH

AAAAAA

AAAAAAAAAAAA

H1

H00

1H

H0 1

HH00H

H0

H

01

1101

100

00

0

00

d

d

d

d

D,K

D,1

1,K

1,1

Block based detection

Block detection

0 B its 1 -1 0 1 1 0 B its 1 -1 0 1 1 0 B its 1 -1 0 1 1

B its 1 -1 0 de te c te din a blo c k

M a tc he d F ilte r P IC (S ta g e 1 ) P IC (S ta g e 3 )

O ve rhe a d B its D e te c tio n W indo w (D = 1 2 )

Outline

• Parallel interference cancellation

• Pipelining

• On-line arithmetic

• Conclusions

• Current research directions

Parallel interference cancellation

)y(signd

d]SAARe[yy1l

lH

AAAAAA

AAAAAAAAAAAA

H1

H00

1H

H0 1

HH00H

H0

H

01

1101

100

00

0

00

d

d

d

d

D,K

D,1

1,K

1,1

1ii1i RdCdLdyy

Block Toeplitz structure - suitable for pipelining

Pipelined detection

5 6 7 8 9 1 0 1 1 1 21 2 3 4

5 6 7 8 9 1 0 1 1 1 21 2 3 4

5 6 7 8 9 1 0 1 1 1 21 2 3 4

5 6 7 8 9 1 0 1 1 1 21 2 3 4

M atch edF ilter

P IC (S tag e 1 )

P IC (S tag e 2 )

P IC (S tag e 3 )

T im e ( i )

^

^

^

^

d i

d i+ 2

d i-2

d i-4

M atc he d F i l te r

Adde r

L d i -1L T d i + 1 C d i

Si g nD e te c t i o n

Stag e 1

r i + 3 A0 , A1

C LR = L Td i + 2y i + 2

+

- --

^

^ ^ ^

Stag e 2

Ld iy i R = L T^

^ ^

D e l ay

D e l ay y i

R e c e i ve dSi g nal

C hanne lE s t i m ate s

Stag e 3

Cd i -2y i -2 R = L T^

y i -4^d i -4

D e te c te d bi ts

C

L

Being designed as a class project in Elec 422/423VLSI class

Outline

• Parallel interference cancellation

• Pipelining

• On-line arithmetic

• Conclusions

• Current research directions

Redundant number systems

• Conventional systems ( 0.34578, r=10)– radix r has r possible digits

• Redundant (0.34578,0.35578,…. r=10)– >r possible digits.

• Limit carry propagation• Totally parallel addition/subtraction ONLY.

On-line arithmetic

• Uses a redundant number system• Pipelined bit-serial arithmetic• Most Significant Digit First computation• Successive computations as soon as

inputs available ( = 1-4, typically)• Can do operations such as addition,

multiplication, division, square-root etc.

On-line detection and decoding

C h an n e lE s tim a tio n

D e tec tio n D eco d in g

A n ten n a

D eco d edIn fo rm a tio n b its

H ard d ec is io n do r so ft d ec is io n y

R F U nit A /D D em ux

Entire chain can be done on-line

Work with hard decisions (sign of MSD) simple way to use softer decisions (2 or more digits)

On-line arithmetic for detection

d p = s ig n (A H r)

O n-l i ne Si ng l e U s e r D e te c to rC o nve nt i o nal S i ng l e U s e r D e te c to r

A Hp ,1 A H

p ,2 A Hp ,N -1

+

+

+

+

+

A H r

A Hp ,N

* * * *

r0 r1 rN - 1 rN

A Hp ,1 A H

p ,2 A Hp , N -1

+

+

+

+

+

A Hp ,N

* * * *

r0 r1 rN - 1 rN

d p = s ig n (A H r)

L ate nc y = lo g2(d)* t co n v* ( lo g2(N ) + 1 ) L ate nc y = tO L* ( lo g2(N )+ 1 ) + t s t o p

Single user detector using Conventional arithmetic

L ate nc y = Thr o ug hput = lo g2(d)* t co n v* ( lo g2(N ) + 1 )

d i - 1 d i d i +1

N * N /2 + 1 log 2 (d)* tc o n v

Conventional arithmetic - matched filter

Single user detector using On-line arithmetic

N m u ltip lic a tio n s in p ar a lle l ( N * )

N /2 ad d it io n s in p ar a lle l ( N /2 + )

N /4 ad d it io n s in p ar a lle l

N /8 ad d it io n s in p ar a lle l

F in a l ad d it io n b ef o r e s ig n ( 1 )

L ate nc y = tO L* ( lo g2(N )+ 1 ) + t s t o p

Thr o ug hput = m * tO L

T r ee ad d it io n

S to p ! S ig n d ig it d e tec ted

0 < = t s t o p < m * tO L is the t im e fo r the f i r s t no n-ze ro digi tm = d/ lo g2( r ) is the no . o f digi ts

d i - 1 d id i +1

m * tO L

tO L * ( lo g 2 ( N ) + 1 )

t s t o p

tO L

On-line matched filter

M atc he d F il te rtC M F = ( lo g2(N )+ 2 )* lo g2(d) * t co n v

P IC - S tage 1tC P IC = ( lo g2(K )+ 3 )* lo g2(d) * t co n v

P IC - S tage 2tC P IC t im e

P IC - S tage S = 3tC P IC t im e

Bit Parallel Conventional arithmetic

L ate nc y = (2 * S-1 )* tC P IC + 2 * tC M F

Thr o ug hput = tC P IC

d i

d i

d i +1

d i +1

d i

d i

d i +1

d i +1

y i , y i +1 ,

tCMF

tCPIC

Conventional multiuser detection

Digit Serial On-line arithmetic

0 < = t s t o p < m * tO L

L ate nc y = tM F + m * S* tO L+ S* tP IC

Thr o ug hput = m * tO L

d i d i +1y i y i +1

d i

d i

d i

d i +1

d i +1

d i +1

M atc he d F i l te rtM F = ( lo g2(N )+ 2 )* tO L + t s t o p

P IC - S tage 1tP IC = ( lo g2(K )+ 3 )* tO L + t s t o p

P IC - S tage 2tP IC t im e

P IC - S tage S = 3tP IC t im e

m*tOL

tMF

tPIC

ts top

On-line multiuser detection

Comparisons

N = K =32, d = 8, S = 3, r = 4, tol = 2, tconv = 1, tstop = 2

Outline

• Parallel interference cancellation

• Pipelining

• On-line arithmetic

• Conclusions

• Current research directions

Conclusions

• Techniques such as pipelining and on-line arithmetic can be used to implement multi-user detection for W-CDMA.

• Lower latency• Higher throughput• Smaller area• Simple hardware - adders and multipliers

Current research directions

• Reconfigurable computing -RENE

• Chameleon - hardware

• mNIC card

Recommended