Power Reduction in FIR

Embed Size (px)

Citation preview

  • 8/12/2019 Power Reduction in FIR

    1/16

    Hindawi Publishing CorporationVLSI DesignVolume 2012, Article ID 870546,15pagesdoi:10.1155/2012/870546

    Research ArticlePowerConsumptionModels for Decimation FIR Filters inMultistandard Receivers

    KhaledGrati,1 Nadia Khouja,1 Bertrand LeGal,2 and Adel Ghazel1

    1 Cirtacom Laboratory, Ecole Superieure des Communications de Tunis, 2083 Ariana, Tunisia2 IMS Laboratory, Universite de Bordeaux I, Bordeaux, 33405 Talence, France

    Correspondence should be addressed to Nadia Khouja,[email protected]

    Received 3 December 2011; Revised 9 March 2012; Accepted 15 March 2012

    Academic Editor: Frank Kienle

    Copyright 2012 Khaled Grati et al. This is an open access article distributed under the Creative Commons Attribution License,which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

    Decimation filters are widely used in communication-embedded systems. In fact, decimation filters are useful for implementingchannel filtering or selection with low-computation complexity requirements. Many multistandard receiver designs that arerequired in ubiquitous embedded systems are based on a cascade of decimation filter processing. Filter number and implementa-tion architectures have a significant impact on systemperformances, such as computation complexity, area, throughput, andpowerconsumption. In this work, we present filter power consumption estimation models for FIR filters. Power consumption modelswere obtained from a large number of FIRfilter syntheses using a direct form. Several curves that estimate power consumption wereextracted from these synthesis results. Then, we have evaluated the impact of polyphase decomposition on power consumption ofFIR filter and compared it with the direct form results. Some tips regarding power consumption were deduced for the polyphase

    implementation form. The aim of this work is to help a system designer to select an efficient implementation for FIR in terms ofpower consumption without having to implement and synthesize the different possible solutions. The proposed method is appliedfor STMicroelectronics libraries 90 nm and 65 nm low power then validated with a use case of multistandard receiver designing.

    1. Introduction

    Currently, wireless technologies are widespread because oftheir flexibility of use. However, many different standards areused, and a new challenge for the communication-embeddedsystem designer is to implement multiple communicationstandard devices in order to provide easy access to in-

    formation everywhere with low hardware complexity. Ingeneral, such devices include communication chains withdecimation and selector filters. Typically, multiple stan-dards communication chains are composed of an RF frontend, an over-sampler analogue to digital converter, anda cascade of decimation filters (Figure 1) [1]. The powerconsumption is an important constraint during embeddedsystem design because the design of decimation filtershas a substantial impact on the power consumption inmultistandard receivers. This work focuses on FIR filterpower consumption estimation in direct form (Figure 2).The polyphase form of FIR filters is widely recommendedfor reducing power consumption in comparison with all

    possible implementation forms of these filters. To the bestof our knowledge, there has not been a clear study basedon experimental results showing how much power FIR filtersconsume. In fact, the work by Dumonteix et al. [2] is widelymentioned, and it deals only with the power consumption,area and critical path of a particular implementation of acomb filter. It was shown in this work that appropriate filter

    decomposition in association with polyphase decompositioncould lead to an important significant consumption of theCIC filter.

    The main objective of this work is to provide models thatevaluate FIR decimator filter power consumption in directform. These models consider the main filter parameters,which are the filter order, the input wordlength, and the co-efficient wordlength. This model was given for STM90 nmand STM60 nm low power. The operation conditions arespecified in paragraph 2.

    The second objective of this paper is to study the impactof the polyphase form on the power consumption of FIRfilters. Some tips regarding the best decomposition to

  • 8/12/2019 Power Reduction in FIR

    2/16

    2 VLSI Design

    Table1: Dynamic and static power contribution using a 90 nm library of STMicroelectronics.

    Filter order Dynamic power (uW) Dynamic power (%) Static power (uW) Static power (%)

    13 300,9546 48,84 315,2013 51,16

    22 481,3064 48,26 515,8453 51,74

    31 632 46,71 721 53,29

    40 722 43,54 936 56,4649 840,64 42,63 1131,1 57,37

    58 973,3182 41,87 1351,3 58,13

    64 1054,6 41,62 1478,9 58,38

    RF filterAnalog to

    digitalconverter

    FIRdecimation

    filter

    Figure 1: General communication chain in a multistandardreceiver.

    perform to save power are extracted from the synthesisresults in this work.

    The ultimate aim of this study is to help filter designersdecide the best way to decompose the filter processing intostages to guarantee optimal power consumption.

    This paper is organized as follows. Section 2 presentsthe specifications of the symmetric FIR filter that is used toestimate the power consumption of a decimation FIR filter.In Section 3, we present the power consumption modelsthat are obtained for the direct implementation form. InSection 4, the models are validated by using synthesis results.InSection 5, we present the implementation results for thepolyphase form usage. A use case based on established

    models is presented in Section 6. Finally, conclusions aregiven inSection 7.

    2. CharacterizationofDecimatorFIRFilterParameters

    When designing FIR channel selection and a decimationfilter, the filter designer faces a trade-off between channelselection efficiency and filter complexity. Indeed, the designmust guarantee channel selection with the minimum com-plexity in terms of occupied area and power consumption.Depending on the communication standard, filter designerschoose the optimal filter order and coefficient length to

    guarantee the required signal-to-noise ratio. The input datawordlength is typically deduced from the analogue todigital converter input signal dynamic range. Hence, for thehardware performance evaluation, the designer should con-sider the three following parameters in FIR filter powerconsumption: input wordlength, filter order, and coefficientwordlength.

    To evaluate the impact of these parameters on the powerconsumption of an FIR filter, we performed a large numberof FIR filter architecture syntheses. We considered filtersorders from 7 to 64 (with a step of three). We chose toimplement both direct and polyphase forms. To reduce thehardware implementation complexity of the filter, we used

    a Wallace adder tree [3, 4] to perform the addition of allmultiplication results (Figure 2).

    For the polyphase decomposition, each subfilter wasimplemented as a FIR filter in the direct form. For simplicity,we chose to run the syntheses with the input wordlengthequal to 4 bits, 8 bits, 16 bits, and 32 bits. In the next section,we will show that, for intermediate values, it is possible to

    interpolate the power consumption of the filter.In the same way, we choose three possible values of filtercoefficients: 4 bits, 8 bits, and 16 bits. In fact, both the filterorder and coefficient size depend on the filters mask. Weestimate that 16 bits offer enough accuracy for quantizationprocess.

    The performance estimation of FIR filters was done onSTM90 nm process technology. In this 90nm process library,the static power has almost the same proportions as thedynamic power consumption (seeTable 1). For this reasonwe proposed two separate models for the dynamic and staticpower consumption. The performance estimation of FIRfilters was also done on ASIC 65 nm process technology using

    a STMicroelectronics low-power library. Using this library,the power consumption of the FIR filters is reduced to thedynamic contribution because the static power is very low(seeTable 2). Using the STM 65 nm low-power technologypermits to deduce the dynamic power consumption modelthat can be verified later with the STM90 nm library process.

    Design Vision of Synopsys was used to extract the perfor-mances on ASIC technology. From the experimental results,we were able to build a power consumption model of FIRfilters depending on the three main filter parameters: inputwordlength, coefficient wordlength, and filter order. Becausethe dynamic power consumption depends on frequency, themodel for dynamic power consumption obtained is also

    frequency dependent.

    3. PowerConsumption EstimationModels

    In this section, we introduce the power consumption esti-mation models for STM65 nm and STM90 nm process tech-nologies. The STM65 nm library is low-power and operatesat 0.9 V and used in nominal case with junction temperatureof 25C. The STM90 nm library operates at 1.26 V and isused in the best case with a junction temperature of 40C.

    3.1. Dynamic Power Consumption Model. For the dynamicpower consumption model we used STM 65 nm low-power

  • 8/12/2019 Power Reduction in FIR

    3/16

    VLSI Design 3

    Reg Reg Reg Reg Reg

    RegRegRegRegReg

    clk

    Wallace tree adder

    h0 h1 h2 hN

    xn

    yn

    (a) Direct form

    Reg

    Reg

    Reg

    Wallaceaddertree

    xn Fs/M

    Fs/M

    Fs/M

    Fs/M

    Fs

    Fs

    Fs

    Fs

    H0

    H1

    H2

    HM1

    yn

    (b) Polyphase form

    Figure2: Implementation architecture of the direct form and polyphase form of FIR filter.

    Table2: Dynamic and static power contribution using a 65-nm low power library.

    Filter order Dynamic Power (uW) Dynamic power (%) Static Power (uW) Static power (%)

    13 169,6857 98,9 1,9 1,1

    22 245,1208 98,8 3,1 1,2

    31 356,7201 98,9 4,2 1,1

    40 424,5542 98,7 5,7 1,3

    49 497,6389 98,6 7,2 1,4

    58 578,6831 98,7 8 1,3

    64 628,4149 98,6 9,1 1,4

    technology. In this technology, the dynamic power consump-tion is assumed to be the total power because the static poweris very low (seeTable 2). The Dynamic power consumptionis assumed to be proportional to frequency. To verify thisassumption, we performed several experiments to evaluatethe impact of the frequency constraint, provided by thelogical synthesis tool, on the occupied area and the powerconsumption of the generated design for a fixed filter order.Figure 3 confirmsthat the resources that are used to build thearchitecture for a given filter are the same regardless of the

    specified frequency constraint. Hence, the logical synthesistool does not introduce any area or power optimizationregardless of the working frequency. As a consequence, thedynamic power consumption is considered linear given theconstrained frequency (Figure 3).

    Following this observation, we concentrated all synthesesefforts at a fixed frequency equal to 80 MHz, which issufficient for the requirements of both GSM and UMTSstandards [5,6].

    Figure 4gives the evolution of the power consumptionversus the filter order for a direct form FIR filter for differentvalues of coefficients and different inputs wordlength. Thisfigure shows that power consumption is quite linear to the

    order of the filter for fixed input (4, 8, and 16 bits) andcoefficient wordlength (4, 8, and 16 bits).

    Figure 5 illustrates the relationship between the powerconsumption and input wordlength for four chosen ordersand for a fixed coefficient value. For all other orders andfor the different coefficient wordlengths, the evolution of thepower consumption has the same trends. According to thesecurves, the power consumption evolution versus the inputwordlength is not linear.

    However,Figure 6shows that the natural logarithm of

    the power consumption is almost linear as compared withthe logarithm of the input wordlength. Hence, (1) givesthe expression of the natural logarithm of the dynamicpower consumption versus the natural logarithm of the inputwordlength. This calculation leads to relation (2), which givesthe power consumption general expression.

    LN(P) = LN()+ LN(I), (1)

    P= (I), (2)

    whereIis the input wordlength, is the slope of the curvesin Figure 6, and LN() represents the origin value of the

  • 8/12/2019 Power Reduction in FIR

    4/16

    4 VLSI Design

    0

    5000

    10000

    15000

    20000

    25000

    30000

    20 40 80 100 133 200 266 400

    Frequency (MHz)

    O13

    O40

    O22

    O58

    O31

    O64

    Areaoccupation(m2)

    (a) Area evolution

    0 50 100 150 200 250 300 350 4000

    500

    1000

    1500

    2000

    2500

    3000

    Frequency (MHz)

    Powerconsumption(W)

    O13

    O40O22O58O31

    O64

    (b) Power consumption

    Figure3: Area occupation and power consumption of FIR filters in the direct form depending on frequency.

    0100200300400500600700800900

    1000110012001300140015001600

    7 10 13 16 19 22 25 28 31 34 37 40 43 46 49 52 55 58 61 64Filter order

    Powerconsumption(W)

    Coef16 I4Coef16 I8

    Coef16 I16Coef8 I4Coef8 I8

    Coef8 I16

    Coef4 I4Coef4 I8

    Coef4 I16

    Figure 4: Power consumption of FIR filters in the direct formdepending on input wordlength and coefficient wordlength.

    same curves. Evaluating the and expressions dependingon the filter parameters and on the basis of the different

    experiments and curves will lead to the establishment ofthe dynamic power consumption model. The followingsection demonstrates how each parameter of expression (2)is obtained.

    We evaluate first whether parameter is independentof the filter versus expression of the parameter . Hence,according to Figure 6, all given curves LN(P) versus LN(I) arealmost parallels for a fixed coefficient value. This propertywas verified for all other orders and for all coefficientwordlength considered in this work. Consequently, the slope() of all these curves is the same regardless of the filter order.Thus, this slope () does not depend on the filter order andis only dependent on the coefficient wordlength.

    0 5 10 15 20 25 30 350

    200

    400

    600

    800

    1000

    1200

    1400

    Input wordlength (bits)

    Order 13Order 16

    Order 19Order 22

    Powerconsumption(W)

    Figure 5: Power consumption of FIR filters in the direct formdepending on input wordlength forthe 65 nm technology fora fixedcoef size of 4 bits.

    To evaluate the expression of as a function of coef(filter coefficient wordlength), we plot the curves given

    depending on coef for fixed filter orders inFigure 7. Thesecurves are obtained from data illustrated inFigure 6.

    Figure 7confirms the filter order (N). Moreover, thecurves show no linear evolution of versus coefficientwordlength (coef). The expression of as a function of coefis given in (3).

    =g(coef) = a exp(b coef)+c, (3)

    where a, b, and c are technology-dependent terms. Theexponential term is explained by the curves in Figure 7. Infact, the curves converge around (a+c) when the coefficientwordlength is close to 0 and then decreases very fast when thecoefficient wordlength increases to converge to the cvalue.

  • 8/12/2019 Power Reduction in FIR

    5/16

    VLSI Design 5

    1 1.5 2 2.5 3 3.54.5

    5

    5.5

    6

    6.5

    7

    Order 13

    Order 16

    Order 19

    Order 22

    L

    N(P)

    LN(I)

    Figure 6: Natural logarithm of the power consumption of FIRfilters in the direct form depending on the natural logarithm of theinput wordlength for the 65 nm technology for a fixed coef size of

    4 bits.

    4 6 8 10 12 14 160.7

    0.75

    0.8

    0.85

    0.9

    0.95

    Coefficient wordlength (bits)

    Order 13Order 16

    Order 19

    Figure 7: Evolution of depending on coefficient wordlength fora fixed input size of 4 bits and fixed filter orders for the 65 nmtechnology.

    The line of equation y = c is a horizontal asymptote to thecurves inFigure 7, which explain the additive c term in (3).To set numerical value ofa,b,cfor this given technology weused Matlab curve fitting toolbox [7] and we found that,in this technology, parameters a, b, and care 0.6, 0.3, and0.7, respectively. Of course, these values are useful onlyfor the given technology; however, if we are using anothertechnology the method presented above should be repeatedat least one time as we have performed with the STM90 nmlater.

    To evaluate parameter of (2), we used the same ap-proach used for expression extraction. Hence, first, thedependency ofversus filter order (N)was evaluated. Then,

    13 14 15 16 17 18 19 20 21 22

    30

    40

    50

    60

    70

    80

    90

    100

    110

    Filter order

    Coef = 4

    Coef = 8

    Coef = 16

    20

    Figure8: Evolution ofdepending on filter order fixed coefficient

    wordlength for the 65 nm technology.

    Figure 9: Evolution of slope depending on the coefficientwordlength for fixed filter order for the 65 nm technology.

    the dependency versus coefficient wordlength (coef) wasfound.

    Curves inFigure 8illustrate the dependency of param-eter on the filter orders and for three fixed coefficient

    wordlengths. To obtain these curves, we extracted theorigin values fromFigure 7. These origin values representthe normal logarithm of the parameter for each fixedcoefficient size. Then, we plotted the exponential of thesevalues. The curves inFigure 8show that is almost linearversus filter order for any coefficient size. Hence, we nextevaluated the dependency of the slope of depending oncoefficient wordlength, as shown inFigure 9. According tothis figure, the slope of is also linear versus coef. Asa consequence, the relation (4) is deduced to model theevolution ofversusNand coef.

    = N coef +, (4)

  • 8/12/2019 Power Reduction in FIR

    6/16

    6 VLSI Design

    0 5 10 15 20 25 30 350

    500

    1000

    1500

    2000

    2500

    3000

    3500

    4000

    4500

    5000

    Input wordlength (bits)

    Order 13

    Order 16

    Order 19

    Order 22

    Powerconsu

    mption(W)

    Figure10: Relationship between the dynamic power consumptionand input wordlength for the 90nm technology for a fixed

    coeffi

    cient wordlength.

    where and are constants and depend on the technologyconsidered. Experiments show that in the 65 nm technology,the values of,are 0.2 and 1, respectively.

    Hence, using (3) and (4), equation (5) gives the dynamicpower consumption evolution versus filter parameters andnormalized frequency for a direct form (Figure 2).

    PN,coef, I,f

    =

    N

    coef +

    I(aexp(bcoef)+c)

    f

    f0,

    (5)

    where f0 is the frequency used during the syntheses processand is equal to 80 MHz. When the filter order is zero, thepower consumption should be zero as well. This conditionis guaranteed because power is directly proportional to filterorder (N).

    On the other hand, when the coefficient length (coef) iszero, the FIR filter is composed ofNregisters andN/2 adders.In this case, the power consumption becomes a constantmultiplying the filter order (N).

    3.1.1. Verification of the Dynamic Power Consumption forSTM90-nm. To verify whether the model of (4) is compliant

    with the amount of dynamic power consumption in theSTM90 nm technology, we repeated all of the synthesesusing the new library. The same parameter values wereused for the experiments. The same conclusions regardingpower consumption versus input wordlength were noticed.As shown in Figure 10, we also verified that the naturallogarithm of the power is proportional to the naturallogarithm of the input wordlength regardless of the filterorder and coefficient size. Hence, (1) and (2) are still truefor the 90 nm technology.

    According toFigure 11, the independency of the param-eter in (2) regarding filter orderNis still verified. Figure12 shows the relationship between and the coefficient

    1 1.5 2 2.5 3 3.54.5

    5

    5.5

    6

    6.5

    7

    Order 13

    Order 16

    Order 19

    Order 22

    LN(

    P)

    LN(I)

    Figure 11: Relationship between the logarithm of dynamic power

    consumption and the logarithm of the input wordlength for the90 nm technology for a fixed coefficient wordlength.

    4 6 8 10 12 14 160.7

    0.72

    0.74

    0.76

    0.78

    0.8

    0.82

    0.84

    0.86

    0.88

    Coef (bits)

    Figure12: Evolution ofdepending on the coefficient wordlengthfor fixed filter orders.

    wordlength. The same trends observed inFigure 7are ob-served inFigure 13. Hence, the expression ofgiven in (3) is

    verified in the 90 nm technology.Figures 13 and 14 give the evolution of parameter

    (given in (3)) regarding filter order and the slope ofversuscoefficient wordlength, respectively. According to the twofigures, (4), which gives expression ofregarding filter orderand coefficient wordlength, is verified.

    As a consequence, the model of (5) is applicable for theevaluation of the dynamic power consumption in the 90 nmtechnology. With the same manner, to set numerical valueofa, b, c for this given technology we used Matlab curvefitting toolbox [7] and we found that for the 90 nm process,parameters , , a, b, and cwere equal to 0.5, 4.5, 0.5, 0.2,and 0.6, respectively.

  • 8/12/2019 Power Reduction in FIR

    7/16

    VLSI Design 7

    10 20 30 40 50 60 700

    100

    200

    300

    400

    500

    600

    700

    Filter order

    Coef= 4

    Coef= 16

    Figure 13: Evolution of depending on filter orders for a fixedcoefficient wordlength.

    Figure14: Evolution of the slope ofdepending on the coefficientwordlength.

    3.2. Static Power Consumption Model. TheTable 1illustratesthe dynamic and static power contribution of the power

    consumption for the 90-nm process technology. It is clearfrom the table that static power cannot be neglected in thistechnology.

    To establish a static power consumption model, we evalu-ated the evolution of the static power regarding filter param-eters. In particular, we noticed a nonlinear relationshipbetween the static power and input wordlength (see Figure15). The second observation concerns the linearity of thenatural logarithm of the static power versus the naturallogarithm of the input wordlength (Figure 16). Hence, forthe dynamic power consumption contribution, (1) and (2)could be used as general equation forms for the static powerconsumption of symmetric FIR filters.

    0 5 10 15 20 25 30 350

    1000

    2000

    3000

    4000

    5000

    6000

    Input wordlength (bits)

    Order 13

    Order 22

    Order 31

    Order 40

    Order 49

    Staticpowercon

    sumption(W)

    Figure 15: Relationship between the static power consumption andinput wordlength for the 90 nm technology for a fixed coefficientwordlength.

    1 1.5 2 2.5 3 3.56

    6.5

    7

    7.5

    8

    8.5

    9

    Order 13

    Order 22

    Order 31

    Order 40

    Order 49

    Log (I)

    Log(P)

    Figure 16: Relationship between the logarithm of the static powerconsumption and the logarithm of the input wordlength for the 90-nm technology for a fixed coefficient wordlength.

    To establish expressions of the and terms (in (2)), thedynamic power consumption modeling was followed. Hence,starting from the fact thatis filter order independent (sincethe curves are parallel inFigure 16), we plot the evolution ofversus coefficient wordlength inFigure 17.

    After analyzing the curves inFigure 17, (6) fits the bestcurve evolution. Indeed, the exponential term is explainedby the rapid decrease when small coefficient values areconsidered. The linear term is added because of the very slowincrease when the coefficient wordlength increases.

    = exp(d coef)+e coef +g, (6)

  • 8/12/2019 Power Reduction in FIR

    8/16

    8 VLSI Design

    4 6 8 10 12 14 160.68

    0.7

    0.72

    0.74

    0.76

    0.78

    0.8

    Coefficient wordlength

    Figure17: Evolution of depending on the coefficient wordlengthfor fixed filter orders.

    10 15 20 25 30 35 40

    50

    100

    150

    200

    250

    300

    350

    400

    450

    500

    550

    Filter order

    Coef= 4Coef= 8

    Coef= 16

    Figure 18: Evolution of depending on filter orders for a fixedcoefficient wordlength.

    where d, e, and g are technology dependent parameters.Using Matlab, we shown that, when using a STMicroelec-

    tronics library, the parameters are equal to 0.11, 0.03, and0.04.

    Parameter was then evaluated depending first on thefilter order and then on the coefficient wordlength. Figure18 shows the relationship between and N. Hence, wefound an almost linear relationship of this parameter versusfilter order. In the second step, the slope of was analyzedaccording to coefficient wordlength (seeFigure 19). Matlabverified that (7) fits the evolution of the slope ofillustratedinFigure 18.

    slope =

    a exp(b coef)+c, (7)

    4 6 8 10 12 14 16

    7

    8

    9

    10

    11

    12

    13

    Coef wordlength

    3

    4

    5

    6

    0 2

    s

    lope

    Figure19: Evolution of theslope depending on filter orders for afixed coefficient wordlength.

    where a, b, c, and are parameters depending on thetechnology. According to Matlab, when using the 90 nmSTMicroelectronics library, the parameters are equal to 0.29,0.4, 0.077, and 1, respectively. Finally,is expressed in (8):

    = +

    a exp(b coef)+cN. (8)

    As a consequence, the general expression of the static powerconsumption for a direct form FIR filter (Figure 2) in the90 nm technology could be written as shown in (9).

    Pstat(N, coef, I) = N

    a exp(b coef)+c+

    I(exp(dcoef)+ecoef+g),(9)

    where technology dependent parametersa, b, c, d, e, and gare equal to 0.29, 0.4, 0.077, 0.11, 0.03, and 0.04, respectively.Parameters and depend also on the technology and areequal to 1.2 and 1.2, respectively, for the current model.

    4. Validation of thePowerConsumptionModels

    This part presents the validation of the models obtained.Hence, we present different figures comparing the synthesesvalues of power consumption and the results deduced

    from established models. The aim of the comparison is todemonstrate first that the power consumption trends arerespected by the models. In fact, the power estimation valuefor a given design is not really important and the objective isto prove that decisions concerning the choice of the suitableparameters, which reduces the power consumption, are notmodified using the models. For this purpose, a parametercalled the deviation is calculated. This parameter givesthe difference for two fixed filter parameters (which are thefilter order and coefficient wordlength or input wordlength)between the power consumption value when varying thethird filter parameter (which is the coefficient wordlengthor input wordlength). Indeed, having a similar deviation

  • 8/12/2019 Power Reduction in FIR

    9/16

  • 8/12/2019 Power Reduction in FIR

    10/16

    10 VLSI Design

    0

    500

    1000

    1500

    2000

    2500

    3000

    3500

    4000

    4500

    13 22 31 40 49 58 64

    Filter order

    d c8e4d c8e16

    model (d c8e8)

    d c8e8

    model (d c8e4)

    model (d c8e16)

    Dynamicpower(W)

    (a) coef= 8

    0

    500

    1000

    1500

    2000

    2500

    3000

    3500

    4000

    4500

    13 22 31 40 49 58 64

    Filter order

    d c16e4

    d c16e8d c16e16

    model (d c16e4)

    model (d c16e8)

    Dynamicp

    ower(W)

    (b) coef= 16

    Figure22: Comparison of model dynamic power results to experimental values for the 90 nm technology for a fixed coefficient wordlength.

    0

    500

    1000

    1500

    2000

    2500

    3000

    13 22 31 40 49 58 64

    Filter order

    Deviation(W)

    d1

    d2

    d3

    m d1 m d2

    m d3

    Figure 23: Deviation parameter for model and estimation toolpower results for coef= 8.

    almost equal to the deviation measured for the experimentaldynamic power estimation (Figure 23).

    4.3. Validation of the Static Power Consumption Model for the90 nm Process Technology. For the dynamic contribution, wevalidated the static power model as given in (9). Figure 24shows the comparison between the results extracted fromthe model and the experimental values for static powerconsumption and for fixed input wordlength.

    In the same way, we confirm that the proposed modelfor static power consumption gives static power values veryclose to the values given by the estimation tool. The modelalso does not modify trends of power and does not modifydecisions depending on the filter parameters because thedeviations are almost equal (seeFigure 25).

    5. PowerConsumption EstimationTips forPolyphase ImplementationForm of FIR Filters

    5.1. Results of Polyphase Implementation Form in the 65 nmTechnology. Polyphase implementations of FIR filters wereperformed with different input wordlengths, coefficient sizes,and filter orders, which lead to the same observations

    illustrated inFigure 26. This figure compares the direct formimplementation of different FIR filter orders versus theirpolyphase decomposition.

    The comparison is made for an input size equal to 4 bitsand includes decimation values of 2, 4, 8, and 16. AccordingtoFigure 26, it is clear that polyphase decomposition reducesthe power consumption for any decimation factor.

    In fact, the following observations are clear.

    (i) Decomposition into two stages allows power reduc-tion rates that increase with filter order. The reduc-tion rates are between 20% and 40% in comparisonwith the direct form implementation.

    (ii) Decomposition into four stages allows reductionrates between 45% and 60% compared with the directform implementation.

    (iii) Decomposition into 8, 16, and 32 stages allows reduc-tion rates between 60% and 70% for 8 stages basedarchitecture, 60% and 75% for the 16 stages, and 55%and 75% for the 32 stages.

    As a consequence, polyphase decomposition is always benefi-cial in terms of power consumption reduction. However, thereduction average depends on the decimation factor that isused for a given filter order. Equation (10) gives the relationbetween the decimation factor and filter order up to 128 to

  • 8/12/2019 Power Reduction in FIR

    11/16

    VLSI Design 11

    Staticpow

    er(W)

    s c4e4 s c8e4

    s c16e4 Model (s c8e4)

    Model (s c4e4) Model (s c16e4)

    0

    500

    1000

    1500

    2000

    2500

    3000

    13 22 31 40 49 58 64

    Filter order

    (a) I= 4

    s c4E8 s c8e8

    s c16e8 Model (s c4e8)

    Model (s c8e8) Model (s c16e8)

    Staticpow

    er(W)

    0

    500

    1000

    1500

    2000

    2500

    3000

    3500

    4000

    4500

    13 22 31 40 49 58 64

    Filter order

    (b) I= 8

    Figure24: Comparison of the model static power results with experimental values for the 90 nm technology for a fixed Input wordlength.

    0

    100

    200

    300

    400

    500

    600

    700

    800

    900

    1000

    13 22 31 40 49 58 64

    Deviation(W)

    Filter order

    d1 d2 d3

    m d1 m d2 m d3

    Figure 25: Deviation parameter for model and estimation toolpower results for an input size of 8 bits.

    offer the best power consumption reduction. It is estimatedthat order 128 is sufficiently large for channel selection filters.

    Mopt =

    8 si 8

  • 8/12/2019 Power Reduction in FIR

    12/16

    12 VLSI Design

    0

    200

    400

    600

    800

    1000

    1200

    13 22 34 43 49 59 61

    Filter order

    Dynamicpowerconsumption(W)

    d m2

    d m4

    d m8

    d m16 Direct form

    Figure 27: Dynamic power evolution of poly phase form imple-mentations of a symmetric FIR filter for the 90 nm technology.

    0

    500

    1000

    1500

    2000

    2500

    13 22 34 43 49 59 61

    Filter order

    Staticpowerconsumption(W)

    s m2

    s m8

    Direct form

    s m4

    s m16

    Figure28: Static power evolution of poly phase form implementa-tions of a symmetric FIR filter for the 90 nm technology.

    InFigure 29, the total power consumption results for theFIR filter considered are plotted and compared to the powerconsumption results of the direct form. According to thisfigure and considering the total power consumption criteria,the polyphase implementation could lead in some cases to apower consumption increase in comparison with the direct

    form power consumption results.However, decimation by a factor of 2 and 4 always re-duces the total power consumption in comparison with thedirect form, with advantageous reduction for a decimationfactor of 4.

    6. Comparison of PowerConsumption ofFilteringArchitectures forGSMandUMTSStandardsUsingEstablishedModels.

    In this section, we examine the power consumption estima-tion of different decimation filter solutions for multistandardreceiver supporting UMTS and GSM standards. The aim

    0

    500

    1000

    1500

    2000

    2500

    3000

    13 22 34 43 49 59 61

    Filter order

    Totalpowerc

    onsumption(W)

    t m2 t m4t m8 t m16

    Pt direct

    Figure 29: Total power evolution of poly phase form implementa-tions of a symmetric FIR filter for the 90 nm technology.

    of this section is to demonstrate how the models couldhelp filter designer make the correct choice regarding filterarchitecture to reduce the power consumption.

    For this purpose, three different architectures suitable formultistandard receivers are analyzed (Figure 30).

    The parameters of the different filters building each ar-chitecture are obtained using the specification methodologydescribed in [8]. The multistandard receiver architecture andparameters considered in this work are calculated in [9].Because filtering performances are out of the scope of thispaper, only the description of the filter parameters is given.

    6.1. Filtering Solution Description for UMTS and GSMStandards. This paragraph gives the details of the differentarchitectures for both standards. Tables3 and 4 summarizethe parameters of the different stages in the case of UMTSand GSM standards, respectively.

    6.1.1. 2-Stage Architecture (Arch0). For both UMTS andGSM standards, the first filter is composed of a cascade of 5-stage CIC filters. It performs decimation by a factor of 12 or4 for GSM and UMTS standards, respectively. The recursivearchitecture of the CIC filter allows the programmabilityof the filter depending on the selected standard. The input

    wordlength of the filter is equal to 6 bits at its output; thewordlength is 16 bits for the UMTS standard and is equal to26 bits in the case of the GSM standard. The second filter ofthe 2-stage architecture is a symmetric FIR filter of order 45for UMTS, where 12 bits are necessary for the quantificationof the filter coefficients for this standard. In the case of GSM,the required filter order is 83, and 12 bits are sufficient for thequantization of coefficients.

    6.1.2. 3-Stage Architecture (Arch1). In this architecture, thelast filter of the 2-stage architecture is split into a half-bandfilter and an FIR filter performing, each one a decimationby a factor of 2. The half-band filter has an order equal

  • 8/12/2019 Power Reduction in FIR

    13/16

    VLSI Design 13

    CIC filter M1 FIR filter M2

    (a)

    CIC filter M3Halfband

    filter 2 FIR filter 2

    (b)

    CIC filter 2 CIC filter 2 Halfband

    filter 2 FIR filter 2

    (c)

    Figure30: Decimation filter architectures for GSM and UMTS standards.

    Table3: Specification of the filters in proposed architectures for the GSM standard.

    Architecture Specification CIC filter HB1filter HB2 filter FIR filter

    2 stages

    Order 5 N/A N/A 83

    Input wordlength 6 N/A N/A 17

    Coefficients wordlength N/A N/A N/A 12

    Decimation factor 12 N/A N/A 4

    3 stages

    Order 5 20 N/A 45

    Input word-length 6 17 N/A 17

    Coefficients wordlength N/A 11 N/A 11

    Decimation factor 12 2 N/A 2

    4 stages

    Order 6 8 10 34

    Input wordlength 6 17 17 17

    Coefficients wordlength N/A 10 11 10

    Decimation factor 6 2 2 2

    Table4: Specification of the filters in proposed architectures for the UMTS standard.

    Architecture Specification CIC filter CIC filter HB filter FIR filter

    2 stages

    Order 5 N/A N/A 57

    Input wordlength 6 N/A N/A 15

    Coefficients wordlength N/A N/A N/A 12

    Decimati on factor 4 N/A N/A 4

    3 stages

    Order 5 N/A 14 20

    Input wordlength 6 N/A 15 15

    Coefficients wordlength N/A N/A 10 12

    Decimation factor 4 N/A 2 24 stages

    Order 5 4 10 20

    Input wordlength 6 11 15 15

    Coefficients wordlength N/A N/A 9 12

    Decimation factor 2 2 2 2

    to 14 for UMTS and 20 for GSM. The filters coefficientslength is fixed to 10 bits and 11 bits for UMTS and GSMstandards, respectively. The last filter has an order of 20 forUMTS and an order equal to 45 for GSM. For both standards,coefficients are quantized with 12 bits.

    6.1.3. 4-Stage Architecture (Arch2). In the case of the GSMstandard, the 4-stage architecture is based on a CIC filter oforder 6 in its recursive form, followed by a cascade of 2 half-band filters, each one performing a decimation by a factorof 2. The last stage is composed of a symmetric FIR filter,

  • 8/12/2019 Power Reduction in FIR

    14/16

    14 VLSI Design

    Table5: Power consumption evaluation according to models in the 65 nm low-power technology.

    2-stage Archpower (W)

    3-stage Archpower (W)

    4-stage Archpower (W)

    GSM standard

    Filter 1 (cic filter) 226 226 264,3

    Filter 2(halfband)

    N/A 42,5 32,6

    Filter 3(halfband)

    N/A N/A 20,4

    Filter 4 (FIR) 208,6 47,8 34,7

    Total GSM 434,6 336,3 352

    UMTS standard

    Filter 1 (cic filter) N/A N/A N/A

    Filter 2 (cic filter) N/A N/A N/A

    Filter 3(halfband)

    N/A 54 52

    Filter 4 (FIR) 303 56 56

    Total UMTS 303 110 108

    Table6: Power consumption evaluation according to models for the 90 nm technology.

    2-stageArchpower (W)

    3-stageArchpower (W)

    4-stageArchpower (W)

    GSM standard

    Filter 1 (cic filter) 967,8 967,8 2796,8

    Filter 2(halfband)

    N/A 119,5 + 1818,8 93,7 + 720

    Filter 3(halfband)

    N/A N/A 59,7 + 909,4

    Filter 4 (FIR) 575,3 + 8660 134,4 + 4092,4 99,5 + 3060,75

    Total GSM 10203,1 7132 7739,8

    UMTS standard

    Filter 1 (cic filter) N/A N/A N/A

    Filter 2 (cic filter) N/A N/A N/A

    Filter 3(halfband)

    N/A 145 + 827 142 + 818

    Filter 4 (FIR) 823 + 4000 147 + 1600 147 + 1600

    Total UMTS 4823 2719 2707

    completing the decimation by 2 and the channel selection.The two half-band filters have orders equal to 6 and 10, andtheir coefficients are quantized on 10 and 11 bits, respectively.The last FIR filter has an order equal to 34, and we found that10 bits are required for the quantization of coefficients.

    For the UMTS standard, the CIC filter used in the 3-stage architecture is split into a cascade of 2 CIC filters. The

    number of required stages is 5 for the first CIC filter and 4for the second. Each filter performs decimation by a factor of2. The output size of the first CIC filter is equal to 11 bits.The third filter is a half-band filter of order 10 and has acoefficient size equal to 9 bits. Finally, a symmetric FIR filterof order 20 with coefficients quantized on 12 bits completesthe selection. As explained before, the input wordlength isconsidered equal to 15 bits, unless for the first or secondstage, for which the input wordlength is equal to 6 bits or11 bits, respectively.

    6.2. Comparison to Implementation Results. On the basis ofthe filter parameters and the established models given in

    (5) and (10), we estimated the power consumption of eachsub-block in the filtering architectures for both the 65 nmand 90 nm technologies. Table 5 gives power consumptionestimation results for GSM and UMTS standards for the65 nm technology. The results for both standards for the90 nm technology are given inTable 6.

    For the particular case of CIC filters, the power consump-

    tion comparison is performed following the work in [2].Indeed, the authors in [2] studied the power consumptionof fixed order CIC filters depending on their implementationarchitecture. In the case of the GSM standard and because thepower consumption of CIC filters of different orders is notincluded in the work in [2], the power consumption resultsconcern the FIR implementation form of the CIC filters.

    According to Tables3and4, it is clear that, for both tech-nologies (65 nm and 90 nm), the architecture that presentsthe optimal power consumption in the case of the GSMstandard is the 3-stage architecture.

    For the UMTS standard, the power consumption valuesobtained from models are very similar in the case of 3-stage

  • 8/12/2019 Power Reduction in FIR

    15/16

    VLSI Design 15

    Table7: Power consumption evaluation according to the prime-power tool for the 65-nm technology for UMTS and GSMstandards.

    2 stages 3 stages 4 stages

    GSM standard 280,6 231 260

    UMTS standard 171 157 149

    Table8: Power consumption evaluation according to the prime-power tool for the 90nm technology for UMTS and GSMstandards.

    2 stages 3 stages 4 stages

    90nm GSM 751 640 731

    UMTS 5353 4231 4441

    65nm GSM 280,6 231 260

    UMTS 171 157 149

    and 4-stage architectures. If the estimation of the power is

    considered, which is done in [2], for the comparison of thepower consumption of the CIC filters, the power values in-crease, but the trends are not modified.

    It is, however, important to notice that the power due tothe clock tree and stage connection is not considered in theevaluation. Thus, the power consumption for 4 stages shouldbe considered because it is composed of more stages andshould increase compared to the 3-stage architectures, in par-ticular for the 90 nm technology. Hence, it can be concludedthat the 3-stage architecture is also more advantageous interms of power consumption for the UMTS standard.

    To validate these results experimentally, VHDL code of allarchitectures considered was built. The results of the power

    estimated after logic syntheses are given in Tables7and8forthe 65 nm and 90 nm technologies, respectively. These tablesconfirm the conclusions obtained from established models.

    7. Conclusion

    This paper presented power consumption evaluation modelsof direct form FIR filters (Figure 2) used for decimation.Both models for dynamic and static power contributionsare proposed in this work in STM65 nm low-power andSTM90 nm ASIC technology. The proposed models are highlevel models, which estimate the dynamic and static powerconsumption of the FIR filter depending on three filter

    parameters, which are the input wordlength, coefficientwordlength, and filter order. The aim of these models isto help the system designer compare, at the system level,different filter architectures in terms of power consumptionwithout having to implement the different filters and per-form syntheses. However, this method should be verified forany new library different form the used ones in this work.In the second step, the effect of polyphase decompositionfor FIR decimator filters was evaluated in two differenttechnologies. We found that polyphase decomposition allowsgood dynamic power reduction regarding the direct formimplementation. We observed that the dynamic powerreduction could reach 75% in some cases. To help the

    system designer choose the best decimation factor in termsof power consumption, relation (3) was given between thedecimation factor and the filter order. However, the staticpower consumption increases when performing polyphasedecomposition. When the static contribution is important(as with the 90 nm technology library considered), the total

    power consumption can increase in some cases in compari-son with the direct form power consumption.Finally, a case study concerning the UMTS and GSM

    standards was presented. We performed a comparison be-tween three filter architectures. The power estimation basedon proposed models helped choose the suitable architecturefor power consumption optimization, and the result wasconfirmed by filter implementations.

    References

    [1] Y. S. Poberezhskiy and G. Y. Poberezhskiy, Flexible analog frontends of reconfigurable radios based on sampling and recon-struction with internal filtering, Eurasip Journal on WirelessCommunications and Networking, vol. 2005, no. 3, pp. 364381,2005.

    [2] Y. Dumonteix, H. Aboushady, H. Mehrez, and M. Louerat,Low power comb decimation filter using polyphase decompo-sition for mono-bit analog-to-digital converters, IEEE Trans-actions on Circuits and Systems II: Analog and Digital SignalProcessing, vol. 48, no. 10, pp. 898903, 2001.

    [3] M. O. Lakshmanan and M. Ali, High performance parallelmultiplier using Wallace-Booth algorithm, in Proceedings of theIEEE International Conference on Semiconductor Electronics, pp.433436, 2002.

    [4] H. Lee, A power-aware scalable pipelined booth multiplier, inProceedings of the IEEE International Systems-On-Chip Confer-ence, pp. 123126, September 2004.

    [5] Universal Mobile Telecommunication System (UMTS), UERadio Transmission and Reception (FDD), 3GPP TS.101 ver-sion 5.2.0 Release 5. ETSI, 2002.

    [6] Radio Transmission and Reception GSM 05.05. ETSI, 1996.[7] Matlab curve fitting toolbox data sheet, http://www.mathworks

    .com/products/curvefitting/.[8] R. Barrak, A. Ghazel, and F. Ghannouchi, Design of sampling-

    based downconversion stage for multistandard RF subsamplingreceiver, in Proceedings of the 13th IEEE International Confer-ence on Electronics, Circuits and Systems (ICECS 06) , pp. 577580, December 2006.

    [9] A. Ghazel, L. Naviner, and K. Grati, On design and imple-mentation of a decimation filter for multistandard wirelesstransceivers, IEEE Transactions on Wireless Communications,

    vol. 1, no. 4, pp. 558562, 2002.

  • 8/12/2019 Power Reduction in FIR

    16/16

    Submit your manuscripts at

    http://www.hindawi.com