[IEEE 2007 International Conference on Microelectronics - ICM - Cairo, Egypt (2007.12.29-2007.12.31)] 2007 Internatonal Conference on Microelectronics - Power aware channel width tapering

Abstract—In this paper a new transistor channel width

tapering scheme called Hill Tapering for FET chains is proposed with specific emphasis on power dissipation and layout area of the tapered chains. This tapering scheme results in the lowest power dissipation and physical area compared to any of the existing tapering schemes like linear, exponential or optimal tapering. It also offers high speed operation. The proposed scheme is general and can be used in all domino logic circuit designs. SPICE simulation results have shown that up to 81% power dissipation reduction could be achieved by using the proposed tapering scheme.

Index Terms—Elmore delay, power dissipation, transistor sizing, transistor tapering

I. INTRODUCTION OW POWER design techniques require special attention to avoid significant increment of the circuit’s area or

sacrifice in the speed performance of the systems. Many CMOS logic structures like domino logic are composed of FET chains. These serially connected MOSFETs are a major source of power dissipation and delay; hence optimal sizing of transistors is important in reducing the power dissipation and delay. Transistor tapering is a special class of transistor sizing technique which is frequently used in high-performance integrated circuit design in order to improve the speed of operation along with reduction in power dissipation of a constituting circuit block. In 1982 Shoji [1] first proposed the transistor tapering technique showing improvement in circuit performance of large FET chains. In this paper a new channel width tapering scheme is proposed with emphasis on power dissipation and physical area. Simulations confirm the superiority of this scheme and shows it outperforms linear and exponential tapering schemes for both small and large

Manuscript received on June 15, 2007. The authors are with the Department of Electrical Engineering, Indian

Institute of Technology, Kanpur-208016, India (email: [email protected], [email protected]).

FET chains. Linear and exponential tapering approaches fail when FET chains become large resulting in larger power dissipation and physical area. Analytical formula for generating the proposed hill tapered profile is also given, which could be integrated into automatic layout generator. (a) (b) Fig. 1. FET chain shown in (a) and untapered layout in (b). A brief background of previous research on tapering FET chains is provided in section II followed by the proposed Hill tapered scheme in section III. In section IV power dissipation and propagation delay characteristics are shown as a part of SPICE simulation results.

II. PREVIOUS RESEARCH ON FET CHAIN TAPERING Shoji [1] in 1982 first proposed that with the proper choice of tapering factor, using either a linear tapering or an exponential tapering of the channel width of a transistor [2], [3], and with the largest transistor closest to ground and the smallest closest to the load, it was possible to produce a circuit which would result in quick discharge of the load capacitor resulting in faster transient response compared to the untapered chain. Under linear tapering scheme, the widths of adjacent

Power Aware Channel Width Tapering of Serially Connected MOSFETs

Sudhanshu Choudhary Department of Electrical

Engineering Indian Institute of Technology

Kanpur, India 208016 Email: [email protected]

Shafi Qureshi Department of Electrical

Engineering Indian Institute of Technology

Kanpur, India 208016 Email: [email protected]

L GND

CL

CN

C2

C1

FET 1 FET 2 FET 3 | | | | FET N

To Load

d

w

c0

978-1-4244-1847-3/07/$25.00 ©2007 IEEE IEEE ICM - December 2007

transistors are decreased by a constant ∆w. Therefore ∆w is the only adjustable variable. Besides linear tapering, exponential tapering is another widely used tapering scheme where the width ratio of any two adjacent FETs is a constant. Fig. 2. FET chain with exponential tapering. Example FET chain with exponential tapering is shown in Fig. 2, transistor width is decreased by a fixed factor α with 0< α ≤1.

III. PROPOSED HILL TAPERING SCHEME In this scheme, at first all the transistors are scaled with a constant factor depending upon scaling factor (α) and the number of transistors (N) in the chain (more research is to be done on this constant scaling factor) then a combination of exponential and linear scaling is applied such that the load and ground are connected to the smallest and second smallest transistor respectively, the largest transistor is connected in the middle of the chain. This gives a hill like tapered structure of the FET chain shown in Fig. 3. Note that initially the width of largest transistor was ‘w’. Comparing Fig. 2 and Fig. 3(b) it is clear that the proposed scheme takes less physical area hence lower capacitance which means low power dissipation. Now an analytical delay model and transistor width function is required that could be integrated with automatic layout generator.

A. RC Model of serially connected FETs Using Elmore’s delay model for the FET chain shown in

Fig. 1, the discharge time of serially connected FET chain could be written by

∑−

=

=1

0

N

iD iii Rct

as ∑ ∑−

=

−

=

+=

1

0

1

0

N

i

N

ijD Lji Ccrt (1)

(a)

(b)

Fig. 3. Proposed Hill Tapering is shown in (a) along with an equivalent hill tapered model of Fig. 2 in (b). N is the number of transistors ri is the effective resistance of the ith FET, cj is the parasitic capacitance at node j and CL is the load capacitance. The discharge time can be written as the

To Load

GND

w

αw

α2w

α3w

α4w

α2wo

To Load

GND

αwo

wo

αw0

αw0

wo

αwo

α2wo

α3wo

α2wo

αwo

α2wo

To Load

GND

Con

stan

t

Expo

nent

ial

Con

stan

t

C02

1 α+( )

C02

α2 α3+( )

C02

α α2+( )

C02

α α2+( )

C02

α α2+( )

C02

1 α+( )

M3

M2

M1

M0

M1

M2

M3

y

Expo

nent

ial

Expo

nent

ial

Expo

nent

ial

(N even) w0=wα(N-1) / 2 and (N odd) w0=wαN / 2

(N even) w0=wα(N-1) / 2 and (N odd) w0=wαN / 2

IEEE ICM - December 2007

sum of products of each of the parasitic and load capacitors and their respective resistances to the ground

∑∑ ∑−

=

−

= =+

=

1

0

1

0 0

N

jjL

N

i

i

jjiD rCrct (2)

Further assuming that the effective resistance and parasitic capacitances are inversely proportional and proportional to the width of FETs, respectively. Therefore the parasitic capacitance between the ith and the (i+1)th FET is

2100

0++ ii ww

c where c is the unit capacitance and w0i is the

width of the ith FET. Under this assumption (2) can be rewritten as

∑∑ ∑−

=

−

= =

++

+=

1

0 0

1

0 0 0

1000 2

N

jL

N

i

i

j

ii

jjD w

rCwrww

ct (3)

where r is the unit effective resistance; if α is the tapering factor then the capacitance between the ith and (i+1)th node could be written as

( )10

01000 22

++ +=+ iiii wcwwc αα (4)

‘w0’ is now the width of the largest transistor shown in Fig. 3 and Fig.4 , increasing the channel width of the M0 transistor at position y will always bring down the discharge time. Hence the best size of this transistor is 1. Now for minimum delay ∂tD /∂w0i =0 for i=0, 1, 2...N-1

or 0112

0

1

12

0

0

0 00

=−−=∂∂ ∑∑

−

+== iL

N

ij i

ji

j ji

D

wC

w

www

t (5)

when N is infinitely large, the summation in (3) can be approximated as

( ) ( )∫∫∫ +=

1

0 00 0

1

000 )( dx

xwrCd

wrdxxwct L

x

D ττ

(6)

where w(x) is the normalized transistor width function; the approximation in (6) is valid as the continuous limit in which the FET chain is modeled as an RC sheet with uniformly distributed resistance and capacitance. Our problem is to find the optimal width function of an RC sheet with a capacitive load of CL as shown in Fig. 4. Let us define the resistance to the ground as

( ) ττ

dw

rxRx

∫=0 0

)( (7)

and transistor width function can be expressed in terms of R(x) as

)(')()(0 xR

r

xRdxd

rxw == (8)

w0 Fig. 4. RC sheet with fixed resistor there are three regions in Fig. 4 hence three width functions are required where region three is a constant and region one and two are exponentially decreasing width functions. Using (7) and (8), (6) can be transformed to

∫

+=

1

00 )('

)(')( dxxRC

xRxRrct LD

(9)

B. Finding the optimal shape of the transistor width function Two cases are there (i) when the transistor count is even

and (ii) when the transistor count is odd, we divide the RC sheet in two halves Case1: when even number of transistors are there then point y in Fig. 4 or the transistor number at position y in Fig. 3(a) is calculated as y=1+N/2. Case2: when even number of transistors are there then point y in Fig. 4 or the transistor number at position y in Fig. 3(a) is calculated as y=1+[(N-1)/2]. Width of RC sheet as a function of x for region I can be calculated (from bottom y as an exponentially decreasing function of x) as

x

IIIIII e

RRRrxw α

α−

++=

)()(

00

(10)

where the tapering factor α is given as

0

lnRR=α (11)

R is the total resistance that includes the resistance of the RC sheet and R0, RII and RIII are the resistances of region II and III respectively. Now functional form of [(RII + RIII) (x)] can be obtained as [(RII + RIII) (x)] = A.eα.x where A and α are two constants that can be determined by the boundary conditions

Con

stan

t Ex

pone

ntia

l Ex

pone

ntia

l

y

a

I II III

R0 GND

Load

R-(RII + RIII)

RII

RIII (constant)


given by [(RII + RIII) (0)] = R0 and [(RII + RIII) (1)] = R-RI A=R0, α=ln(R /R0) (12) if an RC sheet without a resistor at the bottom is considered, letting R0=0 we get A=0 from (12). Therefore, [(RII + RIII) (x)] =0, which means there is no physical solution, that is exponential shape is no longer the optimal shape. Hence in Fig. 4 from position a width is taken constant, where point a is calculated as a=y-1. x

IIIII e

RRrxw α

α−

+=

)()(

00

from x=0 to a (13)

w0III (x) is a constant and is equal to w0II (a), that is the two transistors close to the ground (at the bottom) are of same constant width.

IV. POWER DISSIPATION AND PROPAGATION DELAY CHARACTERISTICS OF TAPERING

SPICE simulation was carried out in a TSMC 0.18 µm (Level 49) technology at 1.8v, results for a six input domino AND gate in [2] are shown in Fig. 5 & 6. The maximum channel width was set to 20µm, we change the size of the output inverter to change the CL value of FET chain. Three different sizes of load inverters are used: 1x, 2x, 4x sized inverters, where a unit inverter has a pFET width of 2 µm and nFET width of 1 µm. The spacing between two adjacent transistors in the chain is set to 4λ according to MOSIS deep submicron rule. c0 and CL were calculated by (3),(5) in [2] as

GloadupDPpullDNnL

jswjo

CCCCCdCdwCc

+++=

+=

− int

00 .2.. (14)

Cj0 is the zero-bias bulk junction bottom capacitance, Cjsw is the zero-bias bulk junction sidewall capacitance and d is the length of the source/drain diffusion island between adjacent channels.

Fig. 5. Power dissipation as increasing functions of α and load inverter.

Fig. 6. Simulated waveforms showing discharge time and power dissipation for

different values of α, CL/c0 ratio is varied by changing the size of the inverter. From the simulated waveforms shown in Fig. 5 and Fig. 6, the superiority of the proposed tapering scheme in terms of power dissipation and delay could be easily verified.

REFERENCES [1] M. Shoji, “FET scaling in domino CMOS gates,” IEEE J. Solid-state Circuits, vol. SC-20, pp. 1067-1071, Oct. 1985. [2] B. S. Cherkauer and E. G. Friedman, “Channel Width Tapering of

Serially Connected MOSFET's with Emphasis on Power Dissipation”, IEEE Trans. Very Large Scale of Integration (VLSI) Systems, vol. 2, no. 1, pp. 100-114, March 1994.

[3] Li Ding and Pinaki Mazumdar, “On Optimal Tapering of FET Chains in High-Speed CMOS Circuits”, IEEE Trans.Circuits and Systems, vol. 48, no12, pp. 1099-1109, Dec. 2001.


Documents

[IEEE 2007 International Conference on Microelectronics - ICM - Cairo, Egypt (2007.12.29-2007.12.31)] 2007 Internatonal Conference on Microelectronics - Power aware channel width tapering