Timing Closure Using Latches

Embed Size (px)

Citation preview

  • 8/12/2019 Timing Closure Using Latches

    1/3

    SEPTEMBER 22, 2011 | EDN 47[www.edn.com]

    Digital blocks contain combinational andsequential circuits. Sequential circuits are thestorage cells with outputs that reflect the pastsequence of their input values, whereas the out-put of the combinational circuits depends onlyon the current input. Latches and flip-flops are

    common storage elements for these blocks.A latch is a level-sensitive storage cell that is transparent

    to signals passing from the D input to the Q output and thatholds the values of D on Q when the enable signal is false.Depending on the polarity of the enable input, latches haveeither a positive level or a negative level. A flip-flop, on theother hand, is an edge-triggered device that changes stateon the rising or the falling edge of an enable signal, suchas a clock. In a rising-edge-triggered flip-flop, the flip-flopsamples its input state only at the rising edge of the clock. Itthen maintains this sampled value until the next rising edgeof the clock. Designers typically prefer flip-flops over latchesbecause of this edge-triggered property, which simplifies the

    behavior of the timing and eases design interpretation.Latch-based designs, however, have smaller dice and aremore successful in high-speed designs in which the clockfrequency is in the gigahertz. In flip-flop-based high-speeddesigns, maintaining clock skew is a problem, but latches easethis problem. Hence, the use of flip-flops can limit the designsperformance when the slowest path limits the frequency ofthe design. When you consider process variation, latch-baseddesign is dramatically more tolerant of variations than is flip-flop-based design, resulting in better yield, allowing moreaggressive clocking than the equivalent design with flip-flops,or providing both of these benefits.

    USING LATCHES TO BORROW TIME

    Latches biggest advantage is that they allow a sufficientlylong combinational path, which determines the maximumfrequency of the design, to borrow some time from a shorterpath in subsequent latch-to-latch stages to meet its timinggoal. A level-sensitive latch is transparent during an activeclock pulse. The time-borrowing technique can also relaxthe normal edge-to-edge timing requirements of synchronousdesigns (Figure 1).

    A sample circuit has two timing paths (Figure 2). Path 1goes from a positive-triggered register (1) to a negative-levellatch (2). Path 2 goes from the latch to a positive-edge-triggered register (3). In the figure, borrowing compensates

    for the delay through the logic cloud (A). The logic in Path1 incurs a delay, and, depending on the length of that delay,two possible scenarios of timing analysis can emerge. Thesescenarios decide how much time the design can borrow(figures 3and 4).

    InFigure 3, data arrives from Logic A at Latch 2 before the

    falling edge of the clock at the latch. In this case, the behaviorof the latch is similar to that of a flip-flop, and the analysis issimple. You need not borrow any time to achieve your timinggoal. InFigure 4, the negative clock edge enables the latchbefore the arrival of the signal from Logic A at the input ofthe latch, so the latch enters transparent mode and for a timetransmits an undefined state from Logic A through to RegisterB. It is important that the new state from Logic A reaches andpasses through Logic B in time to meet the setup requirementsof Register 2. So, if Logic B has a short propagation delay, youcan, in effect, let Logic A have some of the time you reserved

    Latches and timing closure:

    a mixed bag

    BY ASHISH GOEL AND ATEET MISHRA FREESCALE

    Figure 1 The time-borrowing technique can relax the normal

    edge-to-edge timing requirements of synchronous designs.

    Figure 2 Path 1 goes from a positive-triggered register (1) to a

    negative-level latch (2). Path 2 goes from the latch to a positive-

    edge-triggered register (3).

    LATCHES HAVE THE EDGE OVER FLIP-FLOPS IN HIGH-FREQUENCY DESIGN.

    HERE ARE SOME HINTS ON APPLYING THEM TO YOUR NEXT DESIGN.

    1

    3PATH 1

    PATH 2A

    2

    B

    REGISTER 1LAUNCH EDGE

    ARRIVAL OFLOGIC A

    EXPECTED CAPTUREAT REGISTER 3

    TIME BORROWED

    LATCH-TRANSPARENTWINDOW

  • 8/12/2019 Timing Closure Using Latches

    2/3

    [www.edn.com]48 EDN | SEPTEMBER 22, 2011

    for Logic B, and the circuit will still work. Logic A borrowsthis extra time to complete its propagation delay. When Path2 is timed, the timing analysis considers the end of the bor-rowed time as the starting point for analyzing Logic Bs delay.

    Static-timing analysis generates timing reports accordingto the examples shown in figures 3and 4. However, the tim-ing when the latch is enabled is the same as if the latch weresimply a transparent delay element (Figure 5).

    TIME-BORROWING IN OCV

    In an ideal scenario, the time at the starting point shouldequal the time the latch borrows. Due to shrinking pro-cess technology, however, OCV (on-chip variation), signal-integrity, and other factors come into play. To increase theaccuracy of the analysis, you can also use CPPR (common-path-pessimism-removal) techniques. These factors compli-cate the relationship between time-borrowing and time forthe starting point. As a result, the timing analysis of latchesbecomes more challenging.

    Returning toFigure 4, youll note an interesting rela-tionship between time-borrowing and the starting-pointtime. The variables include clock uncertainties, clock-path

    pessimism due to OCV, and clock derating. During the tim-ing of Path C, T

    SP=T

    BU+CPPR, where T

    SPis the time for

    the starting point and TBis the time Path 1 borrows when

    constraining Logic A.Applied uncertainty in Path 1 is the uncertainty for the

    clock path of the latch, which is not part of pessimism whenthe latch is transparent. You thus remove that pessimismabout the latch clocks uncertainty from the start time.Similarly, you recover pessimism due to CPPR in the timefor the starting point because the same early or late path typeof latch-launch clock path is in Path 2. If you want to applyclock derating in the design during the timing of Path 2, youshould consider using early rather than late derating to makethe path the same as the capture clock of Path 2.

    EDA tools usually exhibit pessimistic behavior when tim-ing Path 2 because they dont consider CPPR, but they shouldnot apply that pessimism. Path 1s clock-path pessimism endsduring calculation of the start-point time, and again, youshould not retain this pessimism. The latch is transparent,so it acts as a combinational cell. In this case, you shouldconsider using CPPR between the starting point of Path 1and the ending point of Path 2. These tools yield extremelypessimistic results because they fail to consider that the useof pessimism is acceptable.

    You can also consider using the smallest value betweenthe CPPR of Path 1 and that of Path 2. This approach is notthe most accurate, but it provides another level of pessimismremoval. Comparing the common clock path of the registerand the latch in timing Path 1 versus the common clockpath of the latch and the endpointthe second registerintiming Path 2 can give an idea of the minimum possibility ofthe clock path between the register and the final endpoint.

    Once you ensure that the latch will be transparent duringpath timing, the least preferred, most accurate, and best way

    to judge the timing of latches is to make the latch transparentby using a case analysis on the enable pin of the latch. Afterthis step, the EDA tool can time the two segments as onecomplete path. This method is the least preferred because thelatch may not always be transparent when timing Path 1 inthe best-case condition: when time borrowing is unnecessary.

    REGISTER 1LAUNCH EDGE

    ARRIVAL OFLOGIC A

    EXPECTED CAPTUREAT REGISTER 3

    LOGIC APROPAGATION

    LOGIC BPROPAGATION

    TIME BORROWED=0

    REGISTER 1LAUNCH EDGE

    ARRIVAL OFLOGIC A

    EXPECTED CAPTUREAT REGISTER 3

    LOGIC APROPAGATION

    LOGIC BPROPAGATION

    TIME BORROWED=TT

    Figure 3 When Logic A is fast enough, no borrowing is necessary.

    Figure 4 Logic A borrows time from Logic B.

    1

    3PATH 1

    PATH 2A

    2

    B

    D TO Q ARCWILL BE ENABLED

    Figure 5 When the latch is enabled, it essentially becomes a

    passive delay.

    PATH 1 PATH 2BLOCK

    Figure 6A block interfaces to external latches.

    PATH 1 PATH 2BLOCK

    CLK

    Figure 7A block using latches interfaces to external registers.

  • 8/12/2019 Timing Closure Using Latches

    3/3

    SEPTEMBER 22, 2011 | EDN 49[www.edn.com]

    The tool also misses all the paths thatdo not require time borrowing and holdsa time check at the latchs endpoint.

    PARTITIONING CHALLENGES

    Some challenges occur in hierarchicaldesign when the blocks have latch-basedinterfaces. The timing tools require helpto understand when it is possible toborrow time across a block boundary.The first challenge is to enable time-borrowing for the ports that you havebudgeted for timing. When timing ablock, you can model the ports that areentering or exiting latches at the toplevel of the SOC (system on chip) byusing their proper I/O delays and thelevel-sensitive option in the EDA tool.Consider the case for Path 2 (Figure6). Without the level-sensitive option,this path could be critical at the blocklevel. By defining the output delay atthe output port with the level-sensitiveoption, the timing tool can borrow timefrom the input stage of the next block,and this ability relaxes the timing onthe output port.

    Next, consider a case in which thelatches are inside rather than outsidethe block (Figure 7). Path 1 has nospecial requirements for closing theblock, but you must define all types ofclock latencyrise, fall, minimum, andmaximum timesfor the CLK pin. This

    approach helps you correctly calculatethe time of the starting point employ-ing OCV and CPPR. In this way, youllget no surprises when you merge theblock at the top level. Another chal-lenge arises when you use the timingmodels for top-level execution. You canenable time-borrowing through bound-ary latches by using gray-box ETMs(extracted timing models), which pre-serve the boundary latch and generateETM libraries.

    In summary, latches are beneficial

    for high-speed-SOC designs, but theiruse adds challenges in static-timinganalysis, especially with hierarchicaldesign. The limitations of EDA toolsincrease the complexities of latch-baseddesign. You can employ latches in SOCsonly after careful analysis. You can thenapply some of these techniques, whichcan reduce the complexities of design-ing with latches.EDN

    ACKNOWLEDGMENT

    This article originally appeared on EDNs

    sister site, EDA Designline (http://bit.ly/p3aN3C).

    AUTHORS BIOGRAPHIES

    Ashish Goel is a lead design engineer atFreescale in India. He has 11 years of in-dustry experience in static-timing analy-sis, RTL (register-transfer-level) design,physical des ign, and formal technolo-gies. Previously, he worked at STMicro-

    electronics, Agilent Technologies, and In-fineon Technologies. Goel holds multiplepatents in FPGA architecture.

    Ateet Mishra is a senior design engineer atFreescale in India, where he has workedfor six years. He has experience in static-timing analysis, physical design, and syn-thesis. He has successfully taped out mul-tiple SOCs.