8
Cortex A9 Tri-core Implementation Specifications of the Processor: (Key Features) Functions (abilities) of the Design: Different modules in the design and its functions (in Brief) DMIPS of Processor Effective Speed of the Processor L1 and L2 access speed (cache coherent interconnect) Size of L1 and L2 memories. Working of tag memories (module) Physical Design Predictions: Design Size and Physical Design Observations: Gate Count: /india/proj_es2Arm/explore/pd/ca9/Run1_Pra7july_dwtsCcopt/ checknetlist.rpt Instance Count: Vt flavors in Netlist: Netlist Timing Status: /india/proj_es2Arm/explore/pd/ca9/ Run4_Pra9july_ShieldCcopt/LOG/../nk_rpt/prePlace/ cortex_a9_mp_pl310_prePlace.summary Number of macros: Macro Area: Number of Pins: Size of the Cores: Max Logic Depth

Rort

Embed Size (px)

DESCRIPTION

doodle

Citation preview

Page 1: Rort

Cortex A9 Tri-core Implementation

Specifications of the Processor: (Key Features)

Functions (abilities) of the Design: Different modules in the design and its functions (in Brief) DMIPS of Processor Effective Speed of the Processor L1 and L2 access speed (cache coherent interconnect) Size of L1 and L2 memories. Working of tag memories (module)

Physical Design Predictions:

Design Size and Physical Design Observations:

Gate Count: /india/proj_es2Arm/explore/pd/ca9/Run1_Pra7july_dwtsCcopt/checknetlist.rpt

Instance Count: Vt flavors in Netlist: Netlist Timing Status:

/india/proj_es2Arm/explore/pd/ca9/Run4_Pra9july_ShieldCcopt/LOG/../nk_rpt/prePlace/cortex_a9_mp_pl310_prePlace.summary

Number of macros: Macro Area: Number of Pins: Size of the Cores: Max Logic Depth Average Logic Depth Critical Paths in Synthesis and the Logic depth Clock gating percentage

Physical Design Flow:

Page 2: Rort

1. FloorPlan: Macro size and Placement Channel width Cpu core placement Pin Placement Reason for Rectilinear shape of both core and the wrapper (advantages)

2. Power Planning: Stripe Width Stripe spacing Layers used for stripes

3. Placement and Prects Opt Obtain module interaction Details Placement of the Modules with manual guidance. Set the required margins, max cap, and max trans values. Optimize incrementally using giga opt. Trial route the design and check Timing MMMC flow setup and usage Advantages by EDI Special settings of switches Timing Status and cells added Density and congestion status

4. CTS Clock Architecture (total no. clocks and architectural clock gates) Criticality in clock tree CCopt usage for cts Advantage of CCopt Timing status, Number of cells added NDR used, max trans, max cap Levels of clock tree. Placement and routing congestion status

5. Post CTS Post Cts optimization using giga opt engine and use NRGR.

Page 3: Rort

Control the max length of the nets Timing status, margins added and cell count increase. Status of routing congestion.

6. Route and Post Route Resources for routing. Criticality Nano router and giga opt optimizer usage. Timing Status DRC count (EDI vs Sign off) PDV status. Signal EM status. MMMC timing status

7. Timing Sign off (ECO – MMMC) ETS and EDI correlation DMMMC flow in ETS to fix all mode all corner timing Top Level Integration ECO round.

Init Stage:

MMMC flow: Timing Corners

Cworst: 125

Cbest: -40

Timing Modes:

Dft mbist

Dft_scan_shift

Page 4: Rort

Dft scan capture

Max Route Layer 8

Min route layer 1

<FF> ==============================================

<FF> Derating Factors

<FF> ==============================================

<FF> INFO: max:

<FF> INFO: data_cell_early = 0.975

<FF> INFO: clock_cell_early = 0.920

<FF> INFO: data_cell_late = 1.025

<FF> INFO: clock_cell_late = 1.025

<FF> INFO: min:

<FF> INFO: data_cell_early = 0.975

<FF> INFO: clock_cell_early = 0.975

<FF> INFO: data_cell_late = 1.025

<FF> INFO: clock_cell_late = 1.080

<FF> Extraction Options

<FF> -----------------------------------------------------

<FF> Relative C Threshold = 0.03

<FF> Total C Threshold = 0

<FF> Coupling C Threshold = 3

<FF> -----------------------------------------------------

Page 5: Rort

<FF> Signal Integrity Options

<FF> -----------------------------------------------------

<FF> Process = 40

<FF> Delta Delay Threshold = -1

<FF> Celtic Settings = set_virtual_attacker -mode current -gtol 0 -switch_prob 1 -skiptw; set_parm gtol 0.025 ; set_align_mode -mode peak ; set_thresh -delta_absolute 0

setTrialRouteMode -useNanoRoute true

setPlaceMode -maxDensity 0.95

setNanoRouteMode -drouteAutoStop false -routeWithSiDriven true -routeWithTimingDriven true

setNanoRouteMode -routeStrictlyHonorNonDefaultRule true

setOptMode -maxLength 400

------ Design Summary:

Total Standard Cell Number (cells) : 387277

Total Block Cell Number (cells) : 67

Total I/O Pad Cell Number (cells) : 0

Total Standard Cell Area ( um^2) : 1316979.30

Total Block Cell Area ( um^2) : 9138890.42

Total I/O Pad Cell Area ( um^2) : 0.00

------ Design Statistics:

Page 6: Rort

Number of Instances : 387344

Number of Nets : 1173300

Average number of Pins per Net : 1.14

Maximum number of Pins in Net : 59410

------ I/O Port summary

Number of Primary I/O Ports : 1760

Number of Input Ports : 954

Number of Output Ports : 806

Number of Bidirectional Ports : 0

Number of Power/Ground Ports : 2

Number of Floating Ports *: 65

Number of Ports Connected to Multiple Pads *: 0

Number of Ports Connected to Core Instances : 1695

specifySelectiveBlkgGate –cell

set_max_transition 0.2 [current_design]

set_max_fanout 32 [current_design]

Uncertainity 150ps

createSoftGuide

setPlaceMode -softGuideStrength high

setNanoRouteMode -drouteMinLengthForWireSpreading 250 -droutePostRouteSpreadWire true

Page 7: Rort

setPlaceMode -clkGateAware true

setPlaceMode -groupFlopToMacro true

setPlaceMode -groupFlopToMacroLevel 1

setPlaceMode -groupFlopToMacroStrength 500

################sh########################

setPlaceMode -groupFlopToGate true

setPlaceMode -groupFlopToGateHalfPerim 200

Power: Clock tree power reduction up to 30%, and total power reduction up to 10% Performance: Improvements up to 100 MHz for a GHZ design Area: Clock tree area reduction up to 30%