16
Introducing Intel® Tremont Microarchitecture Stephen Robinson | Senior Principal Engineer | Intel® Linley Fall Processor Conference 2019 – October 24, 2019

Introducing Intel® Tremont Microarchitecture · 2019-10-23 · Out of order front end and 10 wide execution port back-end Significant IPC improvement vs. prior Intel low power x86

  • Upload
    others

  • View
    2

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Introducing Intel® Tremont Microarchitecture · 2019-10-23 · Out of order front end and 10 wide execution port back-end Significant IPC improvement vs. prior Intel low power x86

Introducing Intel® Tremont MicroarchitectureStephen Robinson | Senior Principal Engineer | Intel®

Linley Fall Processor Conference 2019 – October 24, 2019

Page 2: Introducing Intel® Tremont Microarchitecture · 2019-10-23 · Out of order front end and 10 wide execution port back-end Significant IPC improvement vs. prior Intel low power x86

Tremont: Top-level Design Targets

Single thread performanceNetworking Performance/mW Performance/mm2

New instructions

Battery life Performance/mW

2

Page 3: Introducing Intel® Tremont Microarchitecture · 2019-10-23 · Out of order front end and 10 wide execution port back-end Significant IPC improvement vs. prior Intel low power x86

Design Target: Single Thread Performance

3

Intel® Core™ class branch prediction6 wide out of order instruction decode4 wide allocation10 execution portsDual load/store pipelinesQuad-core moduleL2 cache up to 4.5MB Size is product dependent

Page 4: Introducing Intel® Tremont Microarchitecture · 2019-10-23 · Out of order front end and 10 wide execution port back-end Significant IPC improvement vs. prior Intel low power x86

Front End

44

Page 5: Introducing Intel® Tremont Microarchitecture · 2019-10-23 · Out of order front end and 10 wide execution port back-end Significant IPC improvement vs. prior Intel low power x86

Core™ class branch prediction Long history 32 byte based L1 predictor (no penalty) Large L2 predictor

Out of order fetch 32KB instruction cache 32 bytes / cycle Up to 8 outstanding misses

Front End: Fetch, Predict

5

Page 6: Introducing Intel® Tremont Microarchitecture · 2019-10-23 · Out of order front end and 10 wide execution port back-end Significant IPC improvement vs. prior Intel low power x86

Front End: Decode

6-wide x86 instruction decode Dual 3-wide clusters Out of order Wide decode without the area of

a uop cache Optional single cluster mode based on

product targets

6

Page 7: Introducing Intel® Tremont Microarchitecture · 2019-10-23 · Out of order front end and 10 wide execution port back-end Significant IPC improvement vs. prior Intel low power x86

Integer Execution

Large entry out of order window (>200)

Parallel reservation stations (6)

Wide execution (7) 3 ALU 2 AGU 1 jump 1 store data

7

Page 8: Introducing Intel® Tremont Microarchitecture · 2019-10-23 · Out of order front end and 10 wide execution port back-end Significant IPC improvement vs. prior Intel low power x86

Vector Execution

Crypto acceleration Dual 128b AES units, 4 cycle Single instruction SHA256, 4 cycle Galois Field new instructions

Parallel reservation stations (2)Execution ports (3) SIMD/AES/FMUL SIMD/AES/FADD Store data

8

Page 9: Introducing Intel® Tremont Microarchitecture · 2019-10-23 · Out of order front end and 10 wide execution port back-end Significant IPC improvement vs. prior Intel low power x86

Memory Execution

Dual load/store pipeline32KB data cache Three cycle load to use

1024 entry second level TLB Shared between code and data

9

Page 10: Introducing Intel® Tremont Microarchitecture · 2019-10-23 · Out of order front end and 10 wide execution port back-end Significant IPC improvement vs. prior Intel low power x86

Memory Subsystem

L2 shared among 1-4 cores Configurable from 1.5MB to 4.5MB

Last level cache support Inclusive Non-inclusive

Intel® Resource Directory Technology L2 QoS including code/

data prioritization LLC QoS Memory bandwidth enforcement

10

Page 11: Introducing Intel® Tremont Microarchitecture · 2019-10-23 · Out of order front end and 10 wide execution port back-end Significant IPC improvement vs. prior Intel low power x86

New Instructions and Technologies

Accelerator interfacing Instructions Efficient and scalable work-dispatch and

synchronization to accelerators

Intel® Speed Shift Improved responsiveness with faster

hardware controlled frequency changes

11

CPU rooted secure boot Intel® Trusted Execution Technology Intel® Boot Guard

Intel® Total Memory Encryption Improve confidentiality protection in

memory from physical attacks

Page 12: Introducing Intel® Tremont Microarchitecture · 2019-10-23 · Out of order front end and 10 wide execution port back-end Significant IPC improvement vs. prior Intel low power x86

ISO-Frequency Single Thread Performance Improvement(Relative to Goldmont Plus)

12Based on performance projections as of October 2019 and subject to change, with SOCs at ISO frequenciesFor more information about performance and benchmark results go to www.intel.com/benchmarks.

Single Thread Performance – Components of SPECint* Rate Base 2006/2017 & SPECfp* Rate Base 2006/2017

Page 13: Introducing Intel® Tremont Microarchitecture · 2019-10-23 · Out of order front end and 10 wide execution port back-end Significant IPC improvement vs. prior Intel low power x86

Performance - Hybrid

13

Based on pre-silicon projections as presented at Hotchips presentation, August 2019 and subject to changeFor more information about performance and benchmark results go to www.intel.com/benchmarks.

Page 14: Introducing Intel® Tremont Microarchitecture · 2019-10-23 · Out of order front end and 10 wide execution port back-end Significant IPC improvement vs. prior Intel low power x86

Tremont: Intel’s next generation low power x86 microarchitecture

Advancements on ISA, microarchitecture, security, and power management

Out of order front end and 10 wideexecution port back-end

Significant IPC improvement vs. prior Intel low power x86 architectures

Targeting a wide variety of products across client, data center, 5G networking and Internet of Things

Summary

14

Page 15: Introducing Intel® Tremont Microarchitecture · 2019-10-23 · Out of order front end and 10 wide execution port back-end Significant IPC improvement vs. prior Intel low power x86
Page 16: Introducing Intel® Tremont Microarchitecture · 2019-10-23 · Out of order front end and 10 wide execution port back-end Significant IPC improvement vs. prior Intel low power x86

Notices and Disclaimers• Performance results for ISO Frequency Single thread performance are performance projections based on preliminary development board measurements as

of October 2019 and subject to change. Performance results for hybrid performance are based on pre-silicon projections as of August 2019 and are subject to change, may not reflect all publicly available security updates. No product can be absolutely secure

• All information provided here is subject to change without notice. Contact your Intel representative to obtain the latest Intel product specifications, roadmaps, and related information.

• Intel and the Intel logo are trademarks of Intel Corporation or its subsidiaries

• © Intel Corporation 2019