CHAPTER 2: INSTRUCTION SET PRINCIPLES · 2012. 5. 29. · Addressing modes Addressing Modes Example...

Prepared by

Mdm Rohaya binti Abu Hassan

CHAPTER 2: INSTRUCTION SET

PRINCIPLES

Chapter 2: Instruction Set Principles • Instruction Set Architecture

• Classification of ISA/Types of machine

• Primary advantages and disadvantages of each class of machine

• Classification of General Purpose Register Machines

• Addressing modes

• Aligning Addresses

• Interpreting Memory Addresses

• Addressing Modes for Desktops and Servers

• Addressing Mode Usage (VAX)

• DLX Instruction Set

Instruction Sets • What is an instruction set?

• Set of all instructions understood by the CPU

• Each instruction directly executed in hardware

Instruction Set Architecture (ISA) • Definition:

The instruction set architecture is informally the set of programmer visible registers and address spaces and the set of instructions that can operate on them.

Instruction Set Architecture (ISA) • Instruction set architecture of a machine fills the semantic gap

between the user and the machine.

• The ISA specifies the size of main memory, number of registers, and number of bits per instruction.

• It also specifies exactly which instructions the machine is capable of performing and how each of the instruction bits is interpreted.

Instruction Set Architecture (ISA) • It is all of the programmer-visible components and operations of

the computer

• The ISA provides all the information needed for someone to write a program in machine language

• translate from a high-level language to machine language

Instruction Set Architecture (ISA) • ISA serves as the starting point for the design of a new machine or

modification of an existing one.

Instruction Set Architecture (ISA)

instruction set

software

hardware

Instruction Set Architecture (ISA)

Classification of ISA • The type of internal storage in the CPU is the most basic

differentiation.

• The major choices are

• a stack

the operands are implicitly on top of the stack

• an accumulator

one operand is implicitly the accumulator

• a set of registers

all operands are explicit either registers or memory

locations

Classification of ISA • Stack Architecture:

Operands are implicit. They are on the top of the stack. For example, a binary operation pops the top two elements of the stack, applies the operation and pushes the result back on the stack.

Classification of ISA • Accumulator Architecture:

One of the operands is implicitly the accumulator. Usually one of the source operand and the destination operand is assumed to be the accumulator.

Classification of ISA • General Purpose Register Architecture:

Operands are explicit: either memory operands or register operands. Any instruction can access memory.

Load/Store Architecture:

Only load/store instructions can access memory. All other instructions use registers. Also referred to as register- register architecture.

Types of Machines

Code Sequence for C=A+B

Stack Accumulator Register-memory Register-register

Push A Load A Load R1, A Load R1, A Push B Add B Add R1, B Load R2, B Add Store C Store C Add R3, R1, R2 Pop C Store C, R3

Classification of ISA • While most early machines used stack or accumulator-style

architectures, all machines designed in the past ten years use a general purpose architecture.

• Stack architecture : Early machines

• Accumulator architecture : Early machines

• General purpose register (GPR) architecture : machines after 1980.

Classification of ISA Reasons for emergence of general-purpose register (GPR) machines17

1. Registers are faster than memory

2. Registers are easily used by a compiler and used more effectively.

1. Example: (A*B)-(C*D)-(E*F) for stack machine? for GPR machine?

3. Registers can be used to hold variables: Reduce memory traffic, improve code density, speed up program.

Classification of General Purpose Register Machines

• There are two major instruction set characteristics that divide

GPR architectures.

1. Concerns whether an ALU instruction has two or three

operands

2. how many of the operands may be memory addressed in ALU instruction

Types of GPR Machines

Number of memory address

Max # of operands allowed

Examples

0 3 SPARC, MIPS, PA, PowerPC

1 2 Intel 80X86, Motorola 68000

2 2 VAX

3 3 VAX

Classification of General Purpose Register Machines

1. They concern whether an ALU instruction has two or

three operands

• Example:

• ADD R3, R1, R2 R3 <-R1 + R2

• or

• ADD R1, R2 R1 <- R1 + R2

2 operands, R1 and R2

3 operands, R1.R2 and R3

Classification of General Purpose Register Machines 2. how many of the operands may be memory addressed in ALU

instruction

• Register- Register (Load/Store)

• ADD R3, R1, R2 (R3 <- R1 + R2)

• Register - Memory

• ADD R1, A (R1 <- R1 + A)

• Memory - Memory

• ADD C, A, B (C <- A + B)

Primary advantages and disadvantages of each class of machine

Machine Type

Advantages Disadvantages

Stack Simple model of expression evaluation. Good code density.

A stack can't be randomly accessed. It makes it difficult to generate efficient code.

Accumulator Minimizes internal state of machine. Short instructions

Since accumulator is only temporary storage, memory traffic is highest.

Register Most general model for code generation

All operands must be named, leading to longer instructions.

Addressing modes • ISA design must define how memory addresses are interpreted and

specified in the instructions.

• Addressing modes are the ways how architectures specify the address of an object they want to access.

• In GPR machines, an addressing mode can specify a constant, a register or a location in memory.

Addressing modes Addressing Modes Example Instruction Meaning When used

Register Add R4,R3 R4 <- R4 + R3 When a value is in a register

Immediate Add R4, #3 R4 <- R4 + 3 For constants

Displacement Add R4, 100(R1) R4 <- R4 + M[100+R1] Accessing local variables

Register deffered Add R4,(R1) R4 <- R4 + M[R1] Accessing using a pointer or a computed address

Indexed Add R3, (R1 + R2) R3 <- R3 + M[R1+R2]

Useful in array addressing: R1 - base of array R2 - index amount

Direct Add R1, (1001) R1 <- R1 + M[1001] Useful in accessing static data

Addressing modes Addressing Modes

Example Instruction

Meaning When used

Memory deferred Add R1, @(R3) R1 <- R1 + M[M[R3]] If R3 is the address of a pointer p, then mode yields *p

Auto- increment

Add R1, (R2)+ R1 <- R1 +M[R2] R2 <- R2 + d

Useful for stepping through arrays in a loop. R2 - start of array d - size of an element

Auto- decrement

Add R1,-(R2) R2 <-R2-d R1 <- R1 + M[R2]

Same as auto increment. Both can also be used to implement a stack as push and pop

Scaled Add R1, 100(R2)[R3]

R1<-R1+M[100+R2+R3*d]

Used to index arrays. May be applied to any base addressing mode in some machines.

Addressing modes - Notation <- - assignment M - the name for memory: M[R1] refers to contents of memory location whose address is given by the contents of R1

Interpreting Memory Addresses • How is a memory address interpreted?

• Byte addressed: Provide access for bytes (8 bits), half words (16 bits), words (32 bits) , and double words (64 bits)

• Conventions for ordering the bytes within a word:

• Little Endian: put byte whose address xxxx00 at LSB position.

• followed by DEC and Intel

Word address Data

0 3 2 1 0

4 7 6 5 4

• Big Endian: Put byte whose address xxxx00 at MSB position.

• followed by IBM, Motorola and others

Word address Data

0 0 1 2 3

4 4 5 6 7

Example • To store a word in byte-addressable memory (i.e. where each

element of memory is one byte), you have to break up the 32 bit quantity into 4 bytes.

• Thus, if the word was 0x01ab23cd, it's broken up into 0x01, 0xab, 0x23, 0xcd.

Interpreting Memory Addresses •When operating within one machine, the byte

order is often unnoticeable - only programs that access the same locations as both words and bytes can notice the difference.

•However, byte order is a problem when exchanging data among machines with different ordering.

Aligning Addresses • In some machines, accesses to objects larger than a byte must be

aligned . An access to an object of size s bytes at byte address A is aligned if A mod s = 0.

Object Addressed Aligned at Byte Offset

Misaligned at Byte Offset

byte 0,1,2,3,4,5,6,7 never

halfword 0,2,4,6 1,3,5,7

word 0,4 1,2,3,5,6,7

doubleword 0 1,2,3,4,5,6,7

Aligning Addresses

Quantity Address divisible by (Binary) address ends in

Byte 1 anything

Halfword (16 bits) 2 0

Word (32 bits) 4 00

Doubleword (64 bits) 8 000

Aligning Addresses

Aligning Addresses • Misalignment causes hardware complications, since the memory is

typically aligned on a word boundary.

• A misaligned memory access will, therefore, take multiple aligned memory references.

• Misalignment typically results in an alignment fault that must be handled by the OS

Addressing Mode • How architectures specify the address of an object they will access?

• In a GPR, an addressing mode can specify

• a constant,

• a register,

• a location in memory (used to compute effective address).

• Immediate or literals are usually considered as memory addressing mode.

• Addressing modes that depend on the program counter is called PC-relative addressing.

• Addressing modes can significantly reduce instruction counts, but may add to the complexity of building a machine and increase the average CPI.

Addressing Modes for Desktops and Servers

Register ADD R4, R3

Immediate ADD R4, #3

Displacement ADD R4, 100(R1)

Register Indirect ADD R4, (R1)

Indexed ADD R3, (R1+R2)

Direct (Absolute) ADD R1, (1001)

Memory Indirect ADD R1, @(R3)

Autoincrement ADD R1, (R2)+

Autodecrement ADD R1, -(R2)

Scaled ADD R1, 100(R2)[R3]

Addressing Mode Usage (VAX)

Operations in the Instruction Set • Data transfer instructions.

• Arithmetic and logic instructions.

• Instructions for control flow: conditional and

• unconditional branches, jumps, procedure calls

• and procedure returns.

• System calls.

• Floating point instructions.

• Decimal instructions.

• String instructions.

• Graphics instructions

DLX •The DLX(pronounced "Deluxe") is a RISC processor

architecture designed by John L. Hennessy and David A. Patterson, the principal designers of the MIPS and the Berkeley RISC designs (respectively), the two benchmark examples of RISC design.

•The DLX is essentially a cleaned up and simplified MIPS, with a simple 32-bit load/store architecture. Intended primarily for teaching purposes, the DLX design is widely used in university-level computer architecture courses.

DLX Instruction Set • The architecture of DLX was chosen based on observations about

most frequently used primitives in programs. DLX provides a good architectural model for study, not only because of the recent popularity of this type of machine, but also because it is easy to understand.

• Like most recent load/store machines, DLX emphasizes

• A simple load/store instruction set

• Design for pipelining efficiency

• An easily decoded instruction set

• Efficiency as a compiler target

DLX’s Operation 1. Load/Store

Any of the GPRs or FPRs may be loaded and stored except that loading R0 has no effect.

2. ALU Operations All ALU instructions are register-register instructions. The operations are : - add, subtract , AND , OR , XOR ,shifts Compare instructions compare two registers (=,!=,<,>,=<,=>). If the condition is true, these instructions place a 1 in the destination register, otherwise they place a 0.

DLX’s Operation • Branches/Jumps

All branches are conditional.The branch condition is specified by the instruction, which may test the register source for zero or nonzero.

• Floating-Point Operations - add - subtract - multiply - divide

DLX’s Operation • There are four classes of instructions:

1. Load/Store

2. ALU Operations

3. Branches/Jumps

4. Floating-Point Operations

DLX Instruction Set opcode Instruction meaning

Data transfers

Move data between registers and memory, or between the integer and FP or special register; only memory address mode is 16-bit displacement + contents of a GPR

LB, LBU, SB Load byte, load byte unsigned, store byte

LH, LHU, SH Load halfword, load halfword unsigned, store halfword

LW, SW Load word, store word (to/from integer registers)

LF, LD, SF, SD

Load SP float, load DP float, store SP float, store DP float (SP - single precision, DP - double precision)

MOVI2S, MOVS2I

Move from/to GPR to/from a special register

MOVF, MOVD

Copy one floating-point register or a DP pair to another register or pair

MOVFP2I, MOVI2FP

Move 32 bits from/to FP tegister to/from integer registers

DLX Instruction Set opcode Instruction meaning

Arithmetic / Logical Operations on integer or logical data in GPRs; signed arithmetics trap on overflow

ADD, ADDI, ADDU, ADDUI Add, add immediate (all immediates are 16-bits); signed and unsigned

SUB, SUBI, SUBU, SUBUI Subtract, subtract immediate; signed and unsigned

MULT, MULTU, DIV, DIVU Multiply and divide, signed and unsigned; operands must be floating-point registers; all operations take and yield 32-bit values

AND, ANDI And, and immediate

OR, ORI, XOP, XOPI Or, or immediate, exclusive or, exclusive or immediate

LHI Load high immediate - loads upper half of register with immediate

SLL, SRL, SRA, SLLI, SRLI, SRAI

Shifts: both immediate(S__I) and variable form(S__); shifts are shift left logical, right logical, right arithmetic

S__, S__I Set conditional: "__"may be LT, GT, LE, GE, EQ, NE

DLX Instruction Set

opcode Instruction meaning

Control Conditional branches and jumps; PC-relative or through register

BEQZ, BNEZ Branch GPR equal/not equal to zero; 16-bit offset from PC

BFPT, BFPF Test comparison bit in the FP status register and branch; 16-bit offset from PC

J, JR Jumps: 26-bit offset from PC(J) or target in register(JR)

JAL, JALR Jump and link: save PC+4 to R31, target is PC-relative(JAL) ot a register(JALR)

TRAP Transfer to operating system at a vectored address

RFE Return to user code from an exception; restore user code

DLX Instruction Set

Floating point Floating-point operations on DP and SP formats

ADDD, ADDF Add DP, SP numbers

SUBD, SUBF Subtract DP, SP numbers

MULTD, MULTF Multiply DP, SP floating point

DIVD, DIVF Divide DP, SP floating point

CVTF2D, CVTF2I, CVTD2F, CVTD2I, CVTI2F, CVTI2D

Convert instructions: CVTx2y converts from type x to type y, where x and y are one of I(Integer), D(Double precision), or F(Single precision). Both operands are in the FP registers.

__D, __F DP and SP compares: "__" may be LT, GT, LE, GE, EQ, NE; set comparison bit in FP status register.

THANK YOU

CHAPTER 2: INSTRUCTION SET PRINCIPLES · 2012. 5. 29. · Addressing modes Addressing Modes Example...

Documents

Oilon 3 esite FI 030706 LAL1.25 LAL1.25 LAL1.25 LAL1.25 Öljyletkun liitäntäyhde - imu R3/ 4”R3/ 4”R3/ 4”R3/ 4 ” - paluu R1/ 2”R1/ 2”R1/ 2”R1/ 2” Öljypumppu TA2

ZONIFICACIÓN ORDENANZA 3944/2018€¦ · dique frias zanjon frias cricyt canal del oeste referencias 3687/07 ord. zu1 zu2 r1 r3 r3 r1 r1 r1 r6 r2 r1 r6 r2 r2 r6 r6 c1 c1 c1 r2 cen1

LC3 Intro/Review - Georgetown Universitypeople.cs.georgetown.edu/.../Lec-1b-LC3intro.pdf · 2012-08-21 · .orig x3000 ld r1, six ld r2, number and r3,r3,#0 again add r3,r3,r2 add

1 Tomasulo’s Algorithm - ETH Z · format: \opcode destination (source1, source2." ADD R1 R2, R3 MUL R4 R1, R7 MUL R5 R4, R0 MUL R6 R2, R1 ADD R1 R3, R6 (b) Now assume that the machine

Futsal Session Plans. Futsal Session Plan – Attacking Organization R1 receives ball from GK R1 plays pass to R3 R4 tries to receive pass down line R3

R1 PTAP Santa Apolonia R3 R5 27 - yanacocha.com · expediente técnico, suministro e instalación de un ... R1 R3 Clori˜cación río Hornomayo río Porconcillo río Grande río Purhuay

RMG Study Group...Dec 01, 2016 · Adamczyk et al. Theor Chem Acc 128, 2011, 91-113 X is H or Si X H:Si R1 R2 X H Si R1 R2 ⇌ Si. R1 R3 R2.R4 Si R1 R3 R2 R4 ⇌ H Si R1 R3 R2.R4

Example instruction Instruction Name Meaning (RTL Language) ADD R1, R2, R3 AddRegs[R1]

MOVI Voice Control Shield for Arduino Ⓡboards User’s Manual...Uno R1 and R2, MEGA2560 R1 and R2, Leonardo R1 and R2 Uno R3, Mega2560 R3, Leonardo R3 Freeduino Olimexino-328 Diavolino

Vectores r1 y r3

Midterm 2 Review - Colorado State Universitycs270/.Fall18/slides/LectureMT2Revi… · Add R3, R3, #1. 5-5 Load and Store instructions Example: LD R1, Label1 R1 is loaded from memory

CS 450 Module R3. Today's Agenda R1 and R2 review Module R3 Introduction Schedule R1 Grading Next Week Module R4 Introduction

termalismo 2017.qxp Maquetación 1 28/11/16 14:45 Página 1 · r1 r1 r1 r1 r1 r1 r1 r2 r2 r3 r1 r1 r3 r3 r3 r4 r4 r4 r2 r3 r2 r1 r1 r3 r3 r3 r3 r4 r4 r4 r1 r1 r1 r1 r2 r2 r2 r2 r2

R3 814 R3 008 R1 526 R3 278 R2 577 R1 310 R11 436 R9 024 R4 … · 2019-11-15 · 8 antenatal consultations with your gynaecologist, GP or midwife Two 2D ultrasound scans including

19160 CiCe Discern Bias r1.qxd:11476 CiCe guidelines r3

Reboss R1 R3 R4 shoes

Instruction Level Parallelism€¦ · ADD R2, R2, R3 ADD R12, R13, R3 LW R3, 0(R1) ADDI R5, R3, 1 ADD R2, R2, R3 LW R13, 0(R11) ADD R12, R13, R3 stall stall Original Program Scheduled

R1 / R3 LIFECYCLE EXERCISE BIKES BASE USER MANUAL - Costco · R1 / R3 LIFECYCLE® EXERCISE BIKES BASE USER MANUAL. CORPORATE HEADQUARTERS 5100 North River Road Schiller Park, Illinois

Wireless Audio - 360 R5/R3/R1 - Supersonido...Wireless Audio - 360 R5/R3/R1 WAM5500/WAM3500/WAM1500 Manual del usuario imagine las posibilidades Gracias por adquirir un producto Samsung

2020 MEDIA KIT - Farmer's Weekly...r8 000 r5 000 r1 500 r4 000 r7 000 r61 900 discount r3 460 r3 440 r4 000 r3 000 r0 r3 000 r3 500 r20 400 cost r13 340 r16 160 r4 000 r2 000 r1 500