Silberschatz and Galvin Chapter 8

1

CPSC 410--Richard Furuta 2/24/99 1

Silberschatz and GalvinChapter 8

Memory Management


Memory Management

¥ Goal: permit different processes to sharememory--effectively keep several inmemory at the same time

¥ Eventual meta-goal: users develop programsin what appears to be their own infinitely-large address space (i.e., starting at address0 and extending without limit)

2


Memory Management

¥ Initially we assume that the entire program must be inphysical memory before it can be executed.

Ð How can we avoid including unnecessary routines?Programs consist of modules written by several peoplewho may not [be able to] communicate their intentionsto one another.

¥ Reality:

Ð primary memory has faster access time but limited size;secondary memory is slower but much cheaper.

Ð program and data must be in primary memory to bereferenced by CPU directly


Multistep Processing of UserProgram

SourceProgram

compiler orassembler

Objectmodule

Objectmodule

linkageeditor

Objectmodule

. . .

. . .

Loadmodule

Loadmodule

systemlibraries

loader

memoryimage

Compile time ----------------Load time---------------

3



dynamiclibraries

memoryimage

Output

Run Time (execution time)



¥ Binding: associate location with object inprogram.

¥ For example, changing addresses in a userÕsprogram from logical addresses to real ones

¥ More abstractly, mapping one address space toanother

¥ Many things can be bound in programminglanguages; we are concentrating on memoryaddresses here.

4


Binding

¥ Typically

Ð compiler binds symbolic names (e.g., variable names)to relocatable addresses (i.e., relative to the start of themodule)

Ð linkage editor may further modify relocatable addresses(e.g., relative to a larger unit than a single module)

Ð loader binds relocatable addresses to absolute addresses

¥ Actually, address binding can be done at any point in adesign


When should binding occur?

¥ binding at compile timeÐ generates absolute code. Must know at compile time where the

process (or object) will reside in memory. Example: *0 in C.Limits complexity of system.

¥ binding at load timeÐ converts compilerÕs relocatable addresses into absolute addresses

at load time. The most common case. The program cannot bemoved during execution.

¥ binding at run timeÐ process can be moved during its execution from one memory

segment to another. Requires hardware assistance (discussedlater). Run-time overhead results from movement of process.

5


When should loading occur?

¥ Recall that loading moves objects into memory

¥ Load before execution

Ð load all routines before runtime starts

Ð straightforward scheme

¥ Load during execution--Dynamic loading

Ð loads routines on first use

Ð note that unused routines (ones that are not invoked) arenot loaded

Ð Implement as follows: on call to routine, check if theroutine is in memory. If not, load it.


When should linking occur?

¥ Recall that linking resolves references among objects.

¥ Standard implementation: link before execution (hence allreferences to library routines have been resolved beforeexecution begins). Called static linking.

¥ Link during execution: dynamic linking

Ð memory resident library routines

Ð every process uses the same copy of the library routines

Ð hence linking is deferred to execution time, but loadingis not necessarily deferred

6


Dynamic Linking

¥ Implementation of dynamic linking

Ð library routines are not present in executable image.Instead stubs are present.

Ð stub: small piece of code that indicates how to locatethe appropriate memory-resident library routine (or howto load it if it is not already memory-resident)

Ð first time that a routine is invoked, stub locates (andpossibly loads) routine and then replaces itself with theaddress of the memory-resident library routine


Dynamic Linking

¥ Also known as shared libraries

¥ Savings of overall memory (one copy of library routine) and of diskspace (library routines are not in executable images).

¥ Expense: first use is more expensive

¥ Problem: incompatible versions

Ð Can retain version number to distinguish incompatible versions oflibrary. Alternative is to require upward compatibility in libraryroutines.

Ð If there are different versions, then you can have multiple versionsof routine in memory at same time, counteracting a bit of thememory savings.

¥ Example: SUN OSÕs shared libraries

7


Overlays

¥ So far, the entire program and data of processmust be in physical memory during execution.

¥ Ad hoc mechanism for permitting process to belarger than the amount of memory allocated to it:overlays

¥ In effect keeps only those instructions and data inmemory that are in current use

¥ Needed instructions and data replace those nolonger in use


OverlaysExample

Common routines

Overlay driver

Main Routine A Main Routine BOverlay Area

Common data

8


Overlays

¥ Overlays do not require special hardwaresupport--can be managed by programmer

¥ Programmer must structure programappropriately, which may be a difficulty

¥ Very common solution in early days ofcomputers. Now, probably dynamicloading and binding are more flexible

¥ Example: Fortran common


Logical versus PhysicalAddress Space

¥ logical address: generated by the CPU (logical addressspace)

¥ physical address: loaded into the memory address registerof the memory (physical address space)

¥ compile-time and load-time address binding: logical andphysical addresses are the same

¥ execution-time address binding: logical and physicaladdresses may differ

Ð in this case, logical address referred to as virtualaddress

9


Mapping from Virtual to PhysicalAddresses

¥ Run-time mapping from virtual to physical addresshandled by the Memory Management Unit (MMU), ahardware device

¥ Simple MMU scheme

Ð relocation register containing start position of processin memory

Ð value in relocation register is added to every addressgenerated by a user process when it sent to memory


14000

CPU +logical

address

346

physicaladdress

14346

memory

Dynamic (binding) relocation usinga relocation register

relocationregister

mmu

10


Logical Address Space versusPhysical Address Space

¥ User programs only see the logical addressspace, in range 0 to max

¥ Physical memory operates in the physicaladdress space, addresses in the range R+0 toR+max

¥ This distinction between logical andphysical address spaces is a key one formemory management schemes.


Swapping

¥ What: temporarily move inactive process tobacking store (e.g., fast disk). At some later time,return it to main memory for continued execution.

¥ Why: permit other processes to use memoryresources (hence each process can be bigger)

¥ Who: decision of what process to swap made bymedium-term scheduler

11


Schematic view of Swapping


Swapping

¥ Some possibilities of when to swap

Ð if you have 3 processes, start to swap one out when itsquantum expires while two is executing. Goal is tohave third process in place when twoÕs quantum expires(i.e., overlap computation with disk i/o)

Ð context switch time is very high if you canÕt achievethis

¥ Another option: roll out lower priority process in favor ofhigher priority process. Roll in the lower priority processwhen the higher priority one finishes

12


Swapping

¥ If you have static address binding (i.e., compile or loadtime binding) have to swap process back into samememory space. Why?

¥ If you have execution-time address binding, then you canswap the process back into a different memory space.

¥ Disk is slow and the transfer time needed is proportional tothe size of the process, so it is useful if processes canspecify the parts of allocated memory that are unused toavoid having to transfer.


Swapping

¥ Process cannot be swapped until completely idle. Exampleof a problem: overlapped DMA input/output. (Thisrequires that you have buffer space allocated in memorywhen the i/o request comes back)

¥ Note that in general swapping in this form (i.e., with thislarge sized granularity) is not very common now.

13


Contiguous Allocation

¥ Divide memory into partitions. Initially consider twopartitions--one for the resident operating system and onefor a user process.

¥ Where should the operating system go--low memory orhigh memory?

¥ Frequently put the operating system in low memorybecause this is where the interrupt vector is located. Alsothis permits the user partition to be expanded withoutrunning into the operating system (a factor when we havemore than one partition or if we run the same binaries ondifferent system configurations).


Memory Partitions

ResidentOperating

System

User Processes(program and data)

0 Low memory

High memory

14


Single Partition Allocation

¥ Initial location of the userÕs process in memory is not 0

¥ The relocation register (base register) points to the firstlocation in the userÕs partition. UserÕs logical addressesare adjusted by the hardware to produce the physicaladdress. (Address binding delayed until execution time.)

¥ Relocation register value is static during programexecution, hence all of the OS must be present (it might beused). Otherwise have to relocate user code/data Òon theflyÓ! In other words we cannot have transient OS code.


Single Partition Allocation

¥ How about memory references passed from theuser process to the OS (for example, blocks ofmemory passed as an argument to a I/O routine)?

¥ The address must be translated from userÕs logicaladdress space to the physical address space. Otherarguments donÕt get translated (e.g., counts).

¥ Hence OS software has to handle thesetranslations.

15


Limit Register

¥ How do we protect the OS from accidentalor intentional interference from userprocesses?

¥ Add a limit register to the address mappingscheme


Limit Register

CPU < + memorylogicaladdresses

yes

no

limitregister

relocationregister

physicaladdresses

trap: addressing error

16


Multiple-Partition Allocation

¥ Goal: allocate memory to multiple processes (which permits rapidswitches, for example)

¥ Simple scheme: fixed-size partition

Ð memory divided into several partitions of fixed size

Ð each partition holds one process

Ð partition becomes free when process terminates; another processpicked from the ready queue gets the free partition

Ð number of partitions bounds the degree of multiprogramming

Ð originally used in the IBM OS/360 operating system (MFT)

Ð No longer used


Multiple-Partition AllocationDynamic Partition

¥ Memory is partitioned dynamicallyÐ Hole: block of available memory

Ð Holes of various size are scattered throughout memory

¥ Process still must occupy contiguous memory

¥ OS keeps a table listing which parts of memory areavailableÐ Allocated partitions

Ð Free partitions (hole)

¥ When a process arrives, the OS searches for a part ofmemory that is large enough to hold the process. Allocatesonly the amount of needed memory.

17



0

2000

operatingsystem p1

500K100

600

operatingsystem

p1

p2 800K

1400

operatingsystem

p1

p2



0

2000

100

600

1400

operatingsystem

p1

p2

p3 400K

1800

operatingsystem

p1

p2

p3

p4 600K canÕtp2 donep4 gets alloc.

operatingsystem

p1

p3

1200p4

18



0

2000

100

600

1400

1800

operatingsystem

p1

p3

1200p4

200K free

200K free

p5 requests 300 K but canÕt obtain it since there is no large enough contiguous block free. Note that there is 400 K free in the system though...



¥ This is an example of externalfragmentation--sufficient amount of freememory to satisfy request but not in acontiguous block.

¥ We used a first fit algorithm this time todecide where to allocate space--what aresome strategies for finding a free hole tofill?

19



¥ first fit algorithm: allocate the first hole that is big enough.Searching can start either at beginning of set of holes orwhere the previous first-fit search ended. We quit whenwe find a free hole that is large enough.

¥ best fit: allocate the smallest hole that is big enough. Mustsearch entire list to find it if you donÕt keep free listordered by size.

¥ worst fit: allocate the largest hole. Again may need tosearch entire free list if not ordered. Produces the largestleftover hole, which may be less likely to create externalfragmentation.



¥ Simulation shows that first-fit and best-fit arebetter than worst-fit for time and storage use.

¥ First-fit is faster than best-fit

¥ First-fit and best-fit are similar in storage use.

¥ 50% rule--up to 1/3 of memory is lost to externalfragmentation in first-fit (N allocated, 1/2 N lost)

20



¥ General comments:

Ð memory protection is necessary to prevent state interactions. Thisis effected by the limit register.

Ð base registers are required to point to the current partition

¥ In general, blocks are allocated in some quantum (e.g., power or 2).No point in leaving space free if you canÕt address it or if it is too smallto be of any use at all. Also there is an expense in keeping track offree space (free list; traversing list; etc.).

¥ This results in lost space--allocated but not required by process

¥ Internal fragmentation: difference between required memory andallocated memory.

¥ Internal fragmentation also results from estimation error andmanagement overhead.


External Fragmentation

¥ External fragmentation can be controlled with compaction.

Ð requires dynamic address binding (have to move piecesaround)

Ð can be quite expensive in time

Ð some schemes try to control expense by only doingcertain kinds of coalescing--e.g., on power of 2boundary. (Topic of a data structures class.)

Ð OS approach can also be to roll out/roll in all processes,returning processes to new addresses--no additionalcode required!

21


Paging

¥ Permit a processÕ memory to be non-contiguous

¥ Allocate physical memory in fixed-size, relatively smallpieces, called frames

Ð allocate frames to process as needed

Ð avoids external fragmentation

Ð causes increased internal fragmentation but attempts tominimize it through the small-sized frames. (Question:what is the characterization of the amount of internalfragmentation?)


Paging

¥ Implementation:Ð Divide (user) logical memory into pages. The

page is the same size as the frame.

Ð Dynamically map between pages and frames

Ð Hardware assistance is required to do thismapping

22


PagingHardware Assistance

CPU p d f d physicalmemory

logicaladdress

physicaladdress

0

...

p

...

f

page table


PagingImplementation

¥ Frames (and hence pages) are of fixed small size.Generally power of 2 large (why?).

¥ logical address of form (p d)

Ð p, page number

Ð d, offset within that page

¥ physical address of form (f d)

Ð f, a frame number

Ð d, offset within that frame

¥ Hardware maps from p to f; d copied across directly

¥ Q: how are f and d combined to get physical address?

23


Paging Example


Frame Table

¥ How does the Operating System know whatframes are in use in physical memory?

¥ Frame tableÐ One entry per physical page frame

Ð Indicates whether frame is free or allocated

Ð If allocated, indicates to which page of whichprocess or processes

24


PagingFragmentation

¥ No external fragmentation since all frames/pagesare the same size. Any page can be mapped to anyframe.

¥ Internal fragmentation especially on last page ofprocess. Average of 50% on last page.

¥ Smaller frames create less fragmentation butincrease number of frames and size of page table.


Paging

¥ We still require that entire process fit into memory

¥ Each process has its own page table

¥ This means that process cannot address outside of its ownaddress space

¥ Sharing frames (reentrant code) is possible

Ð reentrant code--pure code, i.e., non self-modifying codethat never changes during execution.

Ð code frames can be shared among all processes

Ð data separated out with one copy per process

Ð processÕ page tables for code can be pointed to sharedframe

25


Paging

¥ Hardware support required (too slow to search)

Ð simplest: put page table into dedicated high-speedregisters

¥ must keep page table reasonably small because ofexpense of registers (e.g., 256 entries)

¥ but this severely limits the potential size of theprocess and increases expense of context switch!

Ð Alternative: keep page table in memory with a page-table base register (PTBR) pointing to it

¥ reduces context-switch time

¥ doubles number of memory accesses


Paging

¥ Hardware support--continued

Ð Associative registers, also called translation look-asidebuffers (TLBs)

¥ Associative registers: key and value

¥ Keys are compared simultaneously and acorresponding value is returned

¥ Fast but expensive. Size limited to between 8 and2048 for example.

¥ Use some strategy to decide which page table entriesto put into associative memory.

26


Associative Registers

¥ Associative registers--parallel search

¥ Address translation (AÕ, AÕÕ)Ð If AÕ is in associative register, get frame #

Ð Otherwise get frame # from page table inmemory


Paging

¥ Hardware support (continued)

Ð TLBs (continued)

¥ on memory reference first check TLB. If match(hit) then use the value found

¥ If no hit, then need to go to memory.

¥ hit ratio: representation of how effective the processis.

¥ WhatÕs a strategy for loading the TLB? Perhapscache entries on first access. Replace most recentlyused entry or use some form of rotation.

27


Effective Access Time

¥ Associative lookup = e time units

¥ Assume that a memory cycle is 1 time unit¥ Hit ratio = a

Ð Percentage of times that a page number is foundin the associative registers; ratio related tonumber of associative registers

¥ Effective Access Time (EAT)EAT = (1 + e) a + (2 + e)(1 - a) = 2 + e - a


Memory protection

¥ Protection bits associated with frames; keptin page tableÐ Read, write, read-only

Ð Illegal operations result in hardware trap

¥ Valid/invalid bitÐ Illegal addresses, outside of processÕ address

space

Ð Alternately: page-table length register (PTLR)

28


Paging

¥ Note that the OS also needs to keep track of whichframes are being used--too expensive to searchpage tables to find a free frame

¥ Note that in these cases the logical address spaceis less than or equal to the physical address spacein size. Later we will see the opposite case

¥ UserÕs view is one contiguous space. Physicalview is userÕs program scattered throughoutphysical memory.


Multilevel Paging

¥ Support a very large address space (say 232 to 264)by using a two-level paging scheme.

¥ Here the page table itself is also paged.

¥ Otherwise the page table will become very large!

¥ Additional levels can also be introduced ifnecessary

¥ With caching, can decrease the requirement foradditional memory references

29


Two-level Page table scheme


Two-level paging example

32-bit machine with 4K page size 20 bit page number and 12 bit page offset

30


Inverted Page Table

¥ One entry for each real page of memory

¥ Entry consists of the virtual address of the pagestored in that real memory location, withinformation about the process that owns that page

¥ Decreases memory needed to store each pagetable, but increases time needed to search the tablewhen a page reference occurs

¥ Use hash table to limit the search to one (or atmost a few) page table entries


Inverted page table architecture

31


Shared pages

¥ Shared codeÐ One copy of read-only (reentrant) code shared among

processes (i.e., text editors, compilers, windowsystems)

Ð Shared code must appear in same location in the logicaladdress space of all processes

¥ Private code and dataÐ Each process keeps a separate copy of the code and data

Ð The pages for the private code and data can appearanywhere in the logical address space


Shared pages example

32


Segmentation

User's View of Memorymain

program

subroutines

stack

data


Segmentation

33


Segmentation

¥ Separating logical memory into different portions(segments) for different purposes

¥ Permits userÕs partitioning of space to match the logicalview

¥ Each segment has a name (e.g., a unique number) and alength.

¥ Logical address (from userÕs point of view) is a segmentnumber and an offset

¥ Physical address found by looking into a segment table


Segmentationsegment table

¥ Segment table containsÐ limit

¥ logical addresses must fall within the range 0..limit

¥ error if it doesnÕt (out of range)

¥ provides memory access protection

Ð base¥ added to offset to find physical address

34


Segmentation example


Segmentation Hardware

CPU s d

...

...

s

<limit,base>

d<

limit?

base+d

physicalmemoryyes

notrap: addressing error

35


Segmentation

¥ Permits implementation of protectionsappropriate to segmentÕs roleÐ read-only for code

Ð read/write for data

¥ But once again, since segments are variablesize, we have the problem of externalfragmentation.


Paged Segmentation

¥ Paging + Segmentation

¥ Aim: take the advantages of both paging and segmentationsystems

¥ Implementation: treat each segment independently. Eachsegment has a page table.

¥ Logical address now consists of three parts

Ð segment number

Ð page number

Ð offset

36


Paged SegmentationImplementation

¥ Segment table has base address for pagetable

¥ Look up <page number, offset> in pagetable

¥ Get physical address in return

¥ See examples in text for implementations inMULTICS and the Intel 386


Considerations in comparingmemory-management strategies

¥ Hardware support

¥ Performance

¥ Fragmentation

¥ Relocation (when can we relocate)

¥ Swapping (and the cost)

¥ Sharing

¥ Protection

Documents

Silberschatz and Galvin Chapter 8