Understanding Parallel Computers

  • Published on
    30-Dec-2015

  • View
    18

  • Download
    0

Embed Size (px)

DESCRIPTION

Understanding Parallel Computers. Parallel Processing EE 613. Balancing Machine Specifics With Portability. How much do we need to know about the machine architecture Performance goal Game developers, embedded systems, and hardware vendors Coding specific to machine Lifetime goal - PowerPoint PPT Presentation

Transcript

  • Understanding Parallel ComputersParallel Processing EE 613

  • Balancing Machine Specifics With PortabilityHow much do we need to know about the machine architecturePerformance goalGame developers, embedded systems, and hardware vendorsCoding specific to machineLifetime goalPortabilityGeneralized code

  • A Look At Six Parallel ComputersChip MultiprocessorsIntel Core DuoAMD Dual Core OpteronHeterogeneous Chip DesignsGPU, FPGA, CellVectorClustersNode processors, RAM, disk, memory not sharedTypical - eight nodes, control processor, switchBlade server includes com ports and cooling fansSupercomputers BlueGene/L (440 PowerPCs)

  • CSE524 Parallel Algorithms Lawrence Snyder

  • CSE524 Parallel Algorithms Lawrence Snyder

  • CSE524 Parallel Algorithms Lawrence Snyder

  • CSE524 Parallel Algorithms Lawrence Snyder

  • CSE524 Parallel Algorithms Lawrence SnyderNow we can put multiple cores on a single chip.

  • CSE524 Parallel Algorithms Lawrence SnyderMESI Modified, Exclusive, Shared, Invalid

  • CSE524 Parallel Algorithms Lawrence Snyder

  • CSE524 Parallel Algorithms Lawrence SnyderMOESI Modified, Owned, Exclusive, Shared, Invalid

  • CSE524 Parallel Algorithms Lawrence SnyderBoth designs implement a coherent shared memory.

  • CSE524 Parallel Algorithms Lawrence Snyder

  • Symmetric Multiprocessor (SMP)Each processor makes memory requests over the common memory busAll cache controllers snoop the memory bus and adjust the tags on their cached values to ensure coherent cache usage.

  • CSE524 Parallel Algorithms Lawrence SnyderExample po & p1 have copies of Block X, but p2 writes to Block X.

  • CSE524 Parallel Algorithms Lawrence Snyder

  • CSE524 Parallel Algorithms Lawrence Snyder

  • CSE524 Parallel Algorithms Lawrence Snyder

  • CSE524 Parallel Algorithms Lawrence Snyder

  • CSE524 Parallel Algorithms Lawrence Snyder

  • CSE524 Parallel Algorithms Lawrence Snyder

  • CSE524 Parallel Algorithms Lawrence Snyder

  • CSE524 Parallel Algorithms Lawrence Snyder

  • CSE524 Parallel Algorithms Lawrence Snyder

  • CSE524 Parallel Algorithms Lawrence Snyder

  • CSE524 Parallel Algorithms Lawrence Snyder

  • CSE524 Parallel Algorithms Lawrence Snyder

  • CSE524 Parallel Algorithms Lawrence Snyder

  • CSE524 Parallel Algorithms Lawrence Snyder

  • CSE524 Parallel Algorithms Lawrence Snyder

  • CSE524 Parallel Algorithms Lawrence Snyder

  • CSE524 Parallel Algorithms Lawrence Snyder

  • CSE524 Parallel Algorithms Lawrence Snyder

  • CSE524 Parallel Algorithms Lawrence Snyder

  • CSE524 Parallel Algorithms Lawrence Snyder

  • CSE524 Parallel Algorithms Lawrence Snyder

  • CSE524 Parallel Algorithms Lawrence Snyder

  • CSE524 Parallel Algorithms Lawrence Snyder

  • CSE524 Parallel Algorithms Lawrence Snyder

  • CSE524 Parallel Algorithms Lawrence Snyder

  • CSE524 Parallel Algorithms Lawrence Snyder

  • CSE524 Parallel Algorithms Lawrence Snyder

  • CSE524 Parallel Algorithms Lawrence Snyder

Recommended

View more >