Conference title1 A New Methodology for Studying Realistic Processors in Computer Science Degrees...
Preview:
Citation preview
- Slide 1
- Conference title1 A New Methodology for Studying Realistic
Processors in Computer Science Degrees Crispn Gmez, Mara E. Gmez y
Julio Sahuquillo DISCA. Technical University of Valencia DSI.
University of Castilla-La Mancha
- Slide 2
- 2 Outline Motivation Simulator Proposed Methodology Case Study
Conclusions
- Slide 3
- 3 Motivation Astonishingly quick evolution of processor
architecture: Teaching should cover from the basics to the most
realistic up-to-date concepts In-Order Execution Superscalar
Out-Of-Order Execution ManycoreMulticore POWER
- Slide 4
- 4 Motivation Current designs imply a big complexity
Out-Of-order complex cores Multi-level memory hierarchy On-chip
Interconnection network
- Slide 5
- 5 Outline Motivation Simulator Proposed Methodology Case Study
Conclusions
- Slide 6
- 6 Simulator Multi2Sim: multicore and multithreaded X86 binary
compatibility Application-only Free simulator: Open source project
http://www.multi2sim.org/http://www.multi2sim.org/ Widely used on
research Academia Industry
- Slide 7
- 7 Simulator Cores CPU: 6-staged pipelined processors,
out-of-order execution Execution stage maybe customized to be
multicycle. Speculative execution Three mutithreading paradigms are
supported: Coarse grain, fine grain, simultaneous multithreading
All microarchitectural parameters are customizable Type of branch
predictor Issue width Etc. GPUs
- Slide 8
- 8 Simulator Memory Hierarchy Complete memory hierarchy
Coherency: MOESI Flexible hierarchy: # of memory levels and memory
structures in each level Each memory structure is fully
customizable #Sets #Ways Block size
- Slide 9
- 9 Simulator Interconnection Network Interconnection network:
Any topology can be implemented Forwarding tables routing (any
routing algorithm can be used) Each network element is fully
customizable Buffer size at switches Link bandwidth
- Slide 10
- 10 Outline Motivation Simulator Proposed Methodology Case Study
Conclusions
- Slide 11
- 11 Proposed Methodology Tries to motivate the students into
processor architecture Realistic examples Increasing difficulty
levels Shared use in several courses Develop basic skills for final
projects, MS thesis or Ph.D thesis Based on a progressive
interaction with Multi2Sim 4 learning phases with increasing
difficulty due to the simulators complexity
- Slide 12
- 12 Proposed Methodology 1 st phase: Simulation parameters
modifications ( at labs) Configure the system components Launch
simulations Analyze the effects of the parameters on the system
performance
- Slide 13
- 13 Proposed Methodology 2 nd phase: Modify small pieces of code
Very small and bounded fragments of source code Completely guided
by the instructors Modification of a provided baseline Examples:
Branch predictor, prefetch mechanisms, Final work of the
course
- Slide 14
- 14 Proposed Methodology 3 rd phase: Implementation complete
functionalities Consolidated simulator skills Development of
functionalities from scratch Examples: Memory controller,
Stream-buffers based prefetcher, Final project or MS thesis Some
works have been published in top level conferences 4 th phase:
Complete autonomy The students are in a privileged position to
start a Ph.D.
- Slide 15
- 15 Outline Motivation Simulator Proposed Methodology Case Study
Conclusions
- Slide 16
- 16 Case study The methodology has been implanted at the UPV in
two courses Advanced Processor Architectures Computer Science
Degree and Master Degree Networks on-chip Master Degree We have
defined several learning stages with the simulator Baseline system
modeling Execution of standard benchmark suites Prefetching
mechanisms implementation
- Slide 17
- 17 Case study Baseline system modeling
- Slide 18
- 18 Case study Baseline system modeling Detailed explanation of
the configuration for Memory Cores Interconnection network Sample
configuration files are used
- Slide 19
- 19 Case study Benchmark Execution Parallel (Splash 2)
Multiprogrammed mixes (Spec) Performance study (IPC, Execution
Time, Network latency) varying L2 block size
- Slide 20
- 20 Case study Prefetching mechanisms implementation Base simple
prefetching mechanism provided OBL (One Block Look-ahead) on L2
miss Modification to this mechanism N-block sequential N-block with
regular stride
- Slide 21
- 21 Case study Results This year 2 final projects have been
performed in memory controller and prefetching Results from these
projects are expected to be sent to first level international
conferences These projects are expected to be evolved into MS
thesis Results projection is based on the experiences from previous
year, in which results from the projects were accepted in PACT and
IPDPS conferences
- Slide 22
- 22 Outline Motivation Simulator Proposed Methodology Case Study
Conclusions
- Slide 23
- 23 Conclusions We have reduced the gap between theoretical
contents on Computer Architecture topics and real processors By
using a well-established CMP-simulator in the international
research community Methodology based on an increasing degree of
difficulty First steps are very guided by instructors Students are
encouraged to go ahead to more complex implementations Methodology
+ simulator = good platform for future works as the range of design
choices is very wide
- Slide 24
- Conference title24 Thanks you for your attention
Crispin.Gomez@uclm.es