9

GPU Programming with CUDA – CUDA 5 and 6 Paul Richmond GPUComputing@Sheffield http://gpucomputing.sites.sheffield.ac.uk/

GPU Programming with CUDA – CUDA 5 and 6 Paul Richmond GPUComputing@Sheffield

Download PPTX Report

Upload
andrew-houston
View
221
Download
3

Embed Size (px)

Citation preview

Page 1: GPU Programming with CUDA – CUDA 5 and 6 Paul Richmond GPUComputing@Sheffield

GPU Programming with CUDA – CUDA 5 and 6

Paul Richmond

GPUComputing@Sheffieldhttp://gpucomputing.sites.sheffield.ac.uk/

Page 2: GPU Programming with CUDA – CUDA 5 and 6 Paul Richmond GPUComputing@Sheffield

• Dynamic Parallelism (CUDA 5+)• GPU Object Linking (CUDA 5+)• Unified Memory (CUDA 6+)• Other Developer Tools

Overview

Page 3: GPU Programming with CUDA – CUDA 5 and 6 Paul Richmond GPUComputing@Sheffield

• Before CUDA 5 threads had to be launched from the host• Limited ability to perform recursive functions

• Dynamic Parallelism allows threads to be launched from the device• Improved load balancing• Deep Recursion

Dynamic Parallelism

CPU Kernel A

Kernel B

Kernel C

Kernel D

GPU

Page 4: GPU Programming with CUDA – CUDA 5 and 6 Paul Richmond GPUComputing@Sheffield

//Host Code

...

A<<<...>>>(data);

B<<<...>>>(data);

C<<<...>>>(data);

//Kernel Code

__global__ void vectorAdd(float *data)

{

do_stuff(data);

X<<<...>>>(data);

X<<<...>>>(data);

X<<<...>>>(data);

do_more stuff(data);

}

An Example

Page 5: GPU Programming with CUDA – CUDA 5 and 6 Paul Richmond GPUComputing@Sheffield

• CUDA 4 required a single source file for a single kernel• No linking of compiled device code

• CUDA 5.0+ Allows different object files to be linked• Kernels and host code can be built independently

GPU Object Linking

Main .cpp___________________________

a.cu____________________

b.cu____________________

c.cu____________________

a.o b.o c.o

+ Program.exe

Page 6: GPU Programming with CUDA – CUDA 5 and 6 Paul Richmond GPUComputing@Sheffield

• Objects can also be built into static libraries• Shared by different sources• Much better code reuse• Reduces compilation time• Closed source device libraries

GPU Object Linking

Main .cpp___________________________

a.cu____________________

b.cu____________________

a.o b.o

ab.culib

+

Program.exe

+

+

Main2 .cpp___________________________

ab.culib

Program2.exe

+

+foo.cu bar.cu

...

Page 7: GPU Programming with CUDA – CUDA 5 and 6 Paul Richmond GPUComputing@Sheffield

• Developer view is that GPU and CPU have separate memory• Memory must be explicitly copied• Deep copies required for complex data structures

• Unified Memory changes that view• Single pointer to data accessible anywhere• Simpler code porting

Unified Memory

System Memory GPU Memory

CPU GPU

Unified Memory

CPU GPU

Page 8: GPU Programming with CUDA – CUDA 5 and 6 Paul Richmond GPUComputing@Sheffield

Unified Memory Example

void sortfile(FILE *fp, int N) { char *data; data = (char *)malloc(N); fread(data, 1, N, fp); qsort(data, N, 1, compare); use_data(data); free(data); }

void sortfile(FILE *fp, int N) { char *data; cudaMallocManaged(&data, N); fread(data, 1, N, fp); qsort(data, N, 1, compare); cudaDeviceSynchronize(); use_data(data); free(data); }

Page 9: GPU Programming with CUDA – CUDA 5 and 6 Paul Richmond GPUComputing@Sheffield

• XT and Drop-in libraries• cuFFT and cuBLAS optimised for multi GPU (on the same node)

• GPUDirect• Direct Transfer between GPUs (cut out the host)• To support direct transfer via Infiniband (over a network)

• Developer Tools• Remote Development using Nsight Eclipse• Enhanced Visual Profiler

Other Developer Tools

GPU Computing with CUDA Lecture 2 - CUDA · PDF fileGPU Computing with CUDA Lecture 2 - CUDA Memories ... August, 2011 UTFSM, Valparaíso, Chile 1. ... Memory hierarchy ‣CUDA works

GPU Computing with CUDA Lecture 2 - CUDA · PDF fileGPU Computing with CUDA Lecture 2 - CUDA Memories ... August, 2011 UTFSM, Valparaíso, Chile 1. ... Memory hierarchy ‣CUDA works

Documents

CUDA Compiler Driver NVCC - Nvidiadocs.nvidia.com/cuda//pdf/CUDA_Compiler_Driver_NVCC.pdf · CUDA Compiler Driver NVCC TRM-06721-001_v9.1 ... CUDA Programming Model ... 3.1. Command

CUDA Compiler Driver NVCC - Nvidiadocs.nvidia.com/cuda//pdf/CUDA_Compiler_Driver_NVCC.pdf · CUDA Compiler Driver NVCC TRM-06721-001_v9.1 ... CUDA Programming Model ... 3.1. Command

Documents

Jared Law CUDA: Super-Computing Made Easy. Jared Law NVidia CUDA: Why CUDA? What is CUDA? Where/how is CUDA being used? What does CUDA mean to programmers?

Jared Law CUDA: Super-Computing Made Easy. Jared Law NVidia CUDA: Why CUDA? What is CUDA? Where/how is CUDA being used? What does CUDA mean to programmers?

Documents

Getting Started with CUDA C/C++ · Getting Started with CUDA C/C++ Mark Ebersole, NVIDIA CUDA Educator . CPU GPU ... LabVIEW . Programming a CUDA Language CUDA C/C++ Based on industry-standard

Getting Started with CUDA C/C++ · Getting Started with CUDA C/C++ Mark Ebersole, NVIDIA CUDA Educator . CPU GPU ... LabVIEW . Programming a CUDA Language CUDA C/C++ Based on industry-standard

Documents

CUDA programming Performance considerations (CUDA best practices) NVIDIA CUDA C programming best practices guide ACK: CUDA teaching center Stanford (Hoberrock

CUDA programming Performance considerations (CUDA best practices) NVIDIA CUDA C programming best practices guide ACK: CUDA teaching center Stanford (Hoberrock

Documents

GPGPU programming on example of CUDA - Panoramix - …panoramx.ift.uni.wroc.pl/~maq/cuda/prezentacja-cuda-eng.pdf · CPU GPU CUDA Architecture GPU programming Examples Summary GPGPU

GPGPU programming on example of CUDA - Panoramix - …panoramx.ift.uni.wroc.pl/~maq/cuda/prezentacja-cuda-eng.pdf · CPU GPU CUDA Architecture GPU programming Examples Summary GPGPU

Documents

GPUDIRECT, CUDA AWARE MPI, & CUDA IPC€¦ · Steve Abbott, February 12, 2019 GPUDIRECT, CUDA AWARE MPI,& CUDA IPC

GPUDIRECT, CUDA AWARE MPI, & CUDA IPC€¦ · Steve Abbott, February 12, 2019 GPUDIRECT, CUDA AWARE MPI,& CUDA IPC

Documents

v5.0 | October 2012 NVIDIA CUDA SAMPLES Release Notesdirac.ruc.dk/manuals/cuda-5.0/CUDA_Samples_Release_Notes.pdf · NVIDIA CUDA Samples v5.0 | ii CUDA SAMPLES 5.0 NOTES R304 Driver

v5.0 | October 2012 NVIDIA CUDA SAMPLES Release Notesdirac.ruc.dk/manuals/cuda-5.0/CUDA_Samples_Release_Notes.pdf · NVIDIA CUDA Samples v5.0 | ii CUDA SAMPLES 5.0 NOTES R304 Driver

Documents

Chapter 1. Introduction - POLI's homepagepoli.cs.vsb.cz/edu/apps/cuda/cuda-programming.pdf · CUDA C Programming Guide Version 4.0 1 ... NVIDIA introduced CUDA™, ... Chapter 1

Chapter 1. Introduction - POLI's homepagepoli.cs.vsb.cz/edu/apps/cuda/cuda-programming.pdf · CUDA C Programming Guide Version 4.0 1 ... NVIDIA introduced CUDA™, ... Chapter 1

Documents

MD-CUDA · GPGPU CUDA N-body problem ... –Application programming interface (API) –CUDA runtime –CUFFT –CUBLAS. 20 CUDA Layers. 21 GPU Architecture In CUDA Memory Addressing

MD-CUDA · GPGPU CUDA N-body problem ... –Application programming interface (API) –CUDA runtime –CUFFT –CUBLAS. 20 CUDA Layers. 21 GPU Architecture In CUDA Memory Addressing

Documents

CUDA Without Cuda (CUDA Libraries) - Nvidiadeveloper.download.nvidia.com/CUDA/training/ntrotoCUDALibraries.pdf · CUDA Without Cuda (CUDA Libraries) GPU Computing Webinar 7/16/2011

CUDA Without Cuda (CUDA Libraries) - Nvidiadeveloper.download.nvidia.com/CUDA/training/ntrotoCUDALibraries.pdf · CUDA Without Cuda (CUDA Libraries) GPU Computing Webinar 7/16/2011

Documents

Debugging Experience with CUDA-GDB and CUDA …developer.download.nvidia.com/...GTC2012-Debugging...Debugging Experience with CUDA-GDB and CUDA-MEMCHECK Geoff Gerfin Vyas Venkataraman

Debugging Experience with CUDA-GDB and CUDA …developer.download.nvidia.com/...GTC2012-Debugging...Debugging Experience with CUDA-GDB and CUDA-MEMCHECK Geoff Gerfin Vyas Venkataraman

Documents

CUDA programming Performance considerations (CUDA best practices)

CUDA programming Performance considerations (CUDA best practices)

Documents

GPUDIRECT, CUDA AWARE MPI, & CUDA IPC...Steve Abbott, Summit Training Workshop, December 2018 GPUDIRECT, CUDA AWARE MPI, & CUDA IPC

GPUDIRECT, CUDA AWARE MPI, & CUDA IPC...Steve Abbott, Summit Training Workshop, December 2018 GPUDIRECT, CUDA AWARE MPI, & CUDA IPC

Documents

GPU Programming with CUDA – Optimisation Mike Griffiths GPUComputing@Sheffield

GPU Programming with CUDA – Optimisation Mike Griffiths GPUComputing@Sheffield

Documents

Introduction to GPU Programming with the CUDA Platform · Resources ThispresentationandallsourcecodeareavailableatGitHub: • github.com/phrb/intro-cuda Moreresources: • CUDAC:docs.nvidia.com/cuda/cuda-c-programming-guide

Introduction to GPU Programming with the CUDA Platform · Resources ThispresentationandallsourcecodeareavailableatGitHub: • github.com/phrb/intro-cuda Moreresources: • CUDAC:docs.nvidia.com/cuda/cuda-c-programming-guide

Documents

DU-05227-042 v6.0 | February 2014 CUDA-GDB CUDA DEBUGGERdirac.ruc.dk/manuals/cuda-6.0/cuda-gdb.pdf · CUDA Debugger DU-05227-042 _v6.0 | 3 Chapter 2. RELEASE NOTES 6.0 Release Unified

DU-05227-042 v6.0 | February 2014 CUDA-GDB CUDA DEBUGGERdirac.ruc.dk/manuals/cuda-6.0/cuda-gdb.pdf · CUDA Debugger DU-05227-042 _v6.0 | 3 Chapter 2. RELEASE NOTES 6.0 Release Unified

Documents

CUDA-GDB: The NVIDIA CUDA Debuggerdeveloper.download.nvidia.com/compute/cuda/2_1/cudagdb/... · 2008-12-24 · 1.1 CUDA-GDB: The NVIDIA CUDA Debugger ... You must select Linux 32-bit

CUDA-GDB: The NVIDIA CUDA Debuggerdeveloper.download.nvidia.com/compute/cuda/2_1/cudagdb/... · 2008-12-24 · 1.1 CUDA-GDB: The NVIDIA CUDA Debugger ... You must select Linux 32-bit

Documents

Programming with CUDA · Programming with CUDA ... CUDA C programming guide – CUDA Programming 4 …

Programming with CUDA · Programming with CUDA ... CUDA C programming guide – CUDA Programming 4 …

Documents

Parallel programming many-core computing: CUDA ...bal/college11/class3-cuda-introduction.pdf · CUDA CUDA: Scalable parallel programming C/C++ extensions Provide straightforward mapping

Parallel programming many-core computing: CUDA ...bal/college11/class3-cuda-introduction.pdf · CUDA CUDA: Scalable parallel programming C/C++ extensions Provide straightforward mapping

Documents

Debugging Experience with CUDA-GBD and CUDA-MEMCHECK · 2012-11-27 · Debugging Experience with CUDA-GDB and CUDA-MEMCHECK ... CUDA Debugging Solutions C UDA-G DB (Linux & Mac) C

Debugging Experience with CUDA-GBD and CUDA-MEMCHECK · 2012-11-27 · Debugging Experience with CUDA-GDB and CUDA-MEMCHECK ... CUDA Debugging Solutions C UDA-G DB (Linux & Mac) C

Documents

Best Practices Guide -- CUDA 2 - Nc State Universitymoss.csc.ncsu.edu/.../2.3/NVIDIA_CUDA_BestPracticesGuide_2.3.pdf · NVIDIA CUDA C Programming Best Practices Guide . CUDA ... CUDA

Best Practices Guide -- CUDA 2 - Nc State Universitymoss.csc.ncsu.edu/.../2.3/NVIDIA_CUDA_BestPracticesGuide_2.3.pdf · NVIDIA CUDA C Programming Best Practices Guide . CUDA ... CUDA

Documents

Introduction to Scientific Programming using GPGPU and CUDA · Introduction to Scientific Programming using GPGPU and CUDA ... (NVIDIA CUDA Programming Guide) ... CUDA C OpenCL CUDA

Introduction to Scientific Programming using GPGPU and CUDA · Introduction to Scientific Programming using GPGPU and CUDA ... (NVIDIA CUDA Programming Guide) ... CUDA C OpenCL CUDA

Documents

NVIDIA CUDA Best Practices Guide - Virginia Tech€¦ · CUDA Best Practices Guide Version 3.1 Version 3.1 5/19/2010 NVIDIA CUDA™ NVIDIA CUDA C Best Practices Guide . ... CUDA Programming

NVIDIA CUDA Best Practices Guide - Virginia Tech€¦ · CUDA Best Practices Guide Version 3.1 Version 3.1 5/19/2010 NVIDIA CUDA™ NVIDIA CUDA C Best Practices Guide . ... CUDA Programming

Documents

CUDA Lecture 7 CUDA Threads and Atomics

CUDA Lecture 7 CUDA Threads and Atomics

Documents

CUDA C BEST PRACTICES GUIDE - Multiprocesorski sistemimups.etf.rs/vezbe/cuda/docs/CUDA_C_Best_Practices_Guide.pdf · 1.3.1 CUDA Runtime API ... CUDA C Best Practices Guide DG-05603-001_v4.0

CUDA C BEST PRACTICES GUIDE - Multiprocesorski sistemimups.etf.rs/vezbe/cuda/docs/CUDA_C_Best_Practices_Guide.pdf · 1.3.1 CUDA Runtime API ... CUDA C Best Practices Guide DG-05603-001_v4.0

Documents

Debugging Your CUDA Applications With CUDA-GDBdeveloper.download.nvidia.com/GTC/PDF/1062_Satoor.pdf · Debugging Solutions CUDA-GDB (Linux & Mac) CUDA-MEMCHECK (Linux, Mac, & Windows)

Debugging Your CUDA Applications With CUDA-GDBdeveloper.download.nvidia.com/GTC/PDF/1062_Satoor.pdf · Debugging Solutions CUDA-GDB (Linux & Mac) CUDA-MEMCHECK (Linux, Mac, & Windows)

Documents

GPU Programming with CUDA – Accelerated Architectures Mike Griffiths GPUComputing@Sheffield

GPU Programming with CUDA – Accelerated Architectures Mike Griffiths GPUComputing@Sheffield

Documents

March 2015 CUDA-GDB CUDA DEBUGGER - Rice University · CUDA-GDB CUDA DEBUGGER DU-05227-042 _v7.0 | March 2015 User Manual. CUDA Debugger DU-05227-042 _v7.0 | ii TABLE OF CONTENTS

March 2015 CUDA-GDB CUDA DEBUGGER - Rice University · CUDA-GDB CUDA DEBUGGER DU-05227-042 _v7.0 | March 2015 User Manual. CUDA Debugger DU-05227-042 _v7.0 | ii TABLE OF CONTENTS

Documents

NVIDIA CUDA D CUDA-GDBdeveloper.download.nvidia.com/.../docs/...2.3beta.pdfPG-00000-004_V2.3 1 NVIDIA CHAPTER1 Introduction CUDA‐GDB, the NVIDIA® CUDA™ debugger, is introduced,

NVIDIA CUDA D CUDA-GDBdeveloper.download.nvidia.com/.../docs/...2.3beta.pdfPG-00000-004_V2.3 1 NVIDIA CHAPTER1 Introduction CUDA‐GDB, the NVIDIA® CUDA™ debugger, is introduced,

Documents