16

NVIDIA Visual Profiler & CUDA-MEMCHECK

NVIDIA Visual Profiler - uni-graz.at · NVIDIA Visual Profiler & CUDA-MEMCHECK . Visual Profiler – Overview •Included in CUDA Toolkit •Visualize and optimize performance of

Download PDF Report

Upload
others
View
49
Download
0

Embed Size (px)

Citation preview

Page 1: NVIDIA Visual Profiler - uni-graz.at · NVIDIA Visual Profiler & CUDA-MEMCHECK . Visual Profiler – Overview •Included in CUDA Toolkit •Visualize and optimize performance of

NVIDIA Visual Profiler &

CUDA-MEMCHECK

Page 2: NVIDIA Visual Profiler - uni-graz.at · NVIDIA Visual Profiler & CUDA-MEMCHECK . Visual Profiler – Overview •Included in CUDA Toolkit •Visualize and optimize performance of

Visual Profiler – Overview

• Included in CUDA Toolkit

• Visualize and optimize performance of a CUDA application

• Shows timeline on CPU and GPU

• nvvp (GUI)

• nvprof (Terminal)

• Two types: – Executable session

– Imported session (importing data generated by nvprof)

• Generate pdf report

Page 3: NVIDIA Visual Profiler - uni-graz.at · NVIDIA Visual Profiler & CUDA-MEMCHECK . Visual Profiler – Overview •Included in CUDA Toolkit •Visualize and optimize performance of

Getting started

Page 4: NVIDIA Visual Profiler - uni-graz.at · NVIDIA Visual Profiler & CUDA-MEMCHECK . Visual Profiler – Overview •Included in CUDA Toolkit •Visualize and optimize performance of

Timeline View

• CPU activity

• GPU activity

• Shows start & end of

– Threads

– Kernels

– Memcpy

– …

• Zoom, filter, reorder, …

Page 5: NVIDIA Visual Profiler - uni-graz.at · NVIDIA Visual Profiler & CUDA-MEMCHECK . Visual Profiler – Overview •Included in CUDA Toolkit •Visualize and optimize performance of

Page 6: NVIDIA Visual Profiler - uni-graz.at · NVIDIA Visual Profiler & CUDA-MEMCHECK . Visual Profiler – Overview •Included in CUDA Toolkit •Visualize and optimize performance of

Page 7: NVIDIA Visual Profiler - uni-graz.at · NVIDIA Visual Profiler & CUDA-MEMCHECK . Visual Profiler – Overview •Included in CUDA Toolkit •Visualize and optimize performance of

Analysis View

• Guided or unguided – For unguided compile with SET(LOCAL_CUDA_NVCC_FLAGS ${LOCAL_CUDA_NVCC_FLAGS] –lineinfo)

• CUDA Application Analysis – Application‘s overall GPU utilization

– Kernel performance (orders kernels according to optimization importance based on execution time and achieved occupancy)

• Performance-Critical Kernels – Detailed analysis of a selected kernel

Page 8: NVIDIA Visual Profiler - uni-graz.at · NVIDIA Visual Profiler & CUDA-MEMCHECK . Visual Profiler – Overview •Included in CUDA Toolkit •Visualize and optimize performance of

Page 9: NVIDIA Visual Profiler - uni-graz.at · NVIDIA Visual Profiler & CUDA-MEMCHECK . Visual Profiler – Overview •Included in CUDA Toolkit •Visualize and optimize performance of

Page 10: NVIDIA Visual Profiler - uni-graz.at · NVIDIA Visual Profiler & CUDA-MEMCHECK . Visual Profiler – Overview •Included in CUDA Toolkit •Visualize and optimize performance of

• Compute, Bandwith, or Latency Bound

• Instruction and memory latency

– Examine occupancy

How many warps the kernel has active on the GPU, relative to the maximum number of warps supported by GPU

– Examine stall reasons

Could give insight why latency is still an issue for the kernel

Page 11: NVIDIA Visual Profiler - uni-graz.at · NVIDIA Visual Profiler & CUDA-MEMCHECK . Visual Profiler – Overview •Included in CUDA Toolkit •Visualize and optimize performance of

Page 12: NVIDIA Visual Profiler - uni-graz.at · NVIDIA Visual Profiler & CUDA-MEMCHECK . Visual Profiler – Overview •Included in CUDA Toolkit •Visualize and optimize performance of

Page 13: NVIDIA Visual Profiler - uni-graz.at · NVIDIA Visual Profiler & CUDA-MEMCHECK . Visual Profiler – Overview •Included in CUDA Toolkit •Visualize and optimize performance of

• Compute resources

GPU compute resources could limit the performance of a kernel, if they are insufficient or poorly utilized

Page 14: NVIDIA Visual Profiler - uni-graz.at · NVIDIA Visual Profiler & CUDA-MEMCHECK . Visual Profiler – Overview •Included in CUDA Toolkit •Visualize and optimize performance of

CUDA-MEMCHECK

• detects memory access errors

• Run time error detection

• Included in CUDA Toolkit

• Getting started:

– cuda-memcheck executable -options

best case:

Page 15: NVIDIA Visual Profiler - uni-graz.at · NVIDIA Visual Profiler & CUDA-MEMCHECK . Visual Profiler – Overview •Included in CUDA Toolkit •Visualize and optimize performance of

Supported error detection

• Memory access error Errors due to out of bound or misaligned access to memory by global,

local, shared or global atomic access

• Hardware exception Errors reported by hardware error reporting mechanism

• Malloc/Free errors Errors due to incorrect use of malloc or free

• CUDA API errors Failure of CUDA API call

• cudaMalloc memory leaks Allocations of device memory which have not been freed

• Device heap memory leaks Allocations of device memory in device code which have not been freed

Page 16: NVIDIA Visual Profiler - uni-graz.at · NVIDIA Visual Profiler & CUDA-MEMCHECK . Visual Profiler – Overview •Included in CUDA Toolkit •Visualize and optimize performance of

Example

__global__ : for device global memory __shared__ : for per block shared memory __local__ : for per thread local memory Information about type of access (read / write) Size of access in bytes Source file and line number Thread indices and block indices Memory address being accessed and type of access error

COMPUTE VISUAL PROFILER - Nvidiadeveloper.download.nvidia.com/compute/DevZone/docs/html/... · 2011. 5. 25. · COMPUTE VISUAL PROFILER FILES AND SETTINGS Profiling is automatically

COMPUTE VISUAL PROFILER - Nvidiadeveloper.download.nvidia.com/compute/DevZone/docs/html/... · 2011. 5. 25. · COMPUTE VISUAL PROFILER FILES AND SETTINGS Profiling is automatically

Documents

CUDA Without Cuda (CUDA Libraries) - Nvidiadeveloper.download.nvidia.com/CUDA/training/ntrotoCUDALibraries.pdf · CUDA Without Cuda (CUDA Libraries) GPU Computing Webinar 7/16/2011

CUDA Without Cuda (CUDA Libraries) - Nvidiadeveloper.download.nvidia.com/CUDA/training/ntrotoCUDALibraries.pdf · CUDA Without Cuda (CUDA Libraries) GPU Computing Webinar 7/16/2011

Documents

CUDA Flux: A Lightweight Instruction Profiler for CUDA ......Currently Available Tools for Profiling Hardware performance-counter based: nvprof • CUDA API trace • Light to heavy

CUDA Flux: A Lightweight Instruction Profiler for CUDA ......Currently Available Tools for Profiling Hardware performance-counter based: nvprof • CUDA API trace • Light to heavy

Documents

Future Directions for CUDA...Device API 1000+ new NVPP functions cuBLAS cuFFT Thrust cuRand cuSparse LLVM New Visual Profiler GPU-Aware MPI C++ new/delete Virtual functions Templates

Future Directions for CUDA...Device API 1000+ new NVPP functions cuBLAS cuFFT Thrust cuRand cuSparse LLVM New Visual Profiler GPU-Aware MPI C++ new/delete Virtual functions Templates

Documents

CUDA & OpenCV - Cybernetics · Presentation : OpenCV 2. 2 or 2.3 Set WITH_CUDA flag in Cmake Requirement : CUDA toolkit 4.0(OpenCV 2.3) CUDA toolkit 3.2 (OpenCV 2.2) G++ or Visual

CUDA & OpenCV - Cybernetics · Presentation : OpenCV 2. 2 or 2.3 Set WITH_CUDA flag in Cmake Requirement : CUDA toolkit 4.0(OpenCV 2.3) CUDA toolkit 3.2 (OpenCV 2.2) G++ or Visual

Documents

Debugging Experience with CUDA-GDB and CUDA ......2 CUDA Debugging Solutions CUDA-GDB (Linux & Mac) CUDA-MEMCHECK (Linux, Mac, & Windows) NVIDIA® Nsight Eclipse Edition (NEW!)Visual

Debugging Experience with CUDA-GDB and CUDA ......2 CUDA Debugging Solutions CUDA-GDB (Linux & Mac) CUDA-MEMCHECK (Linux, Mac, & Windows) NVIDIA® Nsight Eclipse Edition (NEW!)Visual

Documents

Profiler User's Guide - Nvidia...The profiling tools contain below changes as part of the CUDA Toolkit 9.1 release. ‣ The Visual Profiler shows the breakdown of the time spent on

Profiler User's Guide - Nvidia...The profiling tools contain below changes as part of the CUDA Toolkit 9.1 release. ‣ The Visual Profiler shows the breakdown of the time spent on

Documents

Visual Profiler - iserd.org.il · Visual Profiler? State of the art Deep learning ... that enables it to detect, interpret and classify objects of interest in a variety of visual

Visual Profiler - iserd.org.il · Visual Profiler? State of the art Deep learning ... that enables it to detect, interpret and classify objects of interest in a variety of visual

Documents

CUDA Como fazer?. CUDA O CUDA? O Visual C++. Integração com o Visual C++. Compilando (OpenGL). Exemplos de código

CUDA Como fazer?. CUDA O CUDA? O Visual C++. Integração com o Visual C++. Compilando (OpenGL). Exemplos de código

Documents

CUDA-GDB (NVIDIA CUDA Debugger)

CUDA-GDB (NVIDIA CUDA Debugger)

Documents

CUDA: NEW FEATURES AND BEYOND - NVIDIA · Upcoming limited decoupling of display driver and CUDA release for ease of deployment ... Nsight Visual Studio/Eclipse Edition –editor,

CUDA: NEW FEATURES AND BEYOND - NVIDIA · Upcoming limited decoupling of display driver and CUDA release for ease of deployment ... Nsight Visual Studio/Eclipse Edition –editor,

Documents

Profiler User's Guide - Colby College · 2014-04-01 · Profiler User's Guide DU-05982-001_v5.5 | 2 To limit profiling to a region of your application, CUDA provides functions to

Profiler User's Guide - Colby College · 2014-04-01 · Profiler User's Guide DU-05982-001_v5.5 | 2 To limit profiling to a region of your application, CUDA provides functions to

Documents

EEEntropic Profiler U Entropic Profiler Untropic Profiler Us ssser …sels.tecnico.ulisboa.pt/ep/UserManual.pdf · 2017. 2. 26. · The “Entropic Profiler” tool is available through

EEEntropic Profiler U Entropic Profiler Untropic Profiler Us ssser …sels.tecnico.ulisboa.pt/ep/UserManual.pdf · 2017. 2. 26. · The “Entropic Profiler” tool is available through

Documents

Webinar: The Visual Query Profiler and MongoDB Compass

Webinar: The Visual Query Profiler and MongoDB Compass

Technology

Jared Law CUDA: Super-Computing Made Easy. Jared Law NVidia CUDA: Why CUDA? What is CUDA? Where/how is CUDA being used? What does CUDA mean to programmers?

Jared Law CUDA: Super-Computing Made Easy. Jared Law NVidia CUDA: Why CUDA? What is CUDA? Where/how is CUDA being used? What does CUDA mean to programmers?

Documents

SENTECH LEVEL PROFILER... The Sentech Level Profiler allows operators to "see through the walls" of separators and vessels Floaters, visual gauges, DP and guided wave radar level transmitters

SENTECH LEVEL PROFILER... The Sentech Level Profiler allows operators to "see through the walls" of separators and vessels Floaters, visual gauges, DP and guided wave radar level transmitters

Documents

Windowsで始めるCUDA入門 - on-demand.gputechconf.comon-demand.gputechconf.com/gtc/2013/jp/sessions/8001.pdf · 1. Nsight Visual Studio Edition Visual StudioでのCUDA開発 —ビルド・デバッグ・プロファイル

Windowsで始めるCUDA入門 - on-demand.gputechconf.comon-demand.gputechconf.com/gtc/2013/jp/sessions/8001.pdf · 1. Nsight Visual Studio Edition Visual StudioでのCUDA開発 —ビルド・デバッグ・プロファイル

Documents

NVIDIA CUDA Installation Guide for Microsoft Windows...Visual Studio Community 2015 YES NO MSVC Version 1800 Visual Studio 2013 12.0 YES YES MSVC Version 1700 Visual Studio 2012 11.0

NVIDIA CUDA Installation Guide for Microsoft Windows...Visual Studio Community 2015 YES NO MSVC Version 1800 Visual Studio 2013 12.0 YES YES MSVC Version 1700 Visual Studio 2012 11.0

Documents

Profiler User's Guide - Tsudakarel.tsuda.ac.jp/lec/cuda/doc_v9_0/pdf/CUDA_Profiler_Users_Guide.pdfProfiler User's Guide DU-05982-001_v9.0 | iv PROFILING OVERVIEW This document describes

Profiler User's Guide - Tsudakarel.tsuda.ac.jp/lec/cuda/doc_v9_0/pdf/CUDA_Profiler_Users_Guide.pdfProfiler User's Guide DU-05982-001_v9.0 | iv PROFILING OVERVIEW This document describes

Documents

Programming with CUDA · Programming with CUDA ... CUDA C programming guide – CUDA Programming 4 …

Programming with CUDA · Programming with CUDA ... CUDA C programming guide – CUDA Programming 4 …

Documents

Profiler User's Guide · 2019. 4. 29. · Preparing An Application For Profiling Profiler User's Guide Version 2019 | 2 To limit profiling to a region of your application, CUDA provides

Profiler User's Guide · 2019. 4. 29. · Preparing An Application For Profiling Profiler User's Guide Version 2019 | 2 To limit profiling to a region of your application, CUDA provides

Documents

COMPUTE VISUAL PROFILER - Nvidiadeveloper.download.nvidia.com/compute/cuda/3_2...A group of sessions is called a project. Compute Visual Profiler saves the following files: Compute

COMPUTE VISUAL PROFILER - Nvidiadeveloper.download.nvidia.com/compute/cuda/3_2...A group of sessions is called a project. Compute Visual Profiler saves the following files: Compute

Documents

Profiler - Shaping The Learnershapingthelearner.com/images/...Assessment-Profiler... · DO-IT PROFILER Issue 1 Do-IT Profiler Sept 2016 TYPE TAGLINE HERE IN THIS ISSUE Our school

Profiler - Shaping The Learnershapingthelearner.com/images/...Assessment-Profiler... · DO-IT PROFILER Issue 1 Do-IT Profiler Sept 2016 TYPE TAGLINE HERE IN THIS ISSUE Our school

Documents

LEAPS IN VISUAL COMPUTING - NVIDIA Newsroom · 2008 150,000 CUDA Downloads 4,000 Academic Papers 60 Universities Teaching 77 Supercomputing Teraflops 6,000 Tesla GPUs 27 CUDA Apps

LEAPS IN VISUAL COMPUTING - NVIDIA Newsroom · 2008 150,000 CUDA Downloads 4,000 Academic Papers 60 Universities Teaching 77 Supercomputing Teraflops 6,000 Tesla GPUs 27 CUDA Apps

Documents

Parallel&Programming& Paradigms&liacs.leidenuniv.nl/~rietveldkfd/courses/parco2015/Lecture_11.pdf · Profiling CUDA-GDB debugger NVIDIA Visual Profiler &&&&Development &&&&Environment

Parallel&Programming& Paradigms&liacs.leidenuniv.nl/~rietveldkfd/courses/parco2015/Lecture_11.pdf · Profiling CUDA-GDB debugger NVIDIA Visual Profiler &&&&Development &&&&Environment

Documents

CUDA Lecture 8 CUDA Memories

CUDA Lecture 8 CUDA Memories

Documents

CUDA-Accelerated Visual SLAM For UAVs€¦ · CUDA-Accelerated Visual SLAM For UAVs by Donald Bourque A Thesis Submitted to the Faculty of the WORCESTER POLYTECHNIC INSTITUTE in partial

CUDA-Accelerated Visual SLAM For UAVs€¦ · CUDA-Accelerated Visual SLAM For UAVs by Donald Bourque A Thesis Submitted to the Faculty of the WORCESTER POLYTECHNIC INSTITUTE in partial

Documents

Introduction to CUDA Programming Profiler, Assembly, and Floating-Point Andreas Moshovos Winter 2009 Some material from: Wen-Mei Hwu and David Kirk NVIDIA

Introduction to CUDA Programming Profiler, Assembly, and Floating-Point Andreas Moshovos Winter 2009 Some material from: Wen-Mei Hwu and David Kirk NVIDIA

Documents

CUDA Profiler Users Guide

CUDA Profiler Users Guide

Documents

Agilent Mass Profiler GC/MS · Agilent Mass Profiler GC/MS Agilent Mass Profiler Mass Profiler Key word Mass Profiler Authors . 2 1. GC/MS MassHunter Molecular Feature Extraction

Agilent Mass Profiler GC/MS · Agilent Mass Profiler GC/MS Agilent Mass Profiler Mass Profiler Key word Mass Profiler Authors . 2 1. GC/MS MassHunter Molecular Feature Extraction

Documents