NextGen TotalView for HPC User Guide : PART III Using the CUDA Debugger : Chapter 14 CUDA Debugging Tutorial : Enabling CUDA Memory Checker Feature
Enabling CUDA Memory Checker Feature
You can detect global memory addressing violations and misaligned global memory accesses by enabling the CUDA Memory Checker feature.
To enable the feature, use one of the following:
Pass the -cuda_memcheck option to the totalview command, for example:
totalview -cuda_memcheck
Set the TV::cuda_memcheck CLI state variable to true. For example:
dset TV::cuda_memcheck true
Note that global memory violations and misaligned global memory accesses will be detected only while the CUDA thread is running. Detection will not happen when single-stepping the CUDA thread.