TotalView User Guide : Part V: Debugging : Using the CUDA Debugger : TotalView CUDA Debugging Tutorial : GPU Error Reporting

GPU Error Reporting
By default, TotalView reports GPU exception errors as "signals". Continuing the application after these errors can lead to application termination or unpredictable results.
Table 2 lists reported errors, according to these platforms and settings:
*
Exception codes Lane Illegal Address and Lane Misaligned Address are detected using all supported SDK versions when CUDA memcheck is enabled, on supported Tesla and Fermi hardware.
*
All other CUDA errors are detected only for GPUs with sm_20 or higher (for example Fermi) running SDK 3.1 or higher. It is not necessary to enable CUDA memcheck to detect these errors.
 
“Device Unknown Exception”
An application-caused global GPU error that does not match any of the listed error codes below.
Precise (Requires memcheck on)
“Device Hardware Stack Overflow”
The application has triggered a global hardware stack overflow, usually caused by large amounts of divergence in the presence of function calls.
“Warp Illegal Instruction”
A thread within a warp has accessed an address that is outside the valid range of local or shared memory regions.
A thread within a warp has accessed an incorrectly aligned address in the local or shared memory segments.
“Warp Invalid Address Space”
A thread within a warp has executed an instruction that attempts to access a memory space not permitted for that instruction.
“Warp Hardware Stack Overflow”
A thread has accessed an illegal (out of bounds) global address. For increased precision, enable memcheck.
Precise (Requires memcheck on)

Rogue Wave Software, Inc.
Voice: (303) 473-9118
rwonlinedocs@roguewave.com