Create an execution profile report for generated CUDA code
gpucoder.profile(
generates an execution profiling report of the CUDA code generated for the design file
func_name
,codegen_inputs
)func_name
. The codegen_inputs
argument specifies the
inputs to the design file. You must install the Embedded Coder® product to generate the profiling report.
Note
The profiling workflow depends on the nvprof
tool from
NVIDIA®. In CUDA® toolkit v10.1, NVIDIA restricts access to performance counters to admin users. To enable GPU
performance counters for all user accounts, see the instructions in https://developer.nvidia.com/nvidia-development-tools-solutions-ERR_NVGPUCTRPERM-permission-issue-performance-counters.
gpucoder.profile(___,
generates an execution profiling report with one or more profiling options specified as a
name-value pair argument.Name,Value
)