Example input = alexnet_cudnn.log
Example output = alexnet_cudnn.csv
CUBLAS_LOGINFO_DBG=1 CUBLAS_LOGDEST_DBG=cublas.log prog
CUDNN_LOGINFO_DBG=1 CUDNN_LOGDEST_DBG=cudnn.log prog
$ CUBLAS_LOGINFO_DBG=1 CUBLAS_LOGDEST_DBG=cublas.log CUDNN_LOGINFO_DBG=1 CUDNN_LOGDEST_DBG=cudnn.log prog
https://github.com/intel/mkl-dnn/blob/master/doc/performance_considerations/verbose.md
https://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html#env-vars
Driver-Based Profiler (these variables have no impact on the Visual Profiler or the command line profiler nvprof)
CUDA_DEVICE
Integer (default is 0) Specifies the index of the device to profile.COMPUTE_PROFILE 0
or 1 (default is 0) Disables profiling (when set to 0) or enables profiling (when set to 1).COMPUTE_PROFILE_CONFIG
Path Specifies the configuration file to set profiling options and select performance counters.COMPUTE_PROFILE_LOG
Path Specifies the file used to save the profiling output. In case of multiple contexts, use '%d' in the COMPUTE_PROFILE_LOG to generate separate output files for each context - with '%d' substituted by the context number.COMPUTE_PROFILE_CSV
0 or 1 (default is 0) When set to 1, the output will be in comma-separated format.