I've downloaded the newest Nsight Compute profiling tool and I want to use it to benchmark Tensorflow applications. The code I'm using is here. It runs perfectly fine when I execute it and when I benchmark it with nvprof ./mnist.py
it had no problem at all. However, when I try to run it with command sudo ./nv-nsight-cu-cli [path to the file]
I get the following error:
ImportError: libcublas.so.9.0: cannot open shared object file: No such file or directory
I suspect that nv-nsight-cu-cli
somehow didn't recognized the environment variable at all. Is there any fix around?
It appears that GP100 is not supported by the tool at this moment. The answer is found here:
Nsight Compute only supports Pascal (other than GP100) and later GPUs.