tensorflowgpunvidianvprofnsight-compute

nv-nsight-cu-cli caused Tensorflow to fail


I've downloaded the newest Nsight Compute profiling tool and I want to use it to benchmark Tensorflow applications. The code I'm using is here. It runs perfectly fine when I execute it and when I benchmark it with nvprof ./mnist.py it had no problem at all. However, when I try to run it with command sudo ./nv-nsight-cu-cli [path to the file] I get the following error:

ImportError: libcublas.so.9.0: cannot open shared object file: No such file or directory

I suspect that nv-nsight-cu-cli somehow didn't recognized the environment variable at all. Is there any fix around?


Solution

  • It appears that GP100 is not supported by the tool at this moment. The answer is found here:

    Nsight Compute only supports Pascal (other than GP100) and later GPUs.