cupy

How do I pass in the `--gpu-architecture=compute_89` into a NVRTC kernel with CuPy?


cp.RawModule(code=kernel, backend='nvrtc', options=('--gpu-architecture=compute_89',))

When I try to do it like this, I get an error that the option has already been passed in. Do I have to build the library from source with the correct architecture in mind instead?


Solution

  • CuPy automatically adds --gpu-architecture= option for the currently-active device, so you don't have to add it by yourself. Note that compute_89 support requires CUDA 11.8 or later.