CUDA ptxas fatal : Unresolved extern function '_Z22mwGetGlobalThreadIndexv'
20 Ansichten (letzte 30 Tage)
Ältere Kommentare anzeigen
Hi
I used gpucoder to generate the cuda file corresponding to original function. After that i built a .PTX file from my cuda file and now i want to create CUDAkernel using parallel.gpu.CUDAKernel to speedup my function. By using k = parallel.gpu.CUDAKernel('myfunction.ptx','float *, const double *','_Z15myfunction_kernel1dP15emxArray_real_T'), I am getting an error below:
Error using parallel.gpu.CUDAKernel
An error occurred during PTX compilation of <image>.
The information log was:
The error log was:
ptxas fatal : Unresolved extern function '_Z22mwGetGlobalThreadIndexv'
The CUDA error code was: CUDA_ERROR_INVALID_PTX.
How to get rid of this error? Any help will be much appreciated. Regards
0 Kommentare
Antworten (1)
Linda Koletsou Soulti
am 1 Jul. 2021
Hello Riffat,
it seems that the kernel calls into a helper function that it was not compiled with the kernel.
The following example may be helpful:
Cheers,
Linda
3 Kommentare
Linda Koletsou Soulti
am 6 Jul. 2021
Hello Riffat,
in the folder of the generated code there should be a file called MWCudaDimUtility.cu. Your device code calls into a function called mwGetGlobalThreadIndex inside this file. There are a couple of ways to resolve the issue caused by this dependency. I believe the easiest would be to copy the contents of the MWCudaDimUtility.cu file inside the file of your code, before the definition of your function. Then build this file to ptx.
Cheers,
Linda
Siehe auch
Kategorien
Mehr zu GPU Computing finden Sie in Help Center und File Exchange
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!