How to speedup left divide on GPU?
1 Ansicht (letzte 30 Tage)
Ältere Kommentare anzeigen
Hello,
I have two same size 4D gpuArrays f(NxMxLxK) and f1(NxMxLxK) and I need to left divide each column, for that this code is implemented, which become a bottleneck in my algorithm and uses about 95% of runtime:
beta2= arrayfun(@(n) f(:,n)\f1(:,n), 1:numel(f)/size(f,1));
Result beta2 is vector. Is there a way to speed up this code? I assume the latency is due to fact that inside arrayfun is for loop which moves data from cpu to gpu and so on.
0 Kommentare
Antworten (0)
Siehe auch
Kategorien
Mehr zu GPU Computing finden Sie in Help Center und File Exchange
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!