movsum slower than conv2 in GPU
1 Ansicht (letzte 30 Tage)
Ältere Kommentare anzeigen
Hello,
I have this code that tests how both movsum and conv2 perform under GPU.
A=rand(1e4,1e4,'single','gpuArray');
n=10;f=ones(1,n,'single','gpuArray');
gputimeit(@() conv2(f,A))
gputimeit(@() movsum(A,n))
ans =
0.0051
ans =
0.0102
Turns out conv2 takes 5ms whereas movsum takes 10ms.
movsum should have a simpler algorithm than conv2, no?
Amid.
0 Kommentare
Antworten (1)
Joss Knight
am 15 Okt. 2019
One might theorize, perhaps, that movsum literally uses the same kernels as conv2, but first has to construct the filter of ones which takes extra time. I'm not saying that's what happens, but it's a theory. By making more generic kernels, MATLAB saves space, which is important for download times and the MATLAB Compiler Runtime.
0 Kommentare
Siehe auch
Kategorien
Mehr zu GPU Computing finden Sie in Help Center und File Exchange
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!