Why does how I initialize my large matrices make such a big difference?

Question

Michael Epstein am 13 Mär. 2024

0
Verknüpfen

Direkter Link zu dieser Frage

https://de.mathworks.com/matlabcentral/answers/2094156-why-does-how-i-initialize-my-large-matrices-make-such-a-big-difference

Kommentiert: Michael Epstein am 20 Mär. 2024

Can someone explain the results I'm seeing with the code below? The speed of my code depends significantly on how I initialize some large matrices. I have a pair of large 3D matrices (e.g. 3000 x 3000 x 10) inside of a function that gets called many times. In my actual application it's > 1000 times, but in this toy problem it's only 10x iterations.

Wrapper function:

clc
% Set number of loops to call myfunc.m
N_loop = 10;
% Set dimensions of data
n1 = 3000; n2 = 3000; n3 = 10; % Full
% Fast loop
tic
for ii = 1:N_loop
    myfunc_fast(n1,n2,n3);
end
fprintf('Fast version t = %1.6f sec\n',toc)
Fast version t = 0.015981 sec
% Slow loop
tic
for ii = 1:N_loop
    myfunc_slow(n1,n2,n3);
end
fprintf('Slow version t = %1.6f sec\n',toc)
Slow version t = 4.440633 sec

So there are two versions of this function, a "fast" version where I'm initializing the B matrix using the zeros(n1,n2,n3) call.

 
function myfunc_fast(n1,n2,n3)
number_elements = n1*n2*n3; % Number of elements
A = zeros(n1,n2,n3); % Initialize A
% B = A; % THIS SLOWS DOWN THE CODE
B = zeros(n1,n2,n3); % THIS IS OK!
ind = randi([1,number_elements]);  % Generate a random index
A(ind) = B(ind) + 1; % Do a simple read/write
end

And a "slow" version where I initialize A, and then set B = A. I figured "hey this should be slightly faster since I'm eliminating a call to the zeros() function", but this ends up being waaaay slower.

function myfunc_slow(n1,n2,n3)
number_elements = n1*n2*n3; % Number of elements
A = zeros(n1,n2,n3); % Initialize A
B = A; % THIS SLOWS DOWN THE CODE
% B = zeros(n1,n2,n3); % THIS IS OK!
ind = randi([1,number_elements]); % Generate a random index
A(ind) = B(ind) + 1; % Do a simple read/write
end

The output is:

Fast version t = 0.001108 sec

Slow version t = 2.867316 sec

I'm guessing what's happening is that when I set B = A, internally matlab is "smart enough" to not actually create a new variable and just share the memory space, but then when I modify A later inside myfunc_slow.m, it has to go back and allocate the memory that was once shared between A and B, which ends up taking longer.

Can anyone explain what's going on here and offer any best practices to pass along?

I'm using R2022b on a Windows laptop

Thanks!

0 Kommentare
-2 ältere Kommentare anzeigen-2 ältere Kommentare ausblenden

Melden Sie sich an, um zu kommentieren.

Melden Sie sich an, um diese Frage zu beantworten.

Answer 1

Yash am 17 Mär. 2024

1
Verknüpfen

Direkter Link zu dieser Antwort

https://de.mathworks.com/matlabcentral/answers/2094156-why-does-how-i-initialize-my-large-matrices-make-such-a-big-difference#answer_1426646

Hi Michael,

You have correctly identified that when you assign an array to a second variable MATLAB does not allocate new memory right away. Instead, it creates a copy of the array reference. However, if you modify any elements of the memory block using either "A" or "B", MATLAB allocates new memory, copies the data into it, and then modifies the created copy. This technique is known as "Copy-On-Write". You can read more about copying arrays and its memory footprint here: https://www.mathworks.com/help/matlab/matlab_prog/memory-allocation.html

"myfunc_fast" has faster execution time as compared to "myfunc_slow" because MATLAB's memory management system is optimized for operations like allocating arrays of zeros. Also "myfunc_fast" does not have additional overheads like "Copy-On-Write" and doesn't need to check integrity of shared data.

Refer here for more info and best practices on performance and memory: https://www.mathworks.com/help/matlab/performance-and-memory.html

1 Kommentar
-1 ältere Kommentare anzeigen-1 ältere Kommentare ausblenden

Michael Epstein am 20 Mär. 2024

Great, thank you!

Melden Sie sich an, um zu kommentieren.

Why does how I initialize my large matrices make such a big difference?

0 Kommentare
-2 ältere Kommentare anzeigen-2 ältere Kommentare ausblenden

Antworten (1)

1 Kommentar
-1 ältere Kommentare anzeigen-1 ältere Kommentare ausblenden

Siehe auch

Kategorien

Tags

Community Treasure Hunt

Why does how I initialize my large matrices make such a big difference?

0 Kommentare -2 ältere Kommentare anzeigen-2 ältere Kommentare ausblenden

Antworten (1)

1 Kommentar -1 ältere Kommentare anzeigen-1 ältere Kommentare ausblenden

Siehe auch

Kategorien

Tags

Community Treasure Hunt

0 Kommentare
-2 ältere Kommentare anzeigen-2 ältere Kommentare ausblenden

1 Kommentar
-1 ältere Kommentare anzeigen-1 ältere Kommentare ausblenden