Main Content

How MATLAB Allocates Memory

This topic provides information on how MATLAB® allocates memory when working with variables. This information, like any information on how MATLAB treats data internally, is subject to change in future releases.

Memory Allocation for Arrays

When you assign a numeric or character array to a variable, MATLAB allocates a contiguous block of memory and stores the array data in that block. MATLAB also stores information about the array data, such as its class and dimensions, in a small, separate block of memory called a header. For most arrays, the memory required to store the header is insignificant. However, there could be some advantage to storing large data sets in a small number of large arrays as opposed to a large number of small arrays. This is because fewer arrays require fewer array headers.

If you add new elements to an existing array, MATLAB expands the array in memory in a way that keeps its storage contiguous. This usually requires finding a new block of memory large enough to hold the expanded array. MATLAB then copies the contents of the array from its original location to this new block in memory, adds the new elements to the array in this block, and frees up the original array location in memory.

If you remove elements from an existing array, MATLAB keeps the memory storage contiguous by removing the deleted elements, and then compacting its storage in the original memory location.

Copying Arrays

When you assign an array to a second variable (for instance, when you execute B = A), MATLAB does not allocate new memory right away. Instead, it creates a copy of the array reference. As long as you do not modify the contents of the memory block being referenced by A and B, there is no need to store more than one copy of data. However, if you modify any elements of the memory block using either A or B, MATLAB allocates new memory, copies the data into it, and then modifies the created copy.

On Windows® systems, the memory function enables you to inspect the memory details. To see how copying arrays affects memory usage on your Windows system, create the function memUsed in a file in your current folder. The function calls memory to return the amount of memory used by your MATLAB process in megabytes.

function y = memUsed
usr = memory;
y = usr.MemUsedMATLAB/1e6;

Call memUsed to display the current memory usage.

format shortG
memUsed
ans = 
       3966.1

Create a 2000-by-2000 numeric array and observe the change in memory usage. The array uses about 32 MB of memory.

A = magic(2000);
memUsed
ans = 
       3998.1

Make a copy of A in B. Because there is no need to have two copies of the array data, MATLAB only makes a copy of the array reference. This requires no significant additional memory.

B = A;
memUsed
ans = 
       3998.1

Now modify B by removing half of its rows. Because A and B no longer point to the same data, MATLAB must allocate a separate memory block to B. As a result, the amount of memory used by the MATLAB process increases by the size of B, which is about 16 MB (one half of the 32 MB required for A).

B(1001:2000,:) = [];
memUsed
ans = 
       4014.1

Function Arguments

MATLAB handles arguments passed in function calls in the same way that it handles arrays being copied. When you pass a variable to a function, you actually pass a reference to the data that the variable represents. As long as the data is not modified by the called function, the variable in the calling function or script and the variable in the called function point to the same location in memory. If the called function modifies the value of the input data, then MATLAB makes a copy of the original variable in a new location in memory, updates that copy with the modified value, and points the input argument in the called function to this new location.

For example, consider the function myfun, which modifies the value of the array passed to it. MATLAB makes a copy of A in a new location in memory, sets the variable X as a reference to this copy, and then sets one row of X to zero. The array referenced by A remains unchanged.

A = magic(5);
myfun(A)

function myfun(X)
X(4,:) = 0;
disp(X)
end

If the calling function or script needs the modified value of the array it passed to myfun, you need to return the updated array as an output of the called function.

Data Types and Memory

Memory requirements differ for MATLAB data types. You might be able to reduce the amount of memory used by your code by learning how MATLAB treats various data types.

Numeric Arrays

MATLAB allocates 1, 2, 4, or 8 bytes to 8-bit, 16-bit, 32-bit, and 64-bit signed and unsigned integers, respectively. It represents floating-point numbers in either double-precision (double) or single-precision (single) format. Because MATLAB stores numbers of type single using 4 bytes, they require less memory than numbers of type double, which use 8 bytes. However, because they are stored with fewer bits, numbers of type single are represented to less precision than numbers of type double. In MATLAB, double is the default numeric data type and provides sufficient precision for most computational tasks. For more information, see Floating-Point Numbers.

Structure and Cell Arrays

While numeric arrays must be stored in a contiguous block of memory, structures and cell arrays can be stored in noncontiguous blocks. For structures and cell arrays, MATLAB creates a header not only for the array, but also for each field of the structure or each cell of the cell array. Therefore, the amount of memory required to store a structure or cell array depends not only on how much data it holds, but also on how it is constructed.

For example, consider a scalar structure S1 with fields R, G, and B, where each field contains a 100-by-50 array. S1 requires one header to describe the overall structure, one header for each unique field name, and one header for each field. This makes a total of seven headers for the entire structure.

S1.R = zeros(100,50);
S1.G = zeros(100,50);
S1.B = zeros(100,50);

On the other hand, consider a 100-by-50 structure array S2 in which each element has scalar fields R, G, and B. In this case, S2 needs one header to describe the overall structure, one header for each unique field name, and one header for each field of the 5,000 elements, making a total of 15,004 array headers for the entire structure array.

for i = 1:100
    for j=1:50
        S2(i,j).R = 0;
        S2(i,j).G = 0;
        S2(i,j).B = 0;
    end
end

Use the whos function to compare the amount of memory allocated to S1 and S2 on a 64-bit system. Even though S1 and S2 hold the same data, S1 uses significantly less memory.

whos S1 S2
  Name        Size              Bytes  Class     Attributes

  S1          1x1              120504  struct              
  S2        100x50            1680192  struct              

Complex Arrays

MATLAB uses an interleaved storage representation of complex numbers, where the real and imaginary parts are stored together in a contiguous block of memory. If you make a copy of a complex array, and then modify only the real or imaginary part of the array, MATLAB creates an array containing both real and imaginary parts. For more information about the representation of complex numbers in memory, see MATLAB Support for Interleaved Complex API in MEX Functions.

Sparse Matrices

It is a good practice to store matrices with few nonzero elements using sparse storage. When a full matrix has a small number of nonzero elements, converting the matrix to sparse storage typically improves memory usage and code execution time. You can convert a full matrix to sparse storage using the sparse function.

For example, let matrix A be a 1,000-by-1,000 full storage identity matrix. Create B as a sparse copy of A. In sparse storage, the same data uses a significantly smaller amount of memory.

A = eye(1000);
B = sparse(A);
whos A B
  Name         Size                Bytes  Class     Attributes

  A         1000x1000            8000000  double              
  B         1000x1000              24008  double    sparse    

Working with Large Data Sets

When you work with large data sets, repeatedly resizing arrays might cause your program to run out of memory. If you expand an array beyond the available contiguous memory of its original location, MATLAB must make a copy of the array and move the copy into a memory block with sufficient space. During this process, there are two copies of the original array in memory. This temporarily doubles the amount of memory required for the array and increases the risk of your program running out of memory. You can improve the memory usage and code execution time by preallocating the maximum amount of space required for the array. For more information, see Preallocation.

See Also

|

Related Topics