specifying a stride length in ncread

Question

Chad Greene am 4 Okt. 2016

2
Verknüpfen

Direkter Link zu dieser Frage

https://de.mathworks.com/matlabcentral/answers/305610-specifying-a-stride-length-in-ncread

Beantwortet: Aylin am 14 Okt. 2016

I have a big 1.5 GB .nc file. Data loading is the slowest part of my processing, but I'm lucky that it will be sufficient to load only every Nth data point. Loading the whole file takes about 0.14 seconds:

tic 
z = ncread('myfile.nc','z'); 
toc 
Elapsed time is 0.142669 seconds.

which is about the same amount of time it takes when I specify that which indices to load:

tic 
z = ncread('myfile.nc','z',[1 1],[Inf Inf],[1 1]); 
toc
Elapsed time is 0.156108 seconds.

And so it should be faster if I specify a "stride" of more than 1. But it actually takes much more time to load every 2nd datapoint:

tic 
z = ncread('myfile.nc','z',[1 1],[Inf Inf],[2 2]); 
toc
Elapsed time is 4.992349 seconds.

Increasing the stride length beyond 2 seems to bring data loading time back down, but I have to use a stride length of 8 or more to get any benefit at all. What gives? Any ideas for fixes?

4 Kommentare
2 ältere Kommentare anzeigen2 ältere Kommentare ausblenden

KSSV am 5 Okt. 2016

Have you tried the same with netcdf.getVar?

Chad Greene am 5 Okt. 2016

In MATLAB Online öffnen

Oh, interesting idea. The issue persists!

tic 
ncid = netcdf.open('myfile.nc'); 
z = netcdf.getVar(ncid,2,[1 1],[12444 12444],[1 1]); 
toc
Elapsed time is 0.231038 seconds.
tic 
ncid = netcdf.open('myfile.nc'); 
z = netcdf.getVar(ncid,2,[1 1],[12444/2 12444/2],[2 2]); 
toc
Elapsed time is 4.881778 seconds.

Melden Sie sich an, um zu kommentieren.

Melden Sie sich an, um diese Frage zu beantworten.

Answer 1

Aylin am 14 Okt. 2016

1
Verknüpfen

Direkter Link zu dieser Antwort

https://de.mathworks.com/matlabcentral/answers/305610-specifying-a-stride-length-in-ncread#answer_239033

In MATLAB Online öffnen

It looks like this issue is actually occurring in the underlying NetCDF C library that MATLAB uses. Here is a discussion on the NetCDF mailing list about this issue from 2013:

http://www.unidata.ucar.edu/mailing_lists/archives/netcdfgroup/2013/msg00312.html

As an example, I downloaded the ‘ test_echam_spectral.nc ’ NetCDF file from

http://www.unidata.ucar.edu/software/netcdf/examples/files.html

Then, I entered the following commands into the MATLAB command prompt:

>> tic; z = ncread('test_echam_spectral.nc', 'xl'); toc; % Elapsed time is 0.035419 seconds
>> tic; z = ncread('test_echam_spectral.nc', 'xl', [1 1 1 1], [Inf Inf Inf Inf], [2 1 1 1]); toc; % Elapsed time is 0.424505 seconds

Clearly, the strided read is about an order of magnitude slower than the contiguous read. I was able to remedy the issue by reading the whole array, and then filtering the array using MATLAB’s inbuilt array manipulation syntax:

>> tic; z = ncread('test_echam_spectral.nc', 'xl'); z = z(1:2:192, :, :, :); toc; % Elapsed time is 0.041134 seconds

Note that this still takes a little more time than reading the whole array contiguously. However, this is much faster than using strided read in the ‘ ncread ’ function.

0 Kommentare
-2 ältere Kommentare anzeigen-2 ältere Kommentare ausblenden

Melden Sie sich an, um zu kommentieren.

specifying a stride length in ncread

4 Kommentare
2 ältere Kommentare anzeigen2 ältere Kommentare ausblenden

Akzeptierte Antwort

0 Kommentare
-2 ältere Kommentare anzeigen-2 ältere Kommentare ausblenden

Weitere Antworten (0)

Siehe auch

Kategorien

Tags

Community Treasure Hunt

specifying a stride length in ncread

4 Kommentare 2 ältere Kommentare anzeigen2 ältere Kommentare ausblenden

Akzeptierte Antwort

0 Kommentare -2 ältere Kommentare anzeigen-2 ältere Kommentare ausblenden

Weitere Antworten (0)

Siehe auch

Kategorien

Tags

Community Treasure Hunt

4 Kommentare
2 ältere Kommentare anzeigen2 ältere Kommentare ausblenden

0 Kommentare
-2 ältere Kommentare anzeigen-2 ältere Kommentare ausblenden