problem performing multiple, independent tasks simultaneously in parfor

2 Ansichten (letzte 30 Tage)
Here's a parfor code snippet that's intended to search for 2 different file types and load a file simultaneously, but fails due to different output types:
% run 3 independent things simultaneously to save time
parfor n 1:3
switch n
case 1
output{n} = dir('pathname\**\*.txt'); % this takes a long time
case 2
output{n} = dir('pathname\**\*.csv'); % this takes a long time
case 3
output{n} = load('pathname\matlab.mat'); % this also takes a long time
end
end
Here's how I'd prefer to write this function, which doesn't crash and runs MUCH faster than each function separately, but variables a, b, and c are not saved.
% run 3 independent things simultaneously to save time
parfor n 1:3
switch n
case 1
a = dir('pathname\**\*.txt'); % this takes a long time
case 2
b = dir('pathname\**\*.csv'); % this takes a long time
case 3
c = load('pathname\matlab.mat'); % this also takes a long time
end
end
I'm not I/O limited, these processes run simultaneously much faster. Run time with the second snippet is the same as the single longest operation. How can I run either method of code snippet with the outputs saved?
  3 Kommentare
Bruno Luong
Bruno Luong am 26 Jul. 2022
Bearbeitet: Bruno Luong am 26 Jul. 2022
Redargless it does't work (I don't know why just by reading your code)
Your parfor doesn't do any real computation, just reading, so it is mostly limited by your perepherical HW (reading speed and transfert data through the bus). Not sur parfor will not in contrary to what you would expect slow down even more, since the caching is worse.
David L.
David L. am 26 Jul. 2022
Edited above for clarity of my problem. Yes, the parfor loop runs much faster. Each function takes ~5 minutes, sequentially it's 15 minutes. In the parfor, it takes 5 minutes total.

Melden Sie sich an, um zu kommentieren.

Akzeptierte Antwort

Walter Roberson
Walter Roberson am 27 Jul. 2022
That code that assigns to output{n} will not fail due to different output types. All three of the outputs (for the code you posted) would be struct(), and besides you are outputting a cell array.
parfor n = 1:3
switch n
case 1
output{n} = dir('*.txt'); % this takes a long time
case 2
output{n} = dir('*.csv'); % this takes a long time
case 3
output{n} = struct('this', 5, 'that', 7); % this also takes a long time
end
end
Starting parallel pool (parpool) using the 'local' profile ... Connected to the parallel pool (number of workers: 2).
output
output = 1×3 cell array
{0×1 struct} {0×1 struct} {1×1 struct}
Have you considered using parfeval?
F(1) = parfeval(@() dir('*.txt'), 1);
F(2) = parfeval(@() dir('*.csv'), 1);
F(3) = parfeval(@() struct('this', 5, 'that', 7), 1);
wait(F)
a = fetchOutputs(F(1))
a = 0×1 empty struct array with fields: name folder date bytes isdir datenum
b = fetchOutputs(F(2))
b = 0×1 empty struct array with fields: name folder date bytes isdir datenum
c = fetchOutputs(F(3))
c = struct with fields:
this: 5 that: 7
  1 Kommentar
David L.
David L. am 28 Jul. 2022
Thanks very much Walter! This solution worked great! I also tried a batch method that worked too.

Melden Sie sich an, um zu kommentieren.

Weitere Antworten (0)

Kategorien

Mehr zu Parallel for-Loops (parfor) finden Sie in Help Center und File Exchange

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by