parallel code execution on MATLAB cluster

As I run a code on a cluster using spmd, sometimes a worker gets disconnected and the execution stops. In another instance, the job became 'queued' after running for multiple hours and then eventually the execution stopped. What could be potential reasons for these?

1 Kommentar

Are you using Linux? Could you cofirm the maximum process is sufficient?
ulimit -a

Melden Sie sich an, um zu kommentieren.

Antworten (0)

Kategorien

Mehr zu MATLAB Parallel Server finden Sie in Hilfe-Center und File Exchange

Gefragt:

am 10 Jan. 2018

Kommentiert:

am 11 Jan. 2018

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by