Parallel sever: Request GPU via SLURM
    19 Ansichten (letzte 30 Tage)
  
       Ältere Kommentare anzeigen
    
    Christopher McCausland
      
 am 15 Dez. 2022
  
    
    
    
    
    Kommentiert: Christopher McCausland
      
 am 19 Dez. 2022
            Hello,
I have access to an HPC and have been running CPU based code on  it for some time now with help from the mathworks team. Parrellel server dir() for files local to the server - MATLAB Answers - MATLAB Central (mathworks.com)
I now wish to use GPU acceleration. When I try to use a gpuArray I get an error that the GPU devices are not found. This is because they were not requested in my batch command to SLURM which makes sense. 
Is there a method to inject the additional SLURM gpu request argument e.g., --gres=gpu:2 into the .batch request (I am thinking something like the additionalSubmitArgs below) or is there a normal workflow for doing so without changing the submitCommunicatingJob function? It seems like a normal thing to request however I cannot find a straightforward workflow for it. 
function job = submit_brainwave_job()
cluster = parcluster();
idir = '/inputDir'
idir = '/outDir'
job = cluster.batch(@batch_build_0_0_1,1,{idir,odir}, ...
    "AutoAddClientPath",false, "CaptureDiary",true, ...
    "CurrentFolder",".", "Pool",3);
%additionalSubmitArgs = sprintf('--ntasks=%d --gres=gpu:2', props.NumberOfTasks);
end
 Kind regards,
Christopher
0 Kommentare
Akzeptierte Antwort
  Raymond Norris
    
 am 15 Dez. 2022
        cluster = parcluster();
cluster.AdditionalProperties.AdditionalSubmitArgs = '--gres=gpu:2';
You might also need to specify a gpu partition, as such
cluster.AdditionalProperties.AdditionalSubmitArgs = '--gres=gpu:2 -p <name-of-gpu-partition>';
Remember to unset AdditionalSubmitArgs if you don't need a GPU for you next job.
3 Kommentare
  Raymond Norris
    
 am 16 Dez. 2022
				Yes, that's right.  In fact, for individual users familiar with their cluster, this is a viable option.  When our team works with Cluster Admins, to make a more robust solution, we expand out AdditionalProperties, as such
cluster = parcluster();
cluster.AdditionalProperties.CpusPerGpu = 2;
cluster.AdditionalProperties.GpusPerNode = 2;
cluster.AdditionalProperties.Partition = 'gpu';
then on the backend, we implement getCommonSubmitArgs to convert this to the full Slurm syntax.  This way users don't have to know the Slurm syntax themselves.
Weitere Antworten (0)
Siehe auch
Kategorien
				Mehr zu Third-Party Cluster Configuration finden Sie in Help Center und File Exchange
			
	Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!

