Problem running GPU Reinforcement Learning

2 Ansichten (letzte 30 Tage)
Patrick Doran
Patrick Doran am 14 Nov. 2019
Kommentiert: Anh Tran am 21 Feb. 2020
I'm running a custom MATLAB reinforcement learning script. That I have been able to train a DDPG on my cpu laptop.
Now I'm trying to run the same script on a gpu computer, but I'm getting some dimensional errors.
I added these parameters to the training options
'UseParallel',parallel);
trainOpts.ParallelizationOptions.Mode = "async";
trainOpts.ParallelizationOptions.DataToSendFromWorkers = "experiences";
trainOpts.ParallelizationOptions.StepsUntilDataIsSent = 30;
trainOpts.ParallelizationOptions.WorkerRandomSeeds = -1;
Then I see these errors
Error using rl.agent.AbstractPolicy/getInitialAction (line 133)
Invalid observation type or size.
Error in rl.env.MATLABEnvironment/simLoop (line 235)
action = getInitialAction(policy,observation);
Error in rl.env.MATLABEnvironment/simWithPolicy (line 113)
[expcell{simCount},epinfo,siminfos{simCount}] = simLoop(env,policy,opts,simCount,usePCT);
Error in rl.train.parforTrain (line 62)
parfor i = 1:activeSims
Error in rl.train.TrainingManager/train (line 264)
rl.train.parforTrain(this);
Error in rl.train.TrainingManager/run (line 155)
train(this);
Error in rl.agent.AbstractAgent/train (line 54)
TrainingStatistics = run(trainMgr);
Error in train_SSS_agent (line 25)
trainingStats = train(agent,env,trainingOptions);
Error in SSS_learning (line 9)
train_SSS_agent(agent,env);
Caused by:
Error using rl.agent.AbstractPolicy/getAction (line 119)
Invalid observation type or size.
Error using rl.util.rlAbstractRepresentation/evaluate (line 242)
The dimensions of input data are not compatible with those of observation and action info respectively.
I don't have good(any) knowledge about gpu computing so there might be some fundemental setup steps I don't know about.
  2 Kommentare
Walter Roberson
Walter Roberson am 14 Nov. 2019
Bearbeitet: Walter Roberson am 14 Nov. 2019
UseParallel does not use a GPU: it uses additional processes, with all the problems that causes. For example, global variables are not transferred.
Anh Tran
Anh Tran am 21 Feb. 2020
I am not sure whether your training is set up to use GPU from your code snippet. The issue might come from something else, not from the GPU training in parallel. Feel free to upload more code so we can better pinpoint the issue.

Melden Sie sich an, um zu kommentieren.

Antworten (0)

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by