How do I define a continuous reward function for RL environment?
Ältere Kommentare anzeigen
I am trying to follow the double integrator example for giving a continuous reward function. When I used the custom template, and defined the reward using the QR cost function, I get an error stating that the reward should be a scalar value. Where can I find the property of reward and change it to accept vector values?
3 Kommentare
Emmanouil Tzorakoleftherakis
am 12 Okt. 2020
Not sure why you want the reward to be scalar. Typically, rewards are treated as cost functions - they output a scalar value. If you have more than one states, you can turn it into a scalar using e.g. an l2 norm for example/some distance metric.
Prashanth Chivkula
am 12 Okt. 2020
Emmanouil Tzorakoleftherakis
am 12 Okt. 2020
That's right
Akzeptierte Antwort
Weitere Antworten (0)
Kategorien
Mehr zu Environments finden Sie in Hilfe-Center und File Exchange
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!