Sayak Mukherjee
Python, MATLAB
Spoken Languages:
Bengali, English, Hindi
Statistiken
8 Fragen
0 Antworten
RANG
31.990
of 288.900
REPUTATION
1
BEITRÄGE
8 Fragen
0 Antworten
ANTWORTZUSTIMMUNG
12.5%
ERHALTENE STIMMEN
1
RANG
of 19.494
REPUTATION
N/A
DURCHSCHNITTLICHE BEWERTUNG
0.00
BEITRÄGE
0 Dateien
DOWNLOADS
0
ALL TIME DOWNLOADS
0
RANG
of 143.099
BEITRÄGE
0 Probleme
0 Lösungen
PUNKTESTAND
0
ANZAHL DER ABZEICHEN
0
BEITRÄGE
0 Beiträge
BEITRÄGE
0 Öffentlich Kanäle
DURCHSCHNITTLICHE BEWERTUNG
BEITRÄGE
0 Highlights
DURCHSCHNITTLICHE ANZAHL DER LIKES
Content Feed
Frage
Mirror symmetry in actions in reinforcement learning
I am training a RL control problem to perforem neck kinematics. I want the action space to have mirror symmetry as explained in ...
mehr als ein Jahr vor | 0 Antworten | 0
0
AntwortenFrage
Control the exploration in soft actor-critic
What is the best way to control the exploration in SAC agent. For TD3 agent I used to control the exploration by adjusting the v...
etwa 2 Jahre vor | 1 Antwort | 1
1
AntwortFrage
Reinforcement learning agent not being saved during training
I am trying to train my model using TD3 agent. During the training process I am trying to save the agent above a certain episode...
mehr als 2 Jahre vor | 1 Antwort | 0
1
AntwortFrage
Dont need to save 'savedAgentResultStruct' with RL agent
When I am saving agents during RL iterations using 'EpisodeReward' criteria, matlab is also saving 'savedAgentResultStruct' alon...
etwa 3 Jahre vor | 0 Antworten | 0
0
AntwortenFrage
Change revolute joint parameter in env.ResetFcn during reinforcement learning
What is the best way to randomize the initial revolute joint angle during eacg episode of reinforcement learning right now I am...
mehr als 3 Jahre vor | 0 Antworten | 0
0
AntwortenFrage
What is the best activation function to get action between 0 and 1 in DDPG network?
I am using DDPG network to run a control algorithm which has inputs (actions of RL agent, 23 in total) varying between 0 and 1. ...
mehr als 3 Jahre vor | 1 Antwort | 0
1
AntwortFrage
Expected reward blows up while training (DDPG agent, reinforcement learning)
I am training a DDPG network and after training for around 5000 iterations, the model seems doesnot seem to converge while the e...
mehr als 3 Jahre vor | 1 Antwort | 0
1
AntwortFrage
Use saved reinforcement learning DDPG agent
I have saved DDPG agent using the optiopn rlTrainingOptions.SaveAgentValue = 3000 During the simulations number of agents are ...
mehr als 3 Jahre vor | 1 Antwort | 0