Community Profile


Takeshi Takahashi


Last seen: 23 Tage ago Active since 2021


  • Knowledgeable Level 2
  • First Answer

View badges

Content Feed

View by

Why does Soft actor critic have Entropy terms instead of Log probability?
RL toolbox also uses the log of the probability density to approximate the differential entropy.

3 Monate ago | 0

| accepted

ExperienceBuffer has 0 Length when i load a saved agent and continue training in reinforcement training
Length 0 means there isn't any experience in this buffer. I think it didn't save the experience buffer due to this bug. Please s...

5 Monate ago | 0

| accepted

How does RL algorithm work with RNNs?
Hi, rlDDPGAgent with RNN first randomly samples B sequences (trajectories) from the experience buffer, where B is MiniBatchSize...

7 Monate ago | 0

| accepted