This post has been de-listed
It is no longer included in search results and normal feeds (front page, hot posts, subreddit posts, etc). It remains visible only via the author's post history.
I am training an Actor - Critic model but it is not effectively learning the task. I realised, Critic Loss is not decreasing while training and decided to get an output of True rewards and critic outputs to compare critic networks performance. As you can see in the plot, it is not learning anything at all. I tried training with Vanilla LSTM and also another model with custom LSTM block with residual connection and feed forward network but both of them is doing same.
I am using shared layers for both Actor and Critic heads and single optimizer to train. What can be problem here?
Subreddit
Post Details
- Posted
- 2 months ago
- Reddit URL
- View post on reddit.com
- External URL
- reddit.com/r/reinforceme...