Question on SAC implementation

In SAC.py Line 120
https://github.com/p-christ/Deep-Reinforcement-Learning-Algorithms-with-PyTorch/blob/b338c87bebb672e39304e47e0eed55aeb462b243/agents/actor_critic_agents/SAC.py#L120
However, the output of `produce_action_and_action_info(state)` is
https://github.com/p-christ/Deep-Reinforcement-Learning-Algorithms-with-PyTorch/blob/b338c87bebb672e39304e47e0eed55aeb462b243/agents/actor_critic_agents/SAC.py#L135
So, even though SAC algorithm can work in practice, is it a mistake?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question on SAC implementation #79

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Question on SAC implementation #79

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions