about the training period, time of the Prim agent

Hi,
thx for sharing the source code of this wonderful project.
I have a question following issue #1 .
I am recently training both Prim and Mesh agents on airplane dataset using the depth image input.
However, it took very long time and I am not sure whether the training is really effective.
So far, for Prim agent, each epoch took 0.85 hours, and I am currently running until 140 epochs (more than 4 days as you mentioned in #1 )
However, the reward curves are not converging based on my observation.
And I compared the currently trained model with the pretrained Prim agent weights (compare the mean rewards using test.py).
The mean reward of the currently trained model is still pretty bad.
I am wondering do you happen to have any ideas on potential mistakes I made here?

thank you

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

about the training period, time of the Prim agent #5

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

about the training period, time of the Prim agent #5

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions