Hi,
thx for sharing the source code of this wonderful project.
I have a question following issue #1 .
I am recently training both Prim and Mesh agents on airplane dataset using the depth image input.
However, it took very long time and I am not sure whether the training is really effective.
So far, for Prim agent, each epoch took 0.85 hours, and I am currently running until 140 epochs (more than 4 days as you mentioned in #1 )
However, the reward curves are not converging based on my observation.
And I compared the currently trained model with the pretrained Prim agent weights (compare the mean rewards using test.py).
The mean reward of the currently trained model is still pretty bad.
I am wondering do you happen to have any ideas on potential mistakes I made here?
thank you
Hi,
thx for sharing the source code of this wonderful project.
I have a question following issue #1 .
I am recently training both Prim and Mesh agents on airplane dataset using the depth image input.
However, it took very long time and I am not sure whether the training is really effective.
So far, for Prim agent, each epoch took 0.85 hours, and I am currently running until 140 epochs (more than 4 days as you mentioned in #1 )
However, the reward curves are not converging based on my observation.
And I compared the currently trained model with the pretrained Prim agent weights (compare the mean rewards using test.py).
The mean reward of the currently trained model is still pretty bad.
I am wondering do you happen to have any ideas on potential mistakes I made here?
thank you