Current eval trajectory uses random actions. For unstable systems like drone this may not be representative of realistic test data.