Skip to content
This repository was archived by the owner on Jul 13, 2022. It is now read-only.
This repository was archived by the owner on Jul 13, 2022. It is now read-only.

Issue with DataSplitUtility repartition(0) #21

@ottobricks

Description

@ottobricks

When following this tutorial, I encounter the following error during feature selection thrown by DataSplitUtility:
java.lang.IllegalArgumentException: requirement failed: Number of partitions (0) must be positive.

The thing I do differently from the tutorial is setting the trainTestSplitMethod to "chronological" as in:

Map(
  ...
  "tunerTrainSplitMethod" -> "chronological",
  "tunerTrainSplitChronologicalColumn" -> "id",
  "tunerTrainSplitChronologicalRandomPercentage" -> 0.25,
  ...
)

Any ideas on how to fix the issue?

I am using:

  • Spark 3.2.0
  • Hadoop 3.3.1
  • Scala 2.12.15
  • automl-toolkit 0.8.1

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions