Skip to content

Question about airlines dataset #2

@bigwater

Description

@bigwater

Hi,

I am trying to test the airlines application in your repository. However, I got an error in load_data.py.

load_data.py gets the dataset from deephyper.benchmark.datasets.airlines, this part works okay. Some columns in the dataset are strings, for example, the airlines/airport names.

Ater loading the dataset from deephyper.benchmark.datasets.airlines, the error appeared in prepro_input.fit_transform(X_train), which reported ValueError: could not convert string to float: 'OO'. (The detailed error is listed at the bottom. )

Do you have any suggestions about it? Or where can I get the correct dataset of it?

Thank you so much...

!!! USING TEST DATA !!!
Uncaught exception <class 'ValueError'>: could not convert string to float: 'OO'Traceback (most recent call last):
  File "load_data.py", line 91, in <module>
    load_data(use_test=True)
  File "load_data.py", line 48, in load_data
    return load_data_cache(use_test=use_test)
  File "/lus/theta-fs0/projects/VeloC/hyliu/work_deephyper/deephyper/deephyper/benchmark/datasets/util.py", line 30, in wrapper
    (X_train, y_train), (X_valid, y_valid) = data_loader(*args, **kwargs)
  File "load_data.py", line 37, in load_data_cache
    X_train = prepro_input.fit_transform(X_train)
  File "/home/hyliu/work/softwares/conda/envs/testdh/lib/python3.7/site-packages/sklearn/pipeline.py", line 378, in fit_transform
    Xt = self._fit(X, y, **fit_params_steps)
  File "/home/hyliu/work/softwares/conda/envs/testdh/lib/python3.7/site-packages/sklearn/pipeline.py", line 307, in _fit
    **fit_params_steps[name])
  File "/home/hyliu/work/softwares/conda/envs/testdh/lib/python3.7/site-packages/joblib/memory.py", line 352, in __call__
    return self.func(*args, **kwargs)
  File "/home/hyliu/work/softwares/conda/envs/testdh/lib/python3.7/site-packages/sklearn/pipeline.py", line 754, in _fit_transform_one
    res = transformer.fit_transform(X, y, **fit_params)
  File "/home/hyliu/work/softwares/conda/envs/testdh/lib/python3.7/site-packages/sklearn/base.py", line 699, in fit_transform
    return self.fit(X, **fit_params).transform(X)
  File "/home/hyliu/work/softwares/conda/envs/testdh/lib/python3.7/site-packages/sklearn/preprocessing/_data.py", line 363, in fit
    return self.partial_fit(X, y)
  File "/home/hyliu/work/softwares/conda/envs/testdh/lib/python3.7/site-packages/sklearn/preprocessing/_data.py", line 398, in partial_fit
    force_all_finite="allow-nan")
  File "/home/hyliu/work/softwares/conda/envs/testdh/lib/python3.7/site-packages/sklearn/base.py", line 421, in _validate_data
    X = check_array(X, **check_params)
  File "/home/hyliu/work/softwares/conda/envs/testdh/lib/python3.7/site-packages/sklearn/utils/validation.py", line 63, in inner_f
    return f(*args, **kwargs)
  File "/home/hyliu/work/softwares/conda/envs/testdh/lib/python3.7/site-packages/sklearn/utils/validation.py", line 616, in check_array
    array = np.asarray(array, order=order, dtype=dtype)
  File "/home/hyliu/work/softwares/conda/envs/testdh/lib/python3.7/site-packages/numpy/core/_asarray.py", line 85, in asarray
    return array(a, dtype, copy=False, order=order)
ValueError: could not convert string to float: 'OO'

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions