Skip to content

Data augmentation via Nearest Neighbour algorithms #246

@Catarina-Alves

Description

@Catarina-Alves

It could be nice to include a class that encapsulates data augmentation via Nearest Neighbour-inspired algorithms such as SMOTE (Synthetic Minority Over-sampling Technique), ADASYN etc. @tallamjr developed some code for this, and it is saved in utils/imblearn_augment.py.

I propose to implement this data augmentation methodology in snaugment. This involves testing and developing unit tests. Note that, in previous analysis, we found that SMOTE augmentation leads to information leaks in the classification step. Thus this must be checked when implementing this augmentation.

File: snaugment.py, utils/imblearn_augment.py

Metadata

Metadata

Assignees

No one assigned

    Labels

    featureTo add a new feature, new standalone files. (High level)

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions