skordinal

Overview
CI/CD
Code

What is skordinal?

skordinal is an experimental framework built on Python that integrates with scikit-learn to automate machine learning experiments through simple JSON configuration files. Initially designed for ordinal classification, it supports regular classification algorithms as long as they are compatible with scikit-learn, making it easy to run reproducible experiments across multiple datasets and classification methods.

Installation

Requirements

skordinal requires Python 3.9 or higher and is tested on Python 3.9, 3.10, 3.11, and 3.12.

All dependencies are managed through pyproject.toml and include:

numpy (>=1.21)
pandas (>=1.0.1)
scikit-learn (>=1.3.0)
scipy (>=1.7)

Setup

Clone the repository:

git clone https://github.com/ayrna/skordinal
cd skordinal

Install the framework:
```
pip install .
```
For development purposes, use editable installation:
```
pip install -e .
```
Optional dependencies for development:
```
pip install -e .[dev]
```

Note: The editable mode is required for running tests due to automatic dependency resolution.

Testing Installation

Test your installation with the provided example:

python config.py with skordinal/configurations/full_functionality_test.json

Quick Start

skordinal includes sample datasets with pre-partitioned train/test splits using a 30-holdout experimental design.

Basic experiment configuration:

{
    "general_conf": {
        "basedir": "skordinal/datasets/data",
        "datasets": ["balance-scale", "contact-lenses", "tae"],
        "hyperparam_cv_nfolds": 3,
        "output_folder": "results/",
        "metrics": ["ccr", "mae", "amae"],
        "cv_metric": "mae"
    },
    "configurations": {
        "SVM": {
            "classifier": "SVC",
            "parameters": {
                "C": [0.001, 0.1, 1, 10, 100],
                "gamma": [0.1, 1, 10]
            }
        },
        "SVMOP": {
            "classifier": "OrdinalDecomposition",
            "parameters": {
                "dtype": "ordered_partitions",
                "decision_method": "frank_hall",
                "base_classifier": "SVC",
                "parameters": {
                    "C": [0.01, 0.1, 1, 10],
                    "gamma": [0.01, 0.1, 1, 10],
                    "probability": ["True"]
                }
            }
        }
    }
}

Run the experiment:

python config.py with my_experiment.json

Results are saved in results/ folder with performance metrics for each dataset-classifier combination. The framework automatically performs cross-validation, hyperparameter tuning, and evaluation on test sets.

Configuration Files

Experiments are defined using JSON configuration files with two main sections: general_conf for experiment settings and configurations for classifier definitions.

general-conf

Controls global experiment parameters.

Required parameters:

basedir: folder containing all dataset subfolders, it doesn't allow more than one folder at a time. It can be indicated using a full path, or a relative one to the framework folder.
datasets: name of datasets that will be experimented with. A subfolder with the same name must exist inside basedir.

Optional parameters:

hyperparam_cv_folds: number of folds used while cross-validating.
jobs: number of jobs used for GridSearchCV during cross-validation.
input_preprocessing: data preprocessing ("std" for standardization, "norm" for normalization, "" for none)
output_folder: name of the folder where all experiment results will be stored.
metrics: name of the accuracy metrics to measure the train and test performance of the classifier.
cv_metric: error measure used for GridSearchCV to find the best set of hyper-parameters.

configurations

Defines classifiers and their hyperparameters for GridSearchCV. Each configuration has a name and consists of:

classifier: scikit-learn or built-in skordinal classifier
parameters: hyperparameters for grid search (nested for ensemble methods)

Running Experiments

Basic Usage

python config.py with experiment_file.json

Example Output

Results are stored in the specified output folder with detailed performance metrics and hyperparameter information for each dataset and configuration combination.

License

BSD 3

Go to Top

Name		Name	Last commit message	Last commit date
Latest commit History 312 Commits
.github		.github
doc		doc
skordinal		skordinal
.codecov.yml		.codecov.yml
.coveragerc		.coveragerc
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
config.py		config.py
pyproject.toml		pyproject.toml
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

skordinal

What is skordinal?

Table of Contents

Installation

Requirements

Setup

Testing Installation

Quick Start

Configuration Files

general-conf

configurations

Running Experiments

Basic Usage

Example Output

License

About

Uh oh!

Releases 1

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

skordinal

What is skordinal?

Table of Contents

Installation

Requirements

Setup

Testing Installation

Quick Start

Configuration Files

general-conf

configurations

Running Experiments

Basic Usage

Example Output

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages