ASAL (Active Score for Active Learning)

Active Learning Initialization for Supervised Visual Defect Detection
The 2023 Fall Conference of Korean Institute of Industrial Engineers
Associated with Industrial Artificial Intelligence Lab

📍 Project Objective

This study is a method of reflecting the distribution of actual industrial data in which normal and abnormal data are disproportionately mixed, and proposes an initialization methodology for solving the cold-start problem of active learning.

📍 Project Duration

2023.01 ~ 2023.11

📍 Team Member


Jeewon Kim	Sohee Bae

1. Overview

Proposal of an Active Learning methodology using an Unlabeled Set reflecting the distribution of real industrial data with imbalanced normal and abnormal data.
Utilization of Anomaly Detection's Anomaly Score to address the "Cold-Start" challenge in Active Learning.
Construction of a balanced initial dataset using the Anomaly Score, allowing the model to grasp various data patterns in the early learning stages and reducing labeling costs.

Overall Structure
- Sampling Module for Initial Stage using Anomaly Score
- Active Learning

2. Environment

Python version is 3.9.
Installing all the requirements may take some time. After installation, you can run the codes.
Please notice that we used 'PyTorch' and device type as 'GPU'.
We utilized 2 GPUs in our implementation. If the number of GPUs differs, please adjust the code accordingly based on the specific situation.
requirements.txt file is required to set up the virtual environment for running the program. This file contains a list of all the libraries needed to run your program and their versions.

In Anaconda Environment,
```
$ conda create -n [your virtual environment name] python=3.9

$ conda activate [your virtual environment name]

$ pip install -r requirements.txt
```
- Create your own virtual environment.
- Activate your Anaconda virtual environment where you want to install the package. If your virtual environment is named 'testasal', you can type conda activate testasal.
- Use the command pip install -r requirements.txt to install libraries.

3. Dataset

The MVTec anomaly detection dataset (MVTec AD)
You need to create a folder, ./DATA
Merging the Train and Test sets of the existing data, the combined dataset was then divided into Train and Test sets in a ratio of 0.8:0.2.

├── DATA
│   ├── class1
│   │    └── train
│   │    └── test
│   ├── class2
│   │    └── train
│   │    └── test
│   │
│   │    ...
│   │
│   │
│   ├── class15
│   │    └── train
│   │    └── test

4. Prerequisites

For each category, the anomaly scores of the data are sorted in descending order and then divided into five batches, which are subsequently saved as text files.
In our experiment, we employed the DRAEM (Discriminatively Trained Reconstruction Embedding for Surface Anomaly Detection) method to derive anomaly scores.
In loss_batch folder, you can check the anomaly scores we obtained.
batch.txt : The anomaly scores for the entire training dataset, sorted in descending order
batchN.txt : Nth batch among the five batches divided using batch.txt

├── loss_batch
│   ├── anomaly_scores_draem
│   │    └── class1
│   │    │    └── batch.txt
│   │    │    └── batch0.txt
│   │    │    └── batch1.txt
│   │    │    └── batch2.txt
│   │    │    └── batch3.txt
│   │    │    └── batch4.txt
│   │    │
│   │    │    ...
│   │    │
│   │    │
│   │    └── class15
│   │    │    └── batch.txt
│   │    │    └── batch0.txt
│   │    │    └── batch1.txt
│   │    │    └── batch2.txt
│   │    │    └── batch3.txt
│   │    │    └── batch4.txt

Each text file contains, in sequential order, the paths of the corresponding data. An excerpt from an example text file is provided below:

./DATA/cable/train/anomaly/anomaly_cable_missing_cable_011.png
./DATA/cable/train/anomaly/anomaly_cable_missing_cable_006.png
./DATA/cable/train/anomaly/anomaly_cable_missing_cable_004.png
./DATA/cable/train/anomaly/anomaly_cable_missing_cable_010.png
./DATA/cable/train/anomaly/anomaly_cable_missing_cable_009.png
./DATA/cable/train/anomaly/anomaly_cable_missing_cable_008.png
./DATA/cable/train/anomaly/anomaly_cable_missing_cable_000.png
...

5. Implementation

You need to run main.py.

python main.py --loss_type anomaly_scores_draem

--loss_type : This should have the same name as the first subfolder within 'loss_batch'
checkpoint, main_best_acc, and main_best_auroc folders will be created automatically during model training, and these folders can be used to monitor the performance of the model.

6. Result

ASAL(ours) : This is the proposed method in our study that utilizes Anomaly Score for balanced sampling in imbalanced datasets
Random: Random sampling of 10 samples from the entire Unlabeled Data
DRAEM_mixed: Anomaly detection performance when trained on all data, including a mixture of normal and abnormal data
DRAEM_normal: Anomaly detection performance when trained only on all normal data

7. Reference

[1] Yi, J. S. K., Seo, M., Park, J., & Choi, D. G. (2022, October). Pt4al: Using self-supervised pretext tasks for active learning. In European Conference on Computer Vision (pp. 596-612). Cham: Springer Nature Switzerland.
[2] Zavrtanik, V., Kristan, M., & Skočaj, D. (2021). Draem-a discriminatively trained reconstruction embedding for surface anomaly detection. In Proceedings of the IEEE/CVF International Conference on Computer Vision (pp. 8330-8339).
Code Implementation is based on [1]

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
loss_batch/anomaly_scores_draem		loss_batch/anomaly_scores_draem
models		models
README.md		README.md
loader.py		loader.py
main.py		main.py
requirements.txt		requirements.txt
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ASAL (Active Score for Active Learning)

📍 Project Objective

📍 Project Duration

📍 Team Member

1. Overview

Overall Structure

2. Environment

In Anaconda Environment,

3. Dataset

4. Prerequisites

5. Implementation

6. Result

7. Reference

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ASAL (Active Score for Active Learning)

📍 Project Objective

📍 Project Duration

📍 Team Member

1. Overview

Overall Structure

2. Environment

In Anaconda Environment,

3. Dataset

4. Prerequisites

5. Implementation

6. Result

7. Reference

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages