🔬 Wafer Map Defect Classification using CNN

An automated defect classification system for semiconductor wafer maps using Convolutional Neural Networks (CNN) to identify and categorize spatial defect patterns.

📋 Table of Contents

Overview
Business Value
Dataset
Installation
Project Workflow
Results
Model Architecture
Usage

🎯 Overview

This project develops an Automated Defect Classification system using Convolutional Neural Networks (CNN) to automatically categorize spatial defect patterns on semiconductor wafer maps. The system aims to:

Replace manual inspection processes
Reduce human error in defect classification
Accelerate identification of process anomalies
Enable faster Root Cause Analysis (RCA)

💼 Business Value

In semiconductor manufacturing, wafer map patterns provide critical insights into fabrication process health. This automated system delivers:

Yield Ramp-Up 🚀

Rapid identification of defect clusters (e.g., Scratch, Edge-Ring) enables process engineers to perform Root Cause Analysis faster.

Cost Reduction 💰

Automating classification reduces the "man-to-machine" ratio and minimizes misclassification risks due to operator fatigue.

Process Monitoring 🔍

Detecting systematic patterns vs. random defects helps pinpoint specific faulty process steps, such as:

Etching uniformity issues
CMP (Chemical Mechanical Planarization) handling errors
Equipment-specific failures

📊 Dataset

Source: WM811K Wafer Map Dataset

Dataset Characteristics:

Scale: 811,000+ wafer maps
Format: 2D images with pixel values representing die status
- 0: Background
- 1: Good Die
- 2: Defective Die
Classes: 9 categories
- 8 defect patterns: Center, Donut, Edge-Loc, Edge-Ring, Loc, Random, Scratch, Near-full
- 1 normal class: None
Challenge: Natural class imbalance (common in manufacturing data)

Please be aware that the original dataset (WM-811K) contains a typo in the column header: trianTestLabel is used instead of trainTestLabel. To maintain compatibility with the raw data, this project uses the original spelling (trianTestLabel) throughout the codebase.

Defect Pattern Distribution

The dataset exhibits significant class imbalance, with Edge-Ring and Edge-Loc being the most common defect types.

Defect Size Distribution

🛠️ Installation

Prerequisites

Python 3.11+
pip package manager

Setup

Clone the repository

git clone https://github.com/NUSSETO/Semiconductor_Project.git
cd Semiconductor_Project

Create virtual environment (recommended)

python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

Install dependencies

pip install -r requirements.txt

Download dataset

Download the WM811K dataset from Kaggle
Place LSWMD.pkl in the project root directory

🔄 Project Workflow

1. Exploratory Data Analysis (EDA)

Analyze class distribution
Visualize spatial defect patterns
Understand data characteristics

2. Data Preprocessing

Resize wafer maps to standardized dimensions (64×64 and 96x96 for comparative analysis)
Apply denoising techniques
Handle class imbalance

3. Model Architecture

Design and train CNN tailored for spatial pattern recognition:

Convolutional layers for feature extraction
MaxPooling for dimensionality reduction
Dense layers for classification
Dropout for regularization

4. Training & Optimization

Monitor training/validation metrics
Optimize hyperparameters
Implement data augmentation

5. Error Analysis

Investigate model performance through confusion matrices:

Initial Model Performance:

After First Optimization:

Final Model Performance:

📈 Results

Model Performance Metrics

Metric	Value
Overall Accuracy	~78%
Macro Recall	~79%
Training Time	~28 minutes
Model Size	~19 MB

Training History

🏗️ Model Architecture

Sequential Model:
├── Input (96 x 96 x 3)
├── Conv2D (32 filters, 5×5)
├── MaxPooling2D (2×2)
├── Conv2D (64 filters, 3×3)
├── MaxPooling2D (2×2)
├── Conv2D (128 filters, 3×3)
├── MaxPooling2D (2×2)
├── Conv2D (256 filters, 3×3)
├── MaxPooling2D (2×2)
├── Flatten
├── Dense (128 units)
├── Dropout (0.5)
└── Dense (8 units, softmax)

Key Features:

Input shape: 96×96×3
Activation: ReLU for hidden layers, Softmax for output
Optimizer: Adam
Loss function: Categorical Crossentropy

Note on Input Size: While the exploratory phase and initial training (as seen in main.ipynb) experimented with 64x64 resolution for efficiency, the final deployed model architecture has been optimized for 96x96 resolution to capture finer details of defect patterns.

🚀 Usage

Running the Notebook

Start Jupyter Notebook

jupyter notebook

Open main.ipynb
Run all cells or execute step-by-step:

Quick Start Example

# Load the trained model
from tensorflow.keras.models import load_model

# Option 1: Load the modern Keras format (Recommended)
model = load_model('model/wafer_defect_model.keras')

# Option 2: Load the legacy H5 format
# model = load_model('model/wafer_defect_model.h5')

# Predict on new wafer map
prediction = model.predict(preprocessed_wafer_map)
defect_class = np.argmax(prediction)

📁 Project Structure

Semiconductor_Project/
├── main.ipynb              # Main analysis notebook
├── main.html               # For easy access
├── LICENSE                 # MIT License
├── requirements.txt        # Python dependencies
├── .gitignore              # Git ignore rules
├── README.md               # This file
├── img/                    # Visualization images
│   ├── Defect_distribution.png
│   ├── Defect_size.png
│   ├── Resize_example.png
│   ├── Training_history.png
│   ├── Original_CM.png
│   ├── Second_CM.png
│   └── Final_CM.png
├── model/ 
│   ├── wafer_defect_model.h5
│   └── wafer_defect_model.keras
└── data/
    └── LSWMD.pkl           # Dataset (not tracked in git)

🙏 Acknowledgments

Dataset: WM811K Wafer Map Dataset
Inspired by semiconductor manufacturing quality control practices
Built with TensorFlow/Keras/Antigravity/Gemini

License

This project is open-source and available under the MIT License.

Author: Jason Huang
Focus: Semiconductor Manufacturing Quality Control, Machine Learning, Data Analysis

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
data		data
img		img
model		model
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
main.html		main.html
main.ipynb		main.ipynb
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

🔬 Wafer Map Defect Classification using CNN

📋 Table of Contents

🎯 Overview

💼 Business Value

Yield Ramp-Up 🚀

Cost Reduction 💰

Process Monitoring 🔍

📊 Dataset

Dataset Characteristics:

Defect Pattern Distribution

Defect Size Distribution

🛠️ Installation

Prerequisites

Setup

🔄 Project Workflow

1. Exploratory Data Analysis (EDA)

2. Data Preprocessing

3. Model Architecture

4. Training & Optimization

5. Error Analysis

📈 Results

Model Performance Metrics

Training History

🏗️ Model Architecture

🚀 Usage

Running the Notebook

Quick Start Example

📁 Project Structure

🙏 Acknowledgments

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages