Pattern recognition

This is a final project for CIS 3203: Introduction to Artificial Intelligence. We created a netural network for pattern recognition of Hiragana characters using backpropagation algorithms with numpy (and without numpy but we used much smaller sample input data).

Members

Kaito Tsutsui
Sahil Jartare

Introduction

Variables

Input data

Each row of the input matrix below represents a single Hiragana character. Since we used 64 zeroes and ones to represent one Hiragana and gave 46 characters as input, the size of the matrix is 46x64.

$$ x = \begin{bmatrix} x_{11} & x_{12} & \cdots & x_{1\ 64}\\ x_{21} & x_{22} & \cdots & x_{2\ 64}\\ \vdots & \vdots & \ddots & \vdots\\ x_{46\ 1} & x_{46\ 2} & \cdots & x_{46\ 64} \end{bmatrix} $$

Expected output data

The expected output is a 46x46 identity matrix.

$$ y = \begin{bmatrix} 1 & 0 & \cdots & 0\\ 0 & 1 & \cdots & 0\\ \vdots & \vdots & \ddots & \vdots\\ 0 & 0 & \cdots & 1 \end{bmatrix} $$

Weights

To initialize all the weights, we used uniform distribution.

$w$ represents the weight matrix between input and hidden layer. The size is number of hidden nodes by number of each input data, which is 20x64 in this case.

$$ w = \begin{bmatrix} w_{11} & w_{12} & \cdots & w_{164}\\ w_{21} & w_{22} & \cdots & w_{264}\\ \vdots & \vdots & \ddots & \vdots\\ w_{201} & w_{202} & \cdots & w_{20~64} \end{bmatrix} $$

# w: NUM_OF_HIDDEN_NODES x Number of each input data
w = np.random.uniform(-0.5, 0.5, (NUM_OF_HIDDEN_NODES, x.shape[1]))

$b$ represents the bias matrix for the hidden nodes. The size is 1 by number of hidden nodes, which is 1x20 in this case.

$$ b = \begin{bmatrix} b_1 & b_2 & \cdots & b_{20} \end{bmatrix} $$

# b: 1 x NUM_OF_HIDDEN_NODES
b = np.random.uniform(-0.5, 0.5, NUM_OF_HIDDEN_NODES)

$u$ represents the weight matrix between hidden layer and output. The size is number of hidden nodes by number of inputs, which is 20x46 in this case.

$$ u = \begin{bmatrix} u_{11} & u_{12} & \cdots & u_{146}\\ u_{21} & u_{22} & \cdots & u_{246}\\ \vdots & \vdots & \ddots & \vdots\\ u_{201} & u_{202} & \cdots & u_{20~46} \end{bmatrix} $$

# u: NUM_OF_HIDDEN_NODES x NUM_OF_INPUTS
u = np.random.uniform(-0.5, 0.5, (NUM_OF_HIDDEN_NODES, NUM_OF_INPUTS))

$ub$ represents the bias matrix for the output. The size is 1 by number of inputs, which is 1x46 in this case.

$$ ub = \begin{bmatrix} ub_1 & ub_2 & \cdots & ub_{46} \end{bmatrix} $$

# ub: 1 x NUM_OF_INPUTS
ub = np.random.uniform(-0.5, 0.5, NUM_OF_INPUTS)

Learning process

We used 10000 iterations/epochs and 2 learning rate.

Hidden nodes for each row of input ($x_i$)

$$ \underset{1 \times 20}{hidden} = sigmoid( \underset{1 \times 64}{x_i} \times \underset{64 \times 20}{w^T} + \underset{1 \times 20}{b} ) $$

Output nodes for $x_i$

$$ \underset{1 \times 46}{output} = sigmoid( \underset{1 \times 20}{hidden} \times \underset{20 \times 46}{u} + \underset{1 \times 46}{ub} ) $$

Errors for each column of output ($y^T_j$)

$$ \underset{1 \times 46}{errors} = output - expected = \underset{1 \times 46}{output} - \underset{1 \times 46}{y^T_i} $$

Derivatives of errors

Sigmoid activation function

$$ F(x) = \sigma(x) = \frac{1}{1 + e^{-x}} $$

Derivative of sigmoid activation function

$$ F'(x) = \sigma '(x) = \sigma (x) (1 - \sigma (x)) $$

Using the derivative of sigmoid function,

$$ \underset{1 \times 46}{dErrors} = \underset{1 \times 46}{errors} \odot sigmoidDerivative(\underset{1 \times 46}{output}) $$

def sigmoidDerivative(y):
    return y * (1 - y)

Derivative of hidden nodes

$$ \underset{1 \times 46}{dHidden} = \underset{1 \times 46}{dErrors} \times \underset{46 \times 20}{u^T} \odot sigmoidDerivative(\underset{1 \times 20}{hidden}) $$

Update weights

$$ \underset{20 \times 64}{w_{new}} = \underset{20 \times 64}{w_{old}} - \underset{1 \times 20}{dHidden} \otimes \underset{1 \times 64}{x_i} \odot \lambda $$

$$ \underset{1 \times 20}{b_{new}} = \underset{1 \times 20}{b_{old}} - \underset{1 \times 20}{dHidden} \odot \lambda $$

$$ \underset{20 \times 46}{u_{new}} = \underset{20 \times 46}{u_{old}} - \underset{1 \times 20}{hidden} \otimes \underset{1 \times 46}{dErrors} \odot \lambda $$

$$ \underset{1 \times 46}{ub_{new}} = \underset{1 \times 46}{ub_{old}} - \underset{1 \times 46}{dErrors} \odot \lambda $$

Prediction for each row of input

$$ \underset{1 \times 20}{hidden} = sigmoid( \underset{1 \times 64}{x_i} \times \underset{64 \times 20}{w^T} + \underset{1 \times 20}{b}) $$

$$ \underset{1 \times 46}{output} = sigmoid( \underset{1 \times 20}{hidden} \times \underset{20 \times 46}{u} + \underset{1 \times 46}{ub}) $$

Predicted output

All of the values on the diagonal line are close to 1, and others are close to 0. Hence, the neural network we built worked correctly.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.DS_Store		.DS_Store
README.md		README.md
with_numpy.py		with_numpy.py
without_numpy.py		without_numpy.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Pattern recognition

Members

Introduction

Variables

Input data

Expected output data

Weights

Learning process

Hidden nodes for each row of input ($x_i$)

Output nodes for $x_i$

Errors for each column of output ($y^T_j$)

Derivatives of errors

Sigmoid activation function

Derivative of sigmoid activation function

Derivative of hidden nodes

Update weights

Prediction for each row of input

Predicted output

Mean square errors

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Pattern recognition

Members

Introduction

Variables

Input data

Expected output data

Weights

Learning process

Hidden nodes for each row of input ($x_i$)

Output nodes for $x_i$

Errors for each column of output ($y^T_j$)

Derivatives of errors

Sigmoid activation function

Derivative of sigmoid activation function

Derivative of hidden nodes

Update weights

Prediction for each row of input

Predicted output

Mean square errors

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages