GitHub - yanniey/computer_vision_toolkit: A repo for my Computer Vision projects using scikit-image, Tensorflow and Keras.

A list of Jupyter notebooks and functions for Computer Vision tasks, using OpenCV, Pillow and Tensorflow

😎

Cheat Sheet: How to debug a Neural Network for Computer Vision problems in Tensorflow

Jupyter Notebooks

Masking & Image Manipulation
- Block views & Pooling (Max/Mean/Median)
- Contour Detection
- Convex Hull
- Edge detection using Roberts, Sobel and Canny edge detectors
Image Descriptors
1. SIFT(Scale Invariant Feature Transform): for detecting blobs (a region of an image that greatly differs from its surrounding areas).
2. SURF: for detecting blobs. Improved from SIFT and uses Fast Hessian algo
3. DAISY descriptors
4. HOG descriptor (Histogram of Oriented Gradients) with non-max suppression NMS (hog_detect_people.py)
5. Harris: for detecting corners (harrisCornerDetection.py)
6. FAST (Features from Accelerated Segment Test): for detecting corners
7. BRIEF: for detecting blobs
8. ORB (Oriented FAST and Rotated BRIEF): for detecting a combination of corners and blobs, uses both FAST and BRIEF (orb_knn.py)
Algo for matching features
1. Brute-force matching (cv2.BFMatcher class, using KNN and ratio test) (orb_knn.py)
2. FLANN-based (Fast Library for Approximate Nearest Neighbors) matching (flann.py & flann_homography.py)
Denoising Filters
1. Total variation filter: based on the principal that signals with noise have high total variation
2. Bilateral filter: good at preserving edges
3. Wavelet denoising filter: good at preserving image quality
Morphological reconstruction
1. Erosion (to find holes in image)
2. Dilation (to find peaks in image)
Segmentation and Transformation
1. Global (e.g. otsu thresholding) vs local thresholding (e.g. cv.adaptiveThreshold): Thresholding: convert grayscale images to binary, or generally to segment objects from the background
2. RAG (Region Adjacency Graph): Used to segment areas of interest from the image. Each region in image is represented as a graph node in RAG, and weight of edge = difference between average colors of pixels in each region
3. Watershed algos (Classic vs. Compact): Treats a grayscale image as a topographical map and finds lines between pixels of equal brightness. These lines are then used to segment the image into regions
4. Transformation algorithms: warp, swirl from skimage
5. Structural similarity index & MSE: measure how two images are different from each other
Dimension Reduction
1. Dictionary Learning
2. Convolution kernels
3. Autoencoders

Toolkit - Image

Files can the found in the folder Toolkit

High Pass Filter(HPF) and Low Pass Filter (LPF) (hpf.py)
Canny edge detection(canny.py)
Find contours (contours.py)
Try alls threshold methods, e.g. itsu, isodata, mean, min (try_all_threshold.py)
RAG thresholding(rag_thresholding.py)
Segmentation with Watershed algos (watershed_classic.py and watershed_compact.py)
Rotate, scale and translate the image (warp.py)
Add noise to image(add_noise.py)
Find similarity between images(MSE, Structural Similarity Index)(ssim.py)
Histogram comparison (histogram_comparison.py using the compareHist function from opencv)
Detecting lines with HoughLines and HoughLinesP, circles with HoughCircles (lineDetection.py,circleDetection.py). Detecting other shapes can be done via combining cv2.findContours and cv2.approxPolyDP
Foreground segmentation with GrabCut
Haar face detection haarFaceDetection.py
Face recognition: generateImages.py and faceRecognition.py (Eigenfaces, Fisherfaces,Local Binary Patterns Histograms)
Homography, i.e. find images that contain a specific icon (icon_matcher folder)
Non-max supression, used for detection with sliding windows where one object may get detected multiple times non_max_suppression.py
Customised object detector with SIFT, Bag of Words(BoW), SVM, sliding window and non-max suppression detector_car_svm.py and detector_car_bow_sliding_window.py
Save and load an SVM detector with svm.save() and svm.load()

Toolkit - Video

Files can the found in the folder Toolkit Video

Object tracking techniques:
1. Background subtraction
  - Basic motion detection using background subtraction basic_motion_detection.py
  - MOG background subtractor mog.py
  - KNN background subtractor knn.py
  - GMG background subtractor gmg.py
2. Histogram back-projection with MeanShift or CamShift meanshift.py, camshift.py
Kalman filters kalman.py, kalman_pedestrian_tracking.py

Toolkit - Neural Network

Files can the found in the folder Toolkit Neural Network

Simple neural network simple_neural_net.py, neural_net_multiple_features.py
Recognizing handwritten MNIST digits with neural network neural_net_MNIST.py. Run test_neural_net_MNIST.py to see the neural net's accuracy
Use the model built from MNIST data on new data detect_and_classify_digits.py
Ways to improve neural net performance:
1. Experiment with the size of your training dataset, the number of hidden nodes, and the number of epochs until you find a peak level of accuracy
2. Modify neural_net_MNIST.create_ann function so that it supports more than one hidden layer
3. Try different activation functions. We have used cv2.ml.ANN_MLP_SIGMOID_SYM, but it isn't the only option; the others include cv2.ml.ANN_MLP_IDENTITY, cv2.ml.ANN_MLP_GAUSSIAN, cv2.ml.ANN_MLP_RELU, and cv2.ml.ANN_MLP_LEAKYRELU
4. Try different training methods. We have used cv2.ml.ANN_MLP_BACKPROP. The other options include cv2.ml.ANN_MLP_RPROP and cv2.ml.ANN_MLP_ANNEAL
Save and load neural network models save_and_load_neural_net.py
Load a deep learning model for tensorflow load_tf_model.py
Detect and classify objects with 3rd party neural net: mobileNet + Single Shot Detector detect_objects_neural_net.py
Detect and classify faces with 3rd party neural nets: detect_faces_neural_net.py
- Face detection using the Caffe model res10_300x300_ssd_iter_140000
- Age and gender detection using the Caffe model age_net and gender_net

Toolkit - Imitate Film Filters

Files can the found in the folder toolkit_film_filters

Emulate the following 4 types of films using curves
- Kodak Portra, a family of films that is optimized for portraits and weddings class BGRPortraCurveFilter in filters.py
- Fuji Provia, a family of general-purpose films class BGRProviaCurveFilter in filters.py
- Fuji Velvia, a family of films that is optimized for landscapes class BGRVelviaCurveFilter in filters.py
- Cross-processing, a nonstandard film processing technique, sometimes used to produce a grungy look in fashion and band photography

Workflow

Edge detection (e.g. Sobel, Canny). May need to convert to grayscale first
Segment detection (e.g. RAG, watershed, GrabCut)
Transformation(rotation, scale, crop,distanceTransform)
- Apply Gaussian blur to remove noise and make the darkness of image more uniform
- Apply threshold to make image stand out from the background, and erosion to make contours free of irregularities
Feature extraction
Feature matching
- Brute Force
- FLANN-based with KNN and ratio test

Facial Detection and Recognition

Haar cascade classifiers
Facial recognition: Eigenfaces, Fisherfaces, Local Binary Pattern Histograms (LBPHs)

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
.idea		.idea
.ipynb_checkpoints		.ipynb_checkpoints
.vscode		.vscode
dataset		dataset
images		images
toolkit		toolkit
toolkit_3dtracking		toolkit_3dtracking
toolkit_film_filters		toolkit_film_filters
toolkit_neural_network		toolkit_neural_network
toolkit_video		toolkit_video
.DS_Store		.DS_Store
.gitignore		.gitignore
10_Histogram_comparison.ipynb		10_Histogram_comparison.ipynb
11_KeypointPreservingAugmentations.ipynb		11_KeypointPreservingAugmentations.ipynb
1_Loading_and_manipulating_image_data_with_scikit-image.ipynb		1_Loading_and_manipulating_image_data_with_scikit-image.ipynb
2_Image_descriptors.ipynb		2_Image_descriptors.ipynb
3_Corner_detection.ipynb		3_Corner_detection.ipynb
4_Denoising_filters.ipynb		4_Denoising_filters.ipynb
5_Morphological_reconstruction.ipynb		5_Morphological_reconstruction.ipynb
6_Thresholding_and_RAG.ipynb		6_Thresholding_and_RAG.ipynb
7_Watershed.ipynb		7_Watershed.ipynb
8_Transformation_algorithms.ipynb		8_Transformation_algorithms.ipynb
9_Structural_similarity_index.ipynb		9_Structural_similarity_index.ipynb
Dimension_Reduction_Autoencoders.ipynb		Dimension_Reduction_Autoencoders.ipynb
Dimension_Reduction_Convolution_Kernels.ipynb		Dimension_Reduction_Convolution_Kernels.ipynb
Dimension_Reduction_Dictionary_learning.ipynb		Dimension_Reduction_Dictionary_learning.ipynb
Feature_Extraction_DAISY.ipynb		Feature_Extraction_DAISY.ipynb
Feature_Extraction_HOG.ipynb		Feature_Extraction_HOG.ipynb
Feature_Extraction_SIFT.ipynb		Feature_Extraction_SIFT.ipynb
OCR_using_Tesseract.ipynb		OCR_using_Tesseract.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A list of Jupyter notebooks and functions for Computer Vision tasks, using OpenCV, Pillow and Tensorflow

Jupyter Notebooks

Toolkit - Image

Toolkit - Video

Toolkit - Neural Network

Toolkit - Imitate Film Filters

Workflow

Facial Detection and Recognition

About

Uh oh!

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

A list of Jupyter notebooks and functions for Computer Vision tasks, using OpenCV, Pillow and Tensorflow

Jupyter Notebooks

Toolkit - Image

Toolkit - Video

Toolkit - Neural Network

Toolkit - Imitate Film Filters

Workflow

Facial Detection and Recognition

About

Resources

Uh oh!

Stars

Watchers

Forks

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages