Fashion MNIST On Steroids

This was the second project for the Machine Learning course on Faculty Of Computer Science.

Problem description

The first part of the project was to train a convolutional neural network using Keras framework to classify images from Fashion MNIST dataset with at least 85% accuracy on the test set. Trained models are in directory models.

fashion_full.h5 is a CNN classifier trained on all 60k train images and tested on 10k images. It has 4 convolution layers, 2 max-pooling layers, 3 dense layers, 5 dropout layers, and 6 batch normalization layers. Achieved 93.52% accuracy. But it's not great for the second part.

fashion_1_64.h5 is trained on 10k train and 10k test images with data augmentation. Achieved 90.22% accuracy. Solid on the second problem.

fashion_dataaug_1.h5 Is same architecture as fashion_full.h5 but with different data augmentation method and only 10k images. Achieved 90.98% accuracy. Best results on the second problem than previous solutions.

The second part of the project was to use the previously trained model to classify multiple clothing items in a single image and draw their bounding boxes. They are always rotated properly but they can be scaled in any way, also there is no overlap between items. Example of test image and solution is given below.

A detailed explanation is in ML D2.pdf

Image preprocessing - OpenCV

In order to extract individual items from a noisy image, I used several computer vision methods from OpenCV library.

Non-local Means Denoising
Inverted Binary Thresholding
Contour extraction

Since images are very low quality there were many useless contours so I ignored those that were inside other contours (problem with sandal images) and ignored very small contours. Then, to get centered clothing items I calculated the center of mass of each valid contour and shifted bounding box coordinates accordingly. The last part was to use cv2.bitwise_not function to invert pixel values because training images have a black background and whitish items.

Prerequisites

Keras
OpenCV
Tensorflow
SciPy
pandas

Usage

To try just run following commands.

git clone https://github.com/mmilunovic/fashion-mnist.git
cd fashion-mnist
python main.py <image_name>

This will produce an image named <image_name>_out.png which should contain bounding boxes and labels for each item.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
.vscode		.vscode
models		models
tests		tests
ML D2.pdf		ML D2.pdf
README.md		README.md
main.py		main.py
model.py		model.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Fashion MNIST On Steroids

Problem description

Image preprocessing - OpenCV

Prerequisites

Usage

About

Releases

Packages

Languages

mmilunovic/fashion-mnist-on-steroids

Folders and files

Latest commit

History

Repository files navigation

Fashion MNIST On Steroids

Problem description

Image preprocessing - OpenCV

Prerequisites

Usage

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages