Template Credit: Adapted from a template made available by Dr. Jason Brownlee of Machine Learning Mastery.
SUMMARY: This project aims to construct a predictive model using a TensorFlow convolutional neural network (CNN) and document the end-to-end steps using a template. The Chest X-Ray Pneumonia dataset is a binary classification situation where we attempt to predict one of the two possible outcomes.
INTRODUCTION: The dataset contains chest X-ray images selected from retrospective cohorts of pediatric patients of one to five years old from Guangzhou Women and Children’s Medical Center, Guangzhou. The image collection is organized into three folders (train, test, val) and contains subfolders for each image category (Pneumonia/Normal). There are 5,863 X-Ray images with various display resolutions in this collection.
From iteration Take1, we trained a simple three-layer CNN model and used the model’s performance as the baseline.
In this Take2 iteration, we will use the same three-layer CNN model from Take1 and apply it to the same photos with higher resolutions (from 640×480 to 1024×768).
ANALYSIS: From iteration Take1, the baseline model’s performance achieved an accuracy score of 100% after 20 epochs using the validation dataset. However, the final model processed the test dataset with an accuracy measurement of only 72.91%.
In this Take2 iteration, the model’s performance achieved an accuracy score of 62.50% after ten epochs using the validation dataset. However, the final model processed the test dataset with an encouraging accuracy measurement of 83.33%.
CONCLUSION: In this iteration, the TensorFlow CNN model appeared to be suitable for modeling this dataset, but we need to experiment with the TensorFlow model to improve its accuracy.
Dataset Used: Chest X-Ray Images (Pneumonia) Dataset
Dataset ML Model: Binary image classification with numerical attributes
Dataset References: https://www.kaggle.com/paultimothymooney/chest-xray-pneumonia, https://data.mendeley.com/datasets/rscbjbr9sj/2, and http://www.cell.com/cell/fulltext/S0092-8674(18)30154-5.
One potential source of performance benchmarks: https://www.kaggle.com/paultimothymooney/chest-xray-pneumonia
The HTML formatted report can be found here on GitHub.