Template Credit: Adapted from a template made available by Dr. Jason Brownlee of Machine Learning Mastery.
SUMMARY: This project aims to construct a predictive model using a TensorFlow convolutional neural network (CNN) and document the end-to-end steps using a template. The PatchCamelyon Grand Challenge dataset is a binary-class classification situation where we attempt to predict one of two possible outcomes.
INTRODUCTION: The PatchCamelyon benchmark is a new and challenging image classification dataset. It consists of 327680 color images (96 x 96px) extracted from histopathologic scans of lymph node sections. Each image is annotated with a binary label indicating presence of metastatic tissue. This dataset provides a useful benchmark for machine learning models that are bigger than CIFAR10 but smaller than ImageNet.
In this Take1 iteration, we will construct a CNN model using a simple three-block VGG architecture and test the model’s performance using a held-out test dataset.
ANALYSIS: In this Take1 iteration, the model’s performance achieved an accuracy score of 79.83% on the validation dataset after 10 epochs. After we apply the final model to the test dataset, the model achieved an accuracy score of 79.00%.
CONCLUSION: In this iteration, the simple three-block VGG CNN model appeared to be suitable for modeling this dataset. We should consider experimenting with TensorFlow for further modeling.
Dataset Used: PatchCamelyon Grand Challenge
Dataset ML Model: Binary-class image classification with numerical attributes
Dataset Reference: https://patchcamelyon.grand-challenge.org/
Potential source of performance benchmarks: https://patchcamelyon.grand-challenge.org/evaluation/challenge/leaderboard/
The HTML formatted report can be found here on GitHub.