Data Validation for Bondora P2P Lending Using Python and TensorFlow Data Validation

SUMMARY: The project aims to construct a data validation flow using TensorFlow Data Validation (TFDV) and document the end-to-end steps using a template. The Bondora P2P Lending dataset is a binary classification situation where we attempt to predict one of the two possible outcomes. INTRODUCTION: The Kaggle dataset owner retrieved this dataset from Bondora, a… Read More

Data Validation for LendingClub Loan Data Using Python and TensorFlow Data Validation

SUMMARY: The project aims to construct a data validation flow using TensorFlow Data Validation (TFDV) and document the end-to-end steps using a template. The Kaggle LendingClub Loan Data dataset is a binary classification situation where we attempt to predict one of the two possible outcomes. INTRODUCTION: The Kaggle dataset owner derived this dataset from the… Read More

Data Validation for Diabetes 130 US Hospitals Using Python and TensorFlow Data Validation

SUMMARY: The project aims to construct a data validation flow using TensorFlow Data Validation (TFDV) and document the end-to-end steps using a template. The Diabetes 130 US Hospitals dataset is a binary classification situation where we attempt to predict one of the two possible outcomes. INTRODUCTION: The data set is the Diabetes 130-US Hospitals for… Read More

Data Validation for Kaggle Tabular Playground Series Apr 2021 Using Python and TensorFlow Data Validation

SUMMARY: The project aims to construct a data validation flow using TensorFlow Data Validation (TFDV) and document the end-to-end steps using a template. The Kaggle Tabular Playground Series Apr 2021 dataset is a binary classification situation where we attempt to predict one of the two possible outcomes. INTRODUCTION: Kaggle wants to provide an approachable environment… Read More

Data Validation for Kaggle Tabular Playground Series Mar 2021 Using Python and TensorFlow Data Validation

SUMMARY: The project aims to construct a data validation flow using TensorFlow Data Validation (TFDV) and document the end-to-end steps using a template. The Kaggle Tabular Playground Series Mar 2021 dataset is a binary classification situation where we attempt to predict one of the two possible outcomes. INTRODUCTION: Kaggle wants to provide an approachable environment… Read More

Data Validation for Chicago Taxi Trips Using Python and TensorFlow Data Validation

SUMMARY: The project aims to construct a data validation flow using TensorFlow Data Validation (TFDV) and document the end-to-end steps using a template. The Chicago Taxi Trips dataset is a regression situation where we attempt to predict the value of a continuous variable. INTRODUCTION: The City of Chicago collects taxi trip data in its role… Read More