House_Price_Iowa_Project

Repo for the Regression House Price competition project:

https://www.kaggle.com/c/iowa-house-prices-regression-techniques/data

Project Overview

Exploratory Data Analysis, outliers identification and data cleaning.
Modelling using Random Forest, CatBoost amd XGBoost Regressors.
Hyperparameters tuning usings RandomizedSearchCV amd GridSearchCV.
Test set predictions.

Code and Resourses used

Python Version: 3.8.2

Packages: Pandas, Numpy, Matplotlib, Seaborn, SKlearn, XGBoost, CatBoost

EDA: Exploratory Data Analysis

The EDA made shows how data is distributed and relation between different features. Following few highlights from the graphs dispalyed:

Data Cleaning

First use the pandas api to clean the Train dataset (df1) as follow:

Fill missing numerical values with feature median
Convert Object data into numerical
Create a binary column for missing data with Boolean values

Then functionize the whole process with a preprocess_data(df) function that performs same transformations.

Model Building

Split Data into train and test data
Create fit_and_score(model) function to instantiate and compare accuracy from different estimators simultaneously.
3 different estimators: Random Forest Classifier XGBoost Classifier CatBoost Classifier
Hyperparameter tuning using RandomizedSearchCV and GridSearchCV for the two best performant classifiers.

Make predictions

Once evaluated the best model which gives the lowest RSLME, use it to make predictions on test data.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
Pictures		Pictures
1. House prices project - Introduction and EDA.ipynb		1. House prices project - Introduction and EDA.ipynb
2. House prices Iowa project - Data Cleaning.ipynb		2. House prices Iowa project - Data Cleaning.ipynb
3. House prices Iowa project - Modelling and Predictions.ipynb		3. House prices Iowa project - Modelling and Predictions.ipynb
House prices Iowa project - End to end project.ipynb		House prices Iowa project - End to end project.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

House_Price_Iowa_Project

Project Overview

Code and Resourses used

EDA: Exploratory Data Analysis

Data Cleaning

Model Building

Make predictions

About

Uh oh!

Releases

Packages

Languages

davideragone/House_Price_Iowa_Project

Folders and files

Latest commit

History

Repository files navigation

House_Price_Iowa_Project

Project Overview

Code and Resourses used

EDA: Exploratory Data Analysis

Data Cleaning

Model Building

Make predictions

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages