Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
22 changes: 22 additions & 0 deletions Pipfile
Original file line number Diff line number Diff line change
@@ -0,0 +1,22 @@
[[source]]
name = "pypi"
url = "https://pypi.org/simple"
verify_ssl = true

[dev-packages]

[packages]
flask = "*"
flask-sqlalchemy = "*"
flask-migrate = "*"
requests = "*"
python-dotenv = "*"
pandas = "*"
numpy = "*"
sklearn = "*"
gunicorn = "*"
psycopg2-binary = "*"
joblib = "*"

[requires]
python_version = "3.7"
363 changes: 363 additions & 0 deletions Pipfile.lock

Large diffs are not rendered by default.

1 change: 1 addition & 0 deletions Procfile
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
web: gunicorn app:app
9 changes: 8 additions & 1 deletion README.md
Original file line number Diff line number Diff line change
@@ -1 +1,8 @@
# Data-Science
# Data-Science

For this project, our goal is to accurately predict the price of a potential listing for AirBnB in the city of Berlin, Germany.

## Notebook

The notebook for this repo 'airbnb_notebook.ipynb' is used to take 'Berlin.csv' which
is taken and cleaned up from the original at https://www.kaggle.com/brittabettendorf/berlin-airbnb-data and turn that into a predictive model for prices of airbnb listings in the Berlin area. This model is persisted to `berlin_model.gz` as a compressed joblib dump (instead of pickle). Next `app.py` reads this model and uses it to make predictions in the web interface, for now arranged as html under `templates/`. `features.json`contains a skeleton JSON of feature variables with default values set to '0', and is used to transform html form input into a format used to do the model prediction. `requirements.txt`, `runtime.txt`, and `Procfile` are all necessary to run on Heroku.
Loading