Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
31 commits
Select commit Hold shift + click to select a range
0446107
PedroBranch
Pmgbf Dec 8, 2025
a3e8000
allstars
Zina-1234 Dec 8, 2025
97dd330
allstars_Chiara_upload
chiaralanzi93-create Dec 8, 2025
cd9a2cf
changes
Pmgbf Dec 8, 2025
e92cbc9
DrawDB
Pmgbf Dec 8, 2025
70bea43
MiroDiagram
Pmgbf Dec 8, 2025
793c167
allstars2
Zina-1234 Dec 8, 2025
cd0b84f
Merge pull request #1 from ArrimachNasser/Chiara
Pmgbf Dec 9, 2025
41723b7
Merge pull request #2 from ArrimachNasser/Pedro
Pmgbf Dec 9, 2025
79a0185
Merge pull request #3 from ArrimachNasser/Zina
Pmgbf Dec 9, 2025
1412844
Add files via upload
Pmgbf Dec 9, 2025
1b78f6a
Add files via upload
Pmgbf Dec 9, 2025
7cf809b
Add files via upload
Pmgbf Dec 9, 2025
35606de
final_Chiara_upload
chiaralanzi93-create Dec 9, 2025
b5955ea
final
Zina-1234 Dec 9, 2025
c0969d6
graph
Zina-1234 Dec 9, 2025
d5af0b0
rename file
Zina-1234 Dec 10, 2025
cb80e56
Merge pull request #4 from ArrimachNasser/Zina
Zina-1234 Dec 10, 2025
1999df3
Merge pull request #5 from ArrimachNasser/Chiara
chiaralanzi93-create Dec 10, 2025
207f70e
something
Pmgbf Dec 10, 2025
1f18494
changes
Pmgbf Dec 10, 2025
78ba55d
Merge pull request #6 from ArrimachNasser/Pedro
Pmgbf Dec 10, 2025
fe36f80
resolving conflicts
chiaralanzi93-create Dec 10, 2025
33043f0
day3
Pmgbf Dec 10, 2025
70c819a
Slide upload
chiaralanzi93-create Dec 10, 2025
bf5c116
day 3
Zina-1234 Dec 10, 2025
ccaf094
slide
Zina-1234 Dec 11, 2025
6bd1ed1
Merge pull request #7 from ArrimachNasser/Zina
Zina-1234 Dec 11, 2025
958688a
Merge pull request #8 from ArrimachNasser/Chiara
chiaralanzi93-create Dec 11, 2025
9946369
refresh
Pmgbf Dec 11, 2025
94a9ebd
refresh
Pmgbf Dec 11, 2025
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
729 changes: 729 additions & 0 deletions .virtual_documents/notebooks/final_pedro.ipynb

Large diffs are not rendered by default.

77 changes: 0 additions & 77 deletions README.md

This file was deleted.

111 changes: 111 additions & 0 deletions README_Zina.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,111 @@
# Project overview
This project analyzes **global Michelin-starred restaurants** (1, 2, and 3 stars) to understand patterns in:
- Cuisine type vs Michelin star level
- Geographic distribution across regions and major cities
- Price trends by star rating
- Public reviews and their alignment with Michelin ratings

The datasets were concatenated, cleaned, and enriched with:
- Average price (`price_mean`)
- Review rating and count metrics

# Installation

1. **Clone the repository**:

```bash
git clone https://github.com/ArrimachNasser/first_project.git

```

2. **Install UV**

If you're a MacOS/Linux user type:

```bash
curl -LsSf https://astral.sh/uv/install.sh | sh
```

If you're a Windows user open an Anaconda Powershell Prompt and type :

```bash
powershell -ExecutionPolicy ByPass -c "irm https://astral.sh/uv/install.ps1 | iex"
```

3. **Create an environment**

```bash
uv venv
```

3. **Activate the environment**

If you're a MacOS/Linux user type (if you're using a bash shell):

```bash
source ./venv/bin/activate
```

If you're a MacOS/Linux user type (if you're using a csh/tcsh shell):

```bash
source ./venv/bin/activate.csh
```

If you're a Windows user type:

```bash
.\venv\Scripts\activate
```

4. **Install dependencies**:

```bash
uv pip install -r requirements.txt
```

# Questions
1. Do certain cuisines have a higher probability of receiving 3 stars?
2. Which regions host the most starred restaurants?
3. How does the average price vary between 1, 2, and 3 stars?
4. Do public reviews align with Michelin star ratings?

# Dataset
We used three primary datasets:
- `one-star-michelin-restaurants.csv`
- `two-stars-michelin-restaurants.csv`
- `three-stars-michelin-restaurants.csv`

**Additional sources / enrichment**:
- MICHELIN Guide for The restaurants' price ranges index
- Online Review information using API

## Main dataset issues
- Missing values in `price` column for some restaurants
- Inconsistent cuisine naming (e.g., "French cuisine" vs "French")
- Some restaurants incorrectly had five dollar symbols ($$$
- Special characters in restaurant names

## Solutions for the dataset issues
- Imputed missing prices with the **median price** per star level
- Normalized cuisine names by lowercase, stripping spaces, and mapping variants to a single category
- Removed duplicates based on `name + city`
- Created a `major_city` flag using top city populations
- Special Characters: Used 'unidecode' to clean restaurant names of accents and special characters
- Inconsistent cuisine naming: Reduced over 100 cuisine types into ~10 coherent groups
- Geographic Organization: Grouped and ordered suburbs by city and region for clarity
- Symbol Inconsistencies: Fixed entries with too many $ symbols, removed extra spaces and hidden characters
- Handeling Missing Prices: Filled missing values using per star category.

# Conclusions
- New York, Hong Kong, and San Francisco are major culinary and cultural hubs that attract Michelin-star.
- New York leads in 1-star and 2-star Michelin restaurants, offering a diverse array of high-quality dining.
- Hong Kong, excels in 3-star Michelin establishments, indicating a focus on elite dining experiences.
- Among 3-star Michelin restaurants, the dominant cuisines are contemporary, French, and Chinese.
- Average price increases with star level, most 3-star restaurants tend to be the most expensive
- Among all cuisines, Austrian cuisine is the most expensive on average.

# Next steps
- Predict Michelin star level using machine learning
- Include chef profiles for deeper insights
- Analyze yearly trends using the `year` column
41 changes: 41 additions & 0 deletions Untitled.ipynb
Original file line number Diff line number Diff line change
@@ -0,0 +1,41 @@
{
"cells": [
{
"cell_type": "code",
"execution_count": null,
<<<<<<< HEAD
"id": "a858336c-9674-456d-b7f5-bd73f7b63289",
=======
"id": "0f2c3c46-cd21-4e45-802f-204b44620a2b",
>>>>>>> 958688a72a293686bb571ab3125df9e9cb6aaace
"metadata": {},
"outputs": [],
"source": []
}
],
"metadata": {
"kernelspec": {
"display_name": "venv",
"language": "python",
"name": "venv"
},
"language_info": {
"codemirror_mode": {
"name": "ipython",
"version": 3
},
"file_extension": ".py",
"mimetype": "text/x-python",
"name": "python",
"nbconvert_exporter": "python",
"pygments_lexer": "ipython3",
<<<<<<< HEAD
"version": "3.14.0"
=======
"version": "3.13.5"
>>>>>>> 958688a72a293686bb571ab3125df9e9cb6aaace
}
},
"nbformat": 4,
"nbformat_minor": 5
}
Binary file added anaconda_projects/db/project_filebrowser.db
Binary file not shown.
Loading