Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
51 commits
Select commit Hold shift + click to select a range
0446107
PedroBranch
Pmgbf Dec 8, 2025
a3e8000
allstars
Zina-1234 Dec 8, 2025
97dd330
allstars_Chiara_upload
chiaralanzi93-create Dec 8, 2025
cd9a2cf
changes
Pmgbf Dec 8, 2025
e92cbc9
DrawDB
Pmgbf Dec 8, 2025
70bea43
MiroDiagram
Pmgbf Dec 8, 2025
793c167
allstars2
Zina-1234 Dec 8, 2025
cd0b84f
Merge pull request #1 from ArrimachNasser/Chiara
Pmgbf Dec 9, 2025
41723b7
Merge pull request #2 from ArrimachNasser/Pedro
Pmgbf Dec 9, 2025
79a0185
Merge pull request #3 from ArrimachNasser/Zina
Pmgbf Dec 9, 2025
1412844
Add files via upload
Pmgbf Dec 9, 2025
1b78f6a
Add files via upload
Pmgbf Dec 9, 2025
7cf809b
Add files via upload
Pmgbf Dec 9, 2025
35606de
final_Chiara_upload
chiaralanzi93-create Dec 9, 2025
b5955ea
final
Zina-1234 Dec 9, 2025
c0969d6
graph
Zina-1234 Dec 9, 2025
d5af0b0
rename file
Zina-1234 Dec 10, 2025
cb80e56
Merge pull request #4 from ArrimachNasser/Zina
Zina-1234 Dec 10, 2025
1999df3
Merge pull request #5 from ArrimachNasser/Chiara
chiaralanzi93-create Dec 10, 2025
207f70e
something
Pmgbf Dec 10, 2025
1f18494
changes
Pmgbf Dec 10, 2025
78ba55d
Merge pull request #6 from ArrimachNasser/Pedro
Pmgbf Dec 10, 2025
fe36f80
resolving conflicts
chiaralanzi93-create Dec 10, 2025
33043f0
day3
Pmgbf Dec 10, 2025
70c819a
Slide upload
chiaralanzi93-create Dec 10, 2025
bf5c116
day 3
Zina-1234 Dec 10, 2025
ccaf094
slide
Zina-1234 Dec 11, 2025
6bd1ed1
Merge pull request #7 from ArrimachNasser/Zina
Zina-1234 Dec 11, 2025
958688a
Merge pull request #8 from ArrimachNasser/Chiara
chiaralanzi93-create Dec 11, 2025
9946369
refresh
Pmgbf Dec 11, 2025
042a97e
Merge pull request #9 from ArrimachNasser/Pedro
chiaralanzi93-create Dec 11, 2025
94a9ebd
refresh
Pmgbf Dec 11, 2025
b577f47
Saving work in progress
chiaralanzi93-create Dec 11, 2025
c91cb1e
Merge branch 'main' into Chiara
chiaralanzi93-create Dec 11, 2025
be7c207
slide upload
chiaralanzi93-create Dec 11, 2025
2854943
final
Pmgbf Dec 11, 2025
9581068
Merge pull request #10 from ArrimachNasser/Pedro
Pmgbf Dec 11, 2025
247e545
final
Pmgbf Dec 11, 2025
57ee7f2
final slides
chiaralanzi93-create Dec 11, 2025
18f0da1
final
Zina-1234 Dec 11, 2025
d062133
Merge pull request #11 from ArrimachNasser/Chiara
Pmgbf Dec 11, 2025
54feedf
Merge pull request #12 from ArrimachNasser/Zina
Pmgbf Dec 11, 2025
f7cd775
final
Pmgbf Dec 11, 2025
885a904
updates
chiaralanzi93-create Dec 12, 2025
6ba07b7
final!!
Zina-1234 Dec 12, 2025
fa4c2e6
Merge pull request #13 from ArrimachNasser/Zina
Zina-1234 Dec 12, 2025
6816ecd
changes
chiaralanzi93-create Dec 12, 2025
f45d24d
Renamed Jupyter notebook
chiaralanzi93-create Dec 12, 2025
1a60cde
Deleted slides/slides_Zina.pdf
chiaralanzi93-create Dec 12, 2025
bd4bdb1
Merge branch 'main' into Chiara
chiaralanzi93-create Dec 12, 2025
00fe81a
Merge pull request #15 from ArrimachNasser/Chiara
chiaralanzi93-create Dec 12, 2025
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
712 changes: 712 additions & 0 deletions .virtual_documents/notebooks/final_Zina.ipynb

Large diffs are not rendered by default.

758 changes: 758 additions & 0 deletions .virtual_documents/notebooks/final_pedro.ipynb

Large diffs are not rendered by default.

60 changes: 47 additions & 13 deletions README.md
Original file line number Diff line number Diff line change
@@ -1,12 +1,21 @@
# Project overview
...
This project analyzes **global Michelin-starred restaurants** (1, 2, and 3 stars) to understand patterns in:
- Cuisine type vs Michelin star level
- Geographic distribution across regions and major cities
- Price trends by star rating
- Public reviews and their alignment with Michelin ratings

The datasets were concatenated, cleaned, and enriched with:
- Average price (`price_mean`)
- Review rating and count metrics

# Installation

1. **Clone the repository**:

```bash
git clone https://github.com/YourUsername/repository_name.git
git clone https://github.com/ArrimachNasser/first_project.git

```

2. **Install UV**
Expand Down Expand Up @@ -56,22 +65,47 @@ uv pip install -r requirements.txt
```

# Questions
...
1. Do certain cuisines have a higher probability of receiving 3 stars?
2. Which regions host the most starred restaurants?
3. How does the average price vary between 1, 2, and 3 stars?
4. Do public reviews align with Michelin star ratings?

# Dataset
...
We used three primary datasets:
- `one-star-michelin-restaurants.csv`
- `two-stars-michelin-restaurants.csv`
- `three-stars-michelin-restaurants.csv`

## Main dataset issues
**Additional sources / enrichment**:
- MICHELIN Guide for The restaurants' price ranges index
- Online Review information using API

- ...
- ...
- ...
## Main dataset issues
- Missing values in `price` column for some restaurants
- Inconsistent cuisine naming (e.g., "French cuisine" vs "French")
- Some restaurants incorrectly had five dollar symbols ($$$$$)
- Special characters in restaurant names

## Solutions for the dataset issues
...

# Conclussions
...
- Imputed missing prices with the **median price** per star level
- Normalized cuisine names by lowercase, stripping spaces, and mapping variants to a single category
- Removed duplicates based on `name + city`
- Created a `major_city` flag using top city populations
- Special Characters: Used 'unidecode' to clean restaurant names of accents and special characters
- Inconsistent cuisine naming: Reduced over 100 cuisine types into 10 coherent groups
- Geographic Organization: Grouped and ordered suburbs by city and region for clarity
- Symbol Inconsistencies: Fixed entries with too many $ symbols, removed extra spaces and hidden characters
- Handeling Missing Prices: Filled missing values using per star category.

# Conclusions
- New York, Hong Kong, and San Francisco are major culinary and cultural hubs that attract Michelin-star.
- New York leads in 1-star and 2-star Michelin restaurants, offering a diverse array of high-quality dining.
- Hong Kong, excels in 3-star Michelin establishments, indicating a focus on elite dining experiences.
- Among 3-star Michelin restaurants, the dominant cuisines are contemporary, French, and Chinese.
- Average price increases with star level, most 3-star restaurants tend to be the most expensive
- Among all cuisines, Austrian cuisine is the most expensive on average.

# Next steps
...
- Predict Michelin star level using machine learning
- Include chef profiles for deeper insights
- Analyze yearly trends using the `year` column
77 changes: 77 additions & 0 deletions README_Chiara.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,77 @@
# Project overview
...

# Installation

1. **Clone the repository**:

```bash
git clone https://github.com/YourUsername/repository_name.git
```

2. **Install UV**

If you're a MacOS/Linux user type:

```bash
curl -LsSf https://astral.sh/uv/install.sh | sh
```

If you're a Windows user open an Anaconda Powershell Prompt and type :

```bash
powershell -ExecutionPolicy ByPass -c "irm https://astral.sh/uv/install.ps1 | iex"
```

3. **Create an environment**

```bash
uv venv
```

3. **Activate the environment**

If you're a MacOS/Linux user type (if you're using a bash shell):

```bash
source ./venv/bin/activate
```

If you're a MacOS/Linux user type (if you're using a csh/tcsh shell):

```bash
source ./venv/bin/activate.csh
```

If you're a Windows user type:

```bash
.\venv\Scripts\activate
```

4. **Install dependencies**:

```bash
uv pip install -r requirements.txt
```

# Questions
...

# Dataset
...

## Main dataset issues

- ...
- ...
- ...

## Solutions for the dataset issues
...

# Conclussions
...

# Next steps
...
Binary file added anaconda_projects/db/project_filebrowser.db
Binary file not shown.
Loading