This repository contains tools and analyses for the New York City Yellow Taxi Tripdata dataset. The data covers taxi trips in New York City from January 2019 to June 2020 (101.246.797 rows)
This analysis examines patterns and trends in NYC taxi trip data to gain valuable insights into traffic patterns, ride habits, and economic factors.
The dataset consists of:
- Monthly CSV files from January 2019 to June 2020
- A taxi zone lookup table (taxi_zone_lookup.csv)
- Shape files for taxi zones in the "taxi_zones" directory
- Python 3.x
- Pandas
- NumPy
- Matplotlib/Seaborn
- Jupyter Notebook
- PyArrow/Parquet (for efficient data storage)
Please note that the NYC Taxi & Limousine Commission (TLC) is the owner of the original data. Usage should comply with their terms of use.