This is my capstone project. The purpose is to determine the availability of science fiction movies on various streaming services. The analysis is performed using Python and Jupyter Notebooks, leveraging libraries such as pandas, kagglehub, and matplotlib.
vankirk_capstone_project_2025.ipynb: Jupyter Notebook containing the code for data analysis and visualization.
See requirements.txt
-
Clone the repository: https://github.com/danthreepo/CapstoneProject_2025 cd scifi-movies-analysis
-
Create and activate a virtual environment: python -m venv venv
.\venv\Scripts\activate
source venv/bin/activate
-
Install the required packages: pip install pandas kagglehub matplotlib jupyter
-
Install Jupyter extensions in Visual Studio Code: -Open Visual Studio Code. -Go to the Extensions view by clicking the Extensions icon in the Activity Bar on the side of the window or by pressing Ctrl+Shift+X. -Search for "Python" and install the extension provided by Microsoft. -Search for "Jupyter" and install the extension provided by Microsoft.
- Open the Jupyter Notebook: -Launch Visual Studio Code. -Open the vankirk_capstone_project_2025.ipynb file.
- Run the Notebook: -Execute the cells in the notebook to perform the analysis and generate visualizations.
-Loading Data: The data is loaded using the kagglehub library. -Data Cleaning: The data is cleaned and preprocessed to ensure accurate analysis. -Data Visualization: Various visualizations are created using matplotlib to analyze the availability of science fiction movies on different streaming services.
If you would like to contribute to this project, please fork the repository and submit a pull request.
This project is licensed under the MIT License. See the LICENSE file for details.