Ironhack-Data-Analytics:

Final Project

Kate Saslow

Who is "Happier"?

Sentiment Analysis of Tweets comparing my German and American Networks.

Introduction:

For the final project of my Ironhack data analytics course, I wanted to conduct an exploratory analysis on my Twitter network. I wrangled, cleaned, structured, and tagged my data using the Twitter API and tweepy, pandas, spaCy, and more.

I used the Twitter API to gather tweets from my friends and followers. Through interacting with the API, I was able to gather tweets, location, gender, and other helpful information to use in my analysis. I filtered the tweets to only include people from Germany (and Austria and Switzerland) and America (and Canada) in my analysis.

My hypothesis was that Germans would be more negative overall than Americans, but that the women in my American network would be more negative than the women in my German network. Only the former held true. One dimension of further analysis I would like to conduct is to train my own classification model to pick up on sarcasm and context better, because I have the feeling that the women I follow (overwhelmingly women in tech) are highly sarcastic on twitter.

Analysis:

Notebooks:
- 1_Twitter-API-Wrangling-the-Data.ipynb
- 2_Cleaning-and-Restructuring-Twitter-Data.ipynb
- 3_Sentiment-Analysis-on-Twitter-Network.ipynb
- 4_Visualizing-the-Twitter-Data.ipynb
- 5_Twitter-Analysis-with-spaCy.ipynb
- 6_Appendix_Training-NLP-Classification-Model.ipynb
Data Folder:
- all tweets wrangled from Twitter API
- all dataframes cleaned/restructured/used for analysis and visualizations (refer to individual notebooks for which csv file needed at which step)
- tweets processed for NLP
WordClouds Folder:
- all wordclouds generated in "5_Twitter-Analysis-with-spaCy.ipynb"
- wordclouds of frequent words used in American and German tweets
- wordclouds of frequent ADJECTIVEs used in American and German tweets
Graphs Folder:
- all png files created in "4_Visualizing-the-Twitter-Data.ipynb" to analyze and compare networks.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Ironhack-Data-Analytics:

Final Project

Kate Saslow

Who is "Happier"?

Sentiment Analysis of Tweets comparing my German and American Networks.

Introduction:

Analysis:

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
Data		Data
WordClouds		WordClouds
graphs		graphs
.DS_Store		.DS_Store
.gitignore		.gitignore
1_Twitter-API-Wrangling-the-Data.ipynb		1_Twitter-API-Wrangling-the-Data.ipynb
2_Cleaning-and-Restructuring-Twitter-Data.ipynb		2_Cleaning-and-Restructuring-Twitter-Data.ipynb
3_Sentiment-Analysis-on-Twitter-Network.ipynb		3_Sentiment-Analysis-on-Twitter-Network.ipynb
4_Visualizing-the-Twitter-Data.ipynb		4_Visualizing-the-Twitter-Data.ipynb
5_Twitter-Analysis-with-spaCy.ipynb		5_Twitter-Analysis-with-spaCy.ipynb
6_Appendix_Training-NLP-Classification-Model.ipynb		6_Appendix_Training-NLP-Classification-Model.ipynb
README.md		README.md

ksaslow/Ironhack-final-project

Folders and files

Latest commit

History

Repository files navigation

Ironhack-Data-Analytics:

Final Project

Kate Saslow

Who is "Happier"?

Sentiment Analysis of Tweets comparing my German and American Networks.

Introduction:

Analysis:

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages